Contoso has an Azure subscription in North Europe that contains the corporate infrastructure. The current infrastructure contains a Microsoft SQL Server 2017 database. The database contains the following tables.
The FeedbackJson column has a full-text index and stores JSON documents in the following format.
The support staff at Contoso never has the unmask permission.
Requirements
Contoso is deploying a new Azure SQL database that will become the authoritative data store for the following;
Sometimesthe ingestion pipeline fails due to malformed JSON and duplicate payloads.
The engineers at Contoso report that the following dashboard query runs slowly.
SELECT VehicleTd, Lastupdatedutc, EngineStatus, BatteryHealth FROM dbo.VehicleHealthSumary where fleetld- gFleetld ORDER BV LastUpdatedUtc DESC;
You review the execution plan and discover that the plan shows a clustered index scan.
vehicleincidentReports often contains details about the weather, traffic conditions, and location. Analysts report that it is difficult to find similar incidents based on these details
Planned Changes
Contoso wants to modernize Fleet Intelligence Platform to support Al-powered semantic search over
incident reports.
Security Requirements
Contoso identifies the following telemetry requirements:
• Telemetry data must be stored in a partitioned table.
• Telemetry data must provide predictable performance for ingestion and retention operations.
• latitude, longitude, and accuracy JSON properties must be filtered by using an index seek.
Contoso identifies the following maintenance data requirements:
• Ensure that any changes to a row in the MaintenanceEvents table updates the corresponding
value in the LastModif reduce column to the time of the change.
• Avoidrecursive updates.
AI Search, Embedding’s, and Vector indexing
The development learn at Contoso will use Microsoft Visual Studio Code and GitHub Copilot and will
retrieve live metadata from the databases. Contoso identifies the following requirements for
querying data in the FeedbackJson column of the customer-Feedback table:
• Extract the customer feedback text from the JSON document.
• Filter rows where the JSON text contains a keyword.
• Calculate a fuzzy similarity score between the feedback text and a known issue description.
• Orderthe results by similarity score, with the highest score first
View Mode
Q: 1
You need to generate embeddings to resolve the issues identified by the analysts. Which column should you use?
Options
Discussion
No comments yet. Be the first to comment.
Be respectful. No spam.
Correct Answer:
B
Explanation
Embeddings are vector representations that capture the semantic meaning of data, primarily unstructured text. The incidentDescription column contains free-form text detailing the specifics of an incident. Generating embeddings from this column allows analysts to perform sophisticated tasks like semantic search, clustering similar incidents, and anomaly detection based on the narrative content. This provides deeper insights than structured data like type, location, or a numeric score, directly addressing the need to resolve complex analytical issues.
Why Incorrect
A. vehicleLocation is structured geographic data. While it can be embedded, it lacks the rich semantic context of a text description needed for deep analysis.
C. incidentType is categorical data. It's already structured and provides limited semantic richness for embedding compared to a full text description.
D. SeverityScore is a single numerical value. It is already in a machine-readable format and is not a candidate for generating semantic embeddings.
References
1. Microsoft Learn, "What are embeddings?". This document explains that embeddings are numerical representations of concepts, commonly used for text, to capture semantic relationships, making them ideal for search, clustering, and classification.
2. Microsoft Learn, "Tutorial: Use T-SQL to generate and search embeddings". Section "Generate and store embeddings" demonstrates creating embeddings from a text description column in a table to enable semantic analysis.
3. Microsoft Learn, "Generate embeddings with Azure OpenAI". This guide shows that text inputs are the primary source for creating embeddings to be used in applications like semantic search.