📖 About this Domain
This domain covers the execution of model training processes within the Azure Machine Learning workspace. It focuses on creating and running script-based experiments, logging metrics, and managing training jobs on various compute targets.
🎓 What You Will Learn
- You will learn to create and run script-based experiments using the Azure Machine Learning SDK and CLI v2.
- You will learn to consume data from datastores and data assets for model training jobs.
- You will learn to implement model training using frameworks like Scikit-learn and log metrics with MLflow for run tracking.
- You will learn to configure and use compute targets, such as compute clusters, for scalable training jobs.
🛠️ Skills You Will Build
- You will build the skill to write Python scripts that connect to an Azure ML workspace, access data, and train a machine learning model.
- You will build the skill to configure and submit command jobs using YAML definitions for reproducible and scalable training runs.
- You will build the skill to integrate MLflow for autologging or custom logging of parameters, metrics, and artifacts during an experiment run.
- You will build the skill to analyze experiment run history, compare metrics, and retrieve the best model from a completed job.
💡 Top Tips to Prepare
- Master the structure of YAML files for defining command jobs, including inputs, code, environment, and compute specifications.
- Practice using both the Azure ML SDK v2 and CLI v2 to submit and manage training jobs, as the exam covers both.
- Understand the different ways to log metrics, specifically the distinction between logging single values versus logging artifacts like charts.
- Focus on how to retrieve a registered model from a job run and understand the relationship between a job, its outputs, and the model registry.
📖 About this Domain
This domain covers the operationalization of machine learning models using Azure Machine Learning. You will focus on deploying models as real-time and batch endpoints, and automating workflows with pipelines for continuous integration and delivery (CI/CD). It is the core of MLOps practices within the Azure ecosystem.
🎓 What You Will Learn
- Deploy trained models as real-time endpoints on compute targets like Azure Container Instance (ACI) or Azure Kubernetes Service (AKS) for immediate inference.
- Implement batch inference pipelines to process large volumes of data asynchronously using batch endpoints.
- Configure data drift and model performance monitoring to maintain model quality and detect degradation in production.
- Automate the end-to-end ML lifecycle by creating, publishing, and scheduling Azure Machine Learning pipelines using components.
🛠️ Skills You Will Build
- Ability to package models with scoring scripts and environment configurations for containerized deployment.
- Proficiency in creating, managing, and troubleshooting both real-time and batch inference endpoints via the SDK, CLI v2, and studio.
- Competence in building and parameterizing reusable ML pipelines for training and retraining workflows.
- Capability to set up data drift monitors on datasets and interpret results to trigger automated actions like retraining.
💡 Top Tips to Prepare
- Master the differences between ACI for dev/test deployments and AKS for production-scale, high-availability endpoints.
- Practice defining endpoints, deployments, and pipeline components using YAML configuration files for the CLI v2.
- Gain hands-on experience with the Azure CLI v2 for managing all MLOps assets, as it is a key focus of the exam.
- Understand the concepts of a baseline dataset versus a target dataset when configuring a data drift monitor.
📖 About this Domain
This domain covers the operationalization of ethical AI principles using Azure Machine Learning. It focuses on implementing tools and techniques to ensure fairness, interpretability, and privacy in machine learning models. You will work with the Responsible AI dashboard to assess and mitigate potential harms.
🎓 What You Will Learn
- You will learn to use the Responsible AI dashboard to evaluate models for fairness and identify disparities across demographic groups.
- You will learn to implement model explainability techniques, such as SHAP, to interpret model predictions and understand feature importance.
- You will learn to analyze model errors and identify causal relationships to improve model reliability and robustness.
- You will learn the core concepts of differential privacy and how it can be applied to protect sensitive user data during the training process.
🛠️ Skills You Will Build
- You will build the skill to configure and interpret Responsible AI scorecards for a holistic assessment of model performance and fairness.
- You will build the skill to generate both global and local explanations for models to debug and validate their behavior.
- You will build the skill to apply fairness mitigation algorithms from the Fairlearn toolkit to retrain models and reduce bias.
- You will build the skill to identify data drift and other failure modes in production models to ensure ongoing reliability.
💡 Top Tips to Prepare
- Focus on the components of the Responsible AI dashboard, including error analysis, model explainability, and fairness assessment.
- Practice interpreting SHAP values and other explainer outputs to understand a model's decision-making process.
- Understand the different fairness metrics, such as demographic parity and equalized odds, and when to apply them.
- Review how to configure a pipeline job in Azure ML that includes a component for generating a Responsible AI dashboard.
📖 About this Domain
This domain covers the provisioning and configuration of the Azure Machine Learning workspace, the foundational resource for machine learning operations. The workspace is the top-level resource that centralizes all ML artifacts and orchestrates dependent Azure resources.
🎓 What You Will Learn
- Provision an Azure Machine Learning workspace using the Azure portal, Azure CLI, and Python SDK.
- Identify the dependent Azure resources created with a workspace, including Storage Account, Key Vault, Application Insights, and Container Registry.
- Manage workspace access and permissions using Azure Role-Based Access Control (RBAC) for different user roles.
- Navigate the Azure Machine Learning studio to locate and manage workspace assets and properties.
🛠️ Skills You Will Build
- Deploying and configuring the core Azure Machine Learning environment required for any data science project.
- Implementing security controls by assigning built-in RBAC roles like Owner, Contributor, and Reader to users and service principals.
- Managing the lifecycle of a workspace, including understanding its properties and associated resource dependencies.
- Utilizing the Azure Machine Learning studio as the primary user interface for interacting with workspace assets.
💡 Top Tips to Prepare
- Practice creating a workspace using all three methods: Azure portal, Python SDK (Workspace.create), and Azure CLI (az ml workspace create).
- Memorize the four Azure resources automatically provisioned with a workspace and their specific functions.
- Understand the distinction between the Basic and Enterprise workspace editions and their feature differences.
- Know the specific permissions granted by the standard RBAC roles (Owner, Contributor, Reader) within the context of a workspace.