1. Microsoft Azure Documentation
"Troubleshoot model deployment": In the section on autoscaling
the documentation explains that when a service fails because the cluster is at maximum capacity
the solution is to increase the capacity. It states
"To change the maximum number of nodes in the cluster
you need to update the compute target." This directly supports increasing the node count to handle higher loads.
Source: Microsoft Corporation. (2023). Troubleshoot model deployment with Azure Machine Learning. Microsoft Docs. Retrieved from https://docs.microsoft.com/en-us/azure/machine-learning/how-to-troubleshoot-deployment#autoscaling (Refer to the section on Autoscaling).
2. Microsoft Azure Documentation
"Scale cluster in Azure Kubernetes Service (AKS)": This document details the procedures for scaling an AKS cluster. It highlights that scaling the node count is the primary method for adjusting compute capacity to meet application demands. The process can be done manually with a single command (az aks scale)
which exemplifies minimal administrative effort and is designed to be non-disruptive to running applications.
Source: Microsoft Corporation. (2023). Scale the node count in an Azure Kubernetes Service (AKS) cluster. Microsoft Docs. Retrieved from https://docs.microsoft.com/en-us/azure/aks/scale-cluster (Refer to the "Manually scale cluster nodes" section).
3. Microsoft Azure Documentation
"Deploy a model to an Azure Kubernetes Service cluster": This guide explains how to configure an AKS cluster for machine learning workloads
including setting autoscaling parameters. It notes that if autoscaling is not enabled
the cluster operates with a fixed number of nodes
which can be changed manually. If autoscaling is enabled but the maximum is reached
the solution is to increase the autoscalemaxnodes parameter when updating the compute target. Both actions are equivalent to increasing the node count.
Source: Microsoft Corporation. (2023). Deploy a model to an Azure Kubernetes Service cluster. Microsoft Docs. Retrieved from https://docs.microsoft.com/en-us/azure/machine-learning/how-to-deploy-azure-kubernetes-service (Refer to the "Autoscaling" section).