From azure-agent-skills
Expert guidance for Azure Batch: pools, autoscale, containers, MPI jobs, monitoring, and rendering. Includes troubleshooting, architecture, and best practices.
How this skill is triggered — by the user, by Claude, or both
Slash command
/azure-agent-skills:azure-batchThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
This skill provides expert guidance for Azure Batch. Covers troubleshooting, best practices, decision making, architecture & design patterns, limits & quotas, security, configuration, integrations & coding patterns, and deployment. It combines local quick-reference content with remote documentation fetching capabilities.
This skill provides expert guidance for Azure Batch. Covers troubleshooting, best practices, decision making, architecture & design patterns, limits & quotas, security, configuration, integrations & coding patterns, and deployment. It combines local quick-reference content with remote documentation fetching capabilities.
IMPORTANT for Agent: Use the Category Index below to locate relevant sections. For categories with line ranges (e.g.,
L35-L120), useread_filewith the specified lines. For categories with file links (e.g.,[security.md](security.md)), useread_fileon the linked reference file
IMPORTANT for Agent: If
metadata.generated_atis more than 3 months old, suggest the user pull the latest version from the repository. Ifmcp_microsoftdocstools are not available, suggest the user install it: Installation Guide
This skill requires network access to fetch documentation content:
mcp_microsoftdocs:microsoft_docs_fetch with query string from=learn-agent-skill. Returns Markdown.fetch_webpage with query string from=learn-agent-skill&accept=text/markdown. Returns Markdown.| Category | Lines | Description |
|---|---|---|
| Troubleshooting | L37-L43 | Diagnosing and fixing Azure Batch job, task, pool, and node errors, including failure patterns, error codes, and best practices for handling and recovering from failures. |
| Best Practices | L44-L57 | Guidance on designing efficient, secure, and performant Azure Batch solutions, including scheduling, dependencies, concurrency, output persistence, rendering workloads, and large-scale job/task optimization. |
| Decision Making | L58-L69 | Guidance on choosing VM sizes, images, Spot and ephemeral disks, and planning/migrating pools (custom images, Compute Gallery, comms model) to optimize Azure Batch performance and cost. |
| Architecture & Design Patterns | L70-L75 | Architectures and best practices for bursting on-prem render farms to Azure Batch, including storage layout, data movement patterns, and performance-optimized rendering workflows. |
| Limits & Quotas | L76-L81 | Azure Batch account-level limits, default and adjustable quotas, and how to view, manage, and request quota increases using .NET and the Azure portal. |
| Security | L82-L100 | Securing Batch accounts and pools: identity (Entra ID, managed identities, RBAC), keys/certs rotation, encryption, private endpoints/network perimeters, Key Vault, and Azure Policy governance. |
| Configuration | L101-L133 | Configuring Azure Batch pools, nodes, and tasks: autoscale, OS upgrades, networking, storage mounts, containers, diagnostics events, metrics/logs, and secure task/user environments. |
| Integrations & Coding Patterns | L134-L147 | Client SDKs, CLIs, and patterns for integrating with Azure Batch: managing pools/tasks, containers, MPI, storing task output, and monitoring with Application Insights. |
| Deployment | L148-L152 | Deploying Azure Batch workloads using Azure Pipelines and CLI templates, including end-to-end job setup, automation, and integration into CI/CD workflows. |
| Topic | URL |
|---|---|
| Diagnose and handle Azure Batch job and task errors | https://learn.microsoft.com/en-us/azure/batch/batch-job-task-error-checking |
| Troubleshoot Azure Batch pool and node errors | https://learn.microsoft.com/en-us/azure/batch/batch-pool-node-error-checking |
| Diagnose and handle Azure Batch task errors | https://learn.microsoft.com/en-us/azure/batch/error-handling |
| Topic | URL |
|---|---|
| Reference architectures for bursting render farms to Azure Batch | https://learn.microsoft.com/en-us/azure/batch/batch-rendering-architectures |
| Design storage and data movement for Azure Batch rendering | https://learn.microsoft.com/en-us/azure/batch/batch-rendering-storage-data-movement |
| Topic | URL |
|---|---|
| Manage Azure Batch accounts and quotas with .NET | https://learn.microsoft.com/en-us/azure/batch/batch-management-dotnet |
| Review Azure Batch service quotas and limits | https://learn.microsoft.com/en-us/azure/batch/batch-quota-limit |
| Topic | URL |
|---|---|
| Deploy Azure Batch HPC solutions with Azure Pipelines | https://learn.microsoft.com/en-us/azure/batch/batch-ci-cd |
| Run Azure Batch jobs end-to-end using CLI templates | https://learn.microsoft.com/en-us/azure/batch/batch-cli-templates |
npx claudepluginhub microsoftdocs/agent-skills --plugin azure-agent-skillsProvides Azure Batch client SDK usage for Java, including pool, job, and task operations with authentication examples (Entra ID, shared key). Helpful for running scalable parallel or HPC batch jobs on Azure.
Provides expert guidance for Azure CycleCloud development including troubleshooting, architecture, security, and integration with Slurm/LSF/HTCondor. Use when automating CycleCloud via API/CLI or tuning autoscaling.
Manages Azure Batch pools, jobs, tasks, and nodes using Java SDK for large-scale parallel and HPC workloads.