From grimoire
Designs an internal developer platform (IDP) to reduce cognitive load, standardize infrastructure, and improve developer experience at scale.
How this skill is triggered — by the user, by Claude, or both
Slash command
/grimoire:design-platform-engineering-modelThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Build a self-service internal developer platform (IDP) that abstracts infrastructure complexity, enforces golden paths, and enables product teams to deploy and operate software without deep platform expertise.
Build a self-service internal developer platform (IDP) that abstracts infrastructure complexity, enforces golden paths, and enables product teams to deploy and operate software without deep platform expertise.
Adopted by: Spotify (Backstage, open-sourced 2020), Netflix (PaaS internal platform), Airbnb, Uber, and over 60% of organizations with 500+ engineers (Gartner, 2023); CNCF Platform Engineering Working Group formed 2022 with 100+ member organizations
Impact: DORA 2023 research shows that teams with mature platform capabilities deploy 4x more frequently and have 50% lower change failure rates. Spotify's Backstage reduced time to onboard a new service from 2 weeks to 2 hours. Gartner predicts 80% of software engineering organizations will establish platform teams by 2026 to manage cloud complexity. Team Topologies research found cognitive load reduction from platform teams directly correlates with delivery throughput.
Why best: Without a platform team, every product team independently solves the same infrastructure problems (CI/CD, observability, secrets management, Kubernetes configuration) with inconsistent results. This creates n×m complexity. A platform team solves the problem once, exposes it as a self-service product, and product teams consume it — creating cognitive load reduction and economies of scale that compound as the organization grows.
Sources: Skelton, M. & Pais, M. "Team Topologies" IT Revolution (2019); CNCF Platform Engineering Maturity Model v1.0 (2023); Gartner "Innovation Insight for Platform Engineering" (Feb 2023); DORA "Accelerate State of DevOps Report" (2023); Humanitec "State of Platform Engineering" (2023)
Assess the baseline — quantify developer pain — Survey product teams to measure current cognitive load: time spent on infrastructure vs. product work, frequency of blocked deployments, toil hours per sprint, and onboarding time for new engineers. Use these metrics as the baseline against which platform ROI will be measured. Avoid building a platform based on assumptions — validate the pain points.
Define the platform as a product — adopt product management practices — Assign a product manager (or platform engineer with PM skills) to own the IDP roadmap. Treat internal product teams as customers. Conduct user research, maintain a prioritized backlog, publish a roadmap, and track adoption metrics (API calls, active teams, time-to-first-deployment). A platform nobody uses is waste.
Identify and design golden paths — opinionated defaults, not mandates — A golden path is the recommended, paved way to accomplish a common task (e.g., deploy a web service, add a new microservice, integrate with the message bus). Design 3–5 golden paths covering 80% of product team use cases. Golden paths use sensible defaults while permitting escape hatches for teams with valid reasons to diverge.
Build self-service capabilities — eliminate ticket-driven workflows — Replace manual provisioning requests (Jira tickets to platform/infra team) with self-service APIs, CLIs, or UIs. Use platform abstractions: Backstage software templates, Crossplane XRDs, or custom Kubernetes operators. The test: can a product engineer get a new production-ready service running without filing a ticket?
Implement an internal developer portal — centralize discoverability — Deploy a developer portal (Backstage, Port, or Cortex) as the front door to all platform capabilities. Catalog all services, APIs, documentation, runbooks, and infrastructure resources. Enable engineers to discover existing services before building new ones. Track service ownership, SLOs, and on-call information in one place.
Define a capability model — build in layers — Structure the IDP across layers: (1) infrastructure provisioning (compute, networking, storage), (2) container platform (Kubernetes abstractions, namespaces, quotas), (3) application delivery (CI/CD pipelines, deployment patterns), (4) observability (metrics, logs, traces, alerting), (5) security and compliance (secret management, policy enforcement, vulnerability scanning). Deliver layers incrementally, highest-demand first.
Encode standards as guardrails — shift left on compliance — Use policy-as-code (OPA/Gatekeeper, Kyverno) to enforce security and operational standards at admission time. Engineers get immediate feedback when their configuration violates policy, rather than discovering it at production deployment or in a security audit. Document every policy with a rationale and an exception process.
Measure developer experience — track SPACE metrics — Instrument the platform to collect SPACE metrics (Satisfaction, Performance, Activity, Communication, Efficiency). Monitor: deployment frequency per team, lead time from commit to production, time spent on toil per sprint, P95 time-to-self-service for each golden path. Review monthly. Drop features that aren't improving metrics.
Manage platform team topology — enable, don't gate — Structure the platform team as an enabling team (Team Topologies) that embeds with product teams to accelerate adoption, not a gatekeeper that controls access. Set a ratio target (1 platform engineer per 8–15 product engineers). Define interaction modes: X-as-a-Service for stable capabilities, Collaboration mode during onboarding and feature development.
Plan for evolution — avoid platform monolith lock-in — Document architecture decision records (ADRs) for every significant platform choice. Prefer composable, open-source components (Backstage, Argo CD, Crossplane, Prometheus stack) over closed vendor platforms that cannot be replaced. Revisit the platform architecture every 12–18 months against CNCF maturity model to identify capability gaps.
Spotify Backstage golden path: A product engineer runs backstage-cli create to scaffold a new microservice from a template that pre-configures GitHub Actions CI, Kubernetes deployment manifests, Datadog dashboards, and PagerDuty escalation — all compliant with organizational standards. Time from zero to first production deploy: under 2 hours.
Crossplane self-service databases: Instead of filing a Jira ticket for a production PostgreSQL instance, a product team applies a PostgreSQLInstance custom resource to their Kubernetes namespace. The Crossplane provider provisions an RDS instance with approved size, region, encryption, backup policy, and IAM role — all from a single YAML file reviewed in a PR.
npx claudepluginhub jeffreytse/grimoire --plugin grimoireGuides designing Internal Developer Platforms (IDPs), building platform teams, and improving developer experience. Covers Backstage, portal design, and platform engineering principles.
Initializes and guides enterprise-grade systems with microservices, Kubernetes, Terraform, and AI Native methodology. Supports multi-feature sprint planning with auto-pause triggers.