Agentic AI fundamentals for enterprise. What agents are, when to use them vs. workflows or chatbots, agent anatomy, architecture patterns, decision frameworks, and common failure modes. Use when someone is new to agents, evaluating whether agents fit their use case, or needs conceptual grounding before deploying. Triggers: what is an agent, agent vs workflow, should I use agents, how do agents work, new to agents, agent fundamentals, when to use agents, agent design patterns, explain agentic, agent architecture, learn about agents, agentic AI concepts.
How this skill is triggered — by the user, by Claude, or both
Slash command
/agent-blueprint-skills:agent-fundamentalsThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
A practical guide to understanding AI agents in enterprise environments.
A practical guide to understanding AI agents in enterprise environments. Grounded in published research from Anthropic, OpenAI, LangChain, McKinsey, and MIT Sloan.
If you already know agents and want to deploy them, skip this and load
agent-deployment instead.
The clearest definition comes from Anthropic ("Building Effective Agents," Dec 2024):
The key distinction is who decides what happens next: the developer (workflow) or the LLM (agent).
An agent is an LLM that can:
This definition is consistent across Anthropic, OpenAI, Google, Microsoft, and LangChain as of 2026.
What an agent is NOT:
Anthropic's core principle: "Find the simplest solution possible, and only increase complexity when needed."
This is backed by industry experience: most successful production deployments use workflows or workflow-plus-LLM, not autonomous agents. Only a small fraction of real production workloads need fully autonomous agents.
The most common failure in enterprise agent projects. An agent that always follows the same steps in the same order is just an expensive, slow, unreliable workflow.
Signs:
Anthropic warns: "Agentic systems often trade latency and cost for better task performance, and you should consider when this tradeoff makes sense."
Every agent has the same building blocks, regardless of platform.
The system prompt. Defines purpose, behavior, rules, and decision criteria. Anthropic's guidance: "Think of this as writing a great docstring for a junior developer."
Specific instructions produce reliable agents:
"When you receive an RFP, extract vendor name, due date, estimated value, and category. If value exceeds $100K, route to senior review. Otherwise, create a standard evaluation record."
Vague instructions produce inconsistent agents:
"Be helpful with procurement tasks."
Actions the agent can take. Each tool has a name, description, and defined inputs/outputs. Tools connect the agent to systems:
An agent without tools is just a chatbot.
Tool count matters. LLM performance degrades as tool count grows. Models pick the wrong tool more often, hallucinate tool names, and spend more tokens on selection logic. Anthropic's heuristic: use dynamic tool loading when you have more than 10 tools or tool definitions consume more than 10K tokens. If an agent needs many tools, split into multiple agents with focused tool sets.
Anthropic also recommends: "Poka-yoke your tools" -- design tool interfaces so it's harder to make mistakes. Rich descriptions with examples improve selection accuracy dramatically.
What the agent knows:
Constraints on what the agent can and cannot do. Every platform vendor (Anthropic, OpenAI, Microsoft) emphasizes guardrails as architectural requirements, not optional add-ons.
88% of organizations with agent deployments report confirmed or suspected security incidents (Gravitee, 2026). Define what the agent CANNOT do.
How the agent fits into a larger system. See Architecture Patterns below.
Anthropic documents a progression from simple to complex. Start at the top and only move down when the simpler option fails.
Sequential LLM calls with programmatic checks between steps. Each step processes the output of the previous one. The developer controls the sequence.
When: Task decomposes into fixed subtasks with clear handoffs. Example: Generate report outline → write each section → review and edit. Not an agent. This is a workflow with LLM steps.
Classifies input and directs to specialized handlers. One LLM call decides where the request goes, then a specific handler processes it.
When: Distinct categories that need different handling. Example: Customer message → classify as billing/technical/sales → route to specialized handler.
A central LLM dynamically breaks down tasks and delegates to specialized workers. The orchestrator decides what subtasks are needed at runtime.
When: Complex tasks where you can't predict the subtasks upfront. Example: Coding agent that decides which files to read, what to modify, and how to test based on a bug report.
Platform note: On most enterprise platforms, the orchestrator maps to the platform's native workflow or pipeline layer, not a separate agent record.
Generator produces output, evaluator assesses it, loop continues until quality threshold is met.
When: Clear evaluation criteria exist and iterative refinement adds value. Example: Code generation where tests provide automatic evaluation.
LLM using tools in a loop based on environmental feedback. The agent decides every step dynamically.
When: Open-ended problems where you can't predict the number of steps. Most complex. Most expensive. Hardest to control. Only use when simpler patterns genuinely can't handle the task.
When a single agent isn't enough:
| Question | Pattern |
|---|---|
| Can the task be decomposed into fixed steps? | Prompt chaining (workflow) |
| Does input need classification before handling? | Routing |
| Can one agent handle it with a few tools? | Single autonomous agent |
| Are subtasks unpredictable at design time? | Orchestrator + workers |
| Does output quality need iterative refinement? | Evaluator-optimizer |
| Are there high-stakes decision points? | Add human-in-the-loop |
Start with the simplest pattern. Most production systems are patterns 1-2, not pattern 5 (LangChain, 2026).
40%+ of agentic AI projects will be abandoned by end of 2027 (Gartner, June 2025). 42% of companies abandoned most AI initiatives in 2025, up from 17% in 2024 (S&P Global). Understanding why helps avoid the same mistakes.
| Failure | What happens | Evidence |
|---|---|---|
| Over-agenting | Using agents for tasks workflows handle better. High cost, low reliability. | Most successful deployments are workflows, not agents (Anthropic 2024, LangChain 2026) |
| Tool sprawl | Too many tools. LLM picks wrong ones, hallucinates tool names. | Anthropic recommends dynamic tool loading at >10 tools or >10K tokens of tool definitions. |
| Vague instructions | Inconsistent behavior. Same input produces different results. | Anthropic: instructions should read like "a great docstring for a junior developer" |
| Missing guardrails | Agent sends wrong emails, updates wrong records, surfaces confidential data. | 88% of orgs with agents report security incidents (Gravitee 2026) |
| Skipping validation | "Works in demos" but fails on real data. No execution trace review. | Only 52% run offline evaluations, 37% run online evaluations (LangChain 2026) |
| Ignoring sociotechnical work | Focus on model, ignore governance, data quality, org readiness. | 80% of implementation effort is sociotechnical, not model work (MIT Sloan) |
| Cost explosions | Agent loops trigger unbounded API costs. No iteration limits. | Agentic workflows have variable, unpredictable costs (MIT Sloan) |
| No fallback | Agent fails silently. Nobody knows until a customer complains. | Only 6% of companies fully trust agents for core processes (HBR/Workato 2026) |
When someone brings a potential agent use case, walk through these questions:
If question 5 produces zero items, build a workflow.
See references/WORKED_EXAMPLES.md for three detailed walkthroughs applying this framework.
From Anthropic's "Building Effective Agents":
From OpenAI's "Practical Guide to Building Agents":
For context on where the industry stands:
The technology works. The challenge is governance, data quality, and organizational readiness.
Once you understand the concepts:
agent-deployment for structured, vendor-agnostic
methodology (pilot-first, test gates, phased rollout)servicenow-ai-agents for platform-specific
data model, entity mapping, and configurationresearch-agent for multi-tool research methodologyThe progression: understand (this skill) → deploy (agent-deployment) → platform (servicenow-ai-agents or equivalent)
Built by Agent Blueprint -- AI advisory for enterprise agent deployment.
npx claudepluginhub agent-blueprint/agent-blueprint-skills --plugin agent-blueprint-skillsDesign AI agents with capabilities, knowledge, and context. Guides agent architecture decisions from simple loops to subagents and planning.
Designs and builds AI agents for business, research, operations, and creative domains. Covers architecture, capabilities, knowledge, context, planning, and subagents.
Provides patterns and principles for building reliable autonomous agents: agent loops (ReAct, Plan-Execute), goal decomposition, reflection, and production guardrails. Useful when designing constrained, domain-specific agents.