What is the Claude Agent SDK?

You've used Claude Code. You've seen it read files, run commands, search the web, and write code autonomously. Now you're going to understand what powers that autonomy—and build your own agents using the same foundations.

The Claude Agent SDK is fundamentally different from the Claude API (the Anthropic Client SDK). If you've worked with OpenAI's SDK or Google's API, the distinction will feel profound. This difference shapes everything you'll build in this chapter.

Let's start with the core distinction, because it determines which tools you'll reach for in production.

The Mental Model: Two Different Paradigms

When you use the Claude API (Client SDK), you're implementing the agent loop yourself:

Your code:
Send prompt to Claude
Claude returns tool calls ("write this file")
You execute the tools
Send results back to Claude
Claude responds or requests more tools
Repeat until done

You control the loop. Claude generates instructions. You manage execution.

With Claude Agent SDK, Claude handles the loop:

Your code:
Send prompt to Claude
Claude observes the filesystem, runs commands, and takes action
Claude reports back with results
Repeat until task completes

Claude controls the loop. You specify the task. Claude manages execution.

The Question This Raises: Why does this distinction matter?

When you implement the loop yourself, you're responsible for:

Parsing Claude's tool requests correctly
Handling tool execution errors gracefully
Managing state between iterations
Deciding when the agent should stop
Implementing security guardrails

That's a substantial surface area for bugs. You're not writing the agent logic—you're building infrastructure that the agent logic depends on.

With Agent SDK, Claude handles that infrastructure. You focus on:

What the agent should do (specification)
What tools it has access to (permissions)
How to evaluate whether it succeeded (validation)

This is why Claude Agent SDK is called "SDK" not "API". It's not just a connection to Claude's models. It's a complete agent runtime that handles the complexity you'd otherwise implement yourself.

Comparison: Claude SDK vs Client SDK vs Competitors

Here's how Claude Agent SDK compares to other approaches:

Capability	Claude Agent SDK	Claude API (Client SDK)	OpenAI Agents SDK	Google ADK
Tool Execution	Claude autonomous	You implement loop	Swanson native_tools	Google Gemini 2.5 native
Skills Ecosystem	Yes (SKILL.md files)	No	No	No
File Checkpointing	Yes (rewindFiles())	No	No	No
Runtime Permissions	Yes (canUseTool callback)	No	No (static allowlist)	No (static allowlist)
Custom Commands	Yes (.claude/commands/)	No	No	No
Cost Tracking	Per-message token usage	Per-message token usage	Per-message token usage	Per-message token usage
System Prompt Presets	Yes (claude_code preset)	No	No	No
Session Management	Persistent (resume=session_id)	Single-turn	Single-turn	Single-turn
MCP Integration	Native (mcp_servers=)	Basic	Requires wrapper	Basic
Multi-Turn Conversations	ClaudeSDKClient context preservation	Manual context management	Built-in conversation history	Built-in

What do these differences mean for building Digital FTEs?

Skills Ecosystem: Reusable Intelligence

With Claude Agent SDK, you can package domain expertise into skills—filesystem-based documents that give your agent specialized knowledge.

Instead of including everything in a system prompt:

# WITHOUT SKILLS ECOSYSTEM
options = ClaudeAgentOptions(
    system_prompt="""You are a code review expert.
    Consider these security patterns: [20 KB of guidelines]
    Consider these performance patterns: [15 KB of guidelines]
    Consider these testing patterns: [10 KB of guidelines]"""
)

You organize it as reusable skills:

# WITH SKILLS ECOSYSTEM
options = ClaudeAgentOptions(
    setting_sources=["project"],  # Load from .claude/skills/
    allowed_tools=["Skill"]
)

Then your agent accesses /skills/security-review.md, /skills/performance-patterns.md, etc. as needed. Skills compose. They're versionable. Teams can share them.

Why this matters for Digital FTEs: Your product can expose different skills to different customers. One Digital FTE loads the "aggressive optimization" skill. Another loads the "conservative reliability" skill. Same agent, different expertise.

File Checkpointing: Undo Capability

Claude Agent SDK tracks file changes and lets you rewind:

# Rewind files to previous checkpoint
await client.rewind_files(checkpoint_id)

This is extraordinarily valuable for agents that might:

Accidentally delete critical files
Make speculative changes that need reverting
Need to try multiple approaches

Why this matters for Digital FTEs: A buggy agent doesn't destroy a customer's codebase—it reverts. The agent can be bold in exploration because failure is recoverable.

Runtime Permissions: Dynamic Decision-Making

With canUseTool callback, your agent makes permission decisions at runtime based on context:

async def can_use_tool(tool: str, input: dict, context: dict):
    # DENY writes to config files
    if tool == "Write" and "/config/" in input.get("file_path", ""):
        return {"behavior": "deny", "message": "Config files protected"}

    # ALLOW with modified path (sandbox writes)
    if tool == "Write" and input.get("file_path", "").startswith("/tmp"):
        return {"behavior": "allow", "updatedInput": input}

    return {"behavior": "allow", "updatedInput": input}

OpenAI and Google SDKs use static allowlists. Claude SDK can reason about context.

Why this matters for Digital FTEs: Your product can implement sophisticated permission policies. "Allow writes to test files but not production files." "Allow API calls to staging environment but not production." The agent respects these policies without modification.

Cost Tracking: Per-Message Economics

All SDKs track token usage. Claude Agent SDK integrates cost tracking:

async for message in query(prompt="Task", options=opts):
    if message.type == "result":
        print(f"Total cost: ${message.total_cost_usd:.4f}")

Why this matters for Digital FTEs: You're selling agent capability. Cost transparency drives pricing decisions. Knowing that a particular workflow consumes $0.47 per execution lets you offer it profitably.

Why Claude Agent SDK Is The Top Contender For Digital FTEs

The fundamental insight: Claude Agent SDK is the only SDK designed for production autonomous agents.

OpenAI SDK and Google ADK are designed for:

Single-turn queries with tool use
Stateless interactions
Direct API consumption

Claude Agent SDK is designed for:

Multi-turn agent workflows
Stateful sessions with recovery
Production deployment with guardrails

For building Digital FTEs—agents that work 24/7 for customers—this distinction is critical.

Consider a Digital FTE that reviews pull requests for code quality:

Without Agent SDK (OpenAI approach):

You implement the agent loop
Each code review is a separate session
If the agent makes a mistake modifying code, you recover manually
Permissions are static (can write or cannot write)
Cost tracking requires parsing API responses

With Agent SDK:

Claude handles the loop
Session persists across multiple PRs
File changes are checkpointed and reversible
Permissions adapt based on which PR branch is being reviewed
Cost is transparent per review

The Agent SDK removes infrastructure burden. You focus on domain expertise (what makes a good code review?). Claude handles the agent machinery.

The Skills Ecosystem: Compound Intelligence

This is where Claude Agent SDK truly differentiates for Digital FTEs.

A skill is a markdown file with:

Persona: How should the agent think about this domain?
Logic: What decision frameworks apply?
Context: What prerequisites matter?
Data/Knowledge: What patterns exist?

Skills compose. A code review agent might load:

/skills/security-review.md — Security patterns and threat modeling
/skills/python-best-practices.md — Python-specific guidance
/skills/test-driven-development.md — Testing philosophy

Each skill is reusable across agents. Skills are versionable. Teams share and improve them.

Why this matters: You're not building one agent. You're building an agency—a collection of specialized agents sharing domain expertise through skills.

What You're Building In This Chapter

This chapter teaches you to:

Install and configure Claude Agent SDK
Understand tools available (Read, Write, Edit, Bash, Glob, Grep, WebSearch, WebFetch, Task)
Manage permissions using permission modes and canUseTool callbacks
Build session-aware agents that maintain state across interactions
Compose subagents for parallel processing
Create skills that encode reusable intelligence
Deploy production agents with monitoring and cost tracking

By the end, you'll have built a production-grade agent that could become a Digital FTE.

Try With AI

Use Claude Code (or any Claude Agent SDK environment) for these prompts.

Prompt 1: Experience the Agent Loop Difference

I have a directory with 50 Python files. I want to count how many
have a specific function called "validate_input". I'll describe the
task, you execute it autonomously without me implementing a loop.
Read all files in the current directory, search for "validate_input"
function definitions, and report a summary.

What you're learning: How Claude Agent SDK executes complex multi-step tasks without you managing the iteration loop. Notice that you specified what (find validate_input), not how (iterate through files, parse Python, etc.).

Prompt 2: Understand Tool Composition

Now I want you to create a small test file that validates the validate_input
function exists in at least 3 of those files. Write a script that imports
from those files and tests the function. Show me the test results.

What you're learning: How Agent SDK composes multiple tools (Read → understand files → Write new test file → Bash execute tests). Each tool is autonomous. You don't implement orchestration.

Prompt 3: Compare With Manual API Loop

Explain to me: If I were using the Claude API (Client SDK) to do this same
task, what code would I need to write to implement the agent loop? What's
different about how Agent SDK handles it?

What you're learning: The architectural difference. With Client SDK, you'd write code to: 1) Parse Claude's tool requests, 2) Execute the tools yourself, 3) Send results back, 4) repeat until done. Agent SDK eliminates steps 2 and 4 from your responsibility.

Safety Note: As you build agents with the SDK, remember that agent autonomy comes with responsibility. Always test agents in non-production environments first. Use permission callbacks to restrict what agents can do. File checkpointing exists exactly for this—so you can safely experiment knowing changes are reversible.

Reflect on Your Skill

You built a claude-agent skill in Lesson 0. Test and improve it based on what you learned.

Test Your Skill

Using my claude-agent skill, explain the difference between Claude Agent SDK and Claude API.
Does my skill cover autonomous tool execution vs manual loops?

Identify Gaps

Ask yourself:

Did my skill explain the SDK vs API distinction clearly?
Did it cover the unique features (skills ecosystem, file checkpointing, runtime permissions)?

Improve Your Skill

If you found gaps:

My claude-agent skill is missing coverage of SDK architecture and unique features.
Update it to include:
- SDK vs API comparison
- Autonomous tool execution patterns
- Skills ecosystem benefits

The Mental Model: Two Different Paradigms​

Comparison: Claude SDK vs Client SDK vs Competitors​

Skills Ecosystem: Reusable Intelligence​

File Checkpointing: Undo Capability​

Runtime Permissions: Dynamic Decision-Making​

Cost Tracking: Per-Message Economics​

Why Claude Agent SDK Is The Top Contender For Digital FTEs​

The Skills Ecosystem: Compound Intelligence​

What You're Building In This Chapter​

Try With AI​

Prompt 1: Experience the Agent Loop Difference​

Prompt 2: Understand Tool Composition​

Prompt 3: Compare With Manual API Loop​

Reflect on Your Skill​

Test Your Skill​

Identify Gaps​

Improve Your Skill​