Callbacks & Guardrails

Your agent has been researching news, collecting financial data, and maintaining conversation state. But you're starting to notice problems: researchers use it to scrape competitor websites, and tool responses include raw HTML instead of clean summaries.

This is where callbacks come in. Callbacks are user-defined functions that hook into specific points in your agent's execution lifecycle—before and after each major operation. They give you precise control to enforce safety policies, validate data, enhance responses, and manage execution flow without modifying the core agent code.

In this lesson, you'll implement the two most powerful callbacks for building reliable agents: before_tool_callback for guardrails and after_tool_callback for response enhancement. You'll see how returning None allows normal execution while returning a dict or LlmResponse can block operations or override results entirely.

The ADK Callback System

Google ADK provides six callback types that execute at different points in the agent's lifecycle. Understanding this architecture helps you choose the right callback for each use case.

Six Callback Types

Callback	Executes	Best For	Return Value
before_agent_callback	Before agent processes request	Setup, request logging	None (usually)
after_agent_callback	After agent completes execution	Cleanup, response logging	None (usually)
before_model_callback	Before LLM is called	Input validation, prompt modification	None or `LlmResponse`
after_model_callback	After LLM responds	Response sanitization, output guardrails	None or `LlmResponse`
before_tool_callback	Before tool executes	Input validation, policy enforcement	None or dict (block)
after_tool_callback	After tool returns result	Response enhancement, logging, caching	dict with modified result

Key principle: Returning None from a before-callback allows normal execution to proceed. Returning a value overrides the default behavior:

before_tool_callback returns dict → Tool execution is skipped; dict becomes the result
after_tool_callback returns dict → Modified dict replaces the tool's original response
before_model_callback returns LlmResponse → LLM call is skipped; LlmResponse is processed

Let's focus on the two most important for building guardrails: before_tool_callback and after_tool_callback.

Before-Tool Callback: Domain Filtering & Policy Enforcement

Use case: You want to block your agent from searching certain domains or enforce rate limits on API calls.

Pattern: Domain Filtering

Here's how to prevent tool execution when search targets prohibited domains:

BLOCKED_DOMAINS = [
    "wikipedia.org",
    "reddit.com",
    "youtube.com",
    "medium.com",
]

def filter_news_sources_callback(tool, args, tool_context):
    """
    Blocks search requests targeting certain domains.

    Args:
        tool: The tool object being invoked (has name, description)
        args: Dictionary of tool arguments {"query": "..."}
        tool_context: Provides access to state, session info

    Returns:
        None if tool should execute normally
        dict with "error" and "reason" if tool should be blocked
    """
    if tool.name == "google_search":
        query = args.get("query", "").lower()

        # Check if query targets a blocked domain
        for domain in BLOCKED_DOMAINS:
            if f"site:{domain}" in query:
                print(f"BLOCKED: {query}")
                return {
                    "error": "blocked_source",
                    "reason": f"Searches targeting {domain} are not allowed in this environment."
                }

        print(f"ALLOWED: {query}")

    # Return None to allow tool execution
    return None

How it works:

Check condition: Examine tool name and arguments
Return None if allowed: Tool executes normally with its original arguments
Return dict if blocked: Framework skips tool execution and uses your dict as the result

This callback prevents your agent from accidentally searching competitor sites, scraping Reddit discussions, or accessing YouTube transcripts.

Agent Configuration

from google.adk.agents import Agent
from google.adk.tools import google_search

agent = Agent(
    name="guarded_research_agent",
    model="gemini-2.5-flash",
    instruction="""You are a news researcher with strict source guidelines.
You can search Google News and technical blogs.
You cannot search Wikipedia, Reddit, YouTube, or Medium.
If a query targets a blocked source, you'll receive an error message.
In that case, rephrase your search to target allowed sources only.""",
    tools=[google_search],
    before_tool_callback=[filter_news_sources_callback]
)

Output Example:

When your agent tries to search "site:wikipedia.org machine learning advances":

BLOCKED: site:wikipedia.org machine learning advances
{
  "error": "blocked_source",
  "reason": "Searches targeting wikipedia.org are not allowed in this environment."
}

The agent receives this error, realizes it violated a policy, and reformulates its search without the blocked domain.

After-Tool Callback: Response Enhancement & Transparency

Use case: You want to add metadata to tool responses, extract sources for transparency, or transform raw results into cleaner formats.

Pattern: Source Transparency Logging

Here's how to extract domains from search results and add a transparency log:

import re
from urllib.parse import urlparse

def inject_process_log_after_search(tool, args, tool_context, tool_response):
    """
    Enhances search results by extracting and logging source domains.

    Args:
        tool: The tool object (has name, description)
        args: Tool arguments {"query": "..."}
        tool_context: Provides access to state, session info
        tool_response: The raw response from the tool (string)

    Returns:
        Enhanced dict with both results and source transparency log
    """
    if tool.name == "google_search" and isinstance(tool_response, str):
        # Extract all URLs from the response
        urls = re.findall(r'https?://[^\s/]+', tool_response)

        # Get unique domains
        unique_domains = sorted(set(urlparse(url).netloc for url in urls))

        if unique_domains:
            # Create a transparency log entry
            sourcing_log = f"Sources: {', '.join(unique_domains)}"

            # Read existing process log from state
            current_log = tool_context.state.get('process_log', [])

            # Update state with new entry
            tool_context.state['process_log'] = [sourcing_log] + current_log

        # Return enhanced response with both results and metadata
        return {
            "search_results": tool_response,
            "process_log": tool_context.state.get('process_log', [])
        }

    # Return tool_response unchanged for other tools
    return tool_response

How it works:

Extract metadata: Parse URLs from tool results
Update state: Store sourcing information in tool_context.state
Return enhanced dict: Include both original results and transparency metadata

Agent Configuration with Enhanced Instructions

agent = Agent(
    name="transparent_research_agent",
    model="gemini-2.5-flash",
    instruction="""You are a news researcher with transparency requirements.

When you search for information, the google_search tool returns results with a special format:
{
  "search_results": "...",
  "process_log": ["Sources: domain1.com, domain2.com", ...]
}

The process_log shows which domains your searches accessed. Include this transparency information in your final report:

"I researched this topic using: [sources from process_log]"

This builds user trust by showing your research methodology.""",
    tools=[google_search],
    after_tool_callback=[inject_process_log_after_search]
)

Output Example:

When agent searches "AI safety breakthroughs 2025":

{
  "search_results": "[Full search result text...]",
  "process_log": [
    "Sources: news.google.com, techcrunch.com, arxiv.org",
    "Sources: github.com, huggingface.co"
  ]
}

Your agent sees this structured response, extracts the sources, and includes transparency in its report: "I researched this across TechCrunch, ArXiv, and GitHub to ensure current information."

Combining Callbacks: Policy Enforcement + Transparency

Real-world agents often combine multiple callbacks. Here's a complete setup:

# Combine both callbacks
agent = Agent(
    name="controlled_news_agent",
    model="gemini-2.5-flash",
    instruction="""You are an AI News Podcast Agent.

Your research tools have safety guardrails:
- Some domains are blocked (you'll receive error messages)
- All searches are logged for transparency
- You can see which sources you accessed via process_log

When research is blocked, reformulate your queries to access allowed sources.
Always cite your sources in the podcast script.""",
    tools=[google_search, get_financial_context],
    before_tool_callback=[filter_news_sources_callback],
    after_tool_callback=[inject_process_log_after_search]
)

Execution flow:

Agent decides to search "AI news this week"
before_tool_callback: Validates it doesn't target blocked domains → ALLOWED
Tool executes: google_search("AI news this week") → returns raw results
after_tool_callback: Extracts sources, updates process_log → returns enhanced dict
Agent receives: {"search_results": "...", "process_log": [...]}
Agent can see which domains it searched and include that transparency

State Management in Callbacks

Callbacks have access to tool_context.state—a dictionary persistent across the agent's session. Use it for:

Audit Trails

Track which tools were called with what arguments:

def audit_trail_callback(tool, args, tool_context):
    """Log all tool calls for audit purposes."""
    if 'audit_log' not in tool_context.state:
        tool_context.state['audit_log'] = []

    tool_context.state['audit_log'].append({
        "tool": tool.name,
        "args": args,
        "timestamp": datetime.now().isoformat()
    })

    return None  # Allow execution to proceed

Rate Limiting

Prevent excessive API calls:

def rate_limit_callback(tool, args, tool_context):
    """Allow max 5 google_search calls per session."""
    if tool.name == "google_search":
        call_count = tool_context.state.get('search_count', 0)

        if call_count >= 5:
            return {
                "error": "rate_limit_exceeded",
                "reason": "Maximum 5 searches per session reached."
            }

        tool_context.state['search_count'] = call_count + 1

    return None

Caching

Avoid redundant API calls:

def caching_callback(tool, args, tool_context):
    """Cache search results by query."""
    if tool.name == "google_search":
        query = args.get("query", "")
        cache_key = f"search:{query}"

        # Check if we've searched this before
        if cache_key in tool_context.state:
            print(f"Using cached result for: {query}")
            return tool_context.state[cache_key]

    return None

def cache_store_callback(tool, args, tool_context, tool_response):
    """Store successful searches in cache."""
    if tool.name == "google_search":
        query = args.get("query", "")
        cache_key = f"search:{query}"
        tool_context.state[cache_key] = tool_response

    return tool_response

Callback Return Behavior Reference

This is the critical distinction between allowing and blocking:

Callback Type	Return None	Return Value
before_tool_callback	Tool executes with original args	Tool execution skipped; dict becomes result
after_tool_callback	Original response returned unchanged	Response replaced with returned dict
before_model_callback	LLM call proceeds normally	LLM call skipped; LlmResponse processed

Example of blocking vs allowing:

def example_callback(tool, args, tool_context):
    if condition_met:
        return {"error": "blocked"}  # Block execution
    else:
        return None  # Allow execution

Callback Best Practices

1. Keep Callbacks Focused

Each callback should do one thing well:

# Good: Single responsibility
def filter_domains_callback(tool, args, tool_context):
    """Only filters; doesn't log, cache, or modify."""
    if condition:
        return {"error": "blocked"}
    return None

# Avoid: Multiple concerns
def overloaded_callback(tool, args, tool_context):
    """Filters AND logs AND caches AND validates."""
    # Too much logic in one place

2. Handle Errors Gracefully

Don't let callback exceptions crash your agent:

def safe_callback(tool, args, tool_context):
    try:
        # Your callback logic
        return None
    except Exception as e:
        print(f"Callback error: {e}")
        # Return None to allow tool execution rather than crashing
        return None

3. Update Agent Instructions

Always document callback behavior in agent instructions so the agent understands its constraints:

agent = Agent(
    name="constrained_agent",
    instruction="""You have access to google_search with the following constraints:

Blocked domains: Wikipedia, Reddit, YouTube, Medium
Maximum searches: 5 per conversation
Rate limit: 1 search per second

All your searches are logged for transparency. Include sources in your final report.""",
    before_tool_callback=[filter_domains, rate_limit],
)

Try With AI

Setup: You'll build a research agent with both guardrails and transparency.

Prompt 1: Implement Domain Filtering

Copy and execute this code in your Python environment:

from google.adk.agents import Agent
from google.adk.tools import google_search

BLOCKED_DOMAINS = ["reddit.com", "twitter.com", "youtube.com"]

def filter_social_media(tool, args, tool_context):
    """Block searches on social media platforms."""
    if tool.name == "google_search":
        query = args.get("query", "").lower()
        for domain in BLOCKED_DOMAINS:
            if f"site:{domain}" in query:
                return {
                    "error": "blocked_domain",
                    "reason": f"{domain} is not allowed for research in this context."
                }
    return None

agent = Agent(
    name="filtered_research_agent",
    model="gemini-2.5-flash",
    instruction="You are a researcher. You can search the web but not social media platforms.",
    tools=[google_search],
    before_tool_callback=[filter_social_media]
)

# Test: Agent tries to search Twitter
response = agent.run("Search site:twitter.com AI trends 2025")
print(response)

What you're learning: How returning a dict from before_tool_callback blocks tool execution and provides feedback to the agent.

Prompt 2: Add Source Transparency

Enhance your agent with after_tool_callback:

import re
from urllib.parse import urlparse

def add_source_transparency(tool, args, tool_context, tool_response):
    """Add source domains to every search result."""
    if tool.name == "google_search" and isinstance(tool_response, str):
        urls = re.findall(r'https?://[^\s/]+', tool_response)
        domains = sorted(set(urlparse(url).netloc for url in urls))

        if domains:
            if 'sources_accessed' not in tool_context.state:
                tool_context.state['sources_accessed'] = []
            tool_context.state['sources_accessed'].extend(domains)

        return {
            "results": tool_response,
            "sources": domains
        }
    return tool_response

# Update agent with both callbacks
agent = Agent(
    name="transparent_research_agent",
    model="gemini-2.5-flash",
    instruction="""You are a transparent researcher.

Your search results include a 'sources' list showing which domains you accessed.
In your final report, cite these sources: "I accessed information from: [sources]"

You cannot search social media (Reddit, Twitter, YouTube).""",
    tools=[google_search],
    before_tool_callback=[filter_social_media],
    after_tool_callback=[add_source_transparency]
)

# Test: Combined guardrails + transparency
response = agent.run("Find recent AI safety research and cite your sources")
print(response)

What you're learning: How combining before and after callbacks creates both safety and transparency in agent behavior.

Prompt 3: State Management & Rate Limiting

Add rate limiting to prevent excessive API consumption:

def enforce_search_limit(tool, args, tool_context):
    """Allow maximum 5 searches per session."""
    if tool.name == "google_search":
        search_count = tool_context.state.get('search_count', 0)

        if search_count >= 5:
            return {
                "error": "rate_limit_exceeded",
                "reason": f"Maximum 5 searches reached. Current usage: {search_count}/5"
            }

        tool_context.state['search_count'] = search_count + 1
        print(f"Search {search_count + 1}/5")

    return None

# Agent with three callbacks: filtering, rate limiting, transparency
agent = Agent(
    name="fully_controlled_agent",
    model="gemini-2.5-flash",
    instruction="""You are a controlled research agent with three constraints:

1. Domain filtering: Cannot search Reddit, Twitter, YouTube
2. Rate limiting: Maximum 5 searches per conversation
3. Transparency: All sources must be cited

When you hit rate limits or blocked domains, adapt your search strategy.""",
    tools=[google_search],
    before_tool_callback=[filter_social_media, enforce_search_limit],
    after_tool_callback=[add_source_transparency]
)

# Test: Multiple searches with rate limiting
response = agent.run("""
Find information about three topics:
1. Latest AI safety research
2. Emerging AI applications
3. AI ethics frameworks

Show me which sources you used for each.""")
print(response)

What you're learning: How to combine multiple before_tool_callbacks to create layered policy enforcement, and how to track state across callback executions.

When you're done:

Run each prompt and observe how callbacks control behavior
Modify BLOCKED_DOMAINS and SEARCH_LIMIT values; see how agent adapts
Check tool_context.state to understand persistence across tool calls
Write one additional callback (e.g., authentication, cost tracking) to solidify your understanding

Reflect on Your Skill

You built a google-adk skill in Lesson 0. Test and improve it based on what you learned.

Test Your Skill

Using my google-adk skill, implement before_tool_callback and after_tool_callback for guardrails.
Does my skill demonstrate domain filtering, response enhancement, and state-based rate limiting?

Identify Gaps

Ask yourself:

Did my skill include the six callback types and when each executes in the agent lifecycle?
Did it explain callback return behavior (None vs dict vs LlmResponse)?

Improve Your Skill

If you found gaps:

My google-adk skill is missing callback and guardrail patterns.
Update it to include:
- before_tool_callback for input validation and policy enforcement
- after_tool_callback for response enhancement and transparency
- Callback return patterns (return None to allow, return dict to block/modify)
- State management in callbacks (tool_context.state for audit trails, rate limiting, caching)
- Agent instructions that reference callback behavior
- Multiple callback registration with proper execution order

The ADK Callback System​

Six Callback Types​

Before-Tool Callback: Domain Filtering & Policy Enforcement​

Pattern: Domain Filtering​

Agent Configuration​

After-Tool Callback: Response Enhancement & Transparency​

Pattern: Source Transparency Logging​

Agent Configuration with Enhanced Instructions​

Combining Callbacks: Policy Enforcement + Transparency​

State Management in Callbacks​

Audit Trails​

Rate Limiting​

Caching​

Callback Return Behavior Reference​

Callback Best Practices​

1. Keep Callbacks Focused​

2. Handle Errors Gracefully​

3. Update Agent Instructions​

Try With AI​

Prompt 1: Implement Domain Filtering​

Prompt 2: Add Source Transparency​

Prompt 3: State Management & Rate Limiting​

Reflect on Your Skill​

Test Your Skill​

Identify Gaps​

Improve Your Skill​