Skip to main content
Version: Next

Deployment Overview

Agent Kernel supports multiple deployment modes for different use cases.

Deployment Modes

Quick Comparison

ModeBest ForScalabilityCold StartCostFault Tolerance
Local/CLIDevelopment, testingN/AInstantFreeManual restart
REST APIWeb apps, APIsManual scalingInstantServer costsManual
AWS LambdaVariable loadAuto-scaling1-3sPay per useHigh - Auto-retry, multi-AZ
AWS ECSConsistent loadAuto-scalingInstantRunning containersVery High - Multi-AZ, auto-recovery
MCP ServerAI integrationsManualInstantServer costsManual
A2A ServerAgent networksManualInstantServer costsManual

Local Development

Uses agentkernel.CLI module.

python my_agent.py
  • Interactive CLI
  • Instant feedback
  • No deployment needed

Learn more →

REST API Server

Uses agentkernel.RESAPI module.

python my_agent.py
  • HTTP endpoints
  • Easy integration
  • Self-hosted

Learn more →

AWS Serverless

Uses Agent Kernel terraform modules

# Configure the modules and run
terraform init && terraform apply
  • Lambda functions
  • API Gateway
  • Auto-scaling
  • Pay per request

Learn more →

AWS Containerized

Uses Agent Kernel terraform modules

# Configure the modules and run
terraform init && terraform apply
  • ECS Fargate
  • Application Load Balancer
  • Consistent performance
  • Lower latency

Learn more →

Choosing a Deployment Mode

Development

Local/CLI: Fast iteration, no setup

Small Web App

REST API: Simple, self-hosted

Variable Traffic

AWS Lambda: Auto-scales, pay per use

High Traffic

AWS ECS: Consistent performance

AI Integration

MCP/A2A: Protocol-based integration

Fault Tolerance Considerations

Agent Kernel provides different levels of fault tolerance depending on your deployment mode:

Production-Grade Fault Tolerance

AWS ECS/Fargate offers the highest level of fault tolerance:

  • Multi-AZ task distribution for zone-level failures
  • Automatic task replacement on failures
  • Health check-based routing
  • Configurable auto-scaling
  • Rolling deployments with zero downtime
  • Application Load Balancer with health monitoring

Learn more about ECS fault tolerance →

AWS Lambda provides built-in fault tolerance:

  • Serverless architecture with automatic scaling
  • Multi-AZ execution by default
  • Automatic retry on failures
  • No infrastructure management
  • Inherently resilient to hardware failures

Learn more about serverless fault tolerance →

State Persistence

Both production deployment modes support resilient state management:

  • DynamoDB: Multi-AZ replication, automatic backups, 99.999% SLA
  • Redis: Cluster mode with automatic failover, replication

Learn more about fault tolerance →

Next Steps

💬 Ask AI Assistant

Get instant help with Agent Kernel documentation, examples, and more

AI Assistant