Overview
Each deployment strategy is designed for specific use cases. The system can auto-select the best strategy or you can specify one explicitly.LOCAL
Run as a local Python process. Fastest for development and testing.Features
- No containerization overhead
- Direct file system access
- Easiest debugging
- No additional setup required
Usage
How It Works
- Adapter creates a
run.pyentry point - Runner imports and calls the main function
- Results returned directly
Configuration
DOCKER
Run in an isolated Docker container.Features
- Environment isolation
- Reproducible builds
- Easy dependency management
- Cross-platform compatibility
Prerequisites
- Docker installed and running
Usage
How It Works
- Adapter creates
Dockerfileanddocker-compose.yml - Runner builds and starts container
- Inputs/outputs mapped via volumes
Dockerfile Template
MODAL
Deploy to Modal.com for serverless GPU compute.Features
- Serverless scaling
- GPU support (A100, H100)
- Pay-per-use pricing
- Fast cold starts
Prerequisites
- Modal account and CLI
modalpackage installed
Usage
How It Works
- Adapter creates
modal_app.pywith Modal decorators - Runner deploys to Modal
- HTTP endpoint for inference
Modal App Template
BENTOML
Deploy with BentoML for production ML serving.Features
- Model versioning
- Automatic batching
- Prometheus metrics
- Kubernetes-ready
Prerequisites
bentomlpackage installed
Usage
How It Works
- Adapter creates
service.pywith BentoML decorators - Runner builds and serves the Bento
- HTTP API for inference
BentoML Service Template
LANGGRAPH
Deploy as a LangGraph agent workflow.Features
- Stateful agent execution
- Tool integration
- Streaming support
- Checkpoint persistence
Prerequisites
langgraphpackage installed
Usage
How It Works
- Adapter creates
graph.pywith LangGraph nodes - Runner compiles and runs the graph
- Streaming or batch execution
AUTO Selection
WhenDeployStrategy.AUTO is used, the selector analyzes the code:
Selection Criteria
| Factor | LOCAL | DOCKER | MODAL | BENTOML |
|---|---|---|---|---|
| Simple script | ✅ | |||
| Complex deps | ✅ | |||
| GPU required | ✅ | |||
| Production ML | ✅ | |||
| Isolation needed | ✅ | ✅ | ✅ |
Strategy Comparison
| Strategy | Startup | Cost | Isolation | GPU | Production |
|---|---|---|---|---|---|
| LOCAL | Fast | Free | None | Local | No |
| DOCKER | Medium | Free | Full | Via nvidia-docker | Yes |
| MODAL | Medium | Pay-per-use | Full | Yes | Yes |
| BENTOML | Medium | Self-hosted | Full | Yes | Yes |
| LANGGRAPH | Fast | Depends | None | Via backend | Yes |