HasteKit LLM Gateway is a high-performance, open-source gateway written in Golang that provides a unified interface to interact with large language models from various providers. It sits between your application and LLM providers, offering access control via virtual keys, centralized configuration, and deep observability through OpenTelemetry traces.Documentation Index
Fetch the complete documentation index at: https://hastekit.ai/docs/llms.txt
Use this file to discover all available pages before exploring further.
Why HasteKit
- No-Code Agent Builder: Compose agents with model parameters, system prompts, MCP tools, skills, and structured output schemas—without writing code
- Multi-Agent Orchestration: Invoke sub-agents as tools or hand off conversations mid-flight to specialist agents instead of overloading a single monolithic prompt
- Durable Execution: Run agents on Temporal or Restate where every step is checkpointed and every retry is automatic—or call them in-process for low-latency workloads
- Versioning & Aliases: Ship immutable agent snapshots with production aliases so you can iterate safely and roll back instantly
- Unified LLM Gateway: Drop-in replacement for OpenAI, Anthropic, and Gemini SDKs with virtual keys, rate limits, and full request/response logging
- Deep Observability: Every agent run, tool call, and LLM request emits OpenTelemetry traces with token usage, latency, and cost attribution
LLM Gateway
Underneath the Agent Builder, HasteKit ships a production LLM gateway that any application can use directly. Point any OpenAI, Anthropic, or Gemini SDK at the gateway and unlock virtual keys, rate limiting, and request logging without code changes.Virtual Keys
Protect provider API keys. Issue virtual keys with per-project rate limits and revoke them instantly.
Drop-in Replacement
Point your existing OpenAI, Anthropic, or Gemini client at the gateway—just change the base URL.
Cost Tracking
Token usage, latency, and cost tracked per virtual key, per project, and per provider via ClickHouse-powered analytics.
Request Logging
Every LLM call is logged with full request/response details for debugging and compliance.
Drop-in SDK Replacement
Update the base URL and use your virtual key—no other changes required: Go:
Supported Providers
| Provider | Text | Image | Tool Calls | Reasoning |
|---|---|---|---|---|
| OpenAI | ✅ | ✅ | ✅ | ✅ |
| Anthropic | ✅ | ❌ | ✅ | ✅ |
| Gemini | ✅ | ✅ | ✅ | ✅ |
Next Steps
LLM Gateway
Connect providers, issue virtual keys, and route SDK calls through the gateway.
SDK Modes
Understand the difference between Gateway and Direct modes.