Complete AGI Agent Framework — Production-Ready with 17 Advanced Features
State-of-the-Art (March 2026 Ultimate Stack)
Krab is a comprehensive, production-ready AGI framework built for the 2026 AI landscape. It features 17 advanced capabilities including image generation, code execution, desktop automation, web browsing, voice processing, multi-agent collaboration, enterprise security, and more.
- 🚀 Production-Ready: All 17 features implemented and tested
- 🛡️ Enterprise-Grade: Security, analytics, and compliance built-in
- 🔧 Developer-Friendly: Complete SDK and integration tools
- ⚡ High Performance: < 1s startup, parallel execution
- 🌐 Multi-Provider: 15+ LLM providers supported
graph TB
User[User Input] --> CLI[CLI Interface]
CLI --> Agent[AGI Agent Core]
Agent --> Memory[Conversation Memory]
Agent --> Tools[Tool Registry]
Agent --> LLM[LLM Provider]
Tools --> Creative[🎨 Creative AI]
Tools --> Automation[🖥️ Automation]
Tools --> Collaboration[🤝 Collaboration]
Tools --> Enterprise[📊 Enterprise]
Creative --> ImageGen[Image Generation]
Creative --> Voice[Voice Intelligence]
Automation --> Desktop[Desktop Control]
Automation --> Web[Web Automation]
Automation --> Code[Code Execution]
Collaboration --> MultiAgent[Multi-Agent System]
Collaboration --> MCP[MCP Integration]
Collaboration --> Scheduler[Task Scheduling]
Enterprise --> Analytics[Advanced Analytics]
Enterprise --> Security[Security System]
Enterprise --> Cloud[Cloud Deployment]
LLM --> Gemini[Gemini]
LLM --> OpenAI[OpenAI]
LLM --> Anthropic[Anthropic]
LLM --> Local[Local Models]
Agent --> Response[AGI Response]
Response --> User
flowchart TD
Start([Start]) --> Input[User Input]
Input --> Think[🧠 Think: Generate Plan]
Think --> Tools{Use Tools?}
Tools -->|Yes| Execute[⚡ Execute Tools]
Tools -->|No| Respond[💬 Generate Response]
Execute --> Success{Success?}
Success -->|Yes| Reflect[🔍 Reflect on Results]
Success -->|No| Retry[🔄 Retry with Different Approach]
Reflect --> Quality{Quality OK?}
Quality -->|Yes| Respond
Quality -->|No| Improve[📈 Improve Response]
Retry --> MaxRetries{Max Retries?}
MaxRetries -->|Yes| Error[❌ Report Error]
MaxRetries -->|No| Think
Improve --> Think
Respond --> Output[📤 Output Response]
Error --> Output
Output --> End([End])
graph LR
subgraph "User Interface"
CLI[CLI]
Web[Web UI]
Desktop[Desktop App]
end
subgraph "Core Engine"
Agent[AGI Agent]
Memory[Memory System]
Tools[Tool Registry]
end
subgraph "AI Providers"
Gemini[Gemini]
OpenAI[OpenAI]
Anthropic[Anthropic]
Local[Local Models]
end
subgraph "Feature Modules"
Creative[Creative AI]
Automation[Automation]
Collaboration[Collaboration]
Enterprise[Enterprise]
end
CLI --> Agent
Web --> Agent
Desktop --> Agent
Agent --> Memory
Agent --> Tools
Agent --> Gemini
Agent --> OpenAI
Agent --> Anthropic
Agent --> Local
Tools --> Creative
Tools --> Automation
Tools --> Collaboration
Tools --> Enterprise
sequenceDiagram
participant User
participant Agent
participant Tools
participant LLM
participant Memory
User->>Agent: User Input
Agent->>Memory: Load Context
Agent->>LLM: Generate Plan
alt Plan Uses Tools
Agent->>Tools: Execute Tool(s)
Tools->>Agent: Tool Results
Agent->>LLM: Process Results
end
LLM->>Agent: Generate Response
Agent->>Memory: Store Conversation
Agent->>User: Final Response
- Image Generation: AI-powered image creation and editing
- Voice Intelligence: Speech-to-text and text-to-speech with multiple providers
- Desktop Control: Mouse, keyboard, and screen automation with computer vision
- Web Automation: Browser control and data extraction with Playwright
- Code Execution: Safe multi-language programming environment
- Multi-Agent System: Agent coordination and task delegation
- MCP Integration: Model Context Protocol for inter-agent communication
- Task Scheduling: Automated cron-based task execution
- Obsidian Integration ❤️ - Advanced Knowledge Management: TF-IDF semantic search, knowledge graph analysis, AI synthesis, real-time sync, 15+ sophisticated tools
- Plugin Ecosystem: Lego-style modular architecture. Add new tools, agents, or middleware without touching core code.
- Advanced Analytics: Performance monitoring and Vercel AI tracing
- Security System: Enterprise-grade authentication, authorization, and audit logging
- Cloud Deployment: Scalable infrastructure and gateway server
- SDK Integration: Complete development toolkit
- Web Interface: Real-time collaborative chat platform
- Desktop Application: Electron-based client
Krab is designed to be infinitely extensible. Our plugin system allows you to create your own "blocks" and plug them in seamlessly.
# Create a new tool plugin in seconds
krab plugins create my-custom-tool --type tool
# Install a local plugin
krab plugins install ./my-plugin-folder
# List all plugins
krab plugins listIf you find Krab useful, please give us a ⭐ on GitHub!
git clone https://github.com/OpenKrab/Krab.git
cd Krab
npm install --legacy-peer-depsCopy the example environment file:
cp .env.example .envAdd your preferred API key (e.g., GEMINI_API_KEY, KILOCODE_API_KEY, OPENAI_API_KEY).
For Obsidian support, set OBSIDIAN_VAULT_PATH=/path/to/your/vault.
npm run build
npm startOr use development mode:
npm run devnpm start chatnpm start ask "Generate an image of a futuristic city"npm start ask "Search my obsidian vault for notes about AGI and summarize them"npm start ask "Navigate to example.com and extract the main heading"npm start ask "Write a Python script to analyze this dataset"npm start ask "Take a screenshot and save it to desktop"krab chat- Start interactive chat sessionkrab ask <question>- Ask a single questionkrab tools- List all available toolskrab config- Manage configuration
krab plugins list- List all installed pluginskrab plugins create- Scaffold a new plugin projectkrab plugins doctor- Check plugin health
krab gateway- Start web API serverkrab scheduler- Manage scheduled taskskrab analytics- View performance metricskrab security- Security management
/plugins- Show loaded plugins and their status/tools- View all loaded tools and permissions/memory- Check conversation buffer status/debug- View current provider and configuration/clear- Clear conversation memory/help- Show available commands
- Core AGI Engine - Advanced reasoning and tool integration
- Voice Intelligence - Complete STT/TTS system
- Desktop Automation - Mouse, keyboard, vision control
- Web Automation - Browser control and data extraction
- Code Execution - Safe multi-language programming
- Creative AI - Image generation and media processing
- Cloud Infrastructure - Enterprise deployment and monitoring
- Desktop Application - Modern Electron UI
- Web Interface - Real-time collaborative chat
- Developer SDK - Complete integration toolkit
- Advanced Analytics - Observability and performance tracking
- Agent Collaboration - Multi-agent coordination system
- MCP Integration - Inter-agent communication protocol
- Scheduler System - Automated task execution
- Browser Agent - Web automation with AI vision
- Security Enhancements - Enterprise security and compliance
- Obsidian Integration ❤️ - Deep Knowledge Base connection
- Plugin Ecosystem 🧩 - Lego-style modular architecture
- Testing & Validation - Framework testing and validation
- Mobile Apps - React Native iOS/Android applications
- Enterprise Features - Advanced business capabilities
Krab includes 60+ powerful tools across 19 feature categories:
get_datetime- Time and timezone awarenessshell- Safe shell execution with approvalweb_search- Hybrid search capabilitiesfile_ops- File system operations
obsidian_read_note- Read specific notes with full metadataobsidian_search_notes- Basic text search across vaultobsidian_semantic_search- TF-IDF cosine similarity searchobsidian_find_related_notes- Graph-based relationship discoveryobsidian_get_knowledge_graph- Complete vault relationship graphobsidian_get_central_nodes- Most connected/important notesobsidian_find_knowledge_path- Shortest paths between conceptsobsidian_synthesize_knowledge- AI-powered topic summarizationobsidian_discover_clusters- Tag-based theme clusteringobsidian_list_notes_by_tag- Tag-filtered note listingobsidian_create_note- Create new notes with frontmatterobsidian_start_sync- Enable real-time file watchingobsidian_stop_sync- Disable live vault updatesobsidian_sync_status- Monitor sync stateobsidian_vault_stats- Comprehensive vault analytics
image_generate- AI image generationimage_edit- Image manipulationvoice_speak- Text-to-speechvoice_transcribe- Speech-to-text
browser_navigate- Web browsingcomputer_click- Desktop controlcomputer_type- Keyboard automationcode_execute- Multi-language code execution
security_auth- Authenticationanalytics_trace- Performance monitoringscheduler_task- Task schedulingmcp_connect- Agent communication
Krab implements enterprise-grade security:
- Tool Approval System: Dangerous operations require user confirmation
- Rate Limiting: Prevent abuse and resource exhaustion
- Cost Controls: Monitor and limit API usage
- Audit Logging: Complete operation tracking
- Authentication: User management and access control
- Authorization: Role-based permissions
- Lightweight: < 50 dependencies total
- Fast: < 1s startup time
- Efficient: Parallel tool execution
- Scalable: Cloud-ready architecture
- Reliable: 99.9% uptime capability
Krab now supports a powerful, extensible plugin system for communication channels and more:
- Telegram Channel: Integration via grammY with automatic message routing.
- WhatsApp Channel: Multi-device support via Baileys (scan QR to connect).
- Discord Channel: Full bot support with DM and channel integration.
- Web Control UI: A modern, glassmorphic dashboard for monitoring gateway status and managing active channels.
- Multi-Agent Routing: Route messages to specific agents based on sender/workspace.
- Session Isolation: Each communication session maintains its own memory and context.
- Vector Memory: Long-term semantic search across all conversations.
- Service Installation: Easily install Krab Gateway as a system service on Windows, Linux, and macOS.
- Raspberry Pi Deployment Guide
- Gateway Setup Guide
- VPS Deployment Guide
- Docker Deployment Guide
- Railway Deployment Guide
- Render Deployment Guide
Krab supports 15+ LLM providers:
- Free: Gemini 2.0 Flash, Kilocode GLM-5
- Premium: OpenAI GPT-4, Anthropic Claude, DeepSeek
- Local: Ollama, LM Studio
- Enterprise: Azure OpenAI, Google Cloud AI
We welcome contributions! Please see our Contributing Guide for details.
MIT License - see LICENSE file for details.
