Open-Source Alternative to OpenAI Operator
LLM-Powered Browser Automation Agent
Get Started โข Documentation โข Examples โข Contributing โข Demo
OperatorNext is an open-source AI agent platform that understands and executes complex browser tasks through natural language processing and visual reasoning. By combining state-of-the-art LLM technologies (including GPT-4o, Claude, and more) with browser automation, we provide developers and users with a powerful Computer-Using Agent (CUA) for web automation, data collection, UI testing, and various other scenarios.
๐ฅ๏ธ Modern Interface with Real-time Task Monitoring
๐ค Intelligent Task Execution with Visual Feedback
โ ๏ธ Project StatusThis project is in early development stage. Core features are under active development and not yet implemented.
Please note that breaking changes may occur frequently during this phase.
| Feature | OperatorNext | OpenAI Operator |
|---|---|---|
| License | MIT Open Source | Proprietary |
| Deployment | Self-hosted & Cloud | Cloud-only |
| Data Privacy | Local Processing | Cloud Processing |
| Customization | Full Control | Limited |
| Cost | Free & Self-hosted | Usage-based Pricing |
| API Integration | Flexible & Open | Restricted |
OperatorNext empowers developers with:
- ๐ Web Scraping & Data Extraction - Automated data collection with pixel-level accuracy
- ๐งช End-to-End Testing - Modern alternative to Selenium for UI/UX testing
- ๐ค RPA (Robotic Process Automation) - Chain-of-Thought planning for complex tasks
- ๐ Web Testing & QA - Visual reasoning based quality assurance
- ๐ Data Mining & Analytics - Intelligent web data gathering with self-correction
- ๐ Workflow Automation - Custom workflow design with plugin ecosystem
Perfect for:
- DevOps and QA Teams (Automated Testing)
- Data Scientists and Researchers (Web Scraping)
- Digital Marketing Professionals (Form Automation)
- Business Process Automation (RPA Solutions)
- Web Developers and Testers (GUI Testing)
- Enterprise Automation Solutions (Custom Workflows)
- ๐ค AI Agent & Visual Reasoning - Complete complex browser operations through natural language and visual understanding, powered by GPT-4o multimodal capabilities
- ๐ง Chain-of-Thought Planning - Advanced task planning and execution with reinforcement learning for optimal automation
- ๐ฏ Precise GUI Interaction - Pixel-perfect DOM operations, XPath navigation, and complex interaction scenarios using computer vision
- ๐ Real-time Task Tracking - WebSocket-based monitoring system with CPU, memory, and network metrics for execution insights
- ๐ Privacy-First Design - Local processing of sensitive data with comprehensive error handling and self-correction mechanisms
- ๐ Cross-Platform & Multilingual - Full i18n support with Chinese/English interfaces, works on Windows, macOS, and Linux
- ๐ Extensible Architecture - REST API, WebSocket endpoints, and plugin system for seamless integration
- ๐ Cloud & Self-Hosted - Deploy on your infrastructure or use our cloud solution for maximum flexibility
- โก High Performance - Parallel task execution with optimized resource management
- ๐จ Modern Developer Experience - Beautiful UI/UX built with Next.js and Tailwind CSS, extensive API documentation
Try our online demo at operatornext.com or set up your own instance:
- Docker & Docker Compose
- Node.js 18+
- pnpm 10+
- Chrome/Chromium browser
- Clone the repository
git clone https://github.com/OperatorNext/OperatorNext.git
cd OperatorNext- Copy environment variable templates
# Copy frontend environment variables
cp frontend/.env.local.example frontend/.env.local
# Copy Docker environment variables
cp .env.example .env- Install frontend dependencies
cd frontend
pnpm install- Initialize database and generate types
# Push database schema
sudo pnpm db:push
# Generate Prisma client and types
sudo pnpm db:generateNote:
sudomight be required for database operations depending on your system configuration.
- Start Docker services
docker-compose up -dThis will start the following services:
| Service | URL | Description |
|---|---|---|
| Web Application | http://localhost:3000 | Next.js frontend application |
| PgAdmin | http://localhost:5051 | PostgreSQL database management |
| Maildev | http://localhost:8026 | Email testing interface |
| MinIO Console | http://localhost:9003 | Object storage management |
| MinIO API | http://localhost:9002 | S3-compatible API endpoint |
| PostgreSQL | localhost:5438 | Database (connect via psql or GUI) |
โ ๏ธ These are development credentials. Do NOT use in production!
-
PostgreSQL:
- User: operatornext_prod_user
- Database: operatornext_production
-
PgAdmin:
- Email: admin@operatornext.dev
- Password: See
.envfile
-
MinIO:
- Access Key: See
MINIO_ROOT_USERin.env - Secret Key: See
MINIO_ROOT_PASSWORDin.env
- Access Key: See
- Start frontend development server
cd frontend
pnpm devVisit http://localhost:3000 to use the application.
# Create a new browser task
task = {
"task_description": "Login to GitHub and star a repository"
}
response = requests.post("http://localhost:8000/api/tasks", json=task)
task_id = response.json()["task_id"]
# Monitor task status via WebSocket
ws = websockets.connect(f"ws://localhost:8000/ws/tasks/{task_id}")For more examples, please visit our documentation.
- LLM Support - Compatible with GPT-4o, Claude, and other language models
- LangChain - Large Language Model (LLM) orchestration framework
- Computer Vision - Pixel-level DOM interaction and visual analysis
- Reinforcement Learning - Self-improving task execution strategies
- Chain-of-Thought - Advanced planning and decision making
- FastAPI - High-performance Python web framework for building scalable APIs
- WebSocket - Real-time bidirectional communication for task monitoring
- Playwright - Modern web testing and automation with superior stability
- PostgreSQL - Advanced open-source database for task management
- MinIO - S3-compatible object storage for artifact management
- Redis - In-memory data structure store for caching and queuing
- Docker - Containerization and deployment automation
- Next.js 15 (App Router) - React framework with server-side rendering
- React 19 - Latest version with concurrent features and Suspense
- TypeScript - Type-safe JavaScript development for reliability
- Tailwind CSS - Utility-first CSS framework for modern UI
- Shadcn UI - Modern and accessible component library
- Prisma - Next-generation ORM for type-safe database access
- Turbo Repo - High-performance monorepo build system
- WebSocket - Real-time updates and task monitoring
- Biome - Fast and reliable code formatter
- Docker Compose - Multi-container orchestration
- GitHub Actions - CI/CD automation pipeline
- Playwright - End-to-end testing framework
- Prisma - Database schema management and migrations
- Biome - Code quality and formatting tools
- pnpm - Fast, disk space efficient package manager
- Local Processing - Sensitive data handling
- End-to-End Encryption - Secure communication
- Role-Based Access - Fine-grained permissions
- Audit Logging - Comprehensive activity tracking
For detailed documentation, please visit our documentation.
We welcome all forms of contributions, whether it's new features, documentation improvements, or bug reports. Please check our Contributing Guide for more information.
This project is licensed under the MIT License.
This project is inspired by and built upon:
Thanks to all the developers who have contributed to this project!
Join our growing community:
- Discord - Join our community for discussions, support, and updates
- Slack - Join our Slack workspace for team collaboration
- GitHub Issues - Bug reports and feature requests
- GitHub Discussions - Technical discussions and questions
- Email: hi@operatornext.com
- Telegram: @HaiPro_2025
Company: CyberPoet LLC (Position: CEO)