Skip to content

OperatorNext is an AI-powered intelligent browser automation platform that understands and executes complex browser tasks through natural language processing.

License

Notifications You must be signed in to change notification settings

getbyxpo/OperatorNext

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

46 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

OperatorNext ๐Ÿค–

OperatorNext Logo

GitHub license GitHub stars GitHub issues GitHub pull requests Discord Version

English | ็ฎ€ไฝ“ไธญๆ–‡


Open-Source Alternative to OpenAI Operator

LLM-Powered Browser Automation Agent

Get Started โ€ข Documentation โ€ข Examples โ€ข Contributing โ€ข Demo

OperatorNext is an open-source AI agent platform that understands and executes complex browser tasks through natural language processing and visual reasoning. By combining state-of-the-art LLM technologies (including GPT-4o, Claude, and more) with browser automation, we provide developers and users with a powerful Computer-Using Agent (CUA) for web automation, data collection, UI testing, and various other scenarios.

๐Ÿ–ฅ๏ธ Modern Interface with Real-time Task Monitoring

Operator Next Hero

๐Ÿค– Intelligent Task Execution with Visual Feedback

Operator Next Screenshot

โš ๏ธ Project Status

This project is in early development stage. Core features are under active development and not yet implemented.

Please note that breaking changes may occur frequently during this phase.

๐ŸŒŸ Why Choose OperatorNext?

Feature OperatorNext OpenAI Operator
License MIT Open Source Proprietary
Deployment Self-hosted & Cloud Cloud-only
Data Privacy Local Processing Cloud Processing
Customization Full Control Limited
Cost Free & Self-hosted Usage-based Pricing
API Integration Flexible & Open Restricted

OperatorNext empowers developers with:

  • ๐Ÿ” Web Scraping & Data Extraction - Automated data collection with pixel-level accuracy
  • ๐Ÿงช End-to-End Testing - Modern alternative to Selenium for UI/UX testing
  • ๐Ÿค– RPA (Robotic Process Automation) - Chain-of-Thought planning for complex tasks
  • ๐ŸŒ Web Testing & QA - Visual reasoning based quality assurance
  • ๐Ÿ“Š Data Mining & Analytics - Intelligent web data gathering with self-correction
  • ๐Ÿ”„ Workflow Automation - Custom workflow design with plugin ecosystem

Perfect for:

  • DevOps and QA Teams (Automated Testing)
  • Data Scientists and Researchers (Web Scraping)
  • Digital Marketing Professionals (Form Automation)
  • Business Process Automation (RPA Solutions)
  • Web Developers and Testers (GUI Testing)
  • Enterprise Automation Solutions (Custom Workflows)

โœจ Features

  • ๐Ÿค– AI Agent & Visual Reasoning - Complete complex browser operations through natural language and visual understanding, powered by GPT-4o multimodal capabilities
  • ๐Ÿง  Chain-of-Thought Planning - Advanced task planning and execution with reinforcement learning for optimal automation
  • ๐ŸŽฏ Precise GUI Interaction - Pixel-perfect DOM operations, XPath navigation, and complex interaction scenarios using computer vision
  • ๐Ÿ“Š Real-time Task Tracking - WebSocket-based monitoring system with CPU, memory, and network metrics for execution insights
  • ๐Ÿ”’ Privacy-First Design - Local processing of sensitive data with comprehensive error handling and self-correction mechanisms
  • ๐ŸŒ Cross-Platform & Multilingual - Full i18n support with Chinese/English interfaces, works on Windows, macOS, and Linux
  • ๐Ÿ”Œ Extensible Architecture - REST API, WebSocket endpoints, and plugin system for seamless integration
  • ๐Ÿš€ Cloud & Self-Hosted - Deploy on your infrastructure or use our cloud solution for maximum flexibility
  • โšก High Performance - Parallel task execution with optimized resource management
  • ๐ŸŽจ Modern Developer Experience - Beautiful UI/UX built with Next.js and Tailwind CSS, extensive API documentation

๐Ÿš€ Getting Started

Try our online demo at operatornext.com or set up your own instance:

Prerequisites

  • Docker & Docker Compose
  • Node.js 18+
  • pnpm 10+
  • Chrome/Chromium browser

Installation

  1. Clone the repository
git clone https://github.com/OperatorNext/OperatorNext.git
cd OperatorNext
  1. Copy environment variable templates
# Copy frontend environment variables
cp frontend/.env.local.example frontend/.env.local

# Copy Docker environment variables
cp .env.example .env
  1. Install frontend dependencies
cd frontend
pnpm install
  1. Initialize database and generate types
# Push database schema
sudo pnpm db:push

# Generate Prisma client and types
sudo pnpm db:generate

Note: sudo might be required for database operations depending on your system configuration.

Start Services

  1. Start Docker services
docker-compose up -d

This will start the following services:

Service URL Description
Web Application http://localhost:3000 Next.js frontend application
PgAdmin http://localhost:5051 PostgreSQL database management
Maildev http://localhost:8026 Email testing interface
MinIO Console http://localhost:9003 Object storage management
MinIO API http://localhost:9002 S3-compatible API endpoint
PostgreSQL localhost:5438 Database (connect via psql or GUI)

Default Credentials

โš ๏ธ These are development credentials. Do NOT use in production!

  • PostgreSQL:

    • User: operatornext_prod_user
    • Database: operatornext_production
  • PgAdmin:

  • MinIO:

    • Access Key: See MINIO_ROOT_USER in .env
    • Secret Key: See MINIO_ROOT_PASSWORD in .env
  1. Start frontend development server
cd frontend
pnpm dev

Visit http://localhost:3000 to use the application.

๐Ÿ“– Usage Example

# Create a new browser task
task = {
    "task_description": "Login to GitHub and star a repository"
}
response = requests.post("http://localhost:8000/api/tasks", json=task)
task_id = response.json()["task_id"]

# Monitor task status via WebSocket
ws = websockets.connect(f"ws://localhost:8000/ws/tasks/{task_id}")

For more examples, please visit our documentation.

๐Ÿ”ง Technology Stack

AI & Automation

  • LLM Support - Compatible with GPT-4o, Claude, and other language models
  • LangChain - Large Language Model (LLM) orchestration framework
  • Computer Vision - Pixel-level DOM interaction and visual analysis
  • Reinforcement Learning - Self-improving task execution strategies
  • Chain-of-Thought - Advanced planning and decision making

Backend Infrastructure

  • FastAPI - High-performance Python web framework for building scalable APIs
  • WebSocket - Real-time bidirectional communication for task monitoring
  • Playwright - Modern web testing and automation with superior stability
  • PostgreSQL - Advanced open-source database for task management
  • MinIO - S3-compatible object storage for artifact management
  • Redis - In-memory data structure store for caching and queuing
  • Docker - Containerization and deployment automation

Frontend Technologies

  • Next.js 15 (App Router) - React framework with server-side rendering
  • React 19 - Latest version with concurrent features and Suspense
  • TypeScript - Type-safe JavaScript development for reliability
  • Tailwind CSS - Utility-first CSS framework for modern UI
  • Shadcn UI - Modern and accessible component library
  • Prisma - Next-generation ORM for type-safe database access
  • Turbo Repo - High-performance monorepo build system
  • WebSocket - Real-time updates and task monitoring
  • Biome - Fast and reliable code formatter

DevOps & Quality

  • Docker Compose - Multi-container orchestration
  • GitHub Actions - CI/CD automation pipeline
  • Playwright - End-to-end testing framework
  • Prisma - Database schema management and migrations
  • Biome - Code quality and formatting tools
  • pnpm - Fast, disk space efficient package manager

Security & Privacy

  • Local Processing - Sensitive data handling
  • End-to-End Encryption - Secure communication
  • Role-Based Access - Fine-grained permissions
  • Audit Logging - Comprehensive activity tracking

๐Ÿ“ Documentation

For detailed documentation, please visit our documentation.

๐Ÿค Contributing

We welcome all forms of contributions, whether it's new features, documentation improvements, or bug reports. Please check our Contributing Guide for more information.

๐Ÿ“„ License

This project is licensed under the MIT License.

๐Ÿ™ Acknowledgments

This project is inspired by and built upon:

Thanks to all the developers who have contributed to this project!

๐ŸŒŸ Star History

๐Ÿ“ฎ Contact & Community

Join our growing community:

Company: CyberPoet LLC (Position: CEO)

About

OperatorNext is an AI-powered intelligent browser automation platform that understands and executes complex browser tasks through natural language processing.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •