VisionUX is a video analytics platform powered by an Agentic RAG framework incorporating OpenAI GPT-4o-mini and Google Gemini-1.5-Flash models. This proof-of-concept application combines computer vision, AI analysis, and interactive visualization for video understanding through an agent-based approach to content processing and user interaction.
- Python 3.11
- Node.js 20.17.0
- npm 10.8.2
- Conda (for environment management)
- OpenAI API key
- Google Gemini API key
- Create and activate a Python virtual environment:
conda create --name visionux python=3.11
conda activate visionux
cd backend
pip install -r requirements.txt- Copy and configure environment variables:
cp .env.example .envRequired environment variables:
API_HOST=localhost
API_PORT=8000
OPENAI_API_KEY=your_openai_key_here
GEMINI_API_KEY=your_gemini_key_here- In a new terminal (without Python virtual environment):
conda deactivate
cd frontend
npm install- Copy and configure environment variables:
cp .env.example .envConfigure with these settings:
REACT_APP_API_URL=http://localhost:8000
REACT_APP_WS_URL=http://localhost:8000
REACT_APP_API_VERSION=/api/v1
REACT_APP_WS_PORT=8000
REACT_APP_WS_HOST=localhost
REACT_APP_STREAM_PORT=8001
REACT_APP_STREAM_WS_URL=http://localhost:8001- Start the backend server:
cd backend
python run.py --config config.yaml- In a separate terminal, start the frontend:
cd frontend
npm startThe application will be available at http://localhost:3000
- Recommended video length: 10-20 seconds for optimal performance
- Primary supported format: MP4
- Other formats (AVI, MOV, WEBM) may work but might have compatibility issues
- Maximum file size: 100MB
- This is a proof-of-concept implementation
- Camera functionality is disabled (planned for future releases)
- Video processing is optimized for short clips
- Users should test thoroughly and report any issues
- Video file upload and management
- AI-powered scene analysis
- Object detection
- Edge detection
- Chat interface with RAG (Retrieval Augmented Generation)
- Analysis results visualization
For support questions, please open an issue in the repository.
Licensing Terms for VisionUX
Commercial License
If you intend to use VisionUX to develop commercial sites, themes, projects, or applications, the Commercial License is the appropriate option. This license allows you to keep your source code proprietary.
For inquiries regarding the Commercial License, please contact us at contact@dynamworks.com.
Open Source License
VisionUX is also available under the GNU Affero General Public License v3 (GNU AGPL v3). If you are developing an open-source application under a license compatible with the GNU AGPL v3, you may use VisionUX under the terms of this license.
For more information on the GNU AGPL v3, please visit GNU AGPL v3 Overview.
