WDBX: Vector Database for AI Applications

WDBX is a flexible vector database system designed for AI applications with an extensible plugin architecture.

Features

🚀 High-performance vector storage and similarity search with multiple indexing options
🔄 Asynchronous API for non-blocking operations
🔌 Extensible plugin architecture for easy integration with external services
🌐 RESTful API server for remote access
🤖 Built-in support for various embedding models and LLM providers
📊 Advanced visualization and analytics capabilities
🔄 Distributed architecture with sharding and replication
🔒 Secure storage with support for authentication and encryption
💻 Command-line interface for easy management

Installation

pip install wdbx

To install with specific components:

pip install wdbx[api]          # Install with API server
pip install wdbx[security]     # Install with security features
pip install wdbx[visualization] # Install with visualization tools
pip install wdbx[indexing]     # Install with advanced indexing
pip install wdbx[webscraper]   # Install with web scraper plugin
pip install wdbx[ollama]       # Install with Ollama integration
pip install wdbx[all]          # Install with all components

Quick Start

Basic Usage

from wdbx import WDBX

# Create a WDBX instance
wdbx = WDBX(
    vector_dimension=384,  # Common dimension for modern embedding models
    num_shards=2,
    data_dir="./wdbx_data",
    enable_plugins=True,
)

# Initialize the instance
import asyncio
asyncio.run(wdbx.initialize())

# Store a vector
vector = [0.1, 0.2, ...] * 384  # Your vector data
metadata = {"source": "example", "content": "Sample text"}
vector_id = wdbx.vector_store(vector, metadata)

# Search for similar vectors
results = wdbx.vector_search(vector, limit=5)
for vector_id, similarity, metadata in results:
    print(f"Vector ID: {vector_id}, Similarity: {similarity:.4f}")
    print(f"Content: {metadata.get('content')}")

# Don't forget to close the database
asyncio.run(wdbx.shutdown())

Asynchronous API

import asyncio
from wdbx import WDBX

async def main():
    # Create and initialize WDBX instance
    wdbx = WDBX(vector_dimension=384)
    await wdbx.initialize()

    # Store vectors asynchronously
    vector_id = await wdbx.vector_store_async([0.1] * 384, {"text": "Example"})

    # Search asynchronously
    results = await wdbx.vector_search_async([0.1] * 384, limit=5)

    # Clean up
    await wdbx.shutdown()

# Run the async function
asyncio.run(main())

Using Plugins

from wdbx import WDBX

# Create WDBX with plugins enabled
wdbx = WDBX(vector_dimension=384, enable_plugins=True)

# Initialize the instance
import asyncio
asyncio.run(wdbx.initialize())

# Get a plugin instance
webscraper = wdbx.get_plugin("webscraper")

# Use the plugin to extract content and create an embedding
content = asyncio.run(webscraper.extract_content("https://example.com"))
embedding = asyncio.run(webscraper.create_embedding(content))

# Store in the database
metadata = {"url": "https://example.com", "content": content}
vector_id = wdbx.vector_store(embedding, metadata)

# Clean up
asyncio.run(wdbx.shutdown())

Using the CLI

The Command-Line Interface provides easy access to WDBX functionality:

# Display help
wdbx help

# Store a vector from text
wdbx store --from-text "This is a sample text to embed"

# Search for similar vectors
wdbx search --from-text "sample text" --limit 5

# Start the API server
wdbx serve --port 8000

Starting the API Server

from wdbx import WDBX
from wdbx.api import WDBXAPIServer
import asyncio

async def main():
    # Create and initialize WDBX
    wdbx = WDBX(vector_dimension=384, enable_plugins=True)
    await wdbx.initialize()

    # Create and start API server
    server = WDBXAPIServer(wdbx, port=8000)
    await server.initialize()
    await server.start()

# Run the server
asyncio.run(main())

Components

Core System

Vector Storage: High-performance storage for vector embeddings
Indexing: Multiple indexing options (HNSW, Faiss) for efficient similarity search
Distributed Architecture: Sharding and replication for scalability and fault tolerance
Configuration Management: Flexible configuration system with environment variables and config files

Plugins

WDBX includes several plugins for integration with external services:

Plugin	Description	Status
WebScraper	Web content extraction and analysis	Stable
Ollama	Local LLM integration via Ollama API	Stable
LMStudio	OpenAI-compatible local API integration	Stable
Discord	Chat integration with Discord	Stable
Twitch	Twitch chat and API integration	Stable
YouTube	YouTube data and analytics	Stable
SocialMedia	Cross-platform social media integration	Stable

Utilities

Visualization: Tools for visualizing vector spaces and relationships
Security: Authentication, encryption, and access control features
API Server: RESTful API for remote access to WDBX functionality
CLI: Command-line interface for easy management

Documentation

Comprehensive documentation is available in the docs directory:

API Reference: Detailed class and method references
Plugin System: How the plugin system works
Security Guide: Authentication and encryption features
Visualization Guide: Tools for visualizing vector data
CLI Reference: Command-line interface documentation

Development

To set up the development environment:

# Clone the repository
git clone https://github.com/donaldfilimon/wdbx-py.git
cd wdbx-py

# Create and activate a virtual environment
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install development dependencies
pip install -r requirements.txt -U

# Set up pre-commit hooks
pre-commit install

Testing

Run the test suite:

# Run core tests
pytest

# Run plugin-specific tests
python wdbx/tests.test_core.py -v
python wdbx/tests.test_plugins.py -v

Contributing

Contributions are welcome! Please see our Contributing Guide for details.

License

WDBX is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
config		config
examples		examples
tests		tests
venv		venv
wdbx		wdbx
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

WDBX: Vector Database for AI Applications

Features

Installation

Quick Start

Basic Usage

Asynchronous API

Using Plugins

Using the CLI

Starting the API Server

Components

Core System

Plugins

Utilities

Documentation

Development

Testing

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

donaldfilimon/wdbx-py

Folders and files

Latest commit

History

Repository files navigation

WDBX: Vector Database for AI Applications

Features

Installation

Quick Start

Basic Usage

Asynchronous API

Using Plugins

Using the CLI

Starting the API Server

Components

Core System

Plugins

Utilities

Documentation

Development

Testing

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages