Bigbytes

Bigbytes is a hybrid framework for transforming and integrating data. It combines the best of both worlds: the flexibility of notebooks with the rigor of modular code.

Extract and synchronize data from 3rd party sources.
Transform data with real-time and batch pipelines using Python, SQL, and R.
Load data into your data warehouse or data lake using our pre-built connectors.
Run, monitor, and orchestrate thousands of pipelines without losing sleep.

🏃‍♀️ Install

The recommended way to install the latest version of Bigbytes is through Docker with the following command:

docker pull getbigbytes/bigbytes:latest

You can also install Bigbytes using pip or conda, though this may cause dependency issues without the proper environment.

pip install bigbytes

conda install -c conda-forge bigbytes

Looking for help? The fastest way to get started is by checking out our documentation here.

Looking for quick examples? Open a demo project right in your browser or check out our guides.

🎮 Demo

Live demo

Build and run a data pipeline with our demo app.

WARNING

The live demo is public to everyone, please don’t save anything sensitive (e.g. passwords, secrets, etc).

_{Click the image to play video}

A sample data pipeline defined across 3 files ➝

Load data ➝

@data_loader
def load_csv_from_file() -> pl.DataFrame:
    return pl.read_csv('default_repo/titanic.csv')

Transform data ➝

@transformer
def select_columns_from_df(df: pl.DataFrame, *args) -> pl.DataFrame:
    return df[['Age', 'Fare', 'Survived']]

Export data ➝

@data_exporter
def export_titanic_data_to_disk(df: pl.DataFrame) -> None:
    df.to_csv('default_repo/titanic_transformed.csv')

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.git-dev/hooks		.git-dev/hooks
.github		.github
bigbytes		bigbytes
bigbytes_integrations		bigbytes_integrations
docs		docs
integrations		integrations
kube		kube
scripts		scripts
templates		templates
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Dockerfile		Dockerfile
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
README_dev.md		README_dev.md
aws		aws
dev.Dockerfile		dev.Dockerfile
dev.requirements.txt		dev.requirements.txt
dev.spark.Dockerfile		dev.spark.Dockerfile
docker-compose.yml		docker-compose.yml
example.ipynb		example.ipynb
lsp.Dockerfile		lsp.Dockerfile
package-lock.json		package-lock.json
package.json		package.json
pg-docker-compose.yml		pg-docker-compose.yml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py
test_branch.Dockerfile		test_branch.Dockerfile

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Bigbytes

🏃‍♀️ Install

🎮 Demo

Live demo

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

digitranslab/bigbytes

Folders and files

Latest commit

History

Repository files navigation

Bigbytes

🏃‍♀️ Install

🎮 Demo

Live demo

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages