Skip to content
View ugo96's full-sized avatar

Block or report ugo96

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ugo96/README.md

ugo96

Senior Data Engineer focused on streaming data systems with regular exposure to ML, NLP, and backend engineering.

I design and operate pipelines where reliability, scalability, and correctness matter — often at the intersection of real-time data, machine learning inference, and downstream analytics.


What I work on

  • Real-time and near-real-time data pipelines
  • Streaming architectures (event-driven, stateful processing)
  • Large Scale Batch Processing
  • Formalizing requirements for supporting critical processes
  • ML & NLP integration in production data systems
  • Data quality, observability, and backfills
  • Bridging data engineering with backend services

Core technologies

Languages

  • Python, SQL
  • Java, Scala
  • Rust (systems & streaming)

Streaming & Data

  • Kafka, Redis, RabbitMQ
  • Spark / Flink
  • Airflow / Dagster

ML & NLP

  • Feature & inference pipelines
  • Text processing & embeddings
  • Model integration and monitoring

Storage & Analytics

  • PostgreSQL
  • Cloud object storage (S3/GCS)
  • NoSQL DBs (Cassandra, ScyllaDB, DynamoDB, FireBase)
  • Iceberg / Delta Lake

Infra

  • Docker
  • CI/CD (GitHub Actions)
  • Linux

Current interests

  • Streaming systems in Rust
  • Event-time correctness and state management
  • ML observability in data pipelines
  • Vector search & retrieval in production

Contact

Popular repositories Loading

  1. CreditCard-DataModeling-Classification CreditCard-DataModeling-Classification Public

    Jupyter Notebook 1

  2. papers-we-love papers-we-love Public

    Forked from papers-we-love/papers-we-love

    Papers from the computer science community to read and discuss.

    Shell 1

  3. sqlx sqlx Public

    Forked from launchbadge/sqlx

    🧰 The Rust SQL Toolkit. An async, pure Rust SQL crate featuring compile-time checked queries without a DSL. Supports PostgreSQL, MySQL, and SQLite.

    Rust 1

  4. craigslistscraper craigslistscraper Public

    Forked from ryanirl/CraigslistScraper

    Simple webscraper for Craigslist.

    Python 1

  5. flink flink Public

    Forked from apache/flink

    Apache Flink

    Java 1

  6. sparkcluster-docker sparkcluster-docker Public template

    Python