Skip to content
View HazelAbraham's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report HazelAbraham

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
HazelAbraham/README.md

Hi I'm Hazel Abraham

Data & Analytics Engineer Professional | Building Scalable Data Solutions

LinkedIn GitHub


🎯 About Me

I'm a Data Engineer & Analytics professional with 2+ years of experience building robust data pipelines, automations, and turning complex data into actionable insights. I hold a B.Tech in Mechanical Engineering (2021) and a PGP in Data Science and Engineering from Great Learning.

What I do:

  • πŸ”§ Design and build end-to-end data pipelines on Databricks
  • ⚑ Optimize PySpark & Spark SQL for large-scale data processing
  • πŸ–₯️ Build Databricks Apps and Streamlit applications for data tools
  • πŸ“Š Create interactive dashboards with Power BI
  • πŸ€– Automate workflows using Python & Selenium

What drives me: The challenge of building data systems that teams can depend on - from raw ingestion to polished dashboards. There's something satisfying about transforming chaotic data into clean pipelines and clear insights.


πŸš€ Currently Learning

Deepening my expertise in building production-grade data systems:

Area Focus
Data Pipelines Delta Live Tables, Structured Streaming, Batch & Real-time Ingestion
Databricks Platform Performance Tuning, Unity Catalog, Workflows Orchestration, Databricks Apps
Delta Lake Z-ordering, Compaction, CDC, Liquid Clustering, VACUUM
Spark Optimization Partitioning, Caching, Broadcast Joins, Adaptive Query Execution

πŸ’» Tech Stack

Data Engineering

Databricks Apache Spark Delta Lake Snowflake

Languages

Python SQL PySpark

Cloud & Infrastructure

Azure Azure Event Hub Azure Blob Docker Git

Analytics & Visualization

Power BI Pandas Plotly Matplotlib

Apps & Automation

Streamlit Databricks Apps Selenium FastAPI


πŸ›  Core Skills

β”œβ”€β”€ Data Engineering
β”‚   β”œβ”€β”€ PySpark & Spark SQL
β”‚   β”œβ”€β”€ ETL/ELT Pipeline Design
β”‚   β”œβ”€β”€ Delta Lake & Data Lakehouse
β”‚   β”œβ”€β”€ Data Ingestion (Event Hub, Blob Storage)
β”‚   └── Databricks Workflows & Jobs
β”‚
β”œβ”€β”€ Data Processing
β”‚   β”œβ”€β”€ Large-scale Transformations
β”‚   β”œβ”€β”€ Data Quality & Validation
β”‚   β”œβ”€β”€ Performance Optimization
β”‚   └── Structured Streaming
β”‚
β”œβ”€β”€ Apps & Automation
β”‚   β”œβ”€β”€ Databricks Apps
β”‚   β”œβ”€β”€ Streamlit Applications
β”‚   β”œβ”€β”€ Selenium Web Automation
β”‚   └── Python Scripting
β”‚
└── Analytics & Visualization
    β”œβ”€β”€ Power BI Dashboards
    β”œβ”€β”€ SQL Modeling & Optimization
    β”œβ”€β”€ Python Data Analysis (Pandas, NumPy)
    └── Data Storytelling & Reporting

πŸ“š Areas of Expertise

βœ… Data Pipeline Design - Building reliable ETL/ELT workflows on Databricks
βœ… Databricks Apps & Streamlit - Building internal data tools and applications
βœ… SQL Modeling & Optimization - Complex queries, performance tuning, data modeling
βœ… Data Visualization & BI - Power BI dashboards, Python visualization libraries
βœ… Automation - Selenium web automation, Python scripting to reduce manual work
βœ… Cloud - Azure (Event Hub, Blob Storage), Databricks, Snowflake


πŸŽ“ Education & Certifications

  • PGP in Data Science and Engineering - Great Learning
  • B.Tech in Mechanical Engineering - 2021
  • πŸ“š Preparing for: Databricks Data Engineer Associate Certification

🀝 Open to Collaborate On

  • πŸ”§ Data Pipeline Projects - Building scalable ETL/ELT solutions
  • ⚑ Spark & Delta Lake - Performance tuning and lakehouse patterns
  • πŸ–₯️ Databricks Apps - Building data tools and applications
  • πŸ“Š Analytics & Dashboards - BI solutions and data visualization
  • πŸš€ Open Source - Contributing to data engineering projects

🌱 Beyond Tech

When I'm not building pipelines, you can find me:

  • 🌿 Gardening - Growing vegetables and experimenting with plant care
  • 🧁 Baking - Creating desserts and exploring new recipes
  • πŸ’Ž Diamond Painting - Relaxing with some sparkle therapy

πŸ“Š GitHub Stats

GitHub Stats

Top Languages


"From raw data to reliable pipelines to clear insights." πŸ”§πŸ“Š

Popular repositories Loading

  1. HazelAbraham HazelAbraham Public

    Config files for my GitHub profile.

  2. Data-Visualization-Notes-seaborn. Data-Visualization-Notes-seaborn. Public

    Jupyter Notebook

  3. Machine-learning-Project Machine-learning-Project Public

    Jupyter Notebook

  4. Hackathon Hackathon Public

    Hackathon Conducted by Technology for all

    Jupyter Notebook

  5. Diamond-price-prediction Diamond-price-prediction Public

  6. its-kumar its-kumar Public

    Forked from its-Kumar/its-kumar

    GitHub Special Repository of its-kumar