Skip to content
View akhilpatlolla's full-sized avatar
:octocat:
:octocat:

Organizations

@NIU-DATA-Lab

Block or report akhilpatlolla

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
akhilpatlolla/README.md

Hi πŸ‘‹, I'm Akhil Patlolla

Senior Data Engineer | Cloud Data Architect | ML Infrastructure Builder

akhilpatlolla


πŸ™‹β€β™‚οΈ About Me

  • πŸ”­ I’m currently building real-time, scalable data lakes using Apache Iceberg, Airflow, Glue, and AWS.
  • 🌱 Exploring advanced ML Infra for Retrieval-Augmented Generation (RAG) using vector databases and LLMs (OpenAI, Bedrock).
  • πŸ’¬ Ask me about Cloud Data Pipelines, Streaming Infrastructure, and Lakehouse Architectures.
  • ⚑ Fun fact: I love designing reusable data frameworks, optimizing cost-performance, and mentoring data engineers.

πŸ§‘β€πŸ’» Technical Stack

πŸ‘¨β€πŸ’» Programming & Scripting

Python TypeScript JavaScript SQL

☁️ Cloud Platforms

AWS Azure GCP

πŸ“¦ Infrastructure as Code

CDK Terraform Pulumi

πŸ” Orchestration & ETL

Apache Airflow AWS Glue EMR

πŸ›’οΈ Databases & Warehousing

Redshift Snowflake DynamoDB PostgreSQL MySQL

πŸ“Š BI & Visualization

Tableau Power BI QuickSight

🧠 ML/AI Enablers

LangChain OpenAI Pinecone Bedrock


πŸ“ˆ GitHub Stats


πŸ† GitHub Trophies


πŸ† Contributions & Highlights

  • πŸ›  Built a generic ETL utility on AWS Glue integrating MySQL, Oracle, Redshift, and SQL Server.
  • 🧊 Adopted Apache Iceberg to modernize data storage and time-travel capabilities.
  • πŸ“¦ Delivered reusable ingestion framework with config-driven design for fast analytics onboarding.
  • βš™οΈ Migrated legacy pipelines to Pulumi/CDK, increasing infra reproducibility and cost transparency.
  • πŸ”„ Implemented scalable Redshift & Snowflake data models (Star/Snowflake schemas).
  • πŸš€ Architected streaming + batch systems using Kinesis, EventBridge, and Lambda.
  • πŸ”’ Enforced strong data security practices β€” encryption, access controls, masking.
  • πŸ“‘ Integrated 8+ third-party SaaS APIs (Salesforce, Sage Intacct, Import.io) for real-time business analytics.

πŸ“« Connect With Me


β€œTurning complexity into clarity β€” one pipeline at a time.”

Pinned Loading

  1. OpenWeatherMap OpenWeatherMap Public

    Jupyter Notebook

  2. Generic_ETL_Utility_AWS_GLUE Generic_ETL_Utility_AWS_GLUE Public

    AWS Glue - Incremental Pull Script

    Python

  3. Random-Good-Data-Science-Stuff Random-Good-Data-Science-Stuff Public

    Random Good Data Science Stuff

    Jupyter Notebook 1

  4. spack/spack spack/spack Public

    A flexible package manager that supports multiple versions, configurations, platforms, and compilers.

    Python 4.9k 2.4k

  5. Research-Assistant Research-Assistant Public

    custom Conversational and QnA BOT which will summarize document content for you.

    TypeScript 1

  6. Match-Prediction-in-DOTA-2 Match-Prediction-in-DOTA-2 Public

    Match Prediction using Random Forest Classification in Defense of the Ancients 2(DOTA 2)

    R