Skip to content
View Marc-Aradillas's full-sized avatar

Block or report Marc-Aradillas

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Marc-Aradillas/README.md

👋🏼 Hi, I'm Marc Anthony Aradillas

Cloud Engineer | Data & Analytics Engineering | Software Developer
U.S. Air Force Veteran | B.S. Computer Science (Software Engineering)

I'm a technologist with experience in cloud operations (AWS), data engineering pipelines, analytics workflows, and applied machine learning. I enjoy solving complex problems, building clean and reliable systems, and turning raw data into something meaningful.

My background includes:

  • Cloud engineering (AWS EC2, S3, RDS, IAM, SSM, automation)
  • Data engineering & analytics using Python, SQL, and ETL pipelines
  • Machine learning and NLP projects built end-to-end
  • Software engineering foundations (OOP, web apps, secure coding, Git)
  • Military discipline, leadership, and mission-critical operations

I am actively expanding into Analytics Engineering and Data Engineering, including Palantir Foundry concepts, modern pipelines, and scalable data systems.


🔧 Technologies & Tools

Languages & Scripting

Python • SQL • C • C++ • JavaScript • Bash

Data / ML

Pandas • NumPy • Scikit-Learn • NLP (NLTK / spaCy) • Matplotlib • Seaborn

Cloud & DevOps

AWS (EC2, S3, IAM, SSM) • Linux • Git/GitHub • Containers • CI/CD foundations

Analytics / Engineering

ETL Pipelines • Data Modeling • Jupyter • APIs • JSON/XML • Automation


📌 Featured Projects

🔹 1. GitHub Language Prediction (NLP Project)

Machine learning model that predicts the primary programming language of GitHub repos using web-scraped text and NLP.

  • Python • Pandas • NLP • ML • BeautifulSoup
  • Identifies tech stacks using regex + TF-IDF + classification

➡️ Useful to show you can build ML + pipelines end-to-end.


🔹 2. Restaurant Inspection Score Prediction (NYC Open Data)

Forecasts restaurant health inspection results using ML and sentiment extraction from violations.

  • EDA, cleaning, NLP, XGBoost
  • Demonstrates strong applied DS + data engineering

🔹 3. Tesla Stock Price Prediction (Time Series)

Full ML workflow predicting monthly TSLA closing prices.

  • ARIMA-like pipeline • Visualization • Feature engineering

🔹 4. Treasure Hunt Reinforcement Learning Agent

Solves a grid-based environment using RL concepts.


🔹 5. System Analysis & Design (DriverPass Project)

UML diagrams, system architecture, and process flows.


🎯 What I'm Learning Now

  • Data engineering workflows (ETL, modeling, orchestration)
  • Palantir Foundry concepts (Ontologies, Code Workbooks, Pipeline orchestration)
  • Advanced Python engineering practices
  • Cloud automation + infrastructure concepts

📫 Connect With Me

📍 Dallas–Fort Worth
LinkedIn: https://www.linkedin.com/in/marc-aradillas
Email: aradillasm@gmail.com


Profile Activity

Pinned Loading

  1. github_nlp_project github_nlp_project Public

    A complete NLP pipeline that predicts the primary programming language used in GitHub repositories. The project includes automated web scraping, text extraction from READMEs and source files, featu…

    Jupyter Notebook

  2. Somerville-2023/New-York-health-Inspection-Prediction Somerville-2023/New-York-health-Inspection-Prediction Public

    Predicts NYC restaurant health inspection outcomes by integrating structured and unstructured data. Includes ingestion from NYC Open Data, advanced cleaning pipelines, feature engineering, and sent…

    Python

  3. Tesla-Monthly-Closing-Price-Prediction Tesla-Monthly-Closing-Price-Prediction Public

    A time-series ML project predicting monthly TSLA closing prices. Includes historical financial data ingestion, trend decomposition, feature engineering, and baseline forecasting models.

    Jupyter Notebook

  4. TreasureHunt-PirateAgent TreasureHunt-PirateAgent Public

    A reinforcement-learning agent designed to navigate a grid-based treasure hunt environment. The notebook demonstrates environment design, reward modeling, exploration strategies, and model training.

    Jupyter Notebook

  5. System-Analysis-and-Design System-Analysis-and-Design Public

    A complete system design package for a mock DriverPass application. Includes UML diagrams, workflow documentation, architecture decisions, and requirements mapping.

  6. CS470-FS-Cloud CS470-FS-Cloud Public

    Course capstone demonstrating principles of full-stack cloud development. Focuses on building and deploying an application using cloud-native services and modern development patterns.