Skip to content

koachgg/Projects

Repository files navigation

Projects

A repository of all the small and personal projects created by Belo Abhigyan

Project 2 : Suicidal Intention Detection in Text Using BERT-Based Transformers - Fine Tuning Based

Objective:

Developed a suite of BERT-based transformer models to detect suicidal intention in textual data, contributing to efforts to prevent suicide by analyzing social media posts for signs of ideation.

Dataset:

The dataset was sourced from the "SuicideWatch" subreddit on Reddit, consisting of 232,074 posts categorized into suicide and non-suicide classes. It was stratified into training (162,451) and testing (69,623) sets using stratified random sampling.

Preprocessing:

  • Text preprocessing included:

  • Converting capital letters to lowercase.

  • Removing broken Unicode, URLs, and extra spaces.

  • Expanding contractions and correcting special characters.

  • Filtering out HTML tags, punctuation, and emoticons.

  • Implemented the BERT preprocessing module for additional processing and tokenization.

    Models Implemented:

  • BERT-Based Models: Utilized various pre-trained transformer models to classify text based on suicidal intention.

  • BERT-Base

  • ALBERT

  • BERT Experts

  • BERT with Talking-Heads Attention and Gated GELU

  • ELECTRA Each model was fine-tuned on the dataset with a one-cycle learning rate policy and specific configurations such as maximum length, batch size, and transformer parameters.

Results:

  • The BERT with Talking-Heads Attention and Gated GELU model achieved the highest performance with 90.64% training accuracy and 90.27% testing accuracy, along with a response time of 12 seconds for detecting suicidal intention in text.
  • Other models also performed well:
  1. BERT-Base: 89.75% training accuracy and 89.49% testing accuracy.
  2. ALBERT: 88.34% training accuracy and 87.85% testing accuracy.
  3. BERT Experts: 88.83% training accuracy and 87.58% testing accuracy.
  4. ELECTRA: 87.17% training accuracy and 87.06% testing accuracy.

Model performance was evaluated using metrics such as accuracy, precision, recall, and F1-score.

Tools & Technologies Used:

  • Languages: Python
  • Libraries: TensorFlow, PyTorch, Hugging Face Transformers
  • Platforms: Google Colab Pro with T4 GPU and 51GB RAM

Project 3 : MILK QUALITY DETECTION USING MACHINE LEARNING ALGORITHMS

Objective:

Developed a machine learning model to predict and prevent milk spoilage, reducing financial losses and mitigating health risks.

Dataset :

Utilized a dataset from Kaggle with 1059 rows and 8 columns, consisting of seven independent features: pH, temperature, taste, odor, fat content, turbidity, and color. The target variable was the milk grade, classified into low, medium, and high.

ML Techniques :

Implemented multiple machine learning algorithms for milk quality classification, including:

  • AdaBoost
  • Artificial Neural Networks (ANN)
  • Support Vector Machines (SVM)
  • Random Forest (RF)
  • K-Nearest Neighbors (KNN)
  • XGBoost
  • Gradient Boosting (GBM)
  • Decision Trees (DT)

Data pre-processing involved:

  • Label encoding of categorical data.
  • Feature scaling using Min-Max scaling and z-score scaling.
  • Addressed skewness with PowerTransformer.
  • Achieved data normalization through feature-wise scaling to ensure model convergence and prevent feature bias

Results:

  • AdaBoost outperformed other models with a classification accuracy of 99.9%.
  • ANN achieved a classification accuracy of 95.4%.
  • Random Forest, XGBoost, and KNN also showed high accuracy scores of 98.58%.
  • Utilized confusion matrices to visualize and compare results among different algorithms.

Tools & Technologies Used:

  • Languages: Python
  • Libraries: scikit-learn, pandas, numpy, matplotlib, seaborn, xgboost
  • Platforms: Kaggle

Project 4 : Helmet Detection using Deep Learning

Objective

Implemented a deep learning-based helmet detection system to identify whether individuals in images or videos are wearing helmets, providing a valuable tool for safety monitoring in construction sites, sports events, and other domains.

Technologies & Libraries:

  • Python,
  • TensorFlow,
  • Keras
  • OpenCV
  • PIL (Python Imaging Library),
  • sklearn
  • numpy
  • pandas
  • matplotlib

Key Features:

  • Deep Learning Architecture: Developed a convolutional neural network (CNN) for object detection, specifically designed to detect helmets. Utilized TensorFlow and Keras for model training and optimization.
  • Dataset Preparation: Created utilities for collecting and annotating helmet images. Implemented data augmentation techniques and divided the dataset into training and validation sets to improve model robustness.
  • Model Training: Trained the helmet detection model with configurable hyperparameters. Applied iterative training to enhance model accuracy and generalization.
  • Model Evaluation and Metrics: Incorporated evaluation metrics such as precision, recall, and F1-score to assess the model's performance, providing detailed insights into detection accuracy and effectiveness.
  • Inference on Images and Videos: Enabled real-time or batch processing of visual data for helmet detection. Implemented functionalities to draw bounding boxes around detected helmets or generate heatmaps for visualization.
  • Visualization and Reporting: Utilized matplotlib and OpenCV to create informative visualizations, including annotated images with detected helmets, precision-recall curves, and detection heatmaps.
  • Customization and Integration: Designed the project with a modular structure, allowing for easy customization and integration into existing applications. Supported extensions for specific helmet detection requirements and integration into real-time monitoring systems.
  • Documentation and Examples: Provided comprehensive documentation with step-by-step instructions for installation, usage, and customization. Included examples for training the model, applying it to images/videos, and adapting to different datasets or scenarios.

Outcome:

The project resulted in a robust helmet detection system capable of real-time inference with high accuracy, providing a valuable safety monitoring tool. The flexible design and comprehensive documentation make it suitable for a wide range of applications and adaptable to various environments.

Use Cases:

  • Construction site safety monitoring
  • Sports event safety compliance
  • Industrial safety checks

About

A repository of all the small and personal projects created by me

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages