Skip to content

AHANAAF/CCFD-ML

Repository files navigation

The notebook provides a step-by-step guide on how to import data, perform exploratory data analysis (EDA), and apply machine learning models to detect fraudulent transactions.

The key aspects of the analysis include:

  1. Data Importation and Preprocessing:

    • The script imports data, handles missing values, and checks for class imbalances.
    • The dataset used contains transactions, including 492 fraudulent transactions out of 284,807 total transactions, making it highly imbalanced.
  2. Exploratory Data Analysis (EDA):

    • Visualizations are used to explore the distribution of transactions and correlations between features.
    • KDE plots and a heatmap help identify patterns and correlations in the data.
  3. Modeling:

    • Two models are applied: Isolation Forest and Local Outlier Factor (LOF).
    • These models are unsupervised and are commonly used for anomaly detection, which is suitable given the imbalanced nature of the data.
    • The script evaluates the performance of the models using metrics like accuracy, precision, and recall.
  4. Evaluation:

    • The script calculates the performance of the models using the f1-score, precision, recall, and accuracy metrics.
    • The Isolation Forest method produces better results in detecting fraudulent transactions.

This project appears to be a good practice in handling imbalanced datasets and applying machine learning models for anomaly detection. If you need any further assistance with this project, such as optimizing the model or understanding specific parts of the code, feel free to ask!

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published