DeepLearning_ComputerVision

Deep Learning for Computer Vision

1. Automatic_Image_Captioning

This project implements a CNN-RNN model for automatic image captioning. The model takes an image as input and generates a sequence of text that describes the image content.

Key features:

Uses a pre-trained ResNet model as the CNN backbone.
Employs a LSTM network as the RNN for sequence generation.
Trained on the COCO dataset.

Usage:

Install the required dependencies (e.g., TensorFlow, Keras, OpenCV).
Download the pre-trained weights.
Run the Image_Captioning.ipynb script to generate captions for images.

2. Human Intrusion Detection with Real-time Tracking

This project implements a real-time human intrusion detection system using a YOLOv3 deep learning model. It utilizes OpenCV for video processing and object tracking. Key functionalities include:

Human Detection: Detects humans within an image/video stream. Object Tracking: Tracks the detected humans using a Euclidean distance tracker.
Real-time Intrusion Detection: Defines a Region of Interest (ROI) and triggers an alert if a human enters the ROI.
Data Recording: Records human trajectories including bounding box coordinates and frame numbers for further analysis (optional).

Features:

Utilizes YOLOv3 model for efficient human detection.
Employs Euclidean distance tracker for robust human tracking.
Supports real-time video processing with ROI definition.
Generates human trajectory data (optional).

Requirements:

Python 3.x
OpenCV
NumPy
Tensorflow/Keras (for custom model usage)
YOLOv3 pre-trained weights and configuration files

Usage:

Install the required libraries.
Download the YOLOv3 pre-trained weights and configuration files (coco.names, yolov3-320.cfg, yolov3-320.weights).
Define the ROI coordinates in the code (refPt variable).
Run the script: python human_intrusion_detection.py

Note:

This project can be extended to support additional object classes by modifying the required_class_index list and potentially retraining the YOLOv3 model.
The script currently saves human trajectories to a CSV file ("Trajectory.csv"). This functionality can be disabled by commenting out the relevant lines.
This project provides a starting point for building a real-time human intrusion detection system with tracking capabilities.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
1_Automatic_Image_Captioning_PyTorch		1_Automatic_Image_Captioning_PyTorch
2_Human_Intrusion_Detection		2_Human_Intrusion_Detection
3_Image_Caption_Generator_TensorFlow		3_Image_Caption_Generator_TensorFlow
1_Linear_Regression_for_Predicting_Price_of_second_hand_Cars_by_Neuralearn_ai.ipynb		1_Linear_Regression_for_Predicting_Price_of_second_hand_Cars_by_Neuralearn_ai.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DeepLearning_ComputerVision

1. Automatic_Image_Captioning

2. Human Intrusion Detection with Real-time Tracking

About

Uh oh!

Releases

Packages

Languages

SohaibAShah/DeepLearning_ComputerVision

Folders and files

Latest commit

History

Repository files navigation

DeepLearning_ComputerVision

1. Automatic_Image_Captioning

2. Human Intrusion Detection with Real-time Tracking

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages