This repository utilizes deep learning models for video processing tasks, focusing on face detection, gender classification, and indoor/outdoor scene classification. It includes implementations using OpenCV and pretrained models such as MobileNetV2.
For face detection and gender classification, OpenCV and deep learning models were employed. The models achieve significant accuracy in identifying faces and classifying genders in images and video streams.
checkout this notebook for demo Face detection and gender classification.ipynp
Using a pretrained MobileNetV2 model on the SUN dataset, this project performs indoor/outdoor scene classification. The model was trained on 60,000 images, achieving high accuracy rates:
- Training Accuracy: 95.25%
- Validation Accuracy: 93.33%
- Testing Accuracy: 93.66%
Complete training details can be found in the Jupyter Notebook training classification model.ipynb
Here, 1000 videos have been tested for human detection, gender classification, indoor/outdoor classification, and also to provide video information such as resolution, frame rates, and duration. testing on 1000 videos.ipynb
Model weights are available (idod.weights.h5).
To use this repository, follow these steps:
-
Clone the repository:
git clone https://github.com/kishan9999/video-processing-using-deep-learning.git
-
Install the required dependencies:
pip install -r requirements.txt
Ensure the following packages are installed:
- pandas==2.0.3
- tensorflow==2.15.0
- opencv-python==4.8.0.76
- numpy==1.25.2
- matplotlib==3.7.1
- flask==2.2.5
-
Download All weights and place it in the weights folder
-
For video processing, use
single_inference.py:python single_inference2.py --video_path
OR replace
pathwith your video file in python single_inference.py.
run this code video_info.py to display information about the video file.

For detailed documentation, refer to Document
This project is licensed under the MIT License.
- Kishan Joshi
- Keras Models
- SUN dataset
- Age Gender Code Guide
- Age Gender Code Github
- Dataset used for action recognition: UCF101 Action Recognition


