Project Phoenix Domain

AI_WQ Projects

6 Computer Vision Projects

Project Overview

Project Domain Key Techniques Data Type
Proj1 Image Classification (Wildlife) CNN, Binary/Multiclass Images
Proj2 Transfer Learning (Cassava) Fine-tuning, Callbacks Images
Proj3 Object Detection YOLOv8, Data Augmentation Images + Annotations
Proj4 Face Recognition MTCNN, InceptionResNet, Flask Face Images
Proj5 GANs Generator/Discriminator Training Images
Proj6 Diffusion Models Stable Diffusion, Prompting Text Prompts

Proj1: Image Classification

Camera Traps

CNN from scratch for wildlife camera trap image classification. Covers binary classification (animal/no animal) and multiclass species identification.

Notebooks

Techniques

CNN from scratch Custom training loops Data preprocessing

Proj2: Transfer Learning

Cassava Disease

Fine-tuning pretrained models for cassava leaf disease detection. Demonstrates transfer learning with callbacks for early stopping.

Notebooks

Models

Techniques

EfficientNet ResNet50 VGG16 Early stopping

Proj3: Object Detection

YOLOv8

Object detection using YOLOv8 with custom training and data augmentation pipelines.

Notebooks

Techniques

YOLOv8 Bounding boxes Augmentation pipelines

Proj4: Face Recognition

MTCNN + FaceNet

Face detection and recognition with MTCNN and InceptionResNet, deployed as a Flask API.

Notebooks

Modules

Techniques

MTCNN InceptionResNet Embeddings Flask API

Proj5: Generative Adversarial Networks

GANs

Training GANs with generator and discriminator networks for image generation.

Notebooks

Models

Techniques

DCGAN Generator Discriminator Adversarial training

Proj6: Diffusion Models

Stable Diffusion

Text-to-image generation using Stable Diffusion with prompt engineering techniques.

Notebooks

Techniques

Stable Diffusion Prompt engineering Text-to-image Pipeline configs