CoreVisionTasks

This repository contains my implementation of a multi-part computer vision course project completed during my university studies. The tasks and datasets were provided by the instructors, while all implementation work was done independently unless otherwise noted.

The project spans a range of classical and modern computer vision techniques, from 3D estimation using depth data to face recognition and object detection.

Figure 1: Box estimation result	Figure 2: Enhanced HDR image result
Figure 3: Face recognition result	Figure 4: Object detection result (balloon)

🧩 Project Tasks (in Course Order)

1. 3D Box Estimation from Kinect Data

Goal: Estimate real-world box dimensions using Kinect amplitude and depth images.
Key Features:
- RANSAC-based plane detection (floor and box top)
- Morphological filtering and mask creation
- Largest connected component extraction
- Corner detection and computation of box length, width, and height
Tools: numpy, scipy, matplotlib, OpenCV, Jupyter Notebook
Implementation

2. Image Demosaicing & HDR Imaging

Goal: Convert raw sensor data into enhanced RGB images using demosaicing and HDR techniques.
Key Features:
- Bayer pattern analysis and sensor linearity validation
- Demosaicing (basic and advanced)
- Gamma correction and gray-world white balancing
- HDR fusion using exposure stacking and iCAM06 tone mapping
Tools: rawpy, numpy, matplotlib, Python
- Implementation

3. Writer Retrieval using VLAD Encoding

Goal: Identify writers from handwritten historical document images.
Dataset: ICDAR17 Historical Writer Identification (WI) Dataset
Key Features:
- Codebook generation with MiniBatchKMeans
- VLAD encoding and power normalization
- Exemplar SVM-based classification
- PCA whitening and Multi-VLAD (bonus experiments)
Tools: scikit-learn, numpy, OpenCV
Implementation

4. Face Recognition with Open-Set Evaluation

Goal: Build a face recognition system for known and unknown individuals.
Key Features:
- Face detection, tracking, and alignment
- Open-set recognition using SVM classifiers
Implementation

5. Object Detection with Selective Search

Goal: Implement an object detection pipeline based on region proposals.
Key Features:
- Selective Search for candidate region generation
- Modular pipeline structure
- Visualization and analysis of proposal quality
Implementation

🛠️ Technologies & Libraries

Languages & Environments: Python 3.8 or later, Jupyter Notebook
Core Libraries: NumPy, SciPy, OpenCV, scikit-learn, rawpy, matplotlib

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
box_estimation		box_estimation
demosaic_HDR_pipeline		demosaic_HDR_pipeline
face_recognition		face_recognition
object_detection		object_detection
writer_retrieval		writer_retrieval
.gitattributes		.gitattributes
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CoreVisionTasks

🧩 Project Tasks (in Course Order)

1. 3D Box Estimation from Kinect Data

2. Image Demosaicing & HDR Imaging

3. Writer Retrieval using VLAD Encoding

4. Face Recognition with Open-Set Evaluation

5. Object Detection with Selective Search

🛠️ Technologies & Libraries

About

Uh oh!

Releases

Packages

Uh oh!

Languages

ArchitNK/computer_vision

Folders and files

Latest commit

History

Repository files navigation

CoreVisionTasks

🧩 Project Tasks (in Course Order)

1. 3D Box Estimation from Kinect Data

2. Image Demosaicing & HDR Imaging

3. Writer Retrieval using VLAD Encoding

4. Face Recognition with Open-Set Evaluation

5. Object Detection with Selective Search

🛠️ Technologies & Libraries

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages