GitHub - mac999/computer_vision_deeplearning: computer vision based on deep learning lecture materials

Computer Vision with Deep Learning

This repository provides in-depth lecture materials on computer vision using deep learning. It is systematically structured to guide learners through the entire workflow of a computer vision developer, from fundamental theory to real-world application development.

This course goes beyond simply running pre-existing code. The core objective is to foster a deep understanding by having you implement the internal mechanisms of key deep learning models—such as CNN, ResNet, R-CNN, and YOLO—from the ground up. With hands-on exercises in PyTorch and Keras, you will gain proficiency in translating complex theories into functional code.

Moreover, you will experience the complete development lifecycle: from setting up a professional development environment (including tools like VS Code, Docker, and CUDA) and optimizing model performance through hyperparameter tuning, to ultimately building and deploying a computer vision web service using YOLO and Flask. These materials are designed to provide a solid foundation for your journey to becoming a professional in the field of deep learning and computer vision.

History

Aug 2022: Started preparing lecture materials upon request for a practical course on deep learning and computer vision technology for industry professionals.
Sep 2022: Recorded, reviewed, and supplemented lecture videos based on the prepared materials.
Oct 2022: Developed comprehensive lecture materials for training developers in deep learning and computer vision applications.

Core Concepts & Code:
- Provides hands-on source files (Python, Jupyter Notebook) for key computer vision models like CNN, R-CNN, ResNet, YOLOv1, YOLOv3, and YOLOv5 using Keras and PyTorch.
- Explains and implements the core mechanisms of deep learning architectures from the ground up.
Development Environment Setup:
- Covers how to set up a deep learning development environment using essential tools such as VS Code, Ubuntu, CUDA, Virtual Environments, Docker, and Google Colab.
Model Optimization:
- Demonstrates practical hyperparameter tuning techniques to improve model performance.
Application Development:
- Guides you through building a simple computer vision web service using YOLO, Flask, and Python.

Collaboration & Research

This repository is part of my ongoing work on AI, LLMs, and Transformer-based architectures. I am open to research collaboration, academic exchange, and joint projects with universities, public institutions, company and research labs.

For collaboration inquiries, please feel free to reach out: 📧 [[email protected]] | 🌐 [LinkedIn or Personal Website]

Reference

The Fastest Object Detection Algorithm YOLO! A Complete Guide Through Model Implementation, FAST CAMPUS

License

This repository is licensed under the MIT License. You are free to use, modify, and distribute the code for personal or commercial projects.

Author

Ph.D, Taewook Kang ([email protected])

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
0_tools		0_tools
1_data		1_data
2_vision		2_vision
3_classification		3_classification
4_object_detection		4_object_detection
README.md		README.md
Vision and YOLO.pdf		Vision and YOLO.pdf
app_example.PNG		app_example.PNG
fastcamp1.png		fastcamp1.png
fastcamp2.png		fastcamp2.png
fastcamp3.png		fastcamp3.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Computer Vision with Deep Learning

History

Contents

Collaboration & Research

Reference

License

Author

About

Uh oh!

Releases

Packages

Uh oh!

Languages

mac999/computer_vision_deeplearning

Folders and files

Latest commit

History

Repository files navigation

Computer Vision with Deep Learning

History

Contents

Collaboration & Research

Reference

License

Author

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages