Skip to content

mac999/computer_vision_deeplearning

Repository files navigation

Computer Vision with Deep Learning

This repository provides in-depth lecture materials on computer vision using deep learning. It is systematically structured to guide learners through the entire workflow of a computer vision developer, from fundamental theory to real-world application development.

This course goes beyond simply running pre-existing code. The core objective is to foster a deep understanding by having you implement the internal mechanisms of key deep learning models—such as CNN, ResNet, R-CNN, and YOLO—from the ground up. With hands-on exercises in PyTorch and Keras, you will gain proficiency in translating complex theories into functional code.

Moreover, you will experience the complete development lifecycle: from setting up a professional development environment (including tools like VS Code, Docker, and CUDA) and optimizing model performance through hyperparameter tuning, to ultimately building and deploying a computer vision web service using YOLO and Flask. These materials are designed to provide a solid foundation for your journey to becoming a professional in the field of deep learning and computer vision.


History

  • Aug 2022: Started preparing lecture materials upon request for a practical course on deep learning and computer vision technology for industry professionals.
  • Sep 2022: Recorded, reviewed, and supplemented lecture videos based on the prepared materials.
  • Oct 2022: Developed comprehensive lecture materials for training developers in deep learning and computer vision applications.

Contents

This course provides a comprehensive guide to deep learning for computer vision, covering the following topics:

  • Core Concepts & Code:

    • Provides hands-on source files (Python, Jupyter Notebook) for key computer vision models like CNN, R-CNN, ResNet, YOLOv1, YOLOv3, and YOLOv5 using Keras and PyTorch.
    • Explains and implements the core mechanisms of deep learning architectures from the ground up.
  • Development Environment Setup:

    • Covers how to set up a deep learning development environment using essential tools such as VS Code, Ubuntu, CUDA, Virtual Environments, Docker, and Google Colab.
  • Model Optimization:

    • Demonstrates practical hyperparameter tuning techniques to improve model performance.
  • Application Development:

    • Guides you through building a simple computer vision web service using YOLO, Flask, and Python.

Collaboration & Research

This repository is part of my ongoing work on AI, LLMs, and Transformer-based architectures. I am open to research collaboration, academic exchange, and joint projects with universities, public institutions, company and research labs.

For collaboration inquiries, please feel free to reach out: 📧 [[email protected]] | 🌐 [LinkedIn or Personal Website]

Reference

License

This repository is licensed under the MIT License. You are free to use, modify, and distribute the code for personal or commercial projects.

Author

Ph.D, Taewook Kang ([email protected])

About

computer vision based on deep learning lecture materials

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published