Computer Vision Masterclass
About Course
Computer Vision is a subarea of Artificial Intelligence focused on creating systems that can process, analyze and identify visual data in a similar way to the human eye. There are many commercial applications in various departments, such as: security, marketing, decision making and production. Smartphones use Computer Vision to unlock devices using face recognition, self-driving cars use it to detect pedestrians and keep a safe distance from other cars, as well as security cameras use it to identify whether there are people in the environment for the alarm to be triggered.
In this course you will learn everything you need to know in order to get in this world. You will learn the step-by-step implementation of the 14 (fourteen) main computer vision techniques. If you have never heard about computer vision, at the end of this course you will have a practical overview of all areas. Below you can see some of the content you will implement:
- Detect faces in images and videos using OpenCV and Dlib libraries
- Learn how to train the LBPH algorithm to recognize faces, also using OpenCV and Dlib libraries
- Track objects in videos using KCF and CSRT algorithms
- Learn the whole theory behind artificial neural networks and implement them to classify images
- Implement convolutional neural networks to classify images
- Use transfer learning and fine tuning to improve the results of convolutional neural networks
- Detect emotions in images and videos using neural networks
- Compress images using autoencoders and TensorFlow
- Detect objects using YOLO, one of the most powerful techniques for this task
- Recognize gestures and actions in videos using OpenCV
- Create hallucinogenic images using the Deep Dream technique
- Combine style of images using style transfer
- Create images that don’t exist in the real world with GANs (Generative Adversarial Networks)
- Extract useful information from images using image segmentation
You are going to learn the basic intuition about the algorithms and implement some project step by step using Python language and Google Colab
What Will You Learn?
- Understand the basic intuition about Cascade and HOG classifiers to detect faces
- Implement face detection using OpenCV and Dlib library
- Learn how to detect other objects using OpenCV, such as cars, clocks, eyes, and full body of people
- Compare the results of three face detectors: Haarcascade, HOG (Histogram of Oriented Gradients) and CNN (Convolutional Neural Networks)
- Detect faces using images and the webcam
- Understand the basic intuition about LBPH algorithm to recognize faces
- Implement face recognition using OpenCV and Dlib library
- Recognize faces using images and the webcam
- Understand the basic intuition about KCF and CSRT algorithms to perform object tracking
- Learn how to track objects in videos using OpenCV library
- Learn everything you need to know about the theory behind neural networks, such as: perceptron, activation functions, weight update, backpropagation, gradient descent and a lot more
- Implement dense neural networks to classify images
- Learn how to extract pixels and features from images in order to build neural networks
- Learn the theory behind convolutional neural networks and implement them using Python and TensorFlow
- Implement transfer learning and fine tuning to get incredible results when classifying images
- Use convolutional neural networks to classify the following emotions in images and videos: happy, anger, disgust, fear, surprise and neutral
- Compress images using linear and convolutional autoencoders
- Detect objects in images in videos using YOLO, one of the most powerful algorithms today
- Recognize gestures and actions in videos using OpenCV
- Learn how to create hallucinogenic images with Deep Dream
- Learn how to revive famous artists with style transfer
- Create images that don't exist in the real world with GANs (Generative Adversarial Networks)
- Implement image segmentation do extract useful information from images and videos
Course Content
Subtitle Guide – Hướng dẫn thêm phụ đề
01 – Introduction
02 – Face detection
-
-
04:01
-
04:27
-
04:52
-
04:52
-
12:05
-
04:15
-
04:15
-
02:07
-
02:07
-
07:05
-
04:25
-
04:25
-
03:28
-
03:28
-
03:28
-
02:11
-
11:18
-
05:23
-
05:23
-
05:52
-
05:04
-
03:02
-
04:03
-
04:03
03 – Face recognition
-
-
04:30
-
09:26
-
09:56
-
07:53
-
07:53
-
04:38
-
08:22
-
11:24
-
04:36
-
04:12
-
05:53
-
02:56
-
02:56
-
07:11
-
07:11
-
07:52
-
07:52
-
06:49
-
06:49
-
05:58
-
05:57
-
04:14
-
05:44
-
04:48
04 – Object tracking
05 – Neural networks for image classification
-
-
03:04
-
05:16
-
07:19
-
09:41
-
11:29
-
13:21
-
03:52
-
05:01
-
05:28
-
03:59
-
04:40
-
04:54
-
03:53
-
08:56
-
05:45
-
07:27
-
06:30
-
06:23
-
07:39
-
11:16
-
10:47
-
04:37
-
05:00
-
03:05
-
06:34
-
04:21
-
10:51
-
09:59
-
07:07
-
07:29
-
03:54
-
05:08
-
11:45
-
12:38
-
07:06
-
06:17
-
11:02
-
06:20
-
07:53
-
07:53
-
07:07
-
06:57
-
07:07
-
04:16
-
07:29
-
08:35
-
05:16
-
09:27
06 – Convolutional neural networks for image classification
-
-
01:55
-
07:18
-
10:04
-
05:29
-
06:31
-
05:10
-
03:59
-
04:53
-
05:42
-
05:42
-
06:58
-
06:58
-
08:59
-
02:43
-
06:34
-
05:00
-
05:00
07 – Transfer learning and fine tuning
-
-
02:14
-
06:41
-
05:12
-
03:36
-
12:04
-
07:32
-
06:17
-
06:13
-
02:44
-
06:09
-
03:03
-
09:41
08 – Neural networks for classification of emotions
-
-
03:44
-
05:35
-
04:20
-
14:18
-
01:29
-
04:58
-
08:17
-
08:02
-
05:42
-
05:42
-
05:39
09 – Autoencoders
-
-
02:34
-
06:43
-
05:57
-
09:13
-
05:28
-
11:03
-
08:29
-
08:40
-
09:41
-
06:05
-
09:09
-
09:09
-
08:21
-
09:28
-
05:50
-
05:49
10 – Object detection with YOLO
-
-
02:05
-
06:07
-
05:51
-
05:17
-
05:17
-
08:40
-
04:03
-
04:03
-
07:28
-
02:25
11 – Recognition of gestures and actions
-
-
02:56
-
07:02
-
08:34
-
04:32
-
08:42
-
08:42
-
05:17
-
05:53
-
05:53
-
06:06
-
06:35
-
04:17
12 – Deep dream
-
-
02:45
-
06:10
-
06:03
-
06:03
-
08:58
-
06:19
-
07:55
-
09:14
-
05:11
-
01:55
-
01:55
-
03:51
-
02:30
13 – Style transfer
-
-
02:50
-
05:45
-
05:56
-
10:58
-
08:17
-
08:17
-
05:21
-
05:21
-
07:34
-
07:34
-
11:00
-
15:40
-
07:25
-
03:42
-
03:42
-
03:51
-
03:59
14 – GANs (Generative adversarial networks)
-
-
03:18
-
10:48
-
07:14
-
07:14
-
07:53
-
07:53
-
06:52
-
10:19
-
08:41
-
12:30
-
05:37
-
05:37
-
04:24
15 – Image segmentation
-
-
05:19
-
04:39
-
04:39
-
03:49
-
11:33
-
09:01
-
05:34
-
05:34
-
03:12
-
03:11
-
06:23
-
06:25
-
09:17
-
05:40
-
02:41