登录查看更多内容

Pattern Tracking - Machine Learning

Yannick Klose

Software Engineer @ Apple

发布日期: 2019年6月30日

Over the last few years, the computing power of GPU’s has increased rapidly enabling us to compute complex operations in parallel at high speed. As a result with the use of Big Data, Machine Learning and especially the use of convolutional neural networks is becoming increasingly important.

Idenitfing objects in complex images plays a fundamental role in Machine Learning and is already used in many ways to optimise processes. During my year at the EPFL I had the chance to participate in many interesting courses about Machine Learning. In this article I want to show the process of modifying an existing state of the art Machine Learning algorithm, named openpifpaf, to identify predefined patterns on images. This model can then be used for a "followMe" mode on a robot. The project is based on the EPFL course "Data and Artificial Intelligence for Transportation".

The first step of modifying an existing Machine Learning algorithm is to analyse its function and outputs. In this case the openpifpaf architecture outputs a multi-person 2D human pose estimation which is shown below. Therefore in our next step we want to change the output to an object detection task.

Es wurde kein Alt-Text für dieses Bild angegeben.

In our second step we dive into the data input structure of the given architecture. There we can observe that the architecture is based on keypoint detection. That means the algorithms takes keypoints such as ”left-shoulder” or ”right leg” as an input and tries to learn these specific points for multiple objects on an image. The information about the keypoints and many other parameters are given through the OpenSource Coco Dataset. Since we do not need 15 keypoints (left shoulder, right leg, etc.) for each object, we reduce the number of keypoints per object to one. This single keypoint is located in the center of the object and indicate its class (Person, Car, etc.).

If you just need an object detector you are now done. But since our goal is to design a model that detects a very unique pattern we need to do one other modification. Instead of using the objects in the Database, we will create our own pattern and paste this image randomly on the images of the dataset. As input for the architecture we will again only use the location of the center of the image that we pasted. In a final step we just let this algorithm run for around 40.000 images and around 5 epochs and we will get very good results.

As a result we built in a few steps a very well performing model that is able to track a predefined pattern in an unknown environment. In this case we did not even need to create an own architecture but instead use a given OpenSource state of the art architecture. For more projects and a more detailed report please check my GitHub (https://github.com/yklose).

Reza Khosravi

Building Products @ Galliot

4 年

Here is the full article: https://neuralet.com/article/pose-estimation-on-nvidia-jetson-platforms-using-openpifpaf/

1 次回应

Reza Khosravi

Building Products @ Galliot

4 年

Nice article! I was searching for “Openpifpaf” on Linkedin and I found your blog post. We have made the Openpifpaf available for NVIDIA jetson devices using TensorRT. Check our open-source project, Neuralet and let me know if you have any suggestions or feedback! Neuralet.com

1 次回应

查看更多评论

要查看或添加评论，请登录

Yannick Klose的更多文章

Enhancement of Object Detection with Depth Estimation

2020年9月12日

Enhancement of Object Detection with Depth Estimation

Today, the growing challenges of the 21st century are being met in particular by ever faster, more efficient and above…

Pattern Tracking - Machine Learning

Yannick Klose

Software Engineer @ Apple

Yannick Klose的更多文章

社区洞察

其他会员也浏览了

The relationship between chip computing power and deep learning

Exploring the Iris Dataset: A journey through machine learning

GenAI Core Topics Explained in Simple Pictures

Intuition and Mathematics behind Gradient Descent Algorithm

Convolutional Neural Networks For Artificial Vision

Adversarial Machine Learning: How to Attack and Defend ML Models and What Can Go Wrong

TensorFlow - Aamir?P

The Math Behind the Foundation of AI

AI Engineering: Scaling your models with Ray Train for Blazing-Fast Performance

Yannick Klose的更多文章

Enhancement of Object Detection with Depth Estimation

社区洞察

其他会员也浏览了

The relationship between chip computing power and deep learning

Exploring the Iris Dataset: A journey through machine learning

GenAI Core Topics Explained in Simple Pictures

Intuition and Mathematics behind Gradient Descent Algorithm

Convolutional Neural Networks For Artificial Vision

Adversarial Machine Learning: How to Attack and Defend ML Models and What Can Go Wrong

TensorFlow - Aamir?P

The Math Behind the Foundation of AI

AI Engineering: Scaling your models with Ray Train for Blazing-Fast Performance