登录查看更多内容

Monocular Depth Estimation

Elven Kee

Chartered Engineer with 20 years of enginering experience in Automation Control & AI, specialising in Marine Integrated Control and Safety Systems ??A dedicated & passionate technical trainer and mentor

发布日期: 2023年10月28日

Purchasing a 3D camera is a costly endeavour. Naturally, we can purchase two inexpensive cameras and use the stereo camera technique to estimate depth. However, there are five common methods for estimating depth that do not require a stereo camera.

(a) Monocular Depth Estimation: This method estimates depth using a single camera and a variety of computer vision techniques. supervised learning-based techniques, including monocular depth prediction with deep convolutional neural networks (CNNs) trained on massive datasets, are some of the more well-liked techniques.

(b) Focus-based Depth Estimation: This technique gauges depth by looking at how an image's focus varies. It is feasible to determine the depth of objects in a picture by examining the sharpness or blur of various locations in an image.

(c) Depth Estimation Based on Motion:This approach estimates depth based on the motion observed between successive frames in a video. By analyzing the optical flow or motion vectors, depth can be estimated using techniques such as structure from motion or visual odometry.

(d) LiDAR or Time-of-Flight (ToF) Sensors: LiDAR (Light Detection and Ranging) sensors or ToF (Time-of-Flight) cameras emit laser or infrared light and measure the time taken for the light to bounce back from objects in the scene. This information can be used to estimate dept.

(e) Depth from Defocus: This technique estimates depth by analyzing the defocus blur in an image. By capturing multiple images of the same scene with different focus settings, depth can be estimated based on the amount of defocus blur in each image.

Let's discuss indepth on monocular depth estimation. Applications for monocular depth estimation include robots, autonomous vehicles, augmented reality, and 3D reconstruction.

The goal of this computer vision method is to infer a scene's depth information from a single photograph. Put otherwise, it refers to the method of determining an object's distance inside a scene using only one camera viewpoint.

An essential first step in deriving scene geometry from 2D photos is depth estimate. With only one RGB image as input, the objective of monocular depth estimation is to infer depth information or forecast the depth value of each pixel. This example will demonstrate how to create a depth estimation model using a convolutional neural network and basic loss functions.

The method is difficult because it calls for the model to comprehend the intricate connections between the objects inthe scene and the associated depth information, which are influenced by several elements as texture, occlusion, and illumination. Compared to using stereo cameras or depth sensors, depth information is lost while recording a scene in a 2D image.

Below are four photographs taken of a yellow cylinder, with a distance of 18cm to 9cm from its base. One of the advantages of the Monocular DE is that it eliminates all background noises.

There is no obvious change in the color of the cylinder when the distance is short, but the background color becomes bluer when the distance is short. Therefore, the intensity of colour is a function of short distance and can be used to estimate depth.

(1) Distance of cylinder from base = 18cm

领英推荐

Let’s Talk Tech #1: Self-driving vehicles, AI &…

Orange Business 9 个月前

Possible | Socially Assistive Robots with Maja Matari?…

Reid Hoffman 1 年前

Webinar: OpenCV Face Recognition with Seventh Sense…

OpenCV 1 年前

(2) Distance of cylinder from base = 15cm

(3) Distance of cylinder from base = 12cm

(4) Distance of cylinder from base = 9cm

It is noteworthy that although these methods can yield depth estimate without the need for a stereo camera, their accuracy and robustness may be restricted based on the particular application and environmental factors.

REFERENCES

(1) https://github.com/isl-org/MiDaS

(2) https://keras.io/examples/vision/depth_estimation/

要查看或添加评论，请登录

Elven Kee的更多文章

How to run Yolov8 segmentation on Raspberry Pi (from?scratch)

2025年3月8日

How to run Yolov8 segmentation on Raspberry Pi (from?scratch)

Have you tried all the YOLOv5 models, and you are eager to work with the latest YOLOv8 model? And not just Object…

1 条评论
CLIP by OpenAI — by first running the colab

2024年12月9日

CLIP by OpenAI — by first running the colab

CLIP uses modern architecture like Transformer and predicts the text description “a photo of a dog” or “a photo of a…
Understanding Grounding Dino's Thresholds: A Deeper Dive

2024年10月1日

Understanding Grounding Dino's Thresholds: A Deeper Dive

Grounding Dino (GD) and YOLOv8 are both powerful object detection models, but they employ slightly different strategies…
How to turn your Raspberry Pi into small?ChatGPT

2024年2月16日

How to turn your Raspberry Pi into small?ChatGPT

Join me on a new journey as I explore the use of the Large Language Model (LLM) on a Raspberry Pi! To begin with, let's…

3 条评论
How to run Yolov8 segmentation on Raspberry Pi (from scratch)

2023年11月15日

How to run Yolov8 segmentation on Raspberry Pi (from scratch)

Have you tried all the YOLOv5 models, and you are eager to work with the latest YOLOv8 model? And not just Object…

2 条评论
How to run YOLOv5 successfully on Raspberry Pi

2023年9月14日

How to run YOLOv5 successfully on Raspberry Pi

What is YOLOv5 and why is it so popular? YOLOv5 is an object detection algorithm developed by Ultralytics. It is an…
Difference between Pretrained model and native model for portable object detector

2023年2月25日

Difference between Pretrained model and native model for portable object detector

Portable object detectors are becoming increasingly popular due to their versatility and ease of use. They are used in…
Digital Twin of Collaborative Robot

2022年6月6日

Digital Twin of Collaborative Robot

A mobile collaborative robot consists of a collaborative robot manipulator mounted onto a mobile autonomous base. It…

1 条评论
The current trend of Industry 4.0 with OPC UA

2021年8月4日

The current trend of Industry 4.0 with OPC UA

OPC Unified Architecture is a machine-to-machine (M2M) protocol for industrial automation developed by OPC Foundation…
Virtual Engineers to pickup Industry 4.0 skills fast.

2021年7月18日

Virtual Engineers to pickup Industry 4.0 skills fast.

Be comfortable with all the new terms in Industry 4.0! Actually, some of the terms have been around for ages but…

See all articles

Monocular Depth Estimation

Elven Kee

Chartered Engineer with 20 years of enginering experience in Automation Control & AI, specialising in Marine Integrated Control and Safety Systems ??A dedicated & passionate technical trainer and mentor

领英推荐

Elven Kee的更多文章

社区洞察

其他会员也浏览了

AI and Christmas: Santa’s Secret Helper in 2024 vs. 2014

aiSim? – 5.2.0 release notes

3D Detections – FiftyOne Tips and Tricks – September 29th, 2023

Edge AI and Vision Insights Newsletter

Digital Twin Technology May Finally Prove What Sank the Titanic!

Edge AI and Vision Insights

Understanding Digital Twins: A Comprehensive Exploration

Navigating the Future of Technology and Innovation.

AI and Machine Learning: Revolutionizing the Aerospace Industry

领英推荐

Elven Kee的更多文章

How to run Yolov8 segmentation on Raspberry Pi (from?scratch)

CLIP by OpenAI — by first running the colab

Understanding Grounding Dino's Thresholds: A Deeper Dive

How to turn your Raspberry Pi into small?ChatGPT

How to run Yolov8 segmentation on Raspberry Pi (from scratch)

How to run YOLOv5 successfully on Raspberry Pi

Difference between Pretrained model and native model for portable object detector

Digital Twin of Collaborative Robot

The current trend of Industry 4.0 with OPC UA

Virtual Engineers to pickup Industry 4.0 skills fast.

社区洞察

其他会员也浏览了

AI and Christmas: Santa’s Secret Helper in 2024 vs. 2014

aiSim? – 5.2.0 release notes

3D Detections – FiftyOne Tips and Tricks – September 29th, 2023

Edge AI and Vision Insights Newsletter

Digital Twin Technology May Finally Prove What Sank the Titanic!

Edge AI and Vision Insights

Understanding Digital Twins: A Comprehensive Exploration

Navigating the Future of Technology and Innovation.

AI and Machine Learning: Revolutionizing the Aerospace Industry