登录查看更多内容

Image-based 3D Object Reconstruction State-of-the-Art and trends in the Deep Learning Era

Hitesh Jhamtani

Product Consultant | Consulting @ RateGain

发布日期: 2022年10月19日

3D reconstruction is one of the most complex issues of?deep learning systems. There have been multiple types of research in this field, and almost everything has been tried on it — computer vision, computer graphics and machine learning, but to no avail. However, that has resulted in CNN or convolutional neural networks foraying into this field, which has yielded some success.

The Main Objective of the 3D Object Reconstruction

Developing this deep learning technology aims to infer the shape of 3D objects from 2D images. So, to conduct the experiment, you need the following:

Highly calibrated cameras that take a photograph of the image from various angles.
Large training datasets can predict the geometry of the object whose 3D image reconstruction needs to be done. These datasets can be collected from a database of images, or they can be collected and sampled from a video.

By using the apparatus and datasets, you will be able to proceed with the 3D reconstruction from 2D datasets.

State-of-the-art Technology Used by the Datasets for the Reconstruction of 3D Objects

The technology used for this purpose needs to stick to the following parameters:

Input

Training with the help of one or multiple RGB images, where the segmentation of the 3D ground truth needs to be done. It could be one image, multiple images or even a video stream.

The testing will also be done on the same parameters, which will also help to create a uniform, cluttered background, or both.

Output

The volumetric output will be done in both high and low resolution, and the surface output will be generated through parameterisation, template deformation and point cloud. Moreover, the direct and intermediate outputs will be calculated this way.

Network architecture used

Data & Analytics 5 个月前

Top 10 Domains of Deep Learning

Umair Inayat 7 个月前

Demystifying AutoEncoders: The Architects of Data…

Rany ElHousieny, PhD??? 8 个月前

The architecture used in training is 3D-VAE-GAN, which has an encoder and a decoder, with TL-Net and conditional GAN. At the same time, the testing architecture is 3D-VAE, which has an encoder and a decoder.

Training used

The degree of supervision used in 2D vs 3D supervision, weak supervision along with loss functions have to be included in this system. The training procedure is adversarial training with joint 2D and 3D embeddings. Also, the network architecture is extremely important for the speed and processing quality of the output images.

Practical applications and use cases

Volumetric representations and surface representations can do the reconstruction. Powerful computer systems need to be used for reconstruction.

Given below are some of the places where 3D Object Reconstruction Deep Learning Systems are used:

3D reconstruction technology can be used in the Police Department for drawing the faces of criminals whose images have been procured from a crime site where their faces are not completely revealed.
It can be used for re-modelling ruins at ancient architectural sites. The rubble or the debris stubs of structures can be used to recreate the entire building structure and get an idea of how it looked in the past.
They can be used in plastic surgery where the organs, face, limbs or any other portion of the body has been damaged and needs to be rebuilt.
It can be used in airport security, where concealed shapes can be used for guessing whether a person is armed or is carrying explosives or not.
It can also help in completing DNA sequences.

So, if you are planning to implement this technology, then you can rent the required infrastructure from?E2E Networks?and avoid investing in it. And if you plan to learn more about such topics, then keep a tab on the?blog section of the website.?

Reference Links

https://tongtianta.site/paper/68922

https://github.com/natowi/3D-Reconstruction-with-Deep-Learning-Methods

要查看或添加评论，请登录

Hitesh Jhamtani的更多文章

Monitoring Kubernetes with Prometheus and Grafana

2022年11月3日

Monitoring Kubernetes with Prometheus and Grafana

Monitoring your Kubernetes cluster is critical for ensuring that your services are always available and running. And…
DKM Differentiable K-Means Clustering Layer for Neural Network Compression

2022年11月2日

DKM Differentiable K-Means Clustering Layer for Neural Network Compression

DKM casts forth K-means clustering as an attention problem, and then joint optimisation of the DNN parameters and…
Demystifying Cloud GPUs for AI & ML

2022年11月1日

Demystifying Cloud GPUs for AI & ML

https://www.e2enetworks.
Top 5 Open source monitoring tools for Kubernetes

2022年10月31日

Top 5 Open source monitoring tools for Kubernetes

Introduction Distributed computing and orchestration have solved many problems, but they also have created new…
Improving Word Representations via Global Context and Multiple Word Prototypes

2022年10月20日

Improving Word Representations via Global Context and Multiple Word Prototypes

Introduction The improvement in Natural Language Processing (NLP) in modern times is vital in developing Artificial…
Data Incubation— Synthesizing Missing Data For Handwriting Recognition

2022年10月17日

Data Incubation— Synthesizing Missing Data For Handwriting Recognition

Research into the topic of handwritten character recognition spans several disciplines, including AI, CV (Computer…
Using Natural Language Processing to understand and identify risks

2022年10月14日

Using Natural Language Processing to understand and identify risks

Through the analysis of massive volumes of data, machine learning and AI are poised to revolutionize the business and…
GAUDI: A Neural Architect for Immersive 3D Scene Generation

2022年10月13日

GAUDI: A Neural Architect for Immersive 3D Scene Generation

The evolution of artificial intelligence in the past decade has been staggering, and now the focus is shifting towards…
What is RIVA Speech clients container in NVIDIA GPU Cloud?

2022年10月12日

What is RIVA Speech clients container in NVIDIA GPU Cloud?

Introduction: The corporate world has struggled with custom voices that resonate with their brand voice. While the…
Malware: One of the Most Common Types of Cyberattacks

2022年10月11日

Malware: One of the Most Common Types of Cyberattacks

Cybercrime is increasing every year as attackers are getting better at attacking. Cybercriminals try to exploit the…

See all articles

Image-based 3D Object Reconstruction State-of-the-Art and trends in the Deep Learning Era

Hitesh Jhamtani

Product Consultant | Consulting @ RateGain

领英推荐

Hitesh Jhamtani的更多文章

社区洞察

其他会员也浏览了

Understanding Different Multi-GPU Training

Machine Learning – Neural Networks and Artificial Intelligence – Is the situation seen in “The Matrix/Her/Minority Report” becoming a reality?

Deep Learning

Object Detection from Traditional Techniques to Modern Deep Learning Approaches

Automating Neural Network Configuration with Keras-Tuner

How to Use Deep Learning-Based OCR: A Technical Deep-Dive Into Implementation

Scaling of Deep Networks

Applications Deep Learning in GIS

Using Deep Learning to Improve Dead Reckoning IMUs: A Practical Approach

What is Computer Vision??

领英推荐

Hitesh Jhamtani的更多文章

Monitoring Kubernetes with Prometheus and Grafana

DKM Differentiable K-Means Clustering Layer for Neural Network Compression

Demystifying Cloud GPUs for AI & ML

Top 5 Open source monitoring tools for Kubernetes

Improving Word Representations via Global Context and Multiple Word Prototypes

Data Incubation— Synthesizing Missing Data For Handwriting Recognition

Using Natural Language Processing to understand and identify risks

GAUDI: A Neural Architect for Immersive 3D Scene Generation

What is RIVA Speech clients container in NVIDIA GPU Cloud?

Malware: One of the Most Common Types of Cyberattacks

社区洞察

其他会员也浏览了

Understanding Different Multi-GPU Training

Machine Learning – Neural Networks and Artificial Intelligence – Is the situation seen in “The Matrix/Her/Minority Report” becoming a reality?

Deep Learning

Object Detection from Traditional Techniques to Modern Deep Learning Approaches

Automating Neural Network Configuration with Keras-Tuner

How to Use Deep Learning-Based OCR: A Technical Deep-Dive Into Implementation

Scaling of Deep Networks

Applications Deep Learning in GIS

Using Deep Learning to Improve Dead Reckoning IMUs: A Practical Approach

What is Computer Vision??