登录查看更多内容

#6 Coding an Object Detection Model with FasterRCNN + InceptionResNetV2

Riya Chhikara

Data Scientist at The Economist | Guest Teacher at LSE

发布日期: 2024年3月15日

Revisiting the fundamentals of an image

To dive into the basics of image processing, we first understand that an image can be seen as a function f where each pixel holds a value representing its intensity. In grayscale images, this value ranges from 0 to 255, where 0 signifies full black and 255 denotes full white. Hence, f(x,y) gives us the intensity at a pixel location (x,y).

When we talk about coloured images, each pixel has three channels for Red, Green and Blue colours. So, each pixel is no longer a single value between 0 to 255, but instead a combination of RGB.

Common Image Edits

In image manipulation, we often alter either the range of the image, changing its color, or simply adjust the positions of pixels without changing their colors.

Filters

Filters are a fundamental part of image processing. They alter the pixel values within an image, thereby changing its appearance. Common filters include:

Average Filter
Image Segmentation

Project:

Object Detection Model using FasterRCNN+InceptionResNetV2

The InceptionResNetV2 feature extractor was trained on ImageNet and fine-tuned with FasterRCNN head on OpenImages V4 dataset, containing 600 classes.

领英推荐

The Act in Modern Age

Teamware Solutions 5 个月前

Streamlit Part 3: Form Validation

Rick H. 4 个月前

Beyond subroutines in Abaqus/Exp: new keywords to make…

Graeme Short 1 年前

The module performs non-maxima suppression inside the module. The maximal number of detections outputted is 100. Detections are outputted for 600 boxable categories.

Input and Output

The input for the model is a three-channel image of variable size. The output includes bounding box coordinates, detection class names, indices, and scores.

Implementation

Loading the chosen model.
Downloading and resizing images for processing.
Drawing boxes around detected objects.
Implementing a detector function to identify objects within images.

detector function that detects objects in the new images.

An example of how the model detects objects

GitHub Repo:

https://github.com/RiyaChhikara/100daysofComputerVision/blob/main/Day6_Object_Detection.ipynb

Sources:

https://ai.stanford.edu/~syyeung/cvweb/tutorial1.html
https://towardsdatascience.com/convolution-vscorrelation-af868b6b4fb5
DeepLearning.AI course on Advanced Computer Vision with TensorFlow
FasterRCNN + InceptionResNet V2

100 Days of Computer Vision

837 位关注者

要查看或添加评论，请登录

Riya Chhikara的更多文章

#57 Vintage Watch Finder: AI in Luxury Watch Shopping

2024年10月21日

#57 Vintage Watch Finder: AI in Luxury Watch Shopping

Got a cool idea ! We have Google Lens where you can upload images to search for the items. I want to build a…
#56 Connecting the app to AWS S3 bucket

2024年9月22日

#56 Connecting the app to AWS S3 bucket

Now that QualScan works well, and we have integrated Postgres tables into the workflow, we have one more thing left to…
#55: How to build a solid backend for a scalable app?

2024年9月22日

#55: How to build a solid backend for a scalable app?

Now that we have a functional app with a decent interface, we can focus on the backend database storage. I used…
#54: How to integrate alert system into a machine vision app ?

2024年9月20日

#54: How to integrate alert system into a machine vision app ?

This will be a tutorial with code snippets. So, if you are building/ planning to build your app in Python, and want to…
# 53 The app now tracks defects in real-time

2024年9月19日

# 53 The app now tracks defects in real-time

What do real time quality dashboards 'really look' like? I found some results on Google which seemed pretty…
#52: Looks better than yesterday

2024年9月18日

#52: Looks better than yesterday

Today, I made some functional changes. Looks better, and fixed the slider issue.
#51: And the winner for the final model is VGG16

2024年9月17日

#51: And the winner for the final model is VGG16

Quick Recap: Yesterday we created an app that took product images as inputs and predicted the % of defects in it. The…

2 条评论
#50: Machine Vision for checking defects

2024年9月16日

#50: Machine Vision for checking defects

BACK AT IT ! Well, today I read about machine vision used in manufacturing setups. We know that humans can inspect only…
#49: Product Design for Smarter iPhone Search

2024年6月22日

#49: Product Design for Smarter iPhone Search

In the previous article, I mentioned 5 main improvements to be made in the iPhone photo Search. Today, I design…
#48 Tech Review on iPhone's Image Search

2024年6月22日

#48 Tech Review on iPhone's Image Search

As a phone user, I found a pain point in accessing photos from my gallery. Today, I study all the features that Apple…

See all articles

社区洞察

JavaFX

What are some resources and tools for learning and developing JavaFX 3D applications?

#6 Coding an Object Detection Model with FasterRCNN + InceptionResNetV2

Riya Chhikara

Data Scientist at The Economist | Guest Teacher at LSE

Revisiting the fundamentals of an image

Common Image Edits

Filters

Project:

领英推荐

Input and Output

Implementation

GitHub Repo:

Sources:

100 Days of Computer Vision

837 位关注者

Riya Chhikara的更多文章

社区洞察

其他会员也浏览了

Script Tip Friday- Introducing pyAnsys

Script Tip Friday- Bolt Pretension

Write a Post-Processing Shader using CRP Step-by-Step - Guide to Cocos Cyberpunk Source Code

Data Animation: Much Easier than you Think!

Alias Templates and Template Parameters

Automatic Red Eye Remover using OpenCV (C++ / Python)

Simulating a MOSFET Using PySpice

CPLEX Studio, Engine & OPL Kickstart for Optimisation

Running compact AI model locally: How I Deployed the Phi3 on a Raspberry Pi 5 Using FastAPI

Yet another attempt at explaining Domain Specific Languages

Revisiting the fundamentals of an image

Common Image Edits

Filters

Project:

领英推荐

Input and Output

Implementation

GitHub Repo:

Sources:

100 Days of Computer Vision

837 位关注者

Riya Chhikara的更多文章

#57 Vintage Watch Finder: AI in Luxury Watch Shopping

#56 Connecting the app to AWS S3 bucket

#55: How to build a solid backend for a scalable app?

#54: How to integrate alert system into a machine vision app ?

# 53 The app now tracks defects in real-time

#52: Looks better than yesterday

#51: And the winner for the final model is VGG16

#50: Machine Vision for checking defects

#49: Product Design for Smarter iPhone Search

#48 Tech Review on iPhone's Image Search

社区洞察

其他会员也浏览了

Script Tip Friday- Introducing pyAnsys

Script Tip Friday- Bolt Pretension

Write a Post-Processing Shader using CRP Step-by-Step - Guide to Cocos Cyberpunk Source Code

Data Animation: Much Easier than you Think!

Alias Templates and Template Parameters

Automatic Red Eye Remover using OpenCV (C++ / Python)

Simulating a MOSFET Using PySpice

CPLEX Studio, Engine & OPL Kickstart for Optimisation

Running compact AI model locally: How I Deployed the Phi3 on a Raspberry Pi 5 Using FastAPI

Yet another attempt at explaining Domain Specific Languages