登录查看更多内容

点击“继续加入或登录”，即表示您同意遵守领英的《用户协议》、《隐私政策》及《Cookie 政策》。

“Do you see what I see?”: Taking enterprise computer vision to edge

Gerard Suren Saverimuthu

Regional Technical Leader based in Singapore | Helping clients to infuse Hybrid Cloud and AI for digital transformation | Cyclist and Photographer

发布日期: 2019年12月16日

Until November 2019, much of the IBM PowerAI Vision (PAIV) capabilities were focused around training (Unique NVLink, Large Model Support, Distributed Deep learning …etc.) and ease of use. Now PAIV has extended it’s reach to the edge with a well thought out offering: the announcement of IBM Visual Inspector (IVI). I did some light reading over the weekend and learned about IVI. It is a native iOS/iPadOS application complementary to PAIV (https://www.ibm.com/sg-en/marketplace/ibm-powerai-vision) and geared towards enterprise users at edge. IVI uses the models trained on PAIV and performs inferencing using the integrated camera on an iOS/iPadOS device and the CoreML* models stored locally in the edge devices.

Anyone with access to Apple AppStore can download IVI from app store (https://apps.apple.com/sg/app/ibm-visual-inspector/id1486600972) as a free app and it’s ready to go for a demo with two preinstalled inference models. The demo models that are part of the downloaded App and do not require access to PowerAI Vision (i.e. connection to server side).

IVI capabilities:

Users can gather data (in 'collection' mode) and perform inferencing (in 'inspect' mode)
Inferencing can be done in disconnected (coreML model is stored locally in the mobile device) or connected (data will be uploaded to server) modes
Remote management of connected devices and CoreML models
The device can be placed in a ‘kiosk mode’ where the end user can access only the IVI in the mobile device.

3_inferencing using the iOS mobile phone camera

What I like:

Inferencing is possible in connected and disconnected mode. The disconnected mode is very handy when a user (e.g. an insurance agent visiting a disaster zone, a healthcare worker visiting a remote village) have to visit areas with no or limited communication.
IVI has individual authorizations for classes devices and models (e.g. person A can have authorization to model X while person B can have authorization to access models X and Y ...etc.)
You can push the annotated data back to server for new model development or improving existing models
When in a connected mode; the app will send data back to server. If app loses connection to server, it will cache the data to be sent back at a later time.

Use cases

Here are some ideas:

Manufacturing: Quality inspection in various manufacturing industries. Use of AI based inferencing in this context will deliver higher quality products faster through feedback and alerts. This workflow is also a way to continuously improve accuracy.
Healthcare: Capturing images related to a symptom or treatment in remote areas where communication is a challenge.
Financial services: Insurance claims inspections (e.g. A field worker inspecting hurricane damage)
Construction industry: inspection of buildings and surroundings.

I guess at the end of the day, the CoreML models have to be within the recommended sizes for use cases to be executed (see notes) at the edge.

Requirements, Licenses:

Access to an on-premises or cloud instance of PAIV v1.1.5. Production use with PAIV requires a license for PAIV.
An Apple mobile device running iOS v13.1, or later. A license is required for each mobile device on which IVI will be used when in production.

Next step:

Seeing is believing. If you haven’t seen IVI in action, please get in touch with me and I will be happy to share more details and/or show a demo.

What use case do you want to explore?

Notes:

Video streaming is not supported in the initial release.
As I understand an object detection model in the edge device can be maximum of 60MB and classification models can be up to 10 MB in size.
* CoreML is a machine learning framework introduced by Apple. CoreML provides ready-to-use models that you can integrate into your iOS apps.

Pipat Lekhachaiworakul

Hard to identify my profile, I used to be Business Development Manager, Product Manager, Senior System Seller, Technical Seller, and Architect for IBM Infrastructure Solutions

5 年

Superb

要查看或添加评论，请登录

Gerard Suren Saverimuthu的更多文章

Beyond Numbers: Webinars That Drive Real Impact

2025年2月6日

Beyond Numbers: Webinars That Drive Real Impact

Many companies continue to rely on webinars to drive product awareness, announce innovations, and identify sales…
Mastering the Art of Proof of Experience (POX)

2024年12月2日

Mastering the Art of Proof of Experience (POX)

In the ever-changing IT landscape, a Proof of Experience (POX) or demo session is a pivotal time for engaging clients…
Watsonx Code Assistant: Generative AI for Smarter Coding

2024年11月22日

Watsonx Code Assistant: Generative AI for Smarter Coding

The rise of Generative AI (GenAI) is revolutionizing the way developers approach coding. By leveraging machine…
Seamless transitions across infrastructures without the operational pauses

2024年10月26日

Seamless transitions across infrastructures without the operational pauses

With today’s evolving IT demands, system migrations need to be both seamless and resilient. IBM has addressed this…
Spreading Appreciation in the Workplace

2024年10月18日

Spreading Appreciation in the Workplace

This week, instead of my usual article, I’ve decided to share something a little different—a collection of messages I…
Digital Assistants: Is it really for the better?

2024年10月13日

Digital Assistants: Is it really for the better?

Generative AI (GenAI) is creating a massive digital storm, revolutionizing the way we interact with technology. Tools…
How Cricket Shaped My Life: The Unseen Power of Sports

2024年10月5日

How Cricket Shaped My Life: The Unseen Power of Sports

Growing up in the northern region of Sri Lanka during the 1990s, there was little encouragement to pursue sports. The…

4 条评论
Great Leaders Ask Smart Questions to Unlock Insight and Drive Success

2024年9月29日

Great Leaders Ask Smart Questions to Unlock Insight and Drive Success

In today's fast-paced and unpredictable environment, leaders are expected to deliver solutions quickly and efficiently.…

4 条评论
What Red Hat OpenShift Doesn’t Tell You About Security

2024年9月18日

What Red Hat OpenShift Doesn’t Tell You About Security

While Red Hat OpenShift is renowned for its enterprise-grade Kubernetes platform, some critical security aspects…
Debunking Common Myths About IBM Power Systems for AI and Cloud-Native Workloads

2024年9月10日

Debunking Common Myths About IBM Power Systems for AI and Cloud-Native Workloads

IBM Power Systems have a strong reputation for reliability, performance, and flexibility. However, there are several…

See all articles

“Do you see what I see?”: Taking enterprise computer vision to edge

Gerard Suren Saverimuthu

Regional Technical Leader based in Singapore | Helping clients to infuse Hybrid Cloud and AI for digital transformation | Cyclist and Photographer

IVI capabilities:

What I like:

Use cases

Requirements, Licenses:

Further reading:

Next step:

Gerard Suren Saverimuthu的更多文章

社区洞察

IVI capabilities:

What I like:

Use cases

Requirements, Licenses:

Further reading:

Next step:

Gerard Suren Saverimuthu的更多文章

Beyond Numbers: Webinars That Drive Real Impact

Mastering the Art of Proof of Experience (POX)

Watsonx Code Assistant: Generative AI for Smarter Coding

Seamless transitions across infrastructures without the operational pauses

Spreading Appreciation in the Workplace

Digital Assistants: Is it really for the better?

How Cricket Shaped My Life: The Unseen Power of Sports

Great Leaders Ask Smart Questions to Unlock Insight and Drive Success

What Red Hat OpenShift Doesn’t Tell You About Security

Debunking Common Myths About IBM Power Systems for AI and Cloud-Native Workloads

社区洞察