ç™»å½•æŸ¥çœ‹æ›´å¤šå†…å®¹

Building a cashier-less checkout GPT

Inder Singh

Just build

å‘å¸ƒæ—¥æœŸ: 2024å¹´1æœˆ13æ—¥

Back in 2019, I worked on a startup, we were building technology to provide a pickup and go checkout experience for retail customers. The core elements to build this system included solving the following problems -

Track individuals with overhead depth sensing cameras - We solved this through object detection(SSD and YOLO models) and object tracking(graph theory algorithms that join tracked objects across frames) using Intel Real Sense cameras. You can see the output here - Object Tracking
Find out who picked up what - We solved this using pose estimation(who was near the object) and object detection(which object got picked up). You can see the output in this video - Pose Estimation

We had to build different models for each element that included object tracking, object detection, pose estimation and orchestrate the same through a distributed engineering system.

In 2024, I tried the same with GPT-V and I could get most of elements to work with zero short learning using simple prompts as shown below -

AI that is capable of language understanding, human like perception and in some cases reasoning has arrived. The future is to reduce cost/request and deploy this at scale.

Simplifying Complexity

1,924 ä½å…³æ³¨è€…

è®¢é˜…

è¦æŸ¥çœ‹æˆ–æ·»åŠ è¯„è®ºï¼Œè¯·ç™»å½•

Inder Singhçš„æ›´å¤šæ–‡ç«

The intuition and nuts & bolts of transformers explained visually

2024å¹´2æœˆ13æ—¥

The intuition and nuts & bolts of transformers explained visually

The whole memo in PDF form is available here -â€¦

3 æ¡è¯„è®º
Introducing your personal email GPT

2024å¹´1æœˆ20æ—¥

Introducing your personal email GPT

In a past memo, I discussed using GPT to enhance your experience with unstructured data, like emails, photos, andâ€¦
A vision for online shopping powered by GPT

2024å¹´1æœˆ16æ—¥

A vision for online shopping powered by GPT

Tenet Show not tell - This will help the reader understand that the future is already here. Let's go! These optionsâ€¦

1 æ¡è¯„è®º
How to get better at R&D as an organization?

2023å¹´10æœˆ7æ—¥

How to get better at R&D as an organization?

Many organizations start out full of energy and creativity, without a clear plan. This can be uncomfortable, but it's aâ€¦
Unlocking the Power of Strategy: How Everyday Choices Drive Remarkable Returns

2023å¹´8æœˆ16æ—¥

Unlocking the Power of Strategy: How Everyday Choices Drive Remarkable Returns

The significance of strategy becomes evident when we dissect the saying "All returns in life come from compounding"â€¦
Unveiling an Effective Analytical Framework: Navigating Success and Operationalizing Principles

2023å¹´7æœˆ15æ—¥

Unveiling an Effective Analytical Framework: Navigating Success and Operationalizing Principles

A significant portion of success is determined by your daily routine. If you don't actively consider what and how youâ€¦

1 æ¡è¯„è®º
How should business/product/tech leaders think when the bar to build an Alexa/Siri like platform for your business is now english?

2023å¹´4æœˆ30æ—¥

How should business/product/tech leaders think when the bar to build an Alexa/Siri like platform for your business is now english?

If you are a leader involved in defining/leading strategy, product, technology, or operations in a business, it'sâ€¦

1 æ¡è¯„è®º
How to think about the Generative AI market?

2023å¹´4æœˆ23æ—¥

How to think about the Generative AI market?

One key insight/learning that product and tech folks should instill is that a great explosive market matters more thanâ€¦
What will be the trend in GPT - bigger models or infra optimization?

2023å¹´4æœˆ21æ—¥

What will be the trend in GPT - bigger models or infra optimization?

To be right in using SOTA AI, one should be close to numbers to avoid fatal outcomes. When I was building Pikup.

2 æ¡è¯„è®º
ChatGPT - A master at data comprehension, a pretty good analyst and an entry level Data Scientist

2023å¹´4æœˆ17æ—¥

ChatGPT - A master at data comprehension, a pretty good analyst and an entry level Data Scientist

Chilling on a Sunday, I thought i'll take Chat-GPT for a spin to uncover how good it can understand structure fromâ€¦

6 æ¡è¯„è®º

See all articles

Building a cashier-less checkout GPT

Inder Singh

Just build

Simplifying Complexity

1,924 ä½å…³æ³¨è€…

Inder Singhçš„æ›´å¤šæ–‡ç«

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

Artificial Intelligence #37

?? AI in the News: A Plethora of New Tools, Can AI Keep Our Secrets?, and the Meaning of Life (Nothing Less!)

Reasoning and Simulations.

DeepSeek and the Week That Changed AI

Our new cutting-edge multi-language model for Agentic AI use cases

#E1I39: Savory Silicon Servings

AI and the Magnificent 7: How Everybody Else can NOT get left in the dust......

Chat GPT | Mass market adoption within days...

DeepSeek aftershocks: Innovation, efficiency, market shifts, and geopolitics

Intelligencer Vol 4:

Simplifying Complexity

1,924 ä½å…³æ³¨è€…

Inder Singhçš„æ›´å¤šæ–‡ç«

The intuition and nuts & bolts of transformers explained visually

Introducing your personal email GPT

A vision for online shopping powered by GPT

How to get better at R&D as an organization?

Unlocking the Power of Strategy: How Everyday Choices Drive Remarkable Returns

Unveiling an Effective Analytical Framework: Navigating Success and Operationalizing Principles

How should business/product/tech leaders think when the bar to build an Alexa/Siri like platform for your business is now english?

How to think about the Generative AI market?

What will be the trend in GPT - bigger models or infra optimization?

ChatGPT - A master at data comprehension, a pretty good analyst and an entry level Data Scientist

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

Artificial Intelligence #37

?? AI in the News: A Plethora of New Tools, Can AI Keep Our Secrets?, and the Meaning of Life (Nothing Less!)

Reasoning and Simulations.

DeepSeek and the Week That Changed AI

Our new cutting-edge multi-language model for Agentic AI use cases

#E1I39: Savory Silicon Servings

AI and the Magnificent 7: How Everybody Else can NOT get left in the dust......

Chat GPT | Mass market adoption within days...

DeepSeek aftershocks: Innovation, efficiency, market shifts, and geopolitics

Intelligencer Vol 4:

1,924 ä½å…³æ³¨è€…

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†