登录查看更多内容

Location Enabling AI without Computer?Vision

Sean Gorman

Entrepreneur and Geographer

发布日期: 2024年11月5日

The Question

In our previous work at Pixel8earth and Snap we spent a lot of time trying to drive down the cost of city scale Visual Positioning Systems (VPS) for Augmented Reality (AR). While we were able to drive down the cost of compute and enable the use of commodity cameras, there is no way around perpetually mapping the earth in 3D not being resource and cost intensive.

As we’ve worked on improving the accuracy of positioning with Zephr, there has been a little voice asking if it could enable AR without computer vision and VPS. Is it possible to construct a geographic pose with just the sensors on a phone and/or smart glasses?

The Background

The question has become particularly intriguing with Ray-bans by Meta quickly surpassing sales of mixed reality headsets.

While the progress by Meta and Snap on true AR glasses is incredible, the reality of a low cost and fashionable product doesn’t look imminent. In addition, there is a growing sentiment that “The coolest thing about smart glasses is not the AR. Its the AI.” This concept further refined our question; can we just location enable AI for portable compute.

The Challenge

In order to location enable AI we need to be able to determine what the user is looking at. In computer vision parlance this is determining the “geographic pose” of the user. To do so we need highly precise geographic coordinates (latitude, longitude, altitude) as well as orientation (pose). Both geographic location and orientation are available on smartphones today, but are notoriously inaccurate, especially in urban areas. This was one of the primary drivers for creating VPS. As we’ve been building the team has pushed to see if a software augmented GPS + IMU could deliver a viable geographic pose. Based on our latest testing the answer looked to be yes. So, we built out an app to test the concept.

The App

Given the lack of smart glass SDKs we opted to go for a smartphone based demonstration. The goal is simple. Point your phone at a “place” and see if it can determine what you are looking at. Specifically, hitting the excellent Google Places API as a function of the view shed given by the geographic pose. Then we have a Large Language Model (LLM), ChatGPT, generate a custom profile for the place. Last, but not least, we allow the user to ask ad hoc questions about the “place” to the LLM. The video below shows the app in action:

领英推荐

How Facebook Is Using Artificial Intelligence

Bernard Marr 3 年前

How Facebook Is Using Artificial Intelligence

Bernard Marr 3 年前

AI – the catalyst of the Metaverse’s expansion

Martin Petkov 1 年前

For a smart glasses use case we’d leverage the IMU from the glasses for orientation and the position from the phone for location. Then we’d use audio for the interactions “glasses, what place am I looking at?”. The possibilities of location enabling AI powered smart glasses is super exciting. Audio is a wonderful interface and format, which is developing quickly. Features like routing and tours work exceptionally well with this combination of location, AI and audio immersion. Also, the opportunities for gaming are quite exciting.

The Long?Term

Removing the need for computer vision as the primary layer for geographic interaction can really drive down the cost for smart glasses and the requisite infrastructure. Street view capture for the world is incredibly expensive. Processing a global scale or city scale feature databases to power VPS systems uses an tremendous amount of compute and energy. Constantly running the camera to operate a VPS is a significant battery drain.?

One of the most precious commodities for smart glasses is battery life. This is a big reason GPS is often left off early smart glass initiatives. In Zephr’s work with IoT partners we’ve discovered that moving the GPS solver to the cloud and the using the local GPS chip to just send raw satellite measurements can significantly reduce battery consumption while improving accuracy. Given the near future where constellations like AST and Starlink can provide 5G connectivity to devices anywhere in the world, this approach could hold promise.?

Last but not least, none of these means abandoning AR. You can use a geographic pose generated with sensors to render AR objects in a device. Also it is possible to provide occlusion using the vast 3D building databases available today. Will just sensors solve all the problems?—?no. They can provide an important bridge between the demo-able and the deployable. I think the biggest lesson in the success of Ray-bans by Meta and the struggle of mixed reality, for main stream adoption, is the importance of practicality over gadgetophilia. The future is exciting, but there is a lot of opportunity for location enabling AI that consumers are hungry for today.?

Ian Schuler

CEO at Development Seed

4 个月

Really good stuff Sean Gorman. I could imagine rapid uptake in niche professions. I'll be surprised if motorcycle delivery drivers are still looking down at a phone in a couple years

3 次回应

Eric von Eckartsberg

4 个月

Great to see you at the show Sean. Been a longtime. Really excited to see your new project.

2 次回应

Plinio Guzman

National Geographic Explorer | Map Maker | Founding Engineer @ Fused.io

4 个月

Phenomenal demo! ??

2 次回应

Sina Kashuk

Co-founder & CEO at Fused.io

4 个月

This is a game changer!! ??

2 次回应

Matt Sheehan

Working at the convergence of Geospatial, AII, spatial computing and blockchain ~ Unlocking geospatial's potential at Versar

4 个月

Super interesting Sean Gorman

3 次回应

查看更多评论

要查看或添加评论，请登录

Sean Gorman的更多文章

The Building Blocks for Vector AR

2024年12月10日

The Building Blocks for Vector AR

As we've been testing out our AR capabilities, using just the GPS and IMU, I've started to wonder what data we'd need…

8 条评论
Hickam AFB GPS Accuracy Testing

2024年10月30日

Hickam AFB GPS Accuracy Testing

While were out at Hickam AFB for the NSIN - National Security Innovation Network Propel event we had a chance to run an…

4 条评论
Moving Beyond “Open Sky” Accuracy Metrics

2024年10月8日

Moving Beyond “Open Sky” Accuracy Metrics

Introduction When it comes to defining the accuracy of a GNSS technology “open sky” measurements is the coin of the…

3 条评论
The Evolving Shape of GNSS Jamming and Spoofing

2024年7月18日

The Evolving Shape of GNSS Jamming and Spoofing

As we've been building Zephr we do lots of testing around the globe. On occasion we've been asked to test our…

1 条评论
Squashing "Z" Error and Building a More Resilient GPS?Network

2024年3月12日

Squashing "Z" Error and Building a More Resilient GPS?Network

When we announced Zephr’s work on improving the accuracy of GPS we focused on how our approach improved horizontal…

11 条评论
GPS: the technology we all use that is never good?enough

2023年11月2日

GPS: the technology we all use that is never good?enough

Given the trillion dollars in economic impacts and the billions of dollars invested we’d expect GPS to have our…

51 条评论
NeRF World (+3DiM)

2022年10月12日

NeRF World (+3DiM)

2 条评论
SnapMapping the World in 3D

2021年4月26日

SnapMapping the World in 3D

When Pramukta, Winnie, Chris and I started Pixel8.earth we wanted to map the world in 3D, and a few months ago we…

67 条评论
Should the “3D Map of the Globe” be a Public Good?

2020年12月16日

Should the “3D Map of the Globe” be a Public Good?

I’ve read a few stories over the last year about the need for a back up to GPS. The New Yorker has highlighted the…

9 条评论
Deploying the Lowest Earth Orbit Satellite to Edit OpenStreetMap

2020年10月5日

Deploying the Lowest Earth Orbit Satellite to Edit OpenStreetMap

One of the challenges we’ve seen with OpenStreetMaps over the year is access to current remote imagery for creating and…

4 条评论

See all articles

Location Enabling AI without Computer?Vision

Sean Gorman

Entrepreneur and Geographer

The Question

The Background

The Challenge

The App

领英推荐

The Long?Term

Sean Gorman的更多文章

社区洞察

其他会员也浏览了

Digital Twins + AI: Building Blocks for a Digital Reality

Breaking the Hype: What Are the Real Challenges of Implementing AI, AR, and Blockchain in Business?

Artificial Intelligence in Everyday Life

Democratizing AI for All in Southeast Asia and Oceania

Machine Vision - Seeing the World Beyond Human Sight

The Future of Artificial Intelligence: Predictions and Trends to Watch

For the Evolving Role of the CIO, watch for these trends.

Rise of Digital Humans — The Reality Gap #4

The year of the “Digital Twin Organization”

November 06, 2024

The Question

The Background

The Challenge

The App

领英推荐

The Long?Term

Sean Gorman的更多文章

The Building Blocks for Vector AR

Hickam AFB GPS Accuracy Testing

Moving Beyond “Open Sky” Accuracy Metrics

The Evolving Shape of GNSS Jamming and Spoofing

Squashing "Z" Error and Building a More Resilient GPS?Network

GPS: the technology we all use that is never good?enough

NeRF World (+3DiM)

SnapMapping the World in 3D

Should the “3D Map of the Globe” be a Public Good?

Deploying the Lowest Earth Orbit Satellite to Edit OpenStreetMap

社区洞察

其他会员也浏览了

Digital Twins + AI: Building Blocks for a Digital Reality

Breaking the Hype: What Are the Real Challenges of Implementing AI, AR, and Blockchain in Business?

Artificial Intelligence in Everyday Life

Democratizing AI for All in Southeast Asia and Oceania

Machine Vision - Seeing the World Beyond Human Sight

The Future of Artificial Intelligence: Predictions and Trends to Watch

For the Evolving Role of the CIO, watch for these trends.

Rise of Digital Humans — The Reality Gap #4

The year of the “Digital Twin Organization”

November 06, 2024