登录查看更多内容

AI-Readiness to Address the Missed Promise of Open Data

Arnaud Sahuguet

invent, architect, build and ship products that leverage technology to solve meaningful problems and have a large social impact. Currently working on GenAI applied to financial services (hedge fund).

发布日期: 2024年4月26日

On May 09, 2013, the White House published the "Making Open and Machine Readable the New Default for Government Information" Executive Order with the promised benefit of "making information resources easy to find, accessible, and usable can fuel entrepreneurship, innovation, and scientific discovery that improves Americans' lives and contributes significantly to job creation."

Eleven years later, I would argue we are still a long way from the advertised promise of open data, speaking as both an open data producer and an open data consumer.

I am sure some open data pundits may disagree with me while others may have lots of theories about what went wrong.

If I had to zero-in on the root cause, I would blame it on the original phrasing:

To promote continued job growth, Government efficiency, and the social good that can be gained from opening Government data to the public, the default state of new and modernized Government information resources shall be open and machine readable.

Instead of "machine readable", "machine understandable" should have been used.

You can download CSV files, but you need humans to extract meaning from them or to stitch them together into something coherent. Machines can read them; machines cannot understand them. Even worse, with the advent of large language models (LLMs), machines often mis-understand them.

The good news is that this is an issue being tackled as we speak: the US Department of Commerce released last week a "Request for Information: AI-Ready Open Government Data Assets".

Data & Analytics 1 个月前

Beginner's Guide to Vector DBs: Introductory Overview

Vincent Granville 8 个月前

Why is data the least important part of Competitive…

Octopus Competitive Intelligence 2 年前

"AI-ready" is a better moniker than "machine understandable" as it does not distinguish between humans and machines.

My hope is to see soon statements like

the default state of new and modernized Government information resources shall be open and AI-ready

And I think that ontologies and knowledge graphs have a key role to play to make data AI-ready. The other good news is that the National Science Foundation is looking into it (NSF 23-571).

PS: I mentioned 2 initiatives from the US government. This is out of ignorance about what's happening in other regions. I am sure other countries are doing the same and some might even be further along. Comments are welcome.

Arnaud Sahuguet

invent, architect, build and ship products that leverage technology to solve meaningful problems and have a large social impact. Currently working on GenAI applied to financial services (hedge fund).

7 个月

From the Department of Commerce presentation

TOMEK

7 个月

Intriguing insights on the intersection of AI and open data policy—looking forward to seeing how these initiatives continue to evolve and impact data accessibility.

查看更多评论

要查看或添加评论，请登录

查看全部

AI-Readiness to Address the Missed Promise of Open Data

Arnaud Sahuguet

invent, architect, build and ship products that leverage technology to solve meaningful problems and have a large social impact. Currently working on GenAI applied to financial services (hedge fund).

领英推荐

更多精彩文章

社区洞察

其他会员也浏览了

Google Data Common with DataGemma

Fatboy SLM and Structured Data Trippin

AI: Who's Going To Win and Why...

Identifying GenAI Use Cases

Counterfeit Knowledge Graphs

Dirty Data, Big Problems: The Hard Truth About Data Quality

How Machine Learning Will Prepare You for 5G

Text classification approaches

Knowledge As Elementary Information Node Graph A.k.a. the EINGRAPH

Is "Machine Learning" overtaking "Big Data"?

领英推荐

Inside-out: 4 pillars of AI adoption

2024年9月6日

Coding on steroids: StackOverflow > ChatGPT > Cursor > ...

2024年9月3日

LLM-as-intern: revisiting the?analogy

2024年6月3日

WANTED: Chief AI Officer

2024年3月29日

A wishlist for Generative AI technology

2023年5月9日

App-Based Contact Tracing is a Distraction

2020年6月4日

Let's Talk Urban Tech: starting a new online community for #urbantech

2020年2月25日

社区洞察

其他会员也浏览了

Google Data Common with DataGemma

Fatboy SLM and Structured Data Trippin

AI: Who's Going To Win and Why...

Identifying GenAI Use Cases

Counterfeit Knowledge Graphs

Dirty Data, Big Problems: The Hard Truth About Data Quality

How Machine Learning Will Prepare You for 5G

Text classification approaches

Knowledge As Elementary Information Node Graph A.k.a. the EINGRAPH

Is "Machine Learning" overtaking "Big Data"?