Have We Run Out of Data for AI?
Back in December,?Ilya Sutskever, AI-guru and ex OpenAI co-founder, said that we have reached?"peak data."?Elon Musk echoed the sentiment in January, saying that AI has “gobbled up” all human-produced data to train itself.
We only have one Internet they said, implying that we ran out of human data.
These make for punchy headlines for all of us who follow the AI boom. But for enterprises, it could lead to a false complacency, or worse, to cause them to ignore what they are sitting on: the goldmine of their own proprietary enterprise data.
The Three Layers
Modern AI, particularly generative AI, is built on the interplay of three critical layers: Compute Infrastructure, Foundational Models, and Data.
The Power of Proprietary Data in AI
If you are a regular user of LLMs, you will know that their true magic happens when they’re fine-tuned or augmented with?your specific context. (let's put aside the scary part about privacy for now, which deserves many other posts on their own). The full power of LLMs is on display when you ask it questions, and it already has all the finer points of your situation in memory and can then couple that with its foundational knowledge to generate the specific right answer for you.
The same principle applies to enterprises:
AI delivers its full potential when it’s trained and continuously aligned with your organization's specifics, quirks, edge cases, and industry nuances.
Several emerging AI strategies make this possible:
These approaches ensure that your Enterprise AI doesn’t just rely on broad, generic knowledge—it integrates with your organization’s unique insights, producing tailored, high-value outputs.
The Goldmine of Enterprise Data
Your organization has proprietary data that no one else does.
领英推荐
Here are just a few categories:
This proprietary data is what makes you different and can help you win or lose in your market and industry.
When you hire top-tier consultancies like McKinsey, Bain or BCG - they come to you with their deep knowledge of industry and strategy. But they still need to immerse themselves in the specifics of your organization. That is why you have a discovery phase where they interview your key players for hours and turn every stone they can find.
Modern AI works the same way, it will need first to absorb and learn from all your proprietary data in order to deliver maximum value.
The Real Question: Is Your Enterprise AI- Data Ready?
So no, -we haven't not run out of data to train AI, we never will!
The more pressing question for enterprise leaders is:?What is your AI data readiness?
In all this AI frenzy and exuberance, data is more important than ever—especially proprietary data.
The organizations that succeed will be those that recognize the power of their data and invest in making it AI-ready. This means:
In the AI era,?every company is a data company.
So the real question is: What are you doing with your data?
Eric's Note:
I enjoy writing these posts and I want to warmly thank you for being one of my faithful readers. Want to read all my best posts, on less restrained subjects, without LinkedIn between us? Subscribe (absolutely free!) to my other Newsletter on Substack. Click here and See you there!
Founder & CEO @ Captova Technologies Inc | Intelligent Document Processing | captova.com
2 周Yes to that ??—the goldmine of their own proprietary enterprise data.
IT Technical Officer, Founder of GDG Antananarivo the first GDG in Madagascar, Design Thinking facilitator
2 周More fine-tuning !! Especially on neglected languages : African dialects, Malagasy, etc.
CEO @ SmartOne.ai | Smarter Data Services for Smarter AI
2 周ok, so there's a typo on the banner title ??
Senior Software Developer at Kracht GmbH, Werdohl, Germany
2 周When AI scavenges the internet … does it, or better can it, differentiate between Human generated content and AI generated content? hmmm … there are opportunities