登录查看更多内容

Is Big Tech wrong to train AI models on 'messy' public data?

Mark Carey

CEO and Founder @ Delcreo, Inc. DelCreo specializes in helping business and risk leaders implement and manage successful enterprise risk management programs, and offering customized strategic advisory solutions.

发布日期: 2024年8月5日

The rapid growth of Artificial Intelligence technology in many different parts of a business is complicating contractual agreements with customers, 3rd parties as well as mergers and acquisitions (M&A) transactions. ?Data ownership and licensing related risks may not be currently addressed in existing contract and legal reviews and other due diligence activities including:

Training Data: AI systems heavily rely on training data - which is often scraped or open-source data where ownership and data usage rights are not clear. Understanding and documenting training data flows is the first step.
Data Quality: AI data pollution occurs when the data used to train or operate AI models is flawed, incomplete, or biased, potentially leading to biased predictions, unreliable recommendations, and inaccurate insights
Clarity of Ownership: Determining ownership of training data can be complex and uncertain. It might be subject to claims by third parties, infringement claims, privacy issues or other legal restrictions. This uncertainty could impact not only the use of training data, but also ownership of algorithms built using that data and any synthetic data created.
Use Limitations: If training data has use limitations, it can restrict how a company commercializes and licenses the data, develops technology, and applies algorithms.

领英推荐

AI and Human Intelligence: A Synergistic Future

Miamin Systems Inc. 3 个月前

Did Your Startup Hit A Growth Plateau? AI…

OpenGrowth 3 个月前

What Are The Pros And Cons Of AI? Did You Know About…

Ze Learning Labb 6 个月前

Synthetic Data

The reliance on public data for training AI models exposes companies to significant copyright and privacy risks, while synthetic data, though a promising alternative, may face limitations in scope and accuracy when derived from insufficient original data. The push for AI models to handle large-scale data creates operational challenges, including data freshness, regulatory pressures, and the need for real-time insights, all of which necessitate robust and secure technology infrastructures to manage and mitigate these risks effectively.

Ali Golshan, CEO and cofounder of Gretel, which allows companies to experiment and build with synthetic data. Golshan says synthetic data is a safer and more private alternative to "messy" public data, and that it can shepherd most companies into the next era of generative AI development.

Weighing the Risks

2,527 位关注者

要查看或添加评论，请登录

Mark Carey的更多文章

2025 Global Trends to Watch Part 2

2025年2月27日

2025 Global Trends to Watch Part 2

This is Part 2 of our review of Global Trends that risk leaders should be aware of in 2025, covering. See our review of…
2025 Global Trends to Watch

2025年2月11日

2025 Global Trends to Watch

In the rapidly evolving risk landscape of today’s fast-moving business environment, forecasting and managing risks is…

3 条评论
Reliable Electricity at Risk: The Surge in Electricity Demand

2025年1月17日

Reliable Electricity at Risk: The Surge in Electricity Demand

The digital age of AI and cloud computing and its rapidly growing demand for data centers and computing power is…
Using AI to Spot Black Swan Events

2024年8月29日

Using AI to Spot Black Swan Events

We recently conducted a Linkedin poll about recent events and if they were black swan events. With 144 respondents to…

1 条评论
Unintended Consequences of the Crowdstrike Outage

2024年8月16日

Unintended Consequences of the Crowdstrike Outage

A faulty update to CrowdStrike's Falcon sensor software caused a widespread system crash as millions of Windows…

See all articles

Is Big Tech wrong to train AI models on 'messy' public data?

Mark Carey

CEO and Founder @ Delcreo, Inc. DelCreo specializes in helping business and risk leaders implement and manage successful enterprise risk management programs, and offering customized strategic advisory solutions.

领英推荐

Weighing the Risks

2,527 位关注者

Mark Carey的更多文章

社区洞察

其他会员也浏览了

Every Day is a ‘Discover A New GPT Day!’ Exploring #314. Analytics AI for Strategy – Your AI-Powered Learning Companion

Leveraging AI for Enhanced Efficiency and Productivity

AI and Leadership: Overcoming the Fear Factor in 2025

WHY COMPANIES STRUGGLE TO IMPLEMENT AI

The AI Conference Insights

Agentic AI: The Next Big Shift in AI

Addressing AI Bias and Transparency

Why Data Quality Matters in the AI Era

Observing the Possibilities: Proficiency in Crafting AI System Development

Unlocking AI Success: The Vital Role of AI Prompt Engineering for Non-Technical Business People

领英推荐

Weighing the Risks

2,527 位关注者

Mark Carey的更多文章

2025 Global Trends to Watch Part 2

2025 Global Trends to Watch

Reliable Electricity at Risk: The Surge in Electricity Demand

Using AI to Spot Black Swan Events

Unintended Consequences of the Crowdstrike Outage

社区洞察

其他会员也浏览了

Every Day is a ‘Discover A New GPT Day!’ Exploring #314. Analytics AI for Strategy – Your AI-Powered Learning Companion

Leveraging AI for Enhanced Efficiency and Productivity

AI and Leadership: Overcoming the Fear Factor in 2025

WHY COMPANIES STRUGGLE TO IMPLEMENT AI

The AI Conference Insights

Agentic AI: The Next Big Shift in AI

Addressing AI Bias and Transparency

Why Data Quality Matters in the AI Era

Observing the Possibilities: Proficiency in Crafting AI System Development

Unlocking AI Success: The Vital Role of AI Prompt Engineering for Non-Technical Business People