登录查看更多内容

How AI Creates Synthetic Speech

Bernard Marr

?? Internationally Best-selling #Author?? #KeynoteSpeaker?? #Futurist?? #Business, #Tech & #Strategy Advisor

发布日期: 2021年11月10日

Having machines turn text into speech is nothing new.

Professor Stephen Hawking communicated with a computerized voice for many years, and by now, we're used to our GPS devices or smart speakers asking questions and responding to our queries.

What is different these days is that the quality of synthesized speech is improving, thanks to several companies using AI to create voice skins for enterprise companies and content creators that give more options for turning text into speech.

LOVO , an AI voice and synthetic speech startup company, uses a voiceover API to turn text into speech in real-time using 200+ human-like voices in 33 languages using their “voice library.” Users also can clone their own voices to create their own skins, simply by reading 15 minutes of a script.

LOVO recently announced the close of a $4.5 million pre-Series A round, led by South Korean Kakao Entertainment. See here my full conversation with Tom Lee, Co-founder, and COO of LOVO (including a demo)

What Is AI Speech Synthesis?

Speech synthesis is simply the computer-generated production of audible human words.

Traditional text-to-speech robotic voices you hear on software or hardware products like Amazon Echo, Google Home, your GPS, or your ebook reader are fast and cheap for companies to create, but they can also be unoriginal and unrealistic.

Artificial intelligence or AI voice operates a little differently. AI voice uses deep learning to create higher-quality synthetic speech that more accurately mimics the pitch, tone, and pace of a real human voice.

For example, if you wanted to use LOVO AI to generate synthetic text, you can upload a script that you want to turn into audio content. Then choose one of the voices in their library, based on language, style, and character. With a click of a button, LOVO turns your script into audio that sounds pretty lifelike.

You can also clone your own voice by reading a short script, and LOVO will generate a custom voice skin you can use over and over again for videos, audiobooks, or anything else that requires voiceover.

Here’s a side-by-side comparison of original voices and voice clones:

Will AI voice technology replace voiceover professionals? Tom Lee, Co-founder and COO of LOVO, says no.

Towards AI 1 个月前

Impact of AI in Public Speaking with Generative AI

Dr. Hemachandran K 9 个月前

Deep Dive into ASR Systems

Nitin Bhatnagar 6 个月前

“I believe that isn’t going to happen. If you think about how humans and how AIs work, we can complement each other. As a voice actor, you can only do 6 or 7 hours of work a day. You can't work 24/7, and you want to focus your energy on the most important gigs, or maybe you want to have a day job, and then you want your AI voice to make money while you sleep. You can record once with us, then take the revenue shares. One of our most famous voices is raking in a couple of grand a month without doing any work."

The Many Potential Uses of Synthetic Speech

AI voice has a myriad of use cases, including:

Translation:?Papercup is using AI voice to translate videos by generating voices that sound like the original speaker.

Video or audio ads: You can upload a script and create an ad without the added expense and time involved in hiring a voiceover artist. Descript has a collaborative audio/video editor that works just like a regular Word document.

E-learning (for kids, or for corporate training): Teachers and trainers will be able to make written materials more accessible for different types of learners with the help of AI voice automation.

Augmented reality and virtual reality: With the AR and VR markets exploding right now, there is a huge need for realistic, authentic human voices for apps and websites.

The global text to speech (TTS) market is estimated to reach $5.0 billion by 2026 , according to marketsandmarkets.com – so the sky's the limit for this exciting new technology.

To find out more about the latest trends in AI and machine learning, check out the rest of my website or subscribe to my YouTube channel .

Thank you for reading my post.?Here?at LinkedIn ?and at?Forbes ?I regularly write about management and technology trends. To read my future?posts simply?join my network here ?or click 'Follow'. Also feel free to connect with me via?Twitter ,??Facebook ,?Instagram ,?Slideshare ?or?YouTube .

About Bernard Marr

Bernard Marr ?is a world-renowned futurist, influencer and thought leader in the field of business and technology. He is the author of 18 best-selling books, writes a regular column for Forbes and advises and coaches many of the world’s best-known organisations. He has over 2 million social media followers and was ranked by LinkedIn as one of the top 5 business influencers in the world and the No 1 influencer in the UK.

AI & Future Tech Trends

824,517 位关注者

Akram Khan (MA KAN)

Sr. Operations Admin_FedEx

3 年

thanks FOR SHARING!

Muddasar Khan

Lecturer & Full Stack Web Developer (MERN,Laravel & WordPress themes and plugins Development)

3 年

LOVO turns your script into audio that sounds pretty lifelike... it was the main sentence of ur articles.. can u share training algo of the same product

Raymond L. Newkirk, Psy.D., Ph.D., Ph.D.

Entrepreneur ~ Educator ~ Executive ~ Consulting Specialist ~ Author ~ Executive Coach ~ Speaker ~ Presenter ~ Podcaster of "All Things Intriguing" ~ Founder of Systems Management Institute

3 年

I was the COO of the leading Speech Synthesis company in the country back in 1999. WE won "Best of Comdex" two years consecutively. No one much cared back then. Timing matters. Now people are noticing it. Ray Newkirk

Mumbie Fredson-cole

Professional Engineer & Risk Manager

3 年

I am fascinated by the number of areas to which artificial intelligence can be applied. It seems like there is no end to the number of human activities that it can be used to replace. I often wonder however what effect will AI have on the quality of life in our communities. The significant growth of technology has not been accompanied by a proportional decrease in poverty or homelessness. Are there applications of AI that are directly targeting these very important human needs?

1 次回应

查看更多评论

要查看或添加评论，请登录

Bernard Marr的更多文章

Why You Should Be Polite To ChatGPT And Other AIs

2024年11月25日

Why You Should Be Polite To ChatGPT And Other AIs

Thank you for reading my latest article Why You Should Be Polite To ChatGPT And Other AIs. Here at LinkedIn and at…

87 条评论
The 7 Revolutionary Cloud Computing Trends That Will Define Business Success In 2025

2024年11月24日

The 7 Revolutionary Cloud Computing Trends That Will Define Business Success In 2025

Thank you for reading my latest article The 7 Revolutionary Cloud Computing Trends That Will Define Business Success In…

24 条评论
AI And The Global Economy: A Double-Edged Sword That Could Trigger Market Meltdowns

2024年11月22日

AI And The Global Economy: A Double-Edged Sword That Could Trigger Market Meltdowns

Thank you for reading my latest article AI And The Global Economy: A Double-Edged Sword That Could Trigger Market…

26 条评论
Why Artificial Superintelligence Could Be Humanity's Final Invention

2024年11月20日

Why Artificial Superintelligence Could Be Humanity's Final Invention

Thank you for reading my latest article Why Artificial Superintelligence Could Be Humanity's Final Invention. Here at…

44 条评论
The 10 Most Powerful Data Trends That Will Transform Business In 2025

2024年11月18日

The 10 Most Powerful Data Trends That Will Transform Business In 2025

Thank you for reading my latest article The 10 Most Powerful Data Trends That Will Transform Business In 2025. Here at…

46 条评论
The Future Of Retail: 10 Game-Changing Trends That Will Define 2025

2024年11月17日

The Future Of Retail: 10 Game-Changing Trends That Will Define 2025

Thank you for reading my latest article The Future Of Retail: 10 Game-Changing Trends That Will Define 2025. Here at…

28 条评论
The Best Smartwatches In 2025: From AI Health Tracking To Adventure-Ready Timepieces

2024年11月15日

The Best Smartwatches In 2025: From AI Health Tracking To Adventure-Ready Timepieces

Thank you for reading my latest article The Best Smartwatches In 2025: From AI Health Tracking To Adventure-Ready…

23 条评论
The Future Of Corporate Learning And Employee Engagement: Why Traditional Training Is Dead

2024年11月13日

The Future Of Corporate Learning And Employee Engagement: Why Traditional Training Is Dead

Thank you for reading my latest article The Future Of Corporate Learning And Employee Engagement: Why Traditional…

28 条评论
4 AI-Powered Strategies For Your Ultimate Job Search

2024年11月11日

4 AI-Powered Strategies For Your Ultimate Job Search

Thank you for reading my latest article 4 AI-Powered Strategies For Your Ultimate Job Search. Here at LinkedIn and at…

30 条评论
The Impact Of Microsoft's New AI Employees On Your Job

2024年11月10日

The Impact Of Microsoft's New AI Employees On Your Job

Thank you for reading my latest article The Impact Of Microsoft's New AI Employees On Your Job. Here at LinkedIn and at…

43 条评论

See all articles

How AI Creates Synthetic Speech

Bernard Marr

?? Internationally Best-selling #Author?? #KeynoteSpeaker?? #Futurist?? #Business, #Tech & #Strategy Advisor

What Is AI Speech Synthesis?

领英推荐

The Many Potential Uses of Synthetic Speech

AI & Future Tech Trends

824,517 位关注者

Bernard Marr的更多文章

社区洞察

其他会员也浏览了

The Timeless Power of the Written Word: Enhancing AI with Transcription

AI Voice & Speech Generation - Latest Breakthroughs

Text to Speech vs. Speech to Text: What’s the difference?

Embracing Emotional Intelligence in the Age of AI.

And you thought Generative AI was slowing down? Latest news that say otherwise!

Introducing the Vulavula API: here’s a comprehensive overview of its features

Top Use Cases of Speech-to-text API

See how I materialize language with?AI ...

?AI, language, and me ...

Two Phase Modality Fusion

What Is AI Speech Synthesis?

领英推荐

The Many Potential Uses of Synthetic Speech

AI & Future Tech Trends

824,517 位关注者

Bernard Marr的更多文章

Why You Should Be Polite To ChatGPT And Other AIs

The 7 Revolutionary Cloud Computing Trends That Will Define Business Success In 2025

AI And The Global Economy: A Double-Edged Sword That Could Trigger Market Meltdowns

Why Artificial Superintelligence Could Be Humanity's Final Invention

The 10 Most Powerful Data Trends That Will Transform Business In 2025

The Future Of Retail: 10 Game-Changing Trends That Will Define 2025

The Best Smartwatches In 2025: From AI Health Tracking To Adventure-Ready Timepieces

The Future Of Corporate Learning And Employee Engagement: Why Traditional Training Is Dead

4 AI-Powered Strategies For Your Ultimate Job Search

The Impact Of Microsoft's New AI Employees On Your Job

社区洞察

其他会员也浏览了

The Timeless Power of the Written Word: Enhancing AI with Transcription

AI Voice & Speech Generation - Latest Breakthroughs

Text to Speech vs. Speech to Text: What’s the difference?

Embracing Emotional Intelligence in the Age of AI.

And you thought Generative AI was slowing down? Latest news that say otherwise!

Introducing the Vulavula API: here’s a comprehensive overview of its features

Top Use Cases of Speech-to-text API

See how I materialize language with?AI ...

?AI, language, and me ...

Two Phase Modality Fusion