The Voices in the Machine

The Voices in the Machine

Last year I narrated a fascinating book for MIT Press that explores various applications of machine learning in creative work,?The Artist in the Machine: Inside the World of AI-Powered Creativity, by Arthur I. Miller. The author discusses the nature of creativity and how we know it when we see it, but then goes on to give numerous examples of how computers are now composing original music, generating expressive visual art, even writing surprisingly interesting poetry, prose, and plays.

One thing that wasn’t included in Miller’s book was the fact that machine learning and AI are now being used to create synthetic models of the human voice, too.?Incredibly good ones.

How good? Good enough that yours truly completely failed to select the human voice during recent demonstrations of technology from VocaliD and Scribe Audio. And these ears have been trained for the past fifteen years to pick up on all sorts of incredibly subtle nuances of speech!

Unlike previous generations of text-to-speech that were merely reassembling little recorded slices from a large bank of various phonemes to form words and phrases, this new technology is actually generating audio from scratch. Essentially, it does this by analyzing many hours of recording and creating an intricate set of rules regarding all of the little nuances of a person’s voice tone and texture, patterns of speech, and emotional voice modulation.

Some of the potential pitfalls of this sort of technology have come up in the news recently thanks to how it was (mis)used to?deepfake Anthony Bourdain posthumously?and to narrate many thousands of TikTok videos in my pal?Bev Standing’s voice?without her knowledge.

These two high profile cases are just the beginning, however. The quality and availability of such technology is increasing rapidly. It will not only radically alter the voiceover industry that I know and love but it will also pose some unique challenges for us socially, politically, and psychologically.?The comfort and assurance we get from voices we know and trust are wired deeply into our psyche- what happens when such voices can be operated by another?

In order to help explore this wild frontier and advocate for the interests of our fellow voice talent, several of my colleagues and I are currently part of a study group within the?Open Voice Network. OVON is an arm of the Linux Foundation “dedicated to the communal development and adoption of industry standards and usage guidelines, development, and documentation of voice-centric value propositions, and education and advocacy initiatives.” We are helping to produce a white paper that will help inform policy makers and executives alike of the?myriad impacts of synthetic voice to consider as they decide how to implement and regulate the use of these technologies.?

I have plenty more thoughts but that’s probably enough for now. This may be the first time you’ve heard me speak on this subject, but I’m quite sure it won’t be the last!

Gary "Max" McGill

SEASONED VO ACTOR - AI AND ENTERTAINMENT MEDIA CONSULTANT -2023 RECIPIENT OF DEPARTMENT OF DEFENSE MEDAL FOR DISTINGUISHED PUBLIC SERVICE

3 年

Very interesting article. VocaliD is by far, in my opinion, the best AI voice producer in the world.

Thanks for the the shout out Adam Lofbomm. We are thankful for the relationships we have been building in the voice talent industry and believe if we all work together in its design (AI voice)... this technology is going to be a huge asset for the industry and of great benefit to many!

Claire Fry

Helping leaders sound like leaders.

3 年

This is great, Adam! My sense is that many voice actors worry that the AI voice revolution is a tidal wave that will sweep us all away, but of course there are ethical and intentional ways to harness this technology to the benefit of all. Can't wait to read the white paper when it comes out!

Thanks Adam, it's been a pleasure working with you and the Open Voice Network as we build a safer future for voice.

Thank you for including us in this list, Adam. We are thrilled about our recent launch of our MARVEL.ai solution and look forward to seeing what content creators make through synthetic voice!

要查看或添加评论,请登录

Adam Lofbomm的更多文章

  • AI vs. Bigger, Better Stories of Us

    AI vs. Bigger, Better Stories of Us

    Like many of us, AI has been on my mind a lot lately. For the past two years, in fact, I’ve been serving as a member of…

  • Wow Thanks Yay No. 24

    Wow Thanks Yay No. 24

    In today’s edition: an island of safety in the storm, the presents of the past, and a new hybrid chapter for Alfa WOW…

  • In a sense, you are the star.

    In a sense, you are the star.

    While we were passing through NYC during our whirlwind visit to the States, I was so grateful to come across a magical…

  • ? SES: What's Your Story?

    ? SES: What's Your Story?

    READ: On Thursday night my dad took me to an event that his church put on to help promote the sort of healthy men's…

    1 条评论
  • ? SES: Passion Projects, Discarded Treasures & Jaaring Sounds

    ? SES: Passion Projects, Discarded Treasures & Jaaring Sounds

    READ: I got called out a few times recently by a several of my soul brothers and sisters. Although they came at it from…

    1 条评论
  • Getting Together and Getting Real

    Getting Together and Getting Real

    Last weekend I was extremely fortunate to land a spot at FaffCon 9, a very special and unique gathering of 150 of my…

社区洞察

其他会员也浏览了