The Voices in the Machine
Last year I narrated a fascinating book for MIT Press that explores various applications of machine learning in creative work,?The Artist in the Machine: Inside the World of AI-Powered Creativity, by Arthur I. Miller. The author discusses the nature of creativity and how we know it when we see it, but then goes on to give numerous examples of how computers are now composing original music, generating expressive visual art, even writing surprisingly interesting poetry, prose, and plays.
One thing that wasn’t included in Miller’s book was the fact that machine learning and AI are now being used to create synthetic models of the human voice, too.?Incredibly good ones.
How good? Good enough that yours truly completely failed to select the human voice during recent demonstrations of technology from VocaliD and Scribe Audio. And these ears have been trained for the past fifteen years to pick up on all sorts of incredibly subtle nuances of speech!
Unlike previous generations of text-to-speech that were merely reassembling little recorded slices from a large bank of various phonemes to form words and phrases, this new technology is actually generating audio from scratch. Essentially, it does this by analyzing many hours of recording and creating an intricate set of rules regarding all of the little nuances of a person’s voice tone and texture, patterns of speech, and emotional voice modulation.
领英推荐
Some of the potential pitfalls of this sort of technology have come up in the news recently thanks to how it was (mis)used to?deepfake Anthony Bourdain posthumously?and to narrate many thousands of TikTok videos in my pal?Bev Standing’s voice?without her knowledge.
These two high profile cases are just the beginning, however. The quality and availability of such technology is increasing rapidly. It will not only radically alter the voiceover industry that I know and love but it will also pose some unique challenges for us socially, politically, and psychologically.?The comfort and assurance we get from voices we know and trust are wired deeply into our psyche- what happens when such voices can be operated by another?
In order to help explore this wild frontier and advocate for the interests of our fellow voice talent, several of my colleagues and I are currently part of a study group within the?Open Voice Network. OVON is an arm of the Linux Foundation “dedicated to the communal development and adoption of industry standards and usage guidelines, development, and documentation of voice-centric value propositions, and education and advocacy initiatives.” We are helping to produce a white paper that will help inform policy makers and executives alike of the?myriad impacts of synthetic voice to consider as they decide how to implement and regulate the use of these technologies.?
I have plenty more thoughts but that’s probably enough for now. This may be the first time you’ve heard me speak on this subject, but I’m quite sure it won’t be the last!
SEASONED VO ACTOR - AI AND ENTERTAINMENT MEDIA CONSULTANT -2023 RECIPIENT OF DEPARTMENT OF DEFENSE MEDAL FOR DISTINGUISHED PUBLIC SERVICE
3 年Very interesting article. VocaliD is by far, in my opinion, the best AI voice producer in the world.
Thanks for the the shout out Adam Lofbomm. We are thankful for the relationships we have been building in the voice talent industry and believe if we all work together in its design (AI voice)... this technology is going to be a huge asset for the industry and of great benefit to many!
Helping leaders sound like leaders.
3 年This is great, Adam! My sense is that many voice actors worry that the AI voice revolution is a tidal wave that will sweep us all away, but of course there are ethical and intentional ways to harness this technology to the benefit of all. Can't wait to read the white paper when it comes out!
Thanks Adam, it's been a pleasure working with you and the Open Voice Network as we build a safer future for voice.
Thank you for including us in this list, Adam. We are thrilled about our recent launch of our MARVEL.ai solution and look forward to seeing what content creators make through synthetic voice!