A Steady Wish & Consistent Whisper
Bill Kirst
Leading Change in the Era of AI | Storyteller | Poet | Adobe | Podcast Host - "Coffee & Change" | ex-Microsoft, IBM
Your Voice, Their Reign: The Rise of Algorithmic Autocracy in Podcasting
There is an untouchable intimacy of podcasting – an auditory embrace of sorts that can transform someone’s outlook. I recently heard a quote by C.S. Lewis, that spoke to how consistency brings about intimacy. Like a “steady wish,” consistency creates intimacy. And that is all I have ever wanted for the platform I created eight and a half years ago.
“…a steady wish for the loved person's ultimate good as far as it can be obtained." – C.S. Lewis
For each person who listened, and every guest that shared their story, I want nothing more than the ultimate good as far as it can be obtained. And that path is only reached with each step in vulnerability and every breath of humanity. The same breath that forms syllables, sounds and a safe haven for storytelling.
It is hard to come close to the feeling of having a personal conversation with a guest, as a trusted host. For eight years I have had the honor of doing this work, and still, each time, it brings me a sense of joy and healing. And perhaps that is why I grow wary of what lies ahead for this medium…measured in moments, not machines.
I fear podcasting is at risk of becoming a simulated experience. As shared in a previous article, I believe anything worth broadcasting should be required to have a heartbeat. But we are seeing humanity lose this arms race. We won’t realize what we have lost until it is too late.
Advanced AI technology is enabling companies like Riverside.fm to entice podcasters with the convenience of voice cloning. While this might seem like a dream come true for busy creators, it's a deal many may come to regret making. I think it is pivotal that we creators examine the implications of surrendering our voices to the so-called "algorithmic autocrats."
Voice Cloning Chokes Our Consciousness
Voice cloning roots go back decades stemming from research and development into early text-to-speech (TTS) systems where rudimentary machines produced robotic and unnatural voices. However, with the advent of deep learning and neural networks, AI models became adept at analyzing and mimicking the nuances of human speech. And when nuance may be our only remaining deciphering characteristic, we shouldn’t give it over so easily.
As we plunge into what is proving to be the most unpredictable season of political implication in our generation, voice cloning software is increasingly being used, possibly for nefarious reasons. The technologies create convincing replicas from just a few minutes of audio. And while this technology has found benevolent applications in various fields, including accessibility tools for individuals with speech impairments, it is being forced on those who may be the last bastion of the imperfect and essential human voice…podcasters.
The Techno Siren’s Song is Efficiency
Leading platforms for remote podcast recording have integrated AI voice cloning tools into new workflows. With Magic AI editors promises are made to clean up any audio, with injected crispness and clarity. Sadly, the result is warbled and dissonant, leaving the listener craving for more natural breath patterns and a sense of real diction. And Podcasters using these platforms can now use their cloned voices for repetitive tasks like introductions, advertisements, or even entire episodes based on scripts. The appeal appears to be undeniable, and is advertised as such in their marketing materials:
On the surface, this appears to be a win-win scenario. But if we are not careful, we are likely to cross a threshold into a world where things no longer make sense. Where we lose our senses.
Perhaps this is a world where Podcasters are promised the morsel of saved time with less resources, and listeners are told to enjoy a polished and consistent listening experience. But beneath the shiny veneer lies a darker truth. Because if consistency begets intimacy, I don’t think this is the consistency we are all seeking in our lives to in fact feel a more human connection.
领英推荐
Woeful Words of Woland: Faustian Bargains Before Us
How do we confront the challenges to our human expression in a society so seductively swayed by efficiency, time savings and promised perfection? I am reminded of an expression from a malevolent protagonist in a famous Russian novel, Master and Margarita. It translates loosely to “Manuscripts don’t burn.”
The line speaks directly to the enduring power of human spirit inoculated by creativity and resilience, audible through the human voice. Even when faced with the threat of destruction and adversity, the essence of human expression and experience cannot be erased. In the context of AI, it serves as a reminder that the value of human creativity lies in its audible authenticity and the truthful struggles and emotions and syllables and breaths that shape it.
A machine cannot replicate the depth and complexity of human experiences. And it certainly can’t quiver to find words through heartbreak. If we hand over our voices to AI algorithms, are we inadvertently entering into a Faustian bargain? We sacrifice our individuality and authenticity for the allure of efficiency. Then what? Our voices, once unique and irreplaceable, become mere data points for algorithmic autocrats to replicate, manipulate and monetize. ?
Advocate for Authenticity…Always
The beauty of podcasting lies in its raw and unfiltered nature, and almost punk-like, rebellious beginnings. It never ascribed to be anything more than a place for honoring the authentic voice, truth unscripted in all its glory. It's the unscripted laughter, the heartfelt pauses, the occasional tears, and the subtle inflections in a host's voice and a guest’s story that forge a genuine connection with listeners. And turning over aspects, any aspects to AI-generated voices, while technically impressive, will consistently lack the emotional depth and human touch that define great podcasting.
Listeners tune in because they want to hear you, your story – your unique perspective, your experiences, your passion, your truth. When any voice is replaced by an algorithm, that connection is severed. It's the difference between a heartfelt conversation and a sterile serving of 1s and 0s.
Phaethon of Podcasting
If we are not careful and critical of our use of technology, we stand to end up as modern-day Phaethons driving our chariots too close to the sun. We will lose control and scorch the Earth pulling all the water from her rivers to cool our data centers. And for what? Something inhumane?
Remember, once your voice is digitized, you relinquish control over how it's used. The algorithmic autocrats can alter your tone, inflections, or even the words you say. Your voice could be used to endorse products you don't believe in, spread misinformation, or even create deepfakes that tarnish your reputation. This is not hyperbole; this is happening to people as you read this very article.
A Race to the Replica
AI voice cloning is being made to seem like a harmless time-saver, but it is leading to a devaluation of the entire podcasting industry. We can already hear it in content on YouTube, whereby the visuals are created, the transcript created, and the audio reading of the transcript created…all by machines. I can tell in about three seconds, and I turn away. When listeners realize that the voices they're hearing are not authentic, trust evaporates, and so does the idea of compelling content.
Embracing the Imperfect
I am here to encourage podcasters to resist the urge to surrender their voices. Your voice is our most powerful tool for expression, connection, and compassion. It is a tool for our embodied humanity, so beautifully shaped by our flaws, and our gifts of imperfection…in diction and dedication.
So, celebrate the imperfections, the unscripted moments, and the raw emotions that make podcasting consistently compelling. And consistently intimate. That is my “steady wish” for the wavemakers out there as we find our way around here. Embrace the human element and reject the soulless efficiency of algorithmic addiction. Because, in a world increasingly dominated by AI, leading change means using our voices more than ever. Because they are more valuable than ever.
?
Career Development Manager at Microsoft Military Affairs, Empowering Veterans to Achieve Their Full Potential | Veteran | IT Manager | Relationship Builder
4 个月Great insight -- I'm sure that the same feelings existed from the advent of radio and television. Embracing technology without surrendering control is key. It's similar to Orwell's "1984" about manipulation of truth and voice, but I'm glad to see most people/companies working to ensure AI tools serve to enhance, not replace, our human essence because integrity in voice and podcasting should be preserved so creators maintain control over their own voices.