#103 - The Real Time Revolution

#103 - The Real Time Revolution

Imagine your glasses whispering the name of every stranger you pass, or having a real-time conversation with your browser to get your daily dose of news while you're on your way to work, or that article you're working on coming to life, eager to chat about its contents. It's closer than you think. It's been about a week since OpenAI announced its real-time voice API , and Meta unveiled its latest Ray-Ban AR glasses with real-time visual analysis . Since then, developers and hackers have been pushing the boundaries of what's possible.

* Voice-controlled web browsing lets you chat with any website.

* Voice chat for PDFs lets you argue with documents.

* A voice-controlled painting app combines your inner Hemingway with your inner Picasso.

* And of course, someone tried to couple animated AI companions with real-time voices.

These early experiments point to something more substantial: a shift away from clicks and linear delivery, and with it, a business model depending on that approach. However, this shift won't happen overnight. Real-time audio output tokens are priced at $200 per 1 million tokens , roughly translating to $0.15 per minute of conversation for businesses - at scale, this could quickly burn a hole in any startup's budget, let alone consumers'.

A more fundamental reason why this probably needs some more consideration is the real and clear privacy issues. Two Harvard students already demonstrated that by modifying Meta's glasses, they could identify strangers in real-time. And Evan Ratcliff's magnificent, comic but unsettling "Shell Game" podcast shows how easy it is to fool people with cloned voices in real-time phone conversations.

One thing is clear: real-time AI is here - and so are its real-life challenges.

AI

The internet is getting flooded with AI-made stuff. Try searching for "baby peacock" on Google - you'll see tons of images that AI created. While this specific search might be uncommon, it highlights a growing challenge in distinguishing AI-created content from human-made work. To address this issue, Adobe has launched a free web app that helps creators protect their content and ensure proper attribution.

A recent study reveals that people tend to distrust content labeled as "AI-generated", even when it's accurate and vetted, leading to decreased sharing of such material.

Short:

  • Cove.ai & Miro introduce new AI tools that blend mind mapping, brainstorming, and chatbot functionality. Link & Link
  • AI detectors can be unreliable. One detector, claiming to be the most advanced on the market, assessed the US Declaration of Independence as 97% AI-generated. Link
  • Nick Diakopoulos analyzes 872 AI future scenarios. Link
  • And two models to start with AI in your organisation: the lab & the crowd. Link

NEWS

OpenAI has secured another content partnership. This time with Hearst , one of the largest newspaper and magazine holding groups in the US. Meanwhile OpenAI's head of media partnerships has stated the company isn't planning to share ad revenue from SearchGPT with publishers.

Related: Researcher Felix Simon argues for more clarity on AI companies' deals with news publishers.

Short:

  • A new Pew Research Center study?reveals that?U.S. adults are turning to?TikTok?for news but they’re not getting it from the accounts of traditional media outlets or journalists. Link & Link
  • BR's AI Lab has developed a tool to verify if the content of AI-generated summaries matches the original text. Link
  • Related: Six tips for keeping AI hallucinations at bay. Link

VIDEO/AUDIO

ITV has introduced a generative AI-powered ad production service aimed at small and medium-sized enterprises. The idea is to democratize the access to high-quality video advertising.

The Economist did a write-up on YouTube's self-taught filmmakers and how they are challenging the traditional television industry & streaming giants like Netflix and Disney.

The French data institute INA, which receives data from 184 TV and radio channels, has analyzed 700,000 hours of content with AI. The new data visualisations are worth exploring!

Short:

  • Hailuo AI, by Chinese startup MiniMax, has launched an Image-to-Video feature online. Link
  • A growing number of spammers are submitting fake, NotebookLM-generated podcasts to podcast platforms. Link
  • Audacy releases its State of Audio: The Trends Report, exploring shifts in creativity, AI, measurement, and audience activation. Link
  • Storyrabbit, a new app, lets users hear AI-generated stories based on their map location or current position. Link

SHORT

READ

Bain & Co released its Technology Report 2024 , offering insights into the latest tech industry trends with the main focus on AI - what else.

A new Media Viability Manifesto provides a common framework for action for global media development.

404media - the best independent voice in critical AI-coverage - reports on the rise of AI in job applications and hiring processes , with both applying to thousands of jobs at the same time.

And my find of this week! How to uncover data journalism stories that are hidden in complexity: three examples of using AI to find stories in messy data.


Thank you for reading! Wayfinder is made to travel - feel free to share it with friends & colleagues.

要查看或添加评论,请登录

Ezra Eeman的更多文章

  • #107 - What's News Worth?

    #107 - What's News Worth?

    Last week, Google quietly announced a test that could shift the power balance between tech platforms and news…

    3 条评论
  • #106 - The Value Question

    #106 - The Value Question

    When it comes to artificial intelligence, we often get caught up in the wrong conversation. I am not entirely free of…

    1 条评论
  • #105 - Robot Readers

    #105 - Robot Readers

    What if your next readers are all AI agents? It might sound a bit far-fetched, but it's not entirely fictional…

    3 条评论
  • #104 - Courting Creators

    #104 - Courting Creators

    Big social media and tech companies are working overtime to attract and keep content creators. They've realized more…

  • #102- Sight and Sound

    #102- Sight and Sound

    Video is reshaping the podcast landscape. Spotify's latest fan study reveals a clear shift towards visual content in…

  • #101- Imagined for you

    #101- Imagined for you

    I don't often think about dystopian futures. But Meta's recent Connect 2024 event sparked that feeling somehow in me.

    1 条评论
  • #100 - From Pocket to Big Screen

    #100 - From Pocket to Big Screen

    The small screen is mounting a takeover of the big screen. Social video apps made mobile phones the main place to watch…

    13 条评论
  • #99 - Simulated Reasoning

    #99 - Simulated Reasoning

    Imagine teaching a child to solve a math problem step-by-step or explaining a graphic novel in simple terms. This week,…

  • #98 - Next Gen Playlists

    #98 - Next Gen Playlists

    Over 15 years ago, YouTube introduced a seemingly simple feature that would become a catalyst for its explosive growth:…

    7 条评论
  • #97 - Summer Update II

    #97 - Summer Update II

    Welcome to the second summer edition of Wayfinder. This month was quieter in tech & media, but AI development continued.

社区洞察

其他会员也浏览了