NotebookLM - Early Thoughts

NotebookLM - Early Thoughts

First impressions of Google's NotebookLM+AutoPodcast ( notebooklm.google.com ):?

  • The Notebook part: works well. Great info management tool. Feels inevitable.
  • The auto-generated podcast part: jaw-dropping, but … addictive, useful, and awful.


I based my initial exploration on 3 different sets of data:

  1. A big set of my own messy jottings about strategy, capabilities, etc
  2. A maths paper that is way beyond my comprehension
  3. A large collection of jokes from a variety of sources

The Notebook part

The basic idea of the NotebookLM is great, and works well. Feels like a good, sensible, useful, and inevitable use of LLMs. You get accustomed to the ask, pause, response approach. Each response can persist as a note in the set.

I can imagine a power user running with this. The initial set of “you could do XYZ” buttons nudges the user towards its strengths in info management and analysis. I favour the taxonomy route, so I would want a hot key for that.?

The UI is a bit confusing. e.g., I keep losing the podcast button. But experimental, so totally fine.?

I was able to interrogate the datasets, and its comprehension was generally good.?

By dataset

  1. Messy notes. Added useful structure and overviews to the messy strategy notes. Was the best on this.
  2. Hard maths. Provided an ‘in’ to the complicated maths paper. The generated analogies were helpful, and it did not matter that a real mathematician would have laughed (and did) at their wrongness. They helped me.
  3. Messy jokes. It needed help with the jokes. The inputs were messy, and included some jokes labelled as coming from the Edinburgh Fringe, but the system seemed to assume the jokes were all from there. Via the chat dialog, I was able to phrase questions that corrected this misunderstanding, but it needed re-correcting. There seemed to be no way to amend high level misunderstandings like this. It was very judgemental about the jokes, combined with a lack of understanding of the nuance. And sometimes simply wrong about interpretations. But it was able to create a useful taxonomy and classify the jokes according to that taxonomy, so not useless. Except, on closer inspection, while the taxonomy seemed ok, the classification of jokes into that taxonomy had numerous mistakes. Hm. Needs more digging.

Questions / Thoughts?

  • Will be interesting to see if the generated notes themselves start distorting its view of the original content.
  • I wonder how this will scale up to having 100s or 1000s of notes in a set. Note management will presumably become an issue.
  • The length of pause waiting for a response to a meaty question is significant. Will that become increasingly frustrating? Perhaps ensuring the GUI allows the user to be getting on with something else in the meanwhile will be sufficient.
  • Need to be wary of high-level misunderstandings of the data that distort all subsequent interpretations of it. I would guess the system could benefit from some user-supplied context before it takes in the various sources. Or, assume there would be two steps to working on a new set of data: 1. read in the data, generate an overview, 2. User sense-checks the overview, corrects the context, go back to 1.
  • The “create a taxonomy, then classify all the pieces according to the taxonomy” seems like a generally useful step.
  • I may have missed it in the UI, but I couldn’t find the fallback option of a simple keyword search.

Overall, big thumbs up. Some questions about the reliability of classifying large datasets using a taxonomy, and correcting high level misunderstandings. NotebookLM will keep getting better, faster, more useful.

The auto-generated podcast part

Jaw dropping. After a few minutes of the spinny thing, it was expressing “concentrated essence of smug podcasters”. If you overheard little snippets, there is no way to tell this is not real. And when you listen more closely, they are talking about your info. Jaw drops again.

But when you listen more closely still, there are cracks. Again, experimental, so totally fine. But I noticed the following:

  • The speaking heads seem to alternate being The One Who Understands The Info, which can feel odd, and increasingly fake/glitchy.
  • The relentless positivity and smugness is nauseating - face-punchingly annoying in fact. Feels like a satire. Cannot listen to this for long. I guess being a non-American that may be more on me than them, but FFS, tone it down a bit.

By dataset

I had very different experiences with the podcasts for each of the 3 datasets:

  1. Messy notes: This was scary. Properly worrying. Confirmation bias on steroids. My own notes being discussed and agreed with enthusiastically, with nuances and phrasing that did not come directly from me, but oh how very much I agreed with. I was “yes, yes, at last, I’m right, ha, see, they love it”. I could listen to people agreeing with me all day. I felt like I was on a slippery slope into some psychological issues. Thoroughly and instantly addictive. Specific benefits included hearing my own points rephrased into a better, more relatable form, so I’ll take those. But, yes. This needs care, psychologically-speaking.?
  2. Hard maths: This was fascinating and useful. Whilst being wrong about some of the nuances in the maths paper (so a disgruntled expert informs me), the overall approach did help me wrap my head around the paper. I can now begin to ask less stupid questions. It means subsequent explanations have a better chance to stick. The podcast pacing helped with this. So, a net positive.?
  3. Messy jokes: Oh dear. Fell apart on this one. The podcast personas came across as pompous, judgemental, wrong-headed ignoramuses. They just did not get it, but were going to smug their way through 10mins of talking anyway. Their opinionated lack of understanding was painful. When a chat response indicates a misunderstanding, it is fine. You can restate the question, perhaps re-structure the data, change the context, etc, and ask again. But the podcast keeps on trucking. Felt like a brutal satire of polished podcasting ignorance. Speaking for the sake of speaking. Funny though.

Questions / Thoughts

  • I wonder if it will become easy (and annoying) to spot re-use of mannerisms in the podcast speech. TBF, it is anyway with normal human podcasters...
  • The option of non-American accents and mannerisms would be nice. Might help me be less judgemental ;-)
  • I can imagine a dashboard of multiple sliders to configure the podcast personas and their respective ‘understanding’ of the data.?
  • I wonder what would happen if one of the personas was explicitly contrarian. Or a positive persona who does not fully understand the data debating with a contrarian who really does.
  • Make it easy (perhaps the default) to have a sceptical persona.
  • Make it easy to configure the podcast with a slider from objective to subjective.
  • Being able to ‘tune’ the podcast in these subtle but profound ways would be fascinating.
  • When the basic NotebookLM gets an answer ‘wrong’, it does not matter so much. Just rephrase the question and try again. But the podcast being ‘wrong’ was a horror show. It takes the 'wrong' to a whole new level.
  • The addictive aspect was properly worrying.

Overall, wow. But…

Jason Alafgani

Jellypod - Customizable AI Podcasts

4 个月

if you want to get people disagreeing, less annoying (or more annoying), and other controls over voice, structure, content, etc, check out Jellypod

回复
Ian Griffin

Publisher at Booch News

4 个月

I agree about the American accents. And are *all* of the NotebookLM podcasts going to feature Bob & Sally (as I named them)? https://www.boochnews.com/2024/10/15/a-i-bots-discuss-the-science-of-kombucha/

回复
Juan Felipe Márquez Soto

Communications and Marketing | Digital Marketing

4 个月

Hey Chris, I believe you hit the nail on the head with this observation: "Confirmation bias on steroids. My own notes being discussed and agreed with enthusiastically, with nuances and phrasing that did not come directly from me, but oh how very much I agreed with. I was “yes, yes, at last, I’m right, ha, see, they love it”. I could listen to people agreeing with me all day. I felt like I was on a slippery slope into some psychological issues. Thoroughly and instantly addictive. Specific benefits included hearing my own points rephrased into a better, more relatable form, so I’ll take those. But, yes. This needs care, psychologically-speaking." You've said it better than the podcasters, gemini and myself! Very addicting! I thought I was the only one feeling this. I've been having this conversation with myself over how much I enjoyed to hear my points being talked about by third parties. And my brain/ego, didn't seem to mind too much the fact that it was all AI. Looking forward to more of your observations into NotebookLM. Cheers, Juan

Hari Om Vashishtha

3MistakesOfMyLife.in (MVP) #ATributeToTheMentors #NothingElseMatterz.com

4 个月

Wait a second Chris, I beg to differ. You seem to have missed looking at the possibility where someone curiously simplifies your thoughts better than yourself. The virtual AI beings talked about My 11-page document and explained what I was thinking better than myself. Here's my specific context as a sample: https://youtu.be/Ohr1i_dhT4Q?si=b6E_hUUKE5uVrHsN On the top of that, in my recent observation, realisation and need that I felt, I even want to talk to those virtual folks to find the gaps in my own thought processes. Possibly, it's already in their pipeline OR maybe, is that option already there? I will even try to submit that as a wish to the team behind it. If someone can connect me to the team lead, I'd be happy to be the test subject on my way to... try to do my thinking, learning, creating, brainstorming, & note-taking.

Harold Mansfield

IT & AI support that grows your business without breaking the bank.

4 个月

I had the same critiques of the curated podcast feature. My use case is as an AI discussion of my blog posts. I also wanted more control over voices so mine doesn't sound like everyone elses, and which points they focused on. And since it's for my own website wanted to add some promotion at the end...I could not get NotebookLM to do this. So I went down a few rabbit holes (I posted about it today, but won't link to it so I don't come off as spammy). Descript is great for separating the speakers and out putting a transcript of the conversation. With that transcript you can use 11Labs creator plan to assign voices to the transcript. You can also edit the content which is awesome. You may want to further edit for time, and gaps in responses. 11labs is not as conversational as NotebookLM but you can make it work. NotebookLM created the base, but customization with the other tools is possible. It would make it easier (for me) if Google just purchased 11labs and integrated it all together ?? .

要查看或添加评论,请登录

Chris Gathercole的更多文章

  • Capability-First: A Different Take On Product Strategy And Prioritisation

    Capability-First: A Different Take On Product Strategy And Prioritisation

    tl;dr First identify a strategically relevant portfolio of versatile product capabilities, and only then prioritise the…

    9 条评论
  • So, you have a huge internal appetite for LLMs?

    So, you have a huge internal appetite for LLMs?

    In an article published on 26 July 2024, the Financial Times described how JP Morgan now has “about 50,000 staff [..

    1 条评论
  • So, you want to organise a Hackathon?

    So, you want to organise a Hackathon?

    TL;DR?—?Start at the end, with the intended outcomes, and work backwards. But also start with, “Are you sure what you…

    1 条评论
  • So, (you think) you have a strategy?

    So, (you think) you have a strategy?

    There are many things called “a strategy”, but how can you assess them? What if you want to construct your own…

    2 条评论
  • So, you have an FAQ?

    So, you have an FAQ?

    Whether yes or no, this is your way in to using AI. Do you have an FAQ? You know, that always out of date collection of…

    4 条评论
  • Mind the (AI) Gap

    Mind the (AI) Gap

    You can’t plan and prioritise properly if you don’t understand the range of AI possibilities, or how they can relate to…

    2 条评论
  • The LLM-ephants In The Room

    The LLM-ephants In The Room

    In any discussion of Large Language Models (LLMs), it felt necessary to list a variety of recurring caveats, all of…

    1 条评论

社区洞察

其他会员也浏览了