Intent and Iteration are the new Click and Drag

The current generation of LLMs is beginning to let us move from the world of syntax and process to semantics and intent. This is incredibly powerful and valuable for a bunch of reasons. Intent is much higher leverage and much more robust than process. Saying “go into the kitchen and get the milk” is denser than saying “lift your left foot 3cm at .5cm/s, swing forward at 4 degrees per ms, …” etc. And a precise process for walking into the kitchen to get the milk gets even bigger and harder once you have to write all the edge cases (“if cat present, then…”). It’s not really feasible.

Intent is higher density, by a lot, but the underlying system has to be capable of understanding and executing that intent. That’s where there are some opportunities in recent LLMs like GPT-4. But there are still challenges here. One of them is precision: the models are very rich, and very capable, but not always repeatable, and not always accurate (famously, they hallucinate).

At the intersection of these two ideas is an approach to designing programs and UX that sounds simple, but it probably as foundational as “click and drag” has become: iteration. We aren’t used to designing this way - oh, sure, we iterate a lot on our own content. We rewrite posts and images, try things and back out and go again.

But we’re not used to doing that with something we think of as a program. You don’t get to change the icon you clicked on if it doesn’t do the right thing - it’s static, it either works or it doesn’t. But in the realm of language or meaning, things aren’t that simple. Maybe you didn’t express yourself well. Maybe the compression worked too well and you need to clarify (“which milk? Oat or Almond?”). Maybe you were mistaken even, about what you wanted and need to correct (“Actually, maybe I’ll have tea”).

There will almost certainly be best practices that emerge, just as there have for click-and-drag interfaces. There will be a balance between giving the user breadth, and giving guidance, just like there is a balance between having lots of functionality and having too cluttered a graphical design - and there will probably be both good and less good practitioners of design in this space of iteration.

I believe that most of what we will wind up doing eventually will be “talking” to the computer - where talking right now is mostly defined as “text chat” but will shortly be multimodal: images, gestures, voice, etc. As those interactions get more complex, it will be harder and harder to build “rigid” interfaces like we do now. Why build a bunch of static buttons that reflect and underlying, rigid schema somewhere, when you can let the user tell you what you want, find the result in a vector database, and iterate together?

Intent and Iteration will be a foundational metaphor for user experience in the next wave of software. We got click-and-drag windows interfaces when the tech was advanced enough to give us high resolution screens and fast processors that could handle the real-time interaction. We now have new capabilities that let us handle interaction with intent and meaning in real-time. Now it’s time to build the experiences that are native to those capabilities.

Julien Genestoux

Founder, CEO @unlockProtocol

1 年

I have been thinking a lot about feedback loops in the context of these LLMs. It seems that this is still primarily unexplored, yet IMO has a lot of potentials. In your drag-and-drop example, the UI actually has something really interesting: it shows the "file" as stuck to your mouse pointer. This is a critically important way of knowing that the operation will complete successfully. I think a lot of hallucinations could be avoided if the LLM was more often looking for clarification before actually completing the request.

要查看或添加评论,请登录

Sam Schillace的更多文章

  • AI analogies and historical lessons

    AI analogies and historical lessons

    How to make sense of it all. I've decided to keep the primary posts over on Substack.

    1 条评论
  • Motion, Thought, Systems and AI

    Motion, Thought, Systems and AI

    In which I ponder how motion is like thought, why LLMs are like early steam engines (hitting things and pumping water),…

    4 条评论
  • Looking back at the Schillace "laws"

    Looking back at the Schillace "laws"

    Way back in, I think, March or so of 2023, after I’d spent a little while trying to build things with GPT-4, I wrote…

    5 条评论
  • A strange tech parable

    A strange tech parable

    In my role at Microsoft, part of what I do is spend time with the leadership team that runs M365, Office, and Windows…

    12 条评论
  • Simplicity, Clarity, Humility

    Simplicity, Clarity, Humility

    There is an old joke: “sorry this letter is so long, I didn’t have time to write a shorter one”. It’s funny, but it’s…

    4 条评论
  • A matter of context

    A matter of context

    It’s interesting that, as we talk about using AI more and more, the phrase we use is “human in the loop” instead of “AI…

    3 条评论
  • The tension between Chaos and Order

    The tension between Chaos and Order

    I’ve been spending the last week in Japan, meeting with makers and crafts people. as always, it’s a humbling…

    4 条评论
  • No Prize for Pessimism

    No Prize for Pessimism

    A book! I’ve been writing these letters for about 12 years now. I started writing them when I was at Box, as a way to…

    10 条评论
  • Adding Value in the Age of AI

    Adding Value in the Age of AI

    If you wrote out all possible combinations of, say, 1000 letters, the vast number of them would be nonsense. And the…

    3 条评论
  • Don't use AI to make work for humans

    Don't use AI to make work for humans

    I’ve started to notice an interesting pattern. More enlightened teams and people are using AI to get lots of work done…

    5 条评论

社区洞察

其他会员也浏览了