thinking (creatively) with machines
gen 1 vs. gen 2 DALLE creations

thinking (creatively) with machines

“An armchair in the shape of an avocado,” is a weird thing to have shake the world. But when an AI can draw it in hi-def, it might.

The release of the second generation model?DALL-E 2, which generates imagery from a textual description, is striking a chord. The visuals it generates are inspiring and scary-good. It’s higher-resolution, lower-latency and more capable (edits too!) than its predecessor. Dare we say it’s more creative? Here are a few of my favorites so far:

No alt text provided for this image

"Teddy bears shopping for groceries in ancient Egypt."

No alt text provided for this image

“A rabbit detective sitting on a park bench and reading a newspaper in a victorian setting”

With models like DALL-E 2, everyone can now create visuals of high expressive quality at much lower cost. Visuals make communication more impactful. Whether you think it is art or not, DALL-E is about to make us all more powerful communicators.

The Age of Assistive, Creative AI

We are entering the age of assisted knowledge work and play. After I first wrote about the?conversational economy?and the potential of AI agents, there was a period of disillusionment. Bots in the wild, for example to replace human support agents, didn’t live up to revolutionary expectations.?

But we have seen a flurry of applications of AI that have had counterintuitive success. Instead of fully automating repetitive work, they’ve excelled at higher level, counterintuitively creative, nuanced and empathetic tasks as varied as art generation, language translation, and personalized mental health coaching (try?WoeBot).

I’m particularly interested in ways AI can augment humans attempting different modes of creative knowledge work, where we don’t need to see superhuman performance for deployment in the wild, and some level of unreliability as models improve isn’t fatal.?

There have been several breakthroughs besides DALL-E 2:?

Our Future Writing Partners

Writing is the way we share knowledge, or create knowledge in the first place. We cannot have fully formed thoughts without organizing them in language. But writing is also incredibly hard.?

Existing AI that help us write comes in the form of either narrow utilities: rule and style-checkers (Grammarly), auto-summarizers (see?Quillbot) and short-form suggestions (see Google?smart replies) or more specialized support (see?Cresta, for helping call center reps chat effectively). But we don’t yet have a general purpose writing assistant, the way codex is a general purpose programming assistant.?A writing partner is really a thought partner.?This will be a huge unlock for humankind.

How can we picture the path to that? A few skills we can picture for this partner:

  1. Keep track of and structure our ideas
  2. Generate new ideas
  3. Contextualize our ideas in the search space of all existing work
  4. Search for and suggest new, related knowledge while we are thinking
  5. Summarize that existing, related work
  6. Search for and suggest people we should discuss our ideas with
  7. Summarize the structure and storyline of what we have written for review
  8. Rate components for quality, suggest revisions

How might we think and write better if our first reader is someone who has all the knowledge of the internet? An agent that has read everything in a particular field?

In the above list, the second concept of generating new ideas, might be the least obvious. However, this is already within reach. In the field of purely creative writing, I used GPT-3 to help me create “lore” at scale (think fanfiction, but it can actually become part of the canon in web3) for a cyberpunk NFT project I love,?Chain Runners, by feeding it structured prompts. I was inspired by?Crypto Covens, which did something similar: read from?keridwen?about the?process?with some examples of her lovely writing.

If an AI agent can coach us to write, it’s a small step to picture coaching for other types of communication coaching, from the mundane: “don’t text when you’re angry” to encouragement to use specific language, “how do you think about this?” These agents can be thought partners if we want to test out a specific framing or practice a difficult conversation. They can give us guidance for cross-cultural interactions.

AI is on an exponential curve, which is always hard for humans to picture. OpenAI’s advancement of DALLE has made it easier to see the future. I believe the next surprising advancement will be in language.

If wordcels rule the world, then we should all accept any help we can get from AI’s to become our best wordcel selves. And if you’re an entrepreneur, don’t be afraid to be ambitious with what you assume of AI over the coming years. Plan to be surprised.

*Can’t wait for DALLE2 access? Check out?Lore?and mint your own piece of story with an AI-generated art NFT (no affiliation).

Mrudali Birla

Product Nerd | Consulting@KPMG | Grad student @ UW

2 年

writing assistants which could convert heavy texts into pictures for people who have issues reading large blocks of texts would be super-useful. Avacado-shaped chairs would be great to cheer up patients if any serious surgery chair is given this form.

回复
Andrew Unger

Investor at Conversion Capital

2 年

This is mind blowing.

回复

要查看或添加评论,请登录

Sarah Guo的更多文章

  • Pace

    Pace

    The most undervalued trait in startup hiring is pace. This is especially true today, when the floor is lava.

    31 条评论
  • Distributed Spectrum

    Distributed Spectrum

    Winning the Invisible War If software is eating the world, electronic warfare is devouring the battlefield. The next…

    17 条评论
  • Harvey

    Harvey

    In August 2022, when we first met with Winston Weinberg and Gabe Pereyra, we were struck by a few different things:…

    12 条评论
  • Mike Vernal @Conviction

    Mike Vernal @Conviction

    I am thrilled to welcome Mike Vernal as a General Partner at Conviction. At Conviction, at the eve of the AI…

    80 条评论
  • How Fast to Hire

    How Fast to Hire

    “I think it’s working, but I don’t know how fast to go on hiring. How much burn is acceptable?” Startup hiring is…

    12 条评论
  • Why Embed

    Why Embed

    At this moment in time, there is a gap in the technology ecosystem. With access to large-scale general AI models, you…

    2 条评论
  • "Runway" is the wrong way to plan

    "Runway" is the wrong way to plan

    1/ “Runway” is really a cursed way to think about startups and the remnant disease of the 2017-2022 VC bubble. If cash…

    13 条评论
  • Temporary Markets and “Easy” Problems: The Suddenly Popular Idea of LLMOps

    Temporary Markets and “Easy” Problems: The Suddenly Popular Idea of LLMOps

    Sometimes, all of sudden, micro-markets emerge. They can be triggered by all sorts of things, for example an external…

    7 条评论
  • Launching With Conviction

    Launching With Conviction

    Occasionally, a technology comes along that changes everything. AI is that kind of foundational technology.

    146 条评论
  • Conflict Avoidance is Dishonesty

    Conflict Avoidance is Dishonesty

    It's really hard to tell the truth when the truth isn’t positive. Telling the truth requires recognition of risks…

    16 条评论

社区洞察

其他会员也浏览了