This post was not written by an AI
DiffusionBee prompts: “A person writing a letter, Japanese Woodcut”; “A woman writing a letter, 1960s tv show”; “a person writing a letter on Twitter

This post was not written by an AI

Creative work generated by artificial intelligence is having its moment. From?DALL-E 2?– which builds highly detailed and trained images from almost any prompt, even when the prompts are outlandish – to writing tools like?Jasper?that create original writing content based on small inputs, AI content creation is here to stay.

If you look at the example above, the model generates different images based on the prompt. The first one uses “A person writing a letter, Japanese woodcut” to filter the model; the next is based on a 1960s tv show, and the third one is a bit more free-form (what does writing a letter on Twitter mean, anyway?). At a glance, these images are pretty good. I don’t pretend to know?how AI models like this work?- suffice it to say you train it by letting it look at lots of photos. Then, the model can calculate the similarity between your prompt and the images it has indexed. At the end, write some prompts and get some images!

Why should you avoid using AI to create images?

AI models raise a few ethical questions for content creators. This computer code is building new images based on information scanned from the internet and is not paying any of those authors that trained it. Without this training data, it wouldn’t know anything.

Does this mean it’s not capital A “Art”? That’s a question for philosophers. Art is made by artists (and by people who make art) and there are undoubtedly questions if computer intelligence can be truly generative. But this AI generation is not going away. It might make very small adjustments without your knowledge (this happens inside your iPhone when you “take a picture”, or it might create entirely new options for you to choose from.

There’s a catch. Simply saying “I’m not going to use AI” may not be possible unless you go back to a film camera. It’s going to be really hard to avoid in the future. So we need to mitigate the problems this sort of technology can and will create. That includes figuring out how to signal when AI is being used and how to license this information for fair use.

You are going to be using AI to create images

There are a lot of good reasons to use an AI helper for knowledge work (yes, photos are just the start of this change.) They focus on generating a set of ideas that a human can then edit.

Here are a few helpful ways to use AI image prompts:

  • as a “scratchpad” to iterate ideas
  • to generate a specific type of image for a presentation or a prompt
  • produce almost limitless iterations of similar content

Ai models build, well, anything. They also help skilled operators produce work in a fraction of the time they did before. AI is going to cause changes for almost any knowledge work business because it produces “good enough” work very quickly with a low barrier to entry. (Here’s an example of companies?working on this problem, curated by?Elaine Zelby.)

It will be difficult to create a “moat” (a defensible advantage) by using only AI models as the lever. Knowledge work is going to evolve into something like prompt engineering where the best operators are the best “AI whisperers.” Because AI models start from human knowledge, we also need to be cautious and vigilant about declaring the bias inherent in these prompts.

The reward when we do this right? New business models. For example, Shutterstock is?beginning to sell AI-generated stock images. This means they have realized that there is money to be made in generating this sort of image, and also that the software is not good enough to do it on demand. I believe that the software will be proficient at this task eventually and that there will remain an arbitrage between the work needed to make a “good” result and the work a prospect is willing to do.

Trying AI out: some practical examples

A simple example will help illustrate some of the ways AI-generated image models are effective today, and where they have room to grow to approach a creative human editor.

Where does AI struggle today?

AI models do not reproduce logos or text well in the way that we expect. You would think these models could index font families, read text, and be able to generate new text in a readable format. Likewise, you expect an AI model to be able to search for brand logos in the same way we use Google Images to search for an example.

AI models fail at this task, producing “words” that don’t look like readable words and logos and brand images that look like strange photocopy overlays of the expected thing. Both of these tasks feel like things that could be improved quickly with?dedicated or human-assisted operation.

What AI models do well

Models do well with generic prompts and a style (for example, “cat, japanese woodcut”). When you use an item with more examples on the Internet, the outcomes also seem better. This makes sense when you consider the importance of many overlapping instances of similar images for producing a decent outcome. copying the stylistic cues of many images on the internet, especially the more training images that exist.

To improve the likelihood of success, don’t ask the model to do too much, yet. There is a tendency to look at some of the amazing images this method can produce and think great results happen automatically.

An Example: portraying Spider-man’s J. Jonah Jameson

Let’s try it out with?DiffusionBee, a desktop app using the AI model?StableDiffusion. I asked this app to create a picture of J. Jonah Jameson, the newspaper publisher from the series of Spider-man comics. The goal was to recreate an image in the style of Steve Ditko and John Romita, comic book artists who drew Spiderman in the 1960s.

Here’s an example of panels from the actual comic books:

No alt text provided for this image
Panel from a Spiderman comic illustrated by Steve Ditko

And here’s an example panel created by the AI model using a comic book prompt.

No alt text provided for this image
DiffusionBee “J. Jonah Jameson as a cat writing a blog post on using AI to generate blog posts, comic book style, line drawing, in the style of steve ditko and john romita, 1960s marvel comic book” Seed : 37656 | Scale : 7.5 | Steps : 25 | Img Width : 512 | Img Height : 512

It’s got the right idea, rendering some comic panels, capturing Ditko’s signature style, and creating a character that looks reasonably like JJJ. (If you’re a Spider-man fan, I think this looks more like Norman Osborne, but I will give the AI a pass on this one.) The model fails miserably at creating copy for the comic bubbles.

Compare this example to the AI generating a portrait of J. Jonah Jameson as a cat.

No alt text provided for this image
DiffusionBee “J. Jonah Jameson as a cat, 8k, hyperrealistic, SLR, F1.8,Nikon, film, portrait” Seed : 81483 | Scale : 7.5 | Steps : 25 | Img Width : 512 | Img Height : 512

Is it “believable?” Part of the reason that our brains want to accept this as a believable outcome is that it looks like a portrait, follows the conventions of photography, and is relatively seamless as an outcome.?You won’t get the same outcome twice, though.?Also, the same prompts produce?different results on different generative models.

A Proposal for AI Metadata

The elephant in the room for AI-generated images is that they offer no credit to the original author or to the work that's used to generate a seed for that data. We need a way to identify how images are generated that is displayed and easily readable along with those images. There are many reasons for this, chief among them the need to compensate creators, limit copyright infringement, and present the option to filter images people don’t want to see because of their content or generation model.

A modest proposal here would be to adopt a standard like the one already in use for JPEG images. EXIF files are a?standard for metadata documentation?and help people determine the original image criteria. A version of this for AI-generated images might include the original prompt, the engine and version used to produce the image, and the URL of the seed that produced the image. While it’s hard to speculate on any changes that might happen with IP law and copyright for existing images, adding this data lineage would help allocate any money that acrues from selling these images. (Makes you wonder how Shutterstock and other providers are going to address this question.)

This post was not written by an AI. Why is that? The primary reason is that these models can’t reason like people yet. Will they ever get there? It comes down to better understanding when we encounter original thought. In the future almost everyone will use AI assistance for creative work. I’m confident that we’ll still be able to tell the difference, though we might require AI-detecting assistance to help us.

What’s the takeaway??AI generation for creative work is becoming more common, and is likely to provide at least one option in most creative software going forward. To respond, operators (and creators) need to learn the parameters that help AI to deliver the best results, and invent new ways to be unique.


Alex Eben Meyer

freelance illustrator

2 年

an ethically/opt-in sourced data set. some set up tha resembles ascap in music, where artists get paid when even parts of their work is used.

John Mulhollen

Customer experience, Revenue Operations

2 年

#1 - thats exactly what an AI would say. #2 - Thought provoking. If art I create is an aggregate of everything that ever left an impression on me; am I not doing the same thing a machine is doing? It would be difficult to track the sources of my inspiration whereas an AI generated image it is likely doable.

要查看或添加评论,请登录

Greg Meyer的更多文章

  • "The API of Me" in the age of AI

    "The API of Me" in the age of AI

    Our computing ability intersects with our own personal dataset to create new and differentiated solutions with AI at…

    2 条评论
  • Create a pacing graph with Google Sheets

    Create a pacing graph with Google Sheets

    As an operator, how many times do you get asked: “how are we doing this month vs last month? (Or vs. some previous…

  • In support of "boring" software

    In support of "boring" software

    I am an unabashed technology fan and an early adopter of new things. As a kid, I loved (and still love) science fiction…

  • 5 ways to make your low-code automation more effective

    5 ways to make your low-code automation more effective

    When I started my first software job, I remember thinking two things: I am definitely not the smartest person in the…

    2 条评论
  • Turning daily improvements into milestones

    Turning daily improvements into milestones

    You’ve seen the statistic. 1% improvements daily for a year yield a 37x return.

    2 条评论
  • Building Diagrams with Computers

    Building Diagrams with Computers

    Ethan Mollick writes about AI that “the only way to figure out how useful AI might be is to use it.” This is not…

    2 条评论
  • Redefining the Customer Journey

    Redefining the Customer Journey

    Have you ever played RevOps detective? ??? The story goes something like this. There’s a closed-won (or a closed-loss)…

  • Going from 0-1 in Data Operations

    Going from 0-1 in Data Operations

    Imagine you are starting a new venture and need to describe all the data tasks that need to happen to get you from…

  • An ode to console.log()

    An ode to console.log()

    Some of the first programs I ever wrote on a computer used PRINT to echo a line to the screen. Using BASIC, I filled…

    1 条评论
  • Great performance demands mental preparation

    Great performance demands mental preparation

    The coach will see you now When I was younger I wanted to be a professional baseball player. Professional baseball…

    2 条评论

社区洞察

其他会员也浏览了