SI Art With Short Prompts

SI Art With Short Prompts

<SI = Simulated Intelligence. True AI is still out of reach>

Preparing a review of a story with a feudalistic setting, I tried a certain short prompt with six generative art engines. The image above was generated in Bing by the Bing Image Creator, powered by Dall-E3. I'll state up front that Dall-E3 typically produces the most human-like images, compared to the others. Or perhaps I should say, the images I find most comprehensive of what I had in mind. The prompt was as shown in the caption, "The essence of feudalism". This is my favorite of four images presented, as re-rendered in wide format by Dall-E3; it can do that now, but the result is not just an expanded copy of the original, the engine takes certain liberties.

A few anomalies indicate that Dall-E3 doesn't "think" like we do. Firstly, there would be no forest of yew trees surrounding a castle like that. They breach the defenses automatically! The second important item is the varied livery of the mounted knights. The castle rooftops are blue for a reason, and the primary color of the livery should be a similar blue color. I see three colors on the doublets of the knights, and only one of two squires depicted is wearing blue. Smaller points include

  • The serfs' clothing would all have been in drab earthtones, from sepia to brown.
  • There's a staff in mid-air near the leftmost knight.
  • Two of the banners are blowing in wrong directions. Gusty winds at that elevation would not produce such a wide swing of direction.

Now, as for other engines:

Gemini produced nothing of interest, just random interiors of ramshackle barns. No humans or animals visible; Google is still working out how to depict people without getting in trouble for the giant historical anomalies the Bard imager produced.

Strangely, Leonardo AI also didn't produce anything I wanted to show off.

Playground has three engines. The newest, Playground 3.0, is claimed to have the strictest adherence to a prompt. There are no controls or filters in the free version. Here are a couple of results:


I call this over-adherence to the prompt! A table from a textbook, and a bestiary of armor styles.

Onward to the Playground 2.5 engine, which has a variety of controls, including variable prompt adherence. I used the Delicate Detail filter for this:


This engine produced architecture. This baronial interior is fetching, but nothing like what I wanted for the illustration.

Playground's third engine is Stable Diffusion XL, which produced this:


This has a manor house rather than a castle, and focuses on the nearby village. Another offering by this engine was an overview of a large village of thatch-roofed cottages.

I hesitate to say that these engines have "personalities", but the differences in the ways they depict an idea reflect significant differences in their training datasets, at the very least.

I could have saved time by going with the Dall-E3 image, which I did use for the review, but checking with the other engines has been instructive.


要查看或添加评论,请登录

Larry Van Stone的更多文章

  • Sometimes the Constraints are Visible

    Sometimes the Constraints are Visible

    When I was a kid we were taught to draw a mountain-and-river landscape by starting with either an X or an M. Here is a…

  • Illustration Generation

    Illustration Generation

    I produced these two images using Dall-E3 and DreamStudio, to illustrate a review in my blog of the book Be Useful:…

  • Artificial What?

    Artificial What?

    When I open the LinkedIn Home page, I am invited to write a post with the help of AI, or as I prefer to call it, SI…

  • Retrospective: 2 years of SI art

    Retrospective: 2 years of SI art

    In late 2022 I got access to Dall-E2. After a little experimentation, I entered the prompt, "Still Life with apples…

    1 条评论
  • Generative Art for Illustrating a Book Review

    Generative Art for Illustrating a Book Review

    I recently read an amazing book, which I reviewed here: https://polymath07.blogspot.

  • Behind on Retirement Investing?

    Behind on Retirement Investing?

    This is for those 30-ish to 40-ish folks who may be a bit behind in retirement planning. In 1981, age 33, I did…

社区洞察

其他会员也浏览了