登录查看更多内容

Can AI Replace Designers and Artists?

Tagglabs

Engineering Goosebumps Through Experiences, Your Experiential Tech Partner.

发布日期: 2022年8月19日

Ever seen a polar bear play the bass? Or a robot that looks like a Picasso? If not, there is a new Al system from OpenAl called DALL-E2 that can take straightforward text descriptions like "a koala dunking a basketball" and turn them into never seen photorealistic or stylised visuals.

DALL-E2 can also edit and retouch images accurately. It may fill in or replace a portion of an image based on a basic natural language description. Using Al-generated imagery that effortlessly blends in with the original by a process called “inpainting.” OpenAl debuted DALL-E in January 2021, a system that could produce visuals from words, such as "an avocado armchair." DALL-E2 pushes the technology even further with a sharper resolution, more comprehension, more compact design and new capabilities like inpainting.

It can also take a picture as an input and generate versions with various angles and styles. By exposing a neural network to photographs and their textual descriptions, DALL-E was developed. Through deep learning, it studies relationships between objects in addition to understanding specific objects like koala bears and motorcycles. Additionally, DALL-E is capable of producing any image that has a connection to another thing or action when you ask it to make a picture of a koala bear riding a motorcycle. The DALLE 2 algorithm has figured out how images relate to the text.?

Commercial Use

Last month on 20 July 2022 Open AI announced that they will invite 1 million people from the waitlist over the coming weeks. Users can create with DALL·E beta using free credits that refill every month, and buy additional credits in 115-generation increments for $15. The users will have complete usage rights for all media they produce on DALL-E 2, such as the ability to sell, reproduce, and merchandise it. For a variety of reasons, this announcement is highly intriguing. Firstly, we are beginning to see how AI is being employed in some business models. Companies will be selling access to their tools in an attempt to recoup some of their investment. This may also be a glimpse into the creative processes of the future when humans will increasingly rely on AI.

This implies that you will be able to use the images you produce with this software to illustrate children's books or to make movie storyboards for your commercial projects. as well as to resell them. Everyone in the sector is currently praising this as a true game-changer. The future of visual material appears to be here, and photographers are poised to revolutionise the market with their AI-generated graphics.

After explaining what DALLE 2 is, how it functions, and how it is a real game changer in terms of a commercial standpoint let's talk about some other features such as variations, inpainting, and text differences.

Variations?

DALLE 2 contains syntactic and semantic variations. DALLE 2 typically preserves both stylistic and semantic details. The output image can also be changed by DALLE 2 to reflect a change in syntactic-semantic structure. Syntactic components can be encoded separately. Images that accurately reflect the input sentence with sufficient visual semantic linkages can be produced without having seen all of the distinct syntactic elements combined in the dataset.

Inpainting

Additionally, DALLE 2 can perform automatic inpainting on already-existing images. It can match the newly added object's style to that area of the image. To adapt the existing image to the presence of the new object, it also modifies textures and reflections. This may imply that DALLE 2 employs causal reasoning.?

Text Diffs

Another cool feature of DALLE 2 is its ability of interpolation. DALLE 2 can change one image into another using a method known as text diffs. The model may additionally alter objects by improving upon interpolations. The interpolated images' most infamous characteristic is that they maintain a decent level of semantic coherence. Consider the advantages of an advanced text diffs technique. By altering a word in the prompt, you might request changes in people, places, things, apparel, etc. and see the consequences immediately.?

DALL E 2 might seem like a silver bullet to your graphic design/ art woes. However, it has several drawbacks such as :?

Unavailability for public

It is regrettable that members of the public who want to try it cannot. Similar to how GPT-3 was only revealed to a few groups of people, DALLE 2 will be released to a handful of people at a time.

Bernard Marr 6 个月前

The age of co-creation with AI has begun

Nando Costa 2 年前

9 AI Art Prompt Tips for Getting the Best Results

Vishal Chandra 2 个月前

Inconsistency

In general, DALLE 2 products look great, but consistency is sometimes lacking due to the inhuman work. DALLE 2 is very adept at acting as if it understands how the world works, but it doesn't. People can't paint like DALLE 2, but they aren't likely to make mistakes like these on accident.

The text prompt for the image below was to show a red cube over a blue cube.

Text

DALLE 2 excels at drawing but fails miserably at spelling. There may be a lack of spelling information in the DALLE 2 dataset photos. A CLIP embedding is required for DALLE 2 to draw something accurately. As you can see in the image below the text prompt was to show a sign with "Deep learning" written on it.

Not as smart?

DALLE 2 delivers on its purpose. It doesn't always get them right; rather, its compositional logic is usually unreliable. In current circumstances, it's risk-free, but in other, more serious circumstances, it might not be.

While there are drawbacks DALLE 2 is definitely worth giving a try. DALL E 2 is a marvellous technological achievement that was recommended over its predecessor DALL E 1 for its description matching and photorealism. We have joined the waitlist and hope to be lucky enough to try it one day.

What to know more about how it works ?

We a sharing some links to the articles and documents we read in order to write the article. Don’t forget to read the actual research paper on DALL E 2.

https://openai.com/dall-e-2/

https://medium.com/ai-network/dall-e-2-meaning-limitations-and-solutions-a988c87ddeae#:~:text=Limits%20of%20DALL%C2%B7E%202&text=For%20example%2C%20if%20data%20labeling,results%20may%20be%20too%20different.

https://www.creativebloq.com/news/how-to-use-dall-e

https://www.assemblyai.com/blog/how-dall-e-2-actually-works/

Research paper - https://arxiv.org/pdf/2204.06125.pdf

要查看或添加评论，请登录

Tagglabs的更多文章

See all articles

Can AI Replace Designers and Artists?

Tagglabs

Engineering Goosebumps Through Experiences, Your Experiential Tech Partner.

Ever seen a polar bear play the bass? Or a robot that looks like a Picasso? If not, there is a new Al system from OpenAl called DALL-E2 that can take straightforward text descriptions like "a koala dunking a basketball" and turn them into never seen photorealistic or stylised visuals.

领英推荐

Tagglabs的更多文章

社区洞察

其他会员也浏览了

AI Art Generator: Creating Stunning AI Art

The best AI generators in 2024

Is AI an Artist or being artistic?

Creativity in the AI Age: Embracing the Intersection of Art and Artificial Intelligence

The Book I Never Published

Can we stop being so binary about the use of AI in creative work?

Generative AI Changes the Nature of Work

???????? № 8 – Artificial

The Ai (R)evolution issue n°9

AI: Chat GPT Prompts for Artists

Ever seen a polar bear play the bass? Or a robot that looks like a Picasso? If not, there is a new Al system from OpenAl called DALL-E2 that can take straightforward text descriptions like "a koala dunking a basketball" and turn them into never seen photorealistic or stylised visuals.

领英推荐

Tagglabs的更多文章

Brands have an opportunity to be a part of this culture-defining moment and make history.

Designing for 3D storytelling

How To Design For Immersive VR Experiences?

Enhance Event Engagements With NFTs

The Metaverse And AR Are Changing The Beauty Industry

Metaverse and AR - The Future Pillars Of E-Commerce

Building Better AR Experiences For Events With - NIANTIC LIGHTSHIP

Why do our AR experiences stand out from the crowd?

Mixed Reality -The Future of Experiential Marketing?

社区洞察

其他会员也浏览了

AI Art Generator: Creating Stunning AI Art

The best AI generators in 2024

Is AI an Artist or being artistic?

Creativity in the AI Age: Embracing the Intersection of Art and Artificial Intelligence

The Book I Never Published

Can we stop being so binary about the use of AI in creative work?

Generative AI Changes the Nature of Work

???????? № 8 – Artificial

The Ai (R)evolution issue n°9

AI: Chat GPT Prompts for Artists