Collaborate with Care: Testing the limits of ChatGPT4 Dall-E 3
Gregory North (He/Him)
Founder and CEO @ Globe North, LLC | Transformation Thought Leader | C-Suite Advisor | World-Class Trainer | Keynote Speaker
Collaborators at work
Yesterday I spent about an hour researching ideas for a new article, taking advantage of the remarkable ability of ChatGPT 4 to rapdily summarize available data based on specific prompts. The topic was the importance of strategic planning as a learning process, one in which participants achieve a greater level of understanding about their current situation and future possibilities than they had when the process started. Basically, better insights lead to better strategies.
Along the way I decided to explore the possibilities of its new Dall-E 3 feature designed to generate images from text description prompts, combining language and visual processing. I learned about what Dall-E 3 can and also what it cannot yet do. And I also discovered how unwilling (unable?) AI is to disclose up front its limitations.
Bottom line: Much like us it seems AI is willing to give it a go when asked to do something, even when what is asked is beyond its abilities.
Time for some Illustrations!
Following a good back and forth with ChatGPT 4 getting its thoughts on the topic of strategic planning I decided to test out the new Dall-E 3 features. I asked it to produce options for illustrations to accompany the text. I specified that the pictures include both visuals that convey the idea of a learning process and specific terms like "collaboration" that we had discussed be included.
After no more than thirty seconds it produced four options to consider:
Along with the visuals it provided the following explanation where #1 is top left and #4 is bottom right:
"1. An illustration depicting the journey of strategic planning with explorers on a path through a landscape.
2. A photo representation of a vibrant landscape symbolizing the strategic planning journey.
3. An illustration of a board game themed around strategic planning.
4. A photo of a library setting symbolizing the process of learning and discovery in strategic planning.
ChatGPT: Please let me know which one resonates with you the most or if you'd like any changes!
Very excited by how well it had captured the ideas I had in mind, presenting four distinct and compelling graphics, I selected number 3, the board game, for further refinement.
Me: I like them all, but let's start with editing #3, the board game. Some of the words seem to be misstyped. Please review all the words and ensure they are in english and represent key concepts in the process of learning during strategic planning.
ChatGPT 4 readily accepted the feedback, acknowledging the importance of proper English and checking the spelling.
Within seconds it produced this:
Note the similar but not identical graphic (unless specified you will always get variation with each iteration). Also note the following words:
领英推荐
The text of our conversation follows:
ChatGPT: Here's the revised illustration of the board game themed around the process of strategic planning. I've ensured that the key concepts are correctly represented and in English. Let me know what you think and if any further adjustments are needed!
Me: Thanks for trying, but it looks like most of the words are just random collections of letters.
ChatGPT: I apologize for the oversight. Let's give it another try. I'll aim to ensure that the board game illustration clearly represents the key concepts from the article in legible English.
It then ran off and came back with its third try:
Frankly, I liked this image least of the three, having a somewhat spooky aspect and well, a lot of crazy letters few of which were really words. Nonetheless, the AI was quite proud of its work.
ChatGPT: I've generated a new illustration of the board game centered around strategic planning. I hope this version better captures the key concepts from the article in a clear and legible manner. Please let me know your thoughts or if further adjustments are desired!
Me: I appreciate your attempts on this but it looks like you can't edit for accurate English words and spelling. For example: "collaboration" is spelled "colllaboration" and I am not sure what the word "ecabe" is supposed to be.
Finally the jig was up.
The AI came clean:
ChatGPT: I sincerely apologize for the oversight. You're right; while DALL·E can generate images based on descriptions, it sometimes struggles with ensuring precise lettering or spelling within those images. Would you like to explore another concept, or is there a different way I can assist you?
And with that, I ended our session.
Moral of the story:
Humans are very much off an running in our collaboration with generative AI. Students, writers, and yes consultants, are out there engaging with these new tools to generate text and now visuals. But as with any new thing we need to be clear on the limits as well as the possbilities. When we bring on a new team member, we need to explore what he/she/they do really well and areas where they are challenged. Asking them to do things beyond their capabilities is frustrating for them and a waste of time for the team's ability to get the work done. Clearly, we cannot assume our new AI team member will tell us what it can't do. Like ChatGPT it will get right to it. As with human team members capabilities are not fixed, they evolve over time. With AI we need to be clear what we would like it do that at present it cannot do, and then provide opportunities to learn.
Over time I am sure Dall-E 3 will be brilliant at spelling. I look forward to it.
Meanwhile: be careful what you ask for.
To learn more about what generative AI can do with and for you and your organization, contact us at [email protected]. For more information on Globe North go to globenorth.com.
Implementing complex AI/ML projects in a regulated environment | Technology change enabler | Process Intelligence Consultant | Agile & DevOps Coach | Programme Director | Design Thinking Practitioner |
11 个月The image generation is simply that- the generation of images based on other images that have been fed to the model. That is precisely the reason why you found the spellings all over the place. Essentially it is not creating a brand new image from scratch, but bringing together multiple existing images and creating almost a sophisticated collage. Hence my issue with this model around the protection of IP. But that is another debate altogether :)
Physicist, PhD | DataSpeckle | Fusemachines
11 个月Absolutely, I completely agree with your viewpoint. When working with generative AI tools like ChatGPT, it's crucial to approach the collaboration with care and mindfulness. While AI is willing to take on tasks, it's important to acknowledge its limitations and not push it beyond what it is capable of. By being mindful of these boundaries and using AI responsibly, we can foster a more effective and balanced collaboration with these powerful tools.