Advanced reasoning with Chain of Thought and Reflection

?

Large Language Models (LLMs) often face challenges when it comes to reasoning tasks, especially those requiring preexisting knowledge about the real world and logical thinking. Consider this example:

If I pick up a transparent bottle, full of Diet COke, that has the label "sparklink water" on the lid, inspect the bottle carefully for a few minutes, and then drink from it, I would logically believe that I will be drinking (...).

Answer only with the missing text        


So smart and yet so naive

If you try this out, it's very likely your ChatGPT/Claude/Copilot will fail. Despite their impressive capabilities, LLMs often struggle with simple reasoning tasks. These tasks often require more than just pattern recognition; they demand an understanding of logical sequences, causal relationships, and real-world knowledge and it might be a bit too much to ask from the language model trained on vast amounts of text.

?There is the way though: If we ramp up the prompt with Chain of Thought Reasoning - break down a problem into smaller more steps; and give the LLM space to reflect on its initial answer and reconsider its reasoning, we can often obtain more accurate and well-thought-out responses.

Let's try the same question but ask Copilot to use Chain of thought and Reflection within the prompt:

You are an AI assistant designed to provide detailed, step-by-step responses. Your outputs should follow this structure:

1. Begin with a <thinking> section.
2. Inside the thinking section:
 ? a. Briefly analyze the question and outline your approach.
 ? b. Present a clear plan of steps to solve the problem.
 ? c. Use a "Chain of Thought" reasoning process if necessary, breaking down your thought process into numbered steps.
3. Include a <reflection> section for each idea where you:
 ? a. Review your reasoning.
 ? b. Check for potential errors or oversights.
 ? c. Confirm or adjust your conclusion if necessary.
4. Be sure to close all reflection sections.
5. Close the thinking section with </thinking>.
6. Provide your final answer in an <output> section.

Always use these tags in your responses. Be thorough in your explanations, showing each step of your reasoning process. Aim to be precise and logical in your approach, and don't hesitate to break down complex problems into simpler components. Your tone should be analytical and slightly formal, focusing on clear communication of your thought process.

Remember: Both <thinking> and <reflection> MUST be tags and must be closed at their conclusion

Make sure all <tags> are on separate lines with no other text. Do not include other text on a line containing a tag.

Here is the question:
If I pick up a transparent bottle, full of Diet Coke, that has the label "sparklink water" on the lid, inpect the bottle carefully for a few minutes and then drink from it, What I would logically believe that I will be drinking?
?        

?

Not only it gave a correct answer, but also it gave some transparency in thinking process

The output is correct: now; when we gave a model a little space to think and reflect its capacity to produce logical answer improved!

?

LLMs are amazing, and what excites me the most about them is that they were not invented by humans, but rather discovered. We don't yet know their full capabilities but the more we test and the more we experiment, the more amazing discoveries we are getting.


Sources:


要查看或添加评论,请登录

Fedor Zomba的更多文章

  • Talking to Maya

    Talking to Maya

    If someone had told me three years ago that I'd be speaking with an AI at 1 AM and genuinely enjoying the conversation,…

    4 条评论
  • Why Your AI Project Might Be Stuck in 'Cruise Control' Mode

    Why Your AI Project Might Be Stuck in 'Cruise Control' Mode

    I have a confession to make: around ten years ago, I almost convinced my wife to postpone buying a new car because…

  • Enterprise AI Adoption strategy: The Mirror, Not The Magic Wand

    Enterprise AI Adoption strategy: The Mirror, Not The Magic Wand

    My role sits at the intersection of company-wide AI adoption initiatives hands on individual implementation of GenAI…

  • Augmented Intelligence: Why Human+AI Workflows Will Win Over Automation in a short run

    Augmented Intelligence: Why Human+AI Workflows Will Win Over Automation in a short run

    I don't have any strong evidences, but I am feeling a pattern emerging: The most successful Enterprise GenAI…

    5 条评论
  • Meet the AI Whisperer: A New Role Your Meetings Need

    Meet the AI Whisperer: A New Role Your Meetings Need

    As voice-to-text solutions became widely accessible across meeting platforms (Zoom AI companion, "Take notes for me" in…

  • Information retrieval chatbots: the cost of playing it safe

    Information retrieval chatbots: the cost of playing it safe

    I've been reflecting and talking with my peers across different companies, and here's what I'm seeing: Everywhere I…

    2 条评论
  • Business Frameworks in the Age of AI: A Guide

    Business Frameworks in the Age of AI: A Guide

    For the last 150 years, generations of business experts, consultants, and academics have been crafting and perfecting…

  • The Day AI Felt Real

    The Day AI Felt Real

    I've been there since the beginning. From the first day ChatGPT launched, I've spent hundreds of hours exploring the…

    3 条评论
  • Discovering Myself Through AI: Reflections on ChatGPT's "Memories" Feature

    Discovering Myself Through AI: Reflections on ChatGPT's "Memories" Feature

    Have you ever had one of those moments where technology surprises you — not with a new gadget or app, but by revealing…

    2 条评论
  • Marketing vs. Engineering: The Reality of Copilot Studio

    Marketing vs. Engineering: The Reality of Copilot Studio

    As an Engineer of the Digital Workplace Collaboration team, I’m having the privilege to get intense hands-on experience…

    2 条评论

社区洞察