登录查看更多内容

4 Methods of Prompt Engineering

Bahram Khanlarov

???? Azure Certified Data Analyst | Python | SQL | Power BI | Tableau | R | Hospitality Trained

发布日期: 2024年8月9日

Large language models are used for various tasks. Most people are familiar with chatbots, which we encounter frequently. These models are also employed for summarization, and another common use case is information retrieval. These are three distinct applications. But how does this relate to prompt engineering?

Prompt engineering is crucial for effectively communicating with large language models. What does it mean? It involves designing and crafting the right questions to elicit the responses you want from the model. This is important because we want to avoid hallucinations, which occur when a large language model generates false or misleading results. These hallucinations happen because large language models are primarily trained on internet data, which can contain conflicting information and inaccuracies.

RAG

So the large language models, as we know, are trained on the Internet data. They are not aware of our domain specific knowledge base content at all. So when you are querying the LLM models, you want to bring awareness of your knowledge base to the large language models. So when we say knowledge base here, we are referring to something that might be specific to your industry, specific to your company, which you are going to then be applied to the model. So how it works?

Retrieval-Augmented Generation (RAG) is a prompt engineering approach that involves adding domain-specific knowledge to your LLM. To do this, two components are necessary: a retriever component that brings the context of your knowledge base to the generator part of the LLM, and the generator part that responds to your questions based on both the input and the knowledge base. This approach helps ensure accurate responses by grounding the LLM in your specific domain.

Here is practical example of how Retrieval-Augmented Generation (RAG) can be used:

Let's say you work at a biotechnology company that is developing a new drug. You have a large amount of internal research and clinical trial data about the drug's development and performance. You want to use a large language model to assist with answering questions about the drug, but you know the model has not been trained on your company's specific data.

Using the RAG approach, you would first set up a retrieval component that can access your company's internal knowledge base containing all the details about the drug. This might include information like the drug's composition, the results of animal and human trials, potential side effects, and manufacturing details.

When a user asks the large language model a question like "What are the key side effects of Drug X based on the clinical trials?", the retrieval component would first find the relevant information from your company's drug data. It would then pass that context to the generation component of the language model.

The generation component would then use both the user's question and the retrieved data about the drug to produce a response. This allows the language model to provide an informed, accurate answer about the drug's side effects, drawing on the real data from your company's research rather than just general medical knowledge.

The RAG approach ensures the language model's responses are grounded in the specific domain knowledge your company has about this particular drug candidate. This makes the model much more useful and reliable for answering questions compared to relying solely on its broad, generic training data.

Chain-of-Thought

Chain of thought (CoT) prompting is a technique used in prompt engineering to improve the reasoning and accuracy of large language models, especially in tasks that require complex, multi-step reasoning(Wei et al. 2022). The idea behind CoT prompting is to guide the model to break down a problem into a series of logical steps or intermediate stages, which it then solves one by one, leading to a final answer.

The key aspects of the COT approach are:

Decomposition: The initial prompt is carefully structured to decompose the overall task into multiple logical sections or steps. This allows the language model to address the problem systematically rather than attempting to generate a complete answer in one go.
Step-by-Step Reasoning: For each intermediate step, the language model is prompted to provide a concise response explaining its reasoning and showing its work. This encourages the model to engage in more explicit, structured thinking rather than simply returning a final answer.
Aggregation: After completing the individual steps, the language model is prompted to review the intermediate outputs and combine them into a final, coherent answer to the original question or task.

https://www.promptingguide.ai/techniques/cot

ReAct: Reasoning and Acting

ReAct?(Reason + Act;?Yao et al. 2023)

领英推荐

Implementing Retrieval Augmented Generation (RAG): A…

Pavan Belagatti 11 个月前

Retrieval-Augmented Generation (RAG) and Agentic RAG

Sanjay Kumar MBA,MS,PhD 3 个月前

Large Language Model Settings: Temperature, Top P and…

Albert Mao 1 年前

It is a prompt engineering approach that combines the reasoning of Chain-of-Thought with the ability to gather additional information from external sources. React goes beyond content grounding in your private knowledge base by also accessing public resources to supplement its responses. This approach allows for more comprehensive and informed answers, especially when the necessary information is not available in your private knowledge base.

React's operates in a three-step process:

1. Thought: Defining the goal of the prompt.

2. Action: Specifying where the necessary information can be found.

3. Observation: Combining the information from both the private and external knowledge bases to arrive at a final response.

Directional Stimulus Prompting (DSP)

Directional Stimulus Prompting (DSP), introduced by Z. Li in 2023, is an innovative approach for steering black-box large language models (LLMs) towards specific desired outcomes. Rather than modifying the LLMs directly, this technique uses a smaller, adjustable policy model to create an auxiliary prompt (or hints) that guides the LLM's response for each input.

References

Weng, Lilian. (Mar 2023). Prompt Engineering. Lil’Log.

IBM (Jan 2024) 4 Methods of Prompt Engineering

Jeff Su (Aug 2023) Master the Perfect ChatGPT Prompt Formula

要查看或添加评论，请登录

Bahram Khanlarov的更多文章

Event-driven Trading Strategies with Practical Example on Python

2025年2月27日

Event-driven Trading Strategies with Practical Example on Python

Event-driven trading strategies are a type of investment approach that seeks to capitalize on market reactions to…

1 条评论
Pipe Syntax: The Future of SQL is Here?!

2024年11月6日

Pipe Syntax: The Future of SQL is Here?!

I recently came across a paper called 'SQL Has Problems. We Can Fix Them: Pipe Syntax In SQL' published by google…
Probability and the Birthday Paradox

2024年10月7日

Probability and the Birthday Paradox

In this article I want to discuss the interview question "What’s the probability that in a room full of k people, at…
Microsoft Azure Data Scientist Associate certification DP100 renewal

2024年8月18日

Microsoft Azure Data Scientist Associate certification DP100 renewal

?? Excited to share that I’ve successfully renewed my Microsoft Azure Data Scientist Associate (DP-100) certification!…
Exclusive Opportunity: Free Oracle OCI Generative AI Certification Until July 31, 2024

2024年7月17日

Exclusive Opportunity: Free Oracle OCI Generative AI Certification Until July 31, 2024

In today's rapidly evolving tech landscape, staying ahead of the curve is crucial for professionals in cloud computing…
Why Do We Need Medallion Architecture in Data Lakes?

2024年5月27日

Why Do We Need Medallion Architecture in Data Lakes?

This article is part of the preparation material for DP600 Fabric Analytics Engineer Associate exam.(DP500 has been…
Flattening Parent-Child Hierarchies in Power BI: A Practical Guide

2024年5月25日

Flattening Parent-Child Hierarchies in Power BI: A Practical Guide

This article is part of the preparation material for Microsoft Power BI Data Analyst Associate PL300 certificate In the…

1 条评论
From BI to AI-Why use a Data Lakehouse instead of a Data Lake for AI?

2024年4月30日

From BI to AI-Why use a Data Lakehouse instead of a Data Lake for AI?

Companies require systems for diverse data applications including SQL analytics, real-time monitoring, data science and…
Data Lake vs Data Warehouse: Key Differences

2024年4月26日

Data Lake vs Data Warehouse: Key Differences

Today’s most precious asset is data. Companies with superior data management can advance and dominate their sectors…
Is this end of Power BI Developer Role?

2024年4月21日

Is this end of Power BI Developer Role?

In my last article i talked about Microsoft Fabric and its capabilities. Copilot for Microsoft Fabric Public Preview is…

See all articles

4 Methods of Prompt Engineering

Bahram Khanlarov

???? Azure Certified Data Analyst | Python | SQL | Power BI | Tableau | R | Hospitality Trained