登录查看更多内容

Generative AI - Learnings 2023

Mahtab Syed

Data and AI Leader | AI Solutions | Cloud Architecture(Azure, GCP, AWS) | Data Engineering, Generative AI, Artificial Intelligence, Machine Learning and MLOps Programs | Coding and Kaggle

发布日期: 2023年12月21日

+ 关注

This year 2023 has been the year of Generative AI using Large Language Models both closed source and open source.

Like many of us I have been learning via blogs, courses and using prompt engineering for building Generative AI apps and key takeaways are here:

1. Use LLM as a thought partner

It’s a new way to find creative information but be careful to check on incorrect information (hallucinations)

2. Examples of tasks LLMs can carry out

Writing - Brainstorming, Press Release, Translation
Reading - Proofreading, Summarizing, Reputation Monitoring (Sentiment Analysis), Topic Modelling, Extract entities, Moderation for harmful content
Chatting - Specialized customer service chatbot with internal company data

3. What LLMs can and cannot do?

Ask this question, and if the fresh grad can do the following task, then an LLM can also do it.

4. Prompting best practices

Be detailed and specific
Give sufficient context for LLM to complete the task
Guide the model to think through the answer
“Chain of thought reasoning”, is the step-by-step reasoning to give the model time to think to get to a final conclusive answer 1. Fast thinking vs slow thinking for complex thinking 2. Give Steps 1, 2, 3, etc and how you want the answer and in which format
Experiment and iterate - No perfect prompt for every person or situation Instead develop a process for improving prompts through iteration
Be careful with confidential information which you post as part of your prompt - Check how the prompt provider deals with the privacy of the information you post
Double check if you can trust the output of the LLM

5. Iterative Prompt Engineering

Idea
Iteration 1
Implementation
Experimental result, Error Analysis
Iteration 2 - Repeat

6. Design considerations

Lifecycle of a Generative AI project

Scope project
Estimate volume of tokens for Cost estimation
Choose a model (based on size, closed or open source, and cost) - Refer list of LLMs in the Leaderboards section below
Build/improve system
Internal Evaluation
Deploy and monitor
Repeat Internal evaluation
Repeat Build/improve system

7. Advanced Technologies – Beyond Prompting

- Retrieval Augmented Generation (RAG) - Ground model on additional internal proprietary data

- Part 1

Take Knowledge base, break to chunks

Create embeddings for each chunk

For each chunk -> store the embedding with the corresponding chunk in a Vector Database

领英推荐

Ahead of AI #1: A Diffusion of Innovations

Sebastian Raschka, PhD 2 年前

The Rise of Generative AI and Exploring Generative AI…

Pratibha Kumari J. 11 个月前

Dreaming AI's Future: The Ascent of Q-Learning in the…

Shail Khiyara 1 年前

- Part 2

Take a prompt and create its embedding

Compare the embedding with the embeddings stored in Vector Database

Get the chunks for the matched embedding and send the chunks as part of the prompt to the LLM

- 3 stages of LLM training

Pretraining - via Supervised Learning on trillions of tokens on all public data on internet
Fine-tuning - via Supervised Learning Adapt LLM to your task by fine tuning on high quality data
Reinforcement Learning from Human Feedback (RLHF) - via Classification, Reward Model and Reinforcement Learning

- Cost Considerations - Think through the Cost considerations of using a cloud based LLM to power software applications.

8. Process of building an application

Tune prompts on some examples
Add additional "tricky" examples
Develop metrics to measure performance on examples
Collect randomly selected set of examples to tune to (development set, hold out cross validation set)
Collect and use a hold out test set

9. Challenges with LLM

Ambiguous inputs and outputs
Hallucination vs facts
Compatibility amongst models
Maintaining Data Privacy
Safeguarding against Prompt Injection

Code

Refer my Github repo for few GenAI projects on

Leaderboards

Check these Leaderboards to compare against various LLMs like OpenAI GPT, Meta LLaMa, Google PaLM, Microsoft Phi-2, Mistral SC56

Best Models Leaderboard - https://huggingface.co/collections/open-llm-leaderboard/llm-leaderboard-best-models-652d6c7965a4619fb5c27a03
Least Hallucination Leaderboard - https://huggingface.co/spaces/hallucinations-leaderboard/leaderboard

Acknowledgement:

Andrew Ng and DeepLearning.ai for so many wonderful courses
Chip Huyen for her blogs in https://huyenchip.com/blog/

Mahtab Syed - Melbourne - 21 Dec 2023

ManyMangoes ??

1 年

Absolutely fascinating insights on Generative AI! ?? As Albert Einstein once said, "The true sign of intelligence is not knowledge but imagination." Your exploration into #genai and #llms really showcases the power of imagination in driving innovation. Keep pushing the boundaries! ?

Manish Choudhary

#Digital Transformation - #Machine Learning, #AI, #Big Data, #Mobility #Deeplearning | #Investment Banking| #Insurance #P&C #F&A#ScaledAgilePractioner

1 年

Well laid down Mahtab!

1 次回应

查看更多评论

要查看或添加评论，请登录

Mahtab Syed的更多文章

AI Agents or Agentic Systems

2025年3月10日

AI Agents or Agentic Systems

In the new year 2025 we see everyone talking about “Agents” or Agent like systems called “Agentic Systems”. I recently…

1 条评论
Develop your career in AI in 2025

2023年12月27日

Develop your career in AI in 2025

The hype of AI, especially in 2023 and continuing in 2024 and now in 2025, has created a supply of various courses. And…

1 条评论
On Emotional Intelligence

2023年10月3日

On Emotional Intelligence

From my old archives - published on Tue 02 Nov 2010 in https://mahtabsyed.blogspot.

1 条评论
What is Data Governance? And why is it necessary especially now?

2023年3月26日

What is Data Governance? And why is it necessary especially now?

With the advent of Machine Learning and Artificial Intelligence for Predictions (Business metrics like Inventory…
Its end of year again… And I have no new year resolutions…

2022年12月31日

Its end of year again… And I have no new year resolutions…

Its 31 Dec 2022, an end of a year again… And I am quite happy and contented. ?? I have a clear vision of what I will do…

3 条评论
Machine Learning Blog – 9

2022年10月7日

Machine Learning Blog – 9

Machine Learning using 3 ways - Full code vs. No Code vs.

3 条评论
Winning with life which keeps throwing new challenges every day...

2022年3月27日

Winning with life which keeps throwing new challenges every day...

I had written this self care tip few months back which I thought its better to be published as an article..

2 条评论
The Silence within

2022年2月7日

The Silence within

Its peak winter in Melbourne and early morning of Wed 29 May 2019, and so far it’s the coldest day this year. I am at…
This year 2021… was in the trenches of worries

2022年1月1日

This year 2021… was in the trenches of worries

This year 2021… was in the trenches of worries due to Covid lockdowns, number of daily cases, economic slowdown…

1 条评论
Machine Learning Blog – 8

2021年11月20日

Machine Learning Blog – 8

Multi-Layer Stacking Ensemble and Optuna Hyperparameter Tuning In this blog I will illustrate and link to the code of a…

1 条评论

See all articles

Generative AI - Learnings 2023

Mahtab Syed

Data and AI Leader | AI Solutions | Cloud Architecture(Azure, GCP, AWS) | Data Engineering, Generative AI, Artificial Intelligence, Machine Learning and MLOps Programs | Coding and Kaggle

This year 2023 has been the year of Generative AI using Large Language Models both closed source and open source.

Like many of us I have been learning via blogs, courses and using prompt engineering for building Generative AI apps and key takeaways are here:

1. Use LLM as a thought partner

2. Examples of tasks LLMs can carry out

3. What LLMs can and cannot do?

4. Prompting best practices

5. Iterative Prompt Engineering

6. Design considerations

7. Advanced Technologies – Beyond Prompting

领英推荐

8. Process of building an application

9. Challenges with LLM

Code

Leaderboards

Acknowledgement:

Mahtab Syed的更多文章

社区洞察

其他会员也浏览了

AI from Rote Learning to Meaningful Learning, Understanding is what True AI requires?

Learnings on the Journey of GenAI Adoption

The Promise and Progress of Generative AI

Generative AI versus Traditional AI and How we can shape tomorrow's future

The Secret to Smarter AI: Harnessing Human Feedback with RLHF

True Story Behind DeepSeek's Success: AI Learning to Think Slowly Without Human Supervision

Mastering AI Reasoning: The Training Evolution of DeepSeek R1

Tools vs. Agents: Revised Theory of AI Agency

The Power of Human Feedback in Enhancing Generative AI Images: A Deep Dive

This year 2023 has been the year of Generative AI using Large Language Models both closed source and open source.

Like many of us I have been learning via blogs, courses and using prompt engineering for building Generative AI apps and key takeaways are here:

1. Use LLM as a thought partner

2. Examples of tasks LLMs can carry out

3. What LLMs can and cannot do?

4. Prompting best practices

5. Iterative Prompt Engineering

6. Design considerations

7. Advanced Technologies – Beyond Prompting

领英推荐

8. Process of building an application

9. Challenges with LLM

Code

Leaderboards

Acknowledgement:

Mahtab Syed的更多文章

AI Agents or Agentic Systems

Develop your career in AI in 2025

On Emotional Intelligence

What is Data Governance? And why is it necessary especially now?

Its end of year again… And I have no new year resolutions…

Machine Learning Blog – 9

Winning with life which keeps throwing new challenges every day...

The Silence within

This year 2021… was in the trenches of worries

Machine Learning Blog – 8

社区洞察

其他会员也浏览了

AI from Rote Learning to Meaningful Learning, Understanding is what True AI requires?

Learnings on the Journey of GenAI Adoption

The Promise and Progress of Generative AI

Generative AI versus Traditional AI and How we can shape tomorrow's future

The Secret to Smarter AI: Harnessing Human Feedback with RLHF

True Story Behind DeepSeek's Success: AI Learning to Think Slowly Without Human Supervision

Mastering AI Reasoning: The Training Evolution of DeepSeek R1

Tools vs. Agents: Revised Theory of AI Agency

The Power of Human Feedback in Enhancing Generative AI Images: A Deep Dive