登录查看更多内容

GenAI Build, FineTune, RAG...and the so many confusing options to roll out!

Dina Kamal

Field CTO/Canada GM at DataBahn

发布日期: 2024年4月29日

This is article 2 in the AI for Business Leaders.

Generative AI models use neural networks to identify the patterns and structures within existing data to generate new and original content on patterns and insights learned from a massive training dataset. Examples of Generative AI models include:

·?ChatGPT:?An AI language model developed by OpenAI that can answer questions and generate human-like responses from text prompts.

·?DALL-E 3:?Another AI model by OpenAI that can create images and artwork from text prompts.

·?Sora: ?Sora is an AI model from OpenAI that can create realistic and imaginative scenes from text instructions.

·?Google Gemini:?Google’s generative AI chatbot and can answer questions and generate text from prompts.

·?Claude : By Anthropic which collaborates closely with Amazon, it is another generative AI-based model described as a “friendly, enthusiastic colleague or personal assistant.

·?Midjourney:? This Gen AI model interprets text prompts to produce images and artwork, similar to DALL-E.

·?GitHub Copilot:?An AI-powered coding tool that suggests software code completion.

·?Llama:?Meta’s open-source large language model can be used to create conversational AI models for chatbots and virtual assistants.

·?Grok: Another Generative AI virtual assistant from Elon Musk’ new generative AI venture xAI.

·?BLOOM: From Hugging Face, is like ChatGPT but has been trained on 46 different languages and 13 programming languages.

·?Cohere:? Provides pre-built LLMs (Large Language Models) to perform common tasks on text input, such as: summarizing, classification, and finding the similarities in content aka. natural language processing (NLP).

Generative AI platforms are built on top of large language models (LLMs) and foundation models:

·?LLMs?are deep learning models that consume and train on massive datasets to create new combinations of text that mimic natural language based on its training data.

领英推荐

Understanding & Building LLM Applications!

Pavan Belagatti 10 个月前

Should Open-Source AI Prioritize Developing Foundation…

Lightning AI 1 年前

ChatGPT vs Gemini; Uncertainty Quantification in…

Danny Butvinik 1 年前

A key factor in how LLMs work is the way they represent words. Earlier forms of machine learning used a numerical table to represent each word. But this form of representation could not recognize relationships between words such as words with similar meanings. This limitation was overcome in LLMs using a technique commonly referred to as word embeddings, to consider context of the text and relationships between words in the text.

· Foundation models describe machine learning models trained on a broad spectrum of generalized and unlabeled data and capable of performing a wide variety of general tasks such as understanding language, generating text and images, and conversing in natural language. ?

A unique feature of foundation models is their adaptability. These models can perform a wide range of disparate tasks with a high degree of accuracy based on input prompts. Some tasks include natural language processing (NLP), question answering, and image classification. The size and general-purpose nature of FMs make them different from traditional ML models, which typically perform specific tasks, like analyzing text for sentiment, classifying images, and forecasting trends.

So, what do we mean when discuss implementation of GenAI??

There are the main modes when adopting generative AI.

1. Off-the-shelf:?Use an existing foundational model directly by inputting prompts (e.g. interacting with ChatGPT).? For example, ask the model to create a job description for a software engineer or suggest alternative subject lines for marketing emails.

2.Prompt Engineering:? This is currently the most common approach. Use connectors (i.e., APIs) to build applications leverage an existing foundational model.? For example, using the OpenAI API to develop an internal knowledge management solution with a virtual assistant for employee support.? This approach when done right enables better security and intellectual property protection for an enterprise.

3.Retrieval Augmented Generation (RAG): Combining an LLM with external knowledge retrieval. Retrieval Augmentation Generation (RAG) is an architecture that augments the capabilities of a Large Language Model (LLM) like ChatGPT by adding an information retrieval system that provides grounding data. Adding an information retrieval system gives you control over grounding data used by an LLM when it formulates a response. For an enterprise solution, RAG architecture means that you can constrain generative AI to?your enterprise content?sourced from vectorized documents, images, audio, and video.

4. Fine Tuning: Organizations can introduce significant tuning and customization to an existing model to meet their unique needs. ?For example, adding a new data layer focused on customizing the model to better understand the context and nuances of the cyber security data terminology to make the model more relevant to cyber security applications.?

5. Build Your Own:?Building a completely new foundational model is probably not feasible for most organizations due to the sheer data and computational power required to pre-train the model from scratch.

The first three options of GenAI adoption offer the benefit of having the models pre-trained. This allows for use cases that can be done using off-the-shelf tools or using prompt engineering.? This means that for use cases where GenAI can provide value, the time of value can be reduced from many months to a matter of weeks if not days. ?

This is a key point to note in terms of the difference between using traditional machine learning versus generative artificial intelligence; GenAI leapfrogs the effort to implement specific use cases that Machine learning would have been able to deliver but with significant amount of effort in terms of training massive amount of data, the model development and related specialized expensive computational requirements. ?This is the case since with GenAI, the pre-training of the data and the pre-requisite technology infrastructure to build the model is already done.??

This is akin to hiring a CPA to do accounting for your organization compared to hiring a complete novice to do the same job and spending the time and effort to get them up to speed on all accounting basics and foundational knowledge of accounting, then how to apply to your organization. ??

Hopefully this gave you some ideas on the various options to roll out GenAI!

The next article will double click on these deployment approaches and related considerations.. Then another article on AI, Privacy and Cyber Security. Stay tuned!

要查看或添加评论，请登录

Dina Kamal的更多文章

The Dual-Edged Sword of Generative AI in Cybersecurity: Insights from the Cyber Kill Chain

2024年9月3日

The Dual-Edged Sword of Generative AI in Cybersecurity: Insights from the Cyber Kill Chain

In the rapidly evolving landscape of cybersecurity, Generative Artificial Intelligence (GenAI) has emerged as a…

5 条评论
GenAI Risk Management

2024年5月31日

GenAI Risk Management

As the field of generative AI advances, the methodologies for mitigating associated risks must also progress…

9 条评论
Hype busting: A business leader’s primer on understanding where GenAI fits compared to other forms of AI.

2024年4月21日

Hype busting: A business leader’s primer on understanding where GenAI fits compared to other forms of AI.

This is the first in 5 articles on Traditional AI vs GenAI, use cases and enterprise deployment considerations. There…

10 条评论
Something I learned about…. The urge, the pain and the "occasional" glory of putting my feet in the fire…

2024年4月8日

Something I learned about…. The urge, the pain and the "occasional" glory of putting my feet in the fire…

You have one life, correct? You think you can do more? you think you can be more? You had those big dreams, of one…

11 条评论
What on earth is security data fabric, and why do we suddenly need one?

2024年4月1日

What on earth is security data fabric, and why do we suddenly need one?

Every time I am at a security conference, a new buzzword is all over most vendors’ signage, one year it was UEBA (User…

6 条评论
Something I learnt from.. my mother

2023年12月3日

Something I learnt from.. my mother

This is a slightly different kind of blog from the ones I posted under "Something I learnt recently". This one is about…

44 条评论
Something I learned about…. Design thinking:

2023年9月25日

Something I learned about…. Design thinking:

Design thinking and critical thinking are skills that are as important as learning how to write an email or communicate…
Something I learned about... Time.

2023年8月15日

Something I learned about... Time.

The concept of time, and not having enough of it is a common theme in many discussions and articles. On the concept of…

8 条评论
Something I learned recently about .. Earth Overshoot day

2023年8月3日

Something I learned recently about .. Earth Overshoot day

So, what on earth is Earth's Overshoot day? Here's how Fortune explains it: "For the past 50 years, we’ve been in an…

2 条评论
Something I learnt about ... "The Good Life"

2023年8月1日

Something I learnt about ... "The Good Life"

I decided to start a recurring blog/post on something I learned recently. I happen to be a nerd who's addicted to…

12 条评论

See all articles

GenAI Build, FineTune, RAG...and the so many confusing options to roll out!

Dina Kamal

Field CTO/Canada GM at DataBahn

领英推荐

Dina Kamal的更多文章

社区洞察

其他会员也浏览了

Anthropic Raises the Bar with Claude 3

The AI Vanguard Newsletter: Issue #1 - Cutting-Edge Research and a Path To Personal Growth

The Difference Between Large Language Models (LLMs) and Traditional Machine Learning Models

Deep Drive into DeepSeek for Deep Reasoning

Navigating the Generative AI Landscape

Deep-Dive into Opensource LLMs vs Proprietor LLMs

The Future of Data Science: Key Trends to Watch in 2024

Matryoshka Embeddings: Big Benefits in Smaller Packages

Embracing the Evolution

What is GPT-4 and Why Does it Matter?

领英推荐

Dina Kamal的更多文章

The Dual-Edged Sword of Generative AI in Cybersecurity: Insights from the Cyber Kill Chain

GenAI Risk Management

Hype busting: A business leader’s primer on understanding where GenAI fits compared to other forms of AI.

Something I learned about…. The urge, the pain and the "occasional" glory of putting my feet in the fire…

What on earth is security data fabric, and why do we suddenly need one?

Something I learnt from.. my mother

Something I learned about…. Design thinking:

Something I learned about... Time.

Something I learned recently about .. Earth Overshoot day

Something I learnt about ... "The Good Life"

社区洞察

其他会员也浏览了

Anthropic Raises the Bar with Claude 3

The AI Vanguard Newsletter: Issue #1 - Cutting-Edge Research and a Path To Personal Growth

The Difference Between Large Language Models (LLMs) and Traditional Machine Learning Models

Deep Drive into DeepSeek for Deep Reasoning

Navigating the Generative AI Landscape

Deep-Dive into Opensource LLMs vs Proprietor LLMs

The Future of Data Science: Key Trends to Watch in 2024

Matryoshka Embeddings: Big Benefits in Smaller Packages

Embracing the Evolution

What is GPT-4 and Why Does it Matter?