登录查看更多内容

Large Language Models and Google's BARD: A Speech at GDG Nuremberg

Ivy W.

Author of "Terraform Made Easy" | Solution Architect | Google Ambassador

发布日期: 2023年10月26日

On October 24, 2023, I had the pleasure of speaking at the Google Developer Group Nuremberg AI/ML event about large language models (LLMs) and Google's latest product, BARD. I'd like to thank the organizers ?? Lukas Himsel and the audience for the wonderful event.

In my speech, I covered the following main points:

Discriminative models and generative models
What are LLMs and how do they work?
What are the benefits and limitations of LLMs?
LLM Development (using API) and Traditional ML Development
What is BARD?

1. Discriminative models and generative models

Discriminative model

People are more familiar with discriminative models, which are trained to predict a target variable based on a set of features. Discriminative models are often used for classification tasks. For example, a discriminative model can be trained to identify dogs and cats, or, to predict whether a customer is likely to churn based on their behaviours.

Discriminative models are trained using a variety of different machine learning algorithms, such as logistic regression, SVM, and decision trees. These algorithms work by learning a decision boundary that can best separate the training data points into their respective classes. Once the decision boundary has been learned, the model can then be used to predict the class label for new data points.

Generative Models

The Generative models, on the other hand, are trained to generate new contents. They learn the underlying patterns and distributions of data and then using that knowledge to create new data that is similar to the training data. Generative models can be used to generate realistic and coherent images and videos, a variety of text formats, and of course, translate languages.

2. Large Language Model

Large language models refer to large, general-purpose language models that can be pre-trained and then fine-tuned for specific purposes.?

How large is large?

People may have different definition of large. But “large” has been used to describe BERT (110 million parameters) as well as GPT-3 (up to 175 billion parameters). Meanwhile, “large” can refer either to the number of parameters in the model, or sometimes the number of words in the dataset.

The PaLM and its use cases

In April 2022, Google released PaLM(Pathways Language Model). PaLM is trained on a massive dataset of text and code, which includes books, articles, code repositories, and other forms of text. It has over 540 billion parameters, which is more than any other LLM. PaLM has achieved a state-of-the-art performance across multiple language tasks.

PaLM is capable of a wide range of tasks, including:

Generating text, translating languages, writing different kinds of creative content, and answering questions in an informative way.
Performing tasks that require commonsense reasoning, such as understanding and responding to analogies and metaphors.
Learning new tasks from scratch, without being explicitly programmed.

Christopher Penn 1 年前

Graph of Thoughts with LLMs; GPT Can Solve Math…

Danny Butvinik 1 年前

? Time for LLMs?

Pascal Biese 10 个月前

The beauty of Pre-trained and fine-tuned

The large language models are pre-trained on a massive dataset of text and code. The pre-training process allows the LLM to learn the patterns and relationships between words and phrases in the language. It also means that you don’t have to suffer the pain of handling complex engineering challenges or the cost of computing. You can just use it by calling API.

When you expect a more customised task, you also can fine-tune it on a dataset that is specific to your needs. Once you have fine-tuned the LLM, you can use it to generate text, translate languages, write different kinds of creative content, or get answers to questions about the topic that you trained it on.

3. What are the benefits and limitations of LLMs?

Benefits

Limitations

Bias:?LLMs are trained on massive datasets of text and code, which can reflect the biases that exist in the real world. This can lead to LLMs generating text that is biased or offensive.
Hallucination:?LLMs can sometimes generate text that is factually incorrect or misleading. This is because LLMs are trained to generate text that is similar to the training data, and the training data may contain incorrect or misleading information.
Lack of explainability:?LLMs are complex models, and it can be difficult to explain why they generate certain outputs. This can make it difficult to trust LLMs for critical tasks.

4. LLM Development (using API) and Traditional ML Development

Traditional machine learning development can be time-consuming and complex, requiring extensive data preprocessing, model training and optimization, and domain expertise. In contrast, LLM development is much simpler, thanks to the power of pre-training. LLMs have learned a vast amount of knowledge during pre-training, so they require fewer training examples and less domain knowledge. Instead, the focus is on prompt design and engineering, which involves creating clear, concise, and informative prompts.

5.What is BARD?

BARD is launched as an experiment in March 2023. It is an early experiment with generative AI. It is still a baby but powerful.

The meaning of BARD in large language models is Bidirectional Auto-Regressive Decoder. It is a type of neural network model that is trained to predict the next word in a sequence, both forwards and backwards.

BARD uses the Pathways Language Model 2 (PaLM 2). It is the most advanced language model that Google has ever developed.

It is trained on a dataset that is specifically designed to be informative and comprehensive.?This means that BARD is better at generating text that is factually accurate and complete.
It is trained using a technique called “Pathways Language Model” (PaLM), which allows it to learn the relationships between different concepts and ideas in a more comprehensive way.?This means that BARD is better at understanding the context of a query and generating a response that is relevant and informative.
It is trained using a technique called “Sparrow dialogue fine-tuning”, which allows it to learn how to have more natural and engaging conversations.?This means that BARD is better at interacting with users in a way that is similar to how a human would.

BARD is very good at having natural and engaging conversations. For example, if you ask BARD to tell you a joke, BARD will be able to generate a joke that is funny and relevant to the conversation.

Well, to be honest, I’m regretful that I asked a joke. BARD knows too much and the answer is painful.

Overall, BARD is a powerful LLM that is designed to be informative, comprehensive, and engaging. It is a valuable tool for anyone who needs to generate text, translate languages, write different kinds of creative content, or get answers to questions.

Sabrina Jodexnis

1 年

Looking forward to the talk tomorrow!

Illia Dorosh

8-10 Hz Machine learning Engineer @ illigen.fun

1 年

Google knows so much about us, if only we were more aligned financially. I know I can download all my data at any time, but it would be nice if it just was there for me, responding to my "Hey Google" call. It is probably coming soon, I'd like things to move faster. It's important we are proactive about it, and make sure we all get what we want out of it. Having YouTube and Google Play subscription is the step in the right direction. Paying for cloud and storage too. Having something like OpenAI API Secret Key, that's easy to use on all the 3rd party websites would be nice too.

?? Lukas Himsel

Flutter & Full Stack Developer, Industrial Software, GDG Meetup Organizer

1 年

Thank you for that post! – It was great having you this week. And having you at devfest.gdg.nu again!

Google Developer Group Nuremberg

1 年

We are looking forward to your talk! ??

查看更多评论

要查看或添加评论，请登录

查看全部

Large Language Models and Google's BARD: A Speech at GDG Nuremberg

Ivy W.

Author of "Terraform Made Easy" | Solution Architect | Google Ambassador

1. Discriminative models and generative models

Discriminative model

Generative Models

2. Large Language Model

How large is large?

The PaLM and its use cases

领英推荐

The beauty of Pre-trained and fine-tuned

3. What are the benefits and limitations of LLMs?

Benefits

Limitations

4. LLM Development (using API) and Traditional ML Development

5.What is BARD?

更多精彩文章

社区洞察

其他会员也浏览了

?? Getting RAG Right: All in One Go

New Open Long-Context LLM; LLMs For Text Analysis; Graph-2-Text Generative Models; Fine-Tune Your Own Llama 2; and More

The Business Case for Open Source Large Language Models: A Deep Dive into Llama-2

The Origination of Eight Major Methods For FineTuning an LLM

Unveiling LLMops: Your Gateway to Efficient Large Language Model Operations

The Reproducibility Challenge in Large Language Models: Strategies and Practical Insights (Part 2)

A Guide to Training Your Own Language Model

Spring AI and Large Language Models (LLMs) Integration

Unveiled: A tool that unmasks the secrets of large language models (LLMs)

HOW TO FINE-TUNE LLAMA 2 AND UNLOCK ITS FULL POTENTIAL

1. Discriminative models and generative models

Discriminative model

Generative Models

2. Large Language Model

How large is large?

The PaLM and its use cases

领英推荐

The beauty of Pre-trained and fine-tuned

3. What are the benefits and limitations of LLMs?

Benefits

Limitations

4. LLM Development (using API) and Traditional ML Development

5.What is BARD?

The Power of Vertex AI

2023年11月5日

Embrace the Core Mindset for Amateur Data Scientists

2023年8月7日

How to prepare GCP Exam

2023年7月26日

社区洞察

其他会员也浏览了

?? Getting RAG Right: All in One Go

New Open Long-Context LLM; LLMs For Text Analysis; Graph-2-Text Generative Models; Fine-Tune Your Own Llama 2; and More

The Business Case for Open Source Large Language Models: A Deep Dive into Llama-2

The Origination of Eight Major Methods For FineTuning an LLM

Unveiling LLMops: Your Gateway to Efficient Large Language Model Operations

The Reproducibility Challenge in Large Language Models: Strategies and Practical Insights (Part 2)

A Guide to Training Your Own Language Model

Spring AI and Large Language Models (LLMs) Integration

Unveiled: A tool that unmasks the secrets of large language models (LLMs)

HOW TO FINE-TUNE LLAMA 2 AND UNLOCK ITS FULL POTENTIAL