登录查看更多内容

Foundation Models In GenAI

Hashan Wickramasingha Wadanambi (H.W.W)

IT Infrastructure Specialist | IT Infrastructure Services Management | IT Project Management | Cybersecurity | ISO/IEC 27001 Information Security Internal Auditor | Scrum Master | Strategy Implementation Professional

发布日期: 2024年3月30日

Generative Artificial Intelligence (GenAI) stands at the forefront of contemporary discourse, not solely within the confines of the IT Industry but resonating across global landscapes. Thus, H.W.W Articles has diligently delved into the realms of GenAI, disseminating insightful knowledge about its construction and application within professional spheres. Noteworthy definitions and conceptual elucidations have been drawn from authoritative sources such as the Technical Foundations and Terminology for Generative AI under the AWS Skill Builder program, enhancing the depth and credibility of the discourse.

?How does GenAI work?

As a general summary first, the unlabeled data is input into the foundation model with a process called pretrain. Then with prompts, the Foundation Model performs specific Tasks.

?What is the Foundation Model (FM)?

A Foundation Model is a prebuilt, machine learning (ML) model trained on a large amount of data. The result is a model that can be adapted to a wide range of downstream tasks.

?Foundation models are created by taking large amounts of unlabeled data, training a model with that data, and then using that model for a wide range of tasks. The mechanism of transforming unlabeled data to the Foundation Model is called pretrain.

Pretraining is the creation of an FM by training a model with terabytes of unlabeled text or multi-modal data (such as images, audio, or video)

In creating Foundation Models there are two vital factors. They are as below.

1) Unlabeled Data: Unlabeled data can be used at a scale for pretraining because it is a lot easier to obtain compared to labeled data.

2) Large model: A large model with billions of parameters can store richer, deeper context across large amounts of data compared to a smaller model trained on a smaller dataset.

The important factors in developing a large model are the quality and quantity of training data and the Training Infrastructure.

?To develop the Foundation model from unlabeled data Transformer (transformer models) ?does an important job. The transformer architecture is a type of neural network that is efficient, easy to scale and parallelize, and can model interdependence between input and output data.

The transformers (transformer models) do not process words sequentially one at a time. Instead, they process the entire input all at once during the learning cycle. This makes the training process highly parallelizable. Transformer models especially capture positional information. In other words, if a sentence is input then it identifies each word in the sentence and the relationship between each word. This is done by using processes encoded in mathematics. Position encoders allow transformer models to prevent ambiguous meanings when a word is used in other parts of a sentence. As an example “The new lamp had good light for reading.” and “Magnesium is a light metal.” have two different meanings. Although both sentences have word light, the meaning is different due to their position.

The foundation models can be categorized according to their capabilities.

Such as.

领英推荐

Impact of Machine Learning Across Various Industries

TheCodeWork 10 个月前

How to Build AI Models? Complete Guide

Sphinx Solutions Pvt. Ltd. 9 个月前

Understanding Machine Learning and the Power of…

Cecure Intelligence Limited 5 个月前

1.?????? Code Generation

2.?????? Content Generation

3.?????? Content Summarization

4.?????? Questions and Answers

It is possible to further describe the above capability categories.

Code Generation

This category generates code outside the integrated development environment of code developers, and it can be used if the developers choose to use this.

?Content Generation

This category encompasses everything from creating engaging marketing content, such as blog posts, social media updates, or email newsletters, to generating unique, high-quality images, art, logos, and designs.

Content summarization

Foundation Models can take inputs, such as reporting data, call minutes, and long-form articles, to generate summaries. This can help in saving time and reducing errors.

?Question and Answer

This includes chatbots, which are one of the more popular uses of FMs from a consumer perspective. Businesses can use the Q&A capability of FMs to streamline the customer experience and help reduce operational costs.

In the current context, we use these capability categories in our day-to-day work. In the next article, H.W.W Articles wishes to write on how a generative AI model encodes and decodes words with simple examples as H.W.W Article intends to make complex things simple and share the knowledge with the world.

要查看或添加评论，请登录

Hashan Wickramasingha Wadanambi (H.W.W)的更多文章

The Fresh Air Time

2025年2月21日

The Fresh Air Time

In the realm of information technology, configuring systems is a major task. It is well understood that performance and…
A future of product development

2024年10月31日

A future of product development

A few weeks ago, I had the opportunity to participate in an event organized by students specializing in futures studies…
The Critical Role of System Reboots in Technology

2024年9月26日

The Critical Role of System Reboots in Technology

Rebooting has become the go-to solution for resolving system issues, so much so that it's almost second nature—when in…
Updating Kernel Related Software

2024年7月25日

Updating Kernel Related Software

On July 7th, H.W.
Updating firmware is not the answer to every problem.

2024年7月7日

Updating firmware is not the answer to every problem.

Several years ago, in a global organization, a significant issue was identified that demanded immediate attention. The…

3 条评论
Culture Evolution in the Context of SIAM Implementation

2024年4月28日

Culture Evolution in the Context of SIAM Implementation

I recently attended the ServicenorthNordics SIAM Conference hosted in Helsinki. It was a valuable experience, and I…
Importance of being simple and following basics in troubleshooting

2024年2月28日

Importance of being simple and following basics in troubleshooting

An internet connectivity issue arose in one of the newly established small remote offices located in China many years…
Network Traffic Analysis

2024年2月19日

Network Traffic Analysis

With the complexity of the IT Infrastructure, the networks have become more vulnerable, and it has a huge impact on…
Have we identified what digital transformation is?

2024年2月3日

Have we identified what digital transformation is?

The phrase "digital transformation" is widely recognized and holds significant importance in the strategic planning of…

1 条评论
The Future Of IT Departments

2024年1月19日

The Future Of IT Departments

In contemporary organizational contexts, IT departments have traditionally been perceived as cost centers. However, a…

See all articles

Foundation Models In GenAI

Hashan Wickramasingha Wadanambi (H.W.W)

IT Infrastructure Specialist | IT Infrastructure Services Management | IT Project Management | Cybersecurity | ISO/IEC 27001 Information Security Internal Auditor | Scrum Master | Strategy Implementation Professional

领英推荐

Hashan Wickramasingha Wadanambi (H.W.W)的更多文章

社区洞察

其他会员也浏览了

Model-Centric to Data-Centric AI: Challenges, trends, and opportunities underlying this shift

AI Project Development: A Strategic Guide to Business Integration

The Synergy Between Machine Learning AI and Generative AI

Generative AI... and other tools, for the future of business

The Vital Difference Between Machine Learning and Generative AI

Unveiling the Veil: Data Science and Explainable AI in Machine Learning

Technical Difference Between AI and Machine Learning

AI vs Machine Learning. Same thing or Different?

Evolution of AI and Key Concepts of Gen-AI

领英推荐

Hashan Wickramasingha Wadanambi (H.W.W)的更多文章

The Fresh Air Time

A future of product development

The Critical Role of System Reboots in Technology

Updating Kernel Related Software

Updating firmware is not the answer to every problem.

Culture Evolution in the Context of SIAM Implementation

Importance of being simple and following basics in troubleshooting

Network Traffic Analysis

Have we identified what digital transformation is?

The Future Of IT Departments

社区洞察

其他会员也浏览了

Model-Centric to Data-Centric AI: Challenges, trends, and opportunities underlying this shift

AI Project Development: A Strategic Guide to Business Integration

The Synergy Between Machine Learning AI and Generative AI

Generative AI... and other tools, for the future of business

The Vital Difference Between Machine Learning and Generative AI

Unveiling the Veil: Data Science and Explainable AI in Machine Learning

Technical Difference Between AI and Machine Learning

AI vs Machine Learning. Same thing or Different?

Evolution of AI and Key Concepts of Gen-AI