登录查看更多内容

Shorticle 988 – Reinforcement Learning from Human Feedback using autoregressive model behind ChatGPT design strategy

Dr. Magesh Kasthuri

Distinguished Member Of Technical Staff at Wipro Limited

发布日期: 2023年1月17日

In last two decades, I haven’t seen a solution or technology trend got so much hype and attention when I first noticed ChatGPT from it official launch in November/December 2022 as free preview release. In fact some friends asked me when I will publish a shorticle on ChatGPT which surprised me on their curiosity on both Shorticle (and expectation from it) and ChatGPT (and its reach).?I wanted to take some time to understand ChatGPrT technically before writing anything about it.

Artificial Intelligence (AI) has been growing in leaps and bounds for more than two decades and chat services has been a strong usecase to be implemented with AI solution for long time. Logic learning machine (LLM) is an important break-through in Natural Language Processing (NLP) to train large language models to learn the structure and relationship between strokes, letters and words in a language.

This kind of LLM design, leads to advanced AI solutions including the web crawler based search engines (eg; Google and Bing) to provide suggestive text and ordering of search results based on user locality, interest and frequent search. For chat based querying service, we need both supervised and reinforced learning to prepare large domain models.

领英推荐

ChatGPT: Overview of implications in market research

Acuity Knowledge Partners 2 年前

ChatGPT: How to Use the World's Most Popular AI…

Mobikasa 2 年前

ChatGPT vs. DeepSeek: A Comprehensive Comparison

Meta Melon Official 1 个月前

Autoregressive models are build based on time-varying process to forecast future based on past values of the service (in this case user questions). Supervised learning model in AI terminology is very popular where user has to train the system to build the knowledge base and later use the trained model to develop focussed AI service like signature verification in cheques deposited in banks, handwritten character recognition to name a few. Reinforced learning model is important in a web-crawling solution where we collect results from one or more sources (webpages) to user previous results to predict future queries from users.

ChatGPT is an extension of an earlier InstructGPT, which is an autoregressive model of responding to user queries. New age digital assistant devices like Alexa and Google Home is based on this where human voice intents are interpreted to prepare answers by the AI engines . OpenAI was started as non-profit research organization for AI based research and development and one of the projects for them is to develop Generative Pre-trained Transformer (GPT) series to develop intelligent search engine optimization (SEO) tools.

ChatGPT is definitely a break-through solution using Reinforcement learning from human feedback (RLHF) and it can be used as intelligent technology advisor in near future to get design approach, business approach and sales approach for many industrial applications.?

#magtechbytes #wipro #shorticle #shorticleaiml #shorticlegeneral

要查看或添加评论，请登录

Dr. Magesh Kasthuri的更多文章

New Features from Microsoft Ignite 2024 and Their Industry Impact

2024年12月15日

New Features from Microsoft Ignite 2024 and Their Industry Impact

The Microsoft Ignite 2024 event in November 2024 introduced a host of innovative features and tools aimed at enhancing…

1 条评论
Shorticle 1003 - How do Cloud Strategy decides SaaS integration approach?

2023年6月20日

Shorticle 1003 - How do Cloud Strategy decides SaaS integration approach?

During Cloud transformation journey, an important question that affects the entire strategy for cloud adoption is SaaS…
Shorticle 1002 – AWS SageMaker, Azure ML and Google ML

2023年6月12日

Shorticle 1002 – AWS SageMaker, Azure ML and Google ML

Machine learning ability is very critical for new age IT solutions including Data science, AIOps, Data analytics to…
Shorticle 1001 – Structural code analysis for better software quality

2023年2月8日

Shorticle 1001 – Structural code analysis for better software quality

For a programmer or Architect, static code analysis is not a new topic and they would have encountered it in their…

1 条评论
Shorticle 1001 – Structural code analysis for better software quality

2023年2月8日

Shorticle 1001 – Structural code analysis for better software quality

For a programmer or Architect, static code analysis is not a new topic and they would have encountered it in their…
Shorticle 1000 – Technology as a Game changer in Sports world

2023年1月29日

Shorticle 1000 – Technology as a Game changer in Sports world

We know in modern sports including Tennis, Football and Cricket, technology plays a vital role in multiple ways like…

1 条评论
Shorticle 999 – New features introduced in Java 19

2023年1月28日

Shorticle 999 – New features introduced in Java 19

Java has been stable for more than two decades and inspiring for more programming languages and frameworks in terms of…
Shorticle 998 – TensorFlow and its applications

2023年1月27日

Shorticle 998 – TensorFlow and its applications

TensorFlow is a wonderful framework to enable voice/sound based Machine-learning recognition, text based sentiment…
Shorticle 997 – Release management with Google Cloud Deploy and Flagger

2023年1月26日

Shorticle 997 – Release management with Google Cloud Deploy and Flagger

Deploying to cloud services should be carefully designed to handle controlled management of cloud resources so that any…
Shorticle 996 – Why do you need Timeseries database solutions?

2023年1月25日

Shorticle 996 – Why do you need Timeseries database solutions?

When you plan for a data analytics or metrics monitoring, you need data which is bound to time series. For example…

1 条评论

See all articles

Shorticle 988 – Reinforcement Learning from Human Feedback using autoregressive model behind ChatGPT design strategy

Dr. Magesh Kasthuri

Distinguished Member Of Technical Staff at Wipro Limited

领英推荐

Dr. Magesh Kasthuri的更多文章

社区洞察

其他会员也浏览了

How does Chat GPT work?

Perplexity vs. ChatGPT vs. Claude: Which AI tool Will be Better in 2025?

How Does the ChatGPT Voice Assistant Work

DeepSeek AI vs ChatGPT

ChatGPT Vs Maya AI The Ultimate Comparison Of 2023

Navigating the world of AI: How to safely use ChatGPT

World First Linkedin Live Leadership Conversation with OpenAI ChatGPT With Your Host Coach BZ

A Comparative Overview: ChatGPT, DeepSeek, and LLaMA

Copilot VS Other Chatbots: Can Copilot Help You in Your Business? Let’s Find Out

Unlocking Business Potential with ChatGPT-4

领英推荐

Dr. Magesh Kasthuri的更多文章

New Features from Microsoft Ignite 2024 and Their Industry Impact

Shorticle 1003 - How do Cloud Strategy decides SaaS integration approach?

Shorticle 1002 – AWS SageMaker, Azure ML and Google ML

Shorticle 1001 – Structural code analysis for better software quality

Shorticle 1001 – Structural code analysis for better software quality

Shorticle 1000 – Technology as a Game changer in Sports world

Shorticle 999 – New features introduced in Java 19

Shorticle 998 – TensorFlow and its applications

Shorticle 997 – Release management with Google Cloud Deploy and Flagger

Shorticle 996 – Why do you need Timeseries database solutions?

社区洞察

其他会员也浏览了

How does Chat GPT work?

Perplexity vs. ChatGPT vs. Claude: Which AI tool Will be Better in 2025?

How Does the ChatGPT Voice Assistant Work

DeepSeek AI vs ChatGPT

ChatGPT Vs Maya AI The Ultimate Comparison Of 2023

Navigating the world of AI: How to safely use ChatGPT

World First Linkedin Live Leadership Conversation with OpenAI ChatGPT With Your Host Coach BZ

A Comparative Overview: ChatGPT, DeepSeek, and LLaMA

Copilot VS Other Chatbots: Can Copilot Help You in Your Business? Let’s Find Out

Unlocking Business Potential with ChatGPT-4