登录查看更多内容

ChatQA - NIVIDIA'S GPT-4 Level Conversational QA Models & Meta AI's Self-Rewarding Language Models -

Aditi Khare

AWS & AI Research [LLMs & Vision]-Principal Machine Learning Scientist & AI Architect | IIM-A | Author | Inference Optimization | Hyperspectral Imaging | Open-Source Dev | Build Production-Grade AI Products from Scratch

发布日期: 2024年1月21日

ChatQA - NIVIDIA'S GPT-4 Level Conversational QA Models -

This paper presents family of ChatQA models, varying in model sizes from 7B to 70B. Performs comprehensive valuations on 10 conversational QA datasets that the best ChatQA-70B model can remarkably outperforms GPT3.5-turbo and perform on par with GPT-4 without using any synthetic data from ChatGPT.

In addition, Adding Fine-tuning a single-turn query retrieve using curated conversational QA data performs comparable to the state-of-the-art LLM-based query rewriting model, without the need of extra computational time and potential API cost from rewriting. Also shows incorporating a small amount of “unanswerable” samples can significantly enhance our model’s capability to handle scenarios where answers are unavailable. The unanswerable case evaluation highlights that our best model ChatQA-70B only has a slight gap compared to GPT-4.

Paper Key Highlights -

1. Nividia's ChatQA Models for Conversational QA.

2. Presents Two-Stage Instruction tuning boosts zero-shot QA accuracy approaches.

3. Its Fine-tuned Retriever matches SOTA Rewriting with Lower cost.

4. With ChatQA-70B it matches GPT-4 accuracy without any synthetic data.

Evaluation Metrics -

F1 score is the most commonly used automatic metric to assess QA models, used it for all datasets except for ConvFinQA as they are about extracting numbers from documents as well as arithmetic calculations. Hence, the answer only makes sense when it is exactly the same as the answer. When models generate the arithmetic formula, It will calculate its final result based on a calculator and compare it with the gold answer.

Reference Paper Link - https://arxiv.org/pdf/2401.10225v1.pdf

2. Meta AI's Self-Rewarding Language Models -

Meta's "Self-Rewarding Language Models" are designed to improve themselves and complement or, in the future, completely replace human-dependent feedback methods.

领英推荐

How to Bypass GPTZero: 12 Proven Techniques to Beat AI…

Parul Gautam 9 个月前

Bypass GPTZero: 12 New Techniques to Avoid GPTZero AI…

Shushant Lakhyani 10 个月前

Techniques to Fine-Tune Large Language Models (LLMs)

Sanjay Kumar MBA,MS,PhD 4 个月前

This paper presents an approach that assumes access to a base pretrained language model, and a small amount of human-annotated seed data and then develop a model that aims to possess two skills simultaneously -

Instruction following: Given a prompt that describes a user request, the ability to generate a high quality, helpful (and harmless) response.
Self-Instruction creation: Ability to generate and evaluate new instruction-following examples to add to its own training set.

These skills are used so that the model can perform self-alignment, i.e., they are the components used to iteratively train itself using AI Feedback (AIF).

Self-Rewarding Language Models, where the language model itself is used via LLM-as-a-Judge prompting to provide its own rewards during training. Paper suggests that during Iterative DPO training that not only does instruction following ability improve, but also the ability to provide high-quality rewards to itself.

Fine-tuning Llama 2 70B on three iterations approach yields a model that outperforms many existing systems on the AlpacaEval 2.0 leaderboard, including Claude 2, Gemini Pro, and GPT-4 0613.

Reference Paper Link - https://arxiv.org/abs/2401.10020

For more information on AI Research Papers you can visit my Github Profile -

https://github.com/aditikhare007/AI_Research_Junction_Aditi_Khare

For Receving latest updates on Advancements in AI Research Gen-AI, Quantum AI & Computer Vision you can subscribe to my AI Research Papers Summaries Newsletter using below link -

https://www.dhirubhai.net/pulse/google-researchs-codeclm-aligning-language-models-tailored-khare-yfjxc/?trackingId=Rj6RT3IxQvOxymDXjcGvKA%3D%3D

Thank you & Happy Reading !

AI Research Junction

1,677 位关注者

要查看或添加评论，请登录

Aditi Khare的更多文章

LLM Inference-Time Self-Improvement & DeepSeek & Modern BERT

2025年1月26日

LLM Inference-Time Self-Improvement & DeepSeek & Modern BERT

#ai #genai #research #researchpapers #llm #inference LLM Inference-Time Self-Improvement - LLM Inference-Time Self…

1 条评论
OpenAI's AI Powered Search Engine Into ChatGPT

2024年11月1日

OpenAI's AI Powered Search Engine Into ChatGPT

#ai #searchgpt #airesearch #genai Introducing ChatGPT Search - ChatGPT can now search the web in a much better way than…
Introducing Anthropic's Claude 3.5 Sonnet, and Claude 3.5 Haiku

2024年10月23日

Introducing Anthropic's Claude 3.5 Sonnet, and Claude 3.5 Haiku

#ai #airesearchpapers #genai #claude #anthropic For more information on AI Research Papers you can visit my Github…
OpenAI Introduces Swarm, a Framework for Building Multi-Agent Systems

2024年10月12日

OpenAI Introduces Swarm, a Framework for Building Multi-Agent Systems

#openai #ai #airesearch #airesearchpapers #researchskills For more information on AI Research Papers you can visit my…
Architecture Search Framework for Inference-Time Techniques & Designing Priors for Better Few-Shot Image Synthesis

2024年10月7日

Architecture Search Framework for Inference-Time Techniques & Designing Priors for Better Few-Shot Image Synthesis

#ai #genai #architecture #search #researchpapers #researchskills #computervision #pattern recognition Inference-time…
Meta's Llama 3.2 - Edge AI & Vision with Open, Customizable Models

2024年9月28日

Meta's Llama 3.2 - Edge AI & Vision with Open, Customizable Models

#ai #airesearch #meta #llm #genai #vision Meta has released Llama 3.2 - A small and medium-sized vision LLMs (11B and…
Agents in Software Engineering-Survey, Landscape, and Vision & Qwen2.5-Coder

2024年9月24日

Agents in Software Engineering-Survey, Landscape, and Vision & Qwen2.5-Coder

#ai #airesearch #genai #researchskills Agents in Software Engineering: Survey, Landscape, and Vision - Large Language…
Anthropic Introduces Contextual Retrieval Using Prompt Caching & Contextual Embeddings & Reranking Techniques

2024年9月23日

Anthropic Introduces Contextual Retrieval Using Prompt Caching & Contextual Embeddings & Reranking Techniques

#ai #airesearch #anthropic #embeddings #llm #genai Introducing Contextual Retrieval - Developers typically enhance an…
Google's Training Language Models to Self-Correct via Reinforcement Learning & Iteration of Thought - Autonomous Large Language Model Reasoning

2024年9月22日

Google's Training Language Models to Self-Correct via Reinforcement Learning & Iteration of Thought - Autonomous Large Language Model Reasoning

#ai #airesearch #airesearchpapers #genai #rl #llm Google's Training Language Models to Self-Correct via Reinforcement…
Learning to Reason with LLMs - Introducing OpenAI o1

2024年9月14日

Learning to Reason with LLMs - Introducing OpenAI o1

#ai #openai #llms #genai #airesearch #airesearchskills #airesearchpapers Introducing OpenAI o1-Preview - A new series…

1 条评论

See all articles

ChatQA - NIVIDIA'S GPT-4 Level Conversational QA Models & Meta AI's Self-Rewarding Language Models -

Aditi Khare

AWS & AI Research [LLMs & Vision]-Principal Machine Learning Scientist & AI Architect | IIM-A | Author | Inference Optimization | Hyperspectral Imaging | Open-Source Dev | Build Production-Grade AI Products from Scratch

领英推荐

AI Research Junction

1,677 位关注者

Aditi Khare的更多文章

社区洞察

其他会员也浏览了

Essential Benchmarks and Metrics for Responsible AI

Evolution of AI Language Models: A Comparative Analysis of GPT-3.5 and GPT-4

A Simple Playbook for Deriving Value Out Of Generative AI

OpenAI Unleashes GPT-4.5: A Glimpse into the Future of AI Chat

Contextualizing Large Language Models (LLMs) with Enterprise Data

ChatGPT's alternatives to choose from.

BEHOLD THE MARVEL OF GPT-4

A Guide to Data Labeling for Fine-tuning LLMs??

CHATBOTS FAIL TO PASS THE TTT PUB TEST - the TTT CHALLENGE

Multimodality is King - Bridging the Gap Between Language and Vision in AI

领英推荐

AI Research Junction

1,677 位关注者

Aditi Khare的更多文章

LLM Inference-Time Self-Improvement & DeepSeek & Modern BERT

OpenAI's AI Powered Search Engine Into ChatGPT

Introducing Anthropic's Claude 3.5 Sonnet, and Claude 3.5 Haiku

OpenAI Introduces Swarm, a Framework for Building Multi-Agent Systems

Architecture Search Framework for Inference-Time Techniques & Designing Priors for Better Few-Shot Image Synthesis

Meta's Llama 3.2 - Edge AI & Vision with Open, Customizable Models

Agents in Software Engineering-Survey, Landscape, and Vision & Qwen2.5-Coder

Anthropic Introduces Contextual Retrieval Using Prompt Caching & Contextual Embeddings & Reranking Techniques

Google's Training Language Models to Self-Correct via Reinforcement Learning & Iteration of Thought - Autonomous Large Language Model Reasoning

Learning to Reason with LLMs - Introducing OpenAI o1

社区洞察

其他会员也浏览了

Essential Benchmarks and Metrics for Responsible AI

Evolution of AI Language Models: A Comparative Analysis of GPT-3.5 and GPT-4

A Simple Playbook for Deriving Value Out Of Generative AI

OpenAI Unleashes GPT-4.5: A Glimpse into the Future of AI Chat

Contextualizing Large Language Models (LLMs) with Enterprise Data

ChatGPT's alternatives to choose from.

BEHOLD THE MARVEL OF GPT-4

A Guide to Data Labeling for Fine-tuning LLMs??

CHATBOTS FAIL TO PASS THE TTT PUB TEST - the TTT CHALLENGE

Multimodality is King - Bridging the Gap Between Language and Vision in AI