登录查看更多内容

Top LLM Papers of the Week (August Week 3, 2024)

Kalyan KS

发布日期: 2024年8月28日

[1] SelectLLM

This paper introduces SELECTLLM, a novel algorithm developed to overcome limitations of individual LLM by choosing appropriate LLMs for a given query. SelectLLM utilizes the predictions and confidence scores of a multilabel classifier for selecting the appropriate LLMs. It outperforms individual LLMs and achieves competitive results compared to top-performing LLM subsets.?[Tweet] and [Paper]

[2] GraphRAG (Survey)

Retrieval-Augmented Generation (RAG) addresses LLM challenges, but struggle to handle complex entity relationships in databases. GraphRAG addresses this by leveraging structural information for more precise retrieval and context-aware responses. This paper presents the first comprehensive overview of GraphRAG methodologies and also explores applications, evaluation methods, and future research directions. [Tweet] and [Paper]

[3] LLMs for Finance Applications

FinLLaMA is pre-trained on a 52 billion token financial corpus, including text, tables, and time-series data. FinLLaMA-instruct is developed by fine-tuning FinLLaMA? with 573K financial instructions. FinLLaMA-instruct achieves SOTA results by outperforming GPT4 and other Financial LLMs on a number of datasets. [Tweet] and [Paper]

[4] CommunityKG-RAG

This paper introduces CommunityKG-RAG which integrates community structures within Knowledge Graphs with Retrieval-Augmented Generation systems to enhance fact-checking. CommunityKG-RAG can adapt to new domains and queries without additional training which makes it highly versatile and applicable across various contexts.?[Tweet] and [Paper]

[5] LLM Pruning and Distillation

The report focuses on compressing popular open-source LLMs like? Llama 3.1 8B and Mistral NeMo 12B models to 4B and 8B parameters, respectively using? pruning and distillation techniques. This process results in a notable 4B model from Llama 3.1 8B and a state-of-the-art MN-Minitron-8B model from Mistral NeMo 12B. The model weights are open-sourced on Hugging Face with a permissive license. [Tweet] and [Paper]

领英推荐

What Does Big O(N^2) Complexity Mean?

Tpoint Tech 6 个月前

Data Science #8

Andriy Burkov 1 年前

Claude Shannon Used Markov Chains. Why?

Stephen Puryear 1 年前

[6] W-RAG

Training of dense retrieval in RAG systems is challenging due to the scarcity of ground-truth evidence. This paper introduces W-RAG, which utilizes LLMs' ranking capabilities to create weakly labeled data for training dense retrievers. W-RAG enhances both retrieval and OpenQA performance compared to baseline models.?[Tweet] and [Paper]

[7] RAGLab

RAGLab is a modular, research-oriented open-source library that includes the implementation of? 6 existing RAG algorithms. It provides a comprehensive ecosystem for investigating RAG algorithms, addressing the constraints in RAG development. [Tweet] and [Paper]

[8] Combining PLMs and LLMs for Text Classification

Open LLMs moderately outperform or match pretrained language models only when fine-tuned, raising questions about their cost-effectiveness. This paper introduces a confidence-based approach to combine PLMs with open LLMs for text classification. The proposed solution outperforms PLMs, zero-shot, and few-shot LLMs, while competing closely with fine-tuned LLMs at a significantly lower cost. [Tweet] and [Paper]

[9] Flexora

LoRA is one of the most popular parameter efficient fine-tuning techniques. However, LoRA can underperform on certain tasks due to potential overfitting. Flexora overcome LoRA's limitations by automatically selecting the most important layers for fine-tuning.? Flexoa outperforms LoRA on various downstream tasks. [Tweet] and [Paper]

[10] JSON Response Formatting with LLMs

StructuredRAG is a new benchmark introduced to assess LLMs' ability? in generating structured outputs like JSON. Across 24 experiments, an average success rate of 82.55% was observed.? Llama 3 8B-instruct often performed competitively with Gemini 1.5 Pro, despite being a smaller model.? The findings highlight the need for further research to improve the reliability and consistency of structured output generation in LLMs.? [Tweet] and [Paper]

If you like this, do subscribe to the newsletter so that you won't miss any of the interesting LLM and RAG-related papers.

AI Buzz with Kalyan KS

36,484 位关注者

要查看或添加评论，请登录

Kalyan KS的更多文章

?? Top LLM Papers of the Week (December Week 2, 2024)

2024年12月13日

?? Top LLM Papers of the Week (December Week 2, 2024)

[1] EXAONE 3.5 (Open LLMs for Real-world use cases) This technical report introduces EXAONE 3.

5 条评论
?? Top LLM Papers of the Week (December Week 1, 2024)

2024年12月7日

?? Top LLM Papers of the Week (December Week 1, 2024)

[1] o1-Coder This paper presents O1-CODER, an attempt to replicate OpenAI’s o1 model with a focus on coding tasks. It…

4 条评论
?? Top RAG Papers of the Week (December Week 1, 2024)

2024年12月6日

?? Top RAG Papers of the Week (December Week 1, 2024)

[1] Impact of OCR on RAG This paper introduces OHRBench for understanding the impact of OCR on RAG systems. OHRBench…

11 条评论
Top RAG Papers of the Week (November Week 4, 2024)

2024年11月30日

Top RAG Papers of the Week (November Week 4, 2024)

[1] Knowledge Checking in RAG RAG systems face challenges in effectively integrating external knowledge with the LLM's…

11 条评论
?? Top LLM Papers of the Week (November Week 4, 2024)

2024年11月29日

?? Top LLM Papers of the Week (November Week 4, 2024)

[1] Effectiveness of O1 LLM Agents in the Medical Domain This paper investigates how effective are o1 LLM-based agents…

9 条评论
Top RAG Papers of the Week (November Week 3, 2024)

2024年11月24日

Top RAG Papers of the Week (November Week 3, 2024)

[1] Deploying LLMs With RAG Retrieval-Augmented Generation (RAG) has emerged as a key approach for integrating…

7 条评论
?? Top LLM Papers of the Week (November Week 3, 2024)

2024年11月22日

?? Top LLM Papers of the Week (November Week 3, 2024)

[1] Predictive Cache for LLM Serving This paper introduces InstCache, a predictive cache for LLM serving and this is…

4 条评论
Top RAG Papers of the Week (November Week 2, 2024)

2024年11月17日

Top RAG Papers of the Week (November Week 2, 2024)

[1] Optimal Search and Retrieval for RAG This paper investigates how retrievers can be optimized for RAG pipelines for…

7 条评论
Top LLM Papers of the Week (November Week 2, 2024)

2024年11月16日

Top LLM Papers of the Week (November Week 2, 2024)

[1] Practical Guide to Fine-tuning with Limited Data This paper presents a practical guide to fine-tuning modes with…

3 条评论
Top RAG Papers of the Week (November Week 1, 2024)

2024年11月9日

Top RAG Papers of the Week (November Week 1, 2024)

[1] RAGViz Tool This paper presents RAGViz, a RAG diagnosis tool that visualizes the attentiveness of the generated…

5 条评论

See all articles

Top LLM Papers of the Week (August Week 3, 2024)

Kalyan KS

[1] SelectLLM

[2] GraphRAG (Survey)

[3] LLMs for Finance Applications

[4] CommunityKG-RAG

[5] LLM Pruning and Distillation

领英推荐

[6] W-RAG

[7] RAGLab

[8] Combining PLMs and LLMs for Text Classification

[9] Flexora

[10] JSON Response Formatting with LLMs

AI Buzz with Kalyan KS

36,484 位关注者

Kalyan KS的更多文章

社区洞察

其他会员也浏览了

About NULL

COVID-19 Public Dataset Program: Unleash the Dragon

The grass labels the oval. The Information Science brick redefined.

Names adventures

Modern Data Science: Monogamy or Ménage à trois ?

Time and Space Complexities: Bubble Sort in Go

Data analysis, Paris, metro (ep.5)

?? Day 10 with Data Structures and Algorithms (DSA) : Selection Sort??

From PCA to SSL - A personal odyssey in Data Science

[1] SelectLLM

[2] GraphRAG (Survey)

[3] LLMs for Finance Applications

[4] CommunityKG-RAG

[5] LLM Pruning and Distillation

领英推荐

[6] W-RAG

[7] RAGLab

[8] Combining PLMs and LLMs for Text Classification

[9] Flexora

[10] JSON Response Formatting with LLMs

AI Buzz with Kalyan KS

36,484 位关注者

Kalyan KS的更多文章

?? Top LLM Papers of the Week (December Week 2, 2024)

?? Top LLM Papers of the Week (December Week 1, 2024)

?? Top RAG Papers of the Week (December Week 1, 2024)

Top RAG Papers of the Week (November Week 4, 2024)

?? Top LLM Papers of the Week (November Week 4, 2024)

Top RAG Papers of the Week (November Week 3, 2024)

?? Top LLM Papers of the Week (November Week 3, 2024)

Top RAG Papers of the Week (November Week 2, 2024)

Top LLM Papers of the Week (November Week 2, 2024)

Top RAG Papers of the Week (November Week 1, 2024)

社区洞察

其他会员也浏览了

About NULL

COVID-19 Public Dataset Program: Unleash the Dragon

The grass labels the oval. The Information Science brick redefined.

Names adventures

Modern Data Science: Monogamy or Ménage à trois ?

Time and Space Complexities: Bubble Sort in Go

Data analysis, Paris, metro (ep.5)

?? Day 10 with Data Structures and Algorithms (DSA) : Selection Sort??

From PCA to SSL - A personal odyssey in Data Science