登录查看更多内容

Do Trillions Of Parameters Help In LLM Effectiveness?

Venkat Ramakrishnan

Chief Quality Officer | Software Testing Technologist | Keynote Speaker | Corporate Storyteller

发布日期: 2024年3月23日

"The more, the merrier" - A great saying to reflect on while organizing a party. Does the same apply for the number of parameters in a large language model (LLM) in increasing its effectiveness?

The Number Of Parameters Game

The more parameters there are, the more connections can be made between the parameters, the goal being more meaningful associations in the parameters that would lead to the right answer. When I see release of LLMs that market for accuracy based on the no. of parameters, I tend to be become skeptical. 'Our LLM has a trillion parameters!' an LLM release note would say, not mentioning further if that trillion would be really effective in increasing LLM's accuracy in answers.

These parameters operate on data. If the underlying data is inaccurate, irrespective of the number of parameters, the output would be wrong. It all depends on what we train the neural network with. If we train the LLM with say The Internet, chances are that the LLM answers would not only be incorrect, but inconsistent too (meaning if I ask the same question tomorrow, I will get a different answer!).

Whereas, if I train the LLM on a focused domain like say car manufacturing, where I have control over what data I feed to the LLM, there is a much higher possibility of getting the right answer, and here, increasing the number of parameters help because the associations between the parameters are developed well as we have more parameters.

But that is only to a certain extent! There is a threshold above which increasing the number of parameters do not lead to more accurate results irrespective of how correct your data is! More parameters thereafter will only lead to overfitting - your model will not work for future data that is not represented already in the training data!

领英推荐

How Can We Tackle The Problem Of Bias In Artificial…

Bernard Marr 6 年前

This AI newsletter is all you need #13

Towards AI 2 年前

GraphRAG: The most advanced form of RAG

Siddharth Asthana 8 个月前

Software Testing and Quality Angle

What does this mean to a Software Testing of a Quality person while they test an LLM that has curated data (not the entire Internet) ?

Validate the data - that's the first and foremost for quality. Keep diverse set of data relevant to your domain/business that need to be tested with. Keep integrating these data to your training dataset till you get a comprehensive dataset. Data augmentation helps in generating variations of data. This dataset should represent a focused task.
Start testing with a some limited number of parameters. If you don't find the necessary accuracy with the results, increase the number of parameters and see if it helps.
Have a performance chart for the LLM effectiveness vis-a-vis the number of parameters. You will find the threshold wherein increasing the number of parameters further would not increase the effectiveness of the LLM. Keep in mind the cost involved in increasing the parameters and have that data too in the chart.
Test with various datasets that accomplish the same task and see if your model works well for them, and repeat steps 1 to 3.

Conclusion

Blindly increasing the number of parameters do not lead to LLM effectiveness and quality. We need to be conscious of data quality, the task that we are focused on, performance criteria, and cost effectiveness. An optimal balance is achieved by thoroughly testing the accuracy against the number of parameters.

Quality Pivot

583 位关注者

Godwin Josh

Co-Founder of Altrosyn and DIrector at CDTECH | Inventor | Manufacturer

1 年

In assessing the impact of trillions of parameters on LLM effectiveness, historical data suggests that while larger models can enhance performance in certain tasks, they also come with trade-offs in terms of computational resources and scalability. However, testing a focused data-based LLM offers insights into optimizing the balance between effectiveness, performance, and cost. Considering advancements in parallel processing and distributed computing, have you explored techniques like model distillation or pruning to mitigate resource constraints while maintaining model efficacy?

查看更多评论

要查看或添加评论，请登录

Venkat Ramakrishnan的更多文章

Security Testing Of Autonomous Vehicles

2025年3月19日

Security Testing Of Autonomous Vehicles

Still an young field, and there's lot of scope to get into and be an expert! This is about security testing of…

1 条评论
Quality Of Zero-Click Search Results

2025年3月17日

Quality Of Zero-Click Search Results

Let's talk about quality of Zero-Click search results!…
Streamlining Testing Process

2025年3月16日

Streamlining Testing Process

Let's talk about streamlining testing process: https://venkatramakrishnan.com/2025/03/16/testing-process-streamlining/

2 条评论
Measuring Software Quality

2025年3月15日

Measuring Software Quality

Let's talk about how to measure software quality in the modern environments:…
Busting Regression Testing Myths

2025年3月14日

Busting Regression Testing Myths

In this article, let's bust some regression testing myths!…
Avoiding Test Results Conflicts

2025年3月12日

Avoiding Test Results Conflicts

Let's talk about the three key pillars that would contribute to avoiding test results conflicts! Here:…
Test Prioritization

2025年3月11日

Test Prioritization

We encounter difficulties on Test Prioritization on a daily basis. We are challenged because we need to deliver fast…
Skipping Testing Activities

2025年3月10日

Skipping Testing Activities

Skipping testing activities might make sense if the test types are not relevant to the situation at hand. One may…
Balancing Thorough Testing and Fast Feedback

2025年3月9日

Balancing Thorough Testing and Fast Feedback

Pressure to deliver as soon as possible and upholding efforts for superior quality are two conflicting goals because by…
How To Test Last Minute Features

2025年3月8日

How To Test Last Minute Features

We have all been through situations where we are asked to do quality analysis and testing last minute features. In the…

See all articles

Do Trillions Of Parameters Help In LLM Effectiveness?

Venkat Ramakrishnan

Chief Quality Officer | Software Testing Technologist | Keynote Speaker | Corporate Storyteller

The Number Of Parameters Game

领英推荐

Software Testing and Quality Angle

Conclusion

Quality Pivot

583 位关注者

Venkat Ramakrishnan的更多文章

社区洞察

其他会员也浏览了

A quick guide on Artificial Intelligence for data designers and curious minds.

LLM Paper Reading Notes - March 2024

Text analytics models with RapidMiner, deployment and extensions (Part 3 Advanced Models)

Conformal Prediction in Regression: how NOT to Build Intervals (and How to Fix Mistakes)

The Inevitability of Bias: From Artificial Intelligence to the human brain

The Looming Threat of Synthetic Data Feedback Loops

A Comprehensive Guide to Function Calling in LLMs

December 16, 2021

Intelligence Is Just Complexity of Data Transformation

The Number Of Parameters Game

领英推荐

Software Testing and Quality Angle

Conclusion

Quality Pivot

583 位关注者

Venkat Ramakrishnan的更多文章

Security Testing Of Autonomous Vehicles

Quality Of Zero-Click Search Results

Streamlining Testing Process

Measuring Software Quality

Busting Regression Testing Myths

Avoiding Test Results Conflicts

Test Prioritization

Skipping Testing Activities

Balancing Thorough Testing and Fast Feedback

How To Test Last Minute Features

社区洞察

其他会员也浏览了

A quick guide on Artificial Intelligence for data designers and curious minds.

LLM Paper Reading Notes - March 2024

Text analytics models with RapidMiner, deployment and extensions (Part 3 Advanced Models)

Conformal Prediction in Regression: how NOT to Build Intervals (and How to Fix Mistakes)

The Inevitability of Bias: From Artificial Intelligence to the human brain

The Looming Threat of Synthetic Data Feedback Loops

A Comprehensive Guide to Function Calling in LLMs

December 16, 2021

Intelligence Is Just Complexity of Data Transformation