ç™»å½•æŸ¥çœ‹æ›´å¤šå†…å®¹

Bigger is not better in machine learning

Joyce J. Shen

å‘å¸ƒæ—¥æœŸ: 2023å¹´7æœˆ12æ—¥

I heard an interview today featuring the Chief Scientist of Mosaic ML. One of the things Jonathan Frankle said is that bigger is not better. He, of course, is talking about machine model size (e.g., number of parameters).

I posit that we can agree that bigger models may not be better and that when it comes to model size, we generally have a good mental model and a set of metrics to understand this concept relating to the size of a machine learning model. But what about the "better" concept relating to a machine learning model? It's much harder to define, and we have to unpack this onion a lot more.

One way to define it is that the model results are driving superior results. Another way to define it is that the model is more explainable. Another way to define it is that the model is performant enough for the use case but the cost is much lower. The list goes on and on. You can visualize this as a step function or a "build."

é¢†è‹±æŽ¨è

AI And Data At Dow Jones: Why Humans Are The Machine Behind AI

AI And Data At Dow Jones: Why Humans Are The Machineâ€¦

Bernard Marr 3 å¹´å‰

LLMs Are Becoming a Commodityâ€”Now What?

Jared Spataro 5 ä¸ªæœˆå‰

How can we reduce or eliminate bias in machine learning algorithms?

How can we reduce or eliminate bias in machineâ€¦

Machine Learning 2 å¹´å‰

In my view, bigger can be better for some use cases but there will be a significant diminishing return over time. Diminishing return on time, financial investment, performance gain, and interoperability. On the other hand, there is some minimum model complexity needed for most enterprise use cases. So finding that equilibrium or balance of size and performance for a specific use case is a key part of the "art of data science and AI."

Also in my view, there is a lot of work to be done and opportunities to unpack this "better" notion --- both for enterprises looking to establish and mature their AI strategy and program and for startups that are looking to build AI tools for enterprise customers.

Exciting times!

Ranganath Venkataraman

Automation and Innovation | Enterprise-wide value creation | Consulting Director

1 å¹´

Thanks for sharing Joyce J. Shen - agree that model complexity, usability, and ability to interpret results are all factors influencing the bigger/better connection. Also of note are the environmental impacts of AI as computers churn through ever-increasing volumes of data - an MIT review found that training just one AI model can emit more than 626,00 pounds of carbon dioxide equivalent

èµž

å›žå¤

1 æ¬¡å›žåº”

Janusz (John) J.

1 å¹´

Excellent point

èµž

å›žå¤

CHESTER SWANSON SR.

Realtor Associate @ Next Trend Realty LLC | HAR REALTOR, IRS Tax Preparer

1 å¹´

Thanks for Sharing.

èµž

å›žå¤

1 æ¬¡å›žåº”

æŸ¥çœ‹æ›´å¤šè¯„è®º

è¦æŸ¥çœ‹æˆ–æ·»åŠ è¯„è®ºï¼Œè¯·ç™»å½•

Joyce J. Shençš„æ›´å¤šæ–‡ç«

Joyce Shenâ€™s picks: musings and readings in AI/ML, March 24, 2025

2025å¹´3æœˆ24æ—¥

Joyce Shenâ€™s picks: musings and readings in AI/ML, March 24, 2025

Joyceâ€™s picks: musings and readings in AI/ML, March 24, 2025 ?? Musings and Readings â€œGoogle is rolling out Geminiâ€™sâ€¦
Joyce Shenâ€™s picks: musings and readings in AI/ML, March 17, 2025

2025å¹´3æœˆ17æ—¥

Joyce Shenâ€™s picks: musings and readings in AI/ML, March 17, 2025

?? Musings and Readings Powerful A.I.
Joyceâ€™s picks: musings and readings in AI/ML, March 12, 2025

2025å¹´3æœˆ12æ—¥

Joyceâ€™s picks: musings and readings in AI/ML, March 12, 2025

?? Musings and Readings â€œHugging Faceâ€™s chief science officer worries AI is becoming â€˜yes-men on serversâ€™â€ (TechCrunch)â€¦
Joyce Shenâ€™s picks: musings and readings in AI/ML, March 3, 2025

2025å¹´3æœˆ3æ—¥

Joyce Shenâ€™s picks: musings and readings in AI/ML, March 3, 2025

?? Musings and Readings China's Zhipu AI raises $137 million as state funds bet on AI race (Reuters) Augmentedâ€¦
Joyce Shenâ€™s picks: musings and readings in AI/ML, February 24, 2025

2025å¹´2æœˆ24æ—¥

Joyce Shenâ€™s picks: musings and readings in AI/ML, February 24, 2025

?? Musings and Readings OpenAI Bans Accounts Misusing ChatGPT for Surveillance and Influence Campaigns (Hacker News)â€¦
Joyce Shenâ€™s picks: musings and readings in AI/ML, February 17, 2025

2025å¹´2æœˆ17æ—¥

Joyce Shenâ€™s picks: musings and readings in AI/ML, February 17, 2025

?? Musings and Readings How artificial intelligence is changing baseball (The Economist) Can deep learning transformâ€¦
Joyce Shenâ€™s picks: musings and readings in AI/ML, February 10, 2025

2025å¹´2æœˆ10æ—¥

Joyce Shenâ€™s picks: musings and readings in AI/ML, February 10, 2025

?? Musings and Readings Investments in French AI ecosystem reach $85B as Brookfield commits $20B (TechCrunch) DeepMindâ€¦

2 æ¡è¯„è®º
Joyceâ€™s picks: musings and readings in AI/ML, February 5, 2025

2025å¹´2æœˆ5æ—¥

Joyceâ€™s picks: musings and readings in AI/ML, February 5, 2025

?? Musings and Readings DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning (Paper)â€¦
Joyce Shenâ€™s picks: musings and readings in AI/ML, December 23, 2024

2024å¹´12æœˆ23æ—¥

Joyce Shenâ€™s picks: musings and readings in AI/ML, December 23, 2024

Happy Holidays! I hope this weekly newsletter is helpful to stay on top of whatâ€™s happening. Just from this newsletterâ€¦
Joyce Shenâ€™s picks: musings and readings in AI/ML, December 16, 2024

2024å¹´12æœˆ16æ—¥

Joyce Shenâ€™s picks: musings and readings in AI/ML, December 16, 2024

?? Musings and Readings â€œChina sets up AI standards committee as global tech race intensifiesâ€ (Reuters) â€œNew Testsâ€¦

1 æ¡è¯„è®º

See all articles

Bigger is not better in machine learning

Joyce J. Shen

é¢†è‹±æŽ¨è

Joyce J. Shençš„æ›´å¤šæ–‡ç«

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

Navigating the AI Agent Gold Rush

AI/ML news summary: week 32

AI/ML news summary: week 33

AIâ€™s unintended consequences

Machine Learning An Old Wine (OR) in a new Bottle

Titans: A New Era for AI Memory Systems

Thinking Smarter, Not Harder: How LLMs Can Learn on the Fly

AI is About to Change Everything in Business! â€”Are You Ready to Ride the Wave?

AI/ML Digest | Issue 32

#96 How to use AI while Musk and OpenAI fight it out.

é¢†è‹±æŽ¨è

Joyce J. Shençš„æ›´å¤šæ–‡ç«

Joyce Shenâ€™s picks: musings and readings in AI/ML, March 24, 2025

Joyce Shenâ€™s picks: musings and readings in AI/ML, March 17, 2025

Joyceâ€™s picks: musings and readings in AI/ML, March 12, 2025

Joyce Shenâ€™s picks: musings and readings in AI/ML, March 3, 2025

Joyce Shenâ€™s picks: musings and readings in AI/ML, February 24, 2025

Joyce Shenâ€™s picks: musings and readings in AI/ML, February 17, 2025

Joyce Shenâ€™s picks: musings and readings in AI/ML, February 10, 2025

Joyceâ€™s picks: musings and readings in AI/ML, February 5, 2025

Joyce Shenâ€™s picks: musings and readings in AI/ML, December 23, 2024

Joyce Shenâ€™s picks: musings and readings in AI/ML, December 16, 2024

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

Navigating the AI Agent Gold Rush

AI/ML news summary: week 32

AI/ML news summary: week 33

AIâ€™s unintended consequences

Machine Learning An Old Wine (OR) in a new Bottle

Titans: A New Era for AI Memory Systems

Thinking Smarter, Not Harder: How LLMs Can Learn on the Fly

AI is About to Change Everything in Business! â€”Are You Ready to Ride the Wave?

AI/ML Digest | Issue 32

#96 How to use AI while Musk and OpenAI fight it out.

é¢†è‹±æŽ¨è

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†