登录查看更多内容

The war for large language models

Bhasker Gupta

Founder & CEO at AIM

发布日期: 2021年12月20日

Last week, the buzz was around large language models. A day after?DeepMind came out with Gopher, a 280 billion parameter transformer language model, Google introduced the Generalist Language Model (GLaM)?– a trillion weight model that uses sparsity.

The?full version of GLaM has 1.2T total parameters?across 64 experts per mixture of experts (MoE) layer with 32 MoE layers in total but only activates a subnetwork of 97B (8% of 1.2T) parameters per token prediction during inference.

As if this was not enough, in another innovation in the large language model generation space,?LG AI Research has revealed its new artificial intelligence language model “Exaone”, with capabilities of tuning 300 billion different parameters or variables.

Andrej Karpathy Believes AI Models Will Consolidate

Andrej Karpathy, recently wrote a very compelling Twitter thread, making an?argument about the consolidation of AI model architecture. In the last few years, the neural network architecture across areas of applications have begun to look similar.

An example is Transformers – traditionally linked to building language models, their scope is more far-reaching. Transformers were introduced for processing language models – leading to the launch of models like BERT, XLNet, and GPT. However, transformers can be used for more diverse applications.

Also:?Tesla’s Karpathy On The Tech Behind Its Autopilot Project

+?Big Tech & Their Favourite Deep Learning Techniques

Our Discussion with Elad Ziklik Of Oracle

Recently, for an episode of Simulated Reality, a video podcast series by Analytics India Magazine, we had?Elad Ziklik, VP, Product Management for AI Services and Data Science at Oracle, in the house.

During the session, Elad delved deep into some critical aspects of artificial intelligence and how organisations can leverage its full potential, the challenges involved, and Oracle’s key AI capabilities and strategies.

Also:?Chris Chelliah, Oracle JAPAC

+?In The Google Vs Oracle Fight, API Developers Win

Goa Institute of Management's PGDM in Big Data Analytics.

The?two-years residential Postgraduate Diploma in Management programme?has an intake capacity of 120 students. The main intention of the programme is to create future-ready and data-fluent professionals equipped with capabilities for data-driven decision-making.

The programme structure is divided into a 40 to 60 ratio of business knowledge and Big Data Analytics related experience.

Also:?Top Postgraduate Data Science Programmes In India

+?Top Under-graduate Data Science Programmes In India

Council Posts

AIM Leaders Council is an invitation-only forum of senior executives in the Data Science and Analytics industry. To check if you are eligible for a membership, please fill the form?here.

+?Data Engineering Advancements By 2025?- Anish Agarwal

+?Redefining The Customer Experience With Analytics And Technology In 2022?- Sunil Mirani

+?Envisioning The Future of Deepfakes?- Aishwarya Srinivasan

+?The Democratisation Of Data—Everyone wants to know what’s happening NOW!?- Gaurav Dhall

+?Building Multilingual Chatbots In India—Overcoming Cultural Challenges?- Ankush Sabharwal

Featured Video:?Xenobots | The Living Robots - That Reproduce

领英推荐

Large language models can do jaw-dropping things. But…

MIT Technology Review 1 年前

Small Language Models: A Big Leap for AI on a Smaller…

Neil Sahota 4 个月前

To Data & Beyond Week 7 Summary

Youssef Hosni 1 年前

Hands-on Guides for ML Developers

+?How to Hide Objects in Images using Large-Mask Inpainting (LaMa)?

+?A Guide to Term-Document Matrix with Its Implementation in R and Python

+?A Deep Dive into PyeCharts, A Python Tool For Data Visualization

+?What is Dithering in Image Processing and How it Maintains Image Quality?

+?A Hands-On Guide to Automatic Music Generation using RNN

Data Science Hiring Process At Khatabook

At?Khatabook, the analytics and data science team consists of 30 members. Analytics and data science is a centralised function in Khatabook. For instance, Khatabook has five products, and each product has a data science leader and a team of 4-5 data analysts/scientists.

Also:?Data Science Hiring Process At Myntra

+?Data Science Hiring Process At Pine Labs

The Personal Data Protection Bill

The?Indian Personal Data Protection Bill?is expected to be taken up for a much-awaited discussion in the winter session of the Parliament, after almost two years of deliberations. But a few of the provisions have come under great scrutiny by dissenters and opposition parties across the country, leaving the future of the PDP bill a question mark.

Also:?All You Need To Know About India’s Upcoming Personal Data Protection Bill

+?Industry Verdict On Personal Data Protection Bill 2018: Noble Intentions, But Rocky Road Ahead

Player Of Games

After back to back innovations in the gaming space, DeepMind has taken a step further and created a system called?Player of Games (PoG), whose structure and mechanism it has released in a research paper.

What makes Player of Games stand out is that it can perform well at both perfect and imperfect information games.

Also:?DeepMind Now Wants To Study The Behaviour Of Electrons, Launches An AI Tool

+?DeepMind & Mathematicians Use AI To Solve The Knot Problem

BOTTOM OF THE NEWS

Here's what all happened last week.

+?Improving GPT’s Factual Accuracy Of Language Models

+?NVIDIA Builds Framework That Can Generate Motion Capture Animation Using Only Video Inputs

+?MeitY Offers Incentives For Semiconductor Production In India

+?Now Developers Can Train GPT-3 On Their Data

+?MLCommons Makes 2 Large Speech Datasets Freely Available

The Belamy

17,681 位关注者

要查看或添加评论，请登录

Bhasker Gupta的更多文章

The End of the Coding Era

2025年3月17日

The End of the Coding Era

AI might take over coding sooner than anyone expects. And we’re not talking a cursory (pun intended!) 20-30%, but a…

3 条评论
Can IndiaAI Deliver LLM in 6 Months? ??

2025年3月10日

Can IndiaAI Deliver LLM in 6 Months? ??

Is India’s wait for its own LLM finally coming to an end? Or are we on a 6-month reset? Last week, MeitY launched AI…

2 条评论
Bengaluru is the AI Capital

2025年3月3日

Bengaluru is the AI Capital

A quiet yet decisive shift is happening in Namma Bengaluru. The ‘IT City’ is no longer just a hub for outsourcing or IT…

3 条评论
The Week That Felt Like a Decade in Tech

2025年2月24日

The Week That Felt Like a Decade in Tech

Last week felt like a decade compressed into days with AI, quantum computing, and robotics all taking giant leaps…

2 条评论
Karnataka Bags $115 Billion, What’s Next?

2025年2月17日

Karnataka Bags $115 Billion, What’s Next?

“Reimagine Growth” was the theme of this year’s Invest Karnataka, and the state has delivered on its promise in a big…

1 条评论
What Unfolded at MLDS-2025

2025年2月10日

What Unfolded at MLDS-2025

“The human brain is an extremely efficient AGI. It runs on potatoes.
India Bets Big on AI in Budget 2025-26

2025年2月3日

India Bets Big on AI in Budget 2025-26

India had high hopes for the Union Budget 2025-26, especially after the Chinese DeepSeek model took the world by storm,…
India’s AI Mission: The Talk of Davos 2025

2025年1月28日

India’s AI Mission: The Talk of Davos 2025

As we celebrate the 76th Republic Day, AIM reflects on India’s incredible progress and the boundless opportunities…

1 条评论
Indian IT ‘Agentic AI’ Mode: ON

2025年1月20日

Indian IT ‘Agentic AI’ Mode: ON

A few days ago, in a conference call with journalists and analysts, the CEO of a top Indian IT company was asked a…

3 条评论
The Agentic AI Madness Begins

2025年1月13日

The Agentic AI Madness Begins

A few days ago, the AI world quietly reached a significant milestone. Dharmesh Shah, founder and CTO of HubSpot…

5 条评论

See all articles

The war for large language models

Bhasker Gupta

Founder & CEO at AIM

领英推荐

The Belamy

17,681 位关注者

Bhasker Gupta的更多文章

社区洞察

其他会员也浏览了

AI 'Breakthrough': Neural Net Mirrors Human Language Mastery

Quantum-Powered Large Language Models: A Leap Toward Artificial General Intelligence

How Generative AI Is Disrupting the Data Economy and Creating New Opportunities

Understanding the Inner Workings of Large Language Models

Papers Explained 03: LLaMA

RAG Foundry: A Framework for Enhancing LLMs for?RAG

Evaluating Large Language Models: Which Models Perform Best and Why ?

Innovations in Small Language Models

Why Small Language Models (SLMs) could be the Game Changer your business needs

Mixture of Experts: Shaping the Future of Large Language Models

领英推荐

The Belamy

17,681 位关注者

Bhasker Gupta的更多文章

The End of the Coding Era

Can IndiaAI Deliver LLM in 6 Months? ??

Bengaluru is the AI Capital

The Week That Felt Like a Decade in Tech

Karnataka Bags $115 Billion, What’s Next?

What Unfolded at MLDS-2025

India Bets Big on AI in Budget 2025-26

India’s AI Mission: The Talk of Davos 2025

Indian IT ‘Agentic AI’ Mode: ON

The Agentic AI Madness Begins

社区洞察

其他会员也浏览了

AI 'Breakthrough': Neural Net Mirrors Human Language Mastery

Quantum-Powered Large Language Models: A Leap Toward Artificial General Intelligence

How Generative AI Is Disrupting the Data Economy and Creating New Opportunities

Understanding the Inner Workings of Large Language Models

Papers Explained 03: LLaMA

RAG Foundry: A Framework for Enhancing LLMs for?RAG

Evaluating Large Language Models: Which Models Perform Best and Why ?

Innovations in Small Language Models

Why Small Language Models (SLMs) could be the Game Changer your business needs

Mixture of Experts: Shaping the Future of Large Language Models