The war for large language models

The war for large language models

Last week, the buzz was around large language models. A day after?DeepMind came out with Gopher, a 280 billion parameter transformer language model, Google introduced the Generalist Language Model (GLaM)?– a trillion weight model that uses sparsity.

The?full version of GLaM has 1.2T total parameters?across 64 experts per mixture of experts (MoE) layer with 32 MoE layers in total but only activates a subnetwork of 97B (8% of 1.2T) parameters per token prediction during inference.

As if this was not enough, in another innovation in the large language model generation space,?LG AI Research has revealed its new artificial intelligence language model “Exaone”, with capabilities of tuning 300 billion different parameters or variables.


1

Andrej Karpathy Believes AI Models Will Consolidate

Andrej Karpathy, recently wrote a very compelling Twitter thread, making an?argument about the consolidation of AI model architecture. In the last few years, the neural network architecture across areas of applications have begun to look similar.

An example is Transformers – traditionally linked to building language models, their scope is more far-reaching. Transformers were introduced for processing language models – leading to the launch of models like BERT, XLNet, and GPT. However, transformers can be used for more diverse applications.

Also:?Tesla’s Karpathy On The Tech Behind Its Autopilot Project

+?Big Tech & Their Favourite Deep Learning Techniques


2

Our Discussion with Elad Ziklik Of Oracle

Recently, for an episode of Simulated Reality, a video podcast series by Analytics India Magazine, we had?Elad Ziklik, VP, Product Management for AI Services and Data Science at Oracle, in the house.

During the session, Elad delved deep into some critical aspects of artificial intelligence and how organisations can leverage its full potential, the challenges involved, and Oracle’s key AI capabilities and strategies.

Also:?Chris Chelliah, Oracle JAPAC

+?In The Google Vs Oracle Fight, API Developers Win


3

Goa Institute of Management's PGDM in Big Data Analytics.

The?two-years residential Postgraduate Diploma in Management programme?has an intake capacity of 120 students. The main intention of the programme is to create future-ready and data-fluent professionals equipped with capabilities for data-driven decision-making.

The programme structure is divided into a 40 to 60 ratio of business knowledge and Big Data Analytics related experience.

Also:?Top Postgraduate Data Science Programmes In India

+?Top Under-graduate Data Science Programmes In India


4

Council Posts

AIM Leaders Council is an invitation-only forum of senior executives in the Data Science and Analytics industry. To check if you are eligible for a membership, please fill the form?here.

+?Data Engineering Advancements By 2025?- Anish Agarwal

+?Redefining The Customer Experience With Analytics And Technology In 2022?- Sunil Mirani

+?Envisioning The Future of Deepfakes?- Aishwarya Srinivasan

+?The Democratisation Of Data—Everyone wants to know what’s happening NOW!?- Gaurav Dhall

+?Building Multilingual Chatbots In India—Overcoming Cultural Challenges?- Ankush Sabharwal


5

Featured Video:?Xenobots | The Living Robots - That Reproduce


6

Hands-on Guides for ML Developers

+?How to Hide Objects in Images using Large-Mask Inpainting (LaMa)?

+?A Guide to Term-Document Matrix with Its Implementation in R and Python

+?A Deep Dive into PyeCharts, A Python Tool For Data Visualization

+?What is Dithering in Image Processing and How it Maintains Image Quality?

+?A Hands-On Guide to Automatic Music Generation using RNN


7

Data Science Hiring Process At Khatabook

At?Khatabook, the analytics and data science team consists of 30 members. Analytics and data science is a centralised function in Khatabook. For instance, Khatabook has five products, and each product has a data science leader and a team of 4-5 data analysts/scientists.

Also:?Data Science Hiring Process At Myntra

+?Data Science Hiring Process At Pine Labs


8

The Personal Data Protection Bill

The?Indian Personal Data Protection Bill?is expected to be taken up for a much-awaited discussion in the winter session of the Parliament, after almost two years of deliberations. But a few of the provisions have come under great scrutiny by dissenters and opposition parties across the country, leaving the future of the PDP bill a question mark.

Also:?All You Need To Know About India’s Upcoming Personal Data Protection Bill

+?Industry Verdict On Personal Data Protection Bill 2018: Noble Intentions, But Rocky Road Ahead


9

Player Of Games

After back to back innovations in the gaming space, DeepMind has taken a step further and created a system called?Player of Games (PoG), whose structure and mechanism it has released in a research paper.

What makes Player of Games stand out is that it can perform well at both perfect and imperfect information games.

Also:?DeepMind Now Wants To Study The Behaviour Of Electrons, Launches An AI Tool

+?DeepMind & Mathematicians Use AI To Solve The Knot Problem


10

BOTTOM OF THE NEWS

Here's what all happened last week.

+?Improving GPT’s Factual Accuracy Of Language Models

+?NVIDIA Builds Framework That Can Generate Motion Capture Animation Using Only Video Inputs

+?MeitY Offers Incentives For Semiconductor Production In India

+?Now Developers Can Train GPT-3 On Their Data

+?MLCommons Makes 2 Large Speech Datasets Freely Available

要查看或添加评论,请登录

Bhasker Gupta的更多文章

  • The End of the Coding Era

    The End of the Coding Era

    AI might take over coding sooner than anyone expects. And we’re not talking a cursory (pun intended!) 20-30%, but a…

    3 条评论
  • Can IndiaAI Deliver LLM in 6 Months? ??

    Can IndiaAI Deliver LLM in 6 Months? ??

    Is India’s wait for its own LLM finally coming to an end? Or are we on a 6-month reset? Last week, MeitY launched AI…

    2 条评论
  • Bengaluru is the AI Capital

    Bengaluru is the AI Capital

    A quiet yet decisive shift is happening in Namma Bengaluru. The ‘IT City’ is no longer just a hub for outsourcing or IT…

    3 条评论
  • The Week That Felt Like a Decade in Tech

    The Week That Felt Like a Decade in Tech

    Last week felt like a decade compressed into days with AI, quantum computing, and robotics all taking giant leaps…

    2 条评论
  • Karnataka Bags $115 Billion, What’s Next?

    Karnataka Bags $115 Billion, What’s Next?

    “Reimagine Growth” was the theme of this year’s Invest Karnataka, and the state has delivered on its promise in a big…

    1 条评论
  • What Unfolded at MLDS-2025

    What Unfolded at MLDS-2025

    “The human brain is an extremely efficient AGI. It runs on potatoes.

  • India Bets Big on AI in Budget 2025-26

    India Bets Big on AI in Budget 2025-26

    India had high hopes for the Union Budget 2025-26, especially after the Chinese DeepSeek model took the world by storm,…

  • India’s AI Mission: The Talk of Davos 2025

    India’s AI Mission: The Talk of Davos 2025

    As we celebrate the 76th Republic Day, AIM reflects on India’s incredible progress and the boundless opportunities…

    1 条评论
  • Indian IT ‘Agentic AI’ Mode: ON

    Indian IT ‘Agentic AI’ Mode: ON

    A few days ago, in a conference call with journalists and analysts, the CEO of a top Indian IT company was asked a…

    3 条评论
  • The Agentic AI Madness Begins

    The Agentic AI Madness Begins

    A few days ago, the AI world quietly reached a significant milestone. Dharmesh Shah, founder and CTO of HubSpot…

    5 条评论

社区洞察

其他会员也浏览了