登录查看更多内容

AI/ML news summary: week 23

Marco van Hurne

AI & ML advisory | Author of The Machine Learning Book of Knowledge | Building AI skills & organizations | Data Science | Data Governance | AI Compliance Officer | AI Governance

发布日期: 2024年6月9日

+ 关注

Here are the weekly articles, guides, and news about AI, curated so you won't have to.

Subscribe to the TechTonic Shifts newsletter

Hottest News

NVIDIA Announces Financial Results for First Quarter Fiscal 2025

NVIDIA reported revenue of $26.0 billion for the first quarter, up 18% from the previous quarter and 262% from a year ago. Results were driven by H100 chip sales for training and inference of AI models, indicating again the scale of growth in the sector since the launch of Chatgpt.

2. Microsoft Build 2024: Everything Announced

Microsoft announced several new features, including updates to its AI chatbot Copilot, new Microsoft Teams tools, and more. Most notable are the Copilot Agents, AI assistants that promise to “independently and proactively orchestrate tasks for you.” The company also rolled out Phi-3-vision, a new version of the Phi-3 AI model announced in April.

3. Amazon Plans to Give Alexa an AI Overhaul and a Monthly Subscription Price

Amazon is updating Alexa with advanced generative AI capabilities and launching an additional subscription service separate from Prime in an effort to stay competitive with Google and OpenAI’s chatbots, reflecting the company’s strategic emphasis on AI amidst internal and leadership changes.

4. Here’s What’s Really Going On Inside an LLM’s Neural Network

New research from Anthropic offers a new window into what’s going on inside the Claude LLM’s “black box.” The company’s latest paper, “Extracting Interpretable Features from Claude 3 Sonnet,” describes a powerful new method that partially explains how the model’s millions of artificial neurons fire to create surprisingly lifelike responses to general queries.

5. Meta Introduces Chameleon — A Multimodal Model

Meta’s AI research lab just introduced Chameleon, a new family of ‘early-fusion token-based’ AI models that can understand and generate text and images in any order. Chameleon shows the potential for a different type of architecture for multimodal AI models, with its early-fusion approach enabling more seamless reasoning and generation across modalities.

Five 5-minute reads/videos to keep you learning

Build with Meta Llama

This is a series of step-by-step video tutorials from Meta to help you get started with their Llama models. It primarily covers how to run Llama 3 on Linux, Windows, and Mac and shows other ways of running it.

2. PaliGemma: Open Source Multimodal Model by Google

Google has introduced PaliGemma, an open-source vision language model with multimodal capabilities that outperforms its contemporaries in object detection and segmentation. This blog walks you through its specifications, capabilities, limitations, use cases, how to fine-tune and deploy it, and more.

3. The Foundation Model Transparency Index After 6 Months

The Foundation Model Transparency Index, launched in October 2023, is an ongoing initiative to measure and improve transparency in the foundation model ecosystem. This article is a follow-up study that finds developers are more transparent with ample room for improvement. Visit our website for the paper and transparency reports.

4. Decoding GPT-4'o’: In-Depth Exploration of Its Mechanisms and Creating Similar AI

OpenAI has launched the groundbreaking AI GPT-4o, a model that combines many models. This blog post discusses how GPT-4o works and how you can create a similar model.

5. GPU Poor Savior: Revolutionizing Low-Bit Open Source LLMs and Cost-Effective Edge Computing

The article explores progress in developing low-bit quantized large language models optimized for edge computing, highlighting the creation of over 200 models that can run on consumer GPUs such as the GTX 3090. These models achieve notable resource efficiency via advanced quantization methods, aided by new tools like Bitorch Engine and green-bit-llm for streamlined training and deployment.

领英推荐

AI Highlights: Anthropic, Safe AI, Nvidia, Apple…

Generative AI 9 个月前

This AI newsletter is all you need #91

Towards AI 12 个月前

Build It and They Will Come: How One Company Made AI…

Charlene Li 1 个月前

Repositories & Tools

Mistral-7B-Instruct-v0.3 is an instruct fine-tuned version of the Mistral-7B-v0.3.
Mistral Fine-tune is the official repo to fine-tune Mistral open-source models using LoRA.
Perplexica is an AI-powered search engine. It is an open-source alternative to Perplexity AI.
Verba is an open-source RAG tool with customizable frameworks.
Taipy turns data and AI algorithms into production-ready web applications.

Top Papers of The Week

Retrieval-Augmented Generation for AI-Generated Content: A Survey

This paper reviews existing efforts to integrate the RAG technique into AIGC scenarios. It first classifies RAG foundations according to how the retriever augments the generator, distilling the fundamental abstractions of the augmentation methodologies for various retrievers and generators.

2. Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet

The paper reports on successfully scaling sparse autoencoders for extracting diverse, high-quality features from Claude 3 Sonnet, Anthropic’s medium-sized AI model. These features, which are multilingual, multimodal, and highly abstract, include significant safety-relevant aspects such as bias, deception, and security vulnerabilities. Moreover, these features can be used to steer the language models.

3. Chain-of-Thought Reasoning Without Prompting

The study investigates the presence of Chain-of-Thought reasoning in pre-trained large language models by altering the decoding process to consider multiple token options. It reveals that this approach can uncover intrinsic reasoning paths, resulting in an improved understanding of the models’ capabilities and linking reasoning to greater output confidence, as demonstrated across different reasoning benchmarks.

4. Thermodynamic Natural Gradient Descent

The paper presents a novel hybrid digital-analog algorithm that imitates natural gradient descent for neural network training, promising better convergence rates of second-order methods while maintaining computational efficiency akin to first-order methods. Utilizing thermodynamic analog system properties, this approach circumvents the expensive computations typical of current digital techniques.

5. Not All Language Model Features Are Linear

A recent study disputes the linear representation hypothesis in language models by revealing multi-dimensional representations through sparse autoencoders, notably circular representations for time concepts in GPT-2 and Mistral 7B. These representations have proven beneficial for modular arithmetic tasks, and intervention experiments on Mistral 7B and Llama 3 8B underscore their significance in language model computations.

Quick Links

1. Microsoft introduces Phi-Silica, a 3.3B parameter model made for Copilot+ PC NPUs. It will be embedded in all Copilot+ PCs when they go on sale starting in June. Phi-Silica is the fifth and smallest variation of Microsoft’s Phi-3 model.

2. Cohere announced the open weights release of Aya 23, a new family of state-of-the-art multilingual language models. Aya 23 builds on the original model Aya 101 and serves 23 languages.

3. IBM announced it will open-source its Granite AI models and will help Saudi Arabia train an AI system in Arabic. The Granite tools are designed to help software developers complete computer code faster.

Think a friend would enjoy this too? Share the newsletter and let them join the conversation.

Well, that's it for now. If you like my article, subscribe to my newsletter or connect with me. LinkedIn appreciates your likes by making my articles available to more readers.

Signing off - Marco

Top-rated articles:

TechTonic Shifts

2,605 位关注者

CHESTER SWANSON SR.

Realtor Associate @ Next Trend Realty LLC | HAR REALTOR, IRS Tax Preparer

9 个月

Very informative.

2 次回应

查看更多评论

要查看或添加评论，请登录

Marco van Hurne的更多文章

The motherboard of the damned

2025年3月24日

The motherboard of the damned

Oh my God. A bunch of unholy scientists without any sign of a moral compass, have finally done it.
The 10 most expensive Tech-startup failures of all time

2025年3月21日

The 10 most expensive Tech-startup failures of all time

Oh boy, this is going to be so delicious! I’m going to carve up the corpses of failed startups and hang their…

7 条评论
The AI model is dead. Long live the system

2025年3月20日

The AI model is dead. Long live the system

Big Tech has paraded AI models for years, like they were the second coming of the internet. Neural networks, deep…

17 条评论
A software engineer rigged the system to self-destruct the moment he got fired

2025年3月19日

A software engineer rigged the system to self-destruct the moment he got fired

This is the kind of story that gets whispered in dimly lit bars, where you’d find people like Marc Drees. Here, war…

4 条评论
AI search. The biggest con since snake oil, now with a subscription fee

2025年3月18日

AI search. The biggest con since snake oil, now with a subscription fee

Have you ever paid someone to confidently lie to your face? No? Then congratulations, because you haven't been suckered…

10 条评论
How foreign bad peepss are weaponising AI

2025年3月17日

How foreign bad peepss are weaponising AI

I know, people. Cybersecurity isn’t your thang.

17 条评论
How broke college grads are training the bots that will replace them

2025年3月15日

How broke college grads are training the bots that will replace them

Scott O’Neil* is a real human, and he spends his days dealing with frozen pipes and his nights fine-tuning the AI that…

4 条评论
I’m sick of these parasitic Tech Bros and their “Creator Economy” scam

2025年3月14日

I’m sick of these parasitic Tech Bros and their “Creator Economy” scam

Somewhere in San Francisco, another startup and wannabe tech bro is pitching their revolutionary creator economy…

12 条评论
The AI that went full Hannibal Lecter

2025年3月13日

The AI that went full Hannibal Lecter

AI is sold to us as the great intellectual babysitter of our time. Forget Immanual Kant, Hegel, Heidegger…

12 条评论
AI Agent platforms aren’t ready for the real world, except for one

2025年3月12日

AI Agent platforms aren’t ready for the real world, except for one

Sigh. Everyone and their fan boys are talking about AI agents like they’re just around the corner to revolutionize…

18 条评论

See all articles

AI/ML news summary: week 23

Marco van Hurne

AI & ML advisory | Author of The Machine Learning Book of Knowledge | Building AI skills & organizations | Data Science | Data Governance | AI Compliance Officer | AI Governance

Hottest News

Five 5-minute reads/videos to keep you learning

领英推荐

Repositories & Tools

Top Papers of The Week

Quick Links

Top-rated articles:

TechTonic Shifts

2,605 位关注者

Marco van Hurne的更多文章

社区洞察

其他会员也浏览了

The Generative A.I. Brief Issue #2

?? Nvidia Releases Open-Source AI, Competes with OpenAI

AI News Roundup: Dec'23 Highlights

Gen AI Costs

OpenAI's Co-Lead Exit, Canvas Interface Debut, Google's Reasoning AI Efforts, and Nvidia's Surging Blackwell Chip Demand

AI NEWS YOU MISSED ?#53

How DeepSeek hits Nvidia

What's in Tech : Wk of November 30th 2024

What's Cooking in AI—2025

AI Newsletter

Hottest News

Five 5-minute reads/videos to keep you learning

领英推荐

Repositories & Tools

Top Papers of The Week

Quick Links

Top-rated articles:

TechTonic Shifts

2,605 位关注者

Marco van Hurne的更多文章

The motherboard of the damned

The 10 most expensive Tech-startup failures of all time

The AI model is dead. Long live the system

A software engineer rigged the system to self-destruct the moment he got fired

AI search. The biggest con since snake oil, now with a subscription fee

How foreign bad peepss are weaponising AI

How broke college grads are training the bots that will replace them

I’m sick of these parasitic Tech Bros and their “Creator Economy” scam

The AI that went full Hannibal Lecter

AI Agent platforms aren’t ready for the real world, except for one

社区洞察

其他会员也浏览了

The Generative A.I. Brief Issue #2

?? Nvidia Releases Open-Source AI, Competes with OpenAI

AI News Roundup: Dec'23 Highlights

Gen AI Costs

OpenAI's Co-Lead Exit, Canvas Interface Debut, Google's Reasoning AI Efforts, and Nvidia's Surging Blackwell Chip Demand

AI NEWS YOU MISSED ?#53

How DeepSeek hits Nvidia

What's in Tech : Wk of November 30th 2024

What's Cooking in AI—2025

AI Newsletter