登录查看更多内容

Which AI Model Has the Most Rizz? Evaluating GitHub Models for Maximum Sauce

Asha Holla??

Analytics, Automation, AI @Bloom ? Data Nerd ? Speaker ? Technical Writer ? Open Source ? DE&I

发布日期: 2025年2月10日

GitHub Models on marketplace is a platform that lets developers discover and test different AI models directly within GitHub. This not only allows developers to assess and choose models that best fit their project requirements, but also environment enables users to experiment with different models by adjusting parameters and testing various prompts.

It features an interactive playground for experimentation, seamless integration with tools like Copilot Chat and GitHub CLI, and built-in best practices for responsible AI use.

You can experiment using a plethora of models - including the newest ones like Deepseek R1, GPT 4o to name a few. Playground also allows tweaking of parameters based on the kind of output expected - you can limit the output tokens to get crisp responses or adjust the temperature to control the randomness of the responses.

For our experiment today - let’s see which of these LLMs can generate peak brainrot renditions of popular songs off the billboard chart.

Testing GitHub’s AI Models for Maximum Sauce

To find out, I grabbed a few AI models from GitHub Models Marketplace and put them to the test. My goal? See which one could take a regular pop song and remix it into pure brainrot. I evaluated models based on:

Creativity – Does it generate something wild or just a generic remix?
Cursed Energy – The more unhinged, the better
Lyric Deformation – How well does it switch up perfectly normal lyrics into brainrot lingo?
Consistency – Does it stay on theme, or does it go off the rails?

Model Showdown

Here’s how some models stacked up

1. DeepSeek R1

The model did not understand what was meant by brainrot. I supplied a simple prompt to brainrot Taylor Swift's Blank Space and even supplied brainrot words like fanum tax and skibidi to set the context. Here was the output

Verdict: The model struggles to output just song lyrics, instead providing reasoning for each line, doesn’t understand "brainrot," responds slowly, and occasionally fails.

2. OpenAI GPT 4o

For the same prompt - GPT 4o delivered a good result , it faltered in few places and messed up rhyme schemes but overall a usable result

领英推荐

Almost Timely News: ??? Generative AI and the…

Christopher Penn 9 个月前

The future of advanced AI is simple

Sridhar Ramaswamy 1 年前

Blind Selection: The Struggle to Objectively Measure AI

Peterson Technology Partners 9 个月前

Verdict: Promising and the right amount of brainrot

3. Meta Llama 3.1

Bit on the fence about this one. Picked the right mix and variety of brainrot words and the breakdown of song into verse and chorus really helps set the tone.

Verdict: Impressive job on the lyrics, quick to respond, doesn't think for long - strong contender

4. Mistral Large 24.11

I'm not sure which song this model tried to summarize—it seems like a mix of random lyrics, and none of it quite aligns with any recognizable song.

Verdict: Strong start but ends up messing up halfway through. Wrong tool for the job

Final Take

For peak brainrot song generation, GPT-4o had the best mix of coherence and absurdity, but Llama 3.1 had raw chaotic energy. For meme-tier remixes, I'd use Llama 3.1, but GPT-4o gets brownie points for presentation.

DeepSeek and Mistral excel on many fronts but brainrot isn't really their strong suit.

If GitHub continues expanding its model marketplace, who knows? We might one day get a dedicated brainrot AI model—until then, I’ll keep experimenting.

#ADSBlogs #AzureDeveloperCommunity

要查看或添加评论，请登录

Asha Holla??的更多文章

How to Build ADF Pipelines That Won’t Wake You Up at 2 AM

2025年2月12日

How to Build ADF Pipelines That Won’t Wake You Up at 2 AM

Ever woken up in a panic because a pipeline failed? One minute, you're sleeping soundly, and the next, you're up and…

2 条评论
Make AI Work for You: How to Build Your Own ChatGPT with No-Code

2025年2月11日

Make AI Work for You: How to Build Your Own ChatGPT with No-Code

AI is evolving fast, and the days of needing deep coding expertise to build intelligent systems are fading. Thanks to…
Up Close with Satya Nadella - Highlights from the Microsoft AI Tour

2025年1月27日

Up Close with Satya Nadella - Highlights from the Microsoft AI Tour

On January 7th, I had the incredible opportunity to attend the Microsoft AI Tour in Bangalore, an event that brought…
2024 in Review - lessons, milestones and moving forward

2024年12月31日

2024 in Review - lessons, milestones and moving forward

2024 was a year of growth, lot of action, and plenty of firsts for me—but also a fair share of missteps and lessons…
The Future of Power BI x Fabric: Two Features I'm Super Excited for in 2025!

2024年12月14日

The Future of Power BI x Fabric: Two Features I'm Super Excited for in 2025!

As we step into the new era of data analytics, Microsoft is set to redefine how we interact with data through Fabric…

2 条评论

See all articles

Which AI Model Has the Most Rizz? Evaluating GitHub Models for Maximum Sauce

Asha Holla??

Analytics, Automation, AI @Bloom ? Data Nerd ? Speaker ? Technical Writer ? Open Source ? DE&I

Testing GitHub’s AI Models for Maximum Sauce

Model Showdown

1. DeepSeek R1

2. OpenAI GPT 4o

领英推荐

3. Meta Llama 3.1

4. Mistral Large 24.11

Final Take

Asha Holla??的更多文章

社区洞察

其他会员也浏览了

Issue #313 - The ML Engineer ??

How LLMs are Transforming Bot Building, Botnet Detection at Scale, and Declarative ML for Engineers

Exclusive AI Cheat Sheet: Artificial Intelligence Beyond GenAI

Issue #198 - THE ML ENGINEER ??

From MLOps to LLMOps to GenAIOps: A Paradigm Shift

Understanding Retrieval-Augmented Generation (RAG) in AI

OpenAI Slashes Prices & Unveils GPT-4 Turbo!

Rise of Independent AI: How Machines are Becoming More Self-Sufficient

Part 3: Implementing RAG – Retrieval-Augmented Generation for Powerful AI Applications

Testing GitHub’s AI Models for Maximum Sauce

Model Showdown

1. DeepSeek R1

2. OpenAI GPT 4o

领英推荐

3. Meta Llama 3.1

4. Mistral Large 24.11

Final Take

Asha Holla??的更多文章

How to Build ADF Pipelines That Won’t Wake You Up at 2 AM

Make AI Work for You: How to Build Your Own ChatGPT with No-Code

Up Close with Satya Nadella - Highlights from the Microsoft AI Tour

2024 in Review - lessons, milestones and moving forward

The Future of Power BI x Fabric: Two Features I'm Super Excited for in 2025!

社区洞察

其他会员也浏览了

Issue #313 - The ML Engineer ??

How LLMs are Transforming Bot Building, Botnet Detection at Scale, and Declarative ML for Engineers

Exclusive AI Cheat Sheet: Artificial Intelligence Beyond GenAI

Issue #198 - THE ML ENGINEER ??

From MLOps to LLMOps to GenAIOps: A Paradigm Shift

Understanding Retrieval-Augmented Generation (RAG) in AI

OpenAI Slashes Prices & Unveils GPT-4 Turbo!

Rise of Independent AI: How Machines are Becoming More Self-Sufficient

Part 3: Implementing RAG – Retrieval-Augmented Generation for Powerful AI Applications