登录查看更多内容

DeepSeek R1: The New Kid on the Language-Model Block

Ravindra Kumar

Data Engineering | Data Governance | Analytics | AI | healthcare IT

发布日期: 2025年1月29日

It looks like a late post already! but hey, I am still a lazy guy posting at my own speed and comfort!!

DeepSeek R1 waltzed into the world of large language models (LLMs) the same way a mystery dessert appears at a potluck: everyone’s excited, but we’re all secretly wondering what’s in it. Compared to the bigger, more famous dinners served by OpenAI (think GPT-3.5, GPT-4, etc.), DeepSeek R1 might feel like a homemade treat. It promises a fresh approach, some delightfully new flavors, and a few unknown ingredients.

The question is, is it finger-lickin’ good or did someone sneak in tofu? Let’s slice it up and see.

How Is DeepSeek R1 Different from OpenAI Models?

Team Player vs. VIP Member: OpenAI’s GPT models are like VIP club members—slick, well-advertised, and sometimes behind a velvet rope (subscription or invite only). DeepSeek R1, on the other hand, wants to be the friend who shows up to your open mic night and offers to help you write your next chart-topper. Basically, it’s open-source, and we’ll talk about what that means in a bit.
Data & Privacy: OpenAI’s models are known for stringent content policies, but where the actual data is stored and how it’s used often feels like it’s locked in a black box in the middle of Redwood City. DeepSeek R1 says it’s more transparent about data handling (though how much more transparent is up for debate).
Size Matters: DeepSeek R1 claims to be smaller, more nimble, and easier to run on your own hardware—like that microbrewery that just opened down the street. It’s not a behemoth that requires hundreds of GPU servers to spin up (allegedly).

The Mysterious Cost (Do We Really Know?)

This is the million (or billion?) parameter question. One of the most whispered conversations in AI circles is: “Sure, DeepSeek R1 is open source, but what about the hidden costs?”

Computational Overhead: Even though it’s “lighter,” you’ll still want a decent GPU or a potent CPU to run it well. So your wallet might protest if you decide to host it on your own and need a hardware upgrade.
Electricity Costs: Training or even fine-tuning an LLM can be more expensive than fueling a Formula 1 car. DeepSeek R1’s creators claim it’s more efficient, but you might need to keep your electricity company on speed-dial just in case.
Cloud Provider Fees: If you don’t have a souped-up local setup (like 99% of us), you might rely on a cloud provider. The pricing for GPU hours can make your credit card cry. So, do we really know the cost? Not exactly, but let’s just say it’s cheaper than paying Elon Musk to launch it on a rocket, and more expensive than that comfy new gaming chair you’ve been eyeing.

Where Is the Data Stored? (Are We Snooping?)

One of the best parts about open-source projects is that in theory you can take a look at the code and see how the sausage is made. But in practice, sometimes “open source” just means you can see the model weights, not necessarily every detail of the original dataset or the exact process of turning that data into a magical text-generating machine.

DeepSeek R1 is rumored to have been trained on a mix of public domain texts, curated web content, and (maybe) other open datasets. Where exactly all that data is stored can vary:

If you host DeepSeek R1 yourself, that data is on your hardware or in your cloud instance.
If you use a community-provided version, your data might pass through their servers—so it’s worth reading the fine print (yes, the dreaded TOS).

Basically, it’s not quite the “we’re all BFFs sharing secrets over a campfire” scenario. But it’s more open than many big-name, black-box models.

What Does Open Source Really Mean?

“Open source” is often thrown around like “organic” at a grocery store. It generally means:

You can access the code (or in this case, the model weights).
You can modify it, adapt it, fine-tune it.
You can share it with others under specific license conditions.

In the LLM world, open source means you’re not entirely dependent on the whims of a private company to fix bugs or add features. You can roll up your sleeves and do it yourself—or rely on that friendly neighborhood AI developer who just loves pulling all-nighters.

领英推荐

TAI #136: DeepSeek-R1 Challenges OpenAI-o1 With ~30x…

Towards AI 1 个月前

TAI #137: DeepSeek r1 Ignites Debate: Efficiency vs…

Towards AI 1 个月前

This AI newsletter is all you need #77

Towards AI 1 年前

However, licensing can vary. Some open-source LLMs have “safe” licensing (like Apache or MIT) that let you do almost anything. Others might have restrictions (like requiring you to share improvements back or limiting commercial use). So open source can be as free as a streaker on a beach, or it can come with a few fig leaves. DeepSeek R1’s license is presumably somewhere on that spectrum.

Variants, Anyone? (All the Flavors on Ollama & Hugging Face)

DeepSeek R1 might be the base recipe, but the community is known to create spinoffs, forks, and super-secret special sauce versions. On platforms like Hugging Face, you might find:

DeepSeek R1-Base
DeepSeek R1-Turbo (someone fine-tuned it for speed or specialized tasks)
DeepSeek R1-Humor (fine-tuned to deliver next-level puns)
DeepSeek R1-Privacy (more emphasis on data security)

On Ollama, a platform that hosts and runs language models on local machines (particularly on macOS), you might stumble across one or two variants specifically compiled or quantized for Apple’s M-series chips. So, whether you want the “vanilla” DeepSeek or a full double-chocolate with sprinkles edition, you’ll probably find a flavor to suit your taste.

The Bottom Line (With a Side of Laughs)

DeepSeek R1 is an intriguing new LLM that’s aiming to shake up the AI scene:

It’s open-source (meaning you can tinker under the hood).
It’s (allegedly) cost-effective to run (but you might still want a prayer circle for your next cloud bill).
Variants are popping up everywhere, from Hugging Face to Ollama, so you can choose your own adventure.
Its data handling is more transparent than some, but still not as clear as your grandma’s recipe card.

Will DeepSeek R1 dethrone OpenAI anytime soon? Probably not this afternoon—but it’s definitely worth keeping on your radar (or hugging face, or ollama directory, or wherever you get your LLM fix). If nothing else, it’s another example of how quickly the open-source AI community is innovating, democratizing, and occasionally setting off fireworks in the process.

So if you’re feeling brave (and possibly comedic), give DeepSeek R1 a spin. Who knows? Maybe you’ll discover it’s the next best thing to a robot butler—or at the very least, a pretty decent co-author for your next big article.

Disclaimer:

Don’t actually feed it your banking info. Or do—just don’t blame me if it suggests investing in dog-themed cryptocurrencies. The joys of open source, folks!

And in the end, thought about market losses and Nvidia relevance for 4 seconds

So, let’s talk about the big, green elephant in the room: the market meltdown. Markets—especially tech—are moody. One minute they’re popping champagne with all-time highs; the next, they’re crying into their cappuccinos because someone sneezed on the other side of the world.

Overblown Expectations AI hype got turned up to 11. Then reality tapped us on the shoulder and reminded everyone that real profits take time, stable supply chains, and enormous amounts of electricity. Cue the market correction.
Macro Challenges Inflation, interest rates, global conspiracies involving alien chip shortage—take your pick. Market swings don’t usually come from a single cause, but from a symphony of negative news.
The NVDA Factor Despite the turbulence, Nvidia (NVDA) remains the valedictorian of the GPU graduating class. Why? Because every LLM, from DeepSeek R1 to the biggest GPT-whatever, still needs a monstrous amount of GPU power to train (and often, even to run). Whether it’s data centers, gaming, or AI research, Nvidia’s chips are like the A-list celebrities everyone wants on their talk show. So even in a down market, Nvidia has more staying power than a pop star with eight world tours.

In short, the market might be sobbing into its pillow, but Nvidia’s still got the big piece of the GPU pie—and with AI demands only growing, they’re not about to be dethroned by next week’s software meltdown.

Data Tales By Ravi

673 位关注者

Manish Shukla

1 个月

DeepSeek does not provide relevant info if its against Chinese Govt, so it cant be totally called as democratic ??

Damian Dingley

Business and software solutions through observation, collaboration and creative thought.

1 个月

Nothing wrong with a bit of tofu Ravi... ?? but I get your point.

1 次回应

Richard Jones

Supply Chain Executive at Retired Life

1 个月

The Best DeepSeek Quotes. “Deepseek R1 is AI’s Sputnik moment.” ~Marc Andreessen https://www.supplychaintoday.com/the-best-deepseek-quotes/

查看更多评论

要查看或添加评论，请登录

Ravindra Kumar的更多文章

You Too Are an AI Expert! ????

2025年2月24日

You Too Are an AI Expert! ????

AI is everywhere—on your phone, in your workplace, in your car, even in your fridge (if you splurged on that fancy…

1 条评论
A Review of : Mirror, Mirror on the Wall: How the Performance of the U.S. Health Care System Compares Internationally

2025年1月18日

A Review of : Mirror, Mirror on the Wall: How the Performance of the U.S. Health Care System Compares Internationally

recently , I read the 2014 (Don't judge me, I know it's old, but very relevant) edition of “Mirror, Mirror on the Wall:…
Is It Really AI or Just Intelligent Automation? How to Tell the Difference.

2025年1月6日

Is It Really AI or Just Intelligent Automation? How to Tell the Difference.

AI is everywhere—or at least, that’s what the marketing copy says. From workflow tools to customer service chatbots…

3 条评论
Embracing the Healing Power of Music: A Tribute to "Sitar for Mental Health"

2024年12月26日

Embracing the Healing Power of Music: A Tribute to "Sitar for Mental Health"

Music has always been a universal language of emotion, healing, and connection. Recently, I discovered an inspiring…

3 条评论
The Hidden Crisis in Clinically Integrated Networks: Why Data Integration Shouldn't Break the Bank

2024年11月4日

The Hidden Crisis in Clinically Integrated Networks: Why Data Integration Shouldn't Break the Bank

?? Hot Take: Most Clinically Integrated Networks are drowning in tech costs while starving for good data. After working…
The Evolution of Data Architectures: Are We Missing the Forest for the Trees?

2024年9月30日

The Evolution of Data Architectures: Are We Missing the Forest for the Trees?

In the past decade, we've seen a proliferation of data architecture concepts: Data Lakes, Modern Data Warehouses, Data…
Understanding TEFCA Through Real-World Scenarios: How Nationwide Interoperability Will Revolutionize Healthcare

2024年8月30日

Understanding TEFCA Through Real-World Scenarios: How Nationwide Interoperability Will Revolutionize Healthcare

In today's rapidly evolving healthcare landscape, the ability to securely exchange health information across different…

1 条评论
LLM-Powered Applications with prompt caching

2024年8月19日

LLM-Powered Applications with prompt caching

You’ve probably used large language models (LLMs) like ChatGPT, Groke, Gemini, LLaMA, or Anthropic, and maybe you're…

2 条评论
Outsmarting "ArtificaI Intelligence": Harnessing the Power of Blissful Ignorance(aka natural dumbness!!!!!!

2023年5月22日

Outsmarting "ArtificaI Intelligence": Harnessing the Power of Blissful Ignorance(aka natural dumbness!!!!!!

While folks are neck-deep in a frenzied debate about whether the ascendance of artificial intelligence – the ChatGPTs…

1 条评论
A quick snapshot of "Data Governance"

2023年3月1日

A quick snapshot of "Data Governance"

Hello subscribers, This "Data Tales By Ravi" edition is dedicated to Data Governance. As you know, effective data…

2 条评论

See all articles

DeepSeek R1: The New Kid on the Language-Model Block

Ravindra Kumar

Data Engineering | Data Governance | Analytics | AI | healthcare IT

How Is DeepSeek R1 Different from OpenAI Models?

The Mysterious Cost (Do We Really Know?)

Where Is the Data Stored? (Are We Snooping?)

What Does Open Source Really Mean?

领英推荐

Variants, Anyone? (All the Flavors on Ollama & Hugging Face)

The Bottom Line (With a Side of Laughs)

Disclaimer:

Data Tales By Ravi

673 位关注者

Ravindra Kumar的更多文章

社区洞察

其他会员也浏览了

What if AGI happens and nobody notices?

Why Llama 3.1's Release is an Important Step in the LLM Arena

DeepSeek Has Introduced Advanced Reasoning To The Open-Source Community, & More

OpenAI-mer

Mastering MLOps practices for a trading bot

Everybody Likes RAG

Mysterious GPT is Back...

AI PROGRESS FROM MICROSOFT – PHI-4 OUTPERFORMS HUMAN-LIKE REASONER AI MODEL GPT-4o, BITCOIN RUMOR

In the End, They will All Look the Same

Insight of the Week: What DeepSeek R1 means for business

How Is DeepSeek R1 Different from OpenAI Models?

The Mysterious Cost (Do We Really Know?)

Where Is the Data Stored? (Are We Snooping?)

What Does Open Source Really Mean?

领英推荐

Variants, Anyone? (All the Flavors on Ollama & Hugging Face)

The Bottom Line (With a Side of Laughs)

Disclaimer:

Data Tales By Ravi

673 位关注者

Ravindra Kumar的更多文章

You Too Are an AI Expert! ????

A Review of : Mirror, Mirror on the Wall: How the Performance of the U.S. Health Care System Compares Internationally

Is It Really AI or Just Intelligent Automation? How to Tell the Difference.

Embracing the Healing Power of Music: A Tribute to "Sitar for Mental Health"

The Hidden Crisis in Clinically Integrated Networks: Why Data Integration Shouldn't Break the Bank

The Evolution of Data Architectures: Are We Missing the Forest for the Trees?

Understanding TEFCA Through Real-World Scenarios: How Nationwide Interoperability Will Revolutionize Healthcare

LLM-Powered Applications with prompt caching

Outsmarting "ArtificaI Intelligence": Harnessing the Power of Blissful Ignorance(aka natural dumbness!!!!!!

A quick snapshot of "Data Governance"

社区洞察

其他会员也浏览了

What if AGI happens and nobody notices?

Why Llama 3.1's Release is an Important Step in the LLM Arena

DeepSeek Has Introduced Advanced Reasoning To The Open-Source Community, & More

OpenAI-mer

Mastering MLOps practices for a trading bot

Everybody Likes RAG

Mysterious GPT is Back...

AI PROGRESS FROM MICROSOFT – PHI-4 OUTPERFORMS HUMAN-LIKE REASONER AI MODEL GPT-4o, BITCOIN RUMOR

In the End, They will All Look the Same

Insight of the Week: What DeepSeek R1 means for business