DEEP SEEK  --  DOWN THE WHALE'S GULLET

DEEP SEEK -- DOWN THE WHALE'S GULLET

Deep Seek

One word: Don't

Let's leave aside that it's a Chinese model, and you have to agree to terms that give away your privacy information and data to an adversarial nation. Apparently, that isn't enough to scare off some corporate entities and a whole lot of people. It's like TikTok: You have to be insane to be on that platform, but it's your life....

Anyway, Deep Seek does many things from a technical perspective that makes it available to run (in its smaller incarnations) on your average laptop or desktop computer with no special hardware required, primarily GPUs. The two biggies are MoEs and Mixed/Low precision. MoE stands for "Mixture of Experts" and is a technique where you have what amount to a bunch of tiny neural networks cooperating to solve a problem. These take the place of huge NNs that take a lot of computing horsepower to run; you have a lot of small processing resources running while your broader landscape "sleeps". Then:

There is "Mixed Precision" or "Low Precision". In machine learning -- and NNs in particular -- Memory is "quantized", think of data word lengths by way of analogy: You have 64 bit, 32 bit... all the way down to 4 bit. You save a LOT of computing resources when you use the lower quanta -- or "Low Precision" -- but the trade off is your AI model is less accurate. It's actually a common technique, most notably used by model providers like META and Google to make small models that will run on conventional PCs; I have used this technique in my own work with a research team that needs to reduce their GPU memory footprint. But again, what you gain in resource conservation you lose in accuracy. So although the Internet is effing gaga over the wonders of Deep Seek, they completely neglect this little caveat.

Speaking of the Internet, it's gone nuts over the Big Splash Deep Seek has made and touts it as the latest wonder. Yeah, well, Deep Seek has been available for nearly a year on a website called "ollama.com" along with dozens of other models that do natural language processing, computer vision, coding, web-scraping.... just about anything you can think of. And nearly all of them will run on your very own laptop. Did I mention the models Deep Seek was built on? Deep Seek's foundation was laid with META's LLAMA and Ali Baba's QWEN models (also available on ollama.com). There were a few others, but those were the primary brick layers. So yes, most of Deep Seek's code is open-sourced because it's, well, somebody else's.

The upshot is this: Deep Seek may be wonderful at siphoning your personal data and shipping it off to Beijing. But would you trust it with a cancer diagnosis, an analysis of the global economy or even a recipe to bake bread?

Caveat Emptor.

要查看或添加评论,请登录

Mark Ray的更多文章

  • Artificial Intelligence on Your Laptop

    Artificial Intelligence on Your Laptop

    I've started a new article series on TechChannel about how to create a fully-functioning AI system on your own computer!

    2 条评论
  • HISTORY OF ARTIFICIAL INTELLIGENCE: PART III

    HISTORY OF ARTIFICIAL INTELLIGENCE: PART III

    In this concluding article on the history of AI, I take you from the 2000s up to the present day. And next time: I show…

  • THE POWER HYPERVISOR

    THE POWER HYPERVISOR

    When we think of performance analysis, we automatically consider a system's physical resources; these resources are…

  • THE FEEDBACK DIRECTED PROGRAM RESTRUCTURING TOOL

    THE FEEDBACK DIRECTED PROGRAM RESTRUCTURING TOOL

    What happens when you're maintaining an AIX system running a very old application that has no vendor support? And for…

  • Running PerfPMR Scripts: Configuration and Network

    Running PerfPMR Scripts: Configuration and Network

    In part one we looked at different ways to run the PerfPMR diagnostic utility as a whole, focusing on some important…

  • MANAGING MEMORY WITH AIX MALLOCs

    MANAGING MEMORY WITH AIX MALLOCs

    When I started as an AIX performance practitioner in 1999, memory was a hot commodity. Most AIX systems held no more…

  • SPLAT – The “Simple Performance Lock Analysis Tool”

    SPLAT – The “Simple Performance Lock Analysis Tool”

    I don’t know about you, but locks give me a headache. The way locking activity is implemented and the myriad types of…

  • PerfPMR Part 4: Adding Custom Scripts

    PerfPMR Part 4: Adding Custom Scripts

    In this, my concluding article on PerfPMR, I’ll introduce you to one of the simplest, yet most useful customizations…

  • AIO in AIX: The Fast Path to Great Performance

    AIO in AIX: The Fast Path to Great Performance

    Asynchronous input and output (AIO) is an essential performance feature of AIX. Without it, our world would be a much…

  • Analyzing AIX System Dumps

    Analyzing AIX System Dumps

    A system dump indicates a severe problem with an AIX system. System dumps usually halt the system, necessitating a…

    4 条评论

社区洞察

其他会员也浏览了