登录查看更多内容

Docker AI Catalog : The Future of Curated AI Models and LLMs

Turja N Chaudhuri ( ?? to the Cloud )

Global Lead, Platform Success, EY Fabric | ? Practice at EY | Views are my own

发布日期: 2025年1月1日

+ 关注

TL ; DR

AI is evolving so fast it’s hard for anyone to keep up, let alone enterprises.
Companies don’t know which AI tools are secure or fit their needs.
There’s a big trust issue when it comes to picking AI tools for enterprise use.
Curated AI catalogs simplify things by offering pre-vetted, reliable options.
Docker’s AI Catalog solves this by providing a trusted library of AI models and tools.
It gives businesses a consistent way to deploy AI without guesswork.
Verified tags and usage stats help build confidence in what’s safe to use.
The catalog isn’t just for models—it includes frameworks, databases, and orchestration tools.
It’s easily accessible through Docker Hub and Docker Desktop.
By making AI easier to trust and use, Docker’s catalog helps companies innovate faster.

Context

AI, and specifically GenAI is progressing at a tremendous pace; it's almost impossible to keep track of it, even for someone who is full-time dedicated to that ecosystem.? The AI newsletters are brimming with details about the next new LLMs, SLMs, AI orchestrators, AI gateways, and so on.

This innovation in AI is extremely promising, but presents a new challenge to most enterprises :

how to process all this information, and decide what is the correct course of action for them.

Enterprises cannot afford to experiment with every single LLM, or library that is out there - they need someone to guide them as to what is trusted, secured and allowed to be used, within the context of an enterprise ecosystem.

The AI Corundum : To Much to Choose From

As part of the Internal Platform Adoption team at EY, when I speak to my enterprise customers, most of the questions around GenAI revolve around the topic of :

"Is this LLM allowed to be used by my company ?"

"Can I download LLMs from HuggingFace, is that allowed ?"

"How can I be sure this LLM does not have any vulnerabilities, can I use my client's data with it ?"

Obviously, every single solution team will need to do their due diligence on what is secure, or allowed within their usage context, but there is a need within enterprises for some generic guidance to help them choose between what is allowed, and what is blocked, within the confines of that enterprise itself.

Basically, they need someone to filter this ever - increasing catalog of AI models, LLMs, SLMs, AI Orchestrators, AI Gateways... and tell them which :

Are secure, or has non-critical vulnerabilities
Are supported by the respective vendors
Provide a standardized way of deploying

The need for Curation within the AI Ecosystem

The AI Ecosystem is too fractured now, for any single person to know what to do. Let alone individuals, entire portfolios, or teams are having a difficult time in identifying what to use, and what not to.

In the vast and growing landscape of AI, not all AI models are created equal. Businesses often struggle to identify which AI models meet their needs, ensure compatibility with existing systems, and maintain ethical and reliable performance.

Curated AI models solve this problem by offering pre-vetted, high-quality solutions that:

Enhance reliability: Users can trust that the models meet industry standards and have been tested for real-world applications.
Save time: Developers avoid the lengthy process of searching for, validating, and configuring models.
Ensure ethical use: Curated models often come with documentation detailing data sources and intended use cases, reducing the risk of bias or misuse.

From my perspective, this is where AI Catalogs have a huge role to play.

They offer a single pane of glass within an enterprise context, for consumers to choose from the curated subset of models / LLMs that are supposed to be more secure, and compliant than others. This provides the necessary confidence that engagement teams need to get started on a piece of work.

Docker AI Catalog

Docker is a trusted name in the industry, and for long the Docker Hub has been a gold-standard for teams to find docker images, helm charts, based on the use-case. With the addition of the GenAI catalog, teams can also search within a curated list of offerings that they can then use confidently for the entire GenAI application development lifecycle.

Docker AI Catalog is a curated repository of AI and machine learning (ML) models, and other associated artifacts, designed to simplify the process of integrating advanced AI capabilities into applications.

Let's take an example :

Say, an enterprise user wants to start using Llama, by deploying it on their company's private cluster setup, but don't know where to get started, what is the latest secure LLM image they can use, and so on.

They come to Docker Hub, and can now search for Llama

领英推荐

Scaling Enterprise AI – Organically

Helen Yu 6 个月前

Artificial Intelligence #31 - Understanding the AI…

Ajit Jaokar 3 年前

Next-Gen iPaaS: Huawei Cloud No-Code Composable…

Huawei Cloud 7 个月前

Search for a model in the Docker AI Catalog

Once they find what they are looking for, they can get more details :

Get more details about a particular artifact within the Docker AI Catalog

As you can see there is a lot to unpack here :

Verified Publisher tag : this provides the necessary confidence that this image was pushed by Meta, and not by someone else posing as Meta. In an enterprise setting, this is critical to consider, as while going for InfoSec reviews before production deployment, all of these aspects will be critically analyzed.
We also see that this artifact was last updated 2 months ago. This also provides a good deal of confidence that you are no consuming a legacy version of the LLM, which is not being updated, might have vulnerabilities, and so on.
You can also see how many users have already downloaded it, and how many have starred the artifact - again increasing the confidence of the enterprise user to consume it.
Finally, you have a set of instructions to start working with the LLM, by downloading and deploying the image. Another interesting point here is that no matter the publisher of the LLM, the consumption patter is fixed - this means as a developer you can rely on how the image will be downloaded, and deployed; this consistency in consumption approach helps provide stability to the enterprise, rather than having to memorize different techniques, etc.

Get detailed instructions on the deployment steps

Docker AI Catalog : Different artifact types

Docker has been in this business of managing, verifying and offering containerized workloads for a long time now, and Docker Hub to this day is still the de-facto image repository for most of us.

So, with the GenAI ecosystem in play now, it's only normal for Docker to put their skin in the game, and leverage all the existing investments they have in this space, to provide a similar sort of experience for GenAI assets.

But, it's not only images of LLM models that they are providing a standard consumption pattern for, you can also search and consume other artifacts involved in building a GenAI stack like :

Model Deployment & Serving like ollama, localai, cuda
Orchestration offerings kubeflow, langchain, mlflow
ML Frameworks like pytorch, tensorflow
Databases like Mongo Atlas, Neo4J, Redis, Milvus

So, essentially it's a one-stop catalog comprised of a curated, and verified collection of components that one might need to create a GenAI stack starting from the models, to the orchestration layer, to the eventual deployment engine.

Where can I find Docker GenAI Catalog

You can access the Docker GenAI Catalog right from the homepage of Docker Hub ( https://hub.docker.com/ ) :

Recently, it has also been integrated within the UI of Docker Desktop, so you don't need to navigate outside of Docker Desktop, and go to Docker Hub to find the GenAI related artifacts - you can find them within the Desktop UI itself.

Conclusion

Docker, Inc has been around for a long time now, so has Docker Hub, and other associated offerings from them.

Till now, they were providing an easy-to-comprehend way of searching, and consuming container artifacts like docker images, helm charts, etc. Now, Docker has taken it up a notch, and extended their framework to support GenAI assets ( LLM models, orchestrators, etc. ) as well, while following the same consumption pattern.

By offering curated AI models and LLMs in a user-friendly, secure, and scalable format, it paves the way for businesses to harness the full potential of artificial intelligence. As the industry continues to embrace AI, platforms like Docker AI Catalog will undoubtedly become integral to the next wave of innovation.

Anybody who has spent time in an enterprise setting knows how difficult it is to agree on something. Most teams just want to consume the latest and greatest, but don't want to invest any effort in researching different options, etc. For such use-cases, having a trusted repository like Docker AI Catalog makes a lot of difference.

If the enterprise already has a enterprise agreement with Docker, Inc , it becomes even less of a discussion. In those cases, the security team has already vetted docker and approved it's usage within the firm, in which case using the catalog just becomes an extension of the same process, and becomes even easier.

In any case, when it comes to AI - there is a gap in the industry today, that is not a technology gap, rather a trust gap.

With so many companies coming up with the next big orchestrator technology, or vector database, or LLM(s), SLM(s0, etc. nobody really knows who they can trust with their client data, and who not. Any enterprise-grade company who tries to solve this issue, by providing an authenticity to the consumption process will always have my support.

References

[1] https://hub.docker.com/

[2] https://www.docker.com/blog/docker-desktop-4-37/

[3] https://www.docker.com/resources/2024-12-09-docker-ai-catalog/

[4] https://www.docker.com/blog/accelerating-ai-development-with-the-docker-ai-catalog/

要查看或添加评论，请登录

Turja N Chaudhuri ( ?? to the Cloud )的更多文章

How LLM cost reduction can help unlock innovation

2025年2月17日

How LLM cost reduction can help unlock innovation

TL ; DR Many enterprises hesitate to adopt GenAI due to perceived high token costs, but actual costs for most use cases…
Autonomous AI Agents : A Horror story

2025年2月9日

Autonomous AI Agents : A Horror story

TL ; DR Rise of Agentic AI: AI agents capable of reasoning and taking action autonomously are being rapidly developed…
The Law of Two Feet

2025年2月8日

The Law of Two Feet

TL ; DR What It Is: A principle from Open Space Technology (OST) stating that if you are neither learning nor…
AI Agents VS Agentic AI

2025年2月8日

AI Agents VS Agentic AI

TL ; DR AI Agents are task-oriented, performing specific functions within a predefined scope. Agentic AI is autonomous,…

21 条评论
Free as in a Puppy : Keeping an eye out for Open-source

2025年1月26日

Free as in a Puppy : Keeping an eye out for Open-source

TL ; DR Open-source software (OSS) is often appealing due to its zero-cost entry and flexibility but comes with hidden…
Why modern Cloud-native systems need Cloud-native API management methodologies

2025年1月21日

Why modern Cloud-native systems need Cloud-native API management methodologies

TL ; DR Cloud-native systems on Kubernetes have emerged as the de-facto standard for most modern applications, but that…
Will your World end if GitHub goes down ?

2025年1月18日

Will your World end if GitHub goes down ?

TL ; DR The January 2025 GitHub outage, despite a 99.9% SLA, disrupted development operations globally, highlighting…

1 条评论
Enforcing Responsible AI at scale across an Enterprise with AI Gateways

2025年1月16日

Enforcing Responsible AI at scale across an Enterprise with AI Gateways

TL ; DR AI, especially GenAI and AI Agents, is gaining rapid momentum, with enterprises excited yet cautious about its…
Reimagining Analytics within an Enterprise with DuckDB

2025年1月8日

Reimagining Analytics within an Enterprise with DuckDB

TL ; DR Most enterprises don't have Big Data ( >10 TB ) but end-up using a Big Data tech stack to manage their data…

1 条评论
Democratizing Data & Analytics across an Enterprise, At Scale : A Platform-centric Approach

2025年1月6日

Democratizing Data & Analytics across an Enterprise, At Scale : A Platform-centric Approach

TL ; DR The Gap: Despite innovations like Lakehouse patterns and Data Mesh, enterprises struggle to implement these due…

3 条评论

See all articles

Docker AI Catalog : The Future of Curated AI Models and LLMs

Turja N Chaudhuri ( ?? to the Cloud )

Global Lead, Platform Success, EY Fabric | ? Practice at EY | Views are my own

TL ; DR

Context

The AI Corundum : To Much to Choose From

The need for Curation within the AI Ecosystem

Docker AI Catalog

领英推荐

Docker AI Catalog : Different artifact types

Where can I find Docker GenAI Catalog

Conclusion

References

Turja N Chaudhuri ( ?? to the Cloud )的更多文章

社区洞察

其他会员也浏览了

Daily Dose of Tech | 2024-01-23

Domain Specific Generative AI Offerings in AWS Marketplace (Part 10)

How to get the most out of cloud-based AI solutions

Comparison between OpenAI and OCI Gen AI Services - Pricing, Data Security, and Model Diversity

Demystifying LLM Customization for the Enterprise

PoC / MVP / Scaling or Implementation - Generative AI Professional Services Offerings in AWS Marketplace (Part 13)

The Machine Learning Imperative: Empowering Business to Innovate Faster - Giving Builders the Freedom to (Re)Invent

Episode Spotlight: IBM’s Role in Scaling AI for the Enterprise with Stephen Du ???

Why Your Enterprise Needs an AI Platform: A Guide to Smart AI Integration for Modern Enterprises

TL ; DR

Context

The AI Corundum : To Much to Choose From

The need for Curation within the AI Ecosystem

Docker AI Catalog

领英推荐

Docker AI Catalog : Different artifact types

Where can I find Docker GenAI Catalog

Conclusion

References

Turja N Chaudhuri ( ?? to the Cloud )的更多文章

How LLM cost reduction can help unlock innovation

Autonomous AI Agents : A Horror story

The Law of Two Feet

AI Agents VS Agentic AI

Free as in a Puppy : Keeping an eye out for Open-source

Why modern Cloud-native systems need Cloud-native API management methodologies

Will your World end if GitHub goes down ?

Enforcing Responsible AI at scale across an Enterprise with AI Gateways

Reimagining Analytics within an Enterprise with DuckDB

Democratizing Data & Analytics across an Enterprise, At Scale : A Platform-centric Approach

社区洞察

其他会员也浏览了

Daily Dose of Tech | 2024-01-23

Domain Specific Generative AI Offerings in AWS Marketplace (Part 10)

How to get the most out of cloud-based AI solutions

Comparison between OpenAI and OCI Gen AI Services - Pricing, Data Security, and Model Diversity

Demystifying LLM Customization for the Enterprise

PoC / MVP / Scaling or Implementation - Generative AI Professional Services Offerings in AWS Marketplace (Part 13)

The Machine Learning Imperative: Empowering Business to Innovate Faster - Giving Builders the Freedom to (Re)Invent

Episode Spotlight: IBM’s Role in Scaling AI for the Enterprise with Stephen Du ???

Why Your Enterprise Needs an AI Platform: A Guide to Smart AI Integration for Modern Enterprises