Will GenAI app vendors win even the Gen AI infra market

Will GenAI app vendors win even the Gen AI infra market

I published this article (https://www.dhirubhai.net/pulse/possible-profit-pools-gen-ai-stack-pramod-gosavi-vkeec/ ) last year speculating the profit pools in different parts of AI value chain. My bet was the LLM providers and Vertical apps will capture most of the value

  • Platform first or application first?

Infrastructure platform plays are seductive:? “hey, instead of fighting the war, let’s just sell the bullets!”. ?Another popular spin on this is “During a gold rush, sell shovels.”

Age old debate when dealing with a new platform or technology is what comes first - is it application or the platform? I think most platforms have evolved from a successful app, where someone woke up and said, “wow, let’s generalize this and build some APIs.”? Can you think of a successful platform that didn’t come about this way?

The problem is platforms take all of the risks of a startup, and double them.? You have to not only build a successful platform, but you have to convince folks to build (or build yourself) successful apps.? You may enjoy some artificial success at first, but unless you have app builders enjoying sustainable success, your platform won’t enjoy sustainable success. A platform play is an opportunity you earn the right to have.

Amazon/AWS is the best example of cloud platform/e-commerce platform. As the app became popular they started investing in the platform both on the infra side (which is today's AWS) and also on the fulfillment side (which is today's FBA - fulfillment by Amazon). On the social media side, Facebook could have become a platform but instead offering the new apps (marketplace, dating, etc.)

  • OpenAI: App or platform?

OpenAI was founded to build AGI (artificial general intelligence). ChatGPT, a chatbot app released in Nov 2022 was a research preview and a way to showcase the capabilities of GPT4 model they built internally. If OpenAI launched itself as a platform offering LLM models, customers would not know how to use it and its capabilities. By releasing ChatGPT OpenAI showcased the capabilities. Now customers are licensing its APIs but its limited to raw inference only on a prompt.

OpenAI is on its way to becoming a platform for LLM based apps creating a marketplace like Apple App Store, adding fine tuning, RAG support, but limited to "LLM wrapper" apps for now. It is not ready for enterprise apps now as they did not have an enterprise app yet

  • GenAI infrastructure stack

Frameworks such as LangChain exploded in popularity late 2022 as developers rushed to these frameworks to build GenAI apps. Vector Databases followed along with app frameworks such as Adept to build copilots/ workflows. We also saw a few "AI in the box" to help reduce your development time from months to weeks.

Soon a cottage industry of infrastructure startups grew in 2023.

This is my simple stack

  • Early GenAI apps

Glean: Was founded in 2019 (pre chatGPT era) as an AI powered enterprise search. Core technology was building connectors to data/information sources such as cloud apps + on premise apps/Sharepoint and running an indexed search. Used BERT models initially but switched to new LLM in 2022

Writer: Was founded in 2020 as an AI writing assistant for professional users. Core technology was brand compliance, collaboration as well as data privacy and security for the enterprise

  • Glean and Writer as a Platform?

Glean: Glean is built on five layers consisting of infrastructure, connectors, ?a governance engine, the company’s knowledge graph, and an adaptive AI layer. In order to connect to an enterprise’s applications and content repositories, Glean Chat uses its self-developed connectors to link to applications and data sources such as Salesforce, Zendesk, Jira, GitHub, Slack, Figma, Workday, Okta, Outlook, OneDrive, Google Drive, Box, Dropbox, SharePoint, as well as storage offerings from AWS, Google Cloud, and Microsoft among others.

The governance layer ensures that the generative AI follows an enterprise’s set boundaries and security policies such as identity and access management, the company said.

The knowledge graph layer, which the company has developed over the last few years, understands relationships between content and employees and internal language in an enterprise, Jain said, adding that “this enables Glean to recognize nuances like how people collaborate, how each piece of information relates to another, and what information is most relevant to each user.”

The knowledge graph layer is trained on an enterprise’s data along with large language models once it becomes a Glean subscriber, according to Jain.

The adaptive AI layer uses the information from the knowledge graph and runs it through LLM embeddings for semantic understanding and large language models for generative AI, the company said. LLM embeddings are vectors or arrays that are used to give context to artificial intelligence models, a process known as grounding. This process allows enterprises to avoid having to fully train or finetune AI models using the enterprise information corpus, said Bradley Shimmin, chief analyst at Omdia. Currently, Glean is using a mix of large language models including OpenAI’s GPT-4 and transformer models from Google, such as BERT

Writer: Writer is built on three layers - Knowledge Graph, AI guardrails, customized LLMs

Knowledge graph is a graph-based retrieval-augmented generation (RAG), achieves higher accuracy than traditional RAG approaches that use vector retrieval. Writer does not use vector databases

AI guardrails help ensure data security, regulatory compliance, brand reputation, and ethical AI usage by mitigating biases, providing verification systems, and ensuring alignment with enterprise values.

Palmyra, Writer's own family of LLMs are designed specifically for production workloads, with 72 billion parameters compared to GPT-4’s 1.8 trillion. Palmyra excels in specialized scenarios such as legal, medical, and mathematical reasoning, making it a versatile choice for various industries and use cases

Platform is born: both writer and glean recently announced platforms for customers to build their own apps on top of their existing platform built for the core app.



  • How are pure play Gen AI infrastructure vendors doing?

Infrastructure vendors such as in data management or vector databases or LLMops or frameworks or inferencing APIs have traction from customers building in house applications. They all have heavy POC/"experimental" usage, while some have production deployments but very early. Some are being used by GenAI SaaS vendors to quickly bring their apps to market instead of building these features in house.

  • Wrapping up: Large scaled app vendors have solved and scaled infra across real production use cases while pure play genAI vendors are dependent on enterprises and other apps deploying and scaling their apps in production. So if you were to build your next sales or marketing workflow app would you build it on top of a Glean platform or integrate 4-5 infrastructure vendors and build a platform in-house


Barkha Bhatia

Product Manager - Data Security, IAM, Platform as a Service

2 个月

This also reminds me how Steve Jobs launched Next computer without operating systems (platform) because he was in hurry to launch so it’s a very old trick to not invest in platform but then you need to be Jobs to pull it off

Barkha Bhatia

Product Manager - Data Security, IAM, Platform as a Service

2 个月

The platform makes the products and business stick! ServiceNow is an excellent example of a very sticky platform. The only thing I can add is that the sooner anyone shifts from apps to platforms (if that’s the vision), the better. Say the vision is to have 100 apps on platform; when someone reaches 10 percent, they should start building a platform before investing more in 90 other apps. I have seen companies aspiring to become platform-y and not remain a collection of disjointed apps but lacking the force to become one. Ultimately, they keep swinging between because of platform adoption and backward compatibility issues. There is no middle ground; you are either a platform company or not.

回复

Pramod, Great post.. I largely concur that application folks are ending up building their own stack because we are still discovering the various needs and no framework or infra completely addresses these evolving needs. The challenge is therefore the pending evolution and learning from it before we start seeing stable frameworks that works for a wide design space. Beyond that, I would break the applications into 2 design spaces. System 1 type agentic applications and System 2 type agents with more complex cognitive abilities. Our understanding of what's needed for System 1 type agentic applications is sort of becoming clear.. and we are moving to System 1 plus. The stack or approach would need to account for which design space is one building.

Muhammad S Tahir

ASK ME HOW CloudKitect can build a Generative AI platform for you in an hour? Cloud Architect as a Service differs from the usual DevOps as a Service.

2 个月

CloudKitect is another name in that list ??

回复
Addy Sharma ??

Cloud Security Architect | Azure & AWS Certified | SANS | IAM | CASB | CWPP | DLP | EDR | SIEM Expert ?? Cloud Security Assessments ?? Architecting Cloud Security Controls ?? Incident Response

2 个月

Interesting points raised. The interplay between infra and apps is crucial for growth. Would early app vendors create comprehensive platforms?

回复

要查看或添加评论,请登录

Pramod Gosavi的更多文章

社区洞察