OctoAI (now NVIDIA)

软件开发

Seattle，Washington 14,082 位关注者

Run, tune, and scale the models that power AI applications.

关注

查看全部 23 位员工

关于我们

OctoAI is an efficient, customizable, reliable generative AI platform to build and run production applications at the best price and performance. OctoAI puts customers in control with end-to-end solutions for text and media generation, with the ability to run open source models (e.g. Llama 3.1, SD3) custom models, or a mix of both. OctoAI private deployment, known as OctoStack, allows customers to run generative AI in their own environment, including any cloud platform, VPC, or on-premise, offering full control over data. OctoAI is based in Seattle, Washington and is backed by Madrona Venture Partners, Amplify Partners, Tiger Global, and Addition Capital.

网站: https://octo.ai
OctoAI (now NVIDIA)的外部链接
所属行业: 软件开发
规模: 51-200 人
总部: Seattle，Washington
类型: 私人持股
创立: 2019
领域: machine learning、artificial intelligence、Stable Diffusion、SDXL、LLMs和Generative AI

产品

OctoAI (now NVIDIA)

数据科学与机器学习平台

OctoAI is an efficient, customizable, reliable generative AI platform to build and run production applications at the best price and performance. OctoAI puts customers in control with end-to-end solutions for text and media generation, with the ability to run open source models (e.g. Llama3, Mixtral, SDXL) custom models, or a mix of both. OctoAI private deployment, known as OctoStack, allows customers to run generative AI in their own environment, including any cloud platform, VPC, or on-premise, offering full control over data. OctoAI is based in Seattle, Washington and is backed by Madrona Venture Partners, Amplify Partners, Tiger Global, and Addition Capital.

地点

主要

Northlake Way

US，Washington，Seattle，98101

获取路线

OctoAI (now NVIDIA)员工

查看全部员工

动态

OctoAI (now NVIDIA)

14,082 位关注者
2 个月
举报此动态
What's your favorite GenAI use-case? How has generative AI made tasks in your life easier? Alyss Noland details her favorite in the clip below. Check out the full episode of AI Unscripted on Youtube: https://bit.ly/3AJuAJe

1 条评论

赞评论分享
OctoAI (now NVIDIA)

14,082 位关注者
2 个月
举报此动态
At OctoAI, we're committed to transparency and control through open-source innovations. We believe that cross-stack, interoperable services are the key to unlocking the full potential of GenAI. Our large multimodal inference engine is designed to be flexible and adaptable, supporting a full range of enterprise tools, data types, and use cases. Supporting the broader organizational needs of enterprise customers, from fine-tuning to evaluation and more. Contact us to request a no-cost proof of concept to demonstrate the value of the OctoAI Inference Engine. Read the full article here: https://bit.ly/47bfIzo
1 条评论

赞评论分享
OctoAI (now NVIDIA)

14,082 位关注者
2 个月
举报此动态
What if you could transcribe conversations in real-time, extract key information, and generate summaries - without physically writing a single word? Pedro T. shows the Electronic Health Record Demo which has some key features: ?? Real-time audio transcription ?? Named Entity Recognition for crucial medical data ? Automatic summary generation ?? Secure deployment using Snowflake Container Services This solution goes beyond healthcare. Any industry requiring customer support or process documentation can customize to fit their needs. Watch now - https://bit.ly/3z7AKm4

赞评论分享
OctoAI (now NVIDIA)

14,082 位关注者
2 个月
举报此动态
"Before AI, only programmers were able to get computers to do what they wanted by writing arcane programming language texts. OctoAI was created to accelerate our path to that reality so that more people can use and benefit from AI. And people, in turn, can use AI to create yet more benefits by accelerating the sciences, medicine, art, and more." - Jason Knight Thanks to Unite.AI for this great interview! ?? Catch the full interview here: https://lnkd.in/es8Rnuky

Jason Knight is Co-founder and VP of ML at OctoAI - Interview Series - Unite.AI

https://www.unite.ai

1 条评论

赞评论分享
OctoAI (now NVIDIA)

14,082 位关注者
2 个月
举报此动态
As enterprise businesses look to leverage Generative AI models, they face unique challenges integrating sensitive data and ensuring security. Rodney Shetler breaks down some of the key thoughts we've been hearing for the enterprise use case. Find out how you can run your choice of models in your environment, including any cloud platform, VPC, or on-premise, ensuring full control over your data with OctoStack. https://bit.ly/3z0ZrAu Watch the full Builder's Roundtable on Secure GenAI for the Enterprise on Youtube- https://bit.ly/3XoCCQx

1 条评论

赞评论分享
OctoAI (now NVIDIA)

14,082 位关注者
2 个月
举报此动态
Get more out of smaller open source models with fine-tuning! ?? Watch the full video on YT to find out how a fine-tuned Llama 3.1 8B model can outperform a proprietary model like GPT-4o when it comes to cost & quality ??https://bit.ly/3Z6dsYh

赞评论分享
OctoAI (now NVIDIA)

14,082 位关注者
2 个月
举报此动态
What if we could give LLMs the power to make decisions and take actions and go beyond chat interfaces? ?? Function Calling allows the model to decide when it needs additional information or when to use external functionalities, bridging the gap between language understanding and real-world actions. Using Llama 3.1 8B and 70B, we demonstrate how you can use function calling with a customer support use-case - streamlining your workflows and saving you valuable time ?? Read more here ?? https://bit.ly/4dKC4KC
赞评论分享
OctoAI (now NVIDIA)转发了

Subho Majumdar, PhD

Co-founder and Head of AI, Vijil | AI Security and Safety Leader | Scientist, Author, Board Member, Advisor, Angel Investor
2 个月
举报此动态
What does it mean to elicit trust in AI? In yesterday's OctoAI panel on secure genAI for enterprises, I made the point that answering this question amounts to two things. **Maximizing upside** Can we trust AI to attain desired outcomes reliably and generate value for us? **Minimizing downside** Can we trust AI to not attain undesired outcomes that may result in financial or otherwise loss? The definition and metrics for desired and undesired outcomes will depend on usage context of an AI system. Within a certain usage context, - Business goals will dictate minimum required desired outcome. - Regulatory and compliance requirements plus ethical guidelines will inform maximum acceptable undesired outcome (i.e. risk tolerance). Eliciting trust in AI broadly amounts to achieving the balance between these two factors in a particular context. --- Did that make sense? What desired outcomes do you prioritize most when it comes to AI, and what potential risks are you most concerned about? Share your thoughts in the comments below!

4 条评论

赞评论分享
OctoAI (now NVIDIA)

14,082 位关注者
2 个月
举报此动态
Inference should be designed to support the diverse needs of developers, from model tuning and evaluation to routing. OctoAI’s multimodal inference engine delivers value to customers that goes beyond the model endpoint, focusing on aligning closely to the essential needs of enterprise customers: ?? Inference as the cornerstone of the GenAI flywheel ?? Balancing of unit economics with system reqs for latency, throughput, and quality. ?? Customization capabilities to adapt a set of models to solve user problems. ?? Greater transparency and control driven by open source innovations, ?? Flexible and adaptable support for a full range of enterprise tools, data types, and use cases without increasing engineering overhead. Jared Roesch takes a deep dive into the OctoAI Inference Engine ?? https://bit.ly/3Xo0TGo

OctoAI: secure, reconfigurable, natively multimodal | OctoAI

octo.ai

赞评论分享
OctoAI (now NVIDIA)转发了

Luis Ceze

VP at NVIDIA & Lazowska Endowed Professor at University of Washington
2 个月已编辑
举报此动态
Check out the latest post from?our CTO Jared Roesch that digs deep into the OctoAI Inference Engine. There are a lot of model endpoints out there, but what sets OctoAI apart is a relentless focus on building toward enterprise needs: * Inference is the cornerstone of the GenAI flywheel?but it must be designed to support the full scope of developer needs -- such as model tuning, evaluation, and routing. * Balancing of unit economics?with system requirements for latency, throughput, and quality. * Customization capabilities?to a adapt a model or set of models to solve user problems. * Greater transparency and control?driven by open source innovations, optimized for the enterprise. * Flexible and adaptable?support for a full range of enterprise tools, data types, and use cases without increasing engineering overhead. This philosophy has driven the development of the OctoAI platform since day one and enables us to deliver features that are unique in the market: ?? ?Efficiently run a large number of PEFTs on a single node ?? Deploy on diverse hardware, including legacy GPUs ??? Leverage MLC-LLM for leading performance ??? Large context sizes crucial for multimodality ?? Configurable and flexible to support next-gen models I am super proud of this effort and the whole team! Hope you enjoy learning more about it.

OctoAI: secure, reconfigurable, natively multimodal | OctoAI

octo.ai

赞评论分享

相似主页

查看职位

融资

OctoAI (now NVIDIA) 共 4 轮

上一轮

C 轮 2021年12月1日

US$85,000,000.00

投资者

Tiger Global Management +2 其他投资者

在 Crunchbase 上查看更多信息

查看关于OctoAI (now NVIDIA)的洞察

OctoAI (now NVIDIA)

软件开发

Seattle，Washington 14,082 位关注者

Run, tune, and scale the models that power AI applications.

关于我们

产品

OctoAI (now NVIDIA)

数据科学与机器学习平台

地点

OctoAI (now NVIDIA)员工

David Messina

Chief Marketing Officer and Business Development Leader

Jeff Nappi

Senior Manager, DGX Cloud @ NVidia - Building the cloud-first way to get the best of NVIDIA AI. ???

Janisha Anand

GenAI Product Lead -NVIDIA | ex-OctoAI | ex-AWS (SageMaker, Data Lakes)

Zachary Tatlock

Associate Professor at University of Washington

动态

立即加入，查看您错过的职场动态

相似主页

英伟达

Modular

Snorkel AI

Etched

Fixie.ai

Groq

Mistral AI

Madrona

Databricks Mosaic Research

Perplexity

查看职位

科学家职位

总监职位

副总裁职位

机器学习工程师职位

营销总监职位

数据分析员职位

工程师职位

营销副总裁职位

分析师职位

高级产品经理职位

营销主管职位

数据科学家职位

董事长职位

经理职位

行政组长职位

采购师职位

办公室管理员职位

专员职位

分析主管职位

融资