Cerebrium

软件开发

Enabling businesses to build and deploy ML products quickly and easily. Creating a AWS Sagemaker alternative.

关注

查看全部 6 位员工

关于我们

Cerebrium is a platform to deploy machine learning models to serverless GPUs with sub-5 second cold-start times. Customers typically experience 40% in cost savings when compared to using traditional cloud providers and can scale models to more than 10K requests per minute with minimal engineering overhead. Simply write your code in Python and Cerebrium takes care of all infrastructure and scaling. Cerebrium is being used by companies and engineers from Twilio, Rudderstack, Matterport and many more.

网站: https://cerebrium.ai/
Cerebrium的外部链接
所属行业: 软件开发
规模: 2-10 人
总部: Remote
类型: 私人持股
创立: 2021
领域: AI和Machine Learning

地点

主要

US，Remote

获取路线

Cerebrium员工

查看全部员工

动态

Cerebrium

1,142 位关注者
3 年已编辑
举报此动态
We're excited to announce that we're part of the?Y Combinator W22 batch! Sign up for early access here: https://www.cerebrium.ai/ Y Combinator is the most renowned startup accelerator in the world and has backed companies like?Airbnb,?Stripe,?Coinbase?and?Reddit, Inc.?and we’re honored to be joining this community of founders. We would like to thank our group partners, Tim Brady, Dalton Caldwell, Harj Taggar, and Tom Blomfield for helping us on this journey as well as Charl Jacobs and our customers that continue to believe in us and have shown us continued support. Please follow us as we work to enable businesses to be data and AI-driven without a single line of code or an entire data team. Additionally, if you want to be a part of something special, check out our careers page below (link in comments)
3 条评论

赞评论分享
Cerebrium

1,142 位关注者
5 天前
举报此动态
The way we write code is changing faster than ever - and with tools like?v0.dev, bolt.new, Lovable or Cursor, we get to be a part of that revolution. But, have you ever wondered how they all work under the hood? Have you ever wanted to build your own AI coding assistant? We've just published a tutorial to help you do just that. In the tutorial, we break down how to build a system that: - Plans frontend component architecture - Generates code in real-time based on your commands - Deploys your code to a secure sandboxed environment (On E2B) for testing You can use any LLM that fits your needs - We're using Qwen 2.5. The guide covers everything: model setup, prompt engineering, streaming responses, and deployment; It's all built on top of Cerebrium and E2B (Both have generous free tiers). Demo:?https://lnkd.in/dBzrXFc4 Tutorial:?https://lnkd.in/dfiEajFW Source Code:?https://lnkd.in/dQh5RUjZ #AICodeGeneration #DeveloperTools #ai

赞评论分享
Cerebrium

1,142 位关注者
1 周已编辑
举报此动态
We’ve built a real-time AI commentator that delivers instant analysis and dynamic narration for live events. This demo, featuring a 1980s radio-style voice, provides basketball insights in just ~600ms, showcasing what’s possible in media and entertainment. Powered by Cerebrium for scalable AI inference, LiveKit for real-time video streaming, and Cartesia for expressive voice synthesis, this system pushes the boundaries of AI-driven broadcasting. While there’s room for improvement, this is a glimpse into the future of real-time AI commentary. ?? Demo here: https://lnkd.in/dcR4j8Kg Full tutorial here: https://lnkd.in/dWcXWRJD Source code here: https://lnkd.in/dvmFz-UZ

2 条评论

赞评论分享
Cerebrium

1,142 位关注者
3 周
举报此动态
Our clients keep asking for lower latency and higher throughput when it comes to LLMs for various use cases. Using their optimized GPU kernerls from nCompass, vLLM can run 3x more efficiently! Glad to have collaborated with the team at to make this possible!

Aditya Rajagopal PhD

Co-Founder at nCompass Technologies | YC W24
3 周

?? With nCompass x Cerebrium, you can enhance the efficiency of ?????????? ?????? ???????????? ?????????????? ???? ???????? ???? ????, without compromising accuracy. That's right—you can integrate nCompass's library of optimized GPU kernels within your existing inference engine (eg. vLLM), eliminating migration pains! nCompass's optimization approach focuses on building a library of AI inference operators, utilizing techniques like operator fusion to minimize global memory reads and writes. ?????? ????????????????: - ???? ???????????????? ????????????????????: Process 3x more tokens per second, enabling support for more users on existing infrastructure. - ??.???? ?????????????? ????????-????-??????????-??????????: Users receive responses 4.6x faster on average, improving engagement. - ??.???? ?????????? ??????-????-?????? ??????????????: Users experience a 2.1x reduction in wait time for complete responses, boosting satisfaction. Together with Cerebrium's advanced serverless infrastructure, which offers low cold start times and a variety of GPU options, we're making it simpler, faster, and more cost-effective to deploy models like Llama 70B at scale. Discover how this collaboration can accelerate your AI initiatives by reading our blog post. Link in comments! For more information, comment below or reach out to us at [email protected] or the Cerebrium team at [email protected]. Huge thanks to Michael Louis and his team at Cerebrium for making this collaboration possible. #AI #MachineLearning #nCompass #CerebriumAI #Llama70B #vLLM #GPUOptimization #AIInfrastructure #TechInnovation

1 条评论

赞评论分享
Cerebrium转发了
Kwindla Hultman Kramer
3 个月
举报此动态
In June we shipped the (then) world's fastest voice AI agent — 500ms voice-to-voice latency including network transport. The (not-so-) secret sauce behind that demo was Pipecat + Cerebrium's serverless AI infrastructure + Daily's WebRTC infrastructure. The team at Cerebrium is live on Product Hunt today. They've built infrastructure that is super impressive: very fast cold-start times, access to a bunch of different GPUs, support for realtime AI, seamless scaling. I've learned a *lot* from working with Cerebrium over the past few months on customer-facing deployments and several fun demos. Go check out what Cerebrium is doing if you're interested in deploying your own AI models and services! https://lnkd.in/gntv8GsF

Cerebrium - Serverless infra for AI/ML apps - Build faster and cheaper | Product Hunt

producthunt.com

2 条评论

赞评论分享
Cerebrium转发了
Anh-Tho Chuong

CEO @ Lago
3 个月
举报此动态
?????? I love merch ?? ?? Thanks for the hat ("Built different") Michael Louis, and above all big shoutout to the Cerebrium team, who just launched on Product Hunt! They’ve built a serverless AI infrastructure platform with ultra-low cold start times, support for real-time applications, and access to 12+ types of GPUs. It’s a game-changer for building and scaling AI applications effectively. If you believe in supporting an incredible team on a mission, please check out Cerebrium on Product Hunt and drop an upvote! Proud of what they’re doing, honored to have them as users of Lago as well!! Link to vote in the 1st comment!
6 条评论

赞评论分享
Cerebrium转发了
Tom Shapland

YC Alum. Mixpanel for Voice AI. @tom_shapland
3 个月已编辑
举报此动态
There are two parts of Cerebrium's Product Hunt launch that feel like magic. 1. The real-time diffusion model image generation. Every word Michael Louis types into the prompt instantly creates a new image. 2. How easy it is to set up GPU inference. Maybe magic isn't the right word. It feels like cheating. Check out the link in the comments.
3 条评论

赞评论分享
Cerebrium转发了
Michael Louis

Founder at Cerebrium (YC W22) | Prev CTO at OneCart (Acquired by Walmart)
3 个月已编辑
举报此动态
?? Cerebrium is Live on Product Hunt! ?? Today marks a big milestone for our team! Cerebrium, our serverless AI infrastructure platform, just went live on Product Hunt, and we’d be so grateful for your support and upvote. ?? From humble beginnings in South Africa to competing on a global stage and going up against giants with hundreds of engineers, we’ve had our share of long nights and big questions about whether we were crazy to take this on. But seeing the impact we make for our customers and how big the opportunity is in front of us — reminds us why we’re here. Cerebrium makes it easy to build, deploy, and scale AI applications with low cold start times, over 12 GPU types to choose from, support for large-scale batch jobs, real-time voice applications, and more. ?? Today, we’re proudly supporting companies and engineering teams across every continent, in many industries. If you’ve ever believed in underdogs or want to see us change the way business build and implement AI, we’d love an upvote, a comment, or a review! Upvote us here: https://lnkd.in/dA-qH8Ch Thank you to our early customers who’ve believed in us from the start — this launch is for you. ?? #startup #AI #machineLearning #productHunt

Cerebrium - Product Information, Latest Updates, and Reviews 2024 | Product Hunt

producthunt.com

10 条评论

赞评论分享
Cerebrium转发了
Cartesia

7,475 位关注者
3 个月
举报此动态
Cerebrium built a demo that lets you practice everything from handling angry customers to preparing for YC interviews – with AI voices that respond as fast as humans do. Key capabilities powered by Cartesia:? ? Sub-100ms voice generation (fastest in the market) ? End-to-end responses in under 500ms ? Ultra-realistic voices (ranked the most human-like over every alternative by Artificial Analysis) ? Dynamic emotion & speed control ? Natural conversation handling Built in collaboration with:? Cerebrium - Serverless AI infrastructure? Tavus - AI avatars? Mistral AI - Language model Huge kudos to the incredible Cerebrium team for pushing the boundaries of what's possible with AI voices! ?? Try the demo and read the full deep-dive - link in comments ??

Cerebrium and Cartesia Demo

3 条评论

赞评论分享
Cerebrium

1,142 位关注者
3 个月
举报此动态
In preparation for our Product Hunt launch on the 11th, we're giving away awesome Windbreakers to those who contribute to our examples Github repo! In order to submit an example: 1. Fork the repo 2. Add your example and let us know here or on our communities 3. Get your PR merged 4. Fill out the shipping form We are giving away bonus swag for blog posts or tutorial videos! Looking for some ideas? - RAG app that can chat with a PDF document? - Someone say Discord Bot? - TensorTRT example - Using Blender to render 3D graphics - Integrating Cerebrium with LlamaIndex or Tailscale - Realtime SDXL image generator using WebSockets Subscribe and be notified of PH Launch here: https://lnkd.in/dsXkRGfS Examples repo: https://lnkd.in/dX8qG99u
1 条评论

赞评论分享

相似主页

查看职位

融资

Cerebrium 共 1 轮

上一轮

种子前 2022年2月15日

US$500,000.00

在 Crunchbase 上查看更多信息

登录看看您认识Cerebrium的哪些人

Cerebrium

软件开发

Enabling businesses to build and deploy ML products quickly and easily. Creating a AWS Sagemaker alternative.

关于我们

地点

Cerebrium员工

Dan Marcus

Gaming and Data

Kyle Gani

AI | Product | Developer experience

Eli Roussos

Founding ML Engineer @ Cerebrium (YC W22)

Michael Louis

Founder at Cerebrium (YC W22) | Prev CTO at OneCart (Acquired by Walmart)

动态

Cerebrium - Serverless infra for AI/ML apps - Build faster and cheaper | Product Hunt

producthunt.com

Cerebrium - Product Information, Latest Updates, and Reviews 2024 | Product Hunt

producthunt.com

Cerebrium and Cartesia Demo

立即加入，查看您错过的职场动态

相似主页

OneCart

Daily

Decart

Cartesia

Deepgram

Tavus

Sxuirrel

Kyutai

Suger

Mistral AI

查看职位

顾问职位

营销主管职位

融资