Neural Magic

软件开发

Somerville，Massachusetts 16,360 位关注者

We are on a mission to bring open-source LLMs and vLLM to every enterprise on the planet. The future of AI is open.

查看职位关注

查看全部 49 位员工

关于我们

Together with our community, we engineer sparse LLM, CV, and NLP models that are more efficient and performant in production. Why does this matter? Sparse models are more flexible and can achieve unrivaled latency and throughput performance on your private CPU and GPU infrastructure. Check us out on GitHub and join the Neural Magic Slack Community to get started with software-delivered AI.

网站: https://neuralmagic.com/
Neural Magic的外部链接
所属行业: 软件开发
规模: 51-200 人
总部: Somerville，Massachusetts
类型: 私人持股
创立: 2018
领域: machine learning、deep learning和artificial intelligence

地点

主要

55 Davis Sq

Floor 3

US，Massachusetts，Somerville，02144

获取路线

Neural Magic员工

查看全部员工

动态

Neural Magic

16,360 位关注者
14 小时前
举报此动态
Exciting insights into Neural Magic leadership! Mark Kurtz, CTO of Neural Magic, shares his journey to a successful AI career, the importance of hands-on expertise, understanding AI’s limitations, and the latest advancements in neural network efficiency. He also highlights the power of mentorship and open-source projects in democratizing AI education. Catch his key takeaways on creating impactful AI solutions: https://lnkd.in/eBdZyUYe #AI #CareerGrowth #MachineLearning #TechLeadership #AIInnovation

Mark Kurtz Of Neural Magic On Five Things You Need To Create A Highly Successful Career In The AI…

medium.com

赞评论分享
Neural Magic转发了

Eldar Kurti?

Machine Learning
2 天前
举报此动态
Ever wondered how your LLM deployments perform under real-world conditions? Curious if there’s a better hardware configuration to optimize your model’s performance? Or what the financial impact of different deployment strategies could be, and how to make smarter decisions to reduce costs? We've been there too, and that's why our team at Neural Magic developed GuideLLM – an open-source tool that helps you answer all these questions and more! Feel free to check it out at https://lnkd.in/dNnmwG6r
赞评论分享
Neural Magic

16,360 位关注者
6 天前
举报此动态
Excited to be at the #AIConf2024! It’s packed with innovative solutions and talks. Come by and say hello to our team!
1 条评论

赞评论分享
Neural Magic

16,360 位关注者
1 周
举报此动态
?? The recording from last week's vLLM office hours is ready! In this session, Tyler Michael Smith shared how to leverage NVIDIA CUTLASS for high-performance inference in #vLLM. Michael Goin then covered the exciting updates in vLLM v0.6.0 that led to a 2.7x throughput increase and a 5x latency improvement. Watch the recording: https://lnkd.in/eq5RWA6K View the slides: https://lnkd.in/e9kPK4QA Explore and join our future bi-weekly vLLM office hours: https://lnkd.in/euF8m73q

vLLM Office Hours - Using NVIDIA CUTLASS for High-Performance Inference - September 05, 2024

https://www.youtube.com/

3 条评论

赞评论分享
Neural Magic转发了

Cade Daniel

Software Engineer at Anyscale
1 周
举报此动态
Calling all vLLM users and contributors: There's a new track dedicated to vLLM at Ray Summit. Great talks from companies including Apple, Uber, Handshake, Databricks, Hinge, including vLLM contributor talks on speedups from quantization (Robert Shaw Michael Goin) and multi-modality (Roger Wang). Conference runs from Sep 30-Oct 2. Reach out if you have question :) #RaySummit2024 https://lnkd.in/g8ZhnkyY

Anyscale | Ray Summit 2024

raysummit.anyscale.com

赞评论分享
Neural Magic

16,360 位关注者
1 周已编辑
举报此动态
Excited to attend the AI Conference at Pier 27 in San Francisco this week, September 10-11. Don’t miss Nir Shavit, MIT Professor and our Co-Founder at Neural Magic, as he presents "The Role of Sparsity in ML" at 9:05AM on Wednesday in the City View room. Nir will explore how sparsity in neural networks can improve efficiency without impacting accuracy. A must-see talk on using parallel algorithms to unlock the full potential of sparsity in machine learning. Looking forward to connecting with fellow AI innovators and practitioners. Let us know you are attending by commenting on this post - we’d love to meet up! #AIConf2024 #MLInnovation https://aiconference.com

The AI Conference 2024 - Join The AI Community

https://aiconference.com

赞评论分享
Neural Magic

16,360 位关注者
1 周
举报此动态
Our team open-sourced GuideLLM, a tool for evaluating and enhancing LLM deployments! ?? GuideLLM simulates real-world inference to gauge performance, resource needs, and cost implications of deploying LLMs across different hardware configurations. Key GuideLLM features and benefits include: - Performance Evaluation: Test LLMs under real-world conditions; - Resource Optimization: Find the best hardware setups for effective inference; - Cost Efficiency: Optimize resources and cut expenses; - Scalability Testing: Ensure models handle high user loads. And this is just the beginning! We're actively enhancing GuideLLM with an intuitive UI and new features like deployment cost analysis and accuracy evaluation. Your feedback is crucial as we continue to push AI efficiency. Explore the GitHub repo and share your thoughts! https://lnkd.in/gUWMfhGE
赞评论分享
Neural Magic

16,360 位关注者
1 周
举报此动态
Happening this Thursday! Join our vLLM office hours for the latest vLLM updates, an in-depth session on NVIDIA CUTLASS, and an interactive Q&A/feedback session: https://hubs.li/Q02Nr3W-0
赞评论分享
Neural Magic转发了

Asif Razzaq

AI Research Editor | CEO @ Marktechpost | 1.5 Million Monthly Readers and 50k+ ML Subreddit
2 周已编辑
举报此动态
GuideLLM Released by Neural Magic: A Powerful Tool for Evaluating and Optimizing the Deployment of Large Language Models (LLMs) GuideLLM is a comprehensive solution that helps users gauge the performance, resource needs, and cost implications of deploying large language models on various hardware configurations. By simulating real-world inference workloads, GuideLLM enables users to ensure that their LLM deployments are efficient and scalable without compromising service quality. This tool is particularly valuable for organizations looking to deploy LLMs in production environments where performance and cost are critical factors. Key Features of GuideLLM GuideLLM offers several key features that make it an indispensable tool for optimizing LLM deployments: ?? Performance Evaluation:?GuideLLM allows users to analyze the performance of their LLMs under different load scenarios. This feature ensures the deployed models meet the desired service level objectives (SLOs), even under high demand. ?? Resource Optimization:?By evaluating different hardware configurations, GuideLLM helps users determine the most suitable setup for running their models effectively. This leads to optimized resource utilization and potentially significant cost savings. ?? Cost Estimation:?Understanding the financial impact of various deployment strategies is crucial for making informed decisions. GuideLLM gives users insights into the cost implications of different configurations, enabling them to minimize expenses while maintaining high performance. ?? Scalability Testing:?GuideLLM can simulate scaling scenarios to handle large numbers of concurrent users. This feature is essential for ensuring the deployment can scale without performance degradation, which is critical for applications that experience variable traffic loads..... Read our full take on this: https://lnkd.in/gsGBGjpj GitHub: https://lnkd.in/gfQeCA-x Neural Magic Mark Kurtz Sa?a Zelenovi? Alex Matveev Nicole Kim Robert Shaw Brian Stevens #opensource #ai #llms
2 条评论

赞评论分享
Neural Magic转发了

Mark Kurtz

Chief Technology Officer @ Neural Magic | Engineering Leader and ML Researcher
2 周
举报此动态
?? Join the Future of Efficient LLM Deployments with GuideLLM! ?? Exciting news! With Neural Magic, we're thrilled to introduce GuideLLM—an open-source tool that simplifies evaluating and optimizing LLM deployments for everyone. Whether you're developing chatbots, summarization tools, or any other LLM application, GuideLLM offers crucial insights to streamline your development process. Key features and benefits: - Performance Evaluation: Ensure your LLMs perform under real-world conditions. - Cost Efficiency: Optimize deployments to save resources and cut expenses. - Scalability Testing: Get your models ready to handle high user loads. We're actively enhancing GuideLLM with an intuitive UI and more features like accuracy evaluation. Your feedback and contributions are vital as we push the boundaries and ensure its benefits reach every user and company. Explore the GitHub repository, test it out, and join us in this journey of innovation together! Check out the project here: https://lnkd.in/ef8MSJWQ #opensource #efficientai #llms #community #llama31
赞评论分享

相似主页

查看职位

融资

Neural Magic 共 3 轮

上一轮

A 轮 2021年11月5日

US$30,000,000.00

投资者

New Enterprise Associates +4 其他投资者

在 Crunchbase 上查看更多信息

Neural Magic

软件开发

Somerville，Massachusetts 16,360 位关注者

We are on a mission to bring open-source LLMs and vLLM to every enterprise on the planet. The future of AI is open.

关于我们

DeepSparse

深度学习软件

SparseML

深度学习软件

SparseZoo

深度学习软件

地点

Neural Magic员工

Dimitri Sirota

BigID - Know Your Data | Control Your Data

Jamie Goldstein

Brian Stevens

CEO at Neural Magic. Ex CTO & VP Google Cloud, CTO & EVP Red Hat.

Gil Beyda

Founder & Managing Partner at Genacast Ventures

动态

vLLM Office Hours - Using NVIDIA CUTLASS for High-Performance Inference - September 05, 2024

https://www.youtube.com/

立即加入，查看您错过的职场动态

相似主页

Ultralytics

Deci AI (Acquired by NVIDIA)

Roboflow

Cerebras Systems

Weights & Biases

Hugging Face

Run:ai

Nebius

OmniML

Layer AI

查看职位

科学家职位

工程师职位

分析师职位

机器学习工程师职位

数据科学家职位

软件工程师职位

市场营销经理职位

营销项目经理职位

副总裁职位

质量助理职位

经理职位

元件工程师职位

实习生职位

助理职位

Python 开发员职位

微生物学家职位

解决方案架构师职位

运营职位

融资