"LLMs and RAGs are powerful, but without a strong evaluation pipeline, you're flying blind." Here at Comet, our team has been obsessing over what it takes to build, test, and ship a GenAI application you can truly rely on. Check out these six critical LLM evaluation steps from founder and CEO?Gideon Mendels, and try them for yourself with our open source eval tool, Opik: https://lnkd.in/dW4D6xMt
关于我们
Comet is an end-to-end model evaluation platform built with developers in mind. Track and compare your training runs, log and evaluate your LLM responses, version your models and training data, and monitor your models in production — all in one platform. Backed by thousands of users and multiple Fortune 100 companies, Comet provides insights and data to build better, more accurate AI models while improving productivity, collaboration, and visibility across teams.
- 网站
-
https://www.comet.com
Comet的外部链接
- 所属行业
- 软件开发
- 规模
- 51-200 人
- 总部
- New York,NY
- 类型
- 私人持股
- 创立
- 2017
- 领域
- Machine Learning、Data Science、Developer Tools和Software
产品
Comet
数据科学与机器学习平台
Comet provides an end-to-end model evaluation platform for AI developers, with best-in-class LLM evaluations, experiment tracking and production monitoring. - Debug and evaluate your LLM applications with Opik - Track and visualize your training runs with Experiment Management - Monitor ML model performance in production with Production Monitoring - Store and manage your models with Model Registry - Create and version datasets with Artifacts The best part? Comet is free for individuals and academics!
地点
-
主要
100 6th Ave
US,NY,New York,10013
Comet员工
动态
-
Comet转发了
Here is a guide on how to evaluate your LLM's performance using OpenAI's Python API. The guide uses Opik, an open-source platform for evaluating, testing, and monitoring LLM applications. Guide: https://lnkd.in/eSKdK884 Use this to: ? Detect hallucinations ? Evaluate RAG applications ? Determine answer relevance ? Measure context recall ? Create and store test cases ? Integrate it with your CI/CD pipeline using Pytest
-
We're excited to connect with the SF developer community next month alongside Yujian Tang, Pinecone & Komodo AI. RSVP ? https://lu.ma/x8lcld46
In just a few weeks, we'll be back in SF with another round of Awesome AI Dev Tools, this time with Pinecone, Comet, and Komodo AI Along with that, as part of our mission to provide developers a complete view of the AI Dev Tool ecosystem, we've got our 2 minute demos as usual. The first five are: - TrustGraph - CloudAEye - CopilotKit?? - API-Rex - Stably AI Come join 150+ devs who've already signed up! RSVP below.
-
?????? Today, our Head of Research, Douglas Blank, (virtually) joined Swarthmore College students to discuss the impact of open-source software, sharing some behind-the-scenes stories from projects like Comet’s Opik and Kangas. Always exciting to see the next generation of OSS developers in the making!
-
?? Building an LLM application is tough - especially when you don't have a way to confidently test its performance. In this code tutorial, you'll learn how you can leverage Opik’s native OpenAI integration to have more visibility into your LLM workflows: ?? Log traces and spans ?? Manually annotate LLM responses ?? Automatically score responses using Opik's out-of-box evaluation metrics
-
???? We’ll be joining Intel Software at their Seattle Developer Meetup this Thursday, November 14th! It's shaping up to be a great opportunity to connect with fellow AI developers and participate in workshops focused on AI PCs and MLOps best practices. We'll also be chatting about #Opik, our open-source LLM eval framework that's quickly gaining momentum in the #GenAI space ?? If you're in the area, we’d love for you to join us. You can register here: https://intel.ly/3YDmoCJ #AI #MLOps #LLMEval
-
Comet转发了
?? Opik Weekly Changelog ?? Highlight of the Week: We just launched a Prompt Library! ?? Now you can easily create and manage your prompts in the Opik platform. But there a twist ! We heard time and time again that folks preferred to have their prompts stored in code so built functionality to automatically sync the prompts stored in code with the Opik library, just wrap the prompt with the `Prompt` class and your done. We also released: ? Bedrock integration ? SDK method to search and return spans
-
? Opik has officially hit 2,000 GitHub Stars! ? For the last several months, the Comet team has been working on our open source LLM evaluation framework, Opik. And since releasing it in September, we’ve been absolutely blown away by the support the ML community has sent our way. Since launching, we’ve had: ?? Dozens of feedback sessions, feature requests, opened issues, and PRs from individual practitioners ?? Integrations from our friends in the open source community, including LiteLLM (YC W23) and Ragas ?? Tutorials, projects, and even an entire course (by the amazing Elvis S., no less) dedicated to running LLM evaluations with Opik And all of this has culminated in 2,039 GitHub Stars, 530 closed PRs, and thousands of users—from hobbyists to researchers to engineers at Fortune 500 companies—using Opik to build their LLM evaluation pipelines. Thank you all so much for your support so far. We're so excited to continue building with you ??
-
??????Join us and Intel Corporation on November 14th for a hands-on developer workshop in #Seattle! ? Dive into AI application development and MLOps best practices ? Connect with fellow developers over drinks ? Learn about #opik, the open-source LLM eval framework trusted by top GenAI teams ?? Grab your spot: https://lu.ma/er85bzb7
-
?? Exciting to see that the LLM’s Engineer’s Handbook is now live! This essential guide equips developers with a full framework for building #LLM systems, and we’re proud to see #Opik included as a tool in their LLMOps tutorials.? Big congrats to Paul Iusztin and Maxime Labonne for delivering this incredible resource. It couldn’t have come at a better time for the #GenAI community. ?? Find limited-time discount links in the comments??!