?? We’re making AI smarter by tackling hallucinations with smarter sampling! How do you make AI models more reliable without blowing up costs? We explored a cutting-edge approach using LLM activations, linear probes, and active learning—and the results are exciting. We designed a sampling method (Log10-ACF) that allows us to select as few as 10 logs for the application developer to inspect for better hallucination detection across the entire application. This lightweight solution achieves state-of-the-art results while avoiding fine-tuning. The payoff? Lower annotation costs, faster deployment, and AI systems you can trust. Want the full breakdown of how we achieved this? Read our post to dive into the details and see how techniques like Log10-ACF could transform AI for businesses. #AI #MachineLearning #ActiveLearning #Innovation
关于我们
AI-powered LLMOps platform to improve and monitor LLM powered applications via logging, debugging, evaluations and fine-tuning. We train custom evaluation models on your data to make accuracy improvements 10x cheaper, 100x faster, and 1000x more scalable
- 网站
-
https://log10.io
Log10.io的外部链接
- 所属行业
- 软件开发
- 规模
- 2-10 人
- 总部
- San Francisco
- 类型
- 私人持股
- 创立
- 2023
地点
-
主要
US,San Francisco
Log10.io员工
动态
-
? Critical applications require higher recall; accuracy is an incomplete metric. For hallucination detection, recall and precision are the relevant metrics to characterize evaluation performance. Latent space readout, or LSR, provides an easily tunable knob for setting recall and precision to a configuration that’s appropriate for the application. For critical applications, a higher recall (hallucination detection rate) is desirable, even if that involves a tradeoff of lower precision (more “false alarms”), so long as the false alarm rate is tolerable. With LSR, the detection threshold can be set for a higher hallucination detection rate, such as in the last row of the table below–a knob that would otherwise have to be laboriously (and less effectively) tuned with prompt engineering. This is similar to the promise of more fine-grained control in model steering. Read more: https://bit.ly/3YSXRes #LLM #AI #LLMOps
-
?? Q: Why isn't there a single solution for LLM evaluation? There are a few reasons: ? Every company has different data/access ? Most evaluators require a lot of data to train ? Use cases vary In this video, Log10 Co-Founder and CEO Arjun Bansal breaks down what's needed to ensure LLM accuracy. ? #LLM #LLMOps
-
Trey Doig, Co-Founder and CTO of Echo AI chats with Arjun Bansal, Co-Founder and CEO of Log10, about navigating LLM accuracy issues and what it takes to deploy generative AI apps to enterprises. Watch today: https://bit.ly/3z2nCyr #GenAI #LLM #LLMOps
What It Actually Takes to Deploy GenAI Applications to Enterprises: Arjun Bansal and Trey Doig
https://www.youtube.com/
-
Why can't LLMs reliably self-evaluate accuracy? ?? Our CEO and Co-Founder, Arjun Bansal, provides an explanation in this quick video. #LLM #LLMOps
-
?? Now hiring: We are seeking a Founding Account Executive who is motivated and tech-savvy to join our growing team! You'll play a crucial role in driving our go-to-market strategy and expanding our customer base in the AI developer tooling space. If you're a self-starter with a passion for AI technology and a knack for leveraging cutting-edge AI-powered tools to optimize the sales process, we want to hear from you! Learn more about the role, including responsibilities and qualifications: https://bit.ly/3zbATVA #hiring #AI #LLMOps
-
Log10.io转发了
?? A game-changing session just concluded at the AI in Healthcare & Pharma Summit 2024 with Niklas Quarfot Nielsen, Co-founder & CTO of Log10.io! ?? AI-Powered Expert Review: Speeding Up Care Delivery - AI isn’t here to replace healthcare professionals—it’s here to empower them. But even with their expertise, human oversight can slow care delivery. Log10 tackles this challenge head-on with AI accuracy at its core. - Our LLMOps platform scales expert review, achieving domain-specific precision and helping professionals complete tasks faster and more effectively. - Demoed live: Our report-generation tool that learns from edits and corrections, continuously improving through a self-refining feedback loop. From tone to accuracy, this tool enhances reports while reducing manual effort. - Seamlessly integrated into workflows, Log10.io’s platform is driving better care, faster, with increased trust and reliability. AI is amplifying human expertise to transform healthcare delivery. It’s about precision, speed, and efficiency! ?? ?? Couldn't join us in person? Register for On-Demand Access to watch back all the presentations in your own time: https://hubs.ly/Q02Y6xlB0 #reworkAI
-
?? AI doesn’t replace healthcare professionals—it empowers them. That's why our experts are heading to the RE?WORK AI in Healthcare & Pharma Summit 2024 this week! Our CTO and Co-Founder Niklas Quarfot Nielsen will be presenting a talk on speeding up the delivery of care through AI-powered expert review. ?? Log10 focuses on AI accuracy. Our LLMOps platform scales expert review, achieving domain-specific precision while enabling experts to complete their work in a fraction of the time. In this talk, we’ll demo our report-generation tool, which scales expert review by learning from your edits and corrections. This tool automatically refines prompts and creates a self-improving feedback loop. Join us this week and talk to our experts about improving your services and products built on AI. #ReWorkAI #LLM #AI
-
In this video, Co-Founder and CTO of Log10 Niklas Quarfot Nielsen demonstrates how to measure the accuracy of an LLM application using a few pieces of human feedback and turning it on autopilot. #LLM #LLMOps #AI