Patronus AI转发了
Had an awesome time last week hosting 100+ AI researchers and engineers from NVIDIA, Databricks, Meta, Palantir Technologies, and more! At Patronus AI, AI research is a big part of what we do. We've begun to develop an important 10-year research agenda, led by the one and only Rebecca Qian. More to come on this soon. When we started the company, we wanted to balance research and product. Product is how we quantify the true impact of research, so research should ultimately serve customers. This means that the research we do is "applied", in the purest sense of the word. We are usually bottlenecked by engineering skill and speed, not simply research ideation (and maybe we're bottlenecked by compute too sometimes ??). Applied research might not be for everyone. But the most exciting part about applied research is you get to see your ideas come to life so quickly. That feeling is irreplaceable. At Patronus AI, we created: - The first widely popular open source LLM judge (Lynx) - The first standardized domain-specific LLM benchmark (FinanceBench) - The first SLM judge to beat GPT-4o mini (Glider) - The first Multimodal LLM judge - The first explainable evaluations And it's still Day -1 for us. If you're excited to join our team and push the frontiers of AI evaluation, let’s talk! Apply here: https://lnkd.in/gQHxW-RR