登录查看更多内容

Inside the Vals.ai report: where does AI already beat lawyers?

Richard Mabey

CEO at Juro - intelligent contract automation

发布日期: 2025年3月4日

The Vals.ai report is an excellent piece of research and it raises really interesting questions for those of us using or building AI solutions in the legal space. I’ve linked it in the comments below and tagged its authors.

The report is a collaboration between some of the leading minds in this space, particularly the Legaltech Hub , courtesy of Nicola Shaver and Jereon Plink; and project lead Tara L. Waters , who I’m excited to say is joining our next webinar to discuss which AI solutions fit best for which tasks (sign up here - Agents vs Copilots: which works best for in-house legal?)

Let’s dig into the report. It evaluates the performance of four tools - CoCounsel , Vincent AI, Harvey Assistant from Harvey , and Oliver from Vecflow - across seven legal tasks:

Data extraction
Document Q&A
Document summarization
Redlining
Transcript analysis
Chronology generation
EDGAR research

AI performance was benchmarked against a ‘Lawyer Baseline’ - a control group assembled by Cognia Law and including Reed Smith LLP , Fisher Phillips , McDermott Will & Emery , and Ogletree Deakins (plus four more anonymous firms).

It’s great to see pioneering firms take the lead here and grasp the nettle of evaluating people vs AI. I believe that the firms and in-house teams who see this wave coming and ride it will thrive compared to those who pretend it won’t affect them.

So what are the toplines? Here are the summarised results:

The first thing to note is that AI is already outperforming the human lawyer baseline in a whole range of cases.?

The horse has bolted

For data extraction, document Q&A, summarization and transcript analysis, you can already buy multiple solutions (including Juro) that outperform a baseline set by some of the world’s best law firms.?

This underlines what many (including us) have been saying for some time, since Goldman Sachs first dropped that dramatic report a year or two ago. But to see it in black and white is still quite something.?

It’s hardly surprising though. If I think back to my time as an M&A lawyer, sitting in a windowless room with piles of paper documents and a clipboard, performing what we’d now call data extraction - finding relevant dates, values, change of control clauses and so on - I would have been astonished to see what is possible today.?

It makes sense that an AI so well-suited to text comprehension should outperform a single tired, inexperienced, junior lawyer like me. Our customers’ adoption of AI Extract validates this too - usage of that feature alone is growing more than 100% every month.

Similarly for document Q&A and summarization, these are exemplary applications of what generative AI can do.?

We saw in our webinar last week (The limits of AI: are there any legal tasks AI should never do? - watch on demand) that there are still pockets of healthy skepticism regarding AI in legal.

But if AI is decisively outperforming top law firms, in a matter of moments, for a fraction of the cost (and without value being measured in six-minute increments), I don’t see how that skepticism can hold for much longer.

AI can’t do everything - yet

That said, we are clearly at the bottom of the maturity curve for some applications of AI. For redlining, for those solutions brave enough to throw their hat in the ring (in this study, Harvey and Vincent), performance is some way behind the law firm baseline. And given the potential consequences of bad redlining, this is definitely a challenge for AI adoption.

Why is redlining so hard? The big difference vs text interpretation tasks is really the amount of context required to do that job well. If we compare extraction, if AI is asked to find an effective date on a vendor contract, it doesn’t really need any contex other than the document itself.

There are probably some numbers and letters that look like a date, near to the words ‘effective’ and ‘date’, or some simple reasoning that relates back to a date, and AI can figure it out.

But to redline a contract well, there’s so much you need to know:

your organisational risk appetite
preferred fallbacks
financial and commercial information
the limits of what you’re allowed to agree
supplier power
house drafting style

… and so on. Even nuanced factors like the real or perceived bargaining power of each side can have a material impact on how you mark up that document.

A lawyer who’s navigated not just that document but that professional scenario dozens or hundreds of times still has the edge…

… for now. Redlining is just at a different point on the maturity curve. As models get more powerful and solutions become more integrated, it’s not hard to imagine the lawyer advantage eroding.?

With integrations and APIs, it would be possible for AI to understand:?

previous documents and how they were drafted
the context of the Zoom calls during the sales negotiation
the sentiment analysis of the emails sent backwards and forwards
external data sources on financial performance

… and so on. Ultimately the context that AI is missing is just data it doesn’t have yet. Given the pace of development, I would bet on it having that data sooner than we think. It’s just a question of time, and then what will the delivery of legal advice look like?

Check out the Vals.ai report in the comments and do share your experiences of tackling these tasks with AI - which do you like, and which aren’t quite there yet?

ICYMI - we're hosting a webinar featuring results from this survey and our own which you can sign up by clicking below.

Brief Encounters

2,283 位关注者

Tara L. Waters

Multi-award winning Digital strategist | Legal innovator | Start-up adviser | Emerging tech evangelist

1 周

Thank you for the thoughtful analysis, Richard Mabey, and I am looking forward to sharing further thoughts and insights on 20 March: https://juro.com/events/agents-vs-copilots-which-ai-works-best-for-in-house-legal

2 次回应

James Farnfield

CEO @ Shake Content | LinkedIn B2B Brand and Content consultancy. Clients include Series B/C SaaS firms, VC-backed startups, and 7-figure software agencies.

1 周

Fascinating stuff, Richard! It's wild to think AI is already outpacing lawyers in some areas. What do you see as the next big step for the legal industry in adapting to this change? Not sure my lawyer is going to be too happy about this haha.

1 次回应

Richard Mabey

CEO at Juro - intelligent contract automation

1 周

https://www.vals.ai/vlair With thanks for the report's key contributors - Nicola, Jeroen, Tara L., Stephanie, Cate, Arthur S., Christian, Sean, Rebecca, Kyle Turner, Emily Nick (Plus anybody I have missed)

7 次回应

查看更多评论

要查看或添加评论，请登录

Richard Mabey的更多文章

Agents vs Copilots: which do you need?

2025年2月19日

Agents vs Copilots: which do you need?

There’s a huge amount of talk about agents right now. But what’s the difference between an agent and a copilot in the…

5 条评论
Here’s what I’d do if I was starting as a GC in 2025

2025年2月6日

Here’s what I’d do if I was starting as a GC in 2025

? Global pandemic ? Inflation crisis ? AI revolution Lawyers have had an eventful few years. COVID forced 5 years of…

16 条评论
AI startup CEO vs UK AI Action Opportunities Plan: in detail

2025年1月17日

AI startup CEO vs UK AI Action Opportunities Plan: in detail

It’s not every day that the UK government pins its plan for economic growth and national revival on the technology you…

11 条评论
Lawyers don’t just want faster contracting from AI. They want something far bigger

2025年1月9日

Lawyers don’t just want faster contracting from AI. They want something far bigger

Is this the year the jobs bomb goes off? (Spoiler - no). The abstract idea that AI will replace jobs suddenly became…

7 条评论
Does using AI at work make you look lazy?

2024年12月12日

Does using AI at work make you look lazy?

Let’s talk about Slack’s Autumn 2024 Workplace Index. The 10,000-person survey found that AI adoption amongst desk…

4 条评论
Why would anyone become a lawyer in 2024?

2024年10月30日

Why would anyone become a lawyer in 2024?

If Goldman Sachs is right and 45% of legal tasks can be automated, then by taking the traditional route - university…

17 条评论
Juro’s vision for intelligent contract automation

2024年9月13日

Juro’s vision for intelligent contract automation

We started Juro to help the world agree faster. Frustrated by the absurdly slow process of taking a contract from…

1 条评论
Can AI help lawyers avoid burnout?

2024年7月3日

Can AI help lawyers avoid burnout?

What do AI and burnout have in common? Well, they’re both facts of life in the 2024 legal profession. The difference is…

5 条评论
In-house lawyers’ AI usage revealed - but are you a truck or a hybrid?

2024年6月11日

In-house lawyers’ AI usage revealed - but are you a truck or a hybrid?

The results from our annual survey of Juro community members are out (read them in full at the State of In-house 2024:…

2 条评论
Reflections on Scaleup GC: bridging the AI trust gap

2024年5月28日

Reflections on Scaleup GC: bridging the AI trust gap

In a week or two we’ll be releasing the findings of our annual survey, and boy have things moved quickly when it comes…

8 条评论

See all articles

The horse has bolted

AI can’t do everything - yet

Brief Encounters

2,283 位关注者

Richard Mabey的更多文章

Agents vs Copilots: which do you need?

Here’s what I’d do if I was starting as a GC in 2025

AI startup CEO vs UK AI Action Opportunities Plan: in detail

Lawyers don’t just want faster contracting from AI. They want something far bigger

Does using AI at work make you look lazy?

Why would anyone become a lawyer in 2024?

Juro’s vision for intelligent contract automation

Can AI help lawyers avoid burnout?

In-house lawyers’ AI usage revealed - but are you a truck or a hybrid?

Reflections on Scaleup GC: bridging the AI trust gap