⚡

Technology & InnovationNeutral

90% confidence

AI Outperforms Law Professors in Legal Reasoning Study

A Stanford-led study found that AI-generated answers to contract law questions were preferred over human-written ones 75% of the time. Google's Gemini 2.5 Pro and NotebookLM outperformed professors, flagged less often for harmfulness, suggesting LLMs align with professional legal standards.

Jun 3, 2026, 8:40 PM UTCDecryptJason Nelson

Quick Take

AI answers preferred by law professors 75% of the time in 2,918 comparisons.

Gemini 2.5 Pro won 75.92% of matchups against human instructors.

AI flagged harmful less (3.41%) vs. human (12.06%) responses.

Results indicate LLMs can meet professional legal standards.

Market Impact Analysis

Neutral

Article discusses AI legal reasoning, not directly impacting crypto markets.

Timeframeshort

Speculation Analysis

Factuality95/100

RumorsVerified

Speculation Trigger10/100

MinimalExtreme FOMO

Key Takeaways

AI-generated contract law answers were preferred by law professors 75% of the time in a blinded study.
Google's Gemini 2.5 Pro won 75.92% of matchups against human instructors across 2,918 comparisons.
AI responses had a harmfulness rate of just 3.41%, far below the 12.06% for professor-written answers.
The results indicate LLMs align with professional legal standards, not just simple fact-based tasks.

AI Preference Rate75%over human answers

Blind Comparisons2,918total evaluations

Gemini Win Rate75.92%against humans

AI Harmfulness3.41%vs 12.06% human

What Happened

A Stanford-led study put AI to the test on contract law—and the machines won. Sixteen professors from 14 top U.S. law schools wrote 40 questions spanning doctrine, case law, and policy. In 2,918 blinded comparisons, they consistently chose AI-generated answers over those from their human peers. Google’s Gemini 2.5 Pro and NotebookLM both surpassed humans, with win rates above 74%. The findings mark a stark shift: AI can now match—and often exceed—professional legal reasoning, not just basic Q&A.

The Numbers

Gemini 2.5 Pro won 75.92% of its matchups, while NotebookLM took 74.75%. The AI harmfulness rate was just 3.41% for Gemini, versus 12.06% for human instructors. Analysis showed AI outperformed across all question types—recall, hypotheticals, and policy discussions. Even after controlling for writing style, the LLMs’ edge held up, confirming substance over surface. The study also tested Claude Opus 4.7 and ChatGPT 5.4; both beat the human average, leaving no doubt about the trend.

Why It Happened

Law isn’t about single right answers—it demands judgment, ambiguity handling, and defensible reasoning. The researchers designed the evaluation to mirror that reality. LLMs succeeded because they aligned with shared disciplinary criteria, not just individual taste. Inter-professor agreement was high, indicating the models tapped into common legal standards. This suggests AI can now navigate the nuanced judgment calls that define professional domains, moving far beyond trivia and into real-world application.

Broader Impact

The results could accelerate AI adoption in legal education. If LLMs consistently produce answers that professors prefer, they might serve as tutors or first-draft tools for students and practitioners. Law schools may need to rethink curriculum and assessments. The study also sets a precedent for testing AI in other judgment-heavy fields like medicine or finance, where professional consensus drives quality.

What to Watch Next

Will law schools integrate AI into classrooms, or push back to preserve traditional training?
Watch for similar benchmarks in medicine, accounting, and other licensed professions.
Future model updates—especially from Google, OpenAI, and Anthropic—could push legal reasoning even further.

This article is for informational purposes only and does not constitute financial advice.

SourceRead the full article on Decrypt

Read full article

Always late to trends?

Join for the latest news, insights & more.

Disclaimer: Bytewit is an independent media outlet that delivers news, research, and data.

AI Outperforms Law Professors in Legal Reasoning Study

Quick Take

Market Impact Analysis

Speculation Analysis

Key Takeaways

What Happened

The Numbers

Why It Happened

Broader Impact

What to Watch Next

Always late to trends?

TAGS

Read Next

KelpDAO $292M Exploit Triggers Aave Bank Run, DeFi in Crisis

Ethereum Risks $1.5K Drop from Vitalik's ETH Sales

Most Read

Wyoming Governor Signs Order for AI Data Center Development

AI Chatbots Still Promote Harmful Intimacy, USC Study Finds

Treasury Secretary: Bitcoin Reserve Moving Forward, CLARITY Act This Summer

Revolut Plans Stablecoins for US Bank Launch Next Year

Cardano Hits 5-Year Low as Hoskinson Warns of Ecosystem Failures

BTC Risks Sub-$65K as Leveraged Liquidations Hurt Sentiment

Tether Debuts Gold-Backed Visa Card Paying Crypto Rewards

Platform

Company

Legal