Technology & InnovationNeutral
38

AI Outperforms Law Professors in Legal Reasoning Study

A Stanford-led study found that AI-generated answers to contract law questions were preferred over human-written ones 75% of the time. Google's Gemini 2.5 Pro and NotebookLM outperformed professors, flagged less often for harmfulness, suggesting LLMs align with professional legal standards.

DecryptJason Nelson

Quick Take

1

AI answers preferred by law professors 75% of the time in 2,918 comparisons.

2

Gemini 2.5 Pro won 75.92% of matchups against human instructors.

3

AI flagged harmful less (3.41%) vs. human (12.06%) responses.

4

Results indicate LLMs can meet professional legal standards.

Market Impact Analysis

Neutral

Article discusses AI legal reasoning, not directly impacting crypto markets.

Timeframeshort

Speculation Analysis

Factuality95/100
RumorsVerified
Speculation Trigger10/100
MinimalExtreme FOMO

Key Takeaways

  • AI-generated contract law answers were preferred by law professors 75% of the time in a blinded study.
  • Google's Gemini 2.5 Pro won 75.92% of matchups against human instructors across 2,918 comparisons.
  • AI responses had a harmfulness rate of just 3.41%, far below the 12.06% for professor-written answers.
  • The results indicate LLMs align with professional legal standards, not just simple fact-based tasks.
AI Preference Rate75%over human answers
Blind Comparisons2,918total evaluations
Gemini Win Rate75.92%against humans
AI Harmfulness3.41%vs 12.06% human

What Happened

A Stanford-led study put AI to the test on contract law—and the machines won. Sixteen professors from 14 top U.S. law schools wrote 40 questions spanning doctrine, case law, and policy. In 2,918 blinded comparisons, they consistently chose AI-generated answers over those from their human peers. Google’s Gemini 2.5 Pro and NotebookLM both surpassed humans, with win rates above 74%. The findings mark a stark shift: AI can now match—and often exceed—professional legal reasoning, not just basic Q&A.

The Numbers

Gemini 2.5 Pro won 75.92% of its matchups, while NotebookLM took 74.75%. The AI harmfulness rate was just 3.41% for Gemini, versus 12.06% for human instructors. Analysis showed AI outperformed across all question types—recall, hypotheticals, and policy discussions. Even after controlling for writing style, the LLMs’ edge held up, confirming substance over surface. The study also tested Claude Opus 4.7 and ChatGPT 5.4; both beat the human average, leaving no doubt about the trend.

Why It Happened

Law isn’t about single right answers—it demands judgment, ambiguity handling, and defensible reasoning. The researchers designed the evaluation to mirror that reality. LLMs succeeded because they aligned with shared disciplinary criteria, not just individual taste. Inter-professor agreement was high, indicating the models tapped into common legal standards. This suggests AI can now navigate the nuanced judgment calls that define professional domains, moving far beyond trivia and into real-world application.

Broader Impact

The results could accelerate AI adoption in legal education. If LLMs consistently produce answers that professors prefer, they might serve as tutors or first-draft tools for students and practitioners. Law schools may need to rethink curriculum and assessments. The study also sets a precedent for testing AI in other judgment-heavy fields like medicine or finance, where professional consensus drives quality.

What to Watch Next

  • Will law schools integrate AI into classrooms, or push back to preserve traditional training?
  • Watch for similar benchmarks in medicine, accounting, and other licensed professions.
  • Future model updates—especially from Google, OpenAI, and Anthropic—could push legal reasoning even further.

Source: Decrypt

This article is for informational purposes only and does not constitute financial advice.

SourceRead the full article on Decrypt
Read full article

Always late to trends?

Join for the latest news, insights & more.

Disclaimer: Bytewit is an independent media outlet that delivers news, research, and data.

© 2026 Bytewit. All Rights Reserved. This article is for informational purposes only.

Read Next

Most Read

⚖️
Regulatory UpdatesBullish
49

Wyoming Governor Signs Order for AI Data Center Development

Wyoming Governor Mark Gordon signed an executive order providing a framework for large-scale data center projects, aiming to attract AI infrastructure investment. The order emphasizes sustainability and resident protections, aligning with crypto miners pivoting to high-performance computing.

70% confidence
Jun 3, 2026, 10:04 PM UTC · Cointelegraph
AI Outperforms Law Professors in Legal Reasoning | Bytewit