⚡

Technology & InnovationNeutral

95% confidence

35% of New Websites Are Now AI-Generated, Study Finds

A study by Stanford and Imperial College reveals that by mid-2025, 35% of new websites were AI-generated. Effects include reduced semantic diversity and heightened artificial positivity, with no impact on factual accuracy yet.

Apr 28, 2026, 8:32 PM UTCDecryptJose Antonio Lanz

Quick Take

AI-generated content now comprises 35% of new websites, up from zero pre-ChatGPT.

Semantic similarity among AI pages is 33% higher, narrowing topic diversity.

Positive sentiment score 107% higher due to LLMs' sycophantic tendencies.

No statistical evidence of increased misinformation or stylistic homogeneity found.

Market Impact Analysis

Neutral

The article discusses AI content prevalence with no direct crypto market connection.

Timeframelong

Speculation Analysis

Factuality90/100

RumorsVerified

Speculation Trigger20/100

MinimalExtreme FOMO

Key Takeaways

AI now produces 35% of new websites, up from near zero before ChatGPT's late 2022 debut.
Semantic similarity among AI pages is 33% higher, narrowing the range of online ideas.
Positive sentiment in AI text is 107% higher, creating an artificially upbeat web.
No measurable increase in factual errors or stylistic uniformity, despite public belief.
At 35% saturation, model collapse risk becomes an empirical concern for future AI training.

AI-Generated Websites35%of new sites by mid-2025

Semantic Similarity33% higheramong AI-written pages

Positive Sentiment107% higherin AI content vs human

Factual Error RateNo correlationwith AI prevalence

What Happened

A study from Stanford, Imperial College London, and the Internet Archive found that by mid-2025, 35% of newly published websites were AI-generated or AI-assisted. That figure was essentially zero before ChatGPT launched in November 2022. The research, drawing on 33 months of Wayback Machine snapshots, used an AI text detector to classify each page. The speed of AI's takeover has stunned researchers, with a significant portion of the internet shaped by machines in just three years. This rapid shift marks a fundamental change in who—or what—creates the web's content.

The Numbers

AI-generated sites showed 33% higher pairwise semantic similarity scores compared to human-written ones, indicating less diversity of expression. Positive sentiment scores were more than 107% higher in AI content, attributed to the overly agreeable nature of large language models. Despite widespread concerns, the study found no statistically significant correlation between AI prevalence and factual error rates. Similarly, the fear of a stylistic monoculture—where AI flattens individual voices—was not supported by character-level analysis. These findings challenge common assumptions about AI's impact on the web.

Why It Happened

The explosion of AI content traces back to ChatGPT's release in late 2022, which made large language models accessible to millions. These models are trained to produce outputs close to their training data, inadvertently reducing semantic diversity. Their tendency toward sycophantic, positive text comes from optimization for human approval signals. As adoption accelerated, the web's composition shifted from predominantly human-created to a growing AI share, with profound effects on the tone and range of online discourse. The speed of this transition has outpaced oversight, leaving the web's character transformed in under three years.

Broader Impact

At 35% AI prevalence, the risk of model collapse—where AI models degrade when trained on AI-generated data—moves from theory to reality. A web flooded with homogenous, cheerful content could narrow the Overton window, marginalizing dissent without explicit censorship. This has implications for everything from public opinion to the training of future AI systems. Researchers are now building a live monitoring tool to continuously track AI's share of the web, aiming to provide early warnings as the digital landscape evolves.

What to Watch Next

The launch of a live monitoring tool by researchers to track AI-generated content share in real time.
Signs of model collapse in next-generation AI models as they increasingly train on synthetic data.
Regulatory or platform responses to the growing AI content saturation, especially around transparency and labeling.

This article is for informational purposes only and does not constitute financial advice.

SourceRead the full article on Decrypt

Read full article

Always late to trends?

Join for the latest news, insights & more.

Disclaimer: Bytewit is an independent media outlet that delivers news, research, and data.

35% of New Websites Are Now AI-Generated, Study Finds

Quick Take

Market Impact Analysis

Speculation Analysis

Key Takeaways

What Happened

The Numbers

Why It Happened

Broader Impact

What to Watch Next

Always late to trends?

TAGS

Read Next

KelpDAO $292M Exploit Triggers Aave Bank Run, DeFi in Crisis

Ethereum Risks $1.5K Drop from Vitalik's ETH Sales

Most Read

CFTC Sues Wisconsin to Defend Prediction Markets

35% of New Websites Are Now AI-Generated, Study Finds

Robinhood Crypto Revenue Crashes 47%, But Event Betting Surges

Robinhood Shares Slide 6% as Crypto Revenue Drops 34%

DeepMind Veteran Raises $1.1B for Self-Learning AI Startup

Coinbase Premium Turns Negative as BTC Losses Exceed $829M

Paul Tudor Jones Calls Bitcoin the 'Best Inflation Hedge'

Platform

Company

Legal