More AI Doesn't Guarantee Better Software But It Seems to Guarantee Faster Iteration

GitQuick

More AI Doesn't Guarantee Better Software But It Seems to Guarantee Faster Iteration

A data-driven analysis of AI-assisted code review, review automation, merge times, and pull request velocity across ten major open-source organizations.

"More AI will make us better" is the easiest thing to promise and the hardest thing to verify.

Bot reviewers, AI code assistants, and automated approvals are everywhere now. The pitch is always the same:

Adopt AI and you'll ship faster.

That sounds plausible. But is it actually true?

I went looking for evidence in data I could inspect.

GitQuick's public showcase runs the same PR analysis pipeline across well-known open-source organizations. I pulled the most recent one-month window (May–June 2026) for all ten showcase organizations and asked a narrow question:

Do organizations with more automated and AI-assisted review actually move faster through the review pipeline?

The short answer:

Yes. Consistently.

But does that automatically mean better software?

How I measured it

Every number here comes from public GitHub metadata only:

Pull request timestamps
Review events
Merge events
Addition/deletion counts

No source code. No internal tooling. No private repositories.

Two metrics act as proxies for review automation:

Bot review rate — the share of reviews from accounts GitQuick classifies as automation.
AI review rate — a narrower share attributed to AI-assisted reviewers such as Copilot, Gemini Code Assist, and similar tools.

These are proxies, not a complete measure of AI adoption. An organization can use AI heavily for authoring, testing, or CI/CD while showing relatively little AI activity in reviews.

The dataset

Org	Bot reviews	AI reviews	Median merge	Median 1st review	Merged/wk	Merged w/o review	Median PR size	Avg reviewers/PR
Netflix	58.0%	15.3%	3.4h	0.04h	32.8	52.4%	43	1.66
Microsoft	33.9%	18.1%	3.9h	1.4h	4,484	8.9%	70	1.90
OpenAI	30.2%	30.2%	8.1h	0.3h	234	13.1%	189	1.75
Google Cloud	27.8%	26.8%	5.5h	0.5h	473	22.0%	56	1.49
Google	12.3%	5.5%	11.7h	1.9h	688	45.4%	56	1.48
AWS	11.9%	1.7%	13.5h	0.7h	315	9.5%	47	1.69
Meta Open Source	8.7%	7.2%	7.0h	5.5h	35	31.1%	60	1.43
Anthropic	4.1%	2.8%	3.2h	1.7h	165	0.1%	16	1.07
Apple	2.8%	2.8%	10.1h	1.4h	101	4.8%	46	1.53
Kubernetes	1.4%	1.8%	15.7h	6.1h	197	35.9%	42	1.84

More review automation tracks with faster pipelines

Split the organizations at the median bot-review rate.

The top five form a "high-automation" group:

Netflix
Microsoft
OpenAI
Google Cloud
Google

The remaining five form a "low-automation" group:

AWS
Meta Open Source
Anthropic
Apple
Kubernetes

Group	Median time to merge	Median time to first review
High automation	5.5h	0.5h
Low automation	10.1h	1.7h

Organizations with higher automation:

Receive first reviews over 3× faster
Merge changes in roughly half the time

Across all ten organizations, the rank correlation between bot-review rate and time-to-merge is -0.48, indicating a moderate relationship between more review automation and faster merges. No single organization perfectly follows the trend, but the overall direction is difficult to ignore.

The strongest effect appears at the very beginning of the review process. That makes intuitive sense.

A bot doesn't wait for calendar availability, nor does it finish another review first. A bot can react instantaneously. The result is shorter feedback loops.

Interesting signals in the data

Netflix: the automation extreme

Netflix stands out immediately:

58% bot reviews
Near-instant first reviews (0.04h)
3.4h median merge time

More than half of its merged pull requests show no recorded review event.

Automation is not merely assisting the process—it appears to be actively clearing low-risk work.

Microsoft: speed without sacrificing governance

Microsoft demonstrates that automation and process discipline can coexist.

Despite operating at enormous scale:

4,484 merges per week
33.9% bot review rate

It maintains only:

8.9% merged without review

Fast does not necessarily mean uncontrolled.

OpenAI and Google Cloud: AI-heavy review cultures

These organizations show the highest AI-specific review rates in the dataset.

OpenAI is particularly interesting because it combines:

Fast first reviews
High AI participation
Large median PR size (189 lines)

Despite rapid feedback, larger changes still take longer to land.

AI helps teams start the conversation faster.

It doesn't eliminate the work required to review complex changes.

Kubernetes: deliberate friction

Kubernetes sits at the opposite end of the spectrum:

Lowest automation rate
Longest merge times
Large maintainer-driven review culture

That slower pace is not necessarily a failure.

In many mature open-source communities, careful review is a deliberate choice.

The counterexample: Anthropic

Anthropic is perhaps the most surprising organization in the dataset.

Despite being one of the companies most associated with AI, its public repositories show very little AI-assisted review activity.

Anthropic has:

Only 4.1% bot reviews
Only 2.8% AI reviews

Yet it achieves the fastest median merge time in the dataset:

3.2 hours.

Why?

The answer appears elsewhere in the table.

Anthropic's median PR size is only:

16 lines.

By far the smallest in the entire dataset.

This is a textbook example of the power of small changes.

Tiny pull requests are:

Easier to understand
Easier to review
Easier to approve
Easier to merge

Anthropic demonstrates that disciplined engineering practices can sometimes outweigh automation volume.

The lesson isn't that AI doesn't matter.

The lesson is that AI is not the only lever.

What this analysis does not prove

Before drawing broader conclusions, it's important to recognize what this dataset cannot tell us.

It does not measure:

Code quality
Reliability
Security
Architecture
Product success
Customer outcomes

Faster merges do not automatically mean better software.

Likewise, slower merges do not automatically mean higher quality software.

This analysis measures workflow speed, not engineering excellence.

The real takeaway

The most important conclusion from this data is not:

"AI creates better software."

This dataset cannot support that claim.

The stronger conclusion is:

AI appears to create faster iteration.

Organizations with more review automation consistently move changes through the pipeline faster.

They receive feedback sooner. They begin reviews sooner. They merge sooner.

Why does that matter?

Because software quality rarely emerges from a single perfect decision.

Quality emerges from repeated cycles of:

Building
Reviewing
Learning
Correcting
Improving

The faster a team can safely move through those cycles, the more opportunities it has to improve.

That doesn't mean faster is always better.

It means faster iteration creates the conditions under which better outcomes can emerge.

The organizations in this dataset provide evidence that AI and automation are effective tools for reducing friction in those feedback loops.

Whether that speed ultimately becomes higher product quality still depends on engineering discipline, testing, architecture, review standards, and culture.

In other words:

More AI does not guarantee better software.

It does, however, appear to be strongly associated with faster iteration.

And when faster iteration is combined with strong engineering practices, it becomes a powerful advantage.

That's exactly why measuring your review process matters.

Not because AI is automatically good.

Because understanding where time is actually spent is the first step toward improving how your team works.

Disclaimer: This analysis uses public GitHub metadata only and reflects review-process signals rather than code quality or internal engineering practices. Figures come from one-month showcase windows (May–June 2026) and will change as the underlying data refreshes. GitQuick is not affiliated with or endorsed by any organization mentioned.