What is Google Lighthouse and how does it work?

Google Lighthouse is an open-source tool built into Chrome DevTools. It loads a page under controlled conditions and runs a series of automated checks across five categories: Performance, Accessibility, Best Practices, SEO, and (in some versions) Progressive Web App compliance. It produces a score from 0 to 100 for each category. The scores are weighted averages of individual metrics. For example, the Performance score combines things like: - **Largest Contentful Paint (LCP):** how long until the main content is visible - **Total Blocking Time (TBT):** how long the page is unresponsive to input - **Cumulative Layout Shift (CLS):** how much the layout jumps around while loading You can run it directly in Chrome (DevTools > Lighthouse tab), via the command line, or through third-party tools that use it under the hood.

Why do I get different scores each time I run the same test?

Because conditions vary. Even for the same page, results can differ based on: - The device and CPU the test runs on - Network conditions (real or simulated) - Whether other tabs or processes are using resources - The version of the tool - Whether the page has dynamic content (ads, A/B tests, personalization) This is especially true for Performance scores. Accessibility and SEO scores tend to be more stable, because they check for the presence or absence of specific elements rather than measuring timing. To get reliable comparisons, always run tests in the same environment. Lighthouse CI — integrated into a deployment pipeline — is a good way to do this consistently.

Can a website score 100 and still have accessibility problems?

Yes. Automated accessibility checks catch a subset of issues — estimates suggest around 50–57% of common problems. A perfect score means the tool found nothing wrong. It does not mean the site is fully accessible. Issues that tools typically miss: - Whether a screen reader announces content in a logical, useful order - Whether form error messages actually describe what went wrong - Whether interactive elements are operable in a reasonable way with keyboard navigation - Whether language is plain enough for people with cognitive disabilities A score of 100 is a good starting point. It is not a substitute for testing with assistive technologies or with users who have disabilities.

What is Goodhart's Law and why does it matter for website audits?

Goodhart's Law is an observation from economics, often summarized as: > *When a measure becomes a target, it ceases to be a good measure.* Applied to website audits: if your goal is a high score rather than a better website, you may end up optimizing for the score specifically — without improving the real user experience. Examples of this in practice: - Deferring or removing scripts to improve Performance scores, without checking whether those scripts are needed for core functionality - Adding alt text to every image just to clear the flag, without checking whether the text is actually useful - Compressing images aggressively to reduce file size, introducing visible quality loss The score is a signal, not the goal. Keep the actual user experience in focus.

How often should I run a website audit?

It depends on how often the site changes. A general approach: - **After every significant deployment:** check that nothing regressed - **Monthly:** a routine check for stable sites with infrequent updates - **After major changes:** new features, redesigns, CMS upgrades, or new third-party scripts all warrant a fresh audit For sites where performance, accessibility, or sustainability matter a lot — public services, e-commerce, health information — more frequent automated checks integrated into the development workflow are worth the effort. A one-time audit is better than no audit. But regular audits catch problems before they become serious. ```

Are paid audit tools better than free ones?

Not necessarily. Many free tools are very capable: - **Google Lighthouse** — performance, accessibility, SEO, best practices - **WAVE** — accessibility-focused, good for visual feedback - **WebPageTest** — detailed performance testing with real browsers and locations - **webaudits.org** — multi-category overview including sustainability and security Paid tools often add value through: - Scheduled monitoring and alerts - Crawling multiple pages at once - Historical tracking and reporting - Team collaboration features For a single site with a small team, free tools used consistently will get you most of the way there. Paid tools are more useful when you need to manage audits at scale or need audit history over time.

What should I do if I don't understand what a flagged issue means?

Don't fix it blindly. A few steps that help: 1. Read the tool's explanation. Lighthouse, for example, links to documentation for each flagged issue. 2. Look up the underlying concept. MDN Web Docs is a reliable reference for web standards and accessibility guidelines. 3. Ask whether the issue affects real users. Some flagged items are minor or even irrelevant to your specific case. 4. Check whether fixing it could break something else. Some optimizations have unintended side effects. If you're working with a developer or agency, ask them to explain the finding in plain language before agreeing to fix it. A good explanation should include what the problem is, why it matters, and what the fix involves.

Performance•

Accessibility•

SEO•

Sustainability•

Tools•

Best Practices

Tools & Best Practices

Automated Website Tests: Useful Signal or False Comfort?

Tools like Google Lighthouse can check a website in seconds and hand you a score. That's genuinely useful — but it's easy to misread what the score means. A high number doesn't always mean a good website. Here's what these tools can and can't do, and how to get real value from them.

Web Audits|Published on May 19, 2026| 8 min readAI-assisted writing•Claude

A diverse group of people engrossed in their smartphones, highlighting modern social connectivity. — cottonbro studio / Pexels

What automated testing tools actually do

Tools like Google Lighthouse, WebPageTest, WAVE, or the audits on webaudits.org run a set of automated checks against a URL. They test things like:

How fast the page loads (or appears to load)
Whether images have alt text
If the page uses HTTPS
Whether certain accessibility rules are met
How much data the page transfers
Basic SEO signals like meta tags and heading structure

The result is usually a score — often between 0 and 100 — broken down by category. It looks authoritative. The problem is that it can be misleading.

The score is a proxy, not the truth

There's a principle sometimes called Goodhart's Law: when a measure becomes a target, it ceases to be a good measure. In other words, optimizing for a score is not the same as improving the thing the score is supposed to reflect.

This happens with website audits all the time.

A developer can get a near-perfect Lighthouse performance score on a page that still feels slow to real users — because the tool measures metrics under controlled lab conditions, not actual network environments. A page can score 100 on accessibility and still be unusable by someone relying on a screen reader, because many real accessibility problems require human judgment, not automated detection. Research by Deque suggests that automated tools catch only around 57% of accessibility issues — the rest require manual testing.

A high score is a good sign. It is not a guarantee.

What these tools genuinely can't check

Automated tools work by running scripts. They can check what's present and what's measurable. They can't judge what matters to users.

Here are real things they miss:

Usability. A button can be large enough, have sufficient color contrast, and still be confusing. Tools don't know if users understand the interface.

Content quality. Whether the text is useful, honest, or readable is beyond any automated check.

Real-world performance. Lab tests use fixed conditions. Real users have different devices, connections, locations, and browser states. A tool might show a fast page that loads slowly for most of your actual visitors.

Deep accessibility. A form field can have a label (which the tool checks) and still be described in a way that makes no sense to a screen reader user (which the tool misses).

Sustainability in full context. Tools can estimate the carbon footprint of a page load, but they can't account for how often the page is visited, whether it replaces something more energy-intensive, or what happens on the server side.

What they're genuinely good for

That said, automated tools are valuable — especially when used with the right expectations.

Speed of feedback. A quick scan can surface obvious problems in seconds. That's useful whether you're auditing your own site or evaluating a vendor's work.

Consistency. Tools apply the same rules every time. That's harder to achieve with manual review.

Catching regressions. Run the same test regularly, and you'll notice if something got worse after a deploy.

Starting a conversation. For clients who aren't technical, a tool report with concrete numbers can open a discussion about priorities and trade-offs more easily than abstract descriptions.

Low-hanging fruit. Missing alt text. Images that aren't compressed. Pages without a meta description. No HTTPS. Render-blocking scripts. These are real problems, and tools find them fast.

From a client's point of view

If you're paying someone to build or maintain a website, automated audit results can be a useful reference — but treat them as a starting point, not a report card.

A few things worth knowing:

A score can be inflated by focusing only on what the tool measures. Ask what was actually improved, not just what the score is.
Different tools give different results. A score of 90 on Lighthouse and a score of 60 on another tool for the same page isn't a contradiction — they measure different things.
Ask for a comparison over time, not just a one-off number. Did performance improve after the work you paid for?

If someone promises a perfect 100, be curious about how. Some optimizations that help scores have no real-world benefit, or even trade one problem for another.

From a developer's point of view

Tools are fast, cheap, and automatable. Use them as part of a regular workflow, not just before a launch.

A few practical habits:

Run audits in a consistent environment. Lighthouse results vary depending on where and how you run it. Use the same setup each time (for example, Lighthouse CI in your deployment pipeline) to make results comparable.
Don't fix issues in isolation. Understand what a flagged issue actually means before acting. Some recommendations conflict with each other or have negligible real-world impact.
Use multiple tools. Different tools catch different things. A combination of Lighthouse, WAVE, and a manual review gets you much closer to the truth than any single tool.
Test with real users where you can. Even a small round of usability testing will surface things no automated tool will find.

Working with limited budgets

Not every website has the resources for a full audit, a round of user testing, and a development sprint to fix everything. That's fine. The question is how to prioritize.

A useful frame: what problems cause real harm, and which are easy to fix?

Priority	Problem type	Example
Fix first	High impact, low effort	Missing alt text, no HTTPS, broken links
Plan for	High impact, higher effort	Poor color contrast throughout, slow server
Deprioritize	Low impact, high effort	Minor score improvements with no user-facing effect
Skip for now	Low impact, low effort	Tweaks that move the score but nothing else

Start with what matters most for real users. For accessibility, that often means making sure core functions (navigation, forms, key content) work without a mouse and with a screen reader — before worrying about edge cases. For performance, fixing a 4 MB image is more useful than fine-tuning cache headers.

When budget is tight, a structured audit can help you identify what to tackle first. Tools like the ones on webaudits.org can give you an overview across multiple categories — sustainability, performance, security, SEO, and accessibility — so you can see where the biggest gaps are before deciding where to spend effort.

What to do with audit results

An audit report is only useful if someone acts on it. Here's a simple approach:

Triage. Go through the findings and sort them: critical, important, and minor.
Understand before you fix. For each issue, make sure you understand what it means in practice. Some flagged items are real problems. Some are false positives. Some are genuine trade-offs.
Estimate effort. A small team with limited time needs to know which fixes are quick wins and which are projects.
Fix and verify. After making changes, run the audit again. Make sure the problem is actually resolved — not just that the score went up.
Make it a habit. A one-time audit is a snapshot. Regular checks catch new problems before they become serious.

A useful tool, not an answer

Automated tests are one input. They're faster and cheaper than manual review. They're good at catching a specific class of problems consistently. But they measure what's measurable, not what matters most.

Use them as a starting point. Combine them with manual testing, real user feedback, and common sense. Be skeptical of scores — including high ones.

The goal isn't a better number. It's a better website.

Frequently Asked Questions About Automated Website Testing

Automated testing tools raise a lot of practical questions — especially for those new to website audits. Here are answers to the most common ones.

Modified on June 3, 2026

All Posts