Visual regression testing has a reputation for being hard to set up. Percy, Chromatic, BackstopJS, Playwright visual comparisons — all powerful tools, all requiring meaningful engineering investment to configure, maintain, and run reliably in CI. (See our comparison of visual regression testing tools if you want the full breakdown.)

For teams without a dedicated QA engineer or a mature CI pipeline, that investment is often too high. The tests never get written. Visual regressions ship.

But visual regression testing doesn't have to mean automated screenshot diffing in CI. There's a manual approach that catches the same class of bugs, takes 20–30 minutes per release, and requires zero test code.

What Is Visual Regression Testing?

Visual regression testing is the practice of comparing the current UI against a known-good baseline to detect unintended visual changes.

Common visual regressions include:

Layout shifts caused by a CSS change in a shared component
Colour changes from a design token update
Spacing drift introduced by a dependency upgrade
Font rendering differences across browsers
Dark mode screens that inherit wrong background variables
Elements that overlap or truncate at specific viewport widths

Visual regressions are distinct from functional bugs. The button still works — it just looks wrong. They're easy to miss in code review and easy to catch by looking.

Why Automated Visual Regression Testing Is Hard

Automated visual regression tools work by:

Taking a screenshot of every page in a baseline state
Taking a new screenshot after every code change
Diffing the two images pixel-by-pixel
Flagging any pixels that changed

The theory is sound. The practice is painful:

High false positive rate. Any animation, dynamic content, or font rendering difference triggers a diff. You spend more time approving false positives than reviewing real regressions.

Baseline management is ongoing work. Every intentional UI change requires updating baselines. If you have 50 screens and 3 developers pushing daily, baseline drift becomes a full-time job.

CI dependency. Most tools require tests to run in a headless browser in CI. Setting up the environment, handling flaky renders, and managing the pipeline adds engineering overhead before you've caught a single bug.

Cost. Percy, Chromatic, and similar SaaS tools charge per screenshot. At scale, costs add up fast.

For many teams — especially smaller ones shipping fast — the ROI doesn't justify the investment until they're mature enough to absorb the overhead.

Manual Visual Regression Testing: The Practical Alternative

Manual visual regression testing follows the same logic as automated testing, but replaces the automation with a structured human review process.

The tradeoff: it doesn't scale to 500 screens, but it's fast enough to cover the 10–20 screens that actually matter in a typical release, and it can be running today with no setup.

The baseline

Your baseline is the last known-good screenshot of each screen. You capture these before making any changes, after each release.

Store baselines in a named folder, tagged by release version: v1.4.2 — baseline.

The comparison

After implementing changes, capture new screenshots of every screen you touched. Then compare the new screenshots against the baseline side-by-side.

What to look for:

Any element that moved position
Any colour that changed
Any spacing that increased or decreased
Any text that truncated, wrapped differently, or changed weight
Any component that appears or disappears unexpectedly

The threshold

Not every pixel difference is a regression. Apply a human threshold:

Intentional change — the diff matches what the PR was supposed to do. Approve and update the baseline.
Unintentional change — something moved that shouldn't have. This is a regression. File a bug.
Acceptable drift — minor sub-pixel rendering differences across environments. Note it, don't block the release.

A Step-by-Step Manual Visual Regression Process

Before the release

Identify which screens will be affected by the current release
Capture baseline screenshots of each affected screen at 1440px and 375px
Label them clearly: checkout-desktop-v1.4.2-baseline.png

During development

Implement changes
Capture new screenshots of the same screens at the same viewports
Label them: checkout-desktop-v1.4.2-new.png

Before merging

Load each pair side-by-side in a comparison tool
Enable sync zoom — zoom into specific sections to inspect alignment at 2x or 4x
Use a grid overlay to check that elements align to the layout grid
For each diff, classify: intentional, regression, or acceptable

After the release

Promote the new screenshots to baseline
Archive the previous baseline for one release cycle

Tools for Manual Visual Regression Testing

Side-by-side comparison tools

The minimum you need is a way to view two screenshots at the same time, at the same zoom level. Even two Finder windows side-by-side works — but it's slow and imprecise.

Better options:

Captur — desktop app for macOS and Windows with built-in side-by-side comparison, sync zoom, grid overlay, and overlay diff. Designed specifically for this workflow.
Figma — works if you're comparing against a design spec, not against a previous screenshot
Browser DevTools — useful for inspecting computed styles, not for visual comparison

Sync zoom is the feature that matters most. When you zoom into the left screenshot, the right screenshot zooms to the same position. Without it, you're constantly re-navigating between screenshots trying to line them up.

Grid overlay shows a layout grid on top of both screenshots simultaneously. It makes spacing regressions — the hardest to spot visually — immediately obvious.

When to Move to Automated Visual Regression Testing

Manual visual regression testing works well for:

Teams with fewer than 50 screens
Release cycles of 1–2 weeks or longer
Products where most screens are stable and changes are targeted

Consider investing in automated tools when:

You're releasing multiple times per day
Your test coverage includes hundreds of screens
You have a dedicated QA engineer to manage baseline updates
False positive management is acceptable as a workflow cost

The transition isn't either/or. Many teams run both: automated testing for stable screens (catching unexpected regressions from dependency updates), and manual review for new features and active development areas.

Getting Started Today

You don't need to set up Percy or write a single line of Playwright code to start catching visual regressions.

The process:

Before your next release, screenshot every screen you touched
After implementing changes, screenshot the same screens
Compare them side-by-side before merging

On macOS or Windows, Captur is built for exactly this workflow — screenshots stay organised by project, side-by-side comparison with sync zoom, and one-click Jira or ClickUp tickets when you find a regression.

No CI. No baselines to manage. No false positives. Just look at your UI before you ship it.

For a step-by-step walkthrough of the full visual QA workflow, see how to do visual QA before every release.

See Captur's comparison features →

Visual Regression Testing Without Code: A Practical Guide