Heuristic Evaluation: The Audit Before the Test

Before you schedule participants, before you set up recordings, before you spend a single euro on incentives—ask yourself: are there obvious problems you could find without users?

Heuristic evaluation is the answer. It is a systematic expert review that catches the low-hanging fruit, so your user testing budget goes toward the problems only real users can reveal.

The "Save Your Budget" Rule

Do not test basic hygiene issues with real users. It is waste.

The Problem

Imagine running a usability test and discovering:

The "Submit" button is gray and looks disabled
Error messages do not explain what went wrong
Users cannot get back to the homepage from the checkout flow
The search icon is labeled "Find" in one place and "Search" in another

These are real problems—but they are obvious problems. You did not need to pay participants and spend hours in sessions to find them. An experienced practitioner could spot them in 20 minutes.

The Solution

Run a heuristic evaluation first to catch the obvious issues. Clean them up. Then run user testing to discover the complex, domain-specific problems that require real user context.

The ROI

Approach	Cost	Findings
Jump straight to user testing	High (recruiting, incentives, time)	Mix of obvious and complex issues
Heuristic evaluation first	Low (expert time only)	Catches ~60% of obvious issues
User testing after cleanup	High	Focuses on complex, valuable insights

The combined approach costs only slightly more than user testing alone but produces dramatically cleaner findings. Users encounter a product that at least makes basic sense, revealing deeper issues instead of stumbling over obvious flaws.

The Framework: Nielsen's 10 Heuristics

The most widely used framework is Jakob Nielsen's 10 Usability Heuristics. These are not rigid rules—they are principles for identifying common usability failures.

The 10 Heuristics

#	Heuristic	Key Question
1	Visibility of System Status	Does the system keep users informed about what is happening?
2	Match Between System and Real World	Does the system use language and concepts familiar to users?
3	User Control and Freedom	Can users easily undo, redo, and exit unwanted states?
4	Consistency and Standards	Does the interface follow platform conventions and internal consistency?
5	Error Prevention	Does the design prevent errors before they occur?
6	Recognition Rather Than Recall	Are options visible rather than requiring users to remember information?
7	Flexibility and Efficiency of Use	Are there shortcuts for expert users without confusing novices?
8	Aesthetic and Minimalist Design	Is irrelevant information eliminated to reduce noise?
9	Help Users Recognize, Diagnose, and Recover from Errors	Are error messages clear, specific, and constructive?
10	Help and Documentation	Is help available when needed, focused on the user's task?

Using the Heuristics

Work through one heuristic at a time. For each heuristic, review every screen or component and ask: "Does this violate the principle?" If it does, document what the violation is, where it occurs, and how severe it is. If it does not, move to the next screen.

For how accessibility heuristics extend this evaluation framework, see Accessibility Testing: Beyond Compliance.

How to Execute

The Process

Step 1: Select Evaluators (3-5 experts)

More evaluators find more problems, with diminishing returns:

Evaluators	Problems Found	Marginal Gain
1	~35%	—
3	~60%	+25%
5	~75%	+15%
10	~85%	+10%

For most projects, 3-5 evaluators provide the best cost-benefit ratio.

Step 2: Brief the Evaluators

Provide:

The interface (prototype, staging environment, or production)
Target user personas and key tasks
The heuristic framework to use
Severity rating scale
Documentation template

Step 3: Independent Evaluation

Each evaluator reviews the interface independently, without discussing with others. This prevents groupthink and ensures diverse perspectives.

Typical time: 1-2 hours per evaluator for a moderately complex interface.

Step 4: Document Findings

Each finding should include:

Field	Example
Location	Checkout > Payment screen > Credit card form
Heuristic violated	#9: Help users recover from errors
Description	Error message says "Invalid input" without explaining which field or why
Severity	High (blocks task completion)
Recommendation	Specify the field and the format expected: "Card number must be 16 digits"

Step 5: Aggregate and Deduplicate

After independent evaluation:

Collect all findings into a single list
Merge duplicates (same issue found by multiple evaluators)
Note agreement level (issues found by 3+ evaluators are high confidence)
Prioritize by severity and frequency

Severity Ratings

Rating	Definition	Example
Critical	Prevents task completion	Cannot submit form due to validation error
Major	Causes significant difficulty	Unclear labels require trial and error
Minor	Causes minor friction	Inconsistent button styling
Cosmetic	Does not affect usability	Alignment is slightly off

When to Use Heuristic Evaluation

Before User Testing (Prototype Cleanup)

The most common use: clean your prototype before showing it to participants.

Timeline:

Design Complete → Heuristic Eval (1-2 days) → Fix Issues (1-3 days) → User Testing

Benefits:

Users encounter fewer obvious issues
User testing reveals deeper, domain-specific problems
Sessions are more efficient (less time on basic issues)
Findings are more actionable (focused on real complexity)

On Legacy Products (Building a Backlog)

For existing products with no recent evaluation, heuristic evaluation builds a prioritized improvement backlog.

Timeline:

Legacy Product → Heuristic Eval (2-3 days) → Prioritized Backlog → Incremental Fixes

Benefits:

Quick baseline of current usability state
Objective prioritization for improvement work
Does not require user recruitment
Can be done on short notice

After Major Redesigns (Quick Validation)

When you have redesigned a significant portion of the interface, heuristic evaluation provides fast feedback before investing in user testing.

Benefits:

Catches regressions from the previous version
Identifies new issues introduced by the redesign
Faster turnaround than recruiting participants

For how heuristic evaluation feeds into the broader research plan, see The Research Plan.

What Heuristic Evaluation Cannot Do

Heuristic evaluation is powerful but limited:

Cannot Assess	Why	Alternative Method
Task completion rates	No real users attempting tasks	Usability testing
Actual user behavior	Experts predict, not observe	Usability testing, analytics
Learnability	Experts are not novices	First-time user testing
Domain-specific issues	Experts may lack domain knowledge	Domain expert review or user testing
Emotional response	Experts evaluate logic, not feeling	User testing, surveys

The Evaluation Template

Use a structured template to ensure consistency:

HEURISTIC EVALUATION REPORT

Product: [Name/Version]
Evaluator: [Name]
Date: [Date]
Scope: [Screens/flows evaluated]

FINDINGS

#1
Location: [Specific screen/component]
Heuristic: [Number and name]
Issue: [Description of the problem]
Severity: [Critical/Major/Minor/Cosmetic]
Recommendation: [Suggested fix]
Screenshot: [If applicable]

#2
...

Running a Group Debrief

After independent evaluation, bring evaluators together:

Share findings (round-robin, each evaluator presents their top issues)
Discuss disagreements (different severity ratings, missed issues)
Merge duplicates (combine identical findings, note agreement level)
Prioritize (vote on top issues to address before user testing)
Assign owners (who will fix what, by when)

What This Means for Practice

Heuristic evaluation is the audit before the test. Use it to:

Save budget: Catch 60%+ of obvious issues without recruiting participants
Clean prototypes: Fix basic problems before users encounter them
Focus user testing: Let real users reveal complex, domain-specific issues
Build backlogs: Quickly assess legacy products for improvement opportunities
Use 3-5 evaluators: One finds 35%, five find 75%

The best usability test is one where users can actually engage with the core experience—not one where they stumble over obvious flaws you could have found yourself.

For the next step after heuristic evaluation, see Usability Test Scripting: Copy-Paste Templates.

Before you schedule participants, before you set up recordings, before you spend a single euro on incentives—ask yourself: are there obvious problems you could find without users?

Heuristic evaluation is the answer. It is a systematic expert review that catches the low-hanging fruit, so your user testing budget goes toward the problems only real users can reveal.

The "Save Your Budget" Rule

Do not test basic hygiene issues with real users. It is waste.

The Problem

Imagine running a usability test and discovering:

The "Submit" button is gray and looks disabled
Error messages do not explain what went wrong
Users cannot get back to the homepage from the checkout flow
The search icon is labeled "Find" in one place and "Search" in another

These are real problems—but they are obvious problems. You did not need to pay participants and spend hours in sessions to find them. An experienced practitioner could spot them in 20 minutes.

The Solution

Run a heuristic evaluation first to catch the obvious issues. Clean them up. Then run user testing to discover the complex, domain-specific problems that require real user context.

The ROI

Approach	Cost	Findings
Jump straight to user testing	High (recruiting, incentives, time)	Mix of obvious and complex issues
Heuristic evaluation first	Low (expert time only)	Catches ~60% of obvious issues
User testing after cleanup	High	Focuses on complex, valuable insights

The Framework: Nielsen's 10 Heuristics

The most widely used framework is Jakob Nielsen's 10 Usability Heuristics. These are not rigid rules—they are principles for identifying common usability failures.

The 10 Heuristics

#	Heuristic	Key Question
1	Visibility of System Status	Does the system keep users informed about what is happening?
2	Match Between System and Real World	Does the system use language and concepts familiar to users?
3	User Control and Freedom	Can users easily undo, redo, and exit unwanted states?
4	Consistency and Standards	Does the interface follow platform conventions and internal consistency?
5	Error Prevention	Does the design prevent errors before they occur?
6	Recognition Rather Than Recall	Are options visible rather than requiring users to remember information?
7	Flexibility and Efficiency of Use	Are there shortcuts for expert users without confusing novices?
8	Aesthetic and Minimalist Design	Is irrelevant information eliminated to reduce noise?
9	Help Users Recognize, Diagnose, and Recover from Errors	Are error messages clear, specific, and constructive?
10	Help and Documentation	Is help available when needed, focused on the user's task?

Using the Heuristics

For how accessibility heuristics extend this evaluation framework, see Accessibility Testing: Beyond Compliance.

How to Execute

The Process

Step 1: Select Evaluators (3-5 experts)

More evaluators find more problems, with diminishing returns:

Evaluators	Problems Found	Marginal Gain
1	~35%	—
3	~60%	+25%
5	~75%	+15%
10	~85%	+10%

For most projects, 3-5 evaluators provide the best cost-benefit ratio.

Step 2: Brief the Evaluators

Provide:

The interface (prototype, staging environment, or production)
Target user personas and key tasks
The heuristic framework to use
Severity rating scale
Documentation template

Step 3: Independent Evaluation

Each evaluator reviews the interface independently, without discussing with others. This prevents groupthink and ensures diverse perspectives.

Typical time: 1-2 hours per evaluator for a moderately complex interface.

Step 4: Document Findings

Each finding should include:

Field	Example
Location	Checkout > Payment screen > Credit card form
Heuristic violated	#9: Help users recover from errors
Description	Error message says "Invalid input" without explaining which field or why
Severity	High (blocks task completion)
Recommendation	Specify the field and the format expected: "Card number must be 16 digits"

Step 5: Aggregate and Deduplicate

After independent evaluation:

Collect all findings into a single list
Merge duplicates (same issue found by multiple evaluators)
Note agreement level (issues found by 3+ evaluators are high confidence)
Prioritize by severity and frequency

Severity Ratings

Rating	Definition	Example
Critical	Prevents task completion	Cannot submit form due to validation error
Major	Causes significant difficulty	Unclear labels require trial and error
Minor	Causes minor friction	Inconsistent button styling
Cosmetic	Does not affect usability	Alignment is slightly off

When to Use Heuristic Evaluation

Before User Testing (Prototype Cleanup)

The most common use: clean your prototype before showing it to participants.

Timeline:

Design Complete → Heuristic Eval (1-2 days) → Fix Issues (1-3 days) → User Testing

Benefits:

Users encounter fewer obvious issues
User testing reveals deeper, domain-specific problems
Sessions are more efficient (less time on basic issues)
Findings are more actionable (focused on real complexity)

On Legacy Products (Building a Backlog)

For existing products with no recent evaluation, heuristic evaluation builds a prioritized improvement backlog.

Timeline:

Legacy Product → Heuristic Eval (2-3 days) → Prioritized Backlog → Incremental Fixes

Benefits:

Quick baseline of current usability state
Objective prioritization for improvement work
Does not require user recruitment
Can be done on short notice

After Major Redesigns (Quick Validation)

When you have redesigned a significant portion of the interface, heuristic evaluation provides fast feedback before investing in user testing.

Benefits:

Catches regressions from the previous version
Identifies new issues introduced by the redesign
Faster turnaround than recruiting participants

For how heuristic evaluation feeds into the broader research plan, see The Research Plan.

What Heuristic Evaluation Cannot Do

Heuristic evaluation is powerful but limited:

Cannot Assess	Why	Alternative Method
Task completion rates	No real users attempting tasks	Usability testing
Actual user behavior	Experts predict, not observe	Usability testing, analytics
Learnability	Experts are not novices	First-time user testing
Domain-specific issues	Experts may lack domain knowledge	Domain expert review or user testing
Emotional response	Experts evaluate logic, not feeling	User testing, surveys

The Evaluation Template

Use a structured template to ensure consistency:

HEURISTIC EVALUATION REPORT

Product: [Name/Version]
Evaluator: [Name]
Date: [Date]
Scope: [Screens/flows evaluated]

FINDINGS

#1
Location: [Specific screen/component]
Heuristic: [Number and name]
Issue: [Description of the problem]
Severity: [Critical/Major/Minor/Cosmetic]
Recommendation: [Suggested fix]
Screenshot: [If applicable]

#2
...

Running a Group Debrief

After independent evaluation, bring evaluators together:

Share findings (round-robin, each evaluator presents their top issues)
Discuss disagreements (different severity ratings, missed issues)
Merge duplicates (combine identical findings, note agreement level)
Prioritize (vote on top issues to address before user testing)
Assign owners (who will fix what, by when)

What This Means for Practice

Heuristic evaluation is the audit before the test. Use it to:

Save budget: Catch 60%+ of obvious issues without recruiting participants
Clean prototypes: Fix basic problems before users encounter them
Focus user testing: Let real users reveal complex, domain-specific issues
Build backlogs: Quickly assess legacy products for improvement opportunities
Use 3-5 evaluators: One finds 35%, five find 75%

The best usability test is one where users can actually engage with the core experience—not one where they stumble over obvious flaws you could have found yourself.

For the next step after heuristic evaluation, see Usability Test Scripting: Copy-Paste Templates.

Heuristic Evaluation: The Audit Before the Test

Summary

The "Save Your Budget" Rule

The Problem

The Solution

The ROI

The Framework: Nielsen's 10 Heuristics

The 10 Heuristics

Using the Heuristics

How to Execute

The Process

Severity Ratings

When to Use Heuristic Evaluation

Before User Testing (Prototype Cleanup)

On Legacy Products (Building a Backlog)

After Major Redesigns (Quick Validation)

What Heuristic Evaluation Cannot Do

The Evaluation Template

Running a Group Debrief

What This Means for Practice

Free Research Handbook

Related Resources

The Perfect Usability Test Script

AI-Moderated Interviews: The 'Rag Rug' Data Problem

Accessibility Testing: Beyond Compliance

Ready to Take Action?

Heuristic Evaluation: The Audit Before the Test

Summary

The "Save Your Budget" Rule

The Problem

The Solution

The ROI

The Framework: Nielsen's 10 Heuristics

The 10 Heuristics

Using the Heuristics

How to Execute

The Process

Severity Ratings

When to Use Heuristic Evaluation

Before User Testing (Prototype Cleanup)

On Legacy Products (Building a Backlog)

After Major Redesigns (Quick Validation)

What Heuristic Evaluation Cannot Do

The Evaluation Template

Running a Group Debrief

What This Means for Practice

Free Research Handbook

Related Resources

The Perfect Usability Test Script

AI-Moderated Interviews: The 'Rag Rug' Data Problem

Accessibility Testing: Beyond Compliance

Ready to Take Action?