CRO Strategy Intermediate

PIE Framework

A CRO prioritization model that scores potential tests by Potential, Importance, and Ease to identify the highest-ROI experiments to run first.

By Mario Kuren Updated

The PIE Framework is a structured method for prioritizing which A/B tests and CRO experiments to run first, developed by Chris Goward at WiderFunnel.

Each test idea is scored across three dimensions:

  • Potential — how much improvement is possible?
  • Importance — how significant is this page or element to the business?
  • Ease — how simple is it to implement and run?

PIE Score = (Potential + Importance + Ease) ÷ 3

Tests are ranked by PIE score and run in order from highest to lowest.

Scoring Each Dimension

Potential (1–10)

How much room for improvement exists on this page or element?

Signals of high potential:

  • Conversion rate significantly below industry benchmark
  • High exit rate on pages that should retain visitors (>60%)
  • Rage clicks or dead clicks visible in session recordings
  • User testing reveals persistent confusion at this step
  • Visitor survey responses mention frustration with this element
  • Heatmaps show visitors ignoring the primary CTA

Score 9–10: Page or element has obvious, severe problems. Major improvement opportunity. Score 5–6: Some issues present but page performs reasonably well. Score 1–2: Page already performing well, limited upside.

Importance (1–10)

How much revenue or traffic flows through this page or element?

Signals of high importance:

  • High monthly sessions (this is where most of your audience lands)
  • Direct connection to conversion (checkout, pricing page, primary CTA)
  • High revenue per visitor (paid traffic landing page)
  • Part of the critical path — visitors must pass through this page to convert

Score 9–10: Your single highest-traffic or highest-revenue page. Every 1% improvement here multiplies across thousands of monthly visitors. Score 5–6: Moderate traffic; contributing to conversion pipeline but not the primary bottleneck. Score 1–2: Low traffic, niche page with minimal revenue impact.

Ease (1–10)

How easily can this change be implemented and tested?

Score 9–10: Copy change, button text update, trust badge addition — minimal development, deployed in hours. Score 5–6: Layout change, new section, modest design work — few days development. Score 1–2: Full page redesign, new checkout flow, major back-end change — weeks of development and QA.

PIE Scoring Example

Test ideaPotentialImportanceEasePIE Score
Rewrite checkout page headline8998.7
Add trust badges to checkout7988.0
Simplify homepage navigation7856.7
Add social proof to pricing page6877.0
Redesign product category page6735.3
A/B test blog CTA copy4395.3
Reduce form fields on lead gen page8687.3

The checkout headline rewrite scores first: high potential (headline is a known high-leverage element), highest importance (checkout is where purchases happen), and easy to implement. The blog CTA test scores identically to the category page redesign but represents dramatically less potential revenue — a reminder that PIE score alone doesn’t tell the whole story.

PIE vs ICE Framework

FrameworkDimensionsBest for
PIEPotential, Importance, EaseIdentifying which pages to focus on first
ICEImpact, Confidence, EaseRanking specific hypotheses by certainty

Many CRO programs use PIE to select pages and ICE to prioritize individual tests within a page.

ICE scoring in practice:

  • Impact: expected CVR lift if the hypothesis is correct (1–10)
  • Confidence: strength of evidence backing the hypothesis (1–10)
  • Ease: implementation effort (1–10)

A hypothesis with strong research backing (session recordings, user testing, heatmaps all pointing to the same problem) scores high on Confidence. A hypothesis based on a gut feeling scores low. ICE builds research-backing into the prioritization system.

Data Sources for PIE Scoring

PIE DimensionData sourceWhat to look for
PotentialGA4 exit rate>60% exit rate = high potential
PotentialSession recordingsRage clicks, form abandonment patterns
PotentialCVR vs industry benchmark50%+ below average = high potential
PotentialHeatmapsIgnored CTAs, scroll depth drop-offs
ImportanceGA4 page trafficMonthly sessions count
ImportanceRevenue flowPages in direct checkout path
ImportanceAd spend destinationPaid traffic landing pages score highest
EaseDevelopment estimatesTime to implement + QA

Grounding each dimension in actual data before scoring reduces the team disagreement that undermines PIE’s usefulness as a prioritization tool.

Building and Maintaining a PIE Backlog

A well-run PIE backlog follows this cycle:

  1. Discovery — Pull low-CVR pages from GA4, identify rage clicks in Hotjar/FullStory, review user survey responses
  2. Hypothesis formation — Convert each problem into a testable hypothesis: “If we [change X], then [metric Y] will improve because [reason Z]”
  3. Scoring — Score each hypothesis on PIE (or ICE) using data anchors, not intuition
  4. Ranking — Sort by score, review top 5 for feasibility
  5. Execution — Run the highest-scored test that’s ready to build
  6. Review — Quarterly re-score of inactive backlog items as site data changes

A backlog of 10–30 items is manageable. Beyond 30, low-priority items become stale and the backlog loses credibility as a prioritization tool.

Limitations of PIE

PIE is a subjective scoring system — different team members will score the same test differently. Reduce subjectivity by:

  • Anchoring Potential scores to actual analytics data (exit rate, CVR relative to benchmark)
  • Anchoring Importance scores to actual traffic or revenue numbers
  • Having multiple team members score independently and averaging

Common PIE mistake: Over-weighting Ease. A high-importance, low-ease test (checkout redesign) will often produce more revenue than 10 high-ease, low-importance tests (blog sidebar CTAs). Ease is a tiebreaker between otherwise equal options — not a reason to skip high-importance work.

Weighted PIE: Some teams assign weights to dimensions (e.g., Importance × 2, Potential × 1.5, Ease × 1) to prevent Ease from dominating the ranking. This is a reasonable modification for teams where Ease scores consistently crowding out high-importance, high-potential tests.

PIE is a structured way to have an informed argument about priorities — not a mathematical truth.

PIE in the Context of a Full CRO Programme

PIE is most effective when paired with a strong research phase. Running PIE scoring without analytics data, session recordings, and user research produces subjective guesses formatted as numbers. Running it with solid data produces informed prioritization.

The research → PIE scoring → testing → learning → research cycle is the engine of a systematic CRO programme. For running tests once priorities are set, see A/B Testing Best Practices. For how PIE fits within the broader CRO strategy, see the full methodology overview.

Frequently Asked Questions

What is the PIE framework in CRO?

The PIE framework is a test prioritization method developed by Chris Goward at WiderFunnel. It scores each potential test idea on three dimensions: Potential (how much improvement is possible on this page), Importance (how much traffic and revenue does this page drive), and Ease (how simple is it to implement and test). Each dimension is scored 1–10, scores are averaged, and tests are ranked by PIE score. The highest PIE score gets tested first.

How do I calculate a PIE score?

Score each test candidate on three dimensions from 1 to 10: Potential — how much room for improvement is there? (low CVR, high exit rate, clear UX problems = high potential), Importance — how much of your revenue or traffic flows through this page? (homepage, checkout = high importance), Ease — how easy is this to design, build, and test? (copy change = easy, full redesign = hard). Add the three scores and divide by 3 to get the PIE score. Prioritize the highest scores.

What is the difference between PIE and ICE framework?

Both PIE and ICE are CRO prioritization frameworks. PIE (Potential, Importance, Ease) focuses on page-level opportunity. ICE (Impact, Confidence, Ease) focuses on hypothesis-level certainty — Impact is the expected conversion uplift, Confidence is how sure you are the change will work (based on data and research), and Ease is implementation effort. ICE is often preferred when you have strong research backing individual hypotheses; PIE is more useful when ranking pages or areas to investigate first.

What data should I use to score PIE dimensions objectively?

Potential: use exit rate (pages with >60% exit rate have high potential), CVR relative to your average (pages converting 50% below average have high potential), and session recording observations (pages with rage clicks or form abandonment have high potential). Importance: use traffic volume from GA4 (high-traffic pages score higher) and revenue attribution (pages in the direct checkout path score highest). Ease: use development time estimates from your team. Anchoring scores to actual data reduces the subjectivity that makes PIE assessments inconsistent between team members.

What are the main limitations of the PIE framework?

PIE is a subjective scoring system — different team members will score the same test differently. It also weights all three dimensions equally, but Importance is often more critical than Ease for revenue impact. A high-importance, low-ease test (checkout redesign) may produce better ROI than a high-PIE, low-importance test (blog sidebar CTA). Some teams use weighted PIE — giving Importance a higher multiplier — to account for this. PIE is best used as a structured conversation tool rather than a definitive mathematical ranking.

How many test ideas should be in a PIE backlog?

A practical PIE backlog has 10–30 scored ideas at any time. Fewer than 10 gives you too little to prioritize from. More than 30 becomes difficult to maintain and the low-priority items rarely get re-evaluated as data changes. Review and re-score the backlog quarterly — a page that had low Potential 6 months ago may have become high-Potential after a UX change. Ideas can come from: analytics data (exit rates, CVR by page), session recordings (rage clicks, form abandonment), user surveys, heatmaps, and competitive analysis.