/Docs

Viewing Rule Results

Every A/B test rule has a dedicated results page that opens when you click Results on a running rule. The page shows per-variation visitor and conversion counts, a time-series chart of the conversion rate over time, a summary of winning probability and observed lift, and a health-guardrails section that flags sample ratio mismatch, low traffic, and statistical confidence concerns.

Opening the results page

Results are available for any rule whose type is A/B Test. There are two entry points:

  • From the ruleset list, click the Results button on a running A/B test rule card.
  • From the rule detail panel, click View results in the actions menu.

Both entry points open a dedicated page at /projects/<projectId>/flags/<flagId>/rules/<ruleId>/results?env=<envKey>. The breadcrumb back-link returns you to the flag detail with the same env tab pre-selected.

Reading the summary cards

Three summary cards (or two, when the rule's primary metric doesn't track revenue) sit above the chart:

  • Winning Probability / Statistical Significance / Sequential Decision— engine-aware. Bayesian rules report the leading variation's probability-to-beat-control. Frequentist rules report the p-value and a Significant badge once the threshold is crossed. Sequential rules report “Safe to stop” or “Inconclusive” based on always-valid bounds.
  • Observed Lift— the leading variation's improvement over control, with the credible-interval range below.
  • Revenue Impact — the rolled-up incremental revenue when the rule has a revenue-tracking primary metric. Hidden when the metric is not revenue-related.

Reading the time-series chart

The conversion-rate chart bins exposures and conversions into 4-hour buckets, fixed across all date ranges. The bucket interval is finer than the experiments page (which uses daily buckets) so flag rules with steady traffic show a smoother trend line. Buckets with zero exposures render as gaps in the line — the chart never interpolates over absent data.

Date range cap
The custom-range picker caps at 90 days end-to-end. At the 90-day cap with five variations the chart draws ~2,700 points — comfortable for the chart engine but noticeable on slower devices.

Reading the variation table

The Primary Metric card expands to show a per-variation table. Each row reports the variation name, total visitors, total conversions, conversion rate, observed lift over the baseline, and the engine-appropriate significance signal. Use the Baseline dropdown in the filter bar to change which variation other rows are compared to — useful when you want to see how variation B performs relative to variation A instead of control.

Continuous metrics (e.g. time-on-page, average order value) shift the column headers from “Conversions” / “Conv. Rate” to “Events” / “Value / Visitor” so the numbers read correctly for the underlying measurement.

Filtering by attributes

Click the Attributes dropdown in the filter bar to slice variation performance by registered attributes — plan, country, deviceType, or any custom attribute your team has registered. The picker is grouped by Built-in (auto-detected from the user context) and Custom (registered explicitly in your project).

Each attribute group lists the distinct values observed in this rule's exposureswithin the active date range, merged with any suggested values you registered for that attribute. Pick one or more values to slice on. Attributes that haven't been observed yet (or whose exposures all fall outside your date range) are hidden from the picker.

Searching long lists

Groups with more than 20 values render the first 20 by default with a per-attribute search field above the list. Type to filter through up to the 200 most-frequent values for that attribute. For values past the cap (or values you know but haven't been recorded yet), use the Add custom valueinput below the list — type the value, press Enter, and it applies as a filter even if it isn't in the rendered list.

Combining filters

Multiple values for the same attribute combine with OR — selecting plan: pro and plan: enterprise shows visitors on either plan. Different attributes combine with AND — adding country: US on top of the plan filter narrows the slice to US visitors on those plans.

Bookmarking and sharing

Active filters appear in the URL as repeated ?attribute=key:value params. You can bookmark a sliced view, share it with a teammate, or paste it into a doc — opening the link restores the same filters on load. The breadcrumb back-link does not carry the filter forward; filters are page-scoped to the current rule.

Empty slice
If a slice has no visitors, the page shows “No visitors match these filters” in place of the variation tables, with a Clear filters button to return to the unfiltered view.

Health guardrails

Below the metric tables, the Flag Rule Health Guardrails section reports three signals:

  • Sample Ratio Mismatch (SRM) — chi-square test of observed vs configured traffic split. Green at p &geq; 0.01, yellow between 0.001 and 0.01, red below 0.001.
  • Statistical Confidence— engine-aware. Reports whether the leading variation has crossed the engine's decision threshold yet, or whether more data is needed.
  • Traffic Health — visitor-count floor. Green above 1,000 total visitors per variation; yellow between 100 and 1,000; red below 100.

See the Health Guardrails reference for the full description of each signal and what the red / yellow / green statuses mean in context.

Segment Lift on flag rules

Below the secondary metrics, the Segment Lift section answers “for whom did this rule help, and for whomdid it hurt?” — broken down by every registered flag attribute (plan, country, deviceType, custom attributes) instead of experiment segments. The same panel runs on the experiment results page and uses identical statistics (Benjamini-Hochberg correction across the family, chi-square interaction p-value per attribute, minimum-sample-size masking), so a single explanation covers both surfaces. See the Segment Lift reference for how the numbers are computed and how to interpret them.

On the flag-rule results page specifically, the panel respects any active attribute filter from the picker — HTE runs within the filtered slice. Exporting the rule's CSV (below) appends a Segment Lift section with per-cell rows whenever the panel is populated.

Exporting results

Click Export CSV in the page header to download a CSV of every variation across every metric, including continuous metrics with their distinct column labels. Filenames are derived from the rule name with non-alphanumeric characters replaced by dashes (e.g. New-Checkout-V2-results.csv). When the Segment Lift panel has data, the export also appends per-cell rows and Top Movers sections under a Segment Lift heading.