Variant B’s conversion rate (1.14%) was 14% higher than variant A’s conversion rate (1.00%). You can be 95% confident that variant B will perform better than variant A.

Power

86.69%

p value

0.0157

Statistical significance is important when running A/B tests because it ensures your results are certain and didn’t happen by chance.

Gather quick answers using the SurveyMonkey A/B testing calculator above.

What is A/B testing, and why is it important?

Overview of A/B testing and its purpose

A/B testing, or split testing, compares the performance of two versions—such as a product concept or ad creative—to identify which variant is more appealing to your target audience.

Why do we use A/B testing?

Researchers, CX professionals, and marketing experts use A/B testing to test small changes, like a new website button or homepage design. It provides direct feedback and data to guide decisions on which variant to choose.

What is statistical significance?

In A/B tests, statistical significance measures the likelihood that the difference between the control and test versions is genuine and not due to error or random chance.

For example, if you run a test with a 95% significance level, you can be 95% confident that the differences are authentic.

Statistical significance is used to observe how experiments affect your business’s conversion rates. In surveys, statistical significance is used to ensure confidence in results.

For example, if you asked whether people preferred ad concept A or ad concept B in a survey, you’d want to ensure the difference in their results was statistically significant before deciding which ad concept to use.

Let us do the math for you. Get automated statistical significance with an Advantage plan. See pricing.

How to calculate statistical significance in 7 steps

Step 1: Form a hypothesis

First, you must form a hypothesis. For any experiment, there is a null hypothesis, which states there’s no relationship between the two things you’re comparing, and an alternative hypothesis.

An alternative hypothesis typically tries to prove that a relationship exists and supports the statement you’re trying to make.

For example, if you are doing conversion rate A/B testing, your hypotheses may be:

Null hypothesis (H₀): Adding a new button to the webpage does not affect conversion rates.
Alternative hypothesis (H₁): Adding a new button to the webpage increases conversion rates

After formulating null and alternative hypotheses, statisticians sometimes do tests to ensure their hypotheses are sound.

A z-score represents confidence level and evaluates the validity of your null hypothesis, which can tell you if there is, in fact, no relationship between the things you’re comparing. A p-value tells you whether the evidence you have to prove your alternative hypothesis is strong.

Step 2: Choose the type of test

Next, decide whether your test will be one-sided or two-sided (sometimes called one-tailed or two-tailed). A one-sided test assumes that your alternative hypothesis will have a directional effect, while a two-sided test accounts for if your hypothesis could have a negative effect on your results.

For instance, in the conversion rate A/B testing example, your test could be:

One-sided: Assumes the effect will be in one direction (e.g., an increase in conversion rates).
Two-sided: Assumes the effect could be in either direction (e.g., an increase or decrease in conversion rates).

Step 3: Collect and prepare data

Next, you will collect the results from your A/B test, including the relevant metrics for both the control (A) and test (B) versions.

In our example, the results of the A/B test could be:

Variant A: Out of 50,000 visitors, 500 users converted. Conversion rate of 1.00%
Variant B: Out of 50,000 visitors, 570 users converted. Conversion rate of 1.14%.

Step 4: Perform statistical tests

Then, you will calculate the z-score which measures how far the observed results are from the null hypothesis to determine if the difference between A and B is statistically significant.

Additionally, you will calculate the p-value, which indicates the probability that the observed difference is due to random chance. A smaller p-value suggests stronger evidence against the null hypothesis.

In our example:

z-score is 14%
p-value is 0.0157

Step 5: Determine statistical significance

To determine statistical significance, set a significance level (alpha). This is commonly set at 0.05 (5%), representing the acceptable risk level for incorrectly rejecting the null hypothesis.

Next, compare your p-value to the alpha level. If the p-value is less than the alpha level, reject the null hypothesis and conclude the difference is statistically significant.

In our example, the p-value is less than the alpha level, meaning the difference of 14% is statistically significant.

Step 6: Interpret results

Now, it’s time to interpret the results. If you receive significant results, it indicates that the observed difference is unlikely due to chance, providing evidence supporting the alternative hypothesis. Non-significant results indicate insufficient evidence to reject the null hypothesis, meaning the observed difference could be due to random variations.

Step 7: Use tools for calculation

For the most efficient process, use tools for calculation such as:

Calculator: Utilize the A/B testing calculator at the top of the page for quick and accurate results.
Statistical software: For more complex analyses, consider using statistical modeling software.

In summary, statistical significance validates your A/B testing results. Using statistical significance is important in making informed decisions based on A/B tests.

Check out the calculator at the top of the page to automatically calculate the significance of your survey results.