A/B Test Statistical Significance Calculator

Want to run an online A/B test? Calculate its significance with our Bayesian-powered calculator built for experimentation, CRO, and UX Optimization.

Zahl der Besucher Zahl der Conversions

Control

Zahl der Besucher

Zahl der Conversions

Variation

Zahl der Besucher

Zahl der Conversions

Testing Objective: Better

MDE: 20%

ROPE: 1%

Testing Objective

Better

Better or Equivalent

Minimum Detectable Effect (MDE)

A value for this field is required.

Region of Practical Equivalence (ROPE)

A value for this field is required.

Variations	Unique Conversions / Visitors	Expected Conversion Rate	Improvement	Decision Probabilities Decision probabilities show the chance of your variation being worse, equivalent or better.
C Control			Baseline	-
V Variation				Worse Equivalent Better

An improvement of ↑ 4.1% can be expected with a 95% probability of being better.

Variations	Unique Conversions / Visitors	Expected Conversion Rate	Improvement	Significance Value This value represents the probability that the observed improvement is simply due to random chance. A lower value indicates greater confidence that the difference is statistically significant.
C Control			Baseline	-
V Variation				-

P-Value

(Bereich von 0-1)

0.334

Significance

What is statistical significance?

Statistical significance quantifies whether a result obtained is likely due to chance or some factor of interest. You can utilize a significance calculator to confidently evaluate and interpret your results. The data-driven approach empowers you to make informed decisions for optimization and ultimately achieve your desired outcomes.

Different approaches to calculate statistical significance

The two commonly used approaches are Frequentist and Bayesian, which power different test statistics calculators. Here is a brief information on each.

Frequentist

Frequentist statistics involves examining the frequency of events. It looks at how often something happens in a given situation.

Bayesian

Bayesian statistics is all about changing your opinion. It starts with an initial belief called the prior, which comes from domain expertise. Then, you gather evidence to support or challenge that belief. After examining the evidence, your opinion should be updated based on the new information you’ve acquired.

How do we calculate statistical significance?

Here is how you can calculate statistical significance using the Frequentist and Bayesian approaches.

Frequentist approach

The Frequentist approach to statistical significance is based on the p-value. To determine the p-value using the Frequentist approach, you will need the following key pieces of information:

The mean difference between the two hypotheses
The standard deviation of the difference between the two hypotheses
The sample size of each hypothesis

Once you have gathered this necessary data, you can easily compute the p-value using a significance testing tool or significance level calculator. If the p-value you get is 0.05, the probability of the variation is 5%.

Bayesian Approach

In this, you do statistical significance calculation through posterior probability. The posterior probability considers the available data and represents the probability of hypothesis A or B being true. The approach takes into account both prior beliefs and the evidence observed in the data.

To calculate the posterior probability, Bayes' theorem is employed. Bayes' theorem is a mathematical formula that combines the prior probability of a hypothesis with the likelihood of the data to derive the updated or posterior probability.

A posterior probability of greater than 95% is considered to be strong evidence in favor of the hypothesis.

Know more about our Bayesian approach in the article series we created on Bayesian A/B testing at VWO.

Darum verwenden wir Bayes'sche Statistik

Intuitive Testberichte

At VWO, we recognize that non-statistical users often misinterpret the frequentist p-value as a Bayesian posterior probability, which leads to incorrect conclusions about the superiority of one variation over another. To address this issue, we developed the industry's first Bayesian statistical engine.

Our Bayesian statistical engine provides users with easily understandable results, eliminating the risk of making mistakes while conducting A/B tests on revenue or other crucial key performance indicators (KPIs). By adopting a more intuitive approach, we ensure that our users obtain accurate insights from our A/B test statistical significance calculator.

With our solution, you can have confidence in the statistical significance of your results, making informed decisions to optimize your testing, revenue generation, and overall user experience. Say goodbye to misinterpretations and embrace the power of the Bayesian A/B test calculator for reliable and actionable A/B testing outcomes.

Get clear comparisons against the baseline

Identify if variations outperform, match, or fall below the baseline with our advanced stats engine. Results are declared at a 95% probability threshold. Detailed graphs offer clear insights for effective decision-making.

Kommen Sie schneller an umsetzbare Resultate

VWO offers two testing objectives, "Better" and "Better or Equivalent," both built on ROPE (Region of Practical Equivalence), which offers asymmetric power in tests.

The "Better" objective helps you quickly identify underperforming variations. This speeds up your evaluations and lets you focus on implementing successful changes faster.

The "Better or Equivalent" objective delivers quicker experiment results when you're looking for either an improvement over the baseline or a performance that's essentially the same.

Häufig gestellte Fragen

Die Nullhypothese gibt an, dass zwischen der Kontrolle und der Variation kein Unterschied besteht. Das bedeutet im Prinzip, dass die Konversionsrate der Variation ähnlich sein wird wie die Konversionsrate der Kontrolle.

Der P-Wert ist definiert als die Wahrscheinlichkeit, ein mindestens so extremes Ergebnis wie die beobachteten zu erhalten, vorausgesetzt, die Nullhypothese ist korrekt, wobei die Variante und Kontrolle bei der Nullhypothese beim A/B-Testing gleich sind.

Die statistische Signifikanz misst, ob ein erhaltenes Ergebnis aufgrund einer Möglichkeit oder einer interessanten Tatsache wahrscheinlich ist. Wenn ein Ergebnis signifikant ist, bedeutet das im Prinzip, dass man darauf vertrauen kann, dass eine Differenz real ist (und man nicht einfach nur Glück oder Pech bei der Auswahl des Beispiels hatte).

Die statistische Trennschärfe ist die Wahrscheinlichkeit, eine Folge zu finden, wenn die Folge real ist. Eine statistische Trennschärfe von 80 % bedeutet also, dass bei 100 Tests mit unterschiedlichen Variationen 20 Tests zu dem Schluss kommen werden, dass die Variationen gleich sind und keine Folge existiert.

Features (+125 more)

Features (+120 more)

A/B Test Statistical Significance Calculator

What is statistical significance?

Different approaches to calculate statistical significance

Frequentist

Bayesian

How do we calculate statistical significance?

Frequentist approach

Bayesian Approach

Darum verwenden wir Bayes'sche Statistik

Intuitive Testberichte

Get clear comparisons against the baseline

Kommen Sie schneller an umsetzbare Resultate

Häufig gestellte Fragen

A/B-Tests schnell und einfach durchführen. Jetzt VWO kostenlos ausprobieren.

Obwohl unsere Demo, die gesamte VWO-Plattform umfasst, geben Sie uns bitte einige Details, damit wir die Demo für Sie personalisieren können.

Select the capabilities that you would like us to emphasise on during the demo.

Welche dieser Optionen trifft auf Sie zu?

Bitte teilen Sie mit uns die Use-Cases, Ziele oder Bedürfnisse, die Sie zu lösen versuchen.

Bitte geben Sie die URL Ihrer Website oder App ein.