{"id":347,"date":"2023-12-06T15:29:40","date_gmt":"2023-12-06T15:29:40","guid":{"rendered":"https:\/\/vwo.com\/stats-blog\/?p=347"},"modified":"2023-12-13T13:21:50","modified_gmt":"2023-12-13T13:21:50","slug":"the-fourth-factor-in-statistical-significance","status":"publish","type":"post","link":"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/","title":{"rendered":"The Fourth Factor in Statistical Significance"},"content":{"rendered":"\n<figure class=\"wp-block-image size-large is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"585\" src=\"https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_cover_image-1024x585.png\" alt=\"\" class=\"wp-image-348\" style=\"width:840px;height:auto\" srcset=\"https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_cover_image-1024x585.png 1024w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_cover_image-300x171.png 300w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_cover_image-768x439.png 768w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_cover_image-1536x878.png 1536w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_cover_image-700x400.png 700w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_cover_image.png 1792w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\">Generated via Open AI Dall-E 3<\/figcaption><\/figure>\n\n\n\n<p>Let me take you through an interesting statistical puzzle. Suppose that there is a rare disease that infects 1 in 1000 people and scientists develop a test that has a 5% false positive rate. This means that the disease will be detected for sure if the patient is infected, but the test will also detect the disease with a 5% chance if the patient is not infected. The question is <strong>if you test positive for this disease, what is the chance that you are actually infected? <\/strong>The answer is 2%.<\/p>\n\n\n\n<p>So, let us break this down. If 1000 people get tested for this disease, there will be 1 truly infected person who will be detected as positive. Also, there will be 999 uninfected people, out of which almost 50 will be detected as positive due to the 5% false positive rate. So, 51 people in total will be detected as positive but only 1 of them will actually be infected. 1\/51 comes out to be roughly 2% which is the chance that you have the disease after you have been tested as positive by a fairly accurate testing mechanism.<\/p>\n\n\n\n<p>Most readers who are not well-versed in Bayesian statistical thinking will tend to find this result odd. But you can try sitting with this for a while and you will realize that the result is statistically sound. This is just how statistics work. It looks odd to us because it goes against our intuition. The key factor in the above problem that is creating the counter-intuition is the extremely low prevalence of the disease in the population.<\/p>\n\n\n\n<p>This is the fourth factor that affects all tests of statistical judgment. It is fundamentally different from the other <a href=\"https:\/\/vwo.com\/stats-blog\/the-forces-behind-statistical-significance\/\">three forces of statistical significance<\/a>, is highly counterintuitive to our understanding of probability, and is the cornerstone to understanding a distinct statistical ideology called the Bayesian statistics. In this post, I will explain this fourth factor and how it comes into play. To build intuition, I will use the context of testing for a disease. However, the ideas presented below apply to all statistical tests including the test of significance in experimentation.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Breaking down Statistical Accuracy<\/strong><\/h2>\n\n\n\n<p>At the very base, a statistical testing procedure is designed for two kinds of inputs: The subjects that are positive (have the disease) and the subjects that are negative (do not have the disease). A testing procedure usually starts by defining a statistic that is used to decide if the subject is positive or negative. In the case of a disease, this can be some observable property of the patient\u2019s body. Advanced tests will use a statistic derived from multiple parameters, but still they will be finally reduced to one number called the test statistic.&nbsp;<\/p>\n\n\n\n<p>Note that the underlying truth being tested is absolute, it is either positive or negative but not both. But it is hidden and what is observable is only the test statistic which is correlated to the truth but not an absolute representation. Hence, the test statistic is distributed over a range of values for the two types of subjects. The chosen test statistic is effective only if it separates the positives and negatives into two lumps. Ideally, the distribution of the test statistic should be something like the following:\u00a0<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"483\" src=\"https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_1-1-1024x483.jpeg\" alt=\"\" class=\"wp-image-355\" srcset=\"https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_1-1-1024x483.jpeg 1024w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_1-1-300x141.jpeg 300w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_1-1-768x362.jpeg 768w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_1-1-1536x724.jpeg 1536w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_1-1-800x377.jpeg 800w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_1-1.jpeg 1796w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\">The x-axis represents the statistic value, the y-axis represents its frequency.<\/figcaption><\/figure>\n<\/div>\n\n\n<p>The statistical test has to define a cutoff which will be used to predict if the subject is positive or negative. This cutoff gives rise to two forms of statistical accuracy that are common to all testing procedures.<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li><strong>Accuracy for the positive subjects: <\/strong>The positive subjects are divided into two groups by the cutoff. These are the true positives (above the cutoff) and the false negatives (below the cutoff). Together the true positive rate and the false negative rate sum up to 1.<\/li>\n<\/ol>\n\n\n\n<ol class=\"wp-block-list\" start=\"2\">\n<li><strong>Accuracy for the negative subjects: <\/strong>The negative subjects are also divided into two groups. These are the true negatives (below the cutoff) and the false positives (above the cutoff). Similarly, the true negative rate and the false positive rate sum up to 1.<\/li>\n<\/ol>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"512\" src=\"https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_2-1024x512.jpeg\" alt=\"\" class=\"wp-image-350\" srcset=\"https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_2-1024x512.jpeg 1024w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_2-300x150.jpeg 300w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_2-768x384.jpeg 768w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_2-800x400.jpeg 800w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_2.jpeg 1185w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\">The x-axis represents the statistic value, the y-axis represents its frequency.<\/figcaption><\/figure>\n<\/div>\n\n\n<p>This is the conventional approach for studying the accuracy of any statistical test. False Positives and False Negatives are also known as Type-1 and Type-2 errors in hypothesis testing. Reporting one of the measures from each category defines the accuracy of the testing mechanism.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>A More Relevant Question for Accuracy<\/strong><\/h2>\n\n\n\n<p>However, if you look at things from a different lens there is a very relevant question of accuracy that gets missed out in the conventional approach. Observe the two questions below and think about how they are different.&nbsp;<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li><strong>What is the chance that the patient is detected as positive if he actually has the disease?&nbsp;<\/strong><\/li>\n<\/ol>\n\n\n\n<ol class=\"wp-block-list\" start=\"2\">\n<li><strong>What is the chance that a patient has the disease if he has been detected as positive?<\/strong><\/li>\n<\/ol>\n\n\n\n<p>Note that while the answers to the two questions look similar, the answers can be drastically different. Let us first decipher what the second question asks for and how it is different from the first. Observe the image below.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"855\" height=\"883\" src=\"https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_3-2.jpeg\" alt=\"\" class=\"wp-image-358\" srcset=\"https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_3-2.jpeg 855w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_3-2-290x300.jpeg 290w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_3-2-768x793.jpeg 768w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_3-2-387x400.jpeg 387w\" sizes=\"(max-width: 855px) 100vw, 855px\" \/><figcaption class=\"wp-element-caption\">The x-axis represents the statistic value, the y-axis represents its frequency.<\/figcaption><\/figure>\n\n\n\n<p>The first question asks how many of the red cases fall above the cutoff. On the other hand, the second question asks how many of the cases that fall above the cutoff are actually positive (red). It is known as the True Discovery Rate.&nbsp;<\/p>\n\n\n\n<p>The True Discovery Rate can be drastically different from the True Positive Rate depending on how many red items are there in the population. If the mix is densely populated with blue items, the true discovery rates are low. If the mix is densely populated with red items, the true discovery rates are high. The figure below shows what happens when we change the proportion of the red and blue items in the population.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"792\" height=\"1024\" src=\"https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_4-792x1024.jpeg\" alt=\"\" class=\"wp-image-352\" srcset=\"https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_4-792x1024.jpeg 792w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_4-232x300.jpeg 232w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_4-768x993.jpeg 768w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_4-1188x1536.jpeg 1188w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_4-309x400.jpeg 309w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_4.jpeg 1193w\" sizes=\"(max-width: 792px) 100vw, 792px\" \/><figcaption class=\"wp-element-caption\">The x-axis represents the statistic value, the y-axis represents its frequency.<\/figcaption><\/figure>\n<\/div>\n\n\n<p>The proportion of this mix can be referred to by different names, one of them being the base rate of the phenomenon being tested. Base Rates are essentially the proportion of positives in the total population. They do not matter to the conventional measures of accuracy by design, but they matter a lot to the second type of question. The second type of question is more relevant in practice because you do not know in advance which subjects are positive and which subjects are negative.<\/p>\n\n\n\n<p>Base Rates are the fourth factor in the determination of statistical significance but they get overlooked because they are very hard to measure. In future posts, we will discuss more on base rates and their implications on statistics and experimentation.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Conclusion<\/h2>\n\n\n\n<p>Base Rates form the fundamental difference between two distinct schools of thought in statistics, the Frequentists and the Bayesians. Frequentists have deep philosophical problems with the base rates because the base rates can never be empirically tested. In other words, there is no way to test if a chosen base rate is correct or not. Bayesians are more fond of addressing the second type of questions rather than the first. Bayesians hence have to define a prior that in practice refers to the base rates. But priors have often been misinterpreted and if defined wrongly, they bias the results. The debate between the Frequentists and the Bayesians has gone on ever since and we will explore it further in future posts.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"676\" height=\"1024\" src=\"https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_5-676x1024.png\" alt=\"\" class=\"wp-image-353\" srcset=\"https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_5-676x1024.png 676w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_5-198x300.png 198w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_5-768x1163.png 768w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_5-264x400.png 264w, https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_fourth_factor_5.png 936w\" sizes=\"(max-width: 676px) 100vw, 676px\" \/><figcaption class=\"wp-element-caption\">Source: <a href=\"https:\/\/xkcd.com\/1132\/\">xkcd<\/a><\/figcaption><\/figure>\n<\/div>","protected":false},"excerpt":{"rendered":"<p>An introduction to base rates &#8211; the fundamental difference between Bayesian and Frequentist schools of thought. Understand when and why base rates matter in statistical testing.<\/p>\n","protected":false},"author":528,"featured_media":348,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[6],"tags":[],"class_list":["post-347","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-statistics"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.9 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>The Fourth Factor in Statistical Significance - VWO Stats Blog<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"The Fourth Factor in Statistical Significance - VWO Stats Blog\" \/>\n<meta property=\"og:description\" content=\"An introduction to base rates - the fundamental difference between Bayesian and Frequentist schools of thought. Understand when and why base rates matter in statistical testing.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/\" \/>\n<meta property=\"og:site_name\" content=\"VWO Stats Blog\" \/>\n<meta property=\"article:published_time\" content=\"2023-12-06T15:29:40+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-12-13T13:21:50+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_cover_image-1024x585.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1024\" \/>\n\t<meta property=\"og:image:height\" content=\"585\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Ishan Goel\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Ishan Goel\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/\"},\"author\":{\"name\":\"Ishan Goel\",\"@id\":\"https:\/\/vwo.com\/stats-blog\/#\/schema\/person\/e2b5d8a142dd1f391ee6a9aee14caf72\"},\"headline\":\"The Fourth Factor in Statistical Significance\",\"datePublished\":\"2023-12-06T15:29:40+00:00\",\"dateModified\":\"2023-12-13T13:21:50+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/\"},\"wordCount\":1243,\"commentCount\":0,\"image\":{\"@id\":\"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_cover_image.png\",\"articleSection\":[\"Statistics\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/\",\"url\":\"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/\",\"name\":\"The Fourth Factor in Statistical Significance - VWO Stats Blog\",\"isPartOf\":{\"@id\":\"https:\/\/vwo.com\/stats-blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_cover_image.png\",\"datePublished\":\"2023-12-06T15:29:40+00:00\",\"dateModified\":\"2023-12-13T13:21:50+00:00\",\"author\":{\"@id\":\"https:\/\/vwo.com\/stats-blog\/#\/schema\/person\/e2b5d8a142dd1f391ee6a9aee14caf72\"},\"breadcrumb\":{\"@id\":\"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/#primaryimage\",\"url\":\"https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_cover_image.png\",\"contentUrl\":\"https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_cover_image.png\",\"width\":1792,\"height\":1024},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/vwo.com\/stats-blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"The Fourth Factor in Statistical Significance\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/vwo.com\/stats-blog\/#website\",\"url\":\"https:\/\/vwo.com\/stats-blog\/\",\"name\":\"VWO Stats Blog\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/vwo.com\/stats-blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/vwo.com\/stats-blog\/#\/schema\/person\/e2b5d8a142dd1f391ee6a9aee14caf72\",\"name\":\"Ishan Goel\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/vwo.com\/stats-blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/6117b4ba8d739a72b4b89659e16d0e34e11ce78a6718d78b71743eee5628bc9d?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/6117b4ba8d739a72b4b89659e16d0e34e11ce78a6718d78b71743eee5628bc9d?s=96&d=mm&r=g\",\"caption\":\"Ishan Goel\"},\"description\":\"Ishan Goel leads research in statistical significance at VWO. In the past few years, he has researched over the most pressing problems of experimentation and designed a new stats engine at VWO. He actively shares his learnings on the VWO Stats Blog with the broader experimentation community.\",\"sameAs\":[\"https:\/\/www.linkedin.com\/in\/i-goel\/\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"The Fourth Factor in Statistical Significance - VWO Stats Blog","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/","og_locale":"en_US","og_type":"article","og_title":"The Fourth Factor in Statistical Significance - VWO Stats Blog","og_description":"An introduction to base rates - the fundamental difference between Bayesian and Frequentist schools of thought. Understand when and why base rates matter in statistical testing.","og_url":"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/","og_site_name":"VWO Stats Blog","article_published_time":"2023-12-06T15:29:40+00:00","article_modified_time":"2023-12-13T13:21:50+00:00","og_image":[{"width":1024,"height":585,"url":"https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_cover_image-1024x585.png","type":"image\/png"}],"author":"Ishan Goel","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Ishan Goel"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/#article","isPartOf":{"@id":"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/"},"author":{"name":"Ishan Goel","@id":"https:\/\/vwo.com\/stats-blog\/#\/schema\/person\/e2b5d8a142dd1f391ee6a9aee14caf72"},"headline":"The Fourth Factor in Statistical Significance","datePublished":"2023-12-06T15:29:40+00:00","dateModified":"2023-12-13T13:21:50+00:00","mainEntityOfPage":{"@id":"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/"},"wordCount":1243,"commentCount":0,"image":{"@id":"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/#primaryimage"},"thumbnailUrl":"https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_cover_image.png","articleSection":["Statistics"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/","url":"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/","name":"The Fourth Factor in Statistical Significance - VWO Stats Blog","isPartOf":{"@id":"https:\/\/vwo.com\/stats-blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/#primaryimage"},"image":{"@id":"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/#primaryimage"},"thumbnailUrl":"https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_cover_image.png","datePublished":"2023-12-06T15:29:40+00:00","dateModified":"2023-12-13T13:21:50+00:00","author":{"@id":"https:\/\/vwo.com\/stats-blog\/#\/schema\/person\/e2b5d8a142dd1f391ee6a9aee14caf72"},"breadcrumb":{"@id":"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/#primaryimage","url":"https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_cover_image.png","contentUrl":"https:\/\/static.wingify.com\/gcp\/uploads\/sites\/20\/2023\/12\/12_cover_image.png","width":1792,"height":1024},{"@type":"BreadcrumbList","@id":"https:\/\/vwo.com\/stats-blog\/the-fourth-factor-in-statistical-significance\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/vwo.com\/stats-blog\/"},{"@type":"ListItem","position":2,"name":"The Fourth Factor in Statistical Significance"}]},{"@type":"WebSite","@id":"https:\/\/vwo.com\/stats-blog\/#website","url":"https:\/\/vwo.com\/stats-blog\/","name":"VWO Stats Blog","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/vwo.com\/stats-blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/vwo.com\/stats-blog\/#\/schema\/person\/e2b5d8a142dd1f391ee6a9aee14caf72","name":"Ishan Goel","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/vwo.com\/stats-blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/6117b4ba8d739a72b4b89659e16d0e34e11ce78a6718d78b71743eee5628bc9d?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/6117b4ba8d739a72b4b89659e16d0e34e11ce78a6718d78b71743eee5628bc9d?s=96&d=mm&r=g","caption":"Ishan Goel"},"description":"Ishan Goel leads research in statistical significance at VWO. In the past few years, he has researched over the most pressing problems of experimentation and designed a new stats engine at VWO. He actively shares his learnings on the VWO Stats Blog with the broader experimentation community.","sameAs":["https:\/\/www.linkedin.com\/in\/i-goel\/"]}]}},"_links":{"self":[{"href":"https:\/\/vwo.com\/stats-blog\/wp-json\/wp\/v2\/posts\/347","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/vwo.com\/stats-blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/vwo.com\/stats-blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/vwo.com\/stats-blog\/wp-json\/wp\/v2\/users\/528"}],"replies":[{"embeddable":true,"href":"https:\/\/vwo.com\/stats-blog\/wp-json\/wp\/v2\/comments?post=347"}],"version-history":[{"count":5,"href":"https:\/\/vwo.com\/stats-blog\/wp-json\/wp\/v2\/posts\/347\/revisions"}],"predecessor-version":[{"id":362,"href":"https:\/\/vwo.com\/stats-blog\/wp-json\/wp\/v2\/posts\/347\/revisions\/362"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/vwo.com\/stats-blog\/wp-json\/wp\/v2\/media\/348"}],"wp:attachment":[{"href":"https:\/\/vwo.com\/stats-blog\/wp-json\/wp\/v2\/media?parent=347"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/vwo.com\/stats-blog\/wp-json\/wp\/v2\/categories?post=347"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/vwo.com\/stats-blog\/wp-json\/wp\/v2\/tags?post=347"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}