Bayesian Weighted Averages, and Why You Care

The search algorithms on were recently upgraded to utilize Bayesian weighted averages for results. When sorted, this is the value that is used rather than the actual rating of the specific result.

Why do we do this, and why should you care?

Let’s say there are two and only two dispensaries in given location. One has been in operation for years and is well liked. It has received 25 reviews, 20 of them five stars and five of them four stars for an average rating of 4.8. When you are viewing the detailed listing page, this is the most important because you aren’t comparing it to other results.

Now let’s say dispensary #2, which previously had no reviews is updated to include a single five star review. If we simply sorted by rating then this dispensary would be listed before the dispensary with a rating of 4.8 out of 25 reviews. This would be great if we knew that the second dispensary was worthy of a five star review, but what if that review came from the dispensary owner’s mom?

To sort all of this out so you don’t have to, we have implemented Bayesian weighted averages on the site using this algorithm:

This may look like Greek to you, but for you math fans out there, Wikipedia’s Bayesian average page describes this formula as following.

Calculating the Bayesian average uses the prior mean m and a constant C. C is assigned a value that is proportional to the typical data set size. The value is larger when the expected variation between data sets (within the larger population) is small. It is smaller, when the data sets are expected to vary substantially from one another.

Using CannaCentral’s 5 star rating system and the example from above, the first dispensary would receive a Bayesian value of 3.28 and the second dispensary a Bayesian value of 3.14 – although it has a lower rating a greater value is given to the number of reviews. The dispensary that has the highest weighted average is listed first. Magic!

2 Responses

