Stats raving mad

The blog

A bayesian cinefeel…(*)

by M. Parzakonis on November 26, 2009

How imbd ranks movies?

The formula for calculating the Top Rated 250 Titles gives a true Bayesian estimate:

$latex WR=\frac{\nu }{\nu +m}R+\frac{m}{\nu +m}C$

where:

$latex R$ = average for the movie (mean) = (Rating)

$latex \nu$ = number of votes for the movie = (votes)

$latex m$ = minimum votes required to be listed in the Top 250 (currently 1500)

$latex C$ = the mean vote across the whole report (currently 6.9)

for the Top 250, only votes from regular voters are considered. (source)

Now that’s something unexpected! Going further than the simple arithmetic mean is something exciting, right?

The formula is the well-known decomposition of the posterior mean -distributed a priori as normal- compromising the prior guess ($latex C$) and the data ($latex R$) weighted by the sample ($latex \nu$) and the pretend-to-be initial sample ($latex m$) under the conjugate prior setting.

Leave a Reply

Your email address will not be published. Required fields are marked *

*

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong> <pre lang="" line="" escaped="" highlight="">