This is something very detailed for how Yahoo!'s today module is optimized for overall CTR. Internally the project was first named COKE but later rebranded as CORE (content optimization and relevence engine).
Here I only include some key points from the paper. Overall, the paper is written in a very succinct way and worth your time.
- mini batch learning in 5 minute segs;
- online models to track CTR including EMP, SS and OLR
- E&E setting with a random bucket
- it is a Bayesian version of logistic regression, Gaussian prior on parameters
- approximate inference with Laplace approximation scheme
- it may be simplified with uncorrelation assumption but the performance will suffer;
No comments:
Post a Comment