[1]Hagai Attias. A variational Bayesian framework for graphical models. In Sara A. Solla, Todd K. Leen, and Klaus-Robert Müller, editors, Advances in Neural Information Processing Systems 12, 209–215. Denver, Colorado, USA, 2000.
[2]David Barber. Bayesian Reasoning and Machine Learning. Cambridge University Press, 2012.
[3]Matthew J. Beal. Variational algorithms for approximate Bayesian inference. PhD thesis, Gatsby Computational Neuroscience Unit, University College London, 2003.
[4]Christopher M. Bishop. Pattern Recognition and Machine Learning. Information Science and Statistics. Springer, 2nd edition, 2006.
[5]Richard T. Cox. The Algebra of Probable Inference. Johns Hopkins University Press, 1961.
[6]Charles W. Fox and Stephen J. Roberts. A tutorial on variational Bayesian inference. Artificial Intelligence Review, 38(2):85–95, jun 2012. doi:10.1007/s10462-011-9236-8.
[7]Andrew Gelman, John B. Carlin, Hal S. Stern, and Donald B. Rubin. Bayesian Data Analysis. Texts in Statistical Science. Chapman and Hall/CRC, Florida, 2nd edition, 2003.
[8]James Hensman, Magnus Rattray, and Neil D. Lawrence. Fast variational inference in the conjugate exponential family. In F. Pereira, C.J.C. Burges, L. Bottou, and K.Q. Weinberger, editors, Advances in Neural Information Processing Systems 25, 2888–2896. Lake Tahoe, Nevada, USA, 2012.
[9]James Hensman, Max Zwiessele, and Neil Lawrence. Tilted variational Bayes. In Samuel Kaski and Jukka Corander, editors, Proceedings of the 17th International Conference on Artificial Intelligence and Statistics, 356–364. Reykjavik, Iceland, apr 2014.
[10]Matthew D. Hoffman, David M. Blei, Chong Wang, and John Paisley. Stochastic variational inference. Journal of Machine Learning Research, 14:1303–47, 2013.
[11]Antti Honkela, Tapani Raiko, Mikael Kuusela, Matti Tornio, and Juha Karhunen. Approximate Riemannian conjugate gradient learning for fixed-form variational Bayes. Journal of Machine Learning Research, 11:3235–3268, 2010.
[12]E. T. Jaynes. Probability Theory: The Logic of Science. Cambridge University Press, 2003.
[13]Michael I. Jordan, Zoubin Ghahramani, Tommi S. Jaakkola, and Lawrence K. Saul. An introduction to variational methods for graphical methods. In Machine Learning, 183–233. MIT Press, 1998.
[14]David A. Knowles and Tom Minka. Non-conjugate variational message passing for multinomial and binary regression. In J. Shawe-Taylor, R.S. Zemel, P.L. Bartlett, F. Pereira, and K.Q. Weinberger, editors, Advances in Neural Information Processing Systems 24, 1701–1709. Granada, Spain, 2011.
[15]Thomas Minka. Expectation propagation for approximate Bayesian inference. In Jack S. Breese and Daphne Koller, editors, Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence, 362–369. Seattle, Washington, USA, aug 2001.
[16]Kevin P. Murphy. Machine Learning: A Probabilistic Perspective. MIT Press, 2012.
[17]H. Rue, S. Martino, and N. Chopin. Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations. Journal of the Royal Statistical Society Series B, 71(2):319–392, apr 2009.
[18]Yee W. Teh, David Newman, and Max Welling. A collapsed variational Bayesian inference algorithm for latent Dirichlet allocation. In Bernhard Schölkopf, John Platt, and Thomas Hoffman, editors, Advances in Neural Information Processing Systems 19, 1353–1360. Vancouver, Canada, 2007.
[19]John Winn and Christopher M. Bishop. Variational message passing. Journal of Machine Learning Research, 6:661–694, 2005.