Pattern Recognition and Machine Learning

REFERENCES 727

G. Dorffner, H. Bischof, and K. Hornik (Eds.), Artificial Neural Networks – ICANN 2001, pp. 421–428. Springer.

Tipping, M. E. (1999). Probabilistic visualisation of
high-dimensional binary data. In M. S. Kearns,
S. A. Solla, and D. A. Cohn (Eds.),Advances
in Neural Information Processing Systems, Vol-
ume 11, pp. 592–598. MIT Press.

Tipping, M. E. (2001). Sparse Bayesian learning and
the relevance vector machine.Journal of Ma-
chine Learning Research 1 , 211–244.

Tipping, M. E. and C. M. Bishop (1997). Probabilis-
tic principal component analysis. Technical Re-
port NCRG/97/010, Neural Computing Research
Group, Aston University.

Tipping, M. E. and C. M. Bishop (1999a). Mixtures
of probabilistic principal component analyzers.
Neural Computation 11 (2), 443–482.

Tipping, M. E. and C. M. Bishop (1999b). Prob-
abilistic principal component analysis.Journal
of the Royal Statistical Society, Series B 21 (3),
611–622.

Tipping, M. E. and A. Faul (2003). Fast marginal
likelihood maximization for sparse Bayesian
models. In C. M. Bishop and B. Frey (Eds.),
Proceedings Ninth International Workshop on
Artificial Intelligence and Statistics, Key West,
Florida.

Tong, S. and D. Koller (2000). Restricted Bayes op-
timal classifiers. InProceedings 17th National
Conference on Artificial Intelligence, pp. 658–

AAAI.

Tresp, V. (2001). Scaling kernel-based systems to
large data sets.Data Mining and Knowledge Dis-
covery 5 (3), 197–211.

Uhlenbeck, G. E. and L. S. Ornstein (1930). On the
theory of Brownian motion.Phys. Rev. 36 , 823–
841.

Valiant, L. G. (1984). A theory of the learnable.
Communications of the Association for Comput-
ing Machinery 27 , 1134–1142.

Vapnik, V. N. (1982).Estimation of dependences based on empirical data. Springer. Vapnik, V. N. (1995).The nature of statistical learning theory. Springer. Vapnik, V. N. (1998).Statistical learning theory.Wi- ley. Veropoulos, K., C. Campbell, and N. Cristianini (1999). Controlling the sensitivity of support vector machines. InProceedings of the Interna- tional Joint Conference on Artificial Intelligence (IJCAI99), Workshop ML3, pp. 55–60. Vidakovic, B. (1999). Statistical Modelling by Wavelets. Wiley. Viola, P. and M. Jones (2004). Robust real-time face detection.International Journal of Computer Vi- sion 57 (2), 137–154. Viterbi, A. J. (1967). Error bounds for convolu- tional codes and an asymptotically optimum de- coding algorithm.IEEE Transactions on Infor- mation Theory IT-13, 260–267. Viterbi, A. J. and J. K. Omura (1979).Principles of Digital Communication and Coding. McGraw- Hill. Wahba, G. (1975). A comparison of GCV and GML for choosing the smoothing parameter in the gen- eralized spline smoothing problem.Numerical Mathematics 24 , 383–393. Wainwright, M. J., T. S. Jaakkola, and A. S. Willsky (2005). A new class of upper bounds on the log partition function.IEEE Transactions on Infor- mation Theory 51 , 2313–2335. Walker, A. M. (1969). On the asymptotic behaviour of posterior distributions.Journal of the Royal Statistical Society, B 31 (1), 80–88. Walker, S. G., P. Damien, P. W. Laud, and A. F. M. Smith (1999). Bayesian nonparametric inference for random distributions and related functions (with discussion).Journal of the Royal Statisti- cal Society, B 61 (3), 485–527. Watson, G. S. (1964). Smooth regression analysis. Sankhya: The Indian Journal of Statistics. Series ̄ A 26 , 359–372.

Pattern Recognition and Machine Learning

REFERENCES 727

Get our desktop app

Company

Features

Documentation

Resources