Seminar Computational Intelligence A (708.111)

SS 2012

Institut für Grundlagen der Informationsverarbeitung (708)

Assoc. Prof. Dr. Robert Legenstein

Office hours: by appointment (via e-mail)

E-mail: robert.legenstein@igi.tugraz.at
Homepage: www.igi.tugraz.at/legi/

Location: IGI-seminar room, Inffeldgasse 16b/I, 8010 Graz
Date: starting from March 14th 2012, every Wednesday, 16.15 - 18.00 p.m.

Content of the seminar:

This year, we will discuss in Seminar A scientific articles about Deep Learning. Deep learning has emerged in the last years as one very powerful method for training deep neural architectures. It can overcome many problems of traditional neural network approaches and shows excellent practical performance.
Each student of this seminar will be able to choose a paper of his or her interest and, after preparation and discussion with Robert Legenstein, present it in a seminar talk.
Alternatively, students can also choose to present one of the recently published papers from our institute.

Slides from first session:

Slides first session

Doodle Poll for Talks

Doodle Poll

Update

Ressources for deep learning at http://deeplearning.net/

A paper by Hinton that discusses some practical aspects when working with deep architectures: Practical Guide. Can also be chosen for a seminar talk.

Talks:

Date	Speaker	Paper
Mar 28, 2012	Robert Legenstein	A quick introduction to Boltzmann Machines
Apr 25, 2012	Daniel Markl	Reducing the dimensionality of data with neural networks, Slides
May 23, 2012	Stefan Grabuschnig	Learning Deep Architectures for AI (1), Slides
May 23, 2012	Teresa Klatzer	Learning Deep Architectures for AI (2), Slides

Jun 6, 2012	Florian Hubner	Unsupervised learning of image transformations, Slides
Jun 13, 2012	Markus Eger	The Recurrent Temporal Restricted Boltzmann Machine, Slides
Jun 20, 2012	Gernot Griesbacher	Neural sampling: A model for stochastic computation in recurrent networks of spiking neurons, Slides
Jun 20, 2012	Michael Rath	Probabilistic inference in general graphical models through sampling in stochastic networks of spiking neurons, Slides
Jul 04, 2012	Philipp Singer	Discovering Binary Codes for Documents by Learning Deep Generative Models, Slides

Papers on Deep Learning :

Basic papers:

Hinton, G. E. and Salakhutdinov, R. R.

Reducing the dimensionality of data with neural networks.
Science, Vol. 313. no. 5786, pp. 504 - 507, 28 July 2006.
[ full paper ] [ supporting online material (pdf) ] [ Matlab code ]

The science paper that made deep networks popular

Hinton, G. E., Osindero, S. and Teh, Y.

A fast learning algorithm for deep belief nets
Neural Computation 18, pp 1527-1554. 2006. [pdf]

The basis for deep learning: the contrastive divergence learning algorithm

Applications:

Taylor, G. W., Hinton, G. E. and Roweis, S.

Modeling human motion using binary latent variables
Advances in Neural Information Processing Systems, 19
MIT Press, Cambridge, MA, 2007 [pdf]

Memisevic, R. and Hinton, G. E..

Unsupervised learning of image transformations
Computer Vision and Pattern Recognition (CVPR-07), 2007 [pdf]
A longer version: Technical Report UTML TR 2006-005 [pdf]

Salakhutdinov R. R, Mnih, A. and Hinton, G. E.

Restricted Boltzmann Machines for Collaborative Filtering
International Conference on Machine Learning, Corvallis, Oregon, 2007 [pdf]

Extensions of deep networks:

Sutskever, I., Hinton, G. E. and Taylor, G. W.

The Recurrent Temporal Restricted Boltzmann Machine
Advances in Neural Information Processing Systems 21, MIT Press, Cambridge, MA
[pdf]

Memisevic, R. and Hinton, G. E.

Learning to represent spatial transformations with factored higher-order Boltzmann machines
Neural Computation, Vol 22, pp 1473-1492 [pdf]

Cognitive-Science Applications:

Hinton, G. E. and Salakhutdinov, R.

Discovering Binary Codes for Fast Document Retrieval by Learning Deep Generative Models
Topics in Cognitive Science, Vol 3, pp 74-91 [pdf]

Ruslan Salakhutdinov, Josh Tenenbaum , Antonio Torralba.

Learning to Learn with Compound Hierarchical-Deep Models
Neural Information Processing Systems (NIPS 25), 2012 [ pdf]

Misc:

Ruslan Salakhutdinov and Geoffrey Hinton.

An Efficient Learning Procedure for Deep Boltzmann Machines
MIT Technical Report MIT-CSAIL-TR-2010-037, 2010 [ pdf]

Yoshua Bengio.

Learning Deep Architectures for AI
Foundations and Trends in Machine Learning: Vol. 2: No. 1, pp 1-127, 2009
[pdf]

IGI-Papers:

L. Büsing, J. Bill, B. Nessler, and W. Maass

Neural dynamics as sampling: A model for stochastic computation in recurrent networks of spiking neurons. PLoS Computational Biology, published 03 Nov 2011. doi:10.1371/journal.pcbi.1002211 (pdf)

This paper shows how Boltzman machines can be implemented by networks of spiking neurons.

D. Pecevski, L. Büsing, and W. Maass

Probabilistic inference in general graphical models through sampling in stochastic networks of spiking neurons.
PLoS Computational Biology, 7(12):e1002294, 2011 (pdf)

Shows how networks of spiking neurons can implement probabilistic inference in graphical models.

02/12 rh