Friday, October 24, 2003

12 noon

Redwood Neuroscience Institute

 

Title:  "Energy-based Models for Sparse Overcomplete Representations"

 

Yee Whye Teh

Computer Science, University of California at Berkeley

 

Abstract:

I will present a new way of extending independent components analysis (ICA) to overcomplete representations. In contrast to the usual causal generative extensions of ICA which maintain marginal independence of sources, we define features as deterministic (linear) functions of the inputs. This assumption results in marginal dependencies among the features, but conditional independence of the features given the inputs. By assigning energies to the features a probability distribution over the input states is defined through the Boltzmann distribution. Free parameters of this model are trained using the contrastive divergence objective (Hinton 2002).  When the number of features is equal to the number of input dimensions this energy-based model reduces to noiseless ICA and we show experimentally that the proposed learning algorithm is able to perform blind source separation on speech data. In additional experiments we train overcomplete energy-based models to extract features from various standard data-sets containing speech, natural images, hand-written digits and faces.

 

This is work done in collaboration with Geoff Hinton, Max Welling and Simon Osindero.  A draft of the paper on which this talk is based on can be obtained at:

 

http://www.cs.berkeley.edu/~ywteh/research/ebm/ebmica.ps.gz