Extreme learning machines and deep learning are slightly related, but advocate quite adversary concepts.
ELMs are neural nets with a single hidden layer, where the first weight matrix is initialized randomly. This allows the output matrix to be estimated via least squares, which is very quickly done.
Deep learning, on the other hand, is the learning of deep architectures (e.g. deep neural nets). Depending on the strategy, all the layers are optimized jointly or greedily.
Long story short. ELM says: "only learn the last layer". Deep learning says: "Learn all the layers." It seems that DL is much more successfull than ELMs.
Best Answer
Autoencoder is basically a technique to find fundamental features representing the input images. A simple autoencoder will have 1 hidden layer between the input and output, wheras a deep autoencoder will have multiple hidden layers (the number of hidden layer depends on your configuration). Refering the first figure below (deep autoencoder), the demsion of the input image (e.g. size=100x100 pixels) is reduced to 2000,1000,500,30(e.g. size = 10x3) respectively; this part is called encoder. Then, the reduced coder layer is reconstruced back to the original image; this part is called the decoder. Typically, you may use the features produced in any layer of the encoder (i.e. 2000,1000,500, or 30) for building a classifier (e.g. softmax classifier). And this classifier performance is generally better than a classifier using original features without any decoding.
You can imagine each hidden layer represent some form of fundamental features that construct the next layer of features. (See below figure; the first layer learn the colour formation; the second layer learns the edges; the third layer learn different parts of face; the fourth layer learn combination of parts to represent face)
References:
Jones, N. (2014). Computer science: The learning machines. Nature, 505(7482), 146-148. doi:10.1038/505146a
Hinton, G., & Salakhutdinov, R. (2006). Reducing the dimensionality of data with neural networks. Science, 313(5786), 504--507.
A good tutorial learning deep learning: http://ufldl.stanford.edu/wiki/index.php/UFLDL_Tutorial
A github for sample solutions for the UFLDL tutorial: https://github.com/johnny5550822/Ho-UFLDL-tutorial