1. Deep learning: an overview and main paradigms
- Author
-
Vladimir Golovko
- Subjects
0209 industrial biotechnology ,General Computer Science ,Computer science ,Computer Science::Neural and Evolutionary Computation ,02 engineering and technology ,Machine learning ,computer.software_genre ,Deep belief network ,020901 industrial engineering & automation ,0202 electrical engineering, electronic engineering, information engineering ,Electrical and Electronic Engineering ,Quantitative Biology::Neurons and Cognition ,Artificial neural network ,business.industry ,Time delay neural network ,Deep learning ,Rectifier (neural networks) ,Electronic, Optical and Magnetic Materials ,ComputingMethodologies_PATTERNRECOGNITION ,Multilayer perceptron ,Feedforward neural network ,020201 artificial intelligence & image processing ,Artificial intelligence ,Types of artificial neural networks ,business ,computer - Abstract
In the present paper, we examine and analyze main paradigms of learning of multilayer neural networks starting with a single layer perceptron and ending with deep neural networks, which are considered regarded as a breakthrough in the field of the intelligent data processing. The baselessness of some ideas about the capacity of multilayer neural networks is shown and transition to deep neural networks is justified. We discuss the principal learning models of deep neural networks based on the restricted Boltzmann machine (RBM), an autoassociative approach and a stochastic gradient method with a Rectified Linear Unit (ReLU) activation function of neural elements.
- Published
- 2017