Deep learning has created a resurgence of interest in neural networks and their application to everything from Internet search to self-driving cars. Results published in the scientific and technical literature show better-than-human accuracy on real-world tasks that include speech and facial recognition.
Fueled by modern massively parallel computing technology, it is now possible to train very complex multi-layer neural network architectures on large data sets to an acceptable degree of accuracy. This is referred to as deep-learning, as these multi-layer networks interpose many neuronal processing layers between the input data and the output results calculated by the neural network — hence the use of the word deep in the deep-learningcatchphrase. The resulting trained networks can be extremely valuable, as they have the ability to perform complex, real-world pattern recognition tasks very quickly on a variety of low-power devices including sensors, mobile phones, and FPGAs, as well as quickly and economically in the data center.
Generic applicability, high accuracy (sometimes better than human), and ability to be deployed nearly everywhere explains why scientists, technologists, entrepreneurs and companies are all scrambling to take advantage of deep-learning technology.
Machine learning went through a similar bandwagon stage in the 1980s where superlatives were lauded on the technology and futurists discussed how machine learning was going to change the world. The genesis of the 1980s machine- earning revolution was a seminal paper by Hopfield and Tank, “Neural Computation of Decisions in Optimization Problems,” which showed that good solutions to a wide class of combinatorial optimization problems could be found using networks of biology-inspired neurons. In particular, Hopfield and Tank demonstrated they could find good solutions to intractably large versions of the NP-Complete traveling salesman problem.
The advent of backpropagation by Rummelhart, Hinton and Williams allowed the adjustment of weights in a ‘neurone-like’ network so the network could be trained to solve a computational problem from example data. In particular, the ability of neural networks to adjust their weights to learn all the logic functions required to build a general purpose computer — including the non-linear XOR truth table — showed that artificial neural networks (ANNs) are computationally universal devices that can, in theory, be trained to solve any computable problem. I like to joke that machine learning made me one of the hardest working lazy men you would ever meet, as I was willing to work very hard to make the computer teach itself to solve a complex problem.
Nettalk, a beautiful example by Terry Sejnowski and Charles Rosenberg, showed that it was possible to teach a neural network to perform tasks at a human-level of complexity — specifically to read English text aloud. Even grade-school children immediately grasp the implications of machine learning through the NetTalk example, as people can literally hear the ANN learn to read aloud. Further, it is easy to show that the ANN had ‘learned’ a general solution to the problem of reading aloud, as it could correctly pronounce words that it had never seen before. I use NetTalk as a stellar example of how scientists can create simple and intuitively obvious examples to communicate their research to anyone.
The bandwagon faded for ANNs during the mid-1990s as overblown claims and a lull in the development of parallel computing exceeded both the patience of funding agencies and limited the size and complexity of the problems that could be addressed. Neural networks faded from the scientific limelight, while research continued to both expand and mature the technology. Still, examples such as Star Trek’s Commander Data preserved the popular perception of the potential of neural network technology.
The development of low-cost massively parallel devices like GPUs sparked a resurgence in the popularity of neural network research. Instead of spending $30M to purchase a 60 GF/s (billion flop/s) Connection Machine, modern researchers can now purchase a TF/s (trillion flop/s) capable GPU for around a hundred dollars. The parallel mapping pioneered by Farber on the Connection Machine at Los Alamos allows the computationally expensive training step to very efficiently map any SIMD architecture, be it a GPU or the vector architecture of an Intel Xeon or Intel Xeon Phi processor. Near-linear scalability in a distributed computing environment means that most computational clusters can achieve very efficient, near-peak performance during the training phase. For example, the 1980s mapping used on a Connection Machine was able to achieve over 13 PF/s (1015 flop/s) average sustained performance on the OakRidge National Laboratory Titan supercomputer. The ability to run efficiently on large numbers of either vector or GPU devices means that researchers can work with complex neural networks and large data sets to solve problems — sometimes as well or better than humans.
“The ability to run efficiently on large numbers of either vector or GPU devices means that researchers can work with complex neural networks and large data sets to solve problems — sometimes as well or better than humans.”
Convolutional neural networks (CNNs) are a form of ‘deep’ neural network architecture popularized by Yann LeCun and others in 1998. CNNs are behind many of the deep-learning successes that have been reported recently in image and speech recognition. Again inspired by biology, these neural networks find features in the data that permit correct classification or recognition of the training images without the help of a human. The lack of dependence on prior knowledge and human effort is considered a major advantage of CNNs over other approaches.
The Latest on: Deep Learning
via Google News
The Latest on: Deep Learning
- Neural architecture search automates the development of deep learning-based models for cancer researchon November 18, 2019 at 12:58 pm
Argonne researchers have created a neural architecture search that automates the development of deep-learning-based predictive models for cancer data. While increasing swaths of collected data and ...
- Unifying machine learning and quantum chemistry with a deep neural network for molecular wavefunctionson November 15, 2019 at 2:22 am
Here we present a deep learning framework for the prediction of the quantum mechanical wavefunction in a local basis of atomic orbitals from which all other ground-state properties can be derived.
- Deep learning segmentation of major vessels in X-ray coronary angiographyon November 15, 2019 at 2:22 am
In the present study, we proposed a robust method for major vessel segmentation using deep learning models with fully convolutional networks. When angiographic images of 3302 diseased major vessels ...
- Deep Learning Market 2019 Drivers, Opportunities, Trends, and Forecast by 2025 – MRE Research Reporton November 14, 2019 at 11:01 pm
Nov 15, 2019 (Heraldkeepers) -- New York, November 15, 2019: The Deep Learning Market is segmented on the Basis of End-User Type, Application Type, Solution Type and Regional Analysis. By End-User ...
- Going deeper: Moveworks snags $75M in new money to bolster deep learning in enterpriseon November 14, 2019 at 4:30 pm
Unlike some applications of BERT or Transformer, which "fine tune" the system, a relatively simpler task, Moveworks is "pre-training" the BERT model, which means developing the initial corpus of text ...
- Amazon Saw 15-Fold Jump In Forecast Accuracy With Deep Learning And Other AI Statson November 14, 2019 at 6:04 am
When Amazon switched from traditional machine learning techniques to deep learning in 2015, it saw a 15-fold increase in the accuracy of its forecasts, a leap that has enabled it to roll-out its ...
- Deep Learning Market by Top Players -Advanced Micro Devices, ARM Ltd, Clarifai, Entilic, Google, & Moreon November 13, 2019 at 9:59 pm
Nov 14, 2019 (Hitech News Daily via COMTEX) -- The World Deep Learning Market is bound to growth by 2025, as per the latest report by Big Market Research (BMR). Deep learning is a part of machine ...
- Deep learning expands study of nuclear waste remediationon November 13, 2019 at 3:38 am
A research collaboration between Lawrence Berkeley National Laboratory (Berkeley Lab), Pacific Northwest National Laboratory (PNNL), Brown University, and NVIDIA has achieved exaflop performance on ...
- Deep learning assists in detecting malignant lung cancerson November 12, 2019 at 11:08 am
Radiologists assisted by deep-learning based software were better able to detect malignant lung cancers on chest X-rays, according to research published in the journal Radiology. "The average ...
- Deep Learning on Summit Supercomputer Powers Insights for Nuclear Waste Remediationon November 9, 2019 at 8:57 am
A research collaboration between LBNL, PNNL, Brown University, and NVIDIA has achieved exaflop (half-precision) performance on the Summit supercomputer with a deep learning application used to model ...
via Bing News