Neural Networks and Deep Learning – Charu C.Aggarwal

Neural networks were developed to simulate the human nervous system for machine learning tasks by treating the computational units in a learning model in a manner similar to human neurons. The grand vision of neural networks is to create artificial intelligence by building machines whose architecture simulates the computations in the human nervous system. This is obviously not a simple task because the computational power of the fastest computer today is a minuscule fraction of the computational power of a human brain. Neural networks were developed soon after the advent of computers in the fifties and sixties. Rosenblatt’s perceptron algorithm was seen as a fundamental cornerstone of neural networks, which caused an initial excitement about the prospects of artificial intelligence. However, after the initial euphoria, there was a period of disappointment in which the data hungry and computationally intensive nature of neural networks was seen as an impediment to their usability. Eventually, at the turn of the century, greater data availability and increasing computational power lead to increased successes of neural networks, and this area was reborn under the new label of “deep learning.” Although we are still far from the day that artificial intelligence (AI) is close to human performance, there are specific domains like image recognition, self-driving cars, and game playing, where AI has matched or exceeded human performance. It is also hard to predict what AI might be able to do in the future. For example, few computer vision experts would have thought two decades ago that any automated system could ever perform an intuitive task like categorizing an image more accurately than a human.

Neural networks are theoretically capable of learning any mathematical function with sufficient training data, and some variants like recurrent neural networks are known to be Turi ng compl ete. Turing completeness refers to the fact that a neural network can simulate any learning algorithm, given sufficient training data. The sticking point is that the amount of data required to learn even simple tasks is often extraordinarily large, which causes a corresponding increase in training time (if we assume that enough training data is available in the first place). For example, the training time for image recognition, which is a simple task for a human, can be on the order of weeks even on high-performance systems. Furthermore, there are practical issues associated with the stability of neural network training, which are being resolved even today. Nevertheless, given that the speed of computers is expected to increase rapidly over time, and fundamentally more powerful paradigms like quantum computing are on the horizon, the computational issue might not eventually turn out to be quite as critical as imagined.

Related posts:

Neural Networks - A visual introduction for beginners - Michael Taylor
Fundamentals of Deep Learning - Nikhil Bubuma
Python Artificial Intelligence Project for Beginners - Joshua Eckroth
Deep Learning with Hadoop - Dipayan Dev
Deep Learning with Applications Using Python - Navin Kumar Manaswi
Python Deep Learning Cookbook - Indra den Bakker
Deep Learning with Python - Francois Chollet
Grokking Deep Learning - MEAP v10 - Andrew W.Trask
Applied Text Analysis with Python - Benjamin Benfort & Rebecca Bibro & Tony Ojeda
Artificial Intelligence with an introduction to Machine Learning second edition - Richar E. Neapolit...
Understanding Machine Learning from theory to algorithms - Shai Shalev-Shwartz & Shai Ben-David
Hands-On Machine Learning with Scikit-Learn and TensorFlow - Aurelien Geron
Natural Language Processing with Python - Steven Bird & Ewan Klein & Edward Loper
Machine Learning with Python for everyone - Mark E.Fenner
Artificial Intelligence by example - Denis Rothman
Introducing Data Science - Davy Cielen & Arno D.B.Meysman & Mohamed Ali
Artificial Intelligence - A Very Short Introduction - Margaret A.Boden
Hands-on Machine Learning with Scikit-Learn, Keras & TensorFlow - Aurelien Geron
Scikit-learn Cookbook Second Edition over 80 recipes for machine learning - Julian Avila & Trent Hau...
Deep Learning dummies second edition - John Paul Mueller & Luca Massaronf
Practical computer vision applications using Deep Learning with CNNs - Ahmed Fawzy Gad
Machine Learning - The art and science of alhorithms that make sense of data - Peter Flach
Python Data Analytics with Pandas, NumPy and Matplotlib - Fabio Nelli
Deep Learning dummies first edition - John Paul Mueller & Luca Massaron
Foundations of Machine Learning second edition - Mehryar Mohri & Afshin Rostamizadeh & Ameet Talwalk...
Machine Learning Applications Using Python - Cases studies form Healthcare, Retail, and Finance - Pu...
R Deep Learning Essentials - Dr. Joshua F.Wiley
Python for Programmers with introductory AI case studies - Paul Deitel & Harvey Deitel
Learning scikit-learn Machine Learning in Python - Raul Garreta & Guillermo Moncecchi
Python Machine Learning Third Edition - Sebastian Raschka & Vahid Mirjalili
Machine Learning with spark and python - Michael Bowles
Deep Learning for Natural Language Processing - Palash Goyal & Sumit Pandey & Karan Jain