Introduction to Deep Learning – Eugene Charniak

Your author is a long-time artificial-intelligence researcher whose field of expertise, natural-language processing, has been revolutionized by deep learning. Unfortunately, it took him (me) a long time to catch on to this fact. I can rationalize this since this is the third time neural networks have threatened a revolution but only the first time they have delivered. Nevertheless, I suddenly found myself way behind the times and struggling to catch up. So I did what any self-respecting professor would do, scheduled myself to teach the stuff, started a crash course by surfing the web, and got my students to teach it to me. (This last is not a joke. In particular, the head undergraduate teaching assistant for the course, Siddarth (Sidd) Karramcheti, deserves special mention.)

This explains several prominent features of this book. First, it is short. I am a slow learner. Second, it is very much project driven. Many texts, particularly in computer science, have a constant tension between topic organization and organizing material around specific projects. Splitting thedifference is often a good idea, but I find I learn computer science materialbest by sitting down and writing programs, so my book largely re ects mylearning habits. It was the most convenient way to put it down, and I amhoping many in the expected audience will find it helpful as well.

Which brings up the question of the expected audience. While I hopemany CS practitioners will find this book useful for the same reason I wroteit, as a teacher my first loyalty is to my students, so this book is primarilyintended as a textbook for a course on deep learning. The course I teach atBrown is for both graduate and undergraduates and covers all the materialherein, plus some \culture” lectures (for graduate credit a student must adda significant final project). Both linear algebra and multivariate calculus arerequired. While the actual quantity of linear-algebra material is not thatgreat, students have told me that without it they would have found thinkingabout multilevel networks, and the tensors they require, quite dificult.Multivariate calculus, however, was a much closer call. It appears explicitly only in Chapter 1, when we build up to back-propagation from scratch andI would not be surprised if an extra lecture on partial derivatives would do.Last, there is a probability and statistics prerequisite. This simplifies theexposition and I certainly want to encourage students to take such a course.I also assume a rudimentary knowledge of programming in Python. I do notinclude this in the text, but my course has an extra \lab” on basic Python.That your author was playing catch-up when writing this book alsoexplains the fact that in almost every chapter’s section on further readingyou will find, beyond the usual references to important research papers,many reference to secondary sources |others’ educational writings. I wouldnever have learned this material without them.

Related posts:

Intelligent Projects Using Python - Santanu Pattanayak
Deep Learning with Keras - Antonio Gulli & Sujit Pal
Building Chatbots with Python Using Natural Language Processing and Machine Learning - Sumit Raj
Pro Deep Learning with TensorFlow - Santunu Pattanayak
Deep Learning from Scratch - Building with Python form First Principles - Seth Weidman
Deep Learning with Python - Francois Chollet
Python for Programmers with introductory AI case studies - Paul Deitel & Harvey Deitel
Deep Learning and Neural Networks - Jeff Heaton
Learn Keras for Deep Neural Networks - Jojo Moolayil
Machine Learning with spark and python - Michael Bowles
Python Deeper Insights into Machine Learning - Sebastian Raschka & David Julian & John Hearty
Deep Learning with Python - Francois Cholletf
Artificial Intelligence with an introduction to Machine Learning second edition - Richar E. Neapolit...
Statistical Methods for Machine Learning - Disconver how to Transform data into Knowledge with Pytho...
Superintelligence - Paths, Danges, Strategies - Nick Bostrom
Neural Networks - A visual introduction for beginners - Michael Taylor
Natural Language Processing in action - Hobson Lane & Cole Howard & Hannes Max Hapke
Generative Deep Learning - Teaching Machines to Paint, Write, Compose and Play - David Foster
Neural Networks and Deep Learning - Charu C.Aggarwal
TensorFlow for Deep Learning - Bharath Ramsundar & Reza Bosagh Zadeh
Machine Learning - The art and science of alhorithms that make sense of data - Peter Flach
Hands-on Machine Learning with Scikit-Learn, Keras & TensorFlow - Aurelien Geron
Introducing Data Science - Davy Cielen & Arno D.B.Meysman & Mohamed Ali
Understanding Machine Learning from theory to algorithms - Shai Shalev-Shwartz & Shai Ben-David
Deep Learning for Natural Language Processing - Jason Brownlee
Artificial Intelligence - A Very Short Introduction - Margaret A.Boden
Building Machine Learning Systems with Python - Willi Richert & Luis Pedro Coelho
Machine Learning - A Probabilistic Perspective - Kevin P.Murphy
Introduction to the Math of Neural Networks - Jeff Heaton
Deep Learning in Python - LazyProgrammer
An introduction to neural networks - Kevin Gurney & University of Sheffield
Deep Learning with Applications Using Python - Navin Kumar Manaswi