Mathematics of Deep Learning: An Introduction to Foundational Mathematics of Neural Nets

Leonid Berlyand; Pierre-Emmanuel Jabin

Book

Mathematics of Deep Learning

An Introduction to Foundational Mathematics of Neural Nets

Leonid Berlyand and Pierre-Emmanuel Jabin

Language: English

Published/Copyright: 2026

Published by

Become an author with De Gruyter Brill

Explore this Subject How to publish with us

This book is in the series

De Gruyter Textbook

About this book

This course aims at providing a mathematical perspective to some key elements of the so-called deep neural networks (DNNs). Much of the interest on deep learning has focused on the implementation of DNN-based algorithms. Our hope is that this compact textbook will offer a complementary point of view that emphasizes the underlying mathematical ideas. We believe that a more foundational perspective will help to answer important questions that have only received empirical answers so far.

Our goal is to introduce basic concepts from deep learning in a rigorous mathematical fashion, e.g. introduce mathematical definitions of deep neural networks (DNNs), loss functions, the backpropagation algorithm, etc.

We attempt to identify for each concept the simplest setting that minimizes technicalities but still contains the key mathematics.

The book focuses on deep learning techniques and introduces them almost immediately. Other techniques such as regression and SVM are briefly introduced and used as a steppingstone for explaining basic ideas of deep learning.

Throughout these notes, the rigorous definitions and statements are supplemented by heuristic explanations and figures. The book is organized so that each chapter introduces a key concept. When teaching this course, some chapters could be presented as a part of a single lecture whereas the others have more material and would take several lectures.

Easily accessible for students with no prior knowledge of deep learning and with minimal background in linear algebra and calculus.

Focuses on the foundational mathematics of deep learning.

New chapter on kernel methods.

Additional examples.

Author / Editor information

Leonid Berland received his Ph. D. in 1985 from Kharkiv University (Ukraine). He joined the Pennsylvania State University (PSU) in 1991, and he is currently a Professor of Mathematics and a member of the Materials Research Institute at PSU. He is a founding co-director of PSU Centers for Interdisciplinary Mathematics and for Mathematics of Living and Mimetic Matter. He is known for his works at the interface between mathematics and other disciplines such as physics, materials sciences, life sciences, and most recently, computer science. He co-authored three books and more than 100 publications. His interdisciplinary works received research awards from leading research agencies in the USA, such as NSF, the US Department of Energy, and the National Institute of Health as well as internationally (Bi-National Science Foundation and NATO). Most recently his work was recognized with the Humboldt Research Award of 2021. His teaching excellence was recognized by C.I. Noll Award for Excellence in Teaching by Eberly College of Science at Penn State.

Pierre-Emmanuel Jabin is currently a distinguished professor at the Pennsylvania State University since August 2020. He was a student of École Normale Supérieure from 1995 to 1999; he earned his Ph.D. in 2000 and his HRD in 2003 both at Université Pierre et Marie Curie (Paris VI). He was more recently a professor at the University of Maryland from 2011 to 2020, where he was also director of the Center for Scientific Computation and Mathematical Modeling from 2016 to 2020. Jabin‘s work in applied mathematics is internationally recognized and he has made seminal contributions to the theory and applications of many-particle/multi-agent systems together with advection and transport phenomena. Jabin was an invited speaker at the International Congress of Mathematicians in Rio de Janeiro in 2018.

Topics

Publicly Available

Frontmatter
I

Download PDF
Publicly Available

Contents
V

Download PDF
Requires Authentication Unlicensed

Licensed

1 About this book
1
Requires Authentication Unlicensed

Licensed

2 Introduction to machine learning: what and why?
3
Requires Authentication Unlicensed

Licensed

3 Classification problem
5
Requires Authentication Unlicensed

Licensed

4 The fundamentals of artificial neural networks (ANNs)
7
Requires Authentication Unlicensed

Licensed

5 Supervised, unsupervised, and semi-supervised learning
22
Requires Authentication Unlicensed

Licensed

6 The regression problem
27
Requires Authentication Unlicensed

Licensed

7 Support vector machine
43
Requires Authentication Unlicensed

Licensed

8 Kernel methods
54
Requires Authentication Unlicensed

Licensed

9 Gradient descent method in the training of DNNs
71
Requires Authentication Unlicensed

Licensed

10 Backpropagation
86
Requires Authentication Unlicensed

Licensed

11 Convolutional neural networks (CNNs)
112
Requires Authentication Unlicensed

Licensed

A Review of the chain rule
143
Requires Authentication Unlicensed

Licensed

Bibliography
Requires Authentication Unlicensed

Licensed

Index
149

Publishing information

Pages and Images/Illustrations in book

eBook published on:

February 2, 2026

eBook ISBN:

9783112218211

Paperback published on:

February 2, 2026

Paperback ISBN:

9783119144117

Edition:

2nd revised and extended edition

Pages and Images/Illustrations in book

Front matter:

8

Main content:

150

Illustrations:

55

Tables:

1

https://doi.org/10.1515/9783112218211

eBook ISBN: 9783112218211

Paperback ISBN: 9783119144117

Keywords for this book

Deep Learning; Machine Learning; Artificial Neural Networks; Deep Neural Networks; Kernel methods

Audience(s) for this book

Undergraduates, Postgraduates, Teachers and Instructors, Professionals and Practitioners

Safety & product resources

Manufacturer information:
Walter de Gruyter GmbH
Genthiner Straße 13
10785 Berlin
productsafety@degruyterbrill.com

Mathematics of Deep Learning

Overview

About this book

Author / Editor information

Topics

Table of contents

Frontmatter

Contents

1 About this book

2 Introduction to machine learning: what and why?

3 Classification problem

4 The fundamentals of artificial neural networks (ANNs)

5 Supervised, unsupervised, and semi-supervised learning

6 The regression problem

7 Support vector machine

8 Kernel methods

9 Gradient descent method in the training of DNNs

10 Backpropagation

11 Convolutional neural networks (CNNs)

A Review of the chain rule

Bibliography

Index

Bibliographic data