New
Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow
A practical, project-driven introduction to machine learning and deep learning with Python
Pages
858
Published
2022
A rigorous foundation in Bayesian reasoning, probabilistic models, and modern machine learning methods
Build the mathematical intuition and practical toolkit to understand and apply probabilistic machine learning from first principles to modern deep models.
Probabilistic Machine Learning by Kevin P. Murphy gives you a unified, mathematically grounded treatment of the core ideas behind modern ML systems. Starting from probability theory and Bayesian inference, the book builds through linear models, deep neural networks, latent variable models, and Monte Carlo methods. At 858 pages, it is a serious reference and a teachable text, covering both the theory that explains why methods work and the practical details that matter when you apply them.
Most machine learning books teach you what to run. This one teaches you why it works. Kevin P. Murphy's Probabilistic Machine Learning builds a coherent framework rooted in probability theory and Bayesian reasoning, then uses that framework to derive, explain, and connect virtually every major class of ML algorithm.
The book opens with the mathematical foundations you actually need: probability, statistics, decision theory, and information theory. From there it moves through linear and logistic regression, feedforward and convolutional networks, recurrent models, attention and Transformers, and then into territory that most introductory texts skip: latent variable models, variational inference, normalizing flows, diffusion models, and Markov chain Monte Carlo.
What distinguishes this treatment is coherence. Each new model is introduced within the same probabilistic language, so you are always building on what came before rather than picking up disconnected techniques. A Gaussian mixture model, a VAE, and a Bayesian neural network all speak the same underlying dialect. That coherence pays off when you encounter a real problem and need to choose or design the right model rather than copy-paste a script.
Murphy writes for readers who are willing to engage with the mathematics. Derivations are shown, not hidden. But the exposition is careful and the notation consistent, so the density is earned rather than gratuitous. Worked examples and figures anchor the abstractions throughout.
Whether you are a graduate student building your theoretical foundation, a researcher who wants to stop treating black-box models as magic, or a senior practitioner who needs a reliable reference, this is the book you reach for when you want the real explanation.
Establishes the probabilistic language used throughout the book, covering random variables, common distributions, and the rules of probability that underpin every subsequent model.
Extends probability theory to joint, conditional, and marginal distributions, and introduces Gaussian distributions and their properties in detail.
Develops Bayesian inference from Bayes' rule through prior and posterior distributions, conjugate models, and the key ideas of credible intervals and model comparison.
Derives the two workhorses of supervised learning from the probabilistic framework, showing how maximum likelihood and MAP estimation connect to familiar loss functions.
Covers feedforward networks, backpropagation, regularization, and the practical training decisions that determine whether a network learns, all grounded in the probabilistic view.
Surveys convolutional networks, recurrent networks, attention mechanisms, and Transformers, explaining the architectural choices in terms of the structural assumptions they encode.
Introduces mixture models, PCA, and variational autoencoders, showing how latent variables extend the expressiveness of the model class and how the EM algorithm fits them.
Covers variational inference and the evidence lower bound in enough mathematical detail to implement them, and shows how mean-field VI scales to large models.
Explains rejection sampling, importance sampling, and Markov chain Monte Carlo including Metropolis-Hastings and Hamiltonian Monte Carlo, with guidance on diagnosing convergence.
Surveys normalizing flows, energy-based models, score matching, and diffusion models, connecting each to the probabilistic framework developed earlier in the book.
You should be comfortable with multivariate calculus, linear algebra, and basic probability at the undergraduate level. Murphy includes a concise review of key concepts, but the book is not a first introduction to mathematics.
No. This is a substantially new book written for the 2020s, covering deep learning, Transformers, variational autoencoders, diffusion models, and other topics absent from the 2012 text. Treat it as a successor, not an update.
Murphy has made draft PDFs freely available on his website. The published MIT Press edition is the final, polished version with corrected notation and updated figures.
The main text is mathematically focused rather than code-first. Supplementary notebooks and code examples are available through the author's public GitHub repository linked from his website.
A strong undergraduate with solid mathematics and some ML exposure can work through it, but it is designed for graduate-level readers. If you are new to ML entirely, an introductory course first will make the material much more tractable.
This volume covers foundations. The companion 'Advanced Topics' volume extends the treatment into research-level material such as causal inference, reinforcement learning, and more. Reading this book first is the intended path.
New
A practical, project-driven introduction to machine learning and deep learning with Python
New
An Iterative Process for Production-Ready Machine Learning Applications
by Chip Huyen
New
The definitive textbook on intelligent systems, from foundational search and logic to modern machine learning and probabilistic reasoning
by Peter Norvig, Stuart Russell
New
A Programmer's Guide to Building AI and Machine Learning Models with TensorFlow