New
Storytelling with Data
A Practical Guide to Communicating Effectively with Data Visualizations and Charts
Pages
432
Published
2013
Using Data Science to Transform Information into Insight β with Excel
Learn the core algorithms of data science by working through real business problems entirely inside Excel, no programming required.
Data Smart teaches you the practical mechanics of data science without requiring a single line of code. Working entirely in Excel, you build clustering models, regression analyses, forecasting systems, and optimization solutions from scratch. Each chapter tackles a concrete business problem and walks you through the math behind the technique before automating it in a spreadsheet. By the end, you understand not just how to click a button, but why the algorithm works.
Most introductions to data science send you straight to Python or R before you understand what the algorithms are actually doing. Data Smart takes a different approach. John W. Foreman starts with Excel, a tool you almost certainly already have, and uses it to expose the inner mechanics of techniques that data scientists use every day.
The logic is deliberate. When you build a k-means clustering model by hand in a spreadsheet, you can watch each centroid move with each iteration. When you run a regression, you can see the residuals cell by cell. That visibility builds intuition that survives the transition to any tool you pick up later.
The book works through real business scenarios from customer segmentation to sales forecasting to network analysis. Each scenario is self-contained. You can follow a single chapter if that technique is the one you need right now, or work through the book front to back to build a complete mental model of the field.
You do not need a statistics degree or programming background. You need Excel, a willingness to read a formula carefully, and a specific business problem you want to solve. The math is present but explained plainly, and every technique lands in a working spreadsheet before the chapter ends.
Data Smart is the book that converts an analyst who knows Excel into an analyst who understands data science. It earns its place on the desk, not just the shelf.
Establishes what data science actually means in a business context and previews the techniques covered in the book. You learn how each method maps to a class of real business problem.
Introduces probability-based classification through naive Bayes, building the model cell by cell in Excel. You classify a real dataset and see exactly how prior and conditional probabilities combine into a prediction.
Walks through the k-means algorithm iteration by iteration so you can watch centroids converge. You segment a customer dataset and learn how to choose the number of clusters.
Builds linear regression from the ground up, covering how coefficients are fitted and how to measure model quality. You apply the technique to a sales forecasting scenario and interpret the results honestly.
Introduces constrained optimization and shows how Excel Solver navigates the solution space. You formulate a real resource-allocation problem and walk through how the algorithm reaches an answer.
Covers time-series decomposition and exponential smoothing methods for predicting future values. You build a working forecast model and learn how to tune the smoothing parameters for your data.
Explains decision trees and the logic that makes ensemble methods like random forests more robust. You trace how individual trees are built and combined, and understand when to trust an ensemble prediction.
Introduces graph theory concepts and applies them to social-network and relationship data. You identify clusters and influential nodes in a network dataset using spreadsheet-compatible techniques.
Reviews the full toolkit and guides you through choosing the right technique for a given business question. You leave with a decision framework for scoping a data project from raw data to actionable output.
No programming is required. Every technique in the book is implemented entirely in Microsoft Excel. You need to be comfortable reading formulas, but no coding background is assumed.
The book was written for Excel 2010 and later on Windows. Most content works in Excel 2013 and 2016 as well. Some Solver-based chapters may behave differently in Excel for Mac or Excel Online.
The core algorithms covered, k-means clustering, regression, naive Bayes, and exponential smoothing, are foundational and have not changed. The Excel mechanics are stable. Readers use this book as an introduction to build intuition before moving to modern tools.
It is a practical guide. The math is explained clearly but the emphasis is always on applying a technique to a real business problem inside a spreadsheet, not on formal proofs.
If you already work comfortably in Python or R and want advanced machine learning techniques, this book is not aimed at you. It is an entry point, not a reference for experienced practitioners.
Companion files were made available through the publisher's website at the time of publication. Check the Wiley product page for the current download link.
New
A Practical Guide to Communicating Effectively with Data Visualizations and Charts
New
Techniques for Thinking Analytically and Solving Real Data Problems
New
A practical guide to the complete data engineering lifecycle, from ingestion to serving
by Joe Reis, Matt Housley
New
A hands-on guide to scalable data analytics using Python and PySpark