A Debugging Guide

Introduction
Terminology
The Mental Game of Debugging
A Debugging Guide
Conclusion
Other Resources

Introduction

"Debugging is twice as hard as writing a program in the first place."

- Brian Kernighan, The Elements of Programming Style

More than programming, debugging requires the mindset of a problem solver. Code can be written sloppily and in an ad-hoc way; debugging must be done methodically and with attention to detail. This guide is about the concrete steps of debugging, but I highly recommend first reading Ryan Chadwick's problem solving tutorial for an introduction to the appropriate attitude and mindset. This guide is heavily inspired by a similar guide by John Regehr.

Terminology

To make things clearer, I will use the following terms for the rest of this guide:

Symptom - The behavior of the program that is incorrect. This could be a null pointer exception, or a ValueError, or simply some output or return value that is incorrect.
Bug - The underlying code that is incorrect, which made lead to zero or more symptoms.
Diagnostic Code - Code that is written not as part of the program, but to help with debugging.
Input - The data that your program operates on. This could be actual typed input from the user, arguments to a function, numbers from a spreadsheet, or even mouse movements on a webpage.
Test Case - Although this usually means some input that we know the correct behavior for and can check against, here I use this term to mean the input that causes the program to exhibit symptoms.
Minimal Test Case - The smallest test case that will cause the same symptom. Alternately, the smallest test case that will expose the same bug (even if the symptoms are different).

It is important to remember here symptoms are not the same as bugs. Your program may be buggy, but could run fine without symptoms on the input you are using. Sometimes, different symptoms under different inputs may ultimately be caused by the same bug. Other times, a symptom may be due to multiple bugs in your code, all of which must be fixed before your program is correct. On occasion, the symptom may lead to the realization that your current approach is completely infeasible, forcing you to rewrite your code from scratch; in that case, there is no single "bug" that's causing your program to fail.

Debugging is process where, given one or more symptoms, you determine the bug(s) that led to it, and (ideally) removing those bugs from the code. In this sense, debugging is like being a detective trying to figure out what happened from evidence left behind, or like a scientist trying to understand some puzzling phenomenon. Lucky for us as programmers, we have the ability to replay what happened by running the program again, and to ask the computer to tell us more about what is happening by adding diagnostic code.

The Mental Game of Debugging

"Would you tell me, please, which way I ought to go from here?"

"That depends a good deal on where you want to get to," said the Cat.

"I don't much care where -" said Alice.

"Then it doesn't matter which way you go," said the Cat.

- Lewis Carroll, Alice in Wonderland

This guide is written for what's known as "print debugging" - that is, I'm assuming that the only thing you have is the source code and the ability to add print statements. Many languages and IDEs have debuggers, which will do the printing for you or offer other ways of understanding what your program is doing. These are highly useful, and I recommend learning to use them in your programming language/environment of choice. Debuggers may not always be available, however, and even with debuggers, the thinking process of debugging is the same, which is why this guide uses print debugging.

A lot of programmers, when they first start writing code, engage in what I call "trial-and-error debugging". That is, when they see that their program doesn't work, they randomly change their code, then run it to see if that fixed the problem. This habit is counter-productive, and the sooner you get rid of it the better. While this strategy might get you through your first (or even second) programming class, it will not work as your program gets more complicated. There are simply too many possible things to change, and without knowing what you're doing, proceeding with guess-and-check will never fix your program - or worse, introduce additional bugs.

Instead, debugging requires having a clear understanding of what your code should be doing and what your code is actually doing. After all, what is a bug if not a mismatch between expectation and reality? As per the Alice in Wonderland quotation, if you don't know what your code is supposed to do, then it doesn't matter what changes you make. The mental game of the debugging process starts here, and the rest of this guide is about the mechanics of figuring out where this mismatch occurred.