Your First Code Review: Catching a Bug

In Lesson 1, you learned PRIMM (predict, run, investigate) and practiced on short two-to-four line blocks. In Lesson 2, you built trace tables to track variable state across longer code where values change. Now you combine both tools on real project code, and this time, the code has a bug.

Here is the scenario. You ask Claude Code to generate a statistics calculator for SmartNotes, something that calculates how many words a student has written, their reading speed, and whether their final score passes a threshold. The AI produces fifteen lines of code. The variable names look sensible. The arithmetic looks reasonable. Nothing seems obviously broken. But somewhere in those fifteen lines, one variable is the wrong type. The code will crash the moment it runs, and no amount of staring at it casually will reveal the problem. Your trace table will.

This lesson puts you in the reviewer's seat. Your job is not to understand the code; you already know how to do that. Your job is to find what is wrong.

What Is a Code Review?

A code review is reading someone else's code to check for problems. In professional teams, every change is reviewed by at least one other person before it reaches the final product. No code ships without a second pair of eyes.

If you have never written code before

A code review is like proofreading a classmate's essay: you are looking for mistakes, not just reading. The difference is that with code, mistakes can crash an entire application rather than just confuse a reader.

If you have done code reviews in another language

This is a lightweight version of the code review process you know from pull requests. The principles are identical (you are checking for correctness, not just readability), but the scope here is a single code block rather than a multi-file changeset.

A code review is different from reading for understanding. When you read to understand, you ask "what does this do?" When you review, you ask "what does this do wrong?" The shift is subtle but important. Understanding accepts the code at face value. Reviewing challenges it.

Your toolkit for reviewing is already built:

Tool	What It Does in a Review
PRIMM	You predict what each section should produce, then check if it actually does
Trace tables	You track every variable line by line, watching for the moment a value stops making sense

You do not need new techniques. You need to apply the techniques you have with a reviewer's mindset: assume the code has a problem, and your job is to find it.

The SmartNotes Code

Open Claude Code in your SmartNotes project and type this prompt:

Generate a note statistics calculator for SmartNotes that computes
total words, reading speed, and whether a final score passes a
threshold. Use only variables with type annotations, arithmetic,
and print. About 15 lines. Do not include functions, imports, or
control flow (concepts covered in later chapters).

AI will generate a statistics module. It might look slightly different each time, but the pattern will be similar: several variables, some arithmetic, and a few print() calls.

Here is what a typical generation looks like. This is the version we will review together. It contains a deliberate bug:

# SmartNotes v0.1 -- Note Statistics Calculator
# Calculates basic statistics about a student's notes
note_count: int = 12
words_per_note: int = 150
total_words: int = note_count * words_per_note

study_hours: float = 2.5
words_per_hour: float = total_words / study_hours

passing_grade: int = 70
current_score: int = 85
bonus_points: str = "10"
final_score: int = current_score + bonus_points

is_passing: bool = final_score >= passing_grade

print(f"Total words written: {total_words}")
print(f"Reading speed: {words_per_hour} words/hour")
print(f"Final score: {final_score}")
print(f"Passing: {is_passing}")

Copy this code into your SmartNotes main.py file. Do not run it yet. Your job is to find the bug by reading, not by waiting for the crash.

Pause here. Before reading further, try to find the bug. Read each line, check each type annotation, and ask: does this operation make sense for these types?

Step 1: Predict (Build the Trace Table)

Work through the code line by line, building a trace table as you go. The columns are every variable in the code.

Lines 1-2: Comments. No variables created, nothing to trace.

Line 3: note_count: int = 12

Line	`note_count`	`words_per_note`	`total_words`	`study_hours`	`words_per_hour`	`passing_grade`	`current_score`	`bonus_points`	`final_score`	`is_passing`	Output
3	`12`	--	--	--	--	--	--	--	--	--

Line 4: words_per_note: int = 150

Line	`note_count`	`words_per_note`	`total_words`	...
3	`12`	--	--
4	`12`	`150`	--

Line 5: total_words: int = note_count * words_per_note Look at the row above: 12 * 150 = 1800.

Line	`note_count`	`words_per_note`	`total_words`	...
5	`12`	`150`	`1800`

Lines 7-8: study_hours becomes 2.5, then words_per_hour becomes 1800 / 2.5 = 720.0.

So far, no problems. Every operation matches its types: int * int produces int, int / float produces float. The trace table confirms the values make sense.

Lines 10-11: passing_grade becomes 70, current_score becomes 85. Both are int. Still fine.

Line 12: bonus_points: str = "10"

Stop. Look at the type annotation: str. Look at the value: "10". The quotes make this a string: text that happens to contain the characters 1 and 0. It is not the number ten. It is the word "10".

If you have never written code before

This kind of mistake, putting quotes around a number, is extremely common. You are not "bad at coding" if you miss it at first. The quotes are small, and "10" looks a lot like 10 at a glance. That is exactly why trace tables matter: they force you to write down the type, making the quotes impossible to ignore.

Line 13: final_score: int = current_score + bonus_points

This line tries to add current_score (an int with value 85) to bonus_points (a str with value "10"). Python cannot add a number and a string. This line will crash.

If you have used statically typed languages like Java or C#

In those languages, this would be a compile-time error: the compiler would refuse to build the program. Python's type annotations give you the same safety when combined with Pyright, but Python itself does not enforce types at runtime. The crash happens when the code actually executes, not when it is loaded.

Here is the complete trace table up to the crash:

Line	`note_count`	`words_per_note`	`total_words`	`study_hours`	`words_per_hour`	`passing_grade`	`current_score`	`bonus_points`	`final_score`	`is_passing`
3	`12`	--	--	--	--	--	--	--	--	--
4	`12`	`150`	--	--	--	--	--	--	--	--
5	`12`	`150`	`1800`	--	--	--	--	--	--	--
7	`12`	`150`	`1800`	`2.5`	--	--	--	--	--	--
8	`12`	`150`	`1800`	`2.5`	`720.0`	--	--	--	--	--
10	`12`	`150`	`1800`	`2.5`	`720.0`	`70`	--	--	--	--
11	`12`	`150`	`1800`	`2.5`	`720.0`	`70`	`85`	--	--	--
12	`12`	`150`	`1800`	`2.5`	`720.0`	`70`	`85`	`"10"`	--	--
13	CRASH

The trace table makes the problem visible. Row 12 shows bonus_points holding a string. Row 13 tries to use that string in arithmetic. The types do not match.

Step 2: Run

Now run the buggy code. Pyright is a static type checker: it reads your code without running it and checks whether the types make sense. You installed it in Chapter 44 as part of your discipline stack. Run Pyright first, then Python:

$ uv run pyright main.py

Pyright output:

main.py:13:37: error: Operator "+" not supported for types "int" and "str" (reportOperatorIssue)
1 error, 0 warnings, 0 informations

Pyright found the same bug you found. It points to line 13 and says the + operator does not work between int and str. Pyright found it without running the code; it analyzed the types and caught the mismatch statically.

Now run the code with Python to see the runtime error:

$ uv run python main.py

Python output:

Traceback (most recent call last):
  File "main.py", line 13, in <module>
    final_score: int = current_score + bonus_points
                       ~~~~~~~~~~~~~~^~~~~~~~~~~~~~
TypeError: unsupported operand type(s) for +: 'int' and 'str'

In plain English: Python cannot use the + operator between a number (int) and text (str). The word "operand" means "the values on each side of the operator."

Compare both outputs to your trace table prediction. Line 13, exactly where your table predicted the crash. Python tells you the specific problem: it cannot use + between an int and a str. Pyright told you the same thing without even running the code.

If you caught the bug during the Predict step, before running anything, that is exactly the skill this chapter teaches. You found the problem by reading, not by waiting for the crash.

Step 3: Investigate

Why did this happen? Three facts combine to create this bug:

bonus_points has type annotation str: the code explicitly says this variable holds text
The value "10" is in quotes: quotes make it a string, not a number
Python does not automatically convert types: adding int + str is an error, not an automatic conversion

The type annotation and the value agree with each other: str = "10" is a valid string. The bug is that the intent was a number, but the code wrote a string. Someone (or some AI) put quotes around 10 by mistake.

The fix: Change the type and remove the quotes.

# Bug:
bonus_points: str = "10"

# Fix:
bonus_points: int = 10

With this fix, line 13 becomes 85 + 10 = 95, final_score is 95, and is_passing becomes True.

Fixed output:

$ uv run python main.py
Total words written: 1800
Reading speed: 720.0 words/hour
Final score: 95
Passing: True

A Logic Bug That Pyright Misses

In Claude Code, type: "Generate another SmartNotes snippet:

Generate a 5-line Python code block that calculates the average
note length from total_notes and total_words for SmartNotes. Use
only variables with type annotations, arithmetic, and print.
Do not include functions, imports, or control flow.

The AI will generate something that calculates an average. Here is a version that contains a logic bug, a mistake that Pyright cannot catch:

# SmartNotes v0.1 -- Average Note Length
total_notes: int = 5
total_words: int = 750
average_length: float = total_notes / total_words
print(f"Average words per note: {average_length}")

Run Pyright on this code:

$ uv run pyright main.py

Pyright output:

0 errors, 0 warnings, 0 informations

Pyright sees nothing wrong. Every type checks out: int / int produces float, and the result is stored in a float variable. The types are perfect.

Now apply PRIMM. Predict the output.

Trace table:

Line	`total_notes`	`total_words`	`average_length`	Output
2	`5`	--	--
3	`5`	`750`	--
4	`5`	`750`	`0.006666...`
5	`5`	`750`	`0.006666...`	`Average words per note: 0.006666666666666667`

Run the code to confirm:

$ uv run python main.py
Average words per note: 0.006666666666666667

The average words per note is 0.0067? That makes no sense. If a student wrote 750 words across 5 notes, the average should be 150 words per note, not a fraction of a word.

The bug: the division is backwards. The code divides total_notes / total_words (5 / 750) when it should divide total_words / total_notes (750 / 5).

# Bug:
average_length: float = total_notes / total_words    # 5 / 750 = 0.0067

# Fix:
average_length: float = total_words / total_notes    # 750 / 5 = 150.0

This is a logic bug. The types are all correct; dividing two integers into a float is perfectly valid Python. Pyright has no way to know that you meant to divide in the other direction. Only a human reader, applying PRIMM and checking whether the result makes sense, catches this.

If you have never written code before

A logic bug means the code runs fine but gives the wrong answer, like a calculator that adds when it should subtract. The program does not crash, and no tool flags an error. You catch it by asking "does this answer make sense?" and using your knowledge of the real world. In this case, you know that average words per note should be a reasonable number (maybe 50, 100, or 200). When the code produces 0.0067, your common sense flags it immediately. Experienced developers sometimes miss logic bugs because they focus on syntax and types rather than asking that question.

Two Kinds of Bugs, Two Kinds of Review

This lesson demonstrated two categories of bugs:

Bug Type	Example	Pyright Catches It?	Human Catches It?
Type bug	Adding `int + str`	Yes: type mismatch detected	Yes: trace table shows mismatched types
Logic bug	Dividing backwards (`5 / 750` instead of `750 / 5`)	No: types are correct	Yes: result does not make sense

Professional verification uses both:

Automated tools (Pyright, ruff, pytest) catch mechanical errors: type mismatches, unused variables, formatting issues. They are fast and tireless.
Human reading (PRIMM, trace tables) catches logical errors: wrong formulas, backwards operations, values that do not make sense. They require judgment.

If you have used linting or CI/CD pipelines at work

This is the same principle as linting plus code review in professional workflows: automated tools catch mechanical issues, humans catch intent mismatches. The combination is what makes production code reliable. Neither alone is sufficient.

Exercises

Practice your code review skills on these code blocks. Each one has a bug. Your job: build a trace table, find the bug, classify it (type bug or logic bug), and state the fix.

Exercise 1: Grade Calculator

student_name: str = "Alice"
exam_score: int = 78
homework_score: str = "15"
total: int = exam_score + homework_score
grade_message: str = f"{student_name} scored {total}"
print(grade_message)

Build the trace table. What happens on the total line? Is this a type bug or a logic bug? What is the fix?

Exercise 2: Percentage Error

correct_answers: int = 18
total_questions: int = 20
percentage: float = correct_answers / total_questions
print(f"Score: {percentage}%")

Build the trace table. The code runs without crashing, but is the output what a teacher would expect? Is this a type bug or a logic bug?

Hint: What does a percentage look like? 0.9 or 90?

Exercise 3: Word Counter

title_words: int = 8
body_words: int = 342
footer_words: int = 12
total_words: int = title_words + body_words
average_section: float = total_words / 3
print(f"Total: {total_words}, Average per section: {average_section}")

Build the trace table. The code runs without crashing and the types are correct. But is total_words actually the total of all words? Look at every addition carefully. Is this a type bug or a logic bug?

PRIMM-AI+ Practice: Code Review

Predict [AI-FREE]

In Claude Code, type:

Generate a 10-line Python code block for SmartNotes statistics using only variables with type annotations (str, int, float, bool), arithmetic, and print(). Include one deliberate type mismatch bug. Do NOT reveal which line has the bug.

Once the file appears, press Shift+Tab to enter Plan Mode. Build a trace table (your mandatory trace artifact). Predict where the crash will occur and what the error message will say, along with a confidence score from 1-5.

Error Taxonomy: Before running, classify what kind of error you expect using /bug. In Claude Code, type:

/bug [paste the error you expect to see]

A type mismatch (like int + str) is a type error. Classify it before you see the result. This trains you to name bugs by category, not just by symptom.

Run

Press Shift+Tab to exit Plan Mode. Run uv run pyright main.py first. Does Pyright catch the type bug? It should: this is your type checker working as designed. Then run uv run python main.py yourself. Record your result: predicted crash location, confidence, actual crash location. Compare Pyright's error to the runtime crash. Both point to the same line, but Pyright caught it before running.

Investigate

Now ask for a harder challenge. In Claude Code, type:

Generate a 10-line Python block that calculates note statistics.
All types must be correct. Pyright should show 0 errors.
But include one logic error (wrong division order, missing variable,
off-by-one). Do NOT reveal which line has the bug.

Press Shift+Tab to enter Plan Mode. Build the trace table. Find the line where the calculation produces a wrong result. Then exit Plan Mode and use /investigate @main.py to compare your findings.

Error Taxonomy: This is a logic error: the types are correct but the behavior is wrong. Pyright says "0 errors" for this code. Your trace table finds what Pyright cannot. That is why trace artifacts are mandatory. They are the tool that catches what automated tools miss.

Modify

Take the bug you found and fix it. But before running the fixed version, predict its output. Then run it. Does the fix produce the correct result? This double verification (predict the fix, then verify the fix) is PRIMM-AI+'s full cycle applied to bug fixing.

Make [Mastery Gate]

In Claude Code, type:

Show me a 10-line Python block. Run Pyright on it and tell me
what Pyright reports. Then tell me: are there any logic bugs
that Pyright missed?

Before reading the AI's analysis, do your own review. Find the logic bugs yourself. Then compare your findings to the AI's. Where you agree, your review skills match the AI's. Where you disagree, investigate: who is right? Finding at least one logic bug that Pyright missed, and correctly explaining why it is wrong, is your mastery gate for this lesson.

Verification Ladder: Rung 2

In the Predict step, you caught a type error by reading type annotations. That is Rung 2 of the Verification Ladder: types catch structural errors before anything runs. Pyright automates Rung 2, but your trace table caught what Pyright could not (the logic bug). Human reading and automated tools together cover more ground than either alone.

Common Beginner Mistakes in Code Review

Assuming code is correct because it runs: Running without errors does not mean correct. A function can return the wrong value without crashing. Always check output against expected behavior.
Reading function names instead of function bodies: A function named calculate_total might not calculate a total correctly. Read what the code does, not what the name promises.
Ignoring type annotations: The : str and -> int labels tell you what kind of data flows through the code. Skipping them means you miss mismatches that cause bugs.

James closes the trace table for the backwards-division bug. Two bugs found today: one that crashed, one that ran perfectly and gave the wrong answer.

"The type bug was easy once I saw the trace table," he says. "But the logic bug is the one that scares me. It ran fine. Pyright said zero errors. If I had not checked the actual number against common sense, I would have shipped it."

"That is the distinction that matters," Emma says. "Pyright catches the mechanical problems. Human reading catches the intent problems. You need both." She pauses. "I will be honest: I do not know a tool that reliably catches logic bugs in general. There are research prototypes, formal verification systems, property-based testing. But for everyday code review, it still comes down to a person asking 'does this answer make sense?'"

James nods. "It is like the difference between a spell-checker and a fact-checker. The spell-checker catches typos. The fact-checker catches a claim that two plus two is five, even though every word is spelled correctly."

Emma almost smiles. "That is a good analogy." She stands. "You can read code. You can trace it. You can find bugs in it. Lesson 4 adds two new vocabulary words: def and assert. They let you read tests, not write them yet, just read them and predict whether they pass or fail. Same PRIMM method. New kind of code."

What Is a Code Review?​

The SmartNotes Code​

Step 1: Predict (Build the Trace Table)​

Step 2: Run​

Step 3: Investigate​

A Logic Bug That Pyright Misses​

Two Kinds of Bugs, Two Kinds of Review​

Exercises​

Exercise 1: Grade Calculator​

Exercise 2: Percentage Error​

Exercise 3: Word Counter​

PRIMM-AI+ Practice: Code Review​

Predict [AI-FREE]​

Run​

Investigate​

Modify​

Make [Mastery Gate]​

What Is a Code Review?

The SmartNotes Code

Step 1: Predict (Build the Trace Table)

Step 2: Run

Step 3: Investigate

A Logic Bug That Pyright Misses

Two Kinds of Bugs, Two Kinds of Review

Exercises

Exercise 1: Grade Calculator

Exercise 2: Percentage Error

Exercise 3: Word Counter

PRIMM-AI+ Practice: Code Review

Predict [AI-FREE]

Run

Investigate

Modify

Make [Mastery Gate]