Your First Failing Test

James opens his editor. He knows what TDG looks like from Lesson 1: he read the worked example, predicted the output, traced the loop. Now he wants to try it himself. He starts typing the function body.

Emma glances at his screen. "What are you writing?"

James does not look up. "The function. celsius_to_fahrenheit. I know the formula: multiply by nine-fifths, add thirty-two."

Emma raises an eyebrow. "So you are skipping the specification."

"Why would I specify something I already know? That is like writing a purchase order for a pen that is already on my desk." He keeps typing.

Emma does not stop him. "Try it. Run pyright when you are done."

James writes the body, saves the file, and runs pyright. Zero errors. He grins. Then he tries pytest. There are no tests. Pytest has nothing to run.

"Green bar," Emma says. "Except there is no bar. How do you know the formula is correct?"

James pauses. "Because I know the formula."

Emma pulls up three versions on her screen. "You know a formula. Is it celsius * 9 / 5 + 32 or celsius * 9/5 + 32 or (celsius * 9) / 5 + 32? They look the same. Are they?"

James stares at the three versions. Operator precedence: the rules for which math operation Python does first. He is not actually sure they all produce the same result. "Okay, let me think... actually, I think they do in Python because multiplication and division have the same precedence and go left to right. But I am not a hundred percent sure. How long am I going to spend on this instead of actually building something?"

Emma sits back. "And that is why you write the tests first. The tests do not care about your confidence. They check the number."

James deletes the function body. Types the three dots. Opens the test file. "Stub and tests. Then I let the tests prove it."

Emma nods. "Now you are writing the specification, not the implementation."

In Lesson 1, you read a complete TDG cycle and understood the five-step loop. Now you do it yourself, but only the first two steps: Specify and Check types. You will learn three new Python words, write a function stub, write two tests, and see your first failing test. The function body stays empty. That is the point.

Word 1: `return`

In Chapter 45, you learned def (labels a function) and assert (insists something is true). A function is a reusable block of code with a name: you give it an input, it does some work, and it gives you back an output. Think of it like a calculator button. You press "5", it does the math, and shows you "10". The def keyword is how you create one in Python.

Now the third word: return.

return hands a value back from a function. When Python reaches a return statement, it stops the function and sends the value back to wherever the function was called.

Here is the simplest example:

def get_greeting() -> str:
    return "Hello, SmartNotes"

When you call get_greeting(), the function hands back the string "Hello, SmartNotes". You can use that value:

message: str = get_greeting()
print(message)  # prints: Hello, SmartNotes

The function runs, hits return "Hello, SmartNotes", and hands that string back. message (a named container for a value, called a variable) receives it. Then print(message) displays it on screen.

Print is for people. Return is for reuse.

This is the single most common confusion for beginners. print() displays a value on your screen. return hands a value back to the code that called the function. They look similar but do completely different things.

	`print()`	`return`
What it does	Displays text on screen	Hands a value back to the caller
Who sees it	You, the human	The code that called the function
Can a test use it?	No: `assert` cannot check what was printed	Yes: `assert` checks the returned value
Analogy	A cashier reading your total out loud	A cashier handing you the receipt

Why does this matter for TDG? Because assert checks the value a function returns, not what it prints. If your function prints the answer instead of returning it, the test cannot verify anything. Every function you specify with TDG must use return.

If you have never written code before

Think of a function like a calculator button. You push 5, it shows 10. The return is the moment the calculator shows the answer: it hands the result back to you. Without return, the calculator does the math inside but never shows the answer. print is like the calculator reading the number aloud. Helpful for you, but another program cannot hear it.

If you have used other programming languages

return works the same as in JavaScript, Java, C, and most other languages. Python's version has no parentheses required: return x not return(x). If you omit the return statement entirely, Python returns None by default.

Word 2: `-> float`

You saw type annotations in Chapter 45: name: str = "Alice" tells pyright that name is a string. The -> float annotation does the same thing for a function's return value.

def celsius_to_fahrenheit(celsius: float) -> float:

Read this line aloud: "Define a function called celsius_to_fahrenheit that takes a parameter (a value you pass into the function) called celsius of type float and returns a float."

The -> float is the promise. It tells pyright, and anyone reading the code, what type of value this function will hand back. Pyright enforces this: if the function returns a string instead of a float, pyright will report an error.

Annotation	Meaning
`celsius: float`	The input must be a float
`-> float`	The output will be a float
`-> int`	The output will be an integer
`-> str`	The output will be a string
`-> bool`	The output will be a boolean

You already know these four types from Chapter 45. The arrow -> applies them to the function's return value instead of a variable.

Word 3: `...` (the Ellipsis)

The ellipsis (pronounced "eh-LIP-sis") is Python's three-dot symbol: .... It means "intentionally empty." When you write:

def celsius_to_fahrenheit(celsius: float) -> float: ...

You are saying: "This function exists. It takes a float. It returns a float. But I have not written the body yet." The ... is a placeholder. It is the stub.

Why `...` and not `pass`?

Python has another "do nothing" keyword: pass. But for TDG, ... is the right choice. Here is why:

... (ellipsis) = pyright treats the function as a stub. It trusts the type annotations and ignores the missing body. uv run pyright reports 0 errors.
pass = pyright treats the function as real code with no return statement. Since the annotation says -> float but the function returns nothing (None), pyright reports a type error.

The difference matters. You want pyright to pass at Step 2 of the TDG loop so you can focus on one thing at a time: first the types (pyright), then the implementation (Claude Code), then the tests (pytest). Using ... lets you separate those steps cleanly.

Quick Check

You learned three new Python words. Before combining them, make sure each one makes sense:

return hands a value back from a function
-> float promises the function returns a float
... means the body is intentionally empty (a stub)

If any of these is unclear, re-read that section before continuing.

Combining All Three: Your First Stub

Your first time typing code

You are about to type your first Python code. It is normal if this feels slow or uncertain. Copy the code exactly as shown. Every space, every colon, every dot matters. If you prefer, copy-paste the code blocks directly into your files. There is no shame in pasting; professional developers copy-paste constantly. The goal is getting the right code into the right file, not proving you can type it from memory.

Creating new files

In VS Code, right-click the smartnotes folder in the sidebar, select "New File", and type temperature.py. Or ask Claude Code: "Create smartnotes/temperature.py with this content." If you do not see a smartnotes folder, revisit the Chapter 44 project setup.

Your SmartNotes project should have this structure (set up in Chapter 44):

smartnotes-project/
├── smartnotes/
│   ├── __init__.py          ← Already exists from Ch 44 setup
│   └── temperature.py       ← You create this now
├── tests/
│   └── test_temperature.py  ← You create this next
└── pyproject.toml

The __init__.py file is a special marker that tells Python "this folder contains code that other files can load and use." Without it, the from smartnotes.temperature import ... line would fail with a ModuleNotFoundError. You created this file during Chapter 44 setup, so it should already be there.

If you see ModuleNotFoundError

If pytest shows ModuleNotFoundError: No module named 'smartnotes', check two things:

Does smartnotes/__init__.py exist? If not, create an empty file at that path (no content needed; the file itself is what matters).
Are you running pytest from the project root folder (the one containing pyproject.toml)? Run ls pyproject.toml. If it says "no such file," navigate to the correct directory first.

Open your SmartNotes project and create a file called smartnotes/temperature.py:

def celsius_to_fahrenheit(celsius: float) -> float: ...

One line. Read it piece by piece:

Part	Meaning
`def`	Define a function (Chapter 45, Lesson 4)
`celsius_to_fahrenheit`	The function's name: what it does
`(celsius: float)`	It takes one input called `celsius`, which is a float
`-> float`	It promises to return a float
`: ...`	The body is intentionally empty: a stub

Now run the type checker:

$ uv run pyright
0 errors, 0 warnings, 0 informations

Pyright passes. The types are consistent: a float goes in, a float comes out. The stub tells pyright "trust me, the body will be filled in later."

Your First Two Tests

Now create the test file tests/test_temperature.py. In VS Code, right-click the tests folder (not the smartnotes folder) in the sidebar, select "New File", and name it test_temperature.py. Or ask Claude Code: "Create tests/test_temperature.py with this content." Copy the code below exactly into your new file. The first line must match your file path precisely, or Python will not find your function:

from smartnotes.temperature import celsius_to_fahrenheit

def test_freezing_point():
    assert celsius_to_fahrenheit(0.0) == 32.0

def test_boiling_point():
    assert celsius_to_fahrenheit(100.0) == 212.0

These are the two tests that define what your function must do. Let us read them:

Line 1: from smartnotes.temperature import celsius_to_fahrenheit is an import statement. It tells Python: "go to the file smartnotes/temperature.py and bring in the function called celsius_to_fahrenheit so I can use it here." Notice the dots: Python writes smartnotes.temperature (with dots) where your file system uses smartnotes/temperature.py (with slashes). Dots are Python's way of navigating folders. You will use this pattern in every test file.

Test 1: "I insist that converting 0.0 degrees Celsius gives 32.0 degrees Fahrenheit." This is the freezing point of water: 0°C equals 32°F.

Test 2: "I insist that converting 100.0 degrees Celsius gives 212.0 degrees Fahrenheit." This is the boiling point of water: 100°C equals 212°F.

Two tests. Two known facts about temperature conversion. These are your specification.

The RED Moment (Tests Fail, On Purpose)

Now run the tests:

$ uv run pytest tests/test_temperature.py -v

tests/test_temperature.py::test_freezing_point FAILED
tests/test_temperature.py::test_boiling_point FAILED

    def test_freezing_point():
>       assert celsius_to_fahrenheit(0.0) == 32.0
E       assert None == 32.0

2 failed

(The :: in pytest output separates the file name from the test function name. Read test_temperature.py::test_freezing_point as "the test called test_freezing_point inside the file test_temperature.py.")

Both tests fail. The function returned None, Python's word for "nothing." When a function has no return statement (because the body is ...), Python hands back None by default. Think of it like a calculator that does the math inside but shows a blank screen. The assert insists the result should be 32.0, but it got None (nothing). The test fails loudly.

This is correct. The failing test is not a mistake. It is the starting point. In test-driven development, this is called RED: the tests fail because the implementation does not exist yet. Your specification is complete. The tests define what "correct" means. The implementation is AI's job.

Here is where you stand:

Tool	Result	Meaning
`uv run pyright`	0 errors	Types are consistent: the stub is valid
`uv run pytest`	2 FAILED	No implementation yet: tests cannot pass

Two tools, two different jobs. Pyright checks the shape (types are consistent). Pytest checks the behavior (the function does what the tests demand). Right now, the shape is right but the behavior is missing. That is exactly where you want to be before asking AI to generate.

Quick Check

Before continuing, make sure you can answer these:

What value does a function return when its body is ...?
Which tool passes (pyright or pytest) when the body is ...?
What is the RED moment?

Answers: None (Python's word for "nothing"), pyright passes (it trusts the type annotations), and RED means the tests fail because the implementation is missing, the correct starting point for TDG.

Terminal commands in this chapter

You will use these commands throughout this chapter. All start with uv run because uv manages your project's Python environment (Chapter 44):

Command	What it does
`uv run pyright`	Checks types: are the promises consistent?
`uv run pytest tests/test_temperature.py -v`	Runs your tests. `-v` means "verbose," showing each test name and result
`uv run python -c "print(...)"`	Runs a quick Python calculation directly in your terminal

What You Just Wrote

Six lines total. One stub. Two tests. One import. That is your complete specification. In Lesson 3, you will prompt Claude Code to fill in the ... with a real implementation, run the tests again, and see GREEN.

Try With AI

Open Claude Code in your SmartNotes project and try these prompts.

Prompt 1: Explain the Three Words

Explain these three Python concepts in one sentence each,
as if I am a complete beginner:
1. The return keyword
2. The -> float annotation
3. The ... (ellipsis) as a function body

Compare AI's explanations to what you just learned. Are they consistent? If AI uses jargon you do not recognize, ask it to simplify.

What you're learning: You are verifying AI explanations against your own understanding, the same critical reading you will apply to AI-generated code.

Prompt 2: Write a Different Stub

Write a function stub (using ... for the body) for a function
called word_count that takes a text parameter of type str and
returns an int. Include a -> int return type annotation.
Do not write the implementation.

Read the stub AI generates. Does it match the pattern you just learned? Check: does it have def, a parameter with a type, -> int, and ...?

What you're learning: You are applying the stub pattern to a different function, testing whether you understand the structure, not just the specific example.

Prompt 3: Generate Test Cases

I have this function stub:
def word_count(text: str) -> int: ...

Write two test functions using assert that define what
word_count should do. Use simple, obvious test cases.

Read the tests AI generates. Predict: do the test values make sense? For example, if the test says word_count("hello world") == 2, does that match your understanding of "word count"? This is specification review: you are checking whether the tests are correct, not the implementation.

What you're learning: You are reviewing AI-generated specifications (tests), a skill you will need when AI suggests test cases for your functions.

PRIMM-AI+ Practice: Writing Specifications

Predict [AI-FREE]

Press Shift+Tab to enter Plan Mode before predicting.

Read this stub and tests without running them:

def inches_to_cm(inches: float) -> float: ...

def test_one_inch():
    assert inches_to_cm(1.0) == 2.54

def test_one_foot():
    assert inches_to_cm(12.0) == 30.48

Predict: if you run uv run pyright, will it report errors? If you run uv run pytest, will the tests pass or fail? Write both predictions and a confidence score from 1-5.

Check your prediction

Pyright: 0 errors. The stub uses ..., so pyright treats it as a valid stub. The types are consistent: float in, float out.

Pytest: 2 FAILED. The body is ..., so the function returns None. Both assertions check None against a float value. Both fail.

If you predicted both correctly with confidence 4-5, you understand the pyright/pytest split, the key insight of this lesson.

Run

Press Shift+Tab to exit Plan Mode. Create the stub and tests in your SmartNotes project. Run uv run pyright and uv run pytest tests/test_inches.py -v. Compare to your predictions.

Investigate

The error says assert None == 2.54. Explain in one sentence why the function returns None.

Error Taxonomy (from Chapter 43): This is not a type, logic, or specification error. The function body is ..., so Python returns None by default. Chapter 43 introduced five error categories: type, logic, specification, data/edge-case, and orchestration errors. At this stage, the first three are most common in code. This case does not fit neatly into any category because there is no implementation at all. It is the expected state at Step 2 of TDG: the specification is complete, and the implementation is missing.

Modify

Change test_one_foot to check 6 inches instead. Calculate 6.0 x 2.54 by hand, update the assertion. Predict: will pyright still pass? Will pytest still fail?

Make [Mastery Gate]

Write your own stub and two tests for a function of your choice. Some ideas:

kg_to_pounds(kg: float) -> float (1 kg = 2.205 pounds)
minutes_to_hours(minutes: int) -> float (divide by 60)
area_of_rectangle(width: float, height: float) -> float (width x height)

Your specification must:

Have a stub with ... and a return type annotation
Have two tests with known input-output pairs
Pass uv run pyright with 0 errors
Fail uv run pytest (RED, because the body is still ...)

If all four conditions are met, you have written a complete TDG specification. You are ready for Lesson 3.

Common Mistakes

Mistake	What Goes Wrong	How to Avoid It
Forgetting the `import` line in the test file	Pytest reports `NameError: name 'celsius_to_fahrenheit' is not defined`	Every test file starts with `from smartnotes.<module> import <function>`
Writing tests without creating the stub first	Pyright reports errors because the function does not exist yet	Create the stub file first, then write tests that import from it
Panicking at RED	The failing test feels like a mistake, but RED is the correct starting point	RED means your specification is complete and the implementation is missing; exactly right

James stares at the two red failures on his screen. The stub. The tests. The intentionally empty body. He had wanted to write the formula himself ten minutes ago. Now the red feels like progress.

"It is strange," he says. "The failing tests feel like I accomplished something."

Emma's voice comes from across the room. "That is because you did. You wrote a contract that a machine can verify. That is harder than writing the formula."

Word 1: return​

Print is for people. Return is for reuse.​

Word 2: -> float​

Word 3: ... (the Ellipsis)​

Why ... and not pass?​

Combining All Three: Your First Stub​

Your First Two Tests​

The RED Moment (Tests Fail, On Purpose)​

What You Just Wrote​

Try With AI​

Prompt 1: Explain the Three Words​

Prompt 2: Write a Different Stub​

Prompt 3: Generate Test Cases​

PRIMM-AI+ Practice: Writing Specifications​

Predict [AI-FREE]​

Run​

Investigate​

Modify​

Make [Mastery Gate]​

Common Mistakes​