10of tenWorking Notes on Prompting ยท What the Tool Is Actually Doing

"Step by Step" and the Reasoning Question

A working note on when "think step by step" helps, when it doesn't, and why the difference matters.

"Think step by step" is the one technique in this series that genuinely works, for some tasks. That makes it the most interesting and the most misused: students apply it everywhere, including where it does nothing. The skill isn't the phrase. It's knowing which kind of problem you have.

The common move

The familiar prompt Think step by step and reason through this carefully before answering.

It's become a reflex, appended to almost any prompt as a general-purpose upgrade. It does real work in the right place. The question worth asking: where is the right place?

What you think it does, and what it actually does

What you think it does
  • Always helps. Treated as a universal upgrade for any prompt.
  • Makes the model smarter. As if the phrase adds intelligence.
  • Is always needed. Assumes you must request it to get it.
What it actually does
  • Helps task-by-task. Strong for multi-step logic, math, debugging; near-useless for recall, classification, or creative work.
  • Gives room to work. It lets the model externalize intermediate steps instead of jumping to an answer. Useful only when the task has steps worth showing.
  • Often already happening. Reasoning-capable models do this internally now, so the instruction can be redundant.

"Step by step" isn't intelligence; it's working room. On a problem with intermediate steps, showing the work catches errors that a leap to the answer would hide. On a problem with no steps, there's nothing to show, and the phrase is noise.

The more honest move

Match the technique to the shape of the task. Before adding it, ask one question: does this problem have intermediate steps where an error could hide?

Step-by-step with a reason Work through this in steps and show each one, so I can check where the logic might go wrong: [the multi-step problem].

The difference is that you're invoking it because the task has checkable steps, not out of habit. And when the steps are visible, you get a second benefit: you can inspect the reasoning, not just the answer, which is exactly the verification habit from sheet 2.

Try this

Before adding "step by step," ask: does this task actually have steps?

If yes (logic, math, debugging, multi-part analysis), use it, and read the steps to catch where it goes wrong.

If no (a fact, a simple classification, a piece of creative writing), skip it. You're adding instructions the task can't use.

The principle underneath

Step-by-step isn't a spell that makes the model smarter; it's room to work that only helps when the task has steps worth showing. Match the tool to the problem, not to the prompt.

The thread through these sheets

One habit, ten times

Every sheet in this set has made the same move in a different costume. "Expert in X," "please," "don't hallucinate," adjective stacks, personas, length caps, "step by step," and the quiet habits around them (how you verify, how you iterate, what context you supply): each is a place students hope a phrase will do work that only attention can do.

The fix was never a better phrase. It was the same habit each time: stop asking what magic words to type, and start asking what the model is actually doing. Once you ask that, the right move is usually obvious, and it's rarely the incantation.

That question (what is this actually doing?) is the only technique worth memorizing. Everything else in these sheets is just that question, applied ten ways.