How Computers Execute Programs

A beginner-friendly mental model of CPU, memory, and the fetch-decode-execute cycle

To program well in any language, you need a mental picture of what the computer is doing while your program runs. The picture doesn't have to be perfect. It just has to be good enough that you can predict what your code will do before pressing Run.

This chapter builds that picture.

A computer has two main parts (for us)

For our purposes, every computer is made of two cooperating parts:

The CPU (Central Processing Unit) is a small, very fast machine that can perform a fixed list of tiny operations: add two numbers, compare two numbers, move data from one place to another, decide whether to jump to a different instruction.
Memory (RAM) is a giant array of numbered storage cells. Each cell holds 8 bits (one byte). Each cell has an address, which is itself just a number — like a house number on an infinitely long street.

That's it. Everything else — variables, functions, files, web browsers, video games — is built on top of these two ideas.

Memory as a row of mailboxes

It helps to picture RAM as a long row of numbered mailboxes:

Address:   1000  1001  1002  1003  1004  1005  1006  1007  ...
Contents:   42   00    00    00    'H'   'i'   '!'   00    ...

Each mailbox holds one byte (a number between 0 and 255, or a character). Larger values like a 32-bit integer occupy several consecutive mailboxes — typically 4 bytes for an int.

When your C program declares a variable int score = 42;, the compiler arranges for four mailboxes somewhere in RAM to hold the number 42. The variable name score is purely a convenience for you, the human. The CPU only knows addresses.

Variables are nicknames

A variable name is a label you (and the compiler) attach to a memory location. The hardware doesn't know the word score. It only knows "the four bytes starting at address 0x7ffeefbff5ac". The compiler is the translator between your friendly names and the CPU's cold addresses.

The fetch-decode-execute cycle

The CPU works in a tight loop, billions of times per second:

Fetch. The CPU has a special register called the program counter (PC) that holds the address of the next instruction. It reads the instruction at that address from memory.
Decode. It looks at the bits of the instruction and figures out what kind it is: add, jump, load, store, etc.
Execute. It actually performs the operation — perhaps updating a register, perhaps writing to memory, perhaps changing the PC to jump elsewhere.

Then it goes back to step 1. That tiny loop is the entire job of a CPU. Everything your computer does — running your operating system, streaming a movie, rendering a 3D game — is some combination of those three steps repeated trillions of times.

What "running a program" actually means

When you double-click a program, here is the (simplified) sequence:

The operating system finds the executable file on disk.
It allocates a chunk of memory for the program — the process.
It copies the program's machine instructions into the code segment of that memory.
It sets up a place for the program's variables: the data segment (for variables known in advance) and the stack (for function-call bookkeeping).
It loads the address of the program's first instruction (often called _start, which then calls main) into the CPU's program counter.
The CPU begins fetching, decoding, and executing instructions from that address.

Don't worry about memorizing these regions yet — we'll come back to the stack and the heap several times in this course. The important thing is the mental picture: your program is a chunk of instructions and data, sitting in memory, being chewed through by the CPU one step at a time.

A worked example, on paper

Let's say the CPU sees these three instructions in a row:

LOAD  R1, [1000]    ; copy the value at address 1000 into register R1
ADD   R1, R1, 10    ; add 10 to R1
STORE [1000], R1    ; copy R1 back into address 1000

Suppose memory at address 1000 currently contains the number 42. Trace it by hand:

Step	What happens	Memory[1000]	R1
0	start	42	?
1	LOAD R1, [1000]	42	42
2	ADD R1, R1, 10	42	52
3	STORE [1000], R1	52	52

That is exactly what your computer does when you write x = x + 10; in C, where x happens to live at address 1000. The C compiler emits those three instructions for you.

Why this matters for programming

When you understand the fetch-decode-execute cycle, several things that confuse beginners become obvious:

Why is order important? Because the CPU executes one instruction at a time, top to bottom (unless told to jump). The order you write statements is the order they happen.
Why do if and while work? Because under the hood they're just "compare two numbers, then maybe jump to a different instruction". A loop is a conditional jump back to the top of the loop body.
Why are variables fast? Because reading from a register or a nearby memory address is something the CPU can do in a few nanoseconds.
Why does running out of memory crash a program? Because the program can't get more mailboxes to store things in.

Programming, then, is just…

Programming is the art of arranging a sequence of small operations in memory such that, when the CPU walks through them, something useful happens at the end.

Every language we use — Python, JavaScript, Rust, C — ultimately exists to make it easier for humans to describe such arrangements. Some hide the machine almost entirely (Python). Some keep it just out of sight (Java). C keeps it right at your fingertips, which is exactly why it's such a great teacher.

QuestionSelect one

In the fetch-decode-execute cycle, what is the role of the program counter?

It counts how many programs are currently running on the computer.

It holds the address of the next instruction the CPU will fetch.

It records how many instructions a program has executed in total.

It stores the result of the most recent arithmetic operation.

QuestionSelect one

When you write int x = 42; in C, which statement is most accurate about what happens at the machine level?

The CPU creates a brand-new memory chip labeled x.

The compiler stores x as the literal text "x" somewhere in memory.

The compiler reserves a small region of memory (typically 4 bytes) to hold the value 42; the name x only exists in the source code.

The number 42 is sent to the CPU directly, not stored in memory.