Spreadsheets vs. Databases
Why spreadsheets are wonderful — until they aren't — and the specific moments when a relational database becomes the right tool.
Spreadsheets are one of the most successful tools ever made. They are visual, immediate, and forgiving. Almost everyone who works with data starts there, and for good reason. So why move to a database at all?
The honest answer: for small, personal, one-off work, you often shouldn't. Databases earn their keep when a spreadsheet starts to strain. This page is about recognizing those moments.
Where spreadsheets shine
- You can see everything at once and click any cell.
- Formulas update instantly as you type.
- There is nothing to set up — open it and go.
- They are perfect for quick calculations, small lists, and drafts.
If your data fits comfortably on a screen and only you touch it, a spreadsheet is frequently the right choice. Using a database there would be over-engineering.
Where spreadsheets start to crack
The trouble begins as data grows in size, users, and importance. Watch what happens:
Let us name each crack precisely.
1. Scale
Spreadsheets slow to a crawl — or refuse to open — somewhere in the hundreds of thousands to a few million rows. Databases routinely handle billions.
2. Simultaneous editing
When two people edit the same shared spreadsheet, changes collide, overwrite each other, or force everyone into a single-file queue. Databases are built from the ground up for many users at once.
3. Consistency
In a spreadsheet, the same customer's name might be typed slightly differently in twenty rows: "Acme", "ACME", "Acme Inc." Nothing stops it. A relational database lets you store that customer once and point to it everywhere, so there is a single source of truth.
4. Rules and validation
A spreadsheet cell will happily accept a price of "banana" or a
date of "someday". A database column declared as a number or a
date refuses values that don't fit, catching errors at the
moment they happen.
5. Relationships
This is the big one. Real data is full of connections: customers have orders, orders contain products. Spreadsheets have no real notion of a relationship — you end up copying data between sheets and praying it stays in sync. Relational databases are designed around relationships, which is exactly what the rest of this course is about.
A side-by-side summary
| Question | Spreadsheet | Relational database |
|---|---|---|
| How much data? | Thousands of rows | Millions to billions |
| Many editors at once? | Painful, error-prone | Built for it |
| One source of truth? | Hard to enforce | Core feature |
| Rules on values? | Optional, weak | Enforced by the schema |
| Relationships between datasets? | Manual copying | First-class concept |
| Best for | Quick, small, personal work | Shared, large, long-lived data |
It is not a competition
This is not "spreadsheets bad, databases good." They are different tools for different moments. A skilled data professional uses a spreadsheet to sketch an idea and a database to run the business. Knowing when to reach for each is part of the judgment this course builds.
A glimpse of the relationship problem
Here is the crack that databases handle most beautifully. In a spreadsheet you might repeat the customer's full name on every single order line. In a database you store customers once, orders once, and link them. Run this to see two small linked tables:
Do not worry about the JOIN syntax yet — we build up to it
carefully. The point to absorb now is that "Ada" appears once
in storage, and the database stitches the two tables together on
demand. That single idea is why relational databases conquered the
data world.
Check your understanding
Which situation most strongly suggests reaching for a relational database instead of a spreadsheet?
You need to add up a short column of numbers for a one-time report.
You want to sketch a rough budget that only you will ever see.
Many people must edit shared data that contains millions of rows and connected datasets that must stay consistent.
You need to color-code ten cells for a presentation.
In a spreadsheet, the same customer name is typed as "Acme", "ACME", and "Acme Inc." across many rows. How does a relational database help avoid this?
It automatically corrects spelling mistakes in any cell.
It makes the spreadsheet open faster.
It lets you store each customer once and refer to that single record everywhere, creating one source of truth.
It hides the duplicate rows from view.
How Data Is Stored
A gentle, intuition-first look at how a database actually keeps information — rows, columns, and tables — without any scary internals.
How Applications Use Databases
Where the database sits in a real app, what a "query" is, and why almost every piece of software you use is quietly talking to one.