Classification & Categorization
Classification is the act of grouping things by shared properties. It is one of the most fundamental cognitive operations: you cannot think clearly about a large number of things until you organize them into meaningful groups. Categorization turns chaos into structure, making it possible to find things, reason about them, and take appropriate action.
What Classification Is
Classification assigns items to groups based on shared characteristics. You observe properties, decide which properties matter, define categories based on those properties, and place each item in the appropriate category.
This sounds formal, but you do it constantly without thinking about it.
Classifying your email:
- Work emails (from colleagues, about projects)
- Personal emails (from friends, family)
- Newsletters (subscriptions, updates)
- Spam (unsolicited, irrelevant)
Classifying expenses:
- Housing (rent, utilities, insurance)
- Food (groceries, restaurants, coffee)
- Transportation (gas, parking, transit pass)
- Entertainment (movies, books, subscriptions)
Every time you sort, organize, or group things, you are classifying.
Everyday Examples
Sorting Laundry
Sorting laundry is classification in its purest form. You examine each item and assign it to a category based on specific properties.
Classification by color:
Whites -> Hot water, bleach safe
Darks -> Cold water, prevent fading
Colors -> Warm water, color-safe detergent
Classification by fabric:
Delicates -> Gentle cycle, mesh bag
Regular -> Normal cycle
Heavy -> Heavy-duty cycle, longer time
Classification by care label:
Machine wash -> Into the machine
Hand wash -> Separate basin
Dry clean -> Set aside for the cleaners
You might use multiple classification schemes simultaneously:
This item is dark + delicate -> cold water, gentle cycle
This item is white + heavy -> hot water, heavy-duty cycle
The key insight is that classification is not neutral. The categories you choose determine what actions you take. Sorting by color produces different groups than sorting by fabric. Both are valid; which is "right" depends on what problem you are solving.
Organizing Files into Folders
Every filing system, physical or digital, is a classification scheme.
By project:
Project Alpha/
contracts/
invoices/
correspondence/
Project Beta/
contracts/
invoices/
correspondence/
By document type:
Contracts/
project-alpha-contract.pdf
project-beta-contract.pdf
Invoices/
project-alpha-invoice-001.pdf
project-beta-invoice-001.pdf
By date:
2025/
Q1/
Q2/
2026/
Q1/
Q2/
Each scheme makes different tasks easier. Organizing by project makes it easy to find everything about one project. Organizing by type makes it easy to find all contracts. Organizing by date makes it easy to find recent items. There is no universally correct scheme, only schemes that serve specific needs.
Grocery Store Layout
A grocery store is a massive classification system designed for one purpose: helping customers find what they need.
Top-level categories:
Produce | Dairy | Meat | Bakery | Frozen | Canned Goods | Beverages
Within Produce:
Fruits | Vegetables | Herbs | Prepared salads
Within Fruits:
Citrus | Berries | Tropical | Stone fruits | Apples and pears
Classification criteria: Similarity of product, storage requirements,
and shopping behavior (people who buy lemons often buy limes).
Triaging in Emergency Rooms
Emergency room triage is life-or-death classification. Patients are classified by severity so that the most critical cases receive care first.
Triage categories:
Red (Immediate): Life-threatening, needs care now
Yellow (Urgent): Serious, can wait a short time
Green (Delayed): Minor injuries, can wait longer
Black (Expectant): Unlikely to survive, comfort care
Properties used for classification:
- Breathing rate
- Pulse
- Level of consciousness
- Nature of injury
Each category determines a specific action protocol.
This is classification with extremely high stakes, and it follows the same principles as sorting laundry: examine properties, assign to categories, and take category-appropriate action.
Music Genres
Music classification by genre helps listeners find what they want, artists reach their audience, and stores organize their inventory.
Top-level: Rock | Pop | Jazz | Classical | Hip-Hop | Country | Electronic
Sub-genres within Rock:
Classic Rock | Punk | Alternative | Metal | Indie | Progressive
Properties used for classification:
- Instrumentation (electric guitar vs synthesizer vs acoustic)
- Tempo (fast vs slow)
- Vocal style (singing vs rapping vs screaming)
- Era (1960s vs 2020s)
- Cultural origin
Fuzzy boundaries:
Is this song pop-rock or rock-pop?
Is this artist country or folk?
Genre classification is imperfect because music exists on spectrums.
Rating & Reviewing
When you rate a restaurant, you are classifying it along multiple dimensions.
Dimensions of restaurant classification:
Cuisine type: Italian, Japanese, Mexican, American
Price range: $, $$, $$$, $$$$
Occasion: Casual, date night, business, family
Quality: 1 star, 2 stars, 3 stars, 4 stars, 5 stars
Dietary: Vegan-friendly, gluten-free options, allergen-aware
A single restaurant might be classified as:
Italian | $$ | Date night | 4 stars | Vegan-friendly
This multi-dimensional classification helps people find exactly
what they need for a specific occasion.
Properties & Criteria
Effective classification requires choosing the right properties to classify by. The same set of items can be classified in completely different ways depending on which properties you prioritize.
Classifying books:
By genre: Fiction, Non-fiction, Science, History, Biography
By format: Hardcover, Paperback, E-book, Audiobook
By audience: Children, Young Adult, Adult, Academic
By length: Short (under 200 pages), Medium, Long (over 500 pages)
By language: English, Spanish, French, Mandarin
By era: Classic, Modern, Contemporary
A library uses genre and audience.
A bookstore uses genre and format.
A shipping company uses format and weight.
A literature professor uses era and genre.
Same items, different classification schemes, different purposes.
Choosing the Right Properties
The best classification scheme depends on the question you are trying to answer.
Question: "What should I wear today?"
Classify clothes by: weather appropriateness, formality, cleanliness
Question: "What fits in this suitcase?"
Classify clothes by: size, wrinkle resistance, versatility
Question: "What needs to be donated?"
Classify clothes by: condition, fit, last time worn
Question: "How should I organize my closet?"
Classify clothes by: type (shirts, pants, jackets), season, frequency
Classification in Technology
Data Types
Every programming language classifies data into types.
Common data types:
Numbers: integers (42), decimals (3.14)
Text: strings ("hello world")
Boolean: true/false
Collections: lists ([1, 2, 3]), dictionaries (key: value pairs)
Dates: timestamps, date objects
Null: absence of value
Why classification matters:
You can add two numbers (42 + 3.14 = 45.14)
You can concatenate two strings ("hello" + " world" = "hello world")
You cannot add a number and a string (42 + "hello" = error)
The data type classification determines what operations are valid.
Taxonomies & Tagging Systems
Digital content management relies heavily on classification.
E-commerce product taxonomy:
Electronics/
Computers/
Laptops/
Gaming Laptops
Business Laptops
Ultrabooks
Desktops/
Gaming Desktops
Workstations
Phones/
Smartphones
Feature Phones
Audio/
Headphones
Speakers
Tags (non-hierarchical classification):
A gaming laptop might be tagged:
[gaming, portable, high-performance, 2026-model, on-sale]
Tags allow items to belong to multiple categories simultaneously,
which rigid hierarchies do not support well.
Hierarchical taxonomies (folders, categories) force each item into one path. Tags allow multiple classifications simultaneously. Most modern systems use both.
Machine Learning Classification
Machine learning classification automates what humans do manually: examine properties and assign categories.
Email spam classification:
Input properties (features):
- Sender reputation
- Subject line keywords
- Number of links in body
- Presence of attachments
- Sender in contact list (yes/no)
- Language patterns
Output categories:
- Spam
- Not spam
The system learns from thousands of examples:
"Emails with these properties tend to be spam."
"Emails with those properties tend to be legitimate."
Then it classifies new emails automatically.
Image classification:
Input: A photograph
Output: What is in it (cat, dog, car, tree, person)
The system learned patterns from millions of labeled images.
It classifies new images by matching patterns it recognizes.
Medical classification:
Input: Patient symptoms, lab results, medical history
Output: Likely diagnosis category
Same principle: learn from classified examples,
apply to new cases.
HTTP Status Codes
Even error messages follow classification schemes.
HTTP Status Code Categories:
2xx - Success:
200 OK (request succeeded)
201 Created (new resource created)
3xx - Redirection:
301 Moved Permanently
304 Not Modified
4xx - Client Error:
400 Bad Request (malformed input)
401 Unauthorized (not authenticated)
403 Forbidden (not authorized)
404 Not Found
5xx - Server Error:
500 Internal Server Error
502 Bad Gateway
503 Service Unavailable
The first digit classifies the type of response.
This classification tells the client how to react
without reading the specific message.
Hierarchical vs Flat Classification
Classification schemes range from flat (one level) to deeply hierarchical.
Flat classification:
Urgent | Important | Normal | Low Priority
Simple. Fast to assign. No ambiguity about structure.
Hierarchical classification:
Living Things/
Animals/
Vertebrates/
Mammals/
Primates/
Humans
Invertebrates/
Plants/
Flowering/
Non-flowering/
Rich. Captures relationships. Can be complex to navigate.
Choose flat classification when speed matters and categories are few. Choose hierarchical when relationships between categories matter and the domain is complex.
Common Pitfalls
Overlapping categories
If categories are not mutually exclusive, items end up in multiple groups and the system becomes confusing. "Is this expense food or entertainment?" happens when dining out could reasonably be either. Define clear rules for boundary cases, or use tags instead of categories.
Too many categories
A classification scheme with 50 categories is unusable. People will not remember the categories, will assign items inconsistently, and will abandon the system. Start with fewer, broader categories and add specificity only where it creates value.
Too few categories
Lumping everything into three categories loses important distinctions. If "Miscellaneous" becomes your largest category, your scheme is not granular enough.
Classifying by the wrong properties
Organizing a tool shed by color is technically a classification, but it is not useful. Color does not help you find the right tool. Organizing by function (cutting tools, fastening tools, measuring tools) serves the actual need.
Treating classification as permanent
Needs change. A filing system that worked for a 5-person company does not work for a 50-person company. Revisit and evolve your classification schemes as the underlying reality changes.
Forcing items into categories they do not fit
Not everything fits neatly. A platypus is a mammal that lays eggs. Some songs span multiple genres. Some expenses straddle two budget categories. Build in a way to handle exceptions rather than forcing everything into rigid boxes.
Key Takeaways
- Classification groups items by shared properties. It is fundamental to organizing information, making decisions, and taking appropriate action.
- You classify constantly: sorting laundry, organizing files, choosing a restaurant, triaging tasks by priority. The skill is already in your toolkit.
- The choice of properties determines the classification. The same items grouped by different properties produce different, equally valid schemes. Choose properties that serve your specific purpose.
- In technology, classification appears in data types, taxonomies, tagging systems, machine learning, and status codes. The principle is identical: examine properties, assign to categories, take category-appropriate action.
- Hierarchical classification captures relationships but adds complexity. Flat classification is simpler but less expressive. Tags offer flexibility by allowing multiple simultaneous classifications.
- Avoid overlapping categories, excessive granularity, wrong properties, and rigid schemes that cannot evolve. The best classification system is one that people actually use consistently.