Skip to main content

Supervised Classification


Supervised classification is a digital image classification method where the analyst guides the classification process by defining classes of interest and providing representative training samples.
The classifier uses these training samples to learn the spectral signatures of each class and then assigns every pixel in the image to the most appropriate class.

This method relies heavily on prior knowledge of the study area.

How Supervised Classification Works

✔ Step 1: Define Information Classes

These are real-world land-cover classes such as:

  • water

  • forest

  • agriculture

  • urban

  • barren land

✔ Step 2: Select Training Areas

Training areas (also called ROIs—Regions of Interest) are chosen on the image where the analyst is confident about the land-cover type.

✔ Step 3: Extract Spectral Signatures

The classifier calculates:

  • mean

  • variance

  • covariance

  • pixel distribution

for each class across different spectral bands.

✔ Step 4: Apply Decision Rules

The classification algorithm uses statistical rules to assign each pixel to a class.

✔ Step 5: Produce Classified Output

The final output is a thematic map showing land-cover classes.

When to Use Supervised Classification

Use supervised classification when:

  • You have prior knowledge of the landscape.

  • Ground truth or ancillary data is available (GPS points, survey data).

  • You can identify distinct, homogeneous training sites for each class.

  • The objective is to extract specific land-cover categories.

Information Class vs Spectral Class

Understanding the difference between these two is essential:

Information Class

  • Defined by the analyst based on real-world concepts.

  • Examples: village, river, wetland, cropland.

  • Represents semantic categories used for mapping and interpretation.

Spectral Class

  • Group of pixels that are spectrally similar, based on reflectance values.

  • Identified statistically by the software.

  • May not always match real-world categories exactly.

📌 Mapping involves matching spectral classes to information classes.

Supervised Training

Supervised training involves:

  • Manually selecting representative pixel samples

  • Ensuring the samples capture the full spectral variability of each class
    (e.g., different shades of vegetation or soil types)

  • Evaluating spectral signatures using

    • histograms

    • scatter plots

    • spectral profiles

    • separability indices (e.g., Jeffries–Matusita)

✔ Characteristics

  • Analyst-controlled

  • Knowledge-driven

  • Often more accurate

  • Requires skill in selecting high-quality training data

Classification Decision Rules (Supervised)

Decision rules determine how the classifier decides which class a pixel belongs to.

They fall into two broad groups:

Parametric Decision Rules

Parametric classifiers assume pixel values follow a normal (Gaussian) distribution.

These rules rely on statistical measures such as:

  • class mean

  • variance

  • covariance

  • probability density functions

Minimum Distance Classifier

  • Computes Euclidean or Mahalanobis distance between pixel and class mean.

  • Assigns pixel to the closest class mean.

  • Simple and fast but may misclassify overlapping classes.

Maximum Likelihood Classifier (MLC)

  • Most widely used supervised classifier.

  • Considers:

    • class mean

    • variance

    • covariance

    • overall probability distribution

  • Assigns pixel to the class with the highest likelihood of belonging.

  • Requires good training data; performs best when classes are normally distributed.

Nonparametric Decision Rules

Do not assume any specific statistical distribution; useful when pixel distributions are irregular.

Parallelepiped Classifier

  • Creates "boxes" using min–max values for each band.

  • A pixel is assigned to a class if its values fall within the box.

  • Fast, but may leave pixels:

    • unclassified (if no box contains the pixel)

    • ambiguously classified (if pixel falls in more than one box)

Feature Space Classifier

  • Plots pixel values in a multi-dimensional feature space.

  • Uses polygons in the feature space to define classes.

  • More flexible and accurate than parallelepiped.

  • Good for visually evaluating class separability.



Comments

Popular posts from this blog

GIS data continuous discrete ordinal interval ratio

In Geographic Information Systems (GIS) , data is categorized based on its nature (discrete or continuous) and its measurement scale (nominal, ordinal, interval, or ratio). These distinctions influence how the data is collected, analyzed, and visualized. Let's break down these categories with concepts, terminologies, and examples: 1. Discrete Data Discrete data is obtained by counting distinct items or entities. Values are finite and cannot be infinitely subdivided. Characteristics : Represent distinct objects or occurrences. Commonly represented as vector data (points, lines, polygons). Values within a range are whole numbers or categories. Examples : Number of People : Counting individuals on a train or in a hospital. Building Types : Categorizing buildings as residential, commercial, or industrial. Tree Count : Number of trees in a specific area. 2. Continuous Data Continuous data is obtained by measuring phenomena that can take any value within a range...

History of GIS

The history of Geographic Information Systems (GIS) is rooted in early efforts to understand spatial relationships and patterns, long before the advent of digital computers. While modern GIS emerged in the mid-20th century with advances in computing, its conceptual foundations lie in cartography, spatial analysis, and thematic mapping. Early Roots of Spatial Analysis (Pre-1960s) One of the earliest documented applications of spatial analysis dates back to  1832 , when  Charles Picquet , a French geographer and cartographer, produced a cholera mortality map of Paris. In his report  Rapport sur la marche et les effets du cholĂ©ra dans Paris et le dĂ©partement de la Seine , Picquet used graduated color shading to represent cholera deaths per 1,000 inhabitants across 48 districts. This work is widely regarded as an early example of choropleth mapping and thematic cartography applied to epidemiology. A landmark moment in the history of spatial analysis occurred in  1854 , when  John Snow  inv...

Disaster Management

1. Disaster Risk Analysis → Disaster Risk Reduction → Disaster Management Cycle Disaster Risk Analysis is the first step in managing disasters. It involves assessing potential hazards, identifying vulnerable populations, and estimating possible impacts. Once risks are identified, Disaster Risk Reduction (DRR) strategies come into play. DRR aims to reduce risk and enhance resilience through planning, infrastructure development, and policy enforcement. The Disaster Management Cycle then ensures a structured approach by dividing actions into pre-disaster, during-disaster, and post-disaster phases . Example Connection: Imagine a coastal city prone to cyclones: Risk Analysis identifies low-lying areas and weak infrastructure. Risk Reduction includes building seawalls, enforcing strict building codes, and training residents for emergency situations. The Disaster Management Cycle ensures ongoing preparedness, immediate response during a cyclone, and long-term recovery afterw...

Representation of Spatial and Temporal Relationships

Geographical Information System (GIS) is a powerful tool for analyzing and visualizing spatial data. One of the key features of GIS is its ability to represent spatial and temporal relationships between different geographic features. Spatial relationships refer to the physical location of an object or feature in relation to other objects or features, while temporal relationships refer to the sequence or timing of events. Together, these relationships are essential for understanding and analyzing complex spatial and temporal data. Representation of Spatial Relationships in GIS: Spatial relationships in GIS can be represented using a variety of techniques such as distance, proximity, and topology. For example, distance-based relationships can be used to measure the distance between two points, while proximity-based relationships can be used to determine which objects or features are closest to one another. Topology-based relationships can be used to represent the connectivity between dif...

How to find drugs against the Corona. Covid 19

FOR SCIENTISTS (and others interested): How to find drugs against the coronavirus: First clues on how we can beat COVID-19. This shows the many ways we can interfere with its replication cycle by repurposing existing drugs - summarized in today's Science journal. LINK TO ARTICLE:  https://science.sciencemag.org/content/367/6485/1412 .... Vineesh V Assistant Professor of Geography, Directorate of Education, Government of Kerala. https://g.page/vineeshvc