Skip to main content

Supervised Classification

In the context of Remote Sensing (RS) and Digital Image Processing (DIP), supervised classification is the process where an analyst defines "training sites" (Areas of Interest or ROIs) representing known land cover classes (e.g., Water, Forest, Urban). The computer then uses these training samples to teach an algorithm how to classify the rest of the image pixels.

The algorithms used to classify these pixels are generally divided into two broad categories: Parametric and Nonparametric decision rules.


Parametric Decision Rules

These algorithms assume that the pixel values in the training data follow a specific statistical distribution—almost always the Gaussian (Normal) distribution (the "Bell Curve").

  • Key Concept: They model the data using statistical parameters: the Mean vector ($\mu$) and the Covariance matrix ($\Sigma$).

  • Analogy: Imagine trying to fit a smooth hill over your data points. If a new point lands high up on the hill, it belongs to that class.

Nonparametric Decision Rules

These algorithms make no assumptions about the statistical distribution of the data. They do not care if the data fits a bell curve.

  • Key Concept: They classify based on discrete geometric shapes (polygons, boxes) or the relative position of the data points themselves.

  • Analogy: Imagine drawing a literal box or fence around your data points. If a new point falls inside the fence, it belongs to that class.


A. Minimum-Distance-to-Means (MDM)

  • Classification: Generally considered a simple Parametric classifier (as it relies on the mean parameter), though it operates geometrically.

  • How it works:

    1. The algorithm calculates the spectral mean vector (the center point or centroid) for each training class.

    2. For every unclassified pixel in the image, it calculates the Euclidean distance to the mean of every class.

    3. The pixel is assigned to the class with the shortest distance.


  • Pros: Very fast computationally; mathematically simple.

  • Cons: It is insensitive to the variance (spread) of the data.

    • Example: If "Urban" data is very scattered (high variance) and "Water" is very tight (low variance), a pixel far from the Urban center might actually belong to Urban, but MDM might classify it as Water just because the Water mean is slightly closer geometrically.

B. Parallelepiped Classification

  • Classification: Nonparametric.

  • How it works:

    1. The algorithm looks at the training data and finds the minimum and maximum brightness values for each band.

    2. It creates a rectangular box (a parallelepiped in multi-dimensional space) defined by these limits.

    3. If a pixel's value falls within the box, it is assigned to that class.

  • Pros: Extremely fast; easy to understand conceptually.

  • Cons:

    • The Correlation Problem: Real remote sensing data (like vegetation in Red vs. NIR bands) is often correlated (diagonal distribution). A rectangular box cannot fit a diagonal data cloud efficiently, leading to large "empty corners" in the box that capture noise/wrong pixels.

    • Overlapping: Pixels often fall into the overlapping area of two boxes, leaving the computer unable to decide.

C. Gaussian Maximum Likelihood (GML/MLC)

  • Classification: Parametric (The standard industry workhorse).

  • How it works:

    1. It assumes the data for each class is normally distributed.

    2. It uses both the Mean vector AND the Covariance matrix to calculate the probability density function.

    3. It calculates the statistical probability of a pixel belonging to each class.

    4. It constructs ellipsoidal equiprobability contours (rather than circles or boxes).

  • Pros: Highly accurate because it accounts for the variance (spread) and covariance (correlation/direction) of the data. It handles "diagonal" data clouds perfectly.

  • Cons: Computationally expensive (slow on massive images); requires a large number of training pixels per class to compute a stable covariance matrix (usually $10N$ to $100N$ pixels, where $N$ is the number of bands).


FeatureParallelepipedMinimum DistanceMaximum Likelihood
TypeNonparametricParametric (Simple)Parametric (Advanced)
GeometryRectangular BoxesCircles/SpheresEllipsoids
AssumptionsNone (Min/Max thresholds)Mean Center PointGaussian Distribution
SpeedVery FastFastSlow / Intensive
AccuracyLow to ModerateModerateHigh
Best Used ForQuick looks; Uncorrelated dataWell-separated classesComplex, correlated data


Comments

Popular posts from this blog

Hazard Mapping Spatial Planning Evacuation Planning GIS

Geographic Information Systems (GIS) play a pivotal role in disaster management by providing the tools and frameworks necessary for effective hazard mapping, spatial planning, and evacuation planning. These concepts are integral for understanding disaster risks, preparing for potential hazards, and ensuring that resources are efficiently allocated during and after a disaster. 1. Hazard Mapping: Concept: Hazard mapping involves the process of identifying, assessing, and visually representing the geographical areas that are at risk of certain natural or human-made hazards. Hazard maps display the probability, intensity, and potential impact of specific hazards (e.g., floods, earthquakes, hurricanes, landslides) within a given area. Terminologies: Hazard Zone: An area identified as being vulnerable to a particular hazard (e.g., flood zones, seismic zones). Hazard Risk: The likelihood of a disaster occurring in a specific location, influenced by factors like geography, climate, an...

Supervised Classification

Image Classification in Remote Sensing Image classification in remote sensing involves categorizing pixels in an image into thematic classes to produce a map. This process is essential for land use and land cover mapping, environmental studies, and resource management. The two primary methods for classification are Supervised and Unsupervised Classification . Here's a breakdown of these methods and the key stages of image classification. 1. Types of Classification Supervised Classification In supervised classification, the analyst manually defines classes of interest (known as information classes ), such as "water," "urban," or "vegetation," and identifies training areas —sections of the image that are representative of these classes. Using these training areas, the algorithm learns the spectral characteristics of each class and applies them to classify the entire image. When to Use Supervised Classification:   - You have prior knowledge about the c...

Scope of Disaster Management

Disaster management refers to the systematic approach to managing and mitigating the impacts of disasters, encompassing both natural hazards (e.g., earthquakes, floods, hurricanes) and man-made disasters (e.g., industrial accidents, terrorism, nuclear accidents). Its primary objectives are to minimize potential losses, provide timely assistance to those affected, and facilitate swift and effective recovery. The scope of disaster management is multifaceted, encompassing a series of interconnected activities: preparedness, response, recovery, and mitigation. These activities must be strategically implemented before, during, and after a disaster. Key Concepts, Terminologies, and Examples 1. Awareness: Concept: Fostering public understanding of potential hazards and appropriate responses before, during, and after disasters. This involves disseminating information about risks, safety measures, and recommended actions. Terminologies: Hazard Awareness: Recognizing the types of natural...

Role of Geography in Disaster Management

Geography plays a pivotal role in disaster management by facilitating an understanding of the impact of natural disasters, guiding preparedness efforts, and supporting effective response and recovery. By analyzing geographical features, environmental conditions, and historical data, geography empowers disaster management professionals to identify risks, plan for hazards, respond to emergencies, assess damage, and monitor recovery. Geographic Information Systems (GIS) serve as crucial tools, providing critical spatial data for informed decision-making throughout the disaster management cycle. Key Concepts, Terminologies, and Examples 1. Identifying Risk: Concept: Risk identification involves analyzing geographical areas to understand their susceptibility to specific natural disasters. By studying historical events, topography, climate patterns, and environmental factors, disaster management experts can predict which regions are most vulnerable. Terminologies: Hazard Risk: The pr...