Skip to main content

Supervised Classification

In the context of Remote Sensing (RS) and Digital Image Processing (DIP), supervised classification is the process where an analyst defines "training sites" (Areas of Interest or ROIs) representing known land cover classes (e.g., Water, Forest, Urban). The computer then uses these training samples to teach an algorithm how to classify the rest of the image pixels.

The algorithms used to classify these pixels are generally divided into two broad categories: Parametric and Nonparametric decision rules.


Parametric Decision Rules

These algorithms assume that the pixel values in the training data follow a specific statistical distribution—almost always the Gaussian (Normal) distribution (the "Bell Curve").

  • Key Concept: They model the data using statistical parameters: the Mean vector ($\mu$) and the Covariance matrix ($\Sigma$).

  • Analogy: Imagine trying to fit a smooth hill over your data points. If a new point lands high up on the hill, it belongs to that class.

Nonparametric Decision Rules

These algorithms make no assumptions about the statistical distribution of the data. They do not care if the data fits a bell curve.

  • Key Concept: They classify based on discrete geometric shapes (polygons, boxes) or the relative position of the data points themselves.

  • Analogy: Imagine drawing a literal box or fence around your data points. If a new point falls inside the fence, it belongs to that class.


A. Minimum-Distance-to-Means (MDM)

  • Classification: Generally considered a simple Parametric classifier (as it relies on the mean parameter), though it operates geometrically.

  • How it works:

    1. The algorithm calculates the spectral mean vector (the center point or centroid) for each training class.

    2. For every unclassified pixel in the image, it calculates the Euclidean distance to the mean of every class.

    3. The pixel is assigned to the class with the shortest distance.


  • Pros: Very fast computationally; mathematically simple.

  • Cons: It is insensitive to the variance (spread) of the data.

    • Example: If "Urban" data is very scattered (high variance) and "Water" is very tight (low variance), a pixel far from the Urban center might actually belong to Urban, but MDM might classify it as Water just because the Water mean is slightly closer geometrically.

B. Parallelepiped Classification

  • Classification: Nonparametric.

  • How it works:

    1. The algorithm looks at the training data and finds the minimum and maximum brightness values for each band.

    2. It creates a rectangular box (a parallelepiped in multi-dimensional space) defined by these limits.

    3. If a pixel's value falls within the box, it is assigned to that class.

  • Pros: Extremely fast; easy to understand conceptually.

  • Cons:

    • The Correlation Problem: Real remote sensing data (like vegetation in Red vs. NIR bands) is often correlated (diagonal distribution). A rectangular box cannot fit a diagonal data cloud efficiently, leading to large "empty corners" in the box that capture noise/wrong pixels.

    • Overlapping: Pixels often fall into the overlapping area of two boxes, leaving the computer unable to decide.

C. Gaussian Maximum Likelihood (GML/MLC)

  • Classification: Parametric (The standard industry workhorse).

  • How it works:

    1. It assumes the data for each class is normally distributed.

    2. It uses both the Mean vector AND the Covariance matrix to calculate the probability density function.

    3. It calculates the statistical probability of a pixel belonging to each class.

    4. It constructs ellipsoidal equiprobability contours (rather than circles or boxes).

  • Pros: Highly accurate because it accounts for the variance (spread) and covariance (correlation/direction) of the data. It handles "diagonal" data clouds perfectly.

  • Cons: Computationally expensive (slow on massive images); requires a large number of training pixels per class to compute a stable covariance matrix (usually $10N$ to $100N$ pixels, where $N$ is the number of bands).


FeatureParallelepipedMinimum DistanceMaximum Likelihood
TypeNonparametricParametric (Simple)Parametric (Advanced)
GeometryRectangular BoxesCircles/SpheresEllipsoids
AssumptionsNone (Min/Max thresholds)Mean Center PointGaussian Distribution
SpeedVery FastFastSlow / Intensive
AccuracyLow to ModerateModerateHigh
Best Used ForQuick looks; Uncorrelated dataWell-separated classesComplex, correlated data


Comments

Popular posts from this blog

REMOTE SENSING INDICES

Remote sensing indices are band ratios designed to highlight specific surface features (vegetation, soil, water, urban areas, snow, burned areas, etc.) using the spectral reflectance properties of the Earth's surface. They improve classification accuracy and environmental monitoring. 1. Vegetation Indices NDVI – Normalized Difference Vegetation Index Formula: (NIR – RED) / (NIR + RED) Concept: Vegetation reflects strongly in NIR and absorbs in RED due to chlorophyll. Measures: Vegetation greenness & health Uses: Agriculture, drought monitoring, biomass estimation EVI – Enhanced Vegetation Index Formula: G × (NIR – RED) / (NIR + C1×RED – C2×BLUE + L) Concept: Corrects for soil and atmospheric noise. Measures: Vegetation vigor in dense canopies Uses: Tropical rainforest mapping, high biomass regions GNDVI – Green Normalized Difference Vegetation Index Formula: (NIR – GREEN) / (NIR + GREEN) Concept: Uses Green instead of Red ...

Landsat band composition

Short-Wave Infrared (7, 6 4) The short-wave infrared band combination uses SWIR-2 (7), SWIR-1 (6), and red (4). This composite displays vegetation in shades of green. While darker shades of green indicate denser vegetation, sparse vegetation has lighter shades. Urban areas are blue and soils have various shades of brown. Agriculture (6, 5, 2) This band combination uses SWIR-1 (6), near-infrared (5), and blue (2). It's commonly used for crop monitoring because of the use of short-wave and near-infrared. Healthy vegetation appears dark green. But bare earth has a magenta hue. Geology (7, 6, 2) The geology band combination uses SWIR-2 (7), SWIR-1 (6), and blue (2). This band combination is particularly useful for identifying geological formations, lithology features, and faults. Bathymetric (4, 3, 1) The bathymetric band combination (4,3,1) uses the red (4), green (3), and coastal bands to peak into water. The coastal band is useful in coastal, bathymetric, and aerosol studies because...

Atmospheric Window

The atmospheric window in remote sensing refers to specific wavelength ranges within the electromagnetic spectrum that can pass through the Earth's atmosphere relatively unimpeded. These windows are crucial for remote sensing applications because they allow us to observe the Earth's surface and atmosphere without significant interference from the atmosphere's constituents. Key facts and concepts about atmospheric windows: Visible and Near-Infrared (VNIR) window: This window encompasses wavelengths from approximately 0. 4 to 1. 0 micrometers. It is ideal for observing vegetation, water bodies, and land cover types. Shortwave Infrared (SWIR) window: This window covers wavelengths from approximately 1. 0 to 3. 0 micrometers. It is particularly useful for detecting minerals, water content, and vegetation health. Mid-Infrared (MIR) window: This window spans wavelengths from approximately 3. 0 to 8. 0 micrometers. It is valuable for identifying various materials, incl...

Landsat 8 Band designation and Band Combination.

Landsat 8 Band designation and Band Combination.  Landsat 8-9 Operational Land Imager (OLI) and Thermal Infrared Sensor (TIRS) Bands Wavelength (micrometers) Resolution (meters) Band 1 - Coastal aerosol 0.43-0.45 30 Band 2 - Blue 0.45-0.51 30 Band 3 - Green 0.53-0.59 30 Band 4 - Red 0.64-0.67 30 Band 5 - Near Infrared (NIR) 0.85-0.88 30 Band 6 - SWIR 1 1.57-1.65 30 Band 7 - SWIR 2 2.11-2.29 30 Band 8 - Panchromatic 0.50-0.68 15 Band 9 - Cirrus 1.36-1.38 30 Band 10 - Thermal Infrared (TIRS) 1 10.6-11.19 100 Band 11 - Thermal Infrared (TIRS) 2 11.50-12.51 100 Vineesh V Assistant Professor of Geography, Directorate of Education, Government of Kerala. https://www.facebook.com/Applied.Geography http://geogisgeo.blogspot.com

Energy Interaction with Atmosphere and Earth Surface

In Remote Sensing , satellites record electromagnetic radiation (EMR) that is reflected or emitted from the Earth. Before reaching the sensor, radiation interacts with: The Atmosphere The Earth's Surface These interactions control how satellite images look and how we interpret them. I. Interaction of EMR with the Atmosphere When solar radiation travels from the Sun to the Earth, four main processes occur: 1. Absorption Definition: Absorption occurs when atmospheric gases absorb radiation at specific wavelengths and convert it into heat. Main absorbing gases: Ozone (O₃) → absorbs Ultraviolet (UV) Carbon dioxide (CO₂) → absorbs Thermal Infrared Water vapour (H₂O) → absorbs Infrared Concept: Atmospheric Windows These are wavelength regions where absorption is very low, allowing radiation to pass through the atmosphere. Remote sensing depends on these windows. For example, satellites like Landsat 8 use visible, near-infrared, and thermal bands located in atmospheric windows. 2. Trans...