Skip to main content

Supervised Classification

In the context of Remote Sensing (RS) and Digital Image Processing (DIP), supervised classification is the process where an analyst defines "training sites" (Areas of Interest or ROIs) representing known land cover classes (e.g., Water, Forest, Urban). The computer then uses these training samples to teach an algorithm how to classify the rest of the image pixels.

The algorithms used to classify these pixels are generally divided into two broad categories: Parametric and Nonparametric decision rules.


Parametric Decision Rules

These algorithms assume that the pixel values in the training data follow a specific statistical distribution—almost always the Gaussian (Normal) distribution (the "Bell Curve").

  • Key Concept: They model the data using statistical parameters: the Mean vector ($\mu$) and the Covariance matrix ($\Sigma$).

  • Analogy: Imagine trying to fit a smooth hill over your data points. If a new point lands high up on the hill, it belongs to that class.

Nonparametric Decision Rules

These algorithms make no assumptions about the statistical distribution of the data. They do not care if the data fits a bell curve.

  • Key Concept: They classify based on discrete geometric shapes (polygons, boxes) or the relative position of the data points themselves.

  • Analogy: Imagine drawing a literal box or fence around your data points. If a new point falls inside the fence, it belongs to that class.


A. Minimum-Distance-to-Means (MDM)

  • Classification: Generally considered a simple Parametric classifier (as it relies on the mean parameter), though it operates geometrically.

  • How it works:

    1. The algorithm calculates the spectral mean vector (the center point or centroid) for each training class.

    2. For every unclassified pixel in the image, it calculates the Euclidean distance to the mean of every class.

    3. The pixel is assigned to the class with the shortest distance.


  • Pros: Very fast computationally; mathematically simple.

  • Cons: It is insensitive to the variance (spread) of the data.

    • Example: If "Urban" data is very scattered (high variance) and "Water" is very tight (low variance), a pixel far from the Urban center might actually belong to Urban, but MDM might classify it as Water just because the Water mean is slightly closer geometrically.

B. Parallelepiped Classification

  • Classification: Nonparametric.

  • How it works:

    1. The algorithm looks at the training data and finds the minimum and maximum brightness values for each band.

    2. It creates a rectangular box (a parallelepiped in multi-dimensional space) defined by these limits.

    3. If a pixel's value falls within the box, it is assigned to that class.

  • Pros: Extremely fast; easy to understand conceptually.

  • Cons:

    • The Correlation Problem: Real remote sensing data (like vegetation in Red vs. NIR bands) is often correlated (diagonal distribution). A rectangular box cannot fit a diagonal data cloud efficiently, leading to large "empty corners" in the box that capture noise/wrong pixels.

    • Overlapping: Pixels often fall into the overlapping area of two boxes, leaving the computer unable to decide.

C. Gaussian Maximum Likelihood (GML/MLC)

  • Classification: Parametric (The standard industry workhorse).

  • How it works:

    1. It assumes the data for each class is normally distributed.

    2. It uses both the Mean vector AND the Covariance matrix to calculate the probability density function.

    3. It calculates the statistical probability of a pixel belonging to each class.

    4. It constructs ellipsoidal equiprobability contours (rather than circles or boxes).

  • Pros: Highly accurate because it accounts for the variance (spread) and covariance (correlation/direction) of the data. It handles "diagonal" data clouds perfectly.

  • Cons: Computationally expensive (slow on massive images); requires a large number of training pixels per class to compute a stable covariance matrix (usually $10N$ to $100N$ pixels, where $N$ is the number of bands).


FeatureParallelepipedMinimum DistanceMaximum Likelihood
TypeNonparametricParametric (Simple)Parametric (Advanced)
GeometryRectangular BoxesCircles/SpheresEllipsoids
AssumptionsNone (Min/Max thresholds)Mean Center PointGaussian Distribution
SpeedVery FastFastSlow / Intensive
AccuracyLow to ModerateModerateHigh
Best Used ForQuick looks; Uncorrelated dataWell-separated classesComplex, correlated data


Comments

Popular posts from this blog

Accuracy Assessment

Accuracy assessment is the process of checking how correct your classified satellite image is . 👉 After supervised classification, the satellite image is divided into classes like: Water Forest Agriculture Built-up land Barren land But classification is done using computer algorithms, so some areas may be wrongly classified . 👉 Accuracy assessment helps to answer this question: ✔ "How much of my classified map is correct compared to real ground conditions?"  Goal The main goal is to: Measure reliability of classified maps Identify classification errors Improve classification results Provide scientific validity to research 👉 Without accuracy assessment, a classified map is not considered scientifically reliable . Reference Data (Ground Truth Data) Reference data is real-world information used to check classification accuracy. It can be collected from: ✔ Field survey using GPS ✔ High-resolution satellite images (Google Earth etc.) ✔ Existing maps or survey reports 🧭 Exampl...

Landsat 8 Band designation and Band Combination.

Landsat 8 Band designation and Band Combination.  Landsat 8-9 Operational Land Imager (OLI) and Thermal Infrared Sensor (TIRS) Bands Wavelength (micrometers) Resolution (meters) Band 1 - Coastal aerosol 0.43-0.45 30 Band 2 - Blue 0.45-0.51 30 Band 3 - Green 0.53-0.59 30 Band 4 - Red 0.64-0.67 30 Band 5 - Near Infrared (NIR) 0.85-0.88 30 Band 6 - SWIR 1 1.57-1.65 30 Band 7 - SWIR 2 2.11-2.29 30 Band 8 - Panchromatic 0.50-0.68 15 Band 9 - Cirrus 1.36-1.38 30 Band 10 - Thermal Infrared (TIRS) 1 10.6-11.19 100 Band 11 - Thermal Infrared (TIRS) 2 11.50-12.51 100 Vineesh V Assistant Professor of Geography, Directorate of Education, Government of Kerala. https://www.facebook.com/Applied.Geography http://geogisgeo.blogspot.com

Change Detection

Change detection is the process of finding differences on the Earth's surface over time by comparing satellite images of the same area taken on different dates . After supervised classification , two classified maps (e.g., Year-1 and Year-2) are compared to identify land use / land cover changes .  Goal To detect where , what , and how much change has occurred To monitor urban growth, deforestation, floods, agriculture, etc.  Basic Concept Forest → Forest = No change Forest → Urban = Change detected Key Terminologies Multi-temporal images : Images of the same area at different times Post-classification comparison : Comparing two classified maps Change matrix : Table showing class-to-class change Change / No-change : Whether land cover remains same or different Main Methods Post-classification comparison – Most common and easy Image differencing – Subtract pixel values Image ratioing – Divide pixel values Deep learning methods – Advanced AI-based detection Examples Agricult...

Landsat band composition

Short-Wave Infrared (7, 6 4) The short-wave infrared band combination uses SWIR-2 (7), SWIR-1 (6), and red (4). This composite displays vegetation in shades of green. While darker shades of green indicate denser vegetation, sparse vegetation has lighter shades. Urban areas are blue and soils have various shades of brown. Agriculture (6, 5, 2) This band combination uses SWIR-1 (6), near-infrared (5), and blue (2). It's commonly used for crop monitoring because of the use of short-wave and near-infrared. Healthy vegetation appears dark green. But bare earth has a magenta hue. Geology (7, 6, 2) The geology band combination uses SWIR-2 (7), SWIR-1 (6), and blue (2). This band combination is particularly useful for identifying geological formations, lithology features, and faults. Bathymetric (4, 3, 1) The bathymetric band combination (4,3,1) uses the red (4), green (3), and coastal bands to peak into water. The coastal band is useful in coastal, bathymetric, and aerosol studies because...

Development and scope of Environmental Geography and Recent concepts in environmental Geography

Environmental Geography studies the relationship between humans and nature in a spatial (place-based) way. It combines Physical Geography (natural processes) and Human Geography (human activities). A. Early Stage 🔹 Environmental Determinism Concept: Nature controls human life. Meaning: Climate, landforms, and soil decide how people live. Example: People in deserts (like Sahara Desert) live differently from people in fertile river valleys. 🔹 Possibilism Concept: Humans can modify nature. Meaning: Environment gives options, but humans make choices. Example: In dry areas like Rajasthan, people use irrigation to grow crops. 👉 In this stage, geography was mostly descriptive (explaining what exists). B. Evolution Stage (Mid-20th Century) Environmental problems increased due to: Industrialization Urbanization Deforestation Pollution Geographers started studying: Environmental degradation Resource management Human impact on ecosystems The field became analytical and problem-solving...