Skip to main content

WHEN TO USE WHAT STATISTICAL TEST IN RESEARCH

There are several statistical test types for analyzing Research Data. When to use what is often the challenge. This piece provides a simplification 

1️⃣t-test:

- Use when: You want to compare the means of two groups to determine if there's a significant difference.
- Example: You want to compare the average score of students who received traditional teaching vs. those who received innovative teaching.

2️⃣ANOVA (Analysis of Variance):

- Use when: You want to compare the means of three or more groups to determine if there are significant differences.
- Example: You want to compare the average score of students from different schools to determine if there are significant differences in their performance.

3️⃣Regression (Simple and Multiple):

- Use when: You want to examine the relationship between a dependent variable and one or more independent variables.
- Example: You want to examine the relationship between hours studied and exam scores (simple regression), or the relationship between hours studied, exam scores, and student motivation (multiple regression).

4️⃣Chi-squared test:

- Use when: You want to determine if there's a significant association between two categorical variables.
- Example: You want to determine if there's a significant association between smoking and lung cancer.

5️⃣Wilcoxon rank-sum test (Mann-Whitney U test):

- Use when: You want to compare the distributions of two independent groups.
- Example: You want to compare the distribution of scores between students who received traditional teaching and those who received innovative teaching.

6️⃣Kruskal-Wallis H test:

- Use when: You want to compare the distributions of three or more independent groups.
- Example: You want to compare the distribution of scores among students from different schools.

7️⃣Friedman test:

- Use when: You want to compare the distributions of three or more related groups.
- Example: You want to compare the distribution of scores among students at different time points.

8️⃣Pearson correlation coefficient:

- Use when: You want to examine the linear relationship between two continuous variables.
- Example: You want to examine the relationship between hours studied and exam scores.

9️⃣Spearman rank correlation coefficient:

- Use when: You want to examine the relationship between two variables when data is not normally distributed.
- Example: You want to examine the relationship between ranking of favorite foods and ranking of nutritional value.

๐Ÿ”ŸKendall's tau correlation coefficient:

- Use when: You want to examine the relationship between two variables when data is ordinal or categorical.
- Example: You want to examine the relationship between socioeconomic status and education level.

1️⃣1️⃣ARIMA models:

- Use when: You want to forecast future values in a time series data.
- Example: You want to predict stock prices based on past trends.

1️⃣2️⃣Exponential smoothing (ES):

- Use when: You want to forecast future values in a time series data with a simple exponential smoothing method.
- Example: You want to predict sales based on past trends.

1️⃣3️⃣Seasonal decomposition:

- Use when: You want to decompose time series data into trend, seasonality, and residuals.
- Example: You want to analyze website traffic data to identify seasonal patterns.

1️⃣4️⃣Kaplan-Meier estimator:

- Use when: You want to estimate the survival function of a population.
- Example: You want to analyze the survival rate of patients with a specific disease.

1️⃣5️⃣Cox proportional hazards model:

- Use when: You want to examine the relationship between covariates and survival time.
- Example: You want to investigate the effect of treatment on survival time.

1️⃣6️⃣Log-rank test:

- Use when: You want to compare the survival curves of two or more groups.
- Example: You want to compare the survival rates of patients with different treatments.

1️⃣7️⃣K-means clustering:

- Use when: You want to group similar observations into clusters based on features.
- Example: You want to segment customers based on buying behavior.

1️⃣8️⃣Hierarchical clustering:

- Use when: You want to group similar observations into clusters based on features, with a hierarchical structure.
- Example: You want to analyze gene expression data to identify clusters of genes.

1️⃣9️⃣DBSCAN (density-based spatial clustering of applications with noise):

- Use when: You want to group similar observations into clusters based on features, with noise handling.
- Example: You want to analyze spatial data to identify clusters of high density.

2️⃣0️⃣Principal component analysis (PCA):

- Use when: You want to reduce the dimensionality of a dataset by identifying principal components.
- Example: You want to analyze stock prices to identify principal components of variation.

2️⃣1️⃣Discriminant analysis:

- Use when: You want to predict group membership based on multivariate data.
- Example: You want to predict customer churn based on usage patterns.

2️⃣2️⃣Canonical correlation analysis:

- Use when: You want to examine the relationship between two sets of multivariate data.
- Example: You want to investigate the relationship between personality traits and behavior.

2️⃣3️⃣Bayesian inference:

- Use when: You want to update probabilities based on new data.
- Example: You want to update the probability of a hypothesis based on new evidence.

2️⃣4️⃣Bayesian regression:

- Use when: You want to model the relationship between variables using Bayesian methods.
- Example:

2️⃣5️⃣Bayesian networks:

- Use when: You want to model complex relationships between variables using Bayesian methods.
- Example: You want to model the relationship between genes and diseases.

2️⃣6️⃣Decision trees:

- Use when: You want to classify observations based on a tree-like model.
- Example: You want to predict customer churn based on usage patterns.

2️⃣7️⃣Random forests:

- Use when: You want to classify observations based on an ensemble of decision trees.
- Example: You want to predict disease diagnosis based on symptoms.

2️⃣8️⃣Support vector machines (SVMs):

- Use when: You want to classify observations based on a hyperplane.
- Example: You want to predict customer churn based on usage patterns.

2️⃣9️⃣Cluster analysis:

- Use when: You want to group similar observations into clusters based on features.
- Example: You want to segment customers based on buying behavior.

3️⃣0️⃣Factor analysis:

- Use when: You want to reduce the dimensionality of a dataset by identifying underlying factors.
- Example: You want to analyze survey data to identify underlying factors of satisfaction.

3️⃣1️⃣Survival analysis:

- Use when: You want to analyze the time-to-event data.
- Example: You want to analyze the survival rate of patients with a specific disease.

3️⃣2️⃣Time-series analysis:

- Use when: You want to analyze data that is ordered in time.
- Example: You want to analyze stock prices to identify patterns and trends.

3️⃣3️⃣Non-parametric tests:

- Use when: You want to analyze data without assuming a specific distribution.
- Example: You want to compare the median scores of students who received traditional teaching vs. those who received innovative teaching.

3️⃣4️⃣Machine learning algorithms:

- Use when: You want to predict outcomes or classify observations based on large datasets.
- Example: You want to predict customer churn based on usage patterns.

The specific test or technique used depends on the research question, data type, and study design.




Comments

Popular posts from this blog

Platforms in Remote Sensing

In remote sensing, a platform is the physical structure or vehicle that carries a sensor (camera, scanner, radar, etc.) to observe and collect information about the Earth's surface. Platforms are classified mainly by their altitude and mobility : Ground-Based Platforms Definition : Sensors mounted on the Earth's surface or very close to it. Examples : Tripods, towers, ground vehicles, handheld instruments. Applications : Calibration and validation of satellite data Detailed local studies (e.g., soil properties, vegetation health, air quality) Strength : High spatial detail but limited coverage. Airborne Platforms Definition : Sensors carried by aircraft, balloons, or drones (UAVs). Altitude : A few hundred meters to ~20 km. Examples : Airplanes with multispectral scanners UAVs with high-resolution cameras or LiDAR High-altitude balloons (stratospheric platforms) Applications : Local-to-regional mapping ...

Types of Remote Sensing

Remote Sensing means collecting information about the Earth's surface without touching it , usually using satellites, aircraft, or drones . There are different types of remote sensing based on the energy source and the wavelength region used. ๐Ÿ›ฐ️ 1. Active Remote Sensing ๐Ÿ“˜ Concept: In active remote sensing , the sensor sends out its own energy (like a signal or pulse) to the Earth's surface. The sensor then records the reflected or backscattered energy that comes back from the surface. ⚙️ Key Terminology: Transmitter: sends energy (like a radar pulse or laser beam). Receiver: detects the energy that bounces back. Backscatter: energy that is reflected back to the sensor. ๐Ÿ“Š Examples of Active Sensors: RADAR (Radio Detection and Ranging): Uses microwave signals to detect surface roughness, soil moisture, or ocean waves. LiDAR (Light Detection and Ranging): Uses laser light (near-infrared) to measure elevation, vegetation...

Optical Sensors in Remote Sensing

1. What Are Optical Sensors? Optical sensors are remote sensing instruments that detect solar radiation reflected or emitted from the Earth's surface in specific portions of the electromagnetic spectrum (EMS) . They mainly work in: Visible region (0.4–0.7 ยตm) Near-Infrared – NIR (0.7–1.3 ยตm) Shortwave Infrared – SWIR (1.3–3.0 ยตm) Thermal Infrared – TIR (8–14 ยตm) — emitted energy, not reflected Optical sensors capture spectral signatures of surface features. Each object reflects/absorbs energy differently, creating a unique spectral response pattern . a) Electromagnetic Spectrum (EMS) The continuous range of wavelengths. Optical sensing uses solar reflective bands and sometimes thermal bands . b) Spectral Signature The unique pattern of reflectance or absorbance of an object across wavelengths. Example: Vegetation reflects strongly in NIR Water absorbs strongly in NIR and SWIR (appears dark) c) Radiance and Reflectance Radi...

Resolution of Sensors in Remote Sensing

Spatial Resolution ๐Ÿ—บ️ Definition : The smallest size of an object on the ground that a sensor can detect. Measured as : The size of a pixel on the ground (in meters). Example : Landsat → 30 m (each pixel = 30 × 30 m on Earth). WorldView-3 → 0.31 m (very detailed, you can see cars). Fact : Higher spatial resolution = finer details, but smaller coverage. Spectral Resolution ๐ŸŒˆ Definition : The ability of a sensor to capture information in different parts (bands) of the electromagnetic spectrum . Measured as : The number and width of spectral bands. Types : Panchromatic (1 broad band, e.g., black & white image). Multispectral (several broad bands, e.g., Landsat with 7–13 bands). Hyperspectral (hundreds of very narrow bands, e.g., AVIRIS). Fact : Higher spectral resolution = better identification of materials (e.g., minerals, vegetation types). Radiometric Resolution ๐Ÿ“Š Definition : The ability of a sensor to ...

Radar Sensors in Remote Sensing

Radar sensors are active remote sensing instruments that use microwave radiation to detect and measure Earth's surface features. They transmit their own energy (radio waves) toward the Earth and record the backscattered signal that returns to the sensor. Since they do not depend on sunlight, radar systems can collect data: day or night through clouds, fog, smoke, and rain in all weather conditions This makes radar extremely useful for Earth observation. 1. Active Sensor A radar sensor produces and transmits its own microwaves. This is different from optical and thermal sensors, which depend on sunlight or emitted heat. 2. Microwave Region Radar operates in the microwave region of the electromagnetic spectrum , typically from 1 mm to 1 m wavelength. Common radar frequency bands: P-band (70 cm) L-band (23 cm) S-band (9 cm) C-band (5.6 cm) X-band (3 cm) Each band penetrates and interacts with surfaces differently: Lo...