Skip to main content

WHEN TO USE WHAT STATISTICAL TEST IN RESEARCH

There are several statistical test types for analyzing Research Data. When to use what is often the challenge. This piece provides a simplification 

1️⃣t-test:

- Use when: You want to compare the means of two groups to determine if there's a significant difference.
- Example: You want to compare the average score of students who received traditional teaching vs. those who received innovative teaching.

2️⃣ANOVA (Analysis of Variance):

- Use when: You want to compare the means of three or more groups to determine if there are significant differences.
- Example: You want to compare the average score of students from different schools to determine if there are significant differences in their performance.

3️⃣Regression (Simple and Multiple):

- Use when: You want to examine the relationship between a dependent variable and one or more independent variables.
- Example: You want to examine the relationship between hours studied and exam scores (simple regression), or the relationship between hours studied, exam scores, and student motivation (multiple regression).

4️⃣Chi-squared test:

- Use when: You want to determine if there's a significant association between two categorical variables.
- Example: You want to determine if there's a significant association between smoking and lung cancer.

5️⃣Wilcoxon rank-sum test (Mann-Whitney U test):

- Use when: You want to compare the distributions of two independent groups.
- Example: You want to compare the distribution of scores between students who received traditional teaching and those who received innovative teaching.

6️⃣Kruskal-Wallis H test:

- Use when: You want to compare the distributions of three or more independent groups.
- Example: You want to compare the distribution of scores among students from different schools.

7️⃣Friedman test:

- Use when: You want to compare the distributions of three or more related groups.
- Example: You want to compare the distribution of scores among students at different time points.

8️⃣Pearson correlation coefficient:

- Use when: You want to examine the linear relationship between two continuous variables.
- Example: You want to examine the relationship between hours studied and exam scores.

9️⃣Spearman rank correlation coefficient:

- Use when: You want to examine the relationship between two variables when data is not normally distributed.
- Example: You want to examine the relationship between ranking of favorite foods and ranking of nutritional value.

🔟Kendall's tau correlation coefficient:

- Use when: You want to examine the relationship between two variables when data is ordinal or categorical.
- Example: You want to examine the relationship between socioeconomic status and education level.

1️⃣1️⃣ARIMA models:

- Use when: You want to forecast future values in a time series data.
- Example: You want to predict stock prices based on past trends.

1️⃣2️⃣Exponential smoothing (ES):

- Use when: You want to forecast future values in a time series data with a simple exponential smoothing method.
- Example: You want to predict sales based on past trends.

1️⃣3️⃣Seasonal decomposition:

- Use when: You want to decompose time series data into trend, seasonality, and residuals.
- Example: You want to analyze website traffic data to identify seasonal patterns.

1️⃣4️⃣Kaplan-Meier estimator:

- Use when: You want to estimate the survival function of a population.
- Example: You want to analyze the survival rate of patients with a specific disease.

1️⃣5️⃣Cox proportional hazards model:

- Use when: You want to examine the relationship between covariates and survival time.
- Example: You want to investigate the effect of treatment on survival time.

1️⃣6️⃣Log-rank test:

- Use when: You want to compare the survival curves of two or more groups.
- Example: You want to compare the survival rates of patients with different treatments.

1️⃣7️⃣K-means clustering:

- Use when: You want to group similar observations into clusters based on features.
- Example: You want to segment customers based on buying behavior.

1️⃣8️⃣Hierarchical clustering:

- Use when: You want to group similar observations into clusters based on features, with a hierarchical structure.
- Example: You want to analyze gene expression data to identify clusters of genes.

1️⃣9️⃣DBSCAN (density-based spatial clustering of applications with noise):

- Use when: You want to group similar observations into clusters based on features, with noise handling.
- Example: You want to analyze spatial data to identify clusters of high density.

2️⃣0️⃣Principal component analysis (PCA):

- Use when: You want to reduce the dimensionality of a dataset by identifying principal components.
- Example: You want to analyze stock prices to identify principal components of variation.

2️⃣1️⃣Discriminant analysis:

- Use when: You want to predict group membership based on multivariate data.
- Example: You want to predict customer churn based on usage patterns.

2️⃣2️⃣Canonical correlation analysis:

- Use when: You want to examine the relationship between two sets of multivariate data.
- Example: You want to investigate the relationship between personality traits and behavior.

2️⃣3️⃣Bayesian inference:

- Use when: You want to update probabilities based on new data.
- Example: You want to update the probability of a hypothesis based on new evidence.

2️⃣4️⃣Bayesian regression:

- Use when: You want to model the relationship between variables using Bayesian methods.
- Example:

2️⃣5️⃣Bayesian networks:

- Use when: You want to model complex relationships between variables using Bayesian methods.
- Example: You want to model the relationship between genes and diseases.

2️⃣6️⃣Decision trees:

- Use when: You want to classify observations based on a tree-like model.
- Example: You want to predict customer churn based on usage patterns.

2️⃣7️⃣Random forests:

- Use when: You want to classify observations based on an ensemble of decision trees.
- Example: You want to predict disease diagnosis based on symptoms.

2️⃣8️⃣Support vector machines (SVMs):

- Use when: You want to classify observations based on a hyperplane.
- Example: You want to predict customer churn based on usage patterns.

2️⃣9️⃣Cluster analysis:

- Use when: You want to group similar observations into clusters based on features.
- Example: You want to segment customers based on buying behavior.

3️⃣0️⃣Factor analysis:

- Use when: You want to reduce the dimensionality of a dataset by identifying underlying factors.
- Example: You want to analyze survey data to identify underlying factors of satisfaction.

3️⃣1️⃣Survival analysis:

- Use when: You want to analyze the time-to-event data.
- Example: You want to analyze the survival rate of patients with a specific disease.

3️⃣2️⃣Time-series analysis:

- Use when: You want to analyze data that is ordered in time.
- Example: You want to analyze stock prices to identify patterns and trends.

3️⃣3️⃣Non-parametric tests:

- Use when: You want to analyze data without assuming a specific distribution.
- Example: You want to compare the median scores of students who received traditional teaching vs. those who received innovative teaching.

3️⃣4️⃣Machine learning algorithms:

- Use when: You want to predict outcomes or classify observations based on large datasets.
- Example: You want to predict customer churn based on usage patterns.

The specific test or technique used depends on the research question, data type, and study design.




Comments

Popular posts from this blog

Groundwater – Porosity and Permeability

Groundwater refers to the water that resides beneath the Earth's surface in the pores and crevices of rock, sediment, and soil. Two key properties that influence the movement and storage of groundwater are porosity and permeability: 1. Porosity:    - Definition: Porosity refers to the volume percentage of void spaces (pores or openings) in a geological material, such as soil or rock.    - Role: Porosity determines how much water a subsurface material can hold. It is a measure of the material's capacity to store water.    - Factors: Porosity is influenced by the size and arrangement of particles within the material. Highly porous materials have more void spaces, while less porous materials have fewer.    - Units: Porosity is expressed as a percentage, with 0% indicating complete solidity (no pore spaces) and 100% indicating complete void space. 2. Permeability:    - Definition: Permeability refers to the ability of a geological material to transmit fluids, such as water. It meas

Ground Water

Groundwater Terminology, Concepts, and Facts Key Terms Aquifer: A geological formation that can store and transmit significant quantities of water. Water Table: The upper surface of the saturated zone in an aquifer. Recharge: The process of replenishing groundwater through precipitation or other sources. Discharge: The process of groundwater flowing out of an aquifer, typically into surface water bodies or through wells. Hydraulic Gradient: The slope of the water table. Darcy's Law: A law that describes the flow of groundwater through porous media. Permeability: The ability of a material to transmit water. Porosity: The amount of void space in a material. Concepts Groundwater Flow: Groundwater moves from areas of higher hydraulic head to areas of lower hydraulic head. Groundwater Contamination: The introduction of pollutants into groundwater. Groundwater Depletion: The excessive extraction of groundwater, leading to a decline in water table levels. Saltwater Intrusion:

Artisan Wells Basins

Artisan Wells and Basins Artesian wells are a type of well that harnesses the natural pressure of water trapped underground to force water to the surface without pumping. This phenomenon occurs in specific geological formations known as artesian basins . Key Terminologies and Concepts Aquifer: A geological formation that can store and transmit water. It is typically made up of porous rocks or sediments like sandstone or gravel. Confined Aquifer: An aquifer that is sandwiched between two impermeable layers (like clay or shale) that prevent water from escaping. Artesian Basin: A geological structure where a confined aquifer is tilted and has a recharge area at a higher elevation than the discharge area. This creates a pressure gradient that forces water to flow upwards. Potentiometric Surface: The theoretical level to which water would rise in a well drilled into an artesian aquifer if there were no restrictions. It is determined by the pressure head in the aquifer. Flowing Artesian

Discrete Detectors and Scanning mirrors Across the track scanner Whisk broom scanner.

Multispectral Imaging Using Discrete Detectors and Scanning Mirrors (Across-Track Scanner or Whisk Broom Scanner) Multispectral Imaging:  This technique involves capturing images of the Earth's surface using multiple sensors that are sensitive to different wavelengths of electromagnetic radiation.  This allows for the identification of various features and materials based on their spectral signatures. Discrete Detectors:  These are individual sensors that are arranged in a linear or array configuration.  Each detector is responsible for measuring the radiation within a specific wavelength band. Scanning Mirrors:  These are optical components that are used to deflect the incoming radiation onto the discrete detectors.  By moving the mirrors,  the sensor can scan across the scene,  capturing data from different points. Across-Track Scanner or Whisk Broom Scanner:  This refers to the scanning mechanism where the mirror moves perpendicular to the direction of flight.  This allows for t

Watershed. Catchment. Basin

A watershed, also known as a river basin or drainage basin, is a fundamental concept in geohydrology and hydrology. It refers to a specific geographic area or region of land where all the surface water, including rainfall, snowmelt, and runoff, drains into a common outlet, such as a river, lake, or ocean. Here's an explanation of each term: 1. Watershed: A watershed is essentially a natural hydrological unit defined by the topography of the land. It represents the entire area from which all precipitation and surface water flow eventually gathers at a single point. This point is typically where the main river or stream within the watershed exits into a larger body of water, such as an ocean. Watersheds come in various sizes, from small ones that encompass a few square miles to enormous ones that cover entire continents. 2. River/Drainage Basin: A river basin or drainage basin is another way to describe a watershed. It's the land area that collects and channels water into a river