Quartiles divide data into four equal parts to show distribution.

Descriptive Statistics Calculator – Mean, Median, Mode, Std Dev & Variance

What is Descriptive Statistics?

Descriptive statistics are numerical and graphical methods used to summarize and describe the main features of a dataset. They provide simple summaries about the sample and the measures, forming the basis of quantitative data analysis.

Key Components:

Measures of Central Tendency: Mean, median, mode - describe the center of the data
Measures of Dispersion: Range, variance, standard deviation - describe the spread of data
Measures of Distribution: Skewness, kurtosis, quartiles - describe the shape of data distribution
Graphical Representations: Histograms, box plots, scatter plots - visualize data patterns

Population vs Sample

Population statistics describe entire groups, while sample statistics describe subsets. Different formulas are used for variance and standard deviation.

Population: σ² = Σ(x - μ)² / N
Sample: s² = Σ(x - x̄)² / (n-1)

Data Types

Different statistical measures are appropriate for different types of data: nominal, ordinal, interval, and ratio scales.

Nominal: Mode only
Ordinal: Median, mode
Interval/Ratio: All measures

Statistical Significance

Descriptive statistics form the foundation for inferential statistics, hypothesis testing, and making predictions about populations.

Mean → Confidence intervals
Variance → Hypothesis tests
Distribution → Model selection

Measures of Central Tendency

These measures describe the center or typical value of a dataset.

Mean (Average)

The arithmetic average of all values. Most common measure but sensitive to outliers.

x̄ = Σx / n

Data: [2, 4, 6, 8, 10]
Mean = (2+4+6+8+10)/5 = 30/5 = 6

Median (Middle Value)

The middle value when data is sorted. Robust to outliers and skewed distributions.

Median = middle value of sorted data

Data: [3, 1, 7, 5, 9]
Sorted: [1, 3, 5, 7, 9]
Median = 5

Mode (Most Frequent)

The value that appears most frequently. Can have multiple modes or no mode.

Mode = most frequent value(s)

Data: [2, 3, 3, 4, 5, 3, 6]
Mode = 3 (appears 3 times)

When to Use Each

Mean for symmetric data, median for skewed data or outliers, mode for categorical data or identifying peaks.

Symmetric: Use mean
Skewed: Use median
Categorical: Use mode

Relationship between mean, median, and mode: In a perfectly normal distribution, mean = median = mode. In right-skewed distributions, mean > median > mode. In left-skewed distributions, mean < median < mode.

Measures of Dispersion

These measures describe how spread out or variable the data is.

Range

The difference between maximum and minimum values. Simple but sensitive to outliers.

Range = Max - Min

Data: [10, 20, 30, 40, 50]
Range = 50 - 10 = 40

Variance

Average of squared deviations from the mean. Measures average squared distance from mean.

σ² = Σ(x - μ)² / N (population)
s² = Σ(x - x̄)² / (n-1) (sample)

Standard Deviation

Square root of variance. In same units as data. Most common measure of spread.

σ = √σ² (population)
s = √s² (sample)

Variance = 25
Std Dev = √25 = 5

Interquartile Range (IQR)

Range of middle 50% of data. Robust to outliers. Q3 - Q1.

IQR = Q3 - Q1

Q1 = 25, Q3 = 75
IQR = 75 - 25 = 50

Empirical Rule (Normal Distribution):
• 68% of data within ±1 standard deviation of mean
• 95% of data within ±2 standard deviations of mean
• 99.7% of data within ±3 standard deviations of mean

Distribution Measures

These measures describe the shape and characteristics of data distribution.

Quartiles

Divide data into four equal parts. Q1 (25th percentile), Q2 (median, 50th), Q3 (75th).

Data divided into:
25% below Q1
25% between Q1-Q2
25% between Q2-Q3
25% above Q3

Skewness

Measures asymmetry of distribution. Positive = right skew, negative = left skew, zero = symmetric.

Skewness = 3(Mean - Median)/Std Dev

Right skew: Mean > Median
Left skew: Mean < Median
Symmetric: Mean = Median

Kurtosis

Measures "tailedness" or peakiness. High kurtosis = heavy tails, low kurtosis = light tails.

Leptokurtic: High peak, heavy tails
Mesokurtic: Normal distribution
Platykurtic: Low peak, light tails

Percentiles

Values below which a given percentage of observations fall. P50 = median, P25 = Q1, P75 = Q3.

Pₖ = value at k% position

90th percentile means
90% of values are below
10% of values are above

Box Plot Elements: Visual representation showing minimum, Q1, median, Q3, maximum, and potential outliers. The box shows IQR, whiskers show range, and points show outliers beyond 1.5×IQR.

Real-World Applications

Descriptive statistics are used across numerous fields for data analysis and decision-making:

Business & Finance

Sales analysis and forecasting
Financial risk assessment
Market research analysis
Quality control metrics
Customer behavior analysis

Healthcare & Medicine

Clinical trial results analysis
Patient vital statistics
Epidemiological studies
Treatment effectiveness
Medical research data

Education & Research

Test score analysis
Academic performance tracking
Survey data summarization
Research data analysis
Educational assessment

Science & Engineering

Experimental data analysis
Quality assurance testing
Process control monitoring
Measurement error analysis
Scientific research data

Social Sciences

Demographic data analysis
Survey research statistics
Psychological test scoring
Sociological research data
Political polling analysis

Sports Analytics

Player performance statistics
Team performance analysis
Game strategy optimization
Talent scouting metrics
Injury prevention analysis

Solved Examples

Step-by-step solutions to common descriptive statistics problems:

Example 1: Basic Statistics

Dataset: [12, 15, 18, 22, 25, 28, 32]

1. Mean = (12+15+18+22+25+28+32)/7 = 152/7 ≈ 21.71

2. Median = middle value = 22

3. Range = 32 - 12 = 20

4. Variance = Σ(x - mean)² / n ≈ 44.24

5. Std Dev = √44.24 ≈ 6.65

Mean: 21.71, Median: 22, Std Dev: 6.65

The data is slightly left-skewed (mean < median). Standard deviation shows moderate spread.

Example 2: With Outliers

Dataset: [10, 12, 14, 16, 18, 20, 100]

1. Mean = (10+12+14+16+18+20+100)/7 ≈ 27.14

2. Median = middle value = 16

3. Notice: Mean (27.14) > Median (16)

4. This indicates right skew due to outlier (100)

5. Without outlier: Mean = 15, Median = 15

Mean: 27.14, Median: 16, Skewed: Yes

The outlier (100) dramatically affects the mean but not the median, demonstrating median's robustness.

Example 3: Quartiles & IQR

Dataset: [5, 8, 12, 15, 18, 22, 25, 30, 35, 40]

1. Q1 (25th percentile) = between 12 and 15 = 13.5

2. Median (Q2) = (18+22)/2 = 20

3. Q3 (75th percentile) = between 25 and 30 = 27.5

4. IQR = Q3 - Q1 = 27.5 - 13.5 = 14

5. Outlier boundaries: Q1-1.5×IQR and Q3+1.5×IQR

Q1: 13.5, Q2: 20, Q3: 27.5, IQR: 14

IQR shows middle 50% of data spans 14 units. No values outside outlier boundaries.

Example 4: Sample vs Population

Sample: [22, 25, 28, 30, 32] from larger population

1. Sample mean = (22+25+28+30+32)/5 = 27.4

2. Sample variance = Σ(x - mean)² / (n-1)

3. Calculations: (22-27.4)² + ... + (32-27.4)² = 88.8

4. Sample variance = 88.8 / 4 = 22.2

5. Sample std dev = √22.2 ≈ 4.71

Sample: Variance 22.2, Std Dev 4.71

Using n-1 (4) in denominator corrects bias when estimating population variance from sample.

Example 5: Mode & Frequency

Dataset: [5, 7, 7, 8, 8, 8, 9, 9, 10]

1. Frequency count: 5(1), 7(2), 8(3), 9(2), 10(1)

2. Mode = 8 (appears 3 times, most frequent)

3. Mean = (5+7+7+8+8+8+9+9+10)/9 ≈ 7.89

4. Median = middle value (5th) = 8

5. Mean ≈ Median ≈ Mode ≈ 8

Mode: 8, Mean: 7.89, Median: 8

The data is approximately symmetric with mean, median, and mode close together.

Example 6: Comprehensive Analysis

Test scores: [65, 70, 75, 80, 85, 90, 95, 100]

1. Mean = 82.5, Median = 82.5

2. Range = 35, Variance = 125, Std Dev = 11.18

3. Q1 = 72.5, Q3 = 92.5, IQR = 20

4. Skewness = 0 (symmetric)

5. 68% within 71.32-93.68 (mean ± 1 std dev)

Complete analysis with all measures

Perfectly symmetric distribution with mean = median. Standard deviation shows moderate spread.

Practice Problems

Test your understanding with these practice problems:

Problem 1: Calculate mean, median, and mode for: [15, 18, 22, 22, 25, 28, 30]

Solution:

Mean = (15+18+22+22+25+28+30)/7 = 160/7 ≈ 22.86

Median = middle value (4th) = 22

Mode = 22 (appears twice, most frequent)

Mean > Median = Mode, indicating slight right skew.

Problem 2: Find range, variance, and standard deviation for sample: [10, 15, 20, 25, 30]

Solution:

Range = 30 - 10 = 20

Mean = (10+15+20+25+30)/5 = 100/5 = 20

Variance = [(10-20)²+(15-20)²+(20-20)²+(25-20)²+(30-20)²] / (5-1)

= (100+25+0+25+100)/4 = 250/4 = 62.5

Standard Deviation = √62.5 ≈ 7.91

Problem 3: Calculate Q1, Q3, and IQR for: [12, 15, 18, 20, 22, 25, 28, 30, 35, 40]

Solution:

10 values: Positions for quartiles at 2.75 and 7.25

Q1 = 15 + 0.75×(18-15) = 15 + 2.25 = 17.25

Q3 = 28 + 0.25×(30-28) = 28 + 0.5 = 28.5

IQR = Q3 - Q1 = 28.5 - 17.25 = 11.25

Outlier boundaries: 17.25 - 1.5×11.25 = 0.375 and 28.5 + 1.5×11.25 = 45.375

No outliers in this dataset.

Problem 4: A dataset has mean = 50, median = 48, mode = 45. What can you say about its skewness?

Solution:

Mean (50) > Median (48) > Mode (45)

This is the pattern for right-skewed (positively skewed) distributions.

The tail on the right side is longer, pulling the mean above the median.

Skewness coefficient would be positive.

Problem 5: Dataset has variance = 64, mean = 100. What percentage of data falls between 84 and 116 assuming normal distribution?

Solution:

Standard Deviation = √64 = 8

84 is (100-84)/8 = 16/8 = 2 standard deviations below mean

116 is (116-100)/8 = 16/8 = 2 standard deviations above mean

Using Empirical Rule: 95% of data falls within ±2 standard deviations

Therefore, approximately 95% of data falls between 84 and 116.

How to Calculate Descriptive Statistics Step-by-Step

Follow this systematic approach to perform comprehensive descriptive statistics analysis:

1

Organize Your Data

Collect and organize your dataset. Sort values in ascending order for easier calculation of median and quartiles.

Original: [15, 8, 22, 17, 10]
Sorted: [8, 10, 15, 17, 22]

2

Calculate Central Tendency

Compute mean (average), median (middle value), and mode (most frequent value).

Mean = Σx / n
Median = middle value
Mode = most frequent

3

Calculate Measures of Spread

Compute range, variance, and standard deviation to understand data dispersion.

Range = Max - Min
Variance = Σ(x - mean)² / (n or n-1)
Std Dev = √Variance

4

Determine Quartiles

Calculate Q1 (25th percentile), Q2 (median, 50th), Q3 (75th percentile), and IQR.

Q1 = value at 25% position
Q3 = value at 75% position
IQR = Q3 - Q1

5

Check for Outliers

Identify outliers using IQR method: values below Q1-1.5×IQR or above Q3+1.5×IQR.

Lower bound = Q1 - 1.5×IQR
Upper bound = Q3 + 1.5×IQR

6

Interpret Results

Analyze what the statistics tell you about your data's center, spread, and distribution shape.

Compare mean vs median for skewness
Use std dev for spread interpretation
Check IQR for middle 50% spread

Pro Tips for Descriptive Statistics

Always visualize: Create histograms or box plots to complement numerical statistics
Check assumptions: Different statistics assume different data characteristics
Consider outliers: Decide whether to include, exclude, or transform outliers based on context
Use appropriate measures: Mean for symmetric data, median for skewed data
Report with context: Always report sample size and whether statistics are for sample or population
Check distribution: Assess normality for parametric tests and confidence intervals

Descriptive Statistics FAQs (Mean, Median, Standard Deviation & More)

Common questions about descriptive statistics, data analysis, and statistical measures.

What are descriptive statistics?

Descriptive statistics summarize and describe data using measures like mean, median, mode, standard deviation, variance, and range.

What's the difference between descriptive and inferential statistics?

Descriptive statistics summarize data, while inferential statistics use sample data to make predictions or draw conclusions about a population.

When should I use mean vs median?

Use mean for symmetric data without outliers and median for skewed data or when outliers are present.

What is the difference between mean, median, and mode?

Mean is the average, median is the middle value, and mode is the most frequent value in a dataset.

What does standard deviation tell us?

It measures how spread out data values are from the mean. A higher value indicates more variability.

What is variance in statistics?

Variance is the average of squared differences from the mean and shows how data is distributed.

What is range in statistics?

Range is the difference between the highest and lowest values in a dataset.

What are quartiles?

Quartiles divide data into four equal parts, helping identify distribution and outliers.

How do I know if my data is normally distributed?

If mean, median, and mode are similar and data follows the 68-95-99.7 rule, it is likely normally distributed.

What are outliers in data?

Outliers are values that differ significantly from other data points and can affect statistical results.

What are the limitations of descriptive statistics?

They summarize data but cannot make predictions or determine cause-and-effect relationships.

When should I use sample vs population statistics?

Use population statistics when analyzing all data and sample statistics when working with a subset.

How many decimal places should I report?

Report one more decimal place than the original data for accuracy and consistency.

Why are descriptive statistics important?

They help understand data, identify patterns, and prepare for deeper statistical analysis.

Related Statistical Calculators

Explore our collection of statistics and hypothesis testing tools:

Related Statistics Learning Guides

Explore essential statistics concepts with clear explanations, real-world applications, and step-by-step analytical methods.

Descriptive Statistics Calculator – Analyze Data with Mean, Median, Mode & Standard Deviation

Data Input

Statistical Results

Central Tendency Visualization

Data Spread Visualization

Distribution Analysis

Dataset Information

Histogram

Recent Calculations

Table of Contents

What is Descriptive Statistics?

Population vs Sample

Data Types

Statistical Significance

Measures of Central Tendency

Mean (Average)

Median (Middle Value)

Mode (Most Frequent)

When to Use Each

Measures of Dispersion

Range

Variance

Standard Deviation

Interquartile Range (IQR)

Distribution Measures

Quartiles

Skewness

Kurtosis

Percentiles

Real-World Applications

Business & Finance

Healthcare & Medicine

Education & Research

Science & Engineering

Social Sciences

Sports Analytics

Solved Examples

Practice Problems

How to Calculate Descriptive Statistics Step-by-Step

Organize Your Data

Calculate Central Tendency

Calculate Measures of Spread

Determine Quartiles

Check for Outliers

Interpret Results

Pro Tips for Descriptive Statistics

Descriptive Statistics FAQs (Mean, Median, Standard Deviation & More)

Related Statistical Calculators

T-Test Calculator

Chi-Square Calculator

Correlation Calculator

Regression Calculator

Power Analysis Calculator