Descriptive Statistics Calculator – Analyze Data with Mean, Median, Mode & Standard Deviation

Analyze any dataset with comprehensive descriptive statistics and detailed solutions.

Data Input

Enter your dataset and select calculation options

⌨️ Manual Entry
📁 Upload File
Comma Separated
Space Separated
New Line
Data Preview:

Population: Use N in denominator. Sample: Use N-1 in denominator for variance.

Statistical Results

TXT
CSV
JSON
Print
Summary
Central Tendency
Dispersion
Distribution
Details
Statistic Value Description
-
Mean
Average value
-
Median
Middle value
-
Mode
Most frequent
-
Midrange
(Min + Max)/2

Central Tendency Visualization

-
Range
Max - Min
-
Variance
σ²
-
Std Dev
√Variance
-
IQR
Q3 - Q1

Data Spread Visualization

-
Q1 (25%)
First Quartile
-
Q2 (50%)
Median
-
Q3 (75%)
Third Quartile
-
Skewness
Distribution shape

Distribution Analysis

Dataset Information

Histogram

Recent Calculations

What is Descriptive Statistics?

Descriptive statistics are numerical and graphical methods used to summarize and describe the main features of a dataset. They provide simple summaries about the sample and the measures, forming the basis of quantitative data analysis.

Key Components:

  • Measures of Central Tendency: Mean, median, mode - describe the center of the data
  • Measures of Dispersion: Range, variance, standard deviation - describe the spread of data
  • Measures of Distribution: Skewness, kurtosis, quartiles - describe the shape of data distribution
  • Graphical Representations: Histograms, box plots, scatter plots - visualize data patterns

Population vs Sample

Population statistics describe entire groups, while sample statistics describe subsets. Different formulas are used for variance and standard deviation.

Population: σ² = Σ(x - μ)² / N
Sample: s² = Σ(x - x̄)² / (n-1)

Data Types

Different statistical measures are appropriate for different types of data: nominal, ordinal, interval, and ratio scales.

Nominal: Mode only
Ordinal: Median, mode
Interval/Ratio: All measures

Statistical Significance

Descriptive statistics form the foundation for inferential statistics, hypothesis testing, and making predictions about populations.

Mean → Confidence intervals
Variance → Hypothesis tests
Distribution → Model selection

Measures of Central Tendency

These measures describe the center or typical value of a dataset.

Mean (Average)

The arithmetic average of all values. Most common measure but sensitive to outliers.

x̄ = Σx / n
Data: [2, 4, 6, 8, 10]
Mean = (2+4+6+8+10)/5 = 30/5 = 6

Median (Middle Value)

The middle value when data is sorted. Robust to outliers and skewed distributions.

Median = middle value of sorted data
Data: [3, 1, 7, 5, 9]
Sorted: [1, 3, 5, 7, 9]
Median = 5

Mode (Most Frequent)

The value that appears most frequently. Can have multiple modes or no mode.

Mode = most frequent value(s)
Data: [2, 3, 3, 4, 5, 3, 6]
Mode = 3 (appears 3 times)

When to Use Each

Mean for symmetric data, median for skewed data or outliers, mode for categorical data or identifying peaks.

Symmetric: Use mean
Skewed: Use median
Categorical: Use mode

Relationship between mean, median, and mode: In a perfectly normal distribution, mean = median = mode. In right-skewed distributions, mean > median > mode. In left-skewed distributions, mean < median < mode.

Measures of Dispersion

These measures describe how spread out or variable the data is.

Range

The difference between maximum and minimum values. Simple but sensitive to outliers.

Range = Max - Min
Data: [10, 20, 30, 40, 50]
Range = 50 - 10 = 40

Variance

Average of squared deviations from the mean. Measures average squared distance from mean.

σ² = Σ(x - μ)² / N (population)
s² = Σ(x - x̄)² / (n-1) (sample)

Standard Deviation

Square root of variance. In same units as data. Most common measure of spread.

σ = √σ² (population)
s = √s² (sample)
Variance = 25
Std Dev = √25 = 5

Interquartile Range (IQR)

Range of middle 50% of data. Robust to outliers. Q3 - Q1.

IQR = Q3 - Q1
Q1 = 25, Q3 = 75
IQR = 75 - 25 = 50
Empirical Rule (Normal Distribution):
• 68% of data within ±1 standard deviation of mean
• 95% of data within ±2 standard deviations of mean
• 99.7% of data within ±3 standard deviations of mean

Distribution Measures

These measures describe the shape and characteristics of data distribution.

Quartiles

Divide data into four equal parts. Q1 (25th percentile), Q2 (median, 50th), Q3 (75th).

Data divided into:
25% below Q1
25% between Q1-Q2
25% between Q2-Q3
25% above Q3

Skewness

Measures asymmetry of distribution. Positive = right skew, negative = left skew, zero = symmetric.

Skewness = 3(Mean - Median)/Std Dev
Right skew: Mean > Median
Left skew: Mean < Median
Symmetric: Mean = Median

Kurtosis

Measures "tailedness" or peakiness. High kurtosis = heavy tails, low kurtosis = light tails.

Leptokurtic: High peak, heavy tails
Mesokurtic: Normal distribution
Platykurtic: Low peak, light tails

Percentiles

Values below which a given percentage of observations fall. P50 = median, P25 = Q1, P75 = Q3.

Pₖ = value at k% position
90th percentile means
90% of values are below
10% of values are above

Box Plot Elements: Visual representation showing minimum, Q1, median, Q3, maximum, and potential outliers. The box shows IQR, whiskers show range, and points show outliers beyond 1.5×IQR.

Real-World Applications

Descriptive statistics are used across numerous fields for data analysis and decision-making:

Business & Finance

  • Sales analysis and forecasting
  • Financial risk assessment
  • Market research analysis
  • Quality control metrics
  • Customer behavior analysis

Healthcare & Medicine

  • Clinical trial results analysis
  • Patient vital statistics
  • Epidemiological studies
  • Treatment effectiveness
  • Medical research data

Education & Research

  • Test score analysis
  • Academic performance tracking
  • Survey data summarization
  • Research data analysis
  • Educational assessment

Science & Engineering

  • Experimental data analysis
  • Quality assurance testing
  • Process control monitoring
  • Measurement error analysis
  • Scientific research data

Social Sciences

  • Demographic data analysis
  • Survey research statistics
  • Psychological test scoring
  • Sociological research data
  • Political polling analysis

Sports Analytics

  • Player performance statistics
  • Team performance analysis
  • Game strategy optimization
  • Talent scouting metrics
  • Injury prevention analysis

Solved Examples

Step-by-step solutions to common descriptive statistics problems:

Example 1: Basic Statistics
Dataset: [12, 15, 18, 22, 25, 28, 32]
1. Mean = (12+15+18+22+25+28+32)/7 = 152/7 ≈ 21.71
2. Median = middle value = 22
3. Range = 32 - 12 = 20
4. Variance = Σ(x - mean)² / n ≈ 44.24
5. Std Dev = √44.24 ≈ 6.65
Mean: 21.71, Median: 22, Std Dev: 6.65
The data is slightly left-skewed (mean < median). Standard deviation shows moderate spread.
Example 2: With Outliers
Dataset: [10, 12, 14, 16, 18, 20, 100]
1. Mean = (10+12+14+16+18+20+100)/7 ≈ 27.14
2. Median = middle value = 16
3. Notice: Mean (27.14) > Median (16)
4. This indicates right skew due to outlier (100)
5. Without outlier: Mean = 15, Median = 15
Mean: 27.14, Median: 16, Skewed: Yes
The outlier (100) dramatically affects the mean but not the median, demonstrating median's robustness.
Example 3: Quartiles & IQR
Dataset: [5, 8, 12, 15, 18, 22, 25, 30, 35, 40]
1. Q1 (25th percentile) = between 12 and 15 = 13.5
2. Median (Q2) = (18+22)/2 = 20
3. Q3 (75th percentile) = between 25 and 30 = 27.5
4. IQR = Q3 - Q1 = 27.5 - 13.5 = 14
5. Outlier boundaries: Q1-1.5×IQR and Q3+1.5×IQR
Q1: 13.5, Q2: 20, Q3: 27.5, IQR: 14
IQR shows middle 50% of data spans 14 units. No values outside outlier boundaries.
Example 4: Sample vs Population
Sample: [22, 25, 28, 30, 32] from larger population
1. Sample mean = (22+25+28+30+32)/5 = 27.4
2. Sample variance = Σ(x - mean)² / (n-1)
3. Calculations: (22-27.4)² + ... + (32-27.4)² = 88.8
4. Sample variance = 88.8 / 4 = 22.2
5. Sample std dev = √22.2 ≈ 4.71
Sample: Variance 22.2, Std Dev 4.71
Using n-1 (4) in denominator corrects bias when estimating population variance from sample.
Example 5: Mode & Frequency
Dataset: [5, 7, 7, 8, 8, 8, 9, 9, 10]
1. Frequency count: 5(1), 7(2), 8(3), 9(2), 10(1)
2. Mode = 8 (appears 3 times, most frequent)
3. Mean = (5+7+7+8+8+8+9+9+10)/9 ≈ 7.89
4. Median = middle value (5th) = 8
5. Mean ≈ Median ≈ Mode ≈ 8
Mode: 8, Mean: 7.89, Median: 8
The data is approximately symmetric with mean, median, and mode close together.
Example 6: Comprehensive Analysis
Test scores: [65, 70, 75, 80, 85, 90, 95, 100]
1. Mean = 82.5, Median = 82.5
2. Range = 35, Variance = 125, Std Dev = 11.18
3. Q1 = 72.5, Q3 = 92.5, IQR = 20
4. Skewness = 0 (symmetric)
5. 68% within 71.32-93.68 (mean ± 1 std dev)
Complete analysis with all measures
Perfectly symmetric distribution with mean = median. Standard deviation shows moderate spread.

Practice Problems

Test your understanding with these practice problems:

Problem 1: Calculate mean, median, and mode for: [15, 18, 22, 22, 25, 28, 30]

Solution:

Mean = (15+18+22+22+25+28+30)/7 = 160/7 ≈ 22.86

Median = middle value (4th) = 22

Mode = 22 (appears twice, most frequent)

Mean > Median = Mode, indicating slight right skew.

Problem 2: Find range, variance, and standard deviation for sample: [10, 15, 20, 25, 30]

Solution:

Range = 30 - 10 = 20

Mean = (10+15+20+25+30)/5 = 100/5 = 20

Variance = [(10-20)²+(15-20)²+(20-20)²+(25-20)²+(30-20)²] / (5-1)

= (100+25+0+25+100)/4 = 250/4 = 62.5

Standard Deviation = √62.5 ≈ 7.91

Problem 3: Calculate Q1, Q3, and IQR for: [12, 15, 18, 20, 22, 25, 28, 30, 35, 40]

Solution:

10 values: Positions for quartiles at 2.75 and 7.25

Q1 = 15 + 0.75×(18-15) = 15 + 2.25 = 17.25

Q3 = 28 + 0.25×(30-28) = 28 + 0.5 = 28.5

IQR = Q3 - Q1 = 28.5 - 17.25 = 11.25

Outlier boundaries: 17.25 - 1.5×11.25 = 0.375 and 28.5 + 1.5×11.25 = 45.375

No outliers in this dataset.

Problem 4: A dataset has mean = 50, median = 48, mode = 45. What can you say about its skewness?

Solution:

Mean (50) > Median (48) > Mode (45)

This is the pattern for right-skewed (positively skewed) distributions.

The tail on the right side is longer, pulling the mean above the median.

Skewness coefficient would be positive.

Problem 5: Dataset has variance = 64, mean = 100. What percentage of data falls between 84 and 116 assuming normal distribution?

Solution:

Standard Deviation = √64 = 8

84 is (100-84)/8 = 16/8 = 2 standard deviations below mean

116 is (116-100)/8 = 16/8 = 2 standard deviations above mean

Using Empirical Rule: 95% of data falls within ±2 standard deviations

Therefore, approximately 95% of data falls between 84 and 116.

How to Calculate Descriptive Statistics Step-by-Step

Follow this systematic approach to perform comprehensive descriptive statistics analysis:

1

Organize Your Data

Collect and organize your dataset. Sort values in ascending order for easier calculation of median and quartiles.

Original: [15, 8, 22, 17, 10]
Sorted: [8, 10, 15, 17, 22]
2

Calculate Central Tendency

Compute mean (average), median (middle value), and mode (most frequent value).

Mean = Σx / n
Median = middle value
Mode = most frequent
3

Calculate Measures of Spread

Compute range, variance, and standard deviation to understand data dispersion.

Range = Max - Min
Variance = Σ(x - mean)² / (n or n-1)
Std Dev = √Variance
4

Determine Quartiles

Calculate Q1 (25th percentile), Q2 (median, 50th), Q3 (75th percentile), and IQR.

Q1 = value at 25% position
Q3 = value at 75% position
IQR = Q3 - Q1
5

Check for Outliers

Identify outliers using IQR method: values below Q1-1.5×IQR or above Q3+1.5×IQR.

Lower bound = Q1 - 1.5×IQR
Upper bound = Q3 + 1.5×IQR
6

Interpret Results

Analyze what the statistics tell you about your data's center, spread, and distribution shape.

Compare mean vs median for skewness
Use std dev for spread interpretation
Check IQR for middle 50% spread

Pro Tips for Descriptive Statistics

  • Always visualize: Create histograms or box plots to complement numerical statistics
  • Check assumptions: Different statistics assume different data characteristics
  • Consider outliers: Decide whether to include, exclude, or transform outliers based on context
  • Use appropriate measures: Mean for symmetric data, median for skewed data
  • Report with context: Always report sample size and whether statistics are for sample or population
  • Check distribution: Assess normality for parametric tests and confidence intervals

Descriptive Statistics FAQs (Mean, Median, Standard Deviation & More)

Common questions about descriptive statistics, data analysis, and statistical measures.

What are descriptive statistics?
Descriptive statistics summarize and describe data using measures like mean, median, mode, standard deviation, variance, and range.
What's the difference between descriptive and inferential statistics?
Descriptive statistics summarize data, while inferential statistics use sample data to make predictions or draw conclusions about a population.
When should I use mean vs median?
Use mean for symmetric data without outliers and median for skewed data or when outliers are present.
What is the difference between mean, median, and mode?
Mean is the average, median is the middle value, and mode is the most frequent value in a dataset.
What does standard deviation tell us?
It measures how spread out data values are from the mean. A higher value indicates more variability.
What is variance in statistics?
Variance is the average of squared differences from the mean and shows how data is distributed.
What is range in statistics?
Range is the difference between the highest and lowest values in a dataset.
What are quartiles?
Quartiles divide data into four equal parts, helping identify distribution and outliers.
How do I know if my data is normally distributed?
If mean, median, and mode are similar and data follows the 68-95-99.7 rule, it is likely normally distributed.
What are outliers in data?
Outliers are values that differ significantly from other data points and can affect statistical results.
What are the limitations of descriptive statistics?
They summarize data but cannot make predictions or determine cause-and-effect relationships.
When should I use sample vs population statistics?
Use population statistics when analyzing all data and sample statistics when working with a subset.
How many decimal places should I report?
Report one more decimal place than the original data for accuracy and consistency.
Why are descriptive statistics important?
They help understand data, identify patterns, and prepare for deeper statistical analysis.