Descriptive statistics in one pass

Statistics Calculator

Paste a dataset and instantly compute core descriptive statistics: count, sum, mean, median, mode (including ties), min/max, range, variance, standard deviation, quartiles, and IQR. Everything is calculated locally in your browser for privacy and speed.

This Statistics Calculator is built for quick, accurate summaries when you have raw numbers and want immediate insight into typical value (mean/median/mode) and spread (variance, standard deviation, quartiles, IQR). You can separate values with commas, spaces, tabs, or new lines—ideal for copying from spreadsheets, lab notes, or logs.

Use it to sanity-check experiments, compare groups, or prepare a report without manual formulas. If you’re exploring more tools, you can browse All Calculators or jump to the Math Calculators hub for related math utilities.

For physics-style datasets (like repeated force measurements), you might also reference the Force Calculator while keeping your descriptive stats here consistent and reproducible.

Calculator

Variance & SD basis
Sample uses n − 1 (unbiased estimate). Population uses n.

Tip: For clean parsing, prefer plain numeric tokens like -3, 4.25, or .5. Thousands-style formatting like 1,234 is treated as separated tokens.

Results

How it works

The calculator first parses your tokens, filters out invalid entries (or blocks them if you prefer strict input), and sorts the valid values numerically. From there it computes standard descriptive statistics that summarize your dataset without changing it.

Key definitions: n is the number of valid values; Σxi is the sum of all values; is the mean; Q1, Q2, Q3 are quartiles using the Tukey median-of-halves method; and IQR is the interquartile range. If you want to explore more statistical tools later, the All Calculators page is a convenient index.

Mean (average): x̄ = (Σxi) / n — sensitive to outliers, so it moves when extreme values appear.
Median: the middle of sorted values (or the average of the two middle values for even n) — robust to outliers.
Mode: the most frequent value(s) — can be none (all unique), one (unimodal), or multiple (multimodal).
Range: max − min — a quick spread measure that can be dominated by one extreme observation.
Variance: Population: σ² = Σ(xi − μ)² / n; Sample: s² = Σ(xi − x̄)² / (n − 1).
Standard deviation: √(variance) — spread in the same units as your data, often easier to interpret than variance.
Quartiles & IQR: IQR = Q3 − Q1 — focuses on the “middle 50%” of the data and is resilient to outliers.

Use cases

Quality control: summarize batch measurements (e.g., thickness, length) and spot unusual spread with SD and IQR.
Experiment repeatability: compare repeated trials—median stability suggests consistency even when a few trials drift.
Survey scoring: quickly compute mean and median on rating scales to see typical sentiment and dispersion.
Fitness tracking: analyze weekly metrics (pace, heart rate) where outliers happen; use median and IQR for robust baselines.
Classroom grading: check overall performance and variability; CV% helps compare spread across different score ranges.
Sensor logs: summarize a time window, then track SD changes as an early indicator of drift or noise growth.

Examples

Example 1: Small repeated values Unimodal with moderate spread
Dataset: 12, 15, 15, 18, 21
Expected results (example): n = 5, mean ≈ 16.20, median = 15, mode = 15, range = 9, sample SD ≈ 3.42, Q1 = 13.5, Q3 = 19.5, IQR = 6.
Interpretation: The median and mode both land at 15, suggesting the “typical” value sits near the repeated measurements.
Example 2: Symmetric-ish data Mean and median close together
Dataset: 8 9 10 10 11 12
Expected results (example): n = 6, mean ≈ 10.00, median = 10, mode = 10, range = 4, sample SD ≈ 1.41, Q1 = 9, Q3 = 11, IQR = 2.
Interpretation: When mean and median align closely, the distribution is often balanced without extreme skew.
Example 3: Mixed signs and decimals Demonstrates negative/decimal support
Dataset: -2.5, -1, 0, 1.5, 2.0, 2.0
Expected results (example): n = 6, mean ≈ 0.33, median ≈ 0.75, mode = 2, range = 4.5, sample SD ≈ 2.00, Q1 = -1, Q3 = 2.0, IQR = 3.0.
Interpretation: A small positive mean here can happen even with negatives, because the larger positive values offset them.

Common Mistakes

  • Mixing separators like 1,234 expecting it to mean “one thousand two hundred thirty-four”; it will be parsed as two tokens: 1 and 234.
  • Switching sample vs population variance without realizing it changes the denominator (n − 1 vs n).
  • Interpreting range as “typical spread” when one extreme value can dominate it; check SD and IQR too.
  • Assuming a mode always exists; if all values occur once, there is no mode in the usual sense.
  • Rounding too aggressively; a small dataset can shift mean/SD noticeably when you hide decimals—adjust precision if needed.
  • Comparing SD across different scales without context; consider CV% when mean is non-zero and scale differs.

Quick Tips

  • Use median and IQR for outlier-resistant summaries; use mean and SD when data is roughly symmetric.
  • If you collected a sample to estimate a larger process, keep Treat as sample (default) for variance/SD.
  • For long datasets, scan the step-by-step section to confirm sorting and the method used for quartiles.
  • Use precision to control report-ready rounding, but keep enough decimals for comparisons between groups.
  • If your input includes notes like “n/a” or labels, enable Ignore non-numeric tokens to skip them safely.

FAQ

What’s the difference between mean and median, and when should I trust each?

The mean is the arithmetic average, so it uses every value and can move sharply when a few points are extreme. The median is the middle of the sorted list (or the average of the two middle values), so it resists outliers and better reflects a “typical” value when data is skewed. If your dataset has occasional spikes, heavy tails, or measurement glitches, the median is often more stable. If your data is roughly symmetric without major outliers, the mean is efficient and pairs naturally with standard deviation for spread.

Should I use sample or population standard deviation?

Use population standard deviation when your numbers represent the entire group you care about (for example, all items produced in a specific batch). Use sample standard deviation when your values are a subset used to estimate a broader process (for example, a small inspection sample from a continuous production line). Sample variance divides by n − 1 to reduce bias in the estimate. If you are unsure, sample is a safer default for inference and comparison, while population is appropriate for complete enumerations of a fixed group.

How does mode work if multiple values tie for most frequent?

Mode is defined as the value (or values) that occur most frequently. If two or more values share the same highest frequency, the dataset is multimodal. This calculator reports all tied modes as a list rather than forcing a single answer. If every value appears exactly once, there is no mode in the usual descriptive sense, so the calculator shows “No mode.” For measurement data, multimodality can hint at mixed populations (for example, two operating states), so it’s often worth checking the raw values or grouping by conditions.

Which quartile method is used here, and why does it matter?

This calculator uses the Tukey “median-of-halves” method. First it sorts the data and finds the median (Q2). For odd n, the median is excluded from both halves; for even n, the data splits evenly. Q1 is the median of the lower half, and Q3 is the median of the upper half. Quartiles can vary slightly across software because there are multiple accepted conventions, especially for small datasets. Stating the method makes your results reproducible and easier to compare across reports and tools.

Can I enter decimals and negative numbers, and how are they validated?

Yes—decimals and negative values are supported, including formats like -3, 4.25, and .5. The parser uses strict numeric validation to avoid accidental misreads, and it rejects NaN and Infinity. Tokens are split on commas, spaces, tabs, new lines, and semicolons, then trimmed. If “Ignore non-numeric tokens” is off, the calculator blocks the run and shows a short list of invalid examples so you can clean your input. If it’s on, invalid tokens are skipped and the stats use only valid numbers.

What’s a good approach to cleaning data before calculating statistics?

Start by deciding what counts as a valid measurement: remove units, labels, and placeholders like “n/a.” Check for impossible values (for example, negative lengths) and confirm whether those represent real cases or entry errors. If values were recorded with different rounding, keep enough precision to avoid artificial ties. For outlier handling, don’t delete extremes by default—compare mean/SD with median/IQR to understand their influence, then document any rule you apply (such as winsorizing or excluding values beyond a known physical limit).

How should I interpret standard deviation and IQR in plain language?

Standard deviation describes how far values typically deviate from the mean, in the same units as the data. When data is roughly bell-shaped, many observations fall within about one SD of the mean, and larger SD means more variability. IQR focuses on the middle half of the data: it is Q3 − Q1, so it ignores the lowest 25% and highest 25%. That makes IQR robust when outliers exist. If SD is high but IQR is modest, you likely have a few extremes; if both are high, the spread is broad throughout.

Is my data sent anywhere, and how does rounding affect results?

Your dataset is processed locally in your browser; the calculator does not transmit values to a server. The precision selector changes how numbers are displayed (rounded to your chosen decimals), while the underlying calculations keep higher internal precision to reduce rounding error. For reporting, choose a precision that matches your measurement resolution and audience. If you need to compare close groups, increase precision to avoid hiding small differences in mean or SD. Copy outputs are plain text so you can paste them directly into notes, emails, or documents.

Accuracy, Privacy & References

Accuracy & Method: All calculations run locally in your browser; no dataset values are sent to servers.

Rounding policy: Displayed results are rounded to the selected precision. Internally, computations use full JavaScript floating-point precision and apply rounding only for presentation.

Privacy-first: Paste sensitive data confidently—results stay on your device.

Last Updated: January 21, 2026

Sources & References:

  • NIST/SEMATECH e-Handbook of Statistical Methods (concept reference)
  • Standard deviation and variance definitions (concept reference)
  • Quartiles and IQR definitions (concept reference)
🚧
Coming Soon

This calculator is being built.

×