Statistics is the study of data collection, analysis, interpretation, and presentation to draw meaningful conclusions.

Etymology: Statistik (German) - description of a state/country; used to be called political arithmetic

Methods:

  • descriptive statistics (location, scale, shape, count, dependence)
  • inferential statistics

branches

criteria

sufficiency

A statistic is sufficient for the parameter if:

  • does not depend on .

  • OR if the density can be written as:

    where is a function of the data only, is a function of and . Both and are non-negative functions.

With a sufficient statistic, we no longer need the original data to make inferences about the parameter.

E.g. the sample mean is a sufficient statistic for the population mean .

Properties

  • Sufficient statistics provide data reduction.
  • Sufficient statistics are not unique.
  • Maximizing is equivalent to maximizing .
    • If the MLE of is unique, then it is a function of .
  • One-to-one functions of a sufficient statistic are also sufficient.

Minimal Sufficient

A statistic is minimal sufficient if any of the following is true:

  • The value of can be calculated from the likelihood function .
  • OR is a function of a sufficient statistic.
  • OR is a sufficient statistic that is also a function of the MLE.

completeness

A statistic is complete if there is no non-trivial (i.e., not always zero) function of the statistic that has an expected value of zero for all values of the parameter. Completeness ensures that the statistic captures all the information about the parameter.

A statistic is complete if for every measurable function , the condition:

implies that almost surely.

If a statistic is both sufficient and complete, it’s typically the best (most informative) statistic you can use for estimation. MVUEs (Minimum Variance Unbiased Estimators) are derived from complete, sufficient statistics.