WordNet

not involving an estimation of the parameters of a statistic
a statistic computed without knowledge of the form or the parameters of the distribution from which observations are drawn (同)distribution free statistic
the branch of statistics dealing with variables without making assumptions about the form or the parameters of their distribution

Wikipedia preview

出典(authority):フリー百科事典『ウィキペディア（Wikipedia）』「2014/03/14 20:09:57」(JST)

wiki en

In statistics, the term non-parametric statistics refers to statistics that do not assume the data or population have any characteristic structure or parameters.

Definitions[edit]

In statistics, the term non-parametric statistics has at least two different meanings:

The first meaning of non-parametric covers techniques that do not rely on data belonging to any particular distribution. These include, among others:
- distribution free methods, which do not rely on assumptions that the data are drawn from a given probability distribution. As such it is the opposite of parametric statistics. It includes non-parametric descriptive statistics, statistical models, inference and statistical tests.
- non-parametric statistics (in the sense of a statistic over data, which is defined to be a function on a sample that has no dependency on a parameter), whose interpretation does not depend on the population fitting any parameterised distributions. Order statistics, which are based on the ranks of observations, are one example of such statistics and these play a central role in many non-parametric approaches.
The following discussion is taken from Kendall's.^[1]

Statistical hypotheses concern the behavior of observable random variables.... For example, the hypothesis (a) that a normal distribution has a specified mean and variance is statistical; so is the hypothesis (b) that it has a given mean but unspecified variance; so is the hypothesis (c) that a distribution is of normal form with both mean and variance unspecified; finally, so is the hypothesis (d) that two unspecified continuous distributions are identical.

It will have been noticed that in the examples (a) and (b) the distribution underlying the observations was taken to be of a certain form (the normal) and the hypothesis was concerned entirely with the value of one or both of its parameters. Such a hypothesis, for obvious reasons, is called parametric.

Hypothesis (c) was of a different nature, as no parameter values are specified in the statement of the hypothesis; we might reasonable call such a hypothesis non-parametric. Hypothesis (d) is also non-parametric but, in addition, it does not even specify the underlying form of the distribution and may now be reasonably termed distribution-free. Notwithstanding these distinctions, the statistical literature now commonly applies the label "non-parametric" to test procedures that we have just termed "distribution-free", thereby losing a useful classification.
The second meaning of non-parametric covers techniques that do not assume that the structure of a model is fixed. Typically, the model grows in size to accommodate the complexity of the data. In these techniques, individual variables are typically assumed to belong to parametric distributions, and assumptions about the types of connections among variables are also made. These techniques include, among others:
- non-parametric regression, which refers to modeling where the structure of the relationship between variables is treated non-parametrically, but where nevertheless there may be parametric assumptions about the distribution of model residuals.
- non-parametric hierarchical Bayesian models, such as models based on the Dirichlet process, which allow the number of latent variables to grow as necessary to fit the data, but where individual variables still follow parametric distributions and even the process controlling the rate of growth of latent variables follows a parametric distribution.

Applications and purpose[edit]

Non-parametric methods are widely used for studying populations that take on a ranked order (such as movie reviews receiving one to four stars). The use of non-parametric methods may be necessary when data have a ranking but no clear numerical interpretation, such as when assessing preferences. In terms of levels of measurement, non-parametric methods result in "ordinal" data.

As non-parametric methods make fewer assumptions, their applicability is much wider than the corresponding parametric methods. In particular, they may be applied in situations where less is known about the application in question. Also, due to the reliance on fewer assumptions, non-parametric methods are more robust.

Another justification for the use of non-parametric methods is simplicity. In certain cases, even when the use of parametric methods is justified, non-parametric methods may be easier to use. Due both to this simplicity and to their greater robustness, non-parametric methods are seen by some statisticians as leaving less room for improper use and misunderstanding.

The wider applicability and increased robustness of non-parametric tests comes at a cost: in cases where a parametric test would be appropriate, non-parametric tests have less power. In other words, a larger sample size can be required to draw conclusions with the same degree of confidence.

Non-parametric models[edit]

Non-parametric models differ from parametric models in that the model structure is not specified a priori but is instead determined from data. The term non-parametric is not meant to imply that such models completely lack parameters but that the number and nature of the parameters are flexible and not fixed in advance.

A histogram is a simple nonparametric estimate of a probability distribution
Kernel density estimation provides better estimates of the density than histograms.
Nonparametric regression and semiparametric regression methods have been developed based on kernels, splines, and wavelets.
Data envelopment analysis provides efficiency coefficients similar to those obtained by multivariate analysis without any distributional assumption.

Methods[edit]

Non-parametric (or distribution-free) inferential statistical methods are mathematical procedures for statistical hypothesis testing which, unlike parametric statistics, make no assumptions about the probability distributions of the variables being assessed. The most frequently used tests include

Anderson–Darling test: tests whether a sample is drawn from a given distribution
Statistical Bootstrap Methods: estimates the accuracy/sampling distribution of a statistic
Cochran's Q: tests whether k treatments in randomized block designs with 0/1 outcomes have identical effects
Cohen's kappa: measures inter-rater agreement for categorical items
Friedman two-way analysis of variance by ranks: tests whether k treatments in randomized block designs have identical effects
Kaplan–Meier: estimates the survival function from lifetime data, modeling censoring
Kendall's tau: measures statistical dependence between two variables
Kendall's W: a measure between 0 and 1 of inter-rater agreement
Kolmogorov–Smirnov test: tests whether a sample is drawn from a given distribution, or whether two samples are drawn from the same distribution
Kruskal-Wallis one-way analysis of variance by ranks: tests whether >2 independent samples are drawn from the same distribution
Kuiper's test: tests whether a sample is drawn from a given distribution, sensitive to cyclic variations such as day of the week
Logrank Test: compares survival distributions of two right-skewed, censored samples
Mann–Whitney U or Wilcoxon rank sum test: tests whether two samples are drawn from the same distribution, as compared to a given alternative hypothesis. In fact this is a "semi" non-parametric test since it is held under assumption that the two samples have the same scale parameter value.
McNemar's test: tests whether, in 2 × 2 contingency tables with a dichotomous trait and matched pairs of subjects, row and column marginal frequencies are equal
median test: tests whether two samples are drawn from distributions with equal medians
Pitman's permutation test: a statistical significance test that yields exact p values by examining all possible rearrangements of labels
Rank products: detects differentially expressed genes in replicated microarray experiments
Siegel–Tukey test: tests for differences in scale between two groups
sign test: tests whether matched pair samples are drawn from distributions with equal medians
Spearman's rank correlation coefficient: measures statistical dependence between two variables using a monotonic function
Squared ranks test: tests equality of variances in two or more samples
Wald–Wolfowitz runs test: tests whether the elements of a sequence are mutually independent/random
Wilcoxon signed-rank test: tests whether matched pair samples are drawn from populations with different mean ranks

Notes[edit]

^ Stuart A., Ord J.K, Arnold S. (1999), Kendall's Advanced Theory of Statistics: Volume 2A—Classical Inference and the Linear Model, sixth edition, §20.2–20.3 (Arnold).

General references[edit]

Bagdonavicius, V., Kruopis, J., Nikulin, M.S. (2011). "Non-parametric tests for complete data", ISTE&WILEY: London&Hoboken. ISBN 978-1-84821-269-5
Corder, G.W. & Foreman, D.I. (2009) Nonparametric Statistics for Non-Statisticians: A Step-by-Step Approach, Wiley ISBN 978-0-470-45461-9
Gibbons, Jean Dickinson and Chakraborti, Subhabrata (2003) Nonparametric Statistical Inference, 4th Ed. CRC ISBN 0-8247-4052-1
Hettmansperger, T. P.; McKean, J. W. (1998). Robust nonparametric statistical methods. Kendall's Library of Statistics 5 (First ed.). London: Edward Arnold. pp. xiv+467 pp. ISBN 0-340-54937-8, 0-471-19479-4 Check |isbn= value (help). MR 1604954. Unknown parameter |publisher2= ignored (help); Unknown parameter |location2= ignored (help)
Wasserman, Larry (2007) All of nonparametric statistics, Springer. ISBN 0-387-25145-6

UpToDate Contents

全文を閲覧するには購読必要です。 To read the full text you will need to subscribe.

1. 副交感神経系機能の評価 evaluation of parasympathetic nervous system function

English Journal

Reference intervals for serum creatinine with enzymatic assay and evaluation of four equations to estimate glomerular filtration rate in a healthy Chinese adult population.

Wang X, Xu G, Li H, Liu Y, Wang F.SourceDepartment of Clinical Laboratory, Peking University First Hospital, Beijing, China.
Clinica chimica acta; international journal of clinical chemistry.Clin Chim Acta.2011 Sep 18;412(19-20):1793-7. Epub 2011 Jun 7.
BACKGROUND: With the wide usage of enzymatic assays to determine serum creatinine (Scr) in China, reference interval (RI) needs to be established. At the same time, the performance of Scr based equations to calculate estimated glomerular filtration rate (eGFR) in healthy Chinese adults has not been
PMID 21672534

Dimension reduced kernel estimation for distribution function with incomplete data.

Hu Z, Follmann DA, Qin J.SourceBiostatistics Research Branch, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD 20892-7609, USA.
Journal of statistical planning and inference.J Stat Plan Inference.2011 Sep;141(9):3084-3093.
This work focuses on the estimation of distribution functions with incomplete data, where the variable of interest Y has ignorable missingness but the covariate X is always observed. When X is high dimensional, parametric approaches to incorporate X - information is encumbered by the risk of model m
PMID 21731174

Japanese Journal

Duration Analysis of Interest Rate Spells : Cross-National Study of Interest Rate Policy

Guo Yingwen,Zhou Z.F. Sherry
Hitotsubashi Journal of Economics 52(1), 1-11, 2011-06
… Both parametric and nonparametric methods are employed for the analysis. …
NAID 120003204224

Duration Analysis of Interest Rate Spells : Cross-National Study of Interest Rate Policy

Guo Yingwen,Zhou Z.F. Sherry
Hitotsubashi journal of economics 52(1), 1-11, 2011-06
… Both parametric and nonparametric methods are employed for the analysis. …
NAID 110008581515

Related Pictures

nonparametric tests for factorial anova Parametric versus Nonparametric smoothers for nonparametric regression Nonparametric regression general dialog Nonparametric multiplicative regression Statistics, Nonparametric nonparametric parametric vs nonparametric ANOVA, Nonparametric Tests, and More Added