WordNet

raise to the second power
(geometry) a plane rectangle with four equal sides and four right angles; a four-sided regular polygon; "you can compute the area of a square if you know the length of its sides" (同)foursquare
the product of two equal terms; "nine is the second power of three"; "gravity is inversely proportional to the square of the distance" (同)second power
make square; "Square the circle"; "square the wood with a file" (同)square_up
something approximating the shape of a square
a hand tool consisting of two straight arms at right angles; used to construct or test right angles; "the carpenter who built this room must have lost his square"
any artifact having a shape similar to a plane geometric figure with four equal sides and four right angles; "a checkerboard has 64 squares"
someone who doesnt understand what is going on (同)lame
a formal and conservative person with old-fashioned views (同)square toes
rigidly conventional or old-fashioned (同)straight
without evasion or compromise; "a square contradiction"; "he is not being as straightforward as it appears" (同)straightforward, straight
be compatible with; "one idea squares with another"
cause to match, as of ideas or acts
having four equal sides and four right angles or forming a right angle; "a square peg in a round hole"; "a square corner"
leaving no balance; "my account with you is now all square"
pay someone and settle a debt; "I squared with him"
position so as to be square; "He squared his shoulders"
undergo a test; "She doesnt test well"
any standardized procedure for measuring sensitivity or memory or intelligence or aptitude or personality etc; "the test was standardized on a large sample of students" (同)mental test, mental testing, psychometric test
the act of undergoing testing; "he survived the great test of battle"; "candidates must compete in a trial of skill" (同)trial
the act of testing something; "in the experimental trials the amount of carbon was measured separately"; "he called each flip of the coin a new trial" (同)trial, run
a hard outer covering as of some amoebas and sea urchins
put to the test, as for its quality, or give experimental use to; "This approach has been tried with good results"; "Test this recipe" (同)prove, try, try out, examine, essay
achieve a certain score or rating on a test; "She tested high on the LSAT and was admitted to all the good law schools"
determine the presence or properties of (a substance)
show a certain characteristic when tested; "He tested positive for HIV"
an examination of the characteristics of something; "there are laboratories for commercial testing"; "it involved testing thousands of children for smallpox"
the act of subjecting to experimental test in order to determine how well something works; "they agreed to end the testing of atomic weapons"
tested and proved useful or correct; "a tested method" (同)tried, well-tried
tested and proved to be reliable (同)time-tested, tried, tried and true
the 22nd letter of the Greek alphabet (同)khi

PrepTutorEJDIC

『正方形』;四角な物;(チェス・チェッカーなどの盤の,正方形の)目,ます目 / (四角い)『広場』(街路の交差点にあって,しばしば中央に植木や芝などが植えてあり,小公園になっている);《おもに英》広場の回りの建物(街路)(《略》Sq.) / (四方を街路で囲まれた方形の)一区画,ブロック / 直角定規,かね尺 / (数の)『2乗』,平方(《略》sq.) / 《俗》旧式な人 / 『正方形の』,四角な,直角の,直角をなす / 角ばった,がっかりした / 『平方の』,2乗の(《略》『sq.』) / 《補語にのみ用いて》対等の,五分五分の(even);貸し借りにない / 正直な(honest),公正な(fair),正しい(just) / 率直な,はっきりした,きっぱりした(direct) / 実質のある,十分な / 《俗》しゃちほこばった / =squarely / …‘を'正方形(四角)にする;直角にする / …‘を'正方形(四角)に区切る《+『off』+『名』,+『名』+『off』》 / 〈肩・ひじなど〉‘を'張る / (人と)〈勘定〉‘を'決済する,清算する《+『名』〈勘定〉+『with』+『名』〈人〉》 / (…と)…‘を'一致させる,適合させる《+『名』+『with』+『名』》 / 《受動態で》〈数〉‘を'2乗する;〈ある形・図形など〉‘の'平方積(面積)を求める / 〈人〉‘を'買収する,抱き込む,…‘に'わいろを使う / 〈試合の得点〉‘を'同点にする / (…と)一致する,適合する《+『with』+『名』》
(人の能力などの)『試験』,考査,テスト / (物事の)『試験』,検済,試錬,実験《+of+名》 / 化学分析;試薬 / =test match / …‘を'『試験する』,検査する / …‘を'化学分析する / (…の)試験を受ける,試験をする《+for+名》
キー(ギリシア語アルファベットの第22字X,x;英語のchに相当)

Wikipedia preview

出典(authority):フリー百科事典『ウィキペディア（Wikipedia）』「2019/05/25 16:47:16」(JST)

wiki en

Chi-squared distribution, showing

χ 2

on the x-axis and p-value on the y-axis.

A chi-squared test, also written as $χ 2$ test, is any statistical hypothesis test where the sampling distribution of the test statistic is a chi-squared distribution when the null hypothesis is true. Without other qualification, 'chi-squared test' often is used as short for Pearson's chi-squared test. The chi-squared test is used to determine whether there is a significant difference between the expected frequencies and the observed frequencies in one or more categories.

In the standard applications of this test, the observations are classified into mutually exclusive classes, and there is some theory, or say null hypothesis, which gives the probability that any observation falls into the corresponding class. The purpose of the test is to evaluate how likely the observations that are made would be, assuming the null hypothesis is true.

Chi-squared tests are often constructed from a sum of squared errors, or through the sample variance. Test statistics that follow a chi-squared distribution arise from an assumption of independent normally distributed data, which is valid in many cases due to the central limit theorem. A chi-squared test can be used to attempt rejection of the null hypothesis that the data are independent.

Also considered a chi-squared test is a test in which this is asymptotically true, meaning that the sampling distribution (if the null hypothesis is true) can be made to approximate a chi-squared distribution as closely as desired by making the sample size large enough.

1 History
- 1.1 Pearson's chi-squared test
2 Other examples of chi-squared tests
- 2.1 Fisher's exact test
- 2.2 Binomial test
- 2.3 Other chi-squared tests
3 Yates's correction for continuity
4 Chi-squared test for variance in a normal population
5 Example chi-squared test for categorical data
6 Applications
7 See also
8 References
9 Further reading

History

In the 19th century, statistical analytical methods were mainly applied in biological data analysis and it was customary for researchers to assume that observations followed a normal distribution, such as Sir George Airy and Professor Merriman, whose works were criticized by Karl Pearson in his 1900 paper.^[1]

Until the end of 19th century, Pearson noticed the existence of significant skewness within some biological observations. In order to model the observations regardless of being normal or skewed, Pearson, in a series of articles published from 1893 to 1916,^[2]^[3]^[4]^[5] devised the Pearson distribution, a family of continuous probability distributions, which includes the normal distribution and many skewed distributions, and proposed a method of statistical analysis consisting of using the Pearson distribution to model the observation and performing the test of goodness of fit to determine how well the model and the observation really fit.

Pearson's chi-squared test

In 1900, Pearson published a paper^[1] on the $χ 2$ test which is considered to be one of the foundations of modern statistics.^[6] In this paper, Pearson investigated the test of goodness of fit.

Suppose that $n$ observations in a random sample from a population are classified into $k$ mutually exclusive classes with respective observed numbers $x i$ (for $i = 1,2,\dots, k$ ), and a null hypothesis gives the probability $p i$ that an observation falls into the $i$ th class. So we have the expected numbers $m i = np i$ for all $i$ , where

{\begin{aligned}\sum _{i=1}^{k}{p_{i}}&=1\\[8pt]\sum _{i=1}^{k}{m_{i}}&=n\sum _{i=1}^{k}{p_{i}}=\sum _{i=1}^{k}x_{i}\end{aligned}}

Pearson proposed that, under the circumstance of the null hypothesis being correct, as $n \to \infty$ the limiting distribution of the quantity given below is the $χ 2$ distribution.

X^{2}=\sum _{i=1}^{k}{\frac {(x_{i}-m_{i})^{2}}{m_{i}}}=\sum _{i=1}^{k}{{\frac {x_{i}^{2}}{m_{i}}}-n}

Pearson dealt first with the case in which the expected numbers $m i$ are large enough known numbers in all cells assuming every $x i$ may be taken as normally distributed, and reached the result that, in the limit as $n$ becomes large, $X 2$ follows the $χ 2$ distribution with $k - 1$ degrees of freedom.

However, Pearson next considered the case in which the expected numbers depended on the parameters that had to be estimated from the sample, and suggested that, with the notation of $m i$ being the true expected numbers and $m' i$ being the estimated expected numbers, the difference

X^{2}-{X'}^{2}=\sum _{i=1}^{k}{\frac {x_{i}^{2}}{m_{i}}}-\sum _{i=1}^{k}{\frac {x_{i}^{2}}{m'_{i}}}

will usually be positive and small enough to be omitted. In a conclusion, Pearson argued that if we regarded $X' 2$ as also distributed as $χ 2$ distribution with $k - 1$ degrees of freedom, the error in this approximation would not affect practical decisions. This conclusion caused some controversy in practical applications and was not settled for 20 years until Fisher's 1922 and 1924 papers.^[7]^[8]

Other examples of chi-squared tests

One test statistic that follows a chi-squared distribution exactly is the test that the variance of a normally distributed population has a given value based on a sample variance. Such tests are uncommon in practice because the true variance of the population is usually unknown. However, there are several statistical tests where the chi-squared distribution is approximately valid:

Fisher's exact test

For an exact test used in place of the 2 x 2 chi-squared test for independence, see Fisher's exact test.

Binomial test

For an exact test used in place of the 2 x 1 chi-squared test for goodness of fit, see Binomial test.

Other chi-squared tests

Cochran–Mantel–Haenszel chi-squared test.
McNemar's test, used in certain 2 × 2 tables with pairing
Tukey's test of additivity
The portmanteau test in time-series analysis, testing for the presence of autocorrelation
Likelihood-ratio tests in general statistical modelling, for testing whether there is evidence of the need to move from a simple model to a more complicated one (where the simple model is nested within the complicated one).

Yates's correction for continuity

Using the chi-squared distribution to interpret Pearson's chi-squared statistic requires one to assume that the discrete probability of observed binomial frequencies in the table can be approximated by the continuous chi-squared distribution. This assumption is not quite correct and introduces some error.

To reduce the error in approximation, Frank Yates suggested a correction for continuity that adjusts the formula for Pearson's chi-squared test by subtracting 0.5 from the absolute difference between each observed value and its expected value in a 2 × 2 contingency table.^[9] This reduces the chi-squared value obtained and thus increases its p-value.

Chi-squared test for variance in a normal population

If a sample of size $n$ is taken from a population having a normal distribution, then there is a result (see distribution of the sample variance) which allows a test to be made of whether the variance of the population has a pre-determined value. For example, a manufacturing process might have been in stable condition for a long period, allowing a value for the variance to be determined essentially without error. Suppose that a variant of the process is being tested, giving rise to a small sample of $n$ product items whose variation is to be tested. The test statistic $T$ in this instance could be set to be the sum of squares about the sample mean, divided by the nominal value for the variance (i.e. the value to be tested as holding). Then $T$ has a chi-squared distribution with $n - 1$ degrees of freedom. For example, if the sample size is 21, the acceptance region for $T$ with a significance level of 5% is between 9.59 and 34.17.

Example chi-squared test for categorical data

Suppose there is a city of 1,000,000 residents with four neighborhoods: $A$ , $B$ , $C$ , and $D$ . A random sample of 650 residents of the city is taken and their occupation is recorded as "white collar", "blue collar", or "no collar". The null hypothesis is that each person's neighborhood of residence is independent of the person's occupational classification. The data are tabulated as:

	$A$	$B$	$C$	$D$	total
White collar	90	60	104	95	349
Blue collar	30	50	51	20	151
No collar	30	40	45	35	150
Total	150	150	200	150	650

Let us take the sample living in neighborhood $A$ , 150, to estimate what proportion of the whole 1,000,000 live in neighborhood $A$ . Similarly we take 349/650 to estimate what proportion of the 1,000,000 are white-collar workers. By the assumption of independence under the hypothesis we should "expect" the number of white-collar workers in neighborhood $A$ to be

150\times {\frac {349}{650}}\approx 80.54

Then in that "cell" of the table, we have

{\frac {\left({\text{observed}}-{\text{expected}}\right)^{2}}{\text{expected}}}={\frac {\left(90-80.54\right)^{2}}{80.54}}\approx 1.11

The sum of these quantities over all of the cells is the test statistic. Under the null hypothesis, it has approximately a chi-squared distribution whose number of degrees of freedom are

({\text{number of rows}}-1)({\text{number of columns}}-1)=(3-1)(4-1)=6

If the test statistic is improbably large according to that chi-squared distribution, then one rejects the null hypothesis of independence.

A related issue is a test of homogeneity. Suppose that instead of giving every resident of each of the four neighborhoods an equal chance of inclusion in the sample, we decide in advance how many residents of each neighborhood to include. Then each resident has the same chance of being chosen as do all residents of the same neighborhood, but residents of different neighborhoods would have different probabilities of being chosen if the four sample sizes are not proportional to the populations of the four neighborhoods. In such a case, we would be testing "homogeneity" rather than "independence". The question is whether the proportions of blue-collar, white-collar, and no-collar workers in the four neighborhoods are the same. However, the test is done in the same way.

Applications

In cryptanalysis, chi-squared test is used to compare the distribution of plaintext and (possibly) decrypted ciphertext. The lowest value of the test means that the decryption was successful with high probability.^[10]^[11] This method can be generalized for solving modern cryptographic problems.^[12]

In bioinformatics, chi-squared test is used to compare the distribution of certain properties of genes (e.g, genomic content, mutation rate, interaction network clustering, etc.) belonging to different categories (e.g., disease genes, essential genes, genes on a certain chromosome etc.).^[13]^[14]

References

^ ^a ^b Pearson, Karl (1900). "On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling" (PDF). Philosophical Magazine. Series 5. 50: 157–175. doi:10.1080/14786440009463897.
^ Pearson, Karl (1893). "Contributions to the mathematical theory of evolution [abstract]". Proceedings of the Royal Society. 54: 329–333. doi:10.1098/rspl.1893.0079. JSTOR 115538.
^ Pearson, Karl (1895). "Contributions to the mathematical theory of evolution, II: Skew variation in homogeneous material". Philosophical Transactions of the Royal Society. 186: 343–414. Bibcode:1895RSPTA.186..343P. doi:10.1098/rsta.1895.0010. JSTOR 90649.
^ Pearson, Karl (1901). "Mathematical contributions to the theory of evolution, X: Supplement to a memoir on skew variation". Philosophical Transactions of the Royal Society A. 197: 443–459. Bibcode:1901RSPTA.197..443P. doi:10.1098/rsta.1901.0023. JSTOR 90841.
^ Pearson, Karl (1916). "Mathematical contributions to the theory of evolution, XIX: Second supplement to a memoir on skew variation". Philosophical Transactions of the Royal Society A. 216: 429–457. Bibcode:1916RSPTA.216..429P. doi:10.1098/rsta.1916.0009. JSTOR 91092.
^ Cochran, William G. (1952). "The Chi-square Test of Goodness of Fit". The Annals of Mathematical Statistics. 23: 315–345. doi:10.1214/aoms/1177729380. JSTOR 2236678.
^ Fisher, Ronald A. (1922). "On the Interpretation of chi-squared from Contingency Tables, and the Calculation of P". Journal of the Royal Statistical Society. 85: 87–94. doi:10.2307/2340521. JSTOR 2340521.
^ Fisher, Ronald A. (1924). "The Conditions Under Which chi-squared Measures the Discrepancey Between Observation and Hypothesis". Journal of the Royal Statistical Society. 87: 442–450. JSTOR 2341149.
^ Yates, Frank (1934). "Contingency table involving small numbers and the $χ 2$ test". Supplement to the Journal of the Royal Statistical Society. 1 (2): 217–235. JSTOR 2983604.
^ "Chi-squared Statistic". Practical Cryptography. Retrieved 18 February 2015.
^ "Using Chi Squared to Crack Codes". IB Maths Resources. British International School Phuket.
^ Ryabko, B. Ya.; Stognienko, V. S.; Shokin, Yu. I. (2004). "A new test for randomness and its application to some cryptographic problems" (PDF). Journal of Statistical Planning and Inference. 123: 365–376. doi:10.1016/s0378-3758(03)00149-6. Retrieved 18 February 2015.
^ Feldman, I.; Rzhetsky, A.; Vitkup, D. (2008). "Network properties of genes harboring inherited disease mutations". PNAS. 105 (11): 4323–432. Bibcode:2008PNAS..105.4323F. doi:10.1073/pnas.0701722105. PMC 2393821. Retrieved 29 June 2018.
^ "chi-square-tests" (PDF). Retrieved 29 June 2018.

UpToDate Contents

全文を閲覧するには購読必要です。 To read the full text you will need to subscribe.

1. Complementary and alternative treatments for anxiety symptoms and disorders: Physical, cognitive, and spiritual interventions
2. 高齢者の健康維持 geriatric health maintenance
3. 成人における初期治療の効果がない線維筋痛症の治療 treatment of fibromyalgia in adults not responsive to initial therapies
4. 転倒：一般社会で生活する高齢者における防止 falls prevention in community dwelling older persons
5. 転倒：介護施設および病院内での予防 falls prevention in nursing care facilities and the hospital setting

English Journal

Contact sensitivity in Behçet's disease.

Demirsoy EO, Kiran R, Oztürk B, Sikar Aktürk A, Etiler N.SourceKocaeli University Medical Faculty, Dermatology , Kocaeli , Turkey.
Cutaneous and ocular toxicology.Cutan Ocul Toxicol.2013 Jun;32(2):112-4. doi: 10.3109/15569527.2012.716886. Epub 2012 Sep 6.
Context: Behçet's disease (BD) is a multisystemic inflammatory disorder with unknown etiology. Many immunological changes were reported in BD previously and these changes may affect the frequency of contact sensitivity in these patients. Objective: We aimed to identify whether there is an interac
PMID 22950639

Japanese Journal

Month of birth in multiple sclerosis with and without longitudinally extensive spinal cord lesions: A study of a Japanese national survey.

Araki Yasukiyo,Kinoshita Masako,Motoyama Rie,Matsushita Takuya,Nakagawa Masanori,Kira Jun-Ichi,Tanaka Masami
Journal of the neurological sciences 330(1-2), 67-70, 2013-07-15
… Differences in the month-of-birth distributions between the patients and the general population were assessed using the chi-square test. …
NAID 120005289850