WordNet

sort according to size
the actual state of affairs; "thats the size of the situation"; "she hates me, thats about the size of it" (同)size of it
any glutinous material used to fill pores in surfaces or to stiffen fabrics; "size gives body to a fabric" (同)sizing
make to a size; bring to a suitable size
a large magnitude; "he blanched when he saw the size of the bill"; "the only city of any size in that area"
the physical magnitude of something (how big it is); "a wolf is about the size of a large dog"
the property resulting from being one of a series of graduated measurements (as of clothing); "he wears a size 13 shoe"
(used in combination) sized; "the economy-size package"; "average-size house"
cover or stiffen or glaze a porous material with size or sizing (a glutinous substance)
take a sample of; "Try these new crackers"; "Sample the regional dishes" (同)try, try out, taste
a small part of something intended as representative of the whole
all or part of a natural object that is collected and preserved as an example of its class
(statistics) the selection of a suitable sample for study
measurement at regular intervals of the amplitude of a varying waveform (in order to convert it to digital form)
having the surface treated or coated with sizing
having a specified size

PrepTutorEJDIC

〈U〉〈C〉(人や物の)『大きさ』 / 〈U〉大きいこと / 〈U〉数量,規模 / 〈C〉(帽子・靴・シャツなどの)『サイズ』,『寸法』,型 / 〈U〉《話》実情,真相 / …‘を'大きさ(寸法)によって分類する / …‘を'ある寸法に作る
サイズ,どうさ(紙や織物の細孔(こう)をふさぐ材料) / …‘に'サイズを塗る
(…の)『見本』,標本《+『of』+『名』》 / (…の)『実例』(example)《+『of』+『名』》 / (無料で進呈する)試供品,サンプル / 見本の,標本の / …‘の'見本をとる;(見本をとって)…‘を'試す(調べる,判断する) / …‘を'実際に試す

Wikipedia preview

出典(authority):フリー百科事典『ウィキペディア（Wikipedia）』「2013/01/16 23:46:08」(JST)

wiki en

Sample size determination is the act of choosing the number of observations or replicates to include in a statistical sample. The sample size is an important feature of any empirical study in which the goal is to make inferences about a population from a sample. In practice, the sample size used in a study is determined based on the expense of data collection, and the need to have sufficient statistical power. In complicated studies there may be several different sample sizes involved in the study: for example, in a survey sampling involving stratified sampling there would be different sample sizes for each population. In a census, data are collected on the entire population, hence the sample size is equal to the population size. In experimental design, where a study may be divided into different treatment groups, there may be different sample sizes for each group.

Sample sizes may be chosen in several different ways:

expedience - For example, include those items readily available or convenient to collect. A choice of small sample sizes, though sometimes necessary, can result in wide confidence intervals or risks of errors in statistical hypothesis testing.
using a target variance for an estimate to be derived from the sample eventually obtained
using a target for the power of a statistical test to be applied once the sample is collected.

How samples are collected is discussed in sampling (statistics) and survey data collection.

Introduction

Larger sample sizes generally lead to increased precision when estimating unknown parameters. For example, if we wish to know the proportion of a certain species of fish that is infected with a pathogen, we would generally have a more accurate estimate of this proportion if we sampled and examined 200, rather than 100 fish. Several fundamental facts of mathematical statistics describe this phenomenon, including the law of large numbers and the central limit theorem.

In some situations, the increase in accuracy for larger sample sizes is minimal, or even non-existent. This can result from the presence of systematic errors or strong dependence in the data, or if the data follow a heavy-tailed distribution.

Sample sizes are judged based on the quality of the resulting estimates. For example, if a proportion is being estimated, one may wish to have the 95% confidence interval be less than 0.06 units wide. Alternatively, sample size may be assessed based on the power of a hypothesis test. For example, if we are comparing the support for a certain political candidate among women with the support for that candidate among men, we may wish to have 80% power to detect a difference in the support levels of 0.04 units.

Estimating proportions and means

A relatively simple situation is estimation of a proportion. For example, we may wish to estimate the proportion of residents in a community who are at least 65 years old.

The estimator of a proportion is , where X is the number of 'positive' observations (e.g. the number of people out of the n sampled people who are at least 65 years old). When the observations are independent, this estimator has a (scaled) binomial distribution (and is also the sample mean of data from a Bernoulli distribution). The maximum variance of this distribution is 0.25/n, which occurs when the true parameter is p = 0.5. In practice, since p is unknown, the maximum variance is often used for sample size assessments.

For sufficiently large n, the distribution of will be closely approximated by a normal distribution with the same mean and variance.^[1] Using this approximation, it can be shown that around 95% of this distribution's probability lies within 2 standard deviations of the mean. Because of this, an interval of the form

will form a 95% confidence interval for the true proportion. If this interval needs to be no more than W units wide, the equation

can be solved for n, yielding^[2]^[3] n = 4/W² = 1/B² where B is the error bound on the estimate, i.e., the estimate is usually given as within ± B. So, for B = 10% one requires n = 100, for B = 5% one needs n = 400, for B = 3% the requirement approximates to n = 1000, while for B = 1% a sample size of n = 10000 is required. These numbers are quoted often in news reports of opinion polls and other sample surveys.

Estimation of means

A proportion is a special case of a mean. When estimating the population mean using an independent and identically distributed (iid) sample of size n, where each data value has variance σ², the standard error of the sample mean is:

This expression describes quantitatively how the estimate becomes more precise as the sample size increases. Using the central limit theorem to justify approximating the sample mean with a normal distribution yields an approximate 95% confidence interval of the form

If we wish to have a confidence interval that is W units in width, we would solve

$4\sigma/\sqrt{n} = W$

for n, yielding the sample size n = 16σ²/W².

For example, if we are interested in estimating the amount by which a drug lowers a subject's blood pressure with a confidence interval that is six units wide, and we know that the standard deviation of blood pressure in the population is 15, then the required sample size is 100.

Required sample sizes for hypothesis tests

A common problem faced by the statisticians is calculating the sample size required to yield a certain power for a test, given a predetermined Type I error rate α. As follows, this can be estimated by pre-determined tables for certain values, by Mead's resource equation, or, more generally, by the cumulative distribution function:

By tables

^[4] Power	Cohen's d
^[4] Power	0.2	0.5	0.8
0.25	84	14	6
0.50	193	32	13
0.60	246	40	16
0.70	310	50	20
0.80	393	64	26
0.90	526	85	34
0.95	651	105	42
0.99	920	148	58

The table shown at right can be used in a two-sample t-test to estimate the sample sizes of an experimental group and a control group that are of equal size, that is, the total number of individuals in the trial is twice that of the number given, and the desired significance level is 0.05.^[4] The parameters used are:

The desired statistical power of the trial, shown in column to the left.
Cohen's d (=effect size), which is the expected difference between the means of the target values between the experimental group and the control group, divided by the expected standard deviation.

Mead's resource equation

Mead's resource equation is often used for estimating sample sizes of laboratory animals, as well as in many other laboratory experiments. It may not be as accurate as using other methods in estimating sample size, but gives a hint of what is the appropriate sample size where parameters such as expected standard deviations or expected differences in values between groups are unknown or very hard to estimate.^[5]

All the parameters in the equation are in fact the degrees of freedom of the number of their concepts, and hence, their numbers are subtracted by 1 before insertion into the equation.

The equation is:^[5]

where:

N is the total number of individuals or units in the study (minus 1)
B is the blocking component, representing environmental effects allowed for in the design (minus 1)
T is the treatment component, corresponding to the number of treatment groups (including control group) being used, or the number of questions being asked (minus 1)
E is the degrees of freedom of the error component, and should be somewhere between 10 and 20.

For example, if a study using laboratory animals is planned with four treatment groups (T=3), with eight animals per group, making 32 animals total (N=31), without any further stratification (B=0), then E would equal 28, which is above the cutoff of 20, indicating that sample size may be a bit too large, and six animals per group might be more appropriate.^[6]

By cumulative distribution function

Let X_i, i = 1, 2, ..., n be independent observations taken from a normal distribution with unknown mean μ and known variance σ². Let us consider two hypotheses, a null hypothesis:

and an alternative hypothesis:

for some 'smallest significant difference' μ^* >0. This is the smallest value for which we care about observing a difference. Now, if we wish to (1) reject H₀ with a probability of at least 1-β when H_a is true (i.e. a power of 1-β), and (2) reject H₀ with probability α when H₀ is true, then we need the following:

If z_α is the upper α percentage point of the standard normal distribution, then

and so

'Reject H₀ if our sample average () is more than '

is a decision rule which satisfies (2). (Note, this is a 1-tailed test)

Now we wish for this to happen with a probability at least 1-β when H_a is true. In this case, our sample average will come from a Normal distribution with mean μ^*. Therefore we require

Through careful manipulation, this can be shown to happen when

where is the normal cumulative distribution function.

Stratified sample size

With more complicated sampling techniques, such as stratified sampling, the sample can often be split up into sub-samples. Typically, if there are k such sub-samples (from k different strata) then each of them will have a sample size n_i, i = 1, 2, ..., k. These n_i must conform to the rule that n₁ + n₂ + ... + n_k = n (i.e. that the total sample size is given by the sum of the sub-sample sizes). Selecting these n_i optimally can be done in various ways, using (for example) Neyman's optimal allocation.

There are many reasons to use stratified sampling:^[7] to decrease variances of sample estimates, to use partly non-random methods, or to study strata individually. A useful, partly non-random method would be to sample individuals where easily accessible, but, where not, sample clusters to save travel costs.^{[citation needed]}

In general, for H strata, a weighted sample mean is

with

^[8]

The weights, W(h), frequently, but not always, represent the proportions of the population elements in the strata, and W(h)=N(h)/N. For a fixed sample size, that is ,

^[9]

which can be made a minimum if the sampling rate within each stratum is made proportional to the standard deviation within each stratum: .

An "optimum allocation" is reached when the sampling rates within the strata are made directly proportional to the standard deviations within the strata and inversely proportional to the square roots of the costs per element within the strata:

^[10]

or, more generally, when

^[11]

Notes

^ NIST/SEMATECH, "7.2.4.2. Sample sizes required", e-Handbook of Statistical Methods.
^ "Large Sample Estimation of a Population Proportion"
^ "Confidence Interval for a Proportion"
^ ^a ^b Chapter 13, page 215, in: Kenny, David A. (1987). Statistics for the social and behavioral sciences. Boston: Little, Brown. ISBN 0-316-48915-8.
^ ^a ^b Kirkwood, James; Robert Hubrecht (2010). The UFAW Handbook on the Care and Management of Laboratory and Other Research Animals. Wiley-Blackwell. pp. 29. ISBN 1-4051-7523-0. online Page 29
^ Isogenic.info > Resource equation by Michael FW Festing. Updated Sept. 2006
^ Kish (1965, Section 3.1)
^ Kish (1965), p.78.
^ Kish (1965), p.81.
^ Kish (1965), p.93.
^ Kish (1965), p.94.

References

Bartlett, J. E., II, Kotrlik, J. W., & Higgins, C. (2001). "Organizational research: Determining appropriate sample size for survey research", Information Technology, Learning, and Performance Journal, 19(1) 43-50.
Kish, L. (1965), Survey Sampling, Wiley. ISBN 0-471-48900-X

External links

Sample Size Calculator by Survey Systems

Sample Size Calculator by Raosoft, Inc.

UpToDate Contents

全文を閲覧するには購読必要です。 To read the full text you will need to subscribe.

1. 証拠、p値、および仮説検定 proof p values and hypothesis testing
2. Selection of modality for diagnosis and staging of patients with suspected non-small cell lung cancer
3. Procedures for tissue biopsy in patients with suspected non-small cell lung cancer
4. Peripartum cardiomyopathy: Treatment and prognosis
5. 胎児の採血 fetal blood sampling

English Journal

The Relationship of Health Literacy With Use of Digital Technology for Health Information: Implications for Public Health Practice.

Manganello J1, Gerstner G, Pergolino K, Graham Y, Falisi A, Strogatz D.
Journal of public health management and practice : JPHMP.J Public Health Manag Pract.2016 Dec 15. [Epub ahead of print]
OBJECTIVE: An understanding of the association of health literacy with patterns related to access and usage of digital technologies and preferences for sources of health information is necessary for public health agencies and organizations to appropriately target channels for health information diss
PMID 26672402

Toward a high-performance management system in health care, part 4: Using high-performance work practices to prevent central line-associated blood stream infections-a comparative case study.

McAlearney AS1, Hefner J, Robbins J, Garman AN.
Health care management review.Health Care Manage Rev.2016 May 21. [Epub ahead of print]
BACKGROUND: Central line-associated bloodstream infections (CLABSIs) are among the most harmful health care-associated infections and a major patient safety concern. Nationally, CLABSI rates have been reduced through the implementation of evidence-based interventions; thus far, however, hospitals st
PMID 26002415

Label-free surface-enhanced Raman scattering strategy for rapid detection of penicilloic acid in milk products.

Qi M1, Huang X1, Zhou Y1, Zhang L1, Jin Y1, Peng Y1, Jiang H1, Du S2.
Food chemistry.Food Chem.2016 Apr 15;197(Pt A):723-9. doi: 10.1016/j.foodchem.2015.11.014. Epub 2015 Nov 9.
A label-free surface-enhanced Raman scattering (SERS) strategy based on silver-coated gold nanoparticles (Au@Ag NPs) was developed for rapid detection of penicilloic acid (PA) in milk products. It has been demonstrated that core size and shell thickness of Au@Ag NPs are two critical variants affecti
PMID 26617009

Japanese Journal

Morphology-dependent photocatalytic activity of octahedral anatase particles prepared by ultrasonication-hydrothermal reaction of titanates

Wei Zhishun,Kowalska Ewa,Verrett Jonathan,Colbeau-Justin Christophe,Remita Hynd,Ohtani Bunsho
Nanoscale 7(29), 12392-12404, 2015-08-07
… The structural/physical properties of OAP-containing samples, including specific surface area, crystallinity, crystallite size, particle aspect ratio, composition and total OAP content, were analyzed. … The sample prepared with 1 h US duration and 6 h HT duration at 433 K using 267 mg of TNWs in 80 mL of Milli-Q water exhibited the highest photocatalytic activity. …
NAID 120005649138

Testing for Linearity in Regressions with I(1) Processes

ARAI Yoichi
GRIPS Discussion Papers 15-11, 2015-08
… Finite-sample simulations show that the empirical size is close to the nominal one and the test succeeds in detecting both nonlinearity and no cointegration.JEL Classification Codes: C22, C32 …
NAID 120005648870

Optimal Colored Noise for Estimating Phase Response Curves

Morinaga Kazuhiko,Miyata Ryota,Aonishi Toru
Journal of the Physical Society of Japan 84(9), 2015-07-31
NAID 160000000820

「n」

　　[★]

n.

例数

関: number of experiment、sample size

pの前の[n]はmと記載する。synptom→symptom

「例数」

　　[★]

英: number of experiment、sample size、n
関: 症例数、標本サイズ

「症例数」

　　[★]

英: number of cases、sample size
関: 例数、標本サイズ

「標本サイズ」

　　[★]

英: sample size
関: 症例数、例数

「number of experiment」

　　[★]

例数

関: n、sample size

「SAMPLE」

　　[★]

関: 二次評価、PALS

焦点を絞った病歴聴取

S	自他覚症状	signs and symptoms	発症初期の自他覚症状
A	アレルギー	allergies	薬物、食物、その他物質
M	薬物	medications	常用薬、最後に投与/服用した薬物の種類と用量
P	病歴	past medical history	出生時の状態、重要な基礎疾患、手術歴、予防接種
L	最後の食事	last meal	摂取時刻とその内容
E	イベント	events	現在の状態に関係する出来事、接触までの出来事

参考

http://kensyui.com/sample.pdf

「sample」

　　[★]

n.

試料、検体、標本、実例、サンプル

v.

標本抽出する、試料採取する

関: instance、preparation、sampling、specimen

「sampling」

　　[★]

n.

サンプリング、検体採取、試料採取、(統計)標本抽出

関: sample

「size」

　　[★]

サイズ、大きさ

関: magnitude

[1] NIST/SEMATECH, "7.2.4.2. Sample sizes required", e-Handbook of Statistical Methods.

[2] "Large Sample Estimation of a Population Proportion"

[3] "Confidence Interval for a Proportion"

[Kenny1987-4] Chapter 13, page 215, in: Kenny, David A. (1987). Statistics for the social and behavioral sciences. Boston: Little, Brown. ISBN 0-316-48915-8.

[Hubrecht.26Kirkwood2010-5] Kirkwood, James; Robert Hubrecht (2010). The UFAW Handbook on the Care and Management of Laboratory and Other Research Animals. Wiley-Blackwell. pp. 29. ISBN 1-4051-7523-0. online Page 29

[6] Isogenic.info > Resource equation by Michael FW Festing. Updated Sept. 2006

[7] Kish (1965, Section 3.1)

[8] Kish (1965), p.78.

[9] Kish (1965), p.81.

[10] Kish (1965), p.93.

[11] Kish (1965), p.94.

リンク元	「n」「例数」「症例数」「標本サイズ」「number of experiment」
関連記事	「SAMPLE」「sample」「sampling」「size」

匿名

検索

案内

案内

sample size