実験デザイン、実験計画

関: data quality、matched group、measurement、research design、scoring method、testing

WordNet

the act or process of assigning numbers to phenomena according to a rule; "the measurements were carefully done"; "his mental measurings proved remarkably accurate" (同)measuring, measure, mensuration
an examination of the characteristics of something; "there are laboratories for commercial testing"; "it involved testing thousands of children for smallpox"
the act of subjecting to experimental test in order to determine how well something works; "they agreed to end the testing of atomic weapons"
plan something for a specific role or purpose or effect; "This room is not designed for work"
the act of working out the form of something (as by making a sketch or outline or plan); "he contributed to the design of a new instrument" (同)designing
a decorative or artistic work; "the coach had a design on the doors" (同)pattern, figure
a preliminary sketch indicating the plan for something; "the design of a building"
an arrangement scheme; "the awkward design of the keyboard made operation difficult"; "it was an excellent design for living"; "a plan for seating guests" (同)plan
make a design of; plan out in systematic, often graphic form; "design a better mousetrap"; "plan the new wing of the museum" (同)plan
conceive or fashion in the mind; invent; "She designed a good excuse for not attending classes that day"
create designs; "Dupont designs for the house of Chanel"
create the design for; create or execute in an artistic or highly skilled manner; "Chanel designed the famous suit"
intend or have as a purpose; "She designed to go far in the world of business"
to conduct a test or investigation; "We are experimenting with the new drug in order to fight this disease"
the act of conducting a controlled test or investigation (同)experimentation
a venture at something new or different; "as an experiment he decided to grow a beard"
the testing of an idea; "it was an experiment in living"; "not all experimentation is done in laboratories" (同)experimentation
try something new, as in order to gain experience; "Students experiment sexually"; "The composer experimented with a new style" (同)try out
relating to or based on experiment; "experimental physics"
relying on observation or experiment; "experimental results that supported the hypothesis" (同)data-based, observational
of the nature of or undergoing an experiment; "an experimental drug"
concealing crafty designs for advancing your own interest; "a selfish and designing nation obsessed with the dark schemes of European intrigue"- W.Churchill; "a scheming wife"; "a scheming gold digger" (同)scheming
done or made or performed with purpose and intent; "style...is more than the deliberate and designed creation"- Havelock Ellis; "games designed for all ages"; "well-designed houses" (同)intentional

PrepTutorEJDIC

〈U〉『測定』,測量 / 〈C〉《複数形で》『寸法』 / 〈U〉測定法,測量法
{C}(形式・構造などの)略図,見取り図,設計図 / 〈C〉『図案』,意匠,模様 / 〈U〉図案法,意匠術 / 〈C〉(…の)『計画』,企画,目的《+『for』+『名』》 / 《複数形で》(…への)たくらみ,下心《+『against』(『on』,『upon』)+『名』》 / 〈構造など〉‘を'『設計する』 / …‘の'『下絵を描く』,図案を描く / …‘を'『計画する』,頭の中で考える(plan) / …‘を'『予定する』
(…の)『実験』,試み《+『in』(『on, with』)+『名』》 / (…の)実験をする《+『on(upon, with)』+『名』》
実験の,実験に基づく: / 実験に使われる,実験用の / 実験的な,試験的な
(人が)たくらみのある,下心のある,陰険な,腹黒い / 設計,図案作製,(服飾の)デザイン

Wikipedia preview

出典(authority):フリー百科事典『ウィキペディア（Wikipedia）』「2017/11/16 17:17:54」(JST)

wiki en

[Wiki en表示]

Design of experiments with full factorial design (left), response surface with second-degree polynomial (right)

The design of experiments (DOE, DOX, or experimental design) is the design of any task that aims to describe or explain the variation of information under conditions that are hypothesized to reflect the variation. The term is generally associated with true experiments in which the design introduces conditions that directly affect the variation, but may also refer to the design of quasi-experiments, in which natural conditions that influence the variation are selected for observation.

In its simplest form, an experiment aims at predicting the outcome by introducing a change of the preconditions, which is reflected in a variable called the predictor (independent). The change in the predictor is generally hypothesized to result in a change in the second variable, hence called the outcome (dependent) variable. Experimental design involves not only the selection of suitable predictors and outcomes, but planning the delivery of the experiment under statistically optimal conditions given the constraints of available resources.

Main concerns in experimental design include the establishment of validity, reliability, and replicability. For example, these concerns can be partially addressed by carefully choosing the predictor, reducing the risk of measurement error, and ensuring that the documentation of the method is sufficiently detailed. Related concerns include achieving appropriate levels of statistical power and sensitivity.

Correctly designed experiments advance knowledge in the natural and social sciences and engineering. Other applications include marketing and policy making.

1 History
- 1.1 Systematic clinical trials
- 1.2 Statistical experiments, following Charles S. Peirce
  - 1.2.1 Randomized experiments
  - 1.2.2 Optimal designs for regression models
- 1.3 Sequences of experiments
2 Fisher's principles
3 Example
4 Avoiding false positives
5 Discussion topics when setting up an experimental design
6 Causal attributions
7 Statistical control
8 Experimental designs after Fisher
9 Human participant constraints
10 See also
11 Notes
12 References
13 External links

History

Systematic clinical trials

In 1747, while serving as surgeon on HMS Salisbury, James Lind carried out a systematic clinical trial to compare remedies for scurvy.^[1] This systematic clinical trial constitutes a type of DOE.^{[citation needed]}^{[dubious – discuss]}

Lind selected 12 men from the ship, all suffering from scurvy. Lind limited his subjects to men who "were as similar as I could have them," that is, he provided strict entry requirements to reduce extraneous variation. He divided them into six pairs, giving each pair different supplements to their basic diet for two weeks. The treatments were all remedies that had been proposed:

A quart of cider every day.
Twenty five gutts (drops) of vitriol (sulphuric acid) three times a day upon an empty stomach.
One half-pint of seawater every day.
A mixture of garlic, mustard, and horseradish in a lump the size of a nutmeg.
Two spoonfuls of vinegar three times a day.
Two oranges and one lemon every day.

The citrus treatment stopped after six days when they ran out of fruit, but by that time one sailor was fit for duty while the other had almost recovered. Apart from that, only group one (cider) showed some effect of its treatment. The remainder of the crew presumably served as a control, but Lind did not report results from any control (untreated) group.

Statistical experiments, following Charles S. Peirce

A theory of statistical inference was developed by Charles S. Peirce in "Illustrations of the Logic of Science" (1877–1878) and "A Theory of Probable Inference" (1883), two publications that emphasized the importance of randomization-based inference in statistics.

Randomized experiments

Charles S. Peirce randomly assigned volunteers to a blinded, repeated-measures design to evaluate their ability to discriminate weights.^[2]^[3]^[4]^[5] Peirce's experiment inspired other researchers in psychology and education, which developed a research tradition of randomized experiments in laboratories and specialized textbooks in the 1800s.^[2]^[3]^[4]^[5]

Optimal designs for regression models

Charles S. Peirce also contributed the first English-language publication on an optimal design for regression models in 1876.^[6] A pioneering optimal design for polynomial regression was suggested by Gergonne in 1815. In 1918 Kirstine Smith published optimal designs for polynomials of degree six (and less).

Sequences of experiments

The use of a sequence of experiments, where the design of each may depend on the results of previous experiments, including the possible decision to stop experimenting, is within the scope of Sequential analysis, a field that was pioneered^[7] by Abraham Wald in the context of sequential tests of statistical hypotheses.^[8] Herman Chernoff wrote an overview of optimal sequential designs,^[9] while adaptive designs have been surveyed by S. Zacks.^[10] One specific type of sequential design is the "two-armed bandit", generalized to the multi-armed bandit, on which early work was done by Herbert Robbins in 1952.^[11]

Fisher's principles

A methodology for designing experiments was proposed by Ronald Fisher, in his innovative books: The Arrangement of Field Experiments (1926) and The Design of Experiments (1935). Much of his pioneering work dealt with agricultural applications of statistical methods. As a mundane example, he described how to test the lady tasting tea hypothesis, that a certain lady could distinguish by flavour alone whether the milk or the tea was first placed in the cup. These methods have been broadly adapted in the physical and social sciences, are still used in agricultural engineering and differ from the design and analysis of computer experiments.

Comparison: In some fields of study it is not possible to have independent measurements to a traceable metrology standard. Comparisons between treatments are much more valuable and are usually preferable, and often compared against a scientific control or traditional treatment that acts as baseline.

Randomization: Random assignment is the process of assigning individuals at random to groups or to different groups in an experiment, so that each individual of the population has the same chance of becoming a participant in the study. The random assignment of individuals to groups (or conditions within a group) distinguishes a rigorous, "true" experiment from an observational study or "quasi-experiment".^[12] There is an extensive body of mathematical theory that explores the consequences of making the allocation of units to treatments by means of some random mechanism (such as tables of random numbers, or the use of randomization devices such as playing cards or dice). Assigning units to treatments at random tends to mitigate confounding, which makes effects due to factors other than the treatment to appear to result from the treatment.

The risks associated with random allocation (such as having a serious imbalance in a key characteristic between a treatment group and a control group) are calculable and hence can be managed down to an acceptable level by using enough experimental units. However, if the population is divided into several subpopulations that somehow differ, and the research requires each subpopulation to be equal in size, stratified sampling can be used. In that way, the units in each subpopulation are randomized, but not the whole sample. The results of an experiment can be generalized reliably from the experimental units to a larger statistical population of units only if the experimental units are a random sample from the larger population; the probable error of such an extrapolation depends on the sample size, among other things.

Statistical replication: Measurements are usually subject to variation and measurement uncertainty; thus they are repeated and full experiments are replicated to help identify the sources of variation, to better estimate the true effects of treatments, to further strengthen the experiment's reliability and validity, and to add to the existing knowledge of the topic.^[13] However, certain conditions must be met before the replication of the experiment is commenced: the original research question has been published in a peer-reviewed journal or widely cited, the researcher is independent of the original experiment, the researcher must first try to replicate the original findings using the original data, and the write-up should state that the study conducted is a replication study that tried to follow the original study as strictly as possible.^[14]

Blocking: Blocking is the arrangement of experimental units into groups (blocks/lots) consisting of units that are similar to one another. Blocking reduces known but irrelevant sources of variation between units and thus allows greater precision in the estimation of the source of variation under study.

Orthogonality

Example of orthogonal factorial design

Orthogonality concerns the forms of comparison (contrasts) that can be legitimately and efficiently carried out. Contrasts can be represented by vectors and sets of orthogonal contrasts are uncorrelated and independently distributed if the data are normal. Because of this independence, each orthogonal treatment provides different information to the others. If there are T treatments and T – 1 orthogonal contrasts, all the information that can be captured from the experiment is obtainable from the set of contrasts.

Factorial experiments: Use of factorial experiments instead of the one-factor-at-a-time method. These are efficient at evaluating the effects and possible interactions of several factors (independent variables). Analysis of experiment design is built on the foundation of the analysis of variance, a collection of models that partition the observed variance into components, according to what factors the experiment must estimate or test.

Example

This example is attributed to Harold Hotelling.^[9] It conveys some of the flavor of those aspects of the subject that involve combinatorial designs.

Weights of eight objects are measured using a pan balance and set of standard weights. Each weighing measures the weight difference between objects in the left pan vs. any objects in the right pan by adding calibrated weights to the lighter pan until the balance is in equilibrium. Each measurement has a random error. The average error is zero; the standard deviations of the probability distribution of the errors is the same number σ on different weighings; errors on different weighings are independent. Denote the true weights by

\theta _{1},\dots ,\theta _{8}.\,

We consider two different experiments:

Weigh each object in one pan, with the other pan empty. Let X_i be the measured weight of the object, for i = 1, ..., 8.
Do the eight weighings according to the following schedule and let Y_i be the measured difference for i = 1, ..., 8:

{\begin{array}{lcc}&{\text{left pan}}&{\text{right pan}}\\\hline {\text{1st weighing:}}&1\ 2\ 3\ 4\ 5\ 6\ 7\ 8&{\text{(empty)}}\\{\text{2nd:}}&1\ 2\ 3\ 8\ &4\ 5\ 6\ 7\\{\text{3rd:}}&1\ 4\ 5\ 8\ &2\ 3\ 6\ 7\\{\text{4th:}}&1\ 6\ 7\ 8\ &2\ 3\ 4\ 5\\{\text{5th:}}&2\ 4\ 6\ 8\ &1\ 3\ 5\ 7\\{\text{6th:}}&2\ 5\ 7\ 8\ &1\ 3\ 4\ 6\\{\text{7th:}}&3\ 4\ 7\ 8\ &1\ 2\ 5\ 6\\{\text{8th:}}&3\ 5\ 6\ 8\ &1\ 2\ 4\ 7\end{array}}

Then the estimated value of the weight θ₁ is

{\widehat {\theta }}_{1}={\frac {Y_{1}+Y_{2}+Y_{3}+Y_{4}-Y_{5}-Y_{6}-Y_{7}-Y_{8}}{8}}.

Similar estimates can be found for the weights of the other items. For example

{\begin{aligned}{\widehat {\theta }}_{2}&={\frac {Y_{1}+Y_{2}-Y_{3}-Y_{4}+Y_{5}+Y_{6}-Y_{7}-Y_{8}}{8}}.\\[5pt]{\widehat {\theta }}_{3}&={\frac {Y_{1}+Y_{2}-Y_{3}-Y_{4}-Y_{5}-Y_{6}+Y_{7}+Y_{8}}{8}}.\\[5pt]{\widehat {\theta }}_{4}&={\frac {Y_{1}-Y_{2}+Y_{3}-Y_{4}+Y_{5}-Y_{6}+Y_{7}-Y_{8}}{8}}.\\[5pt]{\widehat {\theta }}_{5}&={\frac {Y_{1}-Y_{2}+Y_{3}-Y_{4}-Y_{5}+Y_{6}-Y_{7}+Y_{8}}{8}}.\\[5pt]{\widehat {\theta }}_{6}&={\frac {Y_{1}-Y_{2}-Y_{3}+Y_{4}+Y_{5}-Y_{6}-Y_{7}+Y_{8}}{8}}.\\[5pt]{\widehat {\theta }}_{7}&={\frac {Y_{1}-Y_{2}-Y_{3}+Y_{4}-Y_{5}+Y_{6}+Y_{7}-Y_{8}}{8}}.\\[5pt]{\widehat {\theta }}_{8}&={\frac {Y_{1}+Y_{2}+Y_{3}+Y_{4}+Y_{5}+Y_{6}+Y_{7}+Y_{8}}{8}}.\end{aligned}}

The question of design of experiments is: which experiment is better?

The variance of the estimate X₁ of θ₁ is σ² if we use the first experiment. But if we use the second experiment, the variance of the estimate given above is σ²/8. Thus the second experiment gives us 8 times as much precision for the estimate of a single item, and estimates all items simultaneously, with the same precision. What the second experiment achieves with eight would require 64 weighings if the items are weighed separately. However, note that the estimates for the items obtained in the second experiment have errors that correlate with each other.

Many problems of the design of experiments involve combinatorial designs, as in this example and others.^[15]

Avoiding false positives

False positive conclusions, often resulting from the pressure to publish or the author's own confirmation bias, are an inherent hazard in many fields. A good way to prevent biases potentially leading to false positives in the data collection phase is to use a double-blind design. When a double-blind design is used, participants are randomly assigned to experimental groups but the researcher is unaware of what participants belong to which group. Therefore, the researcher can not affect the participants' response to the intervention. Experimental designs with undisclosed degrees of freedom are a problem.^[16] This can lead to conscious or unconscious "p-hacking": trying multiple things until you get the desired result. It typically involves the manipulation - perhaps unconsciously - of the process of statistical analysis and the degrees of freedom until they return a figure below the p<.05 level of statistical significance.^[17]^[18] So the design of the experiment should include a clear statement proposing the analyses to be undertaken. P-hacking can be prevented by preregistering researches, in which researchers have to send their data analysis plan to the journal they wish to publish their paper in before they even start their data collection, so no data manipulation is possible (https://osf.io). Another way to prevent this is taking the double-blind design to the data-analysis phase, where the data are sent to a data-analyst unrelated to the research who scrambles up the data so there is no way to know which participants belong to before they are potentially taken away as outliers.

Clear and complete documentation of the experimental methodology is also important in order to support replication of results.^[19]

Discussion topics when setting up an experimental design

An experimental design or randomized clinical trial requires careful consideration of several factors before actually doing the experiment.^[20] An experimental design is the laying out of a detailed experimental plan in advance of doing the experiment. Some of the following topics have already been discussed in the principles of experimental design section:

How many factors does the design have, and are the levels of these factors fixed or random?
Are control conditions needed, and what should they be?
Manipulation checks; did the manipulation really work?
What are the background variables?
What is the sample size. How many units must be collected for the experiment to be generalisable and have enough power?
What is the relevance of interactions between factors?
What is the influence of delayed effects of substantive factors on outcomes?
How do response shifts affect self-report measures?
How feasible is repeated administration of the same measurement instruments to the same units at different occasions, with a post-test and follow-up tests?
What about using a proxy pretest?
Are there lurking variables?
Should the client/patient, researcher or even the analyst of the data be blind to conditions?
What is the feasibility of subsequent application of different conditions to the same units?
How many of each control and noise factors should be taken into account?

The independent variable of a study often has many levels or different groups. In a true experiment, researchers can have an experimental group, which is where their intervention testing the hypothesis is implemented, and a control group, which has all the same element as the experimental group, without the interventional element. Thus, when everything else except for one intervention is held constant, researchers can certify with some certainty that this one element is what caused the observed change. In some instances, having a control group is not ethical. This is sometimes solved using two different experimental groups. In some cases, independent variables cannot be manipulated, for example when testing the difference between two groups who have a different disease, or testing the difference between genders (obviously variables that would be hard or unethical to assign participants to). In these cases, a quasi-experimental design may be used.

Causal attributions

In the pure experimental design, the independent (predictor) variable is manipulated by the researcher - that is - every participant of the research is chosen randomly from the population, and each participant chosen is assigned randomly to conditions of the independent variable. Only when this is done is it possible to certify with high probability that the reason for the differences in the outcome variables are caused by the different conditions. Therefore, researchers should choose the experimental design over other design types whenever possible. However, the nature of the independent variable does not always allow for manipulation. In those cases, researchers must be aware of not certifying about causal attribution when their design doesn't allow for it. For example, in observational designs, participants are not assigned randomly to conditions, and so if there are differences found in outcome variables between conditions, it is likely that there is something other than the differences between the conditions that causes the differences in outcomes, that is - a third variable. The same goes for studies with correlational design. (Adér & Mellenbergh, 2008).

Statistical control

It is best that a process be in reasonable statistical control prior to conducting designed experiments. When this is not possible, proper blocking, replication, and randomization allow for the careful conduct of designed experiments.^[21] To control for nuisance variables, researchers institute control checks as additional measures. Investigators should ensure that uncontrolled influences (e.g., source credibility perception) do not skew the findings of the study. A manipulation check is one example of a control check. Manipulation checks allow investigators to isolate the chief variables to strengthen support that these variables are operating as planned.

One of the most important requirements of experimental research designs is the necessity of eliminating the effects of spurious, intervening, and antecedent variables. In the most basic model, cause (X) leads to effect (Y). But there could be a third variable (Z) that influences (Y), and X might not be the true cause at all. Z is said to be a spurious variable and must be controlled for. The same is true for intervening variables (a variable in between the supposed cause (X) and the effect (Y)), and anteceding variables (a variable prior to the supposed cause (X) that is the true cause). When a third variable is involved and has not been controlled for, the relation is said to be a zero order relationship. In most practical applications of experimental research designs there are several causes (X1, X2, X3). In most designs, only one of these causes is manipulated at a time.

Experimental designs after Fisher

Some efficient designs for estimating several main effects were found independently and in near succession by Raj Chandra Bose and K. Kishen in 1940 at the Indian Statistical Institute, but remained little known until the Plackett–Burman designs were published in Biometrika in 1946. About the same time, C. R. Rao introduced the concepts of orthogonal arrays as experimental designs. This concept played a central role in the development of Taguchi methods by Genichi Taguchi, which took place during his visit to Indian Statistical Institute in early 1950s. His methods were successfully applied and adopted by Japanese and Indian industries and subsequently were also embraced by US industry albeit with some reservations.

In 1950, Gertrude Mary Cox and William Gemmell Cochran published the book Experimental Designs, which became the major reference work on the design of experiments for statisticians for years afterwards.

Developments of the theory of linear models have encompassed and surpassed the cases that concerned early writers. Today, the theory rests on advanced topics in linear algebra, algebra and combinatorics.

As with other branches of statistics, experimental design is pursued using both frequentist and Bayesian approaches: In evaluating statistical procedures like experimental designs, frequentist statistics studies the sampling distribution while Bayesian statistics updates a probability distribution on the parameter space.

Some important contributors to the field of experimental designs are C. S. Peirce, R. A. Fisher, F. Yates, C. R. Rao, R. C. Bose, J. N. Srivastava, Shrikhande S. S., D. Raghavarao, W. G. Cochran, O. Kempthorne, W. T. Federer, V. V. Fedorov, A. S. Hedayat, J. A. Nelder, R. A. Bailey, J. Kiefer, W. J. Studden, A. Pázman, F. Pukelsheim, D. R. Cox, H. P. Wynn, A. C. Atkinson, G. E. P. Box and G. Taguchi.^{[citation needed]} The textbooks of D. Montgomery, R. Myers, and G. Box/W. Hunter/J.S. Hunter have reached generations of students and practitioners. ^[22] ^[23] ^[24] ^[25] ^[26]

Some discussion of experimental design in the context of system identification (model building for static or dynamic models) is given in ^[27] and.^[28]

Human participant constraints

Laws and ethical considerations preclude some carefully designed experiments with human subjects. Legal constraints are dependent on jurisdiction. Constraints may involve institutional review boards, informed consent and confidentiality affecting both clinical (medical) trials and behavioral and social science experiments.^[29] In the field of toxicology, for example, experimentation is performed on laboratory animals with the goal of defining safe exposure limits for humans.^[30] Balancing the constraints are views from the medical field.^[31] Regarding the randomization of patients, "... if no one knows which therapy is better, there is no ethical imperative to use one therapy or another." (p 380) Regarding experimental design, "...it is clearly not ethical to place subjects at risk to collect data in a poorly designed study when this situation can be easily avoided...". (p 393)

Notes

^ Dunn, Peter (January 1997). "James Lind (1716-94) of Edinburgh and the treatment of scurvy". Archives of Disease in Childhood Fetal and Neonatal Edition. United Kingdom: British Medical Journal Publishing Group. 76 (1): 64–65. doi:10.1136/fn.76.1.F64. PMC 1720613 . PMID 9059193. Retrieved 2009-01-17.
^ ^a ^b Peirce, Charles Sanders; Jastrow, Joseph (1885). "On Small Differences in Sensation". Memoirs of the National Academy of Sciences. 3: 73–83.
^ ^a ^b Hacking, Ian (September 1988). "Telepathy: Origins of Randomization in Experimental Design". Isis. 79 (3): 427–451. doi:10.1086/354775. JSTOR 234674. MR 1013489.
^ ^a ^b Stephen M. Stigler (November 1992). "A Historical View of Statistical Concepts in Psychology and Educational Research". American Journal of Education. 101 (1): 60–70. doi:10.1086/444032. JSTOR 1085417.
^ ^a ^b Trudy Dehue (December 1997). "Deception, Efficiency, and Random Groups: Psychology and the Gradual Origination of the Random Group Design". Isis. 88 (4): 653–673. doi:10.1086/383850. PMID 9519574.
^ Peirce, C. S. (1876). "Note on the Theory of the Economy of Research". Coast Survey Report: 197–201. , actually published 1879, NOAA PDF Eprint.
Reprinted in Collected Papers 7, paragraphs 139–157, also in Writings 4, pp. 72–78, and in Peirce, C. S. (July–August 1967). "Note on the Theory of the Economy of Research". Operations Research. 15 (4): 643–648. doi:10.1287/opre.15.4.643. JSTOR 168276.
^ Johnson, N.L. (1961). "Sequential analysis: a survey." Journal of the Royal Statistical Society, Series A. Vol. 124 (3), 372–411. (pages 375–376)
^ Wald, A. (1945) "Sequential Tests of Statistical Hypotheses", Annals of Mathematical Statistics, 16 (2), 117–186.
^ ^a ^b Herman Chernoff, Sequential Analysis and Optimal Design, SIAM Monograph, 1972.
^ Zacks, S. (1996) "Adaptive Designs for Parametric Models". In: Ghosh, S. and Rao, C. R., (Eds) (1996). "Design and Analysis of Experiments," Handbook of Statistics, Volume 13. North-Holland. ISBN 0-444-82061-2. (pages 151–180)
^ Robbins, H. (1952). "Some Aspects of the Sequential Design of Experiments". Bulletin of the American Mathematical Society. 58 (5): 527–535. doi:10.1090/S0002-9904-1952-09620-8.
^ Creswell, J.W. (2008), Educational research: Planning, conducting, and evaluating quantitative and qualitative research (3rd edition), Upper Saddle River, NJ: Prentice Hall. 2008, p. 300. ISBN 0-13-613550-1
^ Dr. Hani (2009). "Replication study". Retrieved 27 October 2011.
^ Burman, Leonard E.; Robert W. Reed; James Alm (2010), "A call for replication studies", Public Finance Review, 38: 787–793, doi:10.1177/1091142110385210, retrieved 27 October 2011
^ Jack Sifri (8 December 2014). "How to Use Design of Experiments to Create Robust Designs With High Yield". youtube.com. Retrieved 2015-02-11.
^ Simmons, Joseph; Leif Nelson; Uri Simonsohn (November 2011). "False-Positive Psychology: Undisclosed Flexibility in Data Collection and Analysis Allows Presenting Anything as Significant". Psychological Science. Washington DC: Association for Psychological Science. 22 (11): 1359–1366. doi:10.1177/0956797611417632. ISSN 0956-7976. PMID 22006061. Retrieved 29 January 2012.
^ "Science, Trust And Psychology In Crisis". KPLU. 2014-06-02. Retrieved 2014-06-12.
^ "Why Statistically Significant Studies Can Be Insignificant". Pacific Standard. 2014-06-04. Retrieved 2014-06-12.
^ Chris Chambers (2014-06-10). "Physics envy: Do 'hard' sciences hold the solution to the replication crisis in psychology?". theguardian.com. Retrieved 2014-06-12.
^ Ader, Mellenberg & Hand (2008) "Advising on Research Methods: A consultant's companion"
^ Bisgaard, S (2008) "Must a Process be in Statistical Control before Conducting Designed Experiments?", Quality Engineering, ASQ, 20 (2), pp 143 - 176
^ Montgomery, Douglas (2013). Design and analysis of experiments (8th ed.). Hoboken, NJ: John Wiley & Sons, Inc. ISBN 9781118146927.
^ Walpole, Ronald E.; Myers, Raymond H.; Myers, Sharon L.; Ye, Keying (2007). Probability & statistics for engineers & scientists (8 ed.). Upper Saddle River, NJ: Pearson Prentice Hall. ISBN 978-0131877115.
^ Myers, Raymond H.; Montgomery, Douglas C.; Vining, G. Geoffrey; Robinson, Timothy J. (2010). Generalized linear models : with applications in engineering and the sciences (2 ed.). Hoboken, N.J.: Wiley. ISBN 978-0470454633.
^ Box, George E.P.; Hunter, William G.; Hunter, J. Stuart (1978). Statistics for Experimenters : An Introduction to Design, Data Analysis, and Model Building. New York: Wiley. ISBN 0-471-09315-7.
^ Box, George E.P.; Hunter, William G.; Hunter, J. Stuart (2005). Statistics for Experimenters : Design, Innovation, and Discovery (2 ed.). Hoboken, N.J.: Wiley. ISBN 978-0471718130.
^ Spall, J. C. (2010), “Factorial Design for Efficient Experimentation: Generating Informative Data for System Identification,” IEEE Control Systems Magazine, vol. 30(5), pp. 38–53. https://dx.doi.org/10.1109/MCS.2010.937677
^ Pronzato, L. (2008), “Optimal experimental design and some related control problems,” Automatica, vol. 44, pp. 303–325.
^ Moore, David S.; Notz, William I. (2006). Statistics : concepts and controversies (6th ed.). New York: W.H. Freeman. pp. Chapter 7: Data ethics. ISBN 9780716786368.
^ Ottoboni, M. Alice (1991). The dose makes the poison : a plain-language guide to toxicology (2nd ed.). New York, N.Y: Van Nostrand Reinhold. ISBN 0442006608.
^ Glantz, Stanton A. (1992). Primer of biostatistics (3rd ed.). ISBN 0-07-023511-2.

References

Peirce, C. S. (1877–1878), "Illustrations of the Logic of Science" (series), Popular Science Monthly, vols. 12-13. Relevant individual papers:
- (1878 March), "The Doctrine of Chances", Popular Science Monthly, v. 12, March issue, pp. 604–615. Internet Archive Eprint.
- (1878 April), "The Probability of Induction", Popular Science Monthly, v. 12, pp. 705–718. Internet Archive Eprint.
- (1878 June), "The Order of Nature", Popular Science Monthly, v. 13, pp. 203–217.Internet Archive Eprint.
- (1878 August), "Deduction, Induction, and Hypothesis", Popular Science Monthly, v. 13, pp. 470–482. Internet Archive Eprint.
- Peirce, C. S. (1883), "A Theory of Probable Inference", Studies in Logic, pp. 126-181, Little, Brown, and Company. (Reprinted 1983, John Benjamins Publishing Company, ISBN 90-272-3271-7)

External links

A chapter from a "NIST/SEMATECH Handbook on Engineering Statistics" at NIST
Box–Behnken designs from a "NIST/SEMATECH Handbook on Engineering Statistics" at NIST
Detailed mathematical developments of most common DoE in the Opera Magistris v3.6 online reference Chapter 15, section 7.4, ISBN 978-2-8399-0932-7.

UpToDate Contents

全文を閲覧するには購読必要です。 To read the full text you will need to subscribe.

1. 尿管ステントの留置およびマネージメント placement and management of indwelling ureteral stents
2. 生物統計学および疫学に関する一般用語集 glossary of common biostatistical and epidemiological terms
3. 輸血合併症の予防のための白血球除去 leukoreduction to prevent complications of blood transfusion
4. 鎌状赤血球症における血管閉塞症 vasoocclusion in sickle cell disease
5. 小児における特異的学習障害：教育的マネージメント specific learning disabilities in children educational management

English Journal

Menu-engineering in restaurants - adapting portion sizes on plates to enhance vegetable consumption: a real-life experiment.

Reinders MJ1, Huitink M2, Dijkstra SC2, Maaskant AJ3, Heijnen J4.
The international journal of behavioral nutrition and physical activity.Int J Behav Nutr Phys Act.2017 Dec 25;14(1):41. doi: 10.1186/s12966-017-0496-9.
PMID 28424081

Novel electrochemiluminescence of silver nanoclusters fabricated on triplex DNA scaffolds for label-free detection of biothiols.

Feng L1, Wu L2, Xing F3, Hu L4, Ren J2, Qu X5.
Biosensors & bioelectronics.Biosens Bioelectron.2017 Dec 15;98:378-385. doi: 10.1016/j.bios.2017.07.016. Epub 2017 Jul 6.
PMID 28709087

The hydrological functioning of a constructed fen wetland watershed.

Ketcheson SJ1, Price JS2, Sutton O3, Sutherland G4, Kessel E5, Petrone RM6.
The Science of the total environment.Sci Total Environ.2017 Dec 15;603-604:593-605. doi: 10.1016/j.scitotenv.2017.06.101. Epub 2017 Jun 22.
PMID 28646778

Japanese Journal

A high transmittance optical recording material with long-term reliability for super-multilayer discs

Shimomai Kenichi,Asano Sho,Oshita Junji,Matsuda Isao,Kojo Shinichi,Murai Wakaaki,Hattori Masashi,Shimizu Atsuo,Fujii Toru
Jpn. J. Appl. Phys. 54(9S), 09MB02, 2015-07-30
… This paper clarifies the recording mechanism of GeBi oxide material and proposes a suitable material design that satisfies the abovementioned characteristics. … Furthermore, experimental results of recording on super-multilayer discs based on GeBi oxide recording material are presented. …
NAID 150000111217

A 0.1 G-to-20 G integrated MEMS inertial sensor

Yamane Daisuke,Konishi Toshifumi,Matsushima Takaaki,Toshiyoshi Hiroshi,Masu Kazuya,Machida Katsuyuki
Jpn. J. Appl. Phys. 54(8), 087202, 2015-07-07
… This paper presents a novel integrated MEMS inertial sensor with a wide range of acceleration from 0.1 G to 20 G (G = 9.8 m/s<sup>2</sup>) and with a design suitable for CMOS integration. … The experimental results show that the MEMS sensor has higher sensing resolution than those of conventional MEMS accelerometers. …
NAID 150000111162

Airborne ultrasonic transducer using polymer-based elastomer with high output-to-weight ratio

Wu Jiang,Mizuno Yosuke,Tabaru Marie,Nakamura Kentaro
Jpn. J. Appl. Phys. 54(8), 087201, 2015-07-02
… In this study, we design and fabricate a transducer composed of a PPS-based longitudinal vibrator and a PPS-based disk of 0.3 mm thickness to obtain high-intensity ultrasound. … The experimental results indicate that PPS is a good substitute for metal as the elastomer for manufacturing airborne ultrasonic transducers with a high output-to-weight ratio. …
NAID 150000111190

「testing」

	Library resources about; Experimental design
	Resources in your library;

　　[★]

n.

試験

関: assessment、data quality、exam、examination、examine、experimental design、matched group、measurement、research design、scoring method、test、trial

「measurement」

　　[★]

測定、計測、測定値

関: data quality、determine、estimate、experimental design、fathom、matched group、measure、research design、scoring method、testing

「scoring method」

　　[★]

スコアリング法、スコア化法、点数化法

関: data quality、experimental design、matched group、measurement、research design、testing

「実験デザイン」

　　[★]

英: experimental design
関: 計測、試験、実験計画、研究デザイン、データ品質、対応群、スコアリング法

「data quality」

　　[★]

データ品質

関: experimental design、matched group、measurement、research design、scoring method、testing

「design」

　　[★]

デザイン、計画、設計、ザインする

関: engineer、enterprise、plan、planning、program、programme、project、schema、scheme

「experimental」

　　[★]

adj.

実験の、実験的な、実験用の、実験上の、経験的な

関: a priori、empiric、empirical、empirically、experiment、experimentally

「experiment」

　　[★]

実験

関: experimental

[ADC1997-1] Dunn, Peter (January 1997). "James Lind (1716-94) of Edinburgh and the treatment of scurvy". Archives of Disease in Childhood Fetal and Neonatal Edition. United Kingdom: British Medical Journal Publishing Group. 76 (1): 64–65. doi:10.1136/fn.76.1.F64. PMC 1720613 . PMID 9059193. Retrieved 2009-01-17.

[smalldiff-2] Peirce, Charles Sanders; Jastrow, Joseph (1885). "On Small Differences in Sensation". Memoirs of the National Academy of Sciences. 3: 73–83.

[telepathy-3] Hacking, Ian (September 1988). "Telepathy: Origins of Randomization in Experimental Design". Isis. 79 (3): 427–451. doi:10.1086/354775. JSTOR 234674. MR 1013489.

[stigler-4] Stephen M. Stigler (November 1992). "A Historical View of Statistical Concepts in Psychology and Educational Research". American Journal of Education. 101 (1): 60–70. doi:10.1086/444032. JSTOR 1085417.

[dehue-5] Trudy Dehue (December 1997). "Deception, Efficiency, and Random Groups: Psychology and the Gradual Origination of the Random Group Design". Isis. 88 (4): 653–673. doi:10.1086/383850. PMID 9519574.

[6] Peirce, C. S. (1876). "Note on the Theory of the Economy of Research". Coast Survey Report: 197–201. , actually published 1879, NOAA PDF Eprint.
Reprinted in Collected Papers 7, paragraphs 139–157, also in Writings 4, pp. 72–78, and in Peirce, C. S. (July–August 1967). "Note on the Theory of the Economy of Research". Operations Research. 15 (4): 643–648. doi:10.1287/opre.15.4.643. JSTOR 168276.

[7] Johnson, N.L. (1961). "Sequential analysis: a survey." Journal of the Royal Statistical Society, Series A. Vol. 124 (3), 372–411. (pages 375–376)

[8] Wald, A. (1945) "Sequential Tests of Statistical Hypotheses", Annals of Mathematical Statistics, 16 (2), 117–186.

[ref3-9] Herman Chernoff, Sequential Analysis and Optimal Design, SIAM Monograph, 1972.

[10] Zacks, S. (1996) "Adaptive Designs for Parametric Models". In: Ghosh, S. and Rao, C. R., (Eds) (1996). "Design and Analysis of Experiments," Handbook of Statistics, Volume 13. North-Holland. ISBN 0-444-82061-2. (pages 151–180)

[11] Robbins, H. (1952). "Some Aspects of the Sequential Design of Experiments". Bulletin of the American Mathematical Society. 58 (5): 527–535. doi:10.1090/S0002-9904-1952-09620-8.

[12] Creswell, J.W. (2008), Educational research: Planning, conducting, and evaluating quantitative and qualitative research (3rd edition), Upper Saddle River, NJ: Prentice Hall. 2008, p. 300. ISBN 0-13-613550-1

[13] Dr. Hani (2009). "Replication study". Retrieved 27 October 2011.

[14] Burman, Leonard E.; Robert W. Reed; James Alm (2010), "A call for replication studies", Public Finance Review, 38: 787–793, doi:10.1177/1091142110385210, retrieved 27 October 2011

[yout_Howt-15] Jack Sifri (8 December 2014). "How to Use Design of Experiments to Create Robust Designs With High Yield". youtube.com. Retrieved 2015-02-11.

[16] Simmons, Joseph; Leif Nelson; Uri Simonsohn (November 2011). "False-Positive Psychology: Undisclosed Flexibility in Data Collection and Analysis Allows Presenting Anything as Significant". Psychological Science. Washington DC: Association for Psychological Science. 22 (11): 1359–1366. doi:10.1177/0956797611417632. ISSN 0956-7976. PMID 22006061. Retrieved 29 January 2012.

[17] "Science, Trust And Psychology In Crisis". KPLU. 2014-06-02. Retrieved 2014-06-12.

[18] "Why Statistically Significant Studies Can Be Insignificant". Pacific Standard. 2014-06-04. Retrieved 2014-06-12.

[19] Chris Chambers (2014-06-10). "Physics envy: Do 'hard' sciences hold the solution to the replication crisis in psychology?". theguardian.com. Retrieved 2014-06-12.

[20] Ader, Mellenberg & Hand (2008) "Advising on Research Methods: A consultant's companion"

[21] Bisgaard, S (2008) "Must a Process be in Statistical Control before Conducting Designed Experiments?", Quality Engineering, ASQ, 20 (2), pp 143 - 176

[22] Montgomery, Douglas (2013). Design and analysis of experiments (8th ed.). Hoboken, NJ: John Wiley & Sons, Inc. ISBN 9781118146927.

[23] Walpole, Ronald E.; Myers, Raymond H.; Myers, Sharon L.; Ye, Keying (2007). Probability & statistics for engineers & scientists (8 ed.). Upper Saddle River, NJ: Pearson Prentice Hall. ISBN 978-0131877115.

[24] Myers, Raymond H.; Montgomery, Douglas C.; Vining, G. Geoffrey; Robinson, Timothy J. (2010). Generalized linear models : with applications in engineering and the sciences (2 ed.). Hoboken, N.J.: Wiley. ISBN 978-0470454633.

[25] Box, George E.P.; Hunter, William G.; Hunter, J. Stuart (1978). Statistics for Experimenters : An Introduction to Design, Data Analysis, and Model Building. New York: Wiley. ISBN 0-471-09315-7.

[26] Box, George E.P.; Hunter, William G.; Hunter, J. Stuart (2005). Statistics for Experimenters : Design, Innovation, and Discovery (2 ed.). Hoboken, N.J.: Wiley. ISBN 978-0471718130.

[27] Spall, J. C. (2010), “Factorial Design for Efficient Experimentation: Generating Informative Data for System Identification,” IEEE Control Systems Magazine, vol. 30(5), pp. 38–53. https://dx.doi.org/10.1109/MCS.2010.937677

[28] Pronzato, L. (2008), “Optimal experimental design and some related control problems,” Automatica, vol. 44, pp. 303–325.

[29] Moore, David S.; Notz, William I. (2006). Statistics : concepts and controversies (6th ed.). New York: W.H. Freeman. pp. Chapter 7: Data ethics. ISBN 9780716786368.

[30] Ottoboni, M. Alice (1991). The dose makes the poison : a plain-language guide to toxicology (2nd ed.). New York, N.Y: Van Nostrand Reinhold. ISBN 0442006608.

[31] Glantz, Stanton A. (1992). Primer of biostatistics (3rd ed.). ISBN 0-07-023511-2.

v t e Design of experiments
Scientific method	Scientific experiment Statistical design Control Internal and external validity Experimental unit Blinding Optimal design: Bayesian Random assignment Randomization Restricted randomization Replication versus subsampling Sample size
Treatment and blocking	Treatment Effect size Contrast Interaction Confounding Orthogonality Blocking Covariate Nuisance variable
Models and inference	Linear regression Ordinary least squares Bayesian Random effect Mixed model Hierarchical model: Bayesian Analysis of variance (Anova) Cochran's theorem Manova (multivariate) Ancova (covariance) Compare means Multiple comparison
Designs Completely randomized	Factorial Fractional factorial Plackett-Burman Taguchi Response surface methodology Polynomial and rational modeling Box-Behnken Central composite Block Generalized randomized block design (GRBD) Latin square Graeco-Latin square Orthogonal array Latin hypercube Repeated measures design Crossover study Randomized controlled trial Sequential analysis Sequential probability ratio test
Glossary Category Statistics portal Statistical outline Statistical topics

v t e Clinical research and experimental design
Overview	Clinical trial Trial protocols Adaptive clinical trial Academic clinical trials Clinical study design
Controlled study (EBM I to II-1; A to B)	Randomized controlled trial Scientific experiment Blind experiment Open-label trial
Observational study (EBM II-2 to II-3; B to C)	Cross-sectional study vs. Longitudinal study, Ecological study Cohort study Retrospective Prospective Case-control study (Nested case-control study) Case series Case study Case report
Epidemiology/ methods	occurrence: Incidence (Cumulative incidence) Prevalence Point Period association: absolute (Absolute risk reduction, Attributable risk, Attributable risk percent) relative (Relative risk, Odds ratio, Hazard ratio) other: Clinical endpoint Virulence Infectivity Mortality rate Morbidity Case fatality rate Specificity and sensitivity Likelihood-ratios Pre/post-test probability
Trial/test types	In vitro In vivo Animal testing Animal testing on non-human primates First-in-man study Multicenter trial Seeding trial Vaccine trial
Analysis of clinical trials	Risk–benefit ratio Systematic review Replication Meta-analysis Intention-to-treat analysis
Interpretation of results	Selection bias Survivorship bias Correlation does not imply causation Null result
Category Glossary List of topics

v t e Six Sigma tools
Define phase	Project charter Voice of the customer Value stream mapping
Measure phase	Business process mapping Process capability Pareto chart
Analyse phase	Root cause analysis Failure mode and effects analysis Multi-vari chart
Improve phase	Design of experiments Kaizen
Control phase	Control plan Statistical process control 5S Poka-yoke
DMAIC

リンク元	「testing」「measurement」「scoring method」「実験デザイン」「data quality」
関連記事	「design」「experimental」「experiment」

匿名

検索

案内

experimental design