生存率分析、生存分析

関: survival analyses

WordNet

an investigation of the component parts of a whole and their relations in making up the whole
the abstract separation of a whole into its constituent parts in order to study the parts and their relations (同)analytic thinking
a branch of mathematics involving calculus and the theory of limits; sequences and series and integration and differentiation
a form of literary criticism in which the structure of a piece of writing is analyzed
the use of closed-class words instead of inflections: e.g., `the father of the bride instead of `the brides father
a state of surviving; remaining alive (同)endurance
a natural process resulting in the evolution of organisms best adapted to the environment (同)survival of the fittest, natural_selection, selection
something that survives

PrepTutorEJDIC

(内容・状況などの)『分析』,分解;(詳細な)検討 / (化学・物理で)分析;《米》(心理学で)[精神]分析;(数学で)解析
〈U〉(…が)生き残ること,(…の)生存,残存《+of+名》 / 〈C〉生存者,残存者;残存物,遺物

Wikipedia preview

出典(authority):フリー百科事典『ウィキペディア（Wikipedia）』「2015/09/24 09:08:24」(JST)

wiki en

[Wiki en表示]

Survival analysis is a branch of statistics that deals with analysis of time duration until one or more events happen, such as death in biological organisms and failure in mechanical systems. This topic is called reliability theory or reliability analysis in engineering, duration analysis or duration modelling in economics, and event history analysis in sociology. Survival analysis attempts to answer questions such as: what is the proportion of a population which will survive past a certain time? Of those that survive, at what rate will they die or fail? Can multiple causes of death or failure be taken into account? How do particular circumstances or characteristics increase or decrease the probability of survival?

To answer such questions, it is necessary to define "lifetime". In the case of biological survival, death is unambiguous, but for mechanical reliability, failure may not be well-defined, for there may well be mechanical systems in which failure is partial, a matter of degree, or not otherwise localized in time. Even in biological problems, some events (for example, heart attack or other organ failure) may have the same ambiguity. The theory outlined below assumes well-defined events at specific times; other cases may be better treated by models which explicitly account for ambiguous events.

More generally, survival analysis involves the modelling of time to event data; in this context, death or failure is considered an "event" in the survival analysis literature – traditionally only a single event occurs for each subject, after which the organism or mechanism is dead or broken. Recurring event or repeated event models relax that assumption. The study of recurring events is relevant in systems reliability, and in many areas of social sciences and medical research.

1 General formulation
- 1.1 Survival function
- 1.2 Lifetime distribution function and event density
- 1.3 Hazard function and cumulative hazard function
- 1.4 Quantities derived from the survival distribution
2 Censoring
3 Fitting parameters to data
4 Non-parametric estimation
5 Distributions used in survival analysis
6 See also
7 References
8 Further reading
9 External links

General formulation

Survival function

The object of primary interest is the survival function, conventionally denoted S, which is defined as

where t is some time, T is a random variable denoting the time of death, and "Pr" stands for probability. That is, the survival function is the probability that the time of death is later than some specified time t. The survival function is also called the survivor function or survivorship function in problems of biological survival, and the reliability function in mechanical survival problems. In the latter case, the reliability function is denoted R(t).

Usually one assumes S(0) = 1, although it could be less than 1 if there is the possibility of immediate death or failure.

The survival function must be non-increasing: S(u) ≤ S(t) if u ≥ t. This property follows directly because T>u implies T>t. This reflects the notion that survival to a later age is only possible if all younger ages are attained. Given this property, the lifetime distribution function and event density (F and f below) are well-defined.

The survival function is usually assumed to approach zero as age increases without bound, i.e., S(t) → 0 as t → ∞, although the limit could be greater than zero if eternal life is possible. For instance, we could apply survival analysis to a mixture of stable and unstable carbon isotopes; unstable isotopes would decay sooner or later, but the stable isotopes would last indefinitely.

Lifetime distribution function and event density

Related quantities are defined in terms of the survival function.

The lifetime distribution function, conventionally denoted F, is defined as the complement of the survival function,

If F is differentiable then the derivative, which is the density function of the lifetime distribution, is conventionally denoted f,

The function f is sometimes called the event density; it is the rate of death or failure events per unit time.

The survival function can be expressed in terms of probability distribution and probability density functions

Similarly, a survival event density function can be defined as

Hazard function and cumulative hazard function

The hazard function, conventionally denoted , is defined as the event rate at time t conditional on survival until time t or later (that is, T ≥ t),

Force of mortality is a synonym of hazard function which is used particularly in demography and actuarial science, where it is denoted by . The term hazard rate is another synonym.

The hazard function must be non-negative, λ(t) ≥ 0, and its integral over must be infinite, but is not otherwise constrained; it may be increasing or decreasing, non-monotonic, or discontinuous. An example is the bathtub curve hazard function, which is large for small values of t, decreasing to some minimum, and thereafter increasing again; this can model the property of some mechanical systems to either fail soon after operation, or much later, as the system ages.

The hazard function can alternatively be represented in terms of the cumulative hazard function, conventionally denoted :

so transposing signs and exponentiating

or differentiating (with the chain rule)

The name "cumulative hazard function" is derived from the fact that

which is the "accumulation" of the hazard over time.

From the definition of , we see that it increases without bound as t tends to infinity (assuming that S(t) tends to zero). This implies that must not decrease too quickly, since, by definition, the cumulative hazard has to diverge. For example, is not the hazard function of any survival distribution, because its integral converges to 1.

Quantities derived from the survival distribution

Future lifetime at a given time is the time remaining until death, given survival to age . Thus, it is in the present notation. The expected future lifetime is the expected value of future lifetime. The probability of death at or before age , given survival until age , is just

Therefore the probability density of future lifetime is

and the expected future lifetime is

where the second expression is obtained using integration by parts.

For , that is, at birth, this reduces to the expected lifetime.

In reliability problems, the expected lifetime is called the mean time to failure, and the expected future lifetime is called the mean residual lifetime.

As the probability of an individual surviving until age t or later is S(t), by definition, the expected number of survivors at age t out of an initial population of n newborns is n × S(t), assuming the same survival function for all individuals. Thus the expected proportion of survivors is S(t). If the survival of different individuals is independent, the number of survivors at age t has a binomial distribution with parameters n and S(t), and the variance of the proportion of survivors is S(t) × (1-S(t))/n.

The age at which a specified proportion of survivors remain can be found by solving the equation S(t) = q for t, where q is the quantile in question. Typically one is interested in the median lifetime, for which q = 1/2, or other quantiles such as q = 0.90 or q = 0.99.

One can also make more complex inferences from the survival distribution. In mechanical reliability problems, one can bring cost (or, more generally, utility) into consideration, and thus solve problems concerning repair or replacement. This leads to the study of renewal theory and reliability theory of ageing and longevity.

Censoring

Censoring is a form of missing data problem which is common in survival analysis. Ideally, both the birth and death dates of a subject are known, in which case the lifetime is known.

If it is known only that the date of death is after some date, this is called right censoring. Right censoring will occur for those subjects whose birth date is known but who are still alive when they are lost to follow-up or when the study ends.

If a subject's lifetime is known to be less than a certain duration, the lifetime is said to be left-censored.

It may also happen that subjects with a lifetime less than some threshold may not be observed at all: this is called truncation. Note that truncation is different from left censoring, since for a left censored datum, we know the subject exists, but for a truncated datum, we may be completely unaware of the subject. Truncation is also common. In a so-called delayed entry study, subjects are not observed at all until they have reached a certain age. For example, people may not be observed until they have reached the age to enter school. Any deceased subjects in the pre-school age group would be unknown. Left-truncated data are common in actuarial work for life insurance and pensions.^[1]

We generally encounter right-censored data. Left-censored data can occur when a person's survival time becomes incomplete on the left side of the follow-up period for the person. As an example, we may follow up a patient for any infectious disorder from the time of his or her being tested positive for the infection. We may never know the exact time of exposure to the infectious agent.^[2]

Fitting parameters to data

Survival models can be usefully viewed as ordinary regression models in which the response variable is time. However, computing the likelihood function (needed for fitting parameters or making other kinds of inferences) is complicated by the censoring. The likelihood function for a survival model, in the presence of censored data, is formulated as follows. By definition the likelihood function is the conditional probability of the data given the parameters of the model. It is customary to assume that the data are independent given the parameters. Then the likelihood function is the product of the likelihood of each datum. It is convenient to partition the data into four categories: uncensored, left censored, right censored, and interval censored. These are denoted "unc.", "l.c.", "r.c.", and "i.c." in the equation below.

L(\theta) = \prod_{T_i\in unc.} \Pr(T = T_i\mid\theta) \prod_{i\in l.c.} \Pr(T < T_i\mid\theta) \prod_{i\in r.c.} \Pr(T > T_i\mid\theta) \prod_{i\in i.c.} \Pr(T_{i,l} < T < T_{i,r}\mid\theta) .

For uncensored data, with equal to the age at death, we have

For left-censored data, such that the age at death is known to be less than , we have

For right-censored data, such that the age at death is known to be greater than , we have

For an interval censored datum, such that the age at death is known to be less than and greater than , we have

\Pr(T_{i,l} < T < T_{i,r}\mid\theta) = S(T_{i,l}\mid\theta) - S(T_{i,r}\mid\theta) .

An important application where interval-censored data arises is current status data, where an event is known not to have occurred before an observation time and to have occurred before the next observation time.

Non-parametric estimation

The Kaplan-Meier estimator can be used to estimate the survival function. The Nelson–Aalen estimator can be used to provide a non-parametric estimate of the cumulative hazard rate function.

Distributions used in survival analysis

Exponential distribution
Weibull distribution
Log-logistic distribution
Gamma distribution
Exponential-logarithmic distribution

References

^ Richards, S. J. (2012). "A handbook of parametric survival models for actuarial use". Scandinavian Actuarial Journal 2012 (4): 233–257. doi:10.1080/03461238.2010.506688.
^ Singh, R.; Mukhopadhyay, K. (2011). "Survival analysis in clinical trials: Basics and must know areas". Perspect Clin Res 2 (4): 145–148. doi:10.4103/2229-3485.86872.

External links

Therneau, Terry. "A Package for Survival Analysis in S". At: http://mayoresearch.mayo.edu/mayo/research/biostat/therneau.cfm
"Engineering Statistics Handbook". NIST/SEMATEK.
SOCR, Survival analysis applet and interactive learning activity.
Survival/Failure Time Analysis @ Statistics' Textbook Page
Survival Analysis in R
Lifelines, a Python package for survival analysis
Survival Analysis in NAG Fortran Library

UpToDate Contents

全文を閲覧するには購読必要です。 To read the full text you will need to subscribe.

1. 生物統計学および疫学に関する一般用語集 glossary of common biostatistical and epidemiological terms
2. 成人における抗癌剤の投与 dosing of anticancer agents in adults
3. 嚢胞性線維症：肺疾患治療の概要 cystic fibrosis overview of the treatment of lung disease
4. 乳癌Ⅳ期に対する乳房外科手術の役割 role of breast surgery for stage iv breast cancer
5. 去勢抵抗性前立腺癌：アンドロゲンの作用経路を標的する療法 castration resistant prostate cancer treatments targeting the androgen pathway

English Journal

Prognostic value of PCT in septic emergency patients.

Peschanski N1, Chenevier-Gobeaux C2, Mzabi L3, Lucas R1, Ouahabi S4, Aquilina V1, Brunel V5, Lefevre G4, Ray P3,6.
Annals of intensive care.Ann Intensive Care.2016 Dec;6(1):47. doi: 10.1186/s13613-016-0146-4. Epub 2016 May 21.
BACKGROUND: An accurate assessment of septic patients at risk for poor clinical outcomes is challenging for clinicians in the emergency department (ED).OBJECTIVES: We aimed to evaluate the prognostic value of procalcitonin (PCT) in septic patients in the ED for predicting death.RESULTS: In a retrosp
PMID 27207179

Multiple organ dysfunction syndrome in critically ill children: clinical value of two lists of diagnostic criteria.

Villeneuve A1, Joyal JS1, Proulx F1, Ducruet T1, Poitras N1, Lacroix J2.
Annals of intensive care.Ann Intensive Care.2016 Dec;6(1):40. doi: 10.1186/s13613-016-0144-6. Epub 2016 Apr 29.
BACKGROUND: Two sets of diagnostic criteria of paediatric multiple organ dysfunction syndrome (MODS) were published by Proulx in 1996 and by Goldstein in 2005. We hypothesized that this changes the epidemiology of MODS. Thus, we determined the epidemiology of MODS, according to these two sets of dia
PMID 27130424

Measuring total liver function on sulfur colloid SPECT/CT for improved risk stratification and outcome prediction of hepatocellular carcinoma patients.

Bowen SR1,2, Chapman TR3, Borgman J4, Miyaoka RS5, Kinahan PE5, Liou IW6, Sandison GA3, Vesselle HJ5, Nyflot MJ3, Apisarnthanarax S3.
EJNMMI research.EJNMMI Res.2016 Dec;6(1):57. doi: 10.1186/s13550-016-0212-9. Epub 2016 Jun 27.
BACKGROUND: Assessment of liver function is critical in hepatocellular carcinoma (HCC) patient management. We evaluated parameters of [(99m)Tc] sulfur colloid (SC) SPECT/CT liver uptake for association with clinical measures of liver function and outcome in HCC patients.METHODS: Thirty patients with
PMID 27349530

Japanese Journal

Beyond the hazard ratio : がん生存時間解析の新しい方法についての提言

堀口みき,朴慶純,三上剛史 [他]
腫瘍内科 = Clinical oncology 15(4), 402-406, 2015-04
NAID 40020458447

Esophageal malignant melanoma : analysis of 134 cases collected by the Japan Esophageal Society

Makuuchi Hiroyasu,Takubo Kaiyo,Yanagisawa Akio [他]
Esophagus : official journal of the Japan Esophageal Society 12(2), 158-169, 2015-04
NAID 40020440597

Inflammation-based scoring is a useful prognostic predictor of pulmonary resection for elderly patients with clinical stage I non-small-cell lung cancer

Miyazaki Takuro,Yamasaki Naoya,Tsuchiya Tomoshi,Matsumoto Keitaro,Kunizaki Masaki,Taniguchi Daisuke,Nagayasu Takeshi
European Journal of Cardio-Thoracic Surgery 47(4), e140-e145, 2015-04
… Overall survival and disease-specific 5-year survival rates were 55.5 and 70.0%, respectively. … Diseasespecific 5-year survival was significantly longer with GPS 0 than with GPS 1-2. … Overall 5-year survival was significantly longer with GPS 0 than with GPS 1-2. … Both the NLR (median value = 1.9) and the PLR (median value = 117) were not correlated with disease-specific and overall 5-year survival. …
NAID 120005603875

「生存分析」

　　[★]

英: survival analysis、survival analyses
関: 生存率分析

「生存率分析」

　　[★]

英: survival analysis、survival analyses
関: 生存分析

「survival analyses」

　　[★]

生存率分析、生存分析

関: survival analysis

「Kaplan-Meier survival analysis」

　　[★]

カプラン・マイヤー生存分析、Kaplan-Meier生存分析

「analysis」

　　[★]

n.

解析、分析、解析法、分析法

関: anal、analyse、analyses、analytical、analyze、assay、dissect、-metry、solve

「survival」

　　[★]

n.

生存、生存時間、残存、(移植片などの)生着

関: engraft、engraftment、exist、living、relic、remain、survive

[1] Richards, S. J. (2012). "A handbook of parametric survival models for actuarial use". Scandinavian Actuarial Journal 2012 (4): 233–257. doi:10.1080/03461238.2010.506688.

[2] Singh, R.; Mukhopadhyay, K. (2011). "Survival analysis in clinical trials: Basics and must know areas". Perspect Clin Res 2 (4): 145–148. doi:10.4103/2229-3485.86872.

リンク元	「生存分析」「生存率分析」「survival analyses」
拡張検索	「Kaplan-Meier survival analysis」
関連記事	「analysis」「survival」

匿名

検索

案内

案内

survival analysis