同定可能な、同定できる

WordNet

capable of being identified
impossible to identify

Wikipedia preview

出典(authority):フリー百科事典『ウィキペディア（Wikipedia）』「2015/10/20 17:04:44」(JST)

wiki en

In statistics, identifiability is a property which a model must satisfy in order for precise inference to be possible. We say that the model is identifiable if it is theoretically possible to learn the true value of this model’s underlying parameter after obtaining an infinite number of observations from it. Mathematically, this is equivalent to saying that different values of the parameter must generate different probability distributions of the observable variables. Usually the model is identifiable only under certain technical restrictions, in which case the set of these requirements is called the identification conditions.

A model that fails to be identifiable is said to be non-identifiable or unidentifiable; two or more parametrizations are observationally equivalent. In some cases, even though a model is non-identifiable, it is still possible to learn the true values of a certain subset of the model parameters. In this case we say that the model is partially identifiable. In other cases it may be possible to learn the location of the true parameter up to a certain finite region of the parameter space, in which case the model is set identifiable.

Aside from strictly theoretical exploration of the model properties, Identifiability can be referred to in a wider scope when a model is tested with relation of experimental data sets. Usually these tests of Identifiability Analysis are applied when the model fitting of experimental data obtained and serve the detection of non-identifiable and sloppy parameters.^[1]

Definition

Let ℘ = {P_θ: θ∈Θ} be a statistical model where the parameter space Θ is either finite- or infinite-dimensional. We say that ℘ is identifiable if the mapping θ ↦ P_θ is one-to-one:^[2]

P_{\theta_1}=P_{\theta_2} \quad\Rightarrow\quad \theta_1=\theta_2 \quad\ \text{for all } \theta_1,\theta_2\in\Theta.

This definition means that distinct values of θ should correspond to distinct probability distributions: if θ₁≠θ₂, then also P_θ₁≠P_θ₂.^[3] If the distributions are defined in terms of the probability density functions (pdfs), then two pdfs should be considered distinct only if they differ on a set of non-zero measure (for example two functions ƒ₁(x)=1_0≤x<1 and ƒ₂(x)=1_0≤x≤1 differ only at a single point x=1 — a set of measure zero — and thus cannot be considered as distinct pdfs).

Identifiability of the model in the sense of invertibility of the map θ ↦ P_θ is equivalent to being able to learn the model’s true parameter if the model can be observed indefinitely long. Indeed, if {X_t}⊆S is the sequence of observations from the model, then by the strong law of large numbers,

\frac{1}{T} \sum_{t=1}^T \mathbf{1}_{\{X_t\in A\}} \ \xrightarrow{a.s.}\ \operatorname{Pr}[X_t\in A],

for every measurable set A⊆S (here 1_{…} is the indicator function). Thus with an infinite number of observations we will be able to find the true probability distribution P₀ in the model, and since the identifiability condition above requires that the map θ ↦ P_θ be invertible, we will also be able to find the true value of the parameter which generated given distribution P₀.

Examples

Example 1

Let ℘ be the normal location-scale family:

\mathcal{P} = \Big\{\ f_\theta(x) = \tfrac{1}{\sqrt{2\pi}\sigma} e^{ -\frac{1}{2\sigma^2}(x-\mu)^2 }\ \Big|\ \theta=(\mu,\sigma): \mu\in\mathbb{R}, \,\sigma\!>0 \ \Big\}.

Then

\begin{align} f_{\theta_1}=f_{\theta_2}\\ &\Leftrightarrow\ \tfrac{1}{\sqrt{2\pi}\sigma_1}e^{ -\frac{1}{2\sigma_1^2}(x-\mu_1)^2 } = \tfrac{1}{\sqrt{2\pi}\sigma_2}e^{ -\frac{1}{2\sigma_2^2}(x-\mu_2)^2 } \\ &\Leftrightarrow\ \tfrac{1}{\sigma_1^2}(x-\mu_1)^2 + \ln \sigma_1 = \tfrac{1}{\sigma_2^2}(x-\mu_2)^2 + \ln \sigma_2 \\ &\Leftrightarrow\ x^2\big(\tfrac{1}{\sigma_1^2}-\tfrac{1}{\sigma_2^2}\big) - 2x\big(\tfrac{\mu_1}{\sigma_1^2}-\tfrac{\mu_2}{\sigma_2^2}\big) + \big(\tfrac{\mu_1^2}{\sigma_1^2}-\tfrac{\mu_2^2}{\sigma_2^2}+\ln\sigma_1-\ln\sigma_2\big) = 0 \\ \end{align}

This expression is equal to zero for almost all x only when all its coefficients are equal to zero, which is only possible when |σ₁| = |σ₂| and μ₁ = μ₂. Since in the scale parameter σ is restricted to be greater than zero, we conclude that the model is identifiable: ƒ_θ₁=ƒ_θ₂ ⇔ θ₁=θ₂.

Example 2

Let ℘ be the standard linear regression model:

y = \beta'x + \varepsilon, \quad \operatorname{E}[\,\varepsilon|x\,]=0

(where ′ denotes matrix transpose). Then the parameter β is identifiable if and only if the matrix E[xx′] is invertible. Thus, this is the identification condition in the model.

Example 3

Suppose ℘ is the classical errors-in-variables linear model:

\begin{cases} y = \beta x^* + \varepsilon, \\ x = x^* + \eta, \end{cases}

where (ε,η,x*) are jointly normal independent random variables with zero expected value and unknown variances, and only the variables (x,y) are observed. Then this model is not identifiable,^[4] only the product βσ²_∗ is (where σ²_∗ is the variance of the latent regressor x*). This is also an example of set identifiable model: although the exact value of β cannot be learned, we can guarantee that it must lie somewhere in the interval (β_yx, 1÷β_xy), where β_yx is the coefficient in OLS regression of y on x, and β_xy is the coefficient in OLS regression of x on y.^[5]

If we abandon the normality assumption and require that x* were not normally distributed, retaining only the independence condition ε⊥η⊥x*, then the model becomes identifiable.^[4]

Software

In the case of parameter estimation in partially observed dynamical systems, the profile likelihood can be also used for structural and practical identifiability analysis.^[6] An implementation of the Profile Likelihood Approach is available in the MATLAB Toolbox PottersWheel.

Notes

^ Raue, A.; Kreutz, C.; Maiwald, T.; Bachmann, J.; Schilling, M.; Klingmuller, U.; Timmer, J. (2009-08-01). "Structural and practical identifiability analysis of partially observed dynamical models by exploiting the profile likelihood". Bioinformatics 25 (15): 1923–1929. doi:10.1093/bioinformatics/btp358.
^ Lehmann & Casella 1998, Definition 1.5.2
^ van der Vaart 1998, p. 62
^ ^a ^b Reiersøl 1950
^ Casella & Berger 2001, p. 583
^ Raue, A; Kreutz, C; Maiwald, T; Bachmann, J; Schilling, M; Klingmüller, U; Timmer, J (2009), "Structural and practical identifiability analysis of partially observed dynamical models by exploiting the profile likelihood", Bioinformatics 25 (15): 1923–9, doi:10.1093/bioinformatics/btp358, PMID 19505944.

References

Casella, George; Berger, Roger L. (2002), Statistical Inference (2nd ed.), ISBN 0-534-24312-6, LCCN 2001025794
Hsiao, Cheng (1983), Identification, Handbook of Econometrics, Vol. 1, Ch.4, North-Holland Publishing Company
Lehmann, E. L.; Casella, G. (1998), Theory of point estimation (2nd ed.), Springer, ISBN 0-387-98502-6
Reiersøl, Olav (1950), "Identifiability of a linear relation between variables which are subject to error", Econometrica (The Econometric Society) 18 (4): 375–389, doi:10.2307/1907835, JSTOR 1907835
van der Vaart, A.W. (1998), Asymptotic Statistics, Cambridge University Press, ISBN 978-0-521-49603-2

UpToDate Contents

全文を閲覧するには購読必要です。 To read the full text you will need to subscribe.

1. エボラウイルス病およびマールブルグウイルス病の臨床的特徴および診断 clinical manifestations and diagnosis of ebola virus disease
2. アナフィラキシー：診断の確定および誘発因子の判定 anaphylaxis confirming the diagnosis and determining the triggers
3. 破傷風 tetanus
4. 成人における高血圧の初期評価 initial evaluation of the hypertensive adult
5. 小児における過敏性血管炎 hypersensitivity vasculitis in children

English Journal

PMID 28510194

A confidence building exercise in data and identifiability: Modeling cancer chemotherapy as a case study.

Eisenberg MC1, Jain HV2.
Journal of theoretical biology.J Theor Biol.2017 Oct 27;431:63-78. doi: 10.1016/j.jtbi.2017.07.018. Epub 2017 Jul 19.
PMID 28733187

Patient physiological status during emergency care and rapid response team or cardiac arrest team activation during early hospital admission.

Considine J1, Jones D, Pilcher D, Currey J.
European journal of emergency medicine : official journal of the European Society for Emergency Medicine.Eur J Emerg Med.2017 Oct;24(5):359-365. doi: 10.1097/MEJ.0000000000000375.
PMID 26836783

Japanese Journal

The Complex Genetic Basis of Congenital Heart Defects

Circulation Journal, 2017
NAID 130005530323

Direction finding of multiple targets using coprime array in MIMO radar

IEICE Communications Express 6(3), 115-119, 2017
NAID 130005402001

GC－MS/MS を用いたカチノン類の包括的検出と構造推定

日本法科学技術学会誌, 2017
NAID 130005330530

Related Pictures

Personally Identifiable Information FAQ’s | Vision Payment Solutions | Vision Serious consequences for mishandling personal ID info > Scott Air Force Base > Article Who Is the Identifiable Victim? Caste and Charitable Giving in Modern India | r.i.c.e. The Stark Reality of Protecting Personally Identifiable Information PPT - Personally Identifiable Information (PII) PowerPoint Presentation - ID:303148 List of Personally Identifiable Information (PII) Challenges of Managing Personally Identifiable Information