n.

困惑

関: confound、embarrass、embarrassment

WordNet

trouble or confusion resulting from complexity
cause to be embarrassed; cause to feel self-conscious (同)abash
the state of being embarrassed (usually by some financial inadequacy); "he is currently suffering financial embarrassments"
some event that causes someone to be embarrassed; "the outcome of the vote was an embarrassment for the liberals"
the shame you feel when your inadequacy or guilt is made public

PrepTutorEJDIC

〈U〉困惑,当惑;〈C〉困ったこと
〈情況など〉'を'『混乱させる』,〈人〉'を'困惑させる,面くらわせる / (…と)…'を'『混同する』《+『名』+『with』+『名』》 / 〈計画・希望・敵など〉'を'破る,くじく(defeat) / 《遠回しに腹立たしさなどを表して》〈神〉が…'を'地獄に落とす
〈人〉‘の'『まごつかせる』,困惑させる,きまり悪がらせる / …‘を'『じゃまする』,妨げる / 〈人・会社など〉‘を'『財政困難にする』
〈U〉困惑,当惑;きまりの悪さ / 〈U〉〈C〉財政困難 / 〈C〉じゃま者,妨害物

Wikipedia preview

出典(authority):フリー百科事典『ウィキペディア（Wikipedia）』「2017/10/23 16:26:05」(JST)

wiki en

[Wiki en表示]

In information theory, perplexity is a measurement of how well a probability distribution or probability model predicts a sample. It may be used to compare probability models. A low perplexity indicates the probability distribution is good at predicting the sample.

Perplexity of a probability distribution

The perplexity of a discrete probability distribution p is defined as

2^{H(p)}=2^{-\sum _{x}p(x)\log _{2}p(x)}

where H(p) is the entropy (in bits) of the distribution and x ranges over events.

Perplexity of a random variable X may be defined as the perplexity of the distribution over its possible values x.

In the special case where p models a fair k-sided die (a uniform distribution over k discrete events), its perplexity is k. A random variable with perplexity k has the same uncertainty as a fair k-sided die, and one is said to be "k-ways perplexed" about the value of the random variable. (Unless it is a fair k-sided die, more than k values will be possible, but the overall uncertainty is no greater because some of these values will have probability greater than 1/k, decreasing the overall value while summing.)

Perplexity is sometimes used as a measure of how hard a prediction problem is. This is not always accurate. If you have two choices, one with probability 0.9, then your chances of a correct guess are 90 percent using the optimal strategy. The perplexity is 2^{−0.9 log₂ 0.9 - 0.1 log₂ 0.1}= 1.38. The inverse of the perplexity (which, in the case of the fair k-sided die, represents the probability of guessing correctly), is 1/1.38 = 0.72, not 0.9.

The perplexity is the exponentiation of the entropy, which is a more clearcut quantity. The entropy is a measure of the expected, or "average", number of bits required to encode the outcome of the random variable, using a theoretical optimal variable-length code, cf. the next section. It can equivalently be regarded as the expected information gain from learning the outcome of the random variable, where information is measured in bits.

Perplexity of a probability model

A model of an unknown probability distribution p, may be proposed based on a training sample that was drawn from p. Given a proposed probability model q, one may evaluate q by asking how well it predicts a separate test sample x₁, x₂, ..., x_N also drawn from p. The perplexity of the model q is defined as

b^{-{\frac {1}{N}}\sum _{i=1}^{N}\log _{b}q(x_{i})}

where $b$ is customarily 2. Better models q of the unknown distribution p will tend to assign higher probabilities q(x_i) to the test events. Thus, they have lower perplexity: they are less surprised by the test sample.

The exponent above may be regarded as the average number of bits needed to represent a test event x_i if one uses an optimal code based on q. Low-perplexity models do a better job of compressing the test sample, requiring few bits per test element on average because q(x_i) tends to be high.

The exponent may also be regarded as a cross-entropy,

H({\tilde {p}},q)=-\sum _{x}{\tilde {p}}(x)\log _{2}q(x)

where ${\tilde {p}}$ denotes the empirical distribution of the test sample (i.e., ${\tilde {p}}(x)=n/N$ if x appeared n times in the test sample of size N).

Perplexity per word

In natural language processing, perplexity is a way of evaluating language models. A language model is a probability distribution over entire sentences or texts.

Using the definition of perplexity for a probability model, one might find, for example, that the average sentence x_i in the test sample could be coded in 190 bits (i.e., the test sentences had an average log-probability of -190). This would give an enormous model perplexity of 2¹⁹⁰ per sentence. However, it is more common to normalize for sentence length and consider only the number of bits per word. Thus, if the test sample's sentences comprised a total of 1,000 words, and could be coded using a total of 7.95 bits per word, one could report a model perplexity of 2^7.95 = 247 per word. In other words, the model is as confused on test data as if it had to choose uniformly and independently among 247 possibilities for each word.

The lowest perplexity that has been published on the Brown Corpus (1 million words of American English of varying topics and genres) as of 1992 is indeed about 247 per word, corresponding to a cross-entropy of log₂247 = 7.95 bits per word or 1.75 bits per letter ^[1] using a trigram model. It is often possible to achieve lower perplexity on more specialized corpora, as they are more predictable.

Again, simply guessing that the next word in the Brown corpus is the word "the" will have an accuracy of 7 percent, not 1/247 = 0.4 percent, as a naive use of perplexity as a measure of predictiveness might lead one to believe. This guess is based on the unigram statistics of the Brown corpus, not on the trigram statistics, which yielded the word perplexity 247. Using trigram statistics would further improve the chances of a correct guess.

References

^ Brown, Peter F.; et al. (March 1992). "An Estimate of an Upper Bound for the Entropy of English" (PDF). Computational Linguistics. 18 (1). Retrieved 2007-02-07.

UpToDate Contents

全文を閲覧するには購読必要です。 To read the full text you will need to subscribe.

1. せん妄および錯乱状態の診断 diagnosis of delirium and confusional states
2. 緩和ケアにおける症状の評価へのアプローチ approach to symptom assessment in palliative care

English Journal

Endothelial progenitor cells: current issues on characterization and challenging clinical applications.

Resch T, Pircher A, Kähler CM, Pratschke J, Hilbe W.SourceCenter of Operative Medicine, Department of Visceral, Transplant, and Thoracic Surgery, Medical University Innsbruck, Anichstrasse 35, 6020, Innsbruck, Austria, t.resch@uki.at.
Stem cell reviews.Stem Cell Rev.2012 Sep;8(3):926-39.
Since their discovery about a decade ago, endothelial precursor cells (EPC) have been subjected to intensive investigation. The vision to stimulate respectively suppress a key player of vasculogenesis opened a plethora of clinical applications. However, as research opened deeper insights into EPC bi
PMID 22095429

A Voice-Input Voice-Output Communication Aid for People With Severe Speech Impairment.

Hawley M, Cunningham S, Green P, Enderby P, Palmer R, Sehgal S, O'Neill P.AbstractA new form of augmentative and alternative communication (AAC) device for people with severe speech impairment the voice-input voice-output communication aid (VIVOCA) is described. The VIVOCA recognizes the disordered speech of the user and builds messages, which are converted into synthetic speech. System development was carried out employing user-centered design and development methods, which identified and refined key requirements for the device. A novel methodology for building small vocabulary, speaker-dependent automatic speech recognizers with reduced amounts of training data, was applied. Experiments showed that this method is successful in generating good recognition performance (mean accuracy 96%) on highly disordered speech, even when recognition perplexity is increased. The selected message-building technique traded off various factors including speed of message construction and range of available message outputs. The VIVOCA was evaluated in a field trial by individuals with moderate to severe dysarthria and confirmed that they can make use of the device to produce intelligible speech output from disordered speech input. The trial highlighted some issues which limit the performance and usability of the device when applied in real usage situations, with mean recognition accuracy of 67% in these circumstances. These limitations will be addressed in future work.
IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society.IEEE Trans Neural Syst Rehabil Eng.2012 Aug 3. [Epub ahead of print]
A new form of augmentative and alternative communication (AAC) device for people with severe speech impairment the voice-input voice-output communication aid (VIVOCA) is described. The VIVOCA recognizes the disordered speech of the user and builds messages, which are converted into synthetic speech.
PMID 22875259

Japanese Journal

評点付きレビュー文書を対象としたトピックモデルの構築に関する検討

田村一樹,吉川大弘,古橋武
情報処理学会論文誌 56(3), 1013-1027, 2015-03-15
多くの企業にとって,商品(アイテム)に対するユーザレビューの解析は,商品開発やマーケティングの面で重要な役割を占めている.ユーザが自由に感想を記述することのできるレビューには,商品の長所や魅力だけでなく,不満や改善点などの声が豊富に含まれている.それらは商品開発における重要な情報源であり,大量のレビューに対し,自動で解析を行うことができれば,従来見逃していた有益な知見を得られるようになると期待でき …
NAID 110009884096

相対的和音進行に基づく和音進行解析のための語彙フリー無限Gramモデル

田中一生,井上真郷
情報処理学会研究報告. MPS, 数理モデル化と問題解決研究報告 2015-MPS-102(7), 1-6, 2015-02-24
… ない曲のみを対象とする,2) ハ長調/イ短調になるようにあらかじめ移調させておく,という方法でこの問題を解決している.しかし,この方法には「転調する曲」や「調が未知の曲」に対応できないという問題がある.これらを解決するために「相対的和音進行」という,調を明示的に考慮しないモデルを提案する.実験の結果,頻繁に転調するジャズの和音進行データに対して,従来のモデルよりも低い perplexity を達成した. …
NAID 110009877759

Modifying Existing Analogy-based N-gram Language Model

Meng Tian,Yves Lepage
研究報告自然言語処理（NL） 2014-NL-215(2), 1-4, 2014-01-30
… By investigating the occurrence of different proportional analogies in corpora, this paper describes an approach to increase the performance of existing analogy-based N-gram language models evaluated by perplexity. … The use of suffix arrays for data searching leads to a lesser computation time on text scoring tasks.By investigating the occurrence of different proportional analogies in corpora, this paper describes an approach to increase the performance of existing analogy-based N-gram language models evaluated by perplexity. …
NAID 170000080730

Related Pictures

Perplexity Concept Stock Images - Image: 15995114 Perplexity definition/meaning Perplexity Painting by Claudia Goodell Perplexity - Perplexity [EP] (2016) » CORE RADIO! Perplexity Free Stock Photo - Public Domain Pictures Perplexity by JulieCouronne on DeviantArt Maddthelin - Perplexity.mp3 (7.31 MB) mp3 Download Young Man Model On White. Doubt And Perplexity. Free Empty Side Space Stock Photo

★リンクテーブル★

リンク元	「confound」「embarrass」「困惑」「embarrassment」

「confound」

　　[★]

ct.

(やや古い)(人・事・物)を(間違って)(～と)混同する(confuse)(with,and)
(物・事が)(人)を当惑させる(puzzle)。(be ～ed)(物・事に/～ということに)まごつく、うろたえる、泡を食う(at,by,that節)
～を打ち負かす。～を妨げる
～に反駁する、反論させる
～に恥をかかせる、赤面させる
～を悪化させる
(予想などに)反する

関: confuse、confusion、derange、derangement、disarray、disorganized、disorient、disrupt、disruption、embarrass、embarrassment、perplexity、perturbation、upset

「embarrass」

　　[★]

em-(中に) + -barass(障害物、横木=bar) = 中に障害物を置く

vt.

(人)を恥ずかしがらせる/当惑させる。(人)を辱めて(～)させる(shame)(into doing)。(be ～ed)(人が)(人前で)恥ずかしい思いをする、(～で)まごつく(at, with, by, about, that節)
(正式)(be ～ed)(人が)財政困難になる、借金を負う
(問題など)を複雑にする、こじらせる。(政府・行動など)を妨害する
(消化・肺など)の機能を損なう、～に障害を来す。