collocation

WordNet

a grouping of words in a sentence

PrepTutorEJDIC

〈C〉〈U〉配置,配列 / 〈U〉語の配置,連語法;〈C〉連語(ある語の他の語との結びつき方)

Wikipedia preview

出典(authority):フリー百科事典『ウィキペディア（Wikipedia）』「2018/01/24 05:07:24」(JST)

wiki en

[Wiki en表示]

In corpus linguistics, a collocation is a sequence of words or terms that co-occur more often than would be expected by chance. In phraseology, collocation is a sub-type of phraseme. An example of a phraseological collocation, as propounded by Michael Halliday,^[1] is the expression strong tea. While the same meaning could be conveyed by the roughly equivalent powerful tea, this expression is considered excessive and awkward by English speakers. Conversely, the corresponding expression in technology, powerful computer is preferred over strong computer. Phraseological collocations should not be confused with idioms, where an idiom's meaning is derived from its convention as a stand-in for something else while collocation is a mere popular composition.

There are about six main types of collocations: adjective+noun, noun+noun (such as collective nouns), verb+noun, adverb+adjective, verbs+prepositional phrase (phrasal verbs), and verb+adverb.

Collocation extraction is a computational technique that finds collocations in a document or corpus, using various computational linguistics elements resembling data mining.

Expanded definition

Collocations are partly or fully fixed expressions that become established through repeated context-dependent use. Such terms as 'crystal clear', 'middle management', 'nuclear family', and 'cosmetic surgery' are examples of collocated pairs of words.

Collocations can be in a syntactic relation (such as verb–object: 'make' and 'decision'), lexical relation (such as antonymy), or they can be in no linguistically defined relation. Knowledge of collocations is vital for the competent use of a language: a grammatically correct sentence will stand out as awkward if collocational preferences are violated. This makes collocation an interesting area for language teaching. Recently, a mobile version of Collocation Dictionary was published on Google Play.^[2]

Corpus linguists specify a key word in context (KWIC) and identify the words immediately surrounding them. This gives an idea of the way words are used.

The processing of collocations involves a number of parameters, the most important of which is the measure of association, which evaluates whether the co-occurrence is purely by chance or statistically significant. Due to the non-random nature of language, most collocations are classed as significant, and the association scores are simply used to rank the results. Commonly used measures of association include mutual information, t scores, and log-likelihood.^[3]^[4]

Rather than select a single definition, Gledhill^[5] proposes that collocation involves at least three different perspectives: (i) cooccurrence, a statistical view, which sees collocation as the recurrent appearance in a text of a node and its collocates,^[6]^[7]^[8] (ii) construction, which sees collocation either as a correlation between a lexeme and a lexical-grammatical pattern,^[9] or as a relation between a base and its collocative partners^[10] and (iii) expression, a pragmatic view of collocation as a conventional unit of expression, regardless of form.^[11]^[12] It should be pointed out here that these different perspectives contrast with the usual way of presenting collocation in phraseological studies. Traditionally speaking, collocation is explained in terms of all three perspectives at once, in a continuum:

'Free Combination' ↔ 'Bound Collocation' ↔ 'Frozen Idiom'

In dictionaries

In 1933, Harold Palmer's Second Interim Report on English Collocations highlighted the importance of collocation as a key to producing natural-sounding language, for anyone learning a foreign language.^[13] Thus from the 1940s onwards, information about recurrent word combinations became a standard feature of monolingual learner's dictionaries. As these dictionaries became 'less word-centred and more phrase-centred',^[14] more attention was paid to collocation. This trend was supported, from the beginning of the 21st century, by the availability of large text corpora and intelligent corpus-querying software such as Sketch Engine, making possible to provide a more systematic account of collocation in dictionaries. Using these tools, dictionaries such as the Macmillan English Dictionary and the Longman Dictionary of Contemporary English included boxes or panels with lists of frequent collocations.^[15]

There are also a number of specialized dictionaries devoted to describing the frequent collocations in a language.^[16] These include (for Spanish) Redes: Diccionario combinatorio del español contemporaneo (2004), (for French) Le Robert: Dictionnaire des combinaisons de mots (2007), and (for English) the LTP Dictionary of Selected Collocations (1997) and the Macmillan Collocations Dictionary (2010).^[17]

Statistically significant collocation

Student's t-test can be used to determine whether the occurrence of a collocation in a corpus is statistically significant.^[18] For a bigram $w_{1}w_{2}$ , let $P(w_{1})={\frac {\#w_{1}}{N}}$ be the unconditional probability of occurrence of $w_{1}$ in a corpus with size $N$ , and let $P(w_{2})={\frac {\#w_{2}}{N}}$ be the unconditional probability of occurrence of $w_{2}$ in the corpus. Then the t-score for the bigram $w_{1}w_{2}$ is calculated as:

$t={\frac {{\bar {x}}-\mu }{\sqrt {\frac {s^{2}}{N}}}},$

where ${\bar {x}}={\frac {\#w_{i}w_{j}}{N}}$ is the sample mean of the occurrence of $w_{1}w_{2}$ , $\#w_{1}w_{2}$ is the number of occurrences of $w_{1}w_{2}$ , $\mu =P(w_{i})P(w_{j})$ is the probability of $w_{1}w_{2}$ under the null-hypothesis that $w_{1}$ and $w_{2}$ appear independently in the text, and $s^{2}={\bar {x}}(1-{\bar {x}})\approx {\bar {x}}$ is the sample variance. With a large $N$ , the t-test is equivalent to a z-test.

References

^ Halliday, M.A.K., 'Lexis as a Linguistic Level', Journal of Linguistics 2(1) 1966: 57-67
^ [1]
^ Dunning, Ted (1993): "Accurate methods for the statistics of surprise and coincidence". Computational Linguistics 19, 1 (Mar. 1993), 61-74.
^ Dunning, Ted (2008-03-21). "Surprise and Coincidence". blogspot.com. Retrieved 2012-04-09.
^ Gledhill C. (2000): Collocations in Science Writing, Narr, Tübingen
^ Firth J.R. (1957): Papers in Linguistics 1934–1951. Oxford: Oxford University Press.
^ Sinclair J. (1996): "The Search for Units of Meaning", in Textus, IX, 75–106.
^ Smadja F. A & McKeown, K. R. (1990): "Automatically extracting and representing collocations for language generation", Proceedings of ACL'90, 252–259, Pittsburgh, Pennsylvania.
^ Hunston S. & Francis G. (2000): Pattern Grammar — A Corpus-Driven Approach to the Lexical Grammar of English, Amsterdam, John Benjamins
^ Hausmann F. J. (1989): Le dictionnaire de collocations. In Hausmann F.J., Reichmann O., Wiegand H.E., Zgusta L.(eds), Wörterbücher : ein internationales Handbuch zur Lexikographie. Dictionaries. Dictionnaires. Berlin/New-York : De Gruyter. 1010-1019.
^ Moon R. (1998): Fixed Expressions and Idioms, a Corpus-Based Approach. Oxford, Oxford University Press.
^ Frath P. & Gledhill C. (2005): "Free-Range Clusters or Frozen Chunks? Reference as a Defining Criterion for Linguistic Units," in Recherches anglaises et Nord-américaines, vol. 38 :25–43
^ Cowie, A.P., English Dictionaries for Foreign Learners, Oxford University Press 1999:54-56
^ Bejoint, H., The Lexicography of English, Oxford University Press 2010: 318
^ "MED Second Edition - Key features - Macmillan". macmillandictionaries.com.
^ Herbst, T. and Klotz, M. 'Syntagmatic and Phraseological Dictionaries' in Cowie, A.P. (Ed.) The Oxford History of English Lexicography, 2009: part 2, 234-243
^ "Macmillan Collocation Dictionary - How it was written - Macmillan". macmillandictionaries.com.
^ Manning, Chris; Schütze, Hinrich (1999). Foundations of Statistical Natural Language Processing. Cambridge, MA: MIT Press. pp. 163–166. ISBN 0262133601.

External links

[2] Ozdic Collocation Dictionary
A Small System Storing Spanish Collocations (Igor A. Bolshakov & Sabino Miranda-Jiménez)
Morphological characterization of collocations and semantic relationships in Spanish (Sabino Miranda-Jiménez & Igor A. Bolshakov)
Example of collocations for the word "Surgery"

English Journal

Effect of traditional Chinese cooking methods on fatty acid profiles of vegetable oils.

Cui Y1, Hao P2, Liu B1, Meng X3.
Food chemistry.Food Chem.2017 Oct 15;233:77-84. doi: 10.1016/j.foodchem.2017.04.084. Epub 2017 Apr 19.
PMID 28530614
PMID 28421618

Inconsistencies of interannual variability and trends in long-term satellite leaf area index products.

Jiang C1, Ryu Y1,2,3,4, Fang H5,6, Myneni R7, Claverie M8,9, Zhu Z10.
Global change biology.Glob Chang Biol.2017 Oct;23(10):4133-4146. doi: 10.1111/gcb.13787. Epub 2017 Jul 6.
PMID 28614593

Japanese Journal

異領域のくみあわせを表す「動詞+"上"」と客体との関係について

大東文化大学紀要. 人文科学 (54), 137-155, 2016
NAID 40020763663

Modeling of landslide generated waves in Three Gorges Reservoir, China using SPH method

Japanese Geotechnical Society Special Publication 2(32), 1183-1188, 2016
NAID 130005124014

日本語を母語とする韓国語学習者のコロケーション学習に及ぼす指導の効果 : 文脈指導と非文脈指導の比較から

言語文化と日本語教育 (50), 71-80, 2015-12
NAID 40020703613

Related Pictures

A Phrasal What? – Liv Hambrett Adjective + Noun Collocations | eezeeenglishzone Collocation List – Collocations with 'Get' and 'Take' - English Learn Site Collocation Examples: Types of English Collocations - 7 E S L Collocation lists