haplotype

WordNet

(genetics) a combination of alleles (for different genes) that are located closely together on the same chromosome and that tend to be inherited together

Wikipedia preview

出典(authority):フリー百科事典『ウィキペディア（Wikipedia）』「2013/02/28 17:18:51」(JST)

wiki en

A haplotype (from the Greek: ἁπλοῦς, haploûs, "onefold, single, simple") in genetics is a combination of alleles (DNA sequences) at adjacent locations (loci) on a chromosome that are transmitted together. A haplotype may be one locus, several loci, or an entire chromosome depending on the number of recombination events that have occurred between a given set of loci.

A second meaning of the term haplotype is a set of single-nucleotide polymorphisms (SNPs) on a single chromosome of a chromosome pair that are associated statistically. It is thought that these associations, and the identification of a few alleles of a haplotype sequence, can unambiguously identify all other polymorphic sites in its region. Such information is very valuable for investigating the genetics of common diseases, and has been investigated for the human species by the International HapMap Project.^[1]^[2]

Many genetic testing companies use the term 'haplotype' to refer to an individual collection of short tandem repeat (STR) allele mutations within a genetic segment, while using the term 'haplogroup' to refer to the SNP/unique-event polymorphism (UEP) mutations which represents the clade to which a collection of potential haplotypes belong (the term clade referring to a set of people sharing a common ancestor).^[3]

Haplotype resolution

An organism's genotype may not define its haplotype uniquely. For example, consider a diploid organism and two bi-allelic loci (such as SNPs) on the same chromosome. Assume the first locus has alleles A or T and the second locus G or C. Both loci, then, have three possible genotypes: (AA, AT, and TT) and (GG, GC, and CC), respectively. For a given individual, there are nine possible configurations (haplotypes) at these two loci (shown in the Punnett square below). For individuals who are homozygous at one or both loci, the haplotypes are unambiguous - meaning that there is not any differentiation of haplotype T1T2 vs haplotype T2T1; where T1 and T2 are labeled to show that they are the same locus, but labeled as such to show it doesn't matter which order you consider them in, the end result is two T loci. For individuals heterozygous at both loci, the gametic phase is ambiguous - in these cases, you don't know which haplotype you have, e.g., TA vs AT.

	AA	AT	TT
GG	AG AG	AG TG	TG TG
GC	AG AC	AG TC or AC TG	TG TC
CC	AC AC	AC TC	TC TC

The only unequivocal method of resolving phase ambiguity is by sequencing. However, it is possible to estimate the probability of a particular haplotype when phase is ambiguous using a sample of individuals.

Given the genotypes for a number of individuals, the haplotypes can be inferred by haplotype resolution or haplotype phasing techniques. These methods work by applying the observation that certain haplotypes are common in certain genomic regions. Therefore, given a set of possible haplotype resolutions, these methods choose those that use fewer different haplotypes overall. The specifics of these methods vary - some are based on combinatorial approaches (e.g., parsimony), whereas others use likelihood functions based on different models and assumptions such as the Hardy-Weinberg principle, the coalescent theory model, or perfect phylogeny. These models are combined with optimization algorithms such as expectation-maximization algorithm (EM), Markov chain Monte Carlo (MCMC), or hidden Markov models (HMM).

Microfluidic whole genome haplotyping is a technique for the physical separation of individual chromosomes from a metaphase cell followed by direct resolution of the haplotype for each allele.

Y-DNA haplotypes from genealogical DNA tests

Main article: Genealogical DNA test

Unlike other chromosomes, Y chromosomes do not come in pairs. Every human male has only one copy of that chromosome. This means that there is not any chance variation of which copy is inherited, and also (for most of the chromosome) not any shuffling between copies by recombination; so, unlike autosomal haplotypes, there is effectively not any randomisation of the Y-chromosome haplotype between generations. A human male should largely share the same Y chromosome as his father, give or take a few mutations; thus Y chromosomes tend to passed largely intact from father to son, with a small but accumulating number of mutations that can serve to differentiate male lineages. In particular, the Y-DNA represented as the numbered results of a Y-DNA genealogical DNA test should match, except for mutations.

UEP results (SNP results)

Unique-event polymorphisms (UEPs) such as SNPs represent haplogroups. STRs represent haplotypes. The results that comprise the full Y-DNA haplotype from the Y chromosome DNA test can be divided into two parts: the results for UEPs, sometimes loosely called the SNP results as most UEPs are single-nucleotide polymorphisms, and the results for microsatellite short tandem repeat sequences (Y-STRs).

The UEP results represent the inheritance of events it is believed can be assumed to have happened only once in all human history. These can be used to identify the individual's Y-DNA haplogroup, his place in the "family tree" of the whole of humanity. Different Y-DNA haplogroups identify genetic populations that are often distinctly associated with particular geographic regions; their appearance in more recent populations located in different regions represents the migrations tens of thousands of years ago of the direct patrilineal ancestors of current individuals.

Y-STR haplotypes

Genetic results also include the Y-STR haplotype, the set of results from the Y-STR markers tested.

Unlike the UEPs, the Y-STRs mutate much more easily, which allows them to be used to distinguish recent genealogy. But it also means that, rather than the population of descendants of a genetic event all sharing the same result, the Y-STR haplotypes are likely to have spread apart, to form a cluster of more or less similar results. Typically, this cluster will have a definite most probable center, the modal haplotype (presumably similar to the haplotype of the original founding event), and also a haplotype diversity — the degree to which it has become spread out. The further in the past the defining event occurred, and the more that subsequent population growth occurred early, the greater the haplotype diversity will be for a particular number of descendants. However, if the haplotype diversity is smaller for a particular number of descendants, this may indicate a more recent common ancestor, or a recent population expansion.

It is important to note that, unlike for UEPs, two individuals with a similar Y-STR haplotype may not necessarily share a similar ancestry. Y-STR events are not unique. Instead, the clusters of Y-STR haplotype results inherited from different events and different histories tend to overlap.

In most cases, it is a long time since the haplogroups' defining events, so typically the cluster of Y-STR haplotype results associated with descendents of that event has become rather broad. These results will tend to significantly overlap the (similarly broad) clusters of Y-STR haplotypes associated with other haplogroups. This makes it impossible for researchers to predict with absolute certainty to which Y-DNA haplogroup a Y-STR haplotype would point. If the UEPs are not tested, the Y-STRs may be used only to predict probabilities for haplogroup ancestry, but not certainties.

A similar scenario exists in trying to evaluate whether shared surnames indicate shared genetic ancestry. A cluster of similar Y-STR haplotypes may indicate a shared common ancestor, with an identifiable modal haplotype, but only if the cluster is sufficiently distinct from what may have happened by chance from different individuals who historically adopted the same name independently. Many names were adopted from common occupations, for instance, or were associated with habitation of particular sites. More extensive haplotype typing is needed to establish genetic genealogy. Commercial DNA-testing companies now offer their customers testing of more numerous sets of markers to improve definition of their genetic ancestry. The number of sets of markers tested has increased from 12 during the early years to 111 more recently.

Establishing plausible relatedness between different surnames data-mined from a database is significantly more difficult. The researcher must establish that the very nearest member of the population in question, chosen purposely from the population for that reason, would be unlikely to match by accident. This is more than establishing that a randomly selected member of the population is unlikely to have such a close match by accident. Because of the difficulty, establishing relatedness between different surnames as in such a scenario is likely to be impossible, except in special cases where there is specific information to drastically limit the size of the population of candidates under consideration.

Diversity

Haplotype diversity is a measure of the uniqueness of a particular haplotype in a given population. The haplotype diversity (H) is computed as:^[4]

where is the (relative) haplotype frequency of each haplotype in the sample and is the sample size. Haplotype diversity is given for each sample.

Software

FAMHAP^[5] — FAMHAP is a software for single-marker analysis and, in particular, joint analysis of unphased genotype data from tightly linked markers (haplotype analysis).
Fugue — EM based haplotype estimation and association tests in unrelated and nuclear families.
HPlus^[6] — A software package for imputation and testing of haplotypes in association studies using a modified method that incorporates the expectation-maximization algorithm and a Bayesian method known as progressive ligation.
HaploBlockFinder — A software package for analyses of haplotype block structure.

Haploscribe^[7] — Reconstruction of whole-chromosome haplotypes based on all genotyped positions in a nuclear family, including rare variants.

Haploview^[8] — Visualisation of linkage disequilibrium, haplotype estimation and haplotype tagging (Homepage).
HelixTree — Haplotype analysis software - Haplotype Trend Regression (HTR), haplotypic association tests, and haplotype frequency estimation using both the expectation-maximization (EM) algorithm and composite haplotype method (CHM).
PHASE — A software for haplotype reconstruction, and recombination rate estimation from population data.
SNPHAP — EM based software for estimating haplotype frequencies from unphased genotypes.
WHAP^[9] — haplotype based association analysis.

References

^ The International HapMap Consortium (2003). "The International HapMap Project". Nature 426 (6968): 789–796. doi:10.1038/nature02168. PMID 14685227. http://www.nature.com/nature/journal/v426/n6968/pdf/nature02168.pdf.
^ The International HapMap Consortium (2005). "A haplotype map of the human genome". Nature 437 (7063): 1299–1320. doi:10.1038/nature04226. PMC 1880871. PMID 16255080. http://www.nature.com/nature/journal/v437/n7063/pdf/nature04226.pdf.
^ Facts & Genes. Volume 7, Issue 3
^ Masatoshi Nei and Fumio Tajima, "DNA polymorphism detectable by restriction endonucleases", Genetics 97:145 (1981)
^ Becker T., Knapp M. (2004). "Maximum-likelihood estimation of haplotype frequencies in nuclear families". Genetic Epidemiology 27 (1): 21–32. doi:10.1002/gepi.10323. PMID 15185400.
^ Li S.S., Khalid N., Carlson C., Zhao L.P. (2003). "Estimating haplotype frequencies and standard errors for multiple single nucleotide polymorphisms". Biostatistics 4 (4): 513–522. doi:10.1093/biostatistics/4.4.513. PMID 14557108. http://biostatistics.oxfordjournals.org/cgi/content/abstract/4/4/513.
^ Roach J.C., Glusman G., Hubley R., Montsaroff S.Z., Holloway A.K., Mauldin D.E., Srivastava D., Garg V., Pollard K.S., Galas D.J., Hood L., Smit A.F.A. (2011). "Chromosomal Haplotypes by Genetic Phasing of Human Families". American Journal of Human Genetics 89 (3): 382–397. doi:10.1016/j.ajhg.2011.07.023. PMID 21855840. http://www.cell.com/AJHG/abstract/S0002-9297%2811%2900318-1.
^ Barrett J.C., Fry B., Maller J., Daly M.J. (2005). "Haploview: analysis and visualization of LD and haplotype maps". Bioinformatics 21 (2): 263–265. doi:10.1093/bioinformatics/bth457. PMID 15297300. http://bioinformatics.oxfordjournals.org/cgi/reprint/21/2/263.
^ Purcell S., Daly M. J., Sham P. C. (2007). "WHAP: haplotype-based association analysis". Bioinformatics 23 (2): 255–256. doi:10.1093/bioinformatics/btl580. PMID 17118959. http://bioinformatics.oxfordjournals.org/cgi/reprint/23/2/255.

External links

HaploGroups.com — Comprehensive resource for DNA testing.
HapMap — homepage for the International HapMap Project.
Haplotype versus Haplogroup — the difference between haplogroup & haplotype explained.

UpToDate Contents

全文を閲覧するには購読必要です。 To read the full text you will need to subscribe.

1. 遺伝学用語集 glossary of genetic terms
2. 臨床的腎移植における研究中の免疫抑制剤およびそのアプローチ investigational immunosuppressive drugs and approaches in clinical kidney transplantation
3. 大脳皮質基底核変性症 corticobasal degeneration
4. 進行性核上性麻痺（PSP） progressive supranuclear palsy psp
5. 顔面肩甲上腕型筋ジストロフィー facioscapulohumeral muscular dystrophy

English Journal

EGLN1 variants influence expression and SaO2 levels to associate with high-altitude pulmonary oedema and adaptation.

Mishra A, Mohammad G, Thinlas T, Pasha MA.Source‡Department of Medicine, Sonam Norboo Memorial Hospital, Leh, Ladakh, Jammu & Kashmir, India.
Clinical science (London, England : 1979).Clin Sci (Lond).2013 Apr 1;124(7):479-89. doi: 10.1042/CS20120371.
EGLN1 [encoding HIF (hypoxia-inducible factor)-prolyl hydroxylase 2] plays a pivotal role in the HIF pathway and has emerged as one of the most intriguing genes with respect to physiology at HA (high altitude). EGLN1, being an actual oxygen sensor, appears to have a potential role in the functional
PMID 23130672

Juvenile rheumatoid arthritis and asthma, but not childhood-onset systemic lupus erythematosus are associated with FCRL3 polymorphisms in Mexicans.

Ramírez-Bello J, Jiménez-Morales S, Espinosa-Rosales F, Gómez-Vera J, Gutiérrez A, Velázquez Cruz R, Baca V, Orozco L.SourceImmunogenomics laboratory, Instituto Nacional de Medicina Genómica, SS, Mexico City, Mexico; Genomics Sciences Program, Universidad Autónoma de la Ciudad de México, Mexico City, Mexico.
Molecular immunology.Mol Immunol.2013 Apr;53(4):374-8. doi: 10.1016/j.molimm.2012.09.004. Epub 2012 Oct 13.
A regulatory single nucleotide polymorphism located in the 5' region (-169T/C) of the Fc receptor-like 3 (FCRL3_3) gene has been associated with both susceptibility and protection in immune diseases. This case-control study aimed to evaluate the association between FCRL3 polymorphisms and juvenile r
PMID 23070121

Genome-wide association study identified UQCC locus for spine bone size in humans.

Deng FY, Dong SS, Xu XH, Liu YJ, Liu YZ, Shen H, Tian Q, Li J, Deng HW.SourceCenter of Genetic Epidemiology and Genomics, School of Public Health, Soochow University, Suzhou, Jiangsu 215123, PR China; Center for Bioinformatics and Genomics, School of Public Health and Tropical Medicine, Tulane University, New Orleans, LA 70112, USA; Department of Biostatistics and Bioinformatics, School of Public Health and Tropical Medicine, Tulane University, New Orleans, LA 70112, USA.
Bone.Bone.2013 Mar;53(1):129-33. doi: 10.1016/j.bone.2012.11.028. Epub 2012 Nov 30.
Bone size (BS) contributes significantly to the risk of osteoporotic fracture. Osteoporotic spine fracture is one of the most disabling outcomes of osteoporosis. This study aims to identify genomic loci underlying spine BS variation in humans. We performed a genome-wide association scan in 2286 unre
PMID 23207799

Japanese Journal

ミトコンドリアDNAハプロタイプ分析による群馬県ツキノワグマ集団の遺伝的多様性

佐々木剛,和久井諒,和久大介,米澤隆弘,姉崎智子
東京農業大学農学集報 58(2), 49-56, 2013-09-20
日本国内でツキノワグマ(Ursus thibetanus)は本州,四国に生息し,現在5地域の個体群が絶滅の恐れのある地域集団とされている。群馬県でもツキノワグマが生息しているが,その捕獲頭数を定めた群馬県ツキノワグマ適正管理計画は,地域集団の構成を考慮しないまま実施されており,このままでは絶滅を招く危険性をはらんでいる。このことから,ツキノワグマの適切な保全を考慮した農林業被害等の防止対策を実施す …
NAID 110009613194

21-Hydroxylase gene mutant allele CYP21A2∗15 strongly linked to the resistant HLA haplotype B∗14:02-DRB1∗01:02 in chronic Chagas disease

del Puerto Florencia,Kikuchi Mihoko,Nishizawa Juan Eiki,Roca Yelin,Avila Cinthia,Gianella Alberto,Lora Javier,Gutierrez Velarde Freddy Udalrico,Hirayama Kenji
Human Immunology 74(6), 783-786, 2013-06-00
… We previously reported protective haplotype HLA-B*14:02-DRB1*01:02 against chronic Chagas disease in Bolivia. … The V281L mutant allele of the 21-Hydroxylase gene, CYP21A2*15, is reported to be located in the Class III region of the Human leukocyte antigen region and linked to the haplotype HLA-B*14:02-DRB1*01:02. …
NAID 120005298095

DRD2 haplotype associated with negative symptoms and sustained attention deficits in Han Chinese with schizophrenia in Taiwan

Chien Yi-Ling,Hwu Hai-Gwo,Fann Cathy S-J [他]
Journal of human genetics 58(4), 229-232, 2013-04-00
NAID 40019636847

Related Pictures

Haplotype Analysis based on Markov Chain Monte Carlo - ppt video online download Haplotype Phasing from Sequence Data – ZarLab Manish Anand Nihar Sheth Jim Costello Univ. of Indiana - ppt video online download Example of haplotypes and genotypes. (a). The haplotype and genotype | Download Haplotype - YouTube MixSIH: a mixture model for single individual haplotyping Topic #3 Linkage Disequilibrium, Haplotypes & Tagging - ppt video online download Haplotype; Haplotype

[1] The International HapMap Consortium (2003). "The International HapMap Project". Nature 426 (6968): 789–796. doi:10.1038/nature02168. PMID 14685227. http://www.nature.com/nature/journal/v426/n6968/pdf/nature02168.pdf.

[2] The International HapMap Consortium (2005). "A haplotype map of the human genome". Nature 437 (7063): 1299–1320. doi:10.1038/nature04226. PMC 1880871. PMID 16255080. http://www.nature.com/nature/journal/v437/n7063/pdf/nature04226.pdf.

[3] Facts & Genes. Volume 7, Issue 3

[4] Masatoshi Nei and Fumio Tajima, "DNA polymorphism detectable by restriction endonucleases", Genetics 97:145 (1981)

[5] Becker T., Knapp M. (2004). "Maximum-likelihood estimation of haplotype frequencies in nuclear families". Genetic Epidemiology 27 (1): 21–32. doi:10.1002/gepi.10323. PMID 15185400.

[6] Li S.S., Khalid N., Carlson C., Zhao L.P. (2003). "Estimating haplotype frequencies and standard errors for multiple single nucleotide polymorphisms". Biostatistics 4 (4): 513–522. doi:10.1093/biostatistics/4.4.513. PMID 14557108. http://biostatistics.oxfordjournals.org/cgi/content/abstract/4/4/513.

[7] Roach J.C., Glusman G., Hubley R., Montsaroff S.Z., Holloway A.K., Mauldin D.E., Srivastava D., Garg V., Pollard K.S., Galas D.J., Hood L., Smit A.F.A. (2011). "Chromosomal Haplotypes by Genetic Phasing of Human Families". American Journal of Human Genetics 89 (3): 382–397. doi:10.1016/j.ajhg.2011.07.023. PMID 21855840. http://www.cell.com/AJHG/abstract/S0002-9297%2811%2900318-1.

[8] Barrett J.C., Fry B., Maller J., Daly M.J. (2005). "Haploview: analysis and visualization of LD and haplotype maps". Bioinformatics 21 (2): 263–265. doi:10.1093/bioinformatics/bth457. PMID 15297300. http://bioinformatics.oxfordjournals.org/cgi/reprint/21/2/263.

[9] Purcell S., Daly M. J., Sham P. C. (2007). "WHAP: haplotype-based association analysis". Bioinformatics 23 (2): 255–256. doi:10.1093/bioinformatics/btl580. PMID 17118959. http://bioinformatics.oxfordjournals.org/cgi/reprint/23/2/255.

匿名

検索

案内

案内