タンパク質構造、蛋白質構造、タンパク質高次構造

WordNet

a symmetrical arrangement of the parts of a thing
any of a large group of nitrogenous organic compounds that are essential constituents of living cells; consist of polymers of amino acids; essential in the diet of animals for growth and for repair of tissues; can be obtained from meat and eggs and milk and legumes; "a diet high in protein"

PrepTutorEJDIC

〈C〉形態,構造;調和のとれた配置(配列) / 〈U〉(型・性格などべの)一至,適合,適応《+『to』+『名』》
蛋白(たんばく)質

Wikipedia preview

出典(authority):フリー百科事典『ウィキペディア（Wikipedia）』「2013/08/07 05:59:27」(JST)

wiki en

Protein structure is the biomolecular structure of a protein molecule. Each protein is a polymer – specifically a polypeptide – that is a sequence formed from various L-α-amino acids (also referred to as residues). By convention, a chain under 40 residues is often identified as a peptide, rather than a protein. To be able to perform their biological function, proteins fold into one or more specific spatial conformations, driven by a number of non-covalent interactions such as hydrogen bonding, ionic interactions, Van der Waals forces, and hydrophobic packing. To understand the functions of proteins at a molecular level, it is often necessary to determine their three-dimensional structure. This is the topic of the scientific field of structural biology, which employs techniques such as X-ray crystallography, NMR spectroscopy, and dual polarisation interferometry to determine the structure of proteins.

Protein structures range in size from tens to several thousand residues ^[1] Proteins are classified by their physical size as nanoparticles (definition: 1–100 nm). Very large aggregates can be formed from protein subunits: for example, many thousand actin molecules assemble into a microfilament.

A protein may undergo reversible structural changes in performing its biological function. The alternative structures of the same protein are referred to as different conformations, and transitions between them are called conformational changes.

1 Levels of protein structure
- 1.1 Primary structure
- 1.2 Amino acid residues
- 1.3 Secondary structure
- 1.4 Tertiary structure
- 1.5 Quaternary structure
2 Domains, motifs, and folds in protein structure
3 Protein folding
4 Protein structure determination
5 Structure classification
6 Computational prediction of protein structure
7 References
8 Further reading
9 External links
- 9.1 Wikis
- 9.2 Servers

Levels of protein structure[edit source | edit]

Protein structure, from primary to quaternary structure.

There are four distinct levels of protein structure.

Primary structure[edit source | edit]

Main article: Protein primary structure

The primary structure refers to amino acid linear sequence of the polypeptide chain. The primary structure is held together by covalent bonds such as peptide bonds, which are made during the process of protein biosynthesis or translation. The two ends of the polypeptide chain are referred to as the carboxyl terminus (C-terminus) and the amino terminus (N-terminus) based on the nature of the free group on each extremity. Counting of residues always starts at the N-terminal end (NH₂-group), which is the end where the amino group is not involved in a peptide bond. The primary structure of a protein is determined by the gene corresponding to the protein. A specific sequence of nucleotides in DNA is transcribed into mRNA, which is read by the ribosome in a process called translation. The sequence of amino acids was discovered by F.SANGER.The sequence of a protein is unique to that protein, and defines the structure and function of the protein. The sequence of a protein can be determined by methods such as Edman degradation or tandem mass spectrometry. Often however, it is read directly from the sequence of the gene using the genetic code. We know that there are over 10,000 proteins in our body which are composed of different arrangements of 20 types of amino acid residues (it is strictly recommended to use the word "amino acid residues" as when peptide bond is formed a water molecule is lost so, protein is made up of amino acid residues). Post-translational modifications such as disulfide formation, phosphorylations and glycosylations are usually also considered a part of the primary structure, and cannot be read from the gene. Example: Insulin is composed of 51 amino acids in 2 chains. One chain has 31 amino acids and the other has 20 amino acids.

Amino acid residues[edit source | edit]

Main article: Amino acid

Main article: Proteinogenic amino acid

Each α-amino acid consists of a backbone part that is present in all the amino acid types, and a side chain that is unique to each type of residue. An exception from this rule is proline. Because the carbon atom is bound to four different groups it is chiral, however only one of the isomers occur in biological proteins. Glycine however, is not chiral since its side chain is a hydrogen atom. A simple mnemonic for correct L-form is "CORN": when the C_α atom is viewed with the H in front, the residues read "CO-R-N" in a clockwise direction.

Secondary structure[edit source | edit]

An alpha-helix with hydrogen bonds (yellow dots)

Main article: Protein secondary structure

Secondary structure refers to highly regular local sub-structures. Two main types of secondary structure, the alpha helix and the beta strand or beta sheets, were suggested in 1951 by Linus Pauling and coworkers.^[2] These secondary structures are defined by patterns of hydrogen bonds between the main-chain peptide groups. They have a regular geometry, being constrained to specific values of the dihedral angles ψ and φ on the Ramachandran plot. Both the alpha helix and the beta-sheet represent a way of saturating all the hydrogen bond donors and acceptors in the peptide backbone. Some parts of the protein are ordered but do not form any regular structures. They should not be confused with random coil, an unfolded polypeptide chain lacking any fixed three-dimensional structure. Several sequential secondary structures may form a "supersecondary unit".^[3]

Tertiary structure[edit source | edit]

Main article: Protein tertiary structure

Tertiary structure refers to three-dimensional structure of a single protein molecule. The alpha-helices and beta-sheets are folded into a compact globule. The folding is driven by the non-specific hydrophobic interactions (the burial of hydrophobic residues from water), but the structure is stable only when the parts of a protein domain are locked into place by specific tertiary interactions, such as salt bridges, hydrogen bonds, and the tight packing of side chains and disulfide bonds. The disulfide bonds are extremely rare in cytosolic proteins, since the cytosol is generally a reducing environment.

Quaternary structure[edit source | edit]

Main article: Protein quaternary structure

Quaternary structure is the three-dimensional structure of a multi-subunit protein and how the subunits fit together. In this context, the quaternary structure is stabilized by the same non-covalent interactions and disulfide bonds as the tertiary structure. Complexes of two or more polypeptides (i.e. multiple subunits) are called multimers. Specifically it would be called a dimer if it contains two subunits, a trimer if it contains three subunits, and a tetramer if it contains four subunits. The subunits are frequently related to one another by symmetry operations, such as a 2-fold axis in a dimer. Multimers made up of identical subunits are referred to with a prefix of "homo-" (e.g. a homotetramer) and those made up of different subunits are referred to with a prefix of "hetero-" (e.g. a heterotetramer, such as the two alpha and two beta chains of hemoglobin).

Domains, motifs, and folds in protein structure[edit source | edit]

Protein domains. The two shown protein structures share a common domain (maroon), the PH domain, which is involved in phosphatidyl-inositol triphosphate binding

Protein are frequently described as consisting from several structural units.

A structural domain is an element of the protein's overall structure that is self-stabilizing and often folds independently of the rest of the protein chain. Many domains are not unique to the protein products of one gene or one gene family but instead appear in a variety of proteins. Domains often are named and singled out because they figure prominently in the biological function of the protein they belong to; for example, the "calcium-binding domain of calmodulin". Because they are independently stable, domains can be "swapped" by genetic engineering between one protein and another to make chimeras.

The structural and sequence motifs refer to short segments of protein three-dimensional structure or amino acid sequence that were found in a large number of different proteins.

The supersecondary structure refers to a specific combination of secondary structure elements, such as beta-alpha-beta units or helix-turn-helix motif. Some of them may be also referred to as structural motifs.

Protein fold refers to the general protein architecture, like helix bundle, beta-barrel, Rossman fold or different "folds" provided in the Structural Classification of Proteins database.^[4]

Despite the fact that there are about 100,000 different proteins expressed in eukaryotic systems, there are many fewer different domains, structural motifs and folds.

Protein folding[edit source | edit]

Main article: Protein folding

An unfolded polypeptide folds into its characteristic three-dimensional structure from a random coil.

Protein structure determination[edit source | edit]

Around 90% of the protein structures available in the Protein Data Bank have been determined by X-ray crystallography. This method allows one to measure the 3D density distribution of electrons in the protein (in the crystallized state) and thereby infer the 3D coordinates of all the atoms to be determined to a certain resolution. Roughly 9% of the known protein structures have been obtained by Nuclear Magnetic Resonance techniques. The secondary structure composition can be determined via circular dichroism. Cryo-electron microscopy has recently become a means of determining protein structures to high resolution (less than 5 angstroms or 0.5 nanometer) and is anticipated to increase in power as a tool for high resolution work in the next decade. This technique is still a valuable resource for researchers working with very large protein complexes such as virus coat proteins and amyloid fibers. A more qualitative picture of protein structure is often obtained by proteolysis, which is also useful to screen for more crystallisable protein samples. Novel implementations of this approach (including Fast parallel proteolysis (FASTpp)) can probe the structured fraction and its stability without the need for purification.^[5]

Structure classification[edit source | edit]

Protein structures can be grouped based on their similarity or a common evolutionary origin. SCOP^[6] and CATH^[7] databases provide two different structural classifications of proteins.

Computational prediction of protein structure[edit source | edit]

Main article: Protein structure prediction

The generation of a protein sequence is much easier than the determination of a protein structure. However, the structure of a protein gives much more insight in the function of the protein than its sequence. Therefore, a number of methods for the computational prediction of protein structure from its sequence have been developed.^[8] Ab initio prediction methods use just the sequence of the protein. Threading and Homology Modeling methods can build a 3D model for a protein of unknown structure from experimental structures of evolutionary related proteins.

References[edit source | edit]

^ Brocchieri L, Karlin S (2005-06-10). "Protein length in eukaryotic and prokaryotic proteomes". Nucleic Acids Research 33 (10): 3390–3400. doi:10.1093/nar/gki615. PMC 1150220. PMID 15951512.
^ Pauling L, Corey RB, Branson HR (1951). "The structure of proteins; two hydrogen-bonded helical configurations of the polypeptide chain". Proc Natl Acad Sci USA 37 (4): 205–211. doi:10.1073/pnas.37.4.205. PMC 1063337. PMID 14816373.
^ Chiang YS, Gelfand TI, Kister AE, Gelfand IM (2007). "New classification of supersecondary structures of sandwich-like proteins uncovers strict patterns of strand assemblage.". Proteins. 68 (4): 915–921. doi:10.1002/prot.21473. PMID 17557333.
^ Govindarajan S, Recabarren R, Goldstein RA. (17 September 1999). "Estimating the total number of protein folds.". Proteins. 35 (4): 408–414. doi:10.1002/(SICI)1097-0134(19990601)35:4<408::AID-PROT4>3.0.CO;2-A. PMID 10382668.
^ . PMID 23056252. Missing or empty |title= (help)
^ Murzin, A. G.; Brenner, S.; Hubbard, T.; Chothia, C. (1995). "SCOP: A structural classification of proteins database for the investigation of sequences and structures". Journal of Molecular Biology 247 (4): 536–540. doi:10.1016/S0022-2836(05)80134-2. PMID 7723011. edit
^ Orengo, C. A.; Michie, A. D.; Jones, S.; Jones, D. T.; Swindells, M. B.; Thornton, J. M. (1997). "CATH--a hierarchic classification of protein domain structures". Structure (London, England : 1993) 5 (8): 1093–1108. doi:10.1016/S0969-2126(97)00260-8. PMID 9309224. edit
^ Zhang Y (2008). "Progress and challenges in protein structure prediction". Curr Opin Struct Biol 18 (3): 342–348. doi:10.1016/j.sbi.2008.02.004. PMC 2680823. PMID 18436442.

External links[edit source | edit]

Wikis[edit source | edit]

PDBWiki — A discussion forum for macromolecular structures (see PDBWiki)
Proteopedia — Annotation of protein structures and other biomolecules
TOPSAN — Annotation of protein structures in Structural genomics

Servers[edit source | edit]

SSS Database — super-secondary structure protein database
SPROUTS (Structural Prediction for pRotein fOlding UTility System)
SMIR (Most Interacting Residues)

UpToDate Contents

全文を閲覧するには購読必要です。 To read the full text you will need to subscribe.

1. プロテインＳ欠乏症 protein s deficiency
2. 肥大型心筋症：遺伝子変異および臨床遺伝子検査 hypertrophic cardiomyopathy gene mutations and clinical genetic testing
3. プロテインＣ欠乏症 protein c deficiency
4. プリオンの生物学および遺伝学 biology and genetics of prions
5. 第V因子ライデンと活性化プロテインC抵抗性 factor v leiden and activated protein c resistance

English Journal

NMR interaction studies of Neu5Ac-α-(2,6)-Gal-β-(1-4)-GlcNAc with influenza-virus hemagglutinin expressed in transfected human cells.

Vasile F1, Gubinelli F2, Panigada M2, Soprana E2, Siccardi A2, Potenza D1.
Glycobiology.Glycobiology.2018 Dec 1;28(1):42-49. doi: 10.1093/glycob/cwx092.
PMID 29087468

Macrocarpal C isolated from Eucalyptus globulus inhibits dipeptidyl peptidase 4 in an aggregated form.

Kato E1, Kawakami K1, Kawabata J1.
Journal of enzyme inhibition and medicinal chemistry.J Enzyme Inhib Med Chem.2018 Dec;33(1):106-109. doi: 10.1080/14756366.2017.1396458.
PMID 29148282

Impact of oligomeric procyanidins on wheat gluten microstructure and physicochemical properties.

Liu R1, Shi C2, Song Y2, Wu T2, Zhang M3.
Food chemistry.Food Chem.2018 Sep 15;260:37-43. doi: 10.1016/j.foodchem.2018.03.103. Epub 2018 Mar 24.
PMID 29699679

Japanese Journal

様々な蛋白質の圧力依存化学シフトデータに対する主成分解析

Memoirs of Institute of Advanced Technology, Kinki University = 近畿大学先端技術総合研究所紀要 (23), 1-9, 2018-03
NAID 120006469826

NMRによる生体高分子の動的秩序形成解析に向けたベイズ推定に基づく構造最適化計算

J. Comput. Chem. Jpn. 17(1), 65-75, 2018
NAID 130006555815

pH Dependence of the Number of Discrete Conformers of Carbonic Anhydrase 2, as Evaluated from Collision Cross-Section Using Ion Mobility Coupled with Electrospray Ionization