ゲノム配列決定、ゲノムシークエンシング

WordNet

the branch of genetics that studies organisms in terms of their genomes (their full DNA sequences)

PrepTutorEJDIC

配列,接続;(特に時間の)調整

Wikipedia preview

出典(authority):フリー百科事典『ウィキペディア（Wikipedia）』「2015/02/01 12:33:38」(JST)

wiki en

DNA sequencing is the process of determining the precise order of nucleotides within a DNA molecule. It includes any method or technology that is used to determine the order of the four bases—adenine, guanine, cytosine, and thymine—in a strand of DNA. The advent of rapid DNA sequencing methods has greatly accelerated biological and medical research and discovery.

Knowledge of DNA sequences has become indispensable for basic biological research, and in numerous applied fields such as diagnostic, biotechnology, forensic biology, virology and biological systematics. The rapid speed of sequencing attained with modern DNA sequencing technology has been instrumental in the sequencing of complete DNA sequences, or genomes of numerous types and species of life, including the human genome and other complete DNA sequences of many animal, plant, and microbial species.

An example of the results of automated chain-termination DNA sequencing.

The first DNA sequences were obtained in the early 1970s by academic researchers using laborious methods based on two-dimensional chromatography. Following the development of fluorescence-based sequencing methods with automated analysis,^[1] DNA sequencing has become easier and orders of magnitude faster.^[2]

1 Use of sequencing
2 The four canonical bases
3 History
4 Basic methods
- 4.1 Maxam-Gilbert sequencing
- 4.2 Chain-termination methods
5 Advanced methods and de novo sequencing
- 5.1 Shotgun sequencing
- 5.2 Bridge PCR
6 Next-generation methods
- 6.1 Massively parallel signature sequencing (MPSS)
- 6.2 Polony sequencing
- 6.3 454 pyrosequencing
- 6.4 Illumina (Solexa) sequencing
- 6.5 SOLiD sequencing
- 6.6 Ion Torrent semiconductor sequencing
- 6.7 DNA nanoball sequencing
- 6.8 Heliscope single molecule sequencing
- 6.9 Single molecule real time (SMRT) sequencing
7 Methods in development
- 7.1 Nanopore DNA sequencing
- 7.2 Tunnelling currents DNA sequencing
- 7.3 Sequencing by hybridization
- 7.4 Sequencing with mass spectrometry
- 7.5 Microfluidic Sanger sequencing
- 7.6 Microscopy-based techniques
- 7.7 RNAP sequencing
- 7.8 In vitro virus high-throughput sequencing
8 Development initiatives
9 Computational challenges
- 9.1 Read Trimming
10 See also
11 References
12 External links

Use of sequencing

DNA sequencing may be used to determine the sequence of individual genes, larger genetic regions (i.e. clusters of genes or operons), full chromosomes or entire genomes. Sequencing provides the order of individual nucleotides in DNA or RNA (commonly represented as A, C, G, T, and U) isolated from cells of animals, plants, bacteria, archaea, or virtually any other source of genetic information. This is useful for:

Molecular biology – studying the genome itself, how proteins are made, what proteins are made, identifying new genes and associations with diseases and phenotypes, and identifying potential drug targets
Evolutionary biology – studying how different organisms are related and how they evolved
Metagenomics – Identifying species present in a body of water, sewage, dirt, debris filtred from the air, or swab samples of organisms. Helpful in ecology, epidemiology, microbiome research, and other fields.

Less-precise information is produced by non-sequencing techniques like DNA fingerprinting. This information may be easier to obtain and is useful for:

Detecting the presence of known genes for medical purposes (see genetic testing)
Forensic identification
Parental testing

The four canonical bases

The canonical structure of DNA has four bases: Thymine (T), Adenine (A), Cytosine (C), and Guanine (G). DNA sequencing is the determination of the physical order of these bases in a molecule of DNA. However, there are many other bases that may be present in a molecule. In some viruses (specifically, bacteriophage), cytosine is replaced by hydroxy methyl or hydroxy methyl glucose cytosine.^{[citation needed]} In mammalian DNA, variant bases with methyl groups or phosphosulfate may be found.^{[citation needed]} Depending on the sequencing technique, a particular modification may or may not be detected, e.g., the 5mC (5 methyl cytosine) common in humans may or may not be detected.

History

Though the structure of DNA was established as a double helix in 1953,^[3] several decades would pass before fragments of DNA could be reliably analyzed for their sequence in the laboratory. RNA sequencing was one of the earliest forms of nucleotide sequencing. The major landmark of RNA sequencing is the sequence of the first complete gene and the complete genome of Bacteriophage MS2, identified and published by Walter Fiers and his coworkers at the University of Ghent (Ghent, Belgium), in 1972^[4] and 1976.^[5]

The first method for determining DNA sequences involved a location-specific primer extension strategy established by Ray Wu at Cornell University in 1970.^[6] DNA polymerase catalysis and specific nucleotide labeling, both of which figure prominently in current sequencing schemes, were used to sequence the cohesive ends of lambda phage DNA^[7]^[8]^[9] Between 1970 and 1973, Wu, R Padmanabhan and colleagues demonstrated that this method can be employed to determine any DNA sequence using synthetic location-specific primers.^[10]^[11]^[12] Frederick Sanger then adopted this primer-extension strategy to develop more rapid DNA sequencing methods at the MRC Centre, Cambridge, UK and published a method for "DNA sequencing with chain-terminating inhibitors" in 1977.^[13] Walter Gilbert and Allan Maxam at Harvard also developed sequencing methods, including one for "DNA sequencing by chemical degradation".^[14]^[15] In 1973, Gilbert and Maxam reported the sequence of 24 basepairs using a method known as wandering-spot analysis.^[16] Advancements in sequencing were aided by the concurrent development of recombinant DNA technology, allowing DNA samples to be isolated from sources other than viruses.

The first full DNA genome to be sequenced was that of bacteriophage φX174 in 1977.^[17] Medical Research Council scientists deciphered the complete DNA sequence of the Epstein-Barr virus in 1984, finding it to be 170 thousand base-pairs long.

A non-radioactive method for transferring the DNA molecules of sequencing reaction mixtures onto an immobilizing matrix during electrophoresis was developed by Pohl and co-workers in the early 80’s.^[18]^[19] Followed by the commercialization of the DNA sequencer "Direct-Blotting-Electrophoresis-System GATC 1500" by GATC Biotech, which was intensively used in the framework of the EU genome-sequencing programme, the complete DNA sequence of the yeast Saccharomyces cerevisiae chromosome II.^[20] Leroy E. Hood's laboratory at the California Institute of Technology announced the first semi-automated DNA sequencing machine in 1986.^[21] This was followed by Applied Biosystems' marketing of the first fully automated sequencing machine, the ABI 370, in 1987 and by Dupont's Genesis 2000^[22] which used a novel fluorescent labeling technique enabling all four dideoxynucleotides to be identified in a single lane. By 1990, the U.S. National Institutes of Health (NIH) had begun large-scale sequencing trials on Mycoplasma capricolum, Escherichia coli, Caenorhabditis elegans, and Saccharomyces cerevisiae at a cost of US$0.75 per base. Meanwhile, sequencing of human cDNA sequences called expressed sequence tags began in Craig Venter's lab, an attempt to capture the coding fraction of the human genome.^[23] In 1995, Venter, Hamilton Smith, and colleagues at The Institute for Genomic Research (TIGR) published the first complete genome of a free-living organism, the bacterium Haemophilus influenzae. The circular chromosome contains 1,830,137 bases and its publication in the journal Science^[24] marked the first published use of whole-genome shotgun sequencing, eliminating the need for initial mapping efforts. By 2001, shotgun sequencing methods had been used to produce a draft sequence of the human genome.^[25]^[26]

Several new methods for DNA sequencing were developed in the mid to late 1990s. These techniques comprise the first of the "next-generation" sequencing methods. In 1996, Pål Nyrén and his student Mostafa Ronaghi at the Royal Institute of Technology in Stockholm published their method of pyrosequencing.^[27] A year later, Pascal Mayer and Laurent Farinelli submitted patents to the World Intellectual Property Organization describing DNA colony sequencing.^[28] Lynx Therapeutics published and marketed "Massively parallel signature sequencing", or MPSS, in 2000. This method incorporated a parallelized, adapter/ligation-mediated, bead-based sequencing technology and served as the first commercially available "next-generation" sequencing method, though no DNA sequencers were sold to independent laboratories.^[29] In 2004, 454 Life Sciences marketed a parallelized version of pyrosequencing.^[30] The first version of their machine reduced sequencing costs 6-fold compared to automated Sanger sequencing, and was the second of the new generation of sequencing technologies, after MPSS.^[31]

The large quantities of data produced by DNA sequencing have also required development of new methods and programs for sequence analysis. Phil Green and Brent Ewing of the University of Washington described their phred quality score for sequencer data analysis in 1998.^[32]

Basic methods

Maxam-Gilbert sequencing

Allan Maxam and Walter Gilbert published a DNA sequencing method in 1977 based on chemical modification of DNA and subsequent cleavage at specific bases.^[14] Also known as chemical sequencing, this method allowed purified samples of double-stranded DNA to be used without further cloning. This method's use of radioactive labeling and its technical complexity discouraged extensive use after refinements in the Sanger methods had been made.

Maxam-Gilbert sequencing requires radioactive labeling at one 5' end of the DNA and purification of the DNA fragment to be sequenced. Chemical treatment then generates breaks at a small proportion of one or two of the four nucleotide bases in each of four reactions (G, A+G, C, C+T). The concentration of the modifying chemicals is controlled to introduce on average one modification per DNA molecule. Thus a series of labeled fragments is generated, from the radiolabeled end to the first "cut" site in each molecule. The fragments in the four reactions are electrophoresed side by side in denaturing acrylamide gels for size separation. To visualize the fragments, the gel is exposed to X-ray film for autoradiography, yielding a series of dark bands each corresponding to a radiolabeled DNA fragment, from which the sequence may be inferred.^[14]

Chain-termination methods

The chain-termination method developed by Frederick Sanger and coworkers in 1977 soon became the method of choice, owing to its relative ease and reliability.^[13]^[33] When invented, the chain-terminator method used fewer toxic chemicals and lower amounts of radioactivity than the Maxam and Gilbert method. Because of its comparative ease, the Sanger method was soon automated and was the method used in the first generation of DNA sequencers.

Sanger sequencing is the method which prevailed from the 80's until the mid-2000s. Over that period, great advances were made in the technique, such as fluorescent labelling, capillary electrophoresis, and general automation. These developments allowed much more efficient sequencing, leading to lower costs. The Sanger method, in mass production form, is the technology which produced the first human genome in 2001, ushering in the age of genomics. However, later in the decade, radically different approaches reached the market, bringing the cost per genome down from $100 million in 2001 to $10,000 in 2011.^[34]

Advanced methods and de novo sequencing

Genomic DNA is fragmented into random pieces and cloned as a bacterial library. DNA from individual bacterial clones is sequenced and the sequence is assembled by using overlapping DNA regions.(click to expand)

Large-scale sequencing often aims at sequencing very long DNA pieces, such as whole chromosomes, although large-scale sequencing can also be used to generate very large numbers of short sequences, such as found in phage display. For longer targets such as chromosomes, common approaches consist of cutting (with restriction enzymes) or shearing (with mechanical forces) large DNA fragments into shorter DNA fragments. The fragmented DNA may then be cloned into a DNA vector and amplified in a bacterial host such as Escherichia coli. Short DNA fragments purified from individual bacterial colonies are individually sequenced and assembled electronically into one long, contiguous sequence. Studies have shown that adding a size selection step to collect DNA fragments of uniform size can improve sequencing efficiency and accuracy of the genome assembly. In these studies, automated sizing has proven to be more reproducible and precise than manual gel sizing.^[35]^[36]^[37]

The term "de novo sequencing" specifically refers to methods used to determine the sequence of DNA with no previously known sequence. De novo translates from Latin as "from the beginning". Gaps in the assembled sequence may be filled by primer walking. The different strategies have different tradeoffs in speed and accuracy; shotgun methods are often used for sequencing large genomes, but its assembly is complex and difficult, particularly with sequence repeats often causing gaps in genome assembly.

Most sequencing approaches use an in vitro cloning step to amplify individual DNA molecules, because their molecular detection methods are not sensitive enough for single molecule sequencing. Emulsion PCR^[38] isolates individual DNA molecules along with primer-coated beads in aqueous droplets within an oil phase. A polymerase chain reaction (PCR) then coats each bead with clonal copies of the DNA molecule followed by immobilization for later sequencing. Emulsion PCR is used in the methods developed by Marguilis et al. (commercialized by 454 Life Sciences), Shendure and Porreca et al. (also known as "Polony sequencing") and SOLiD sequencing, (developed by Agencourt, later Applied Biosystems, now Life Technologies).^[39]^[40]^[41]

Shotgun sequencing

Shotgun sequencing is a sequencing method designed for analysis of DNA sequences longer than 1000 base pairs, up to and including entire chromosomes. This method requires the target DNA to be broken into random fragments. After sequencing individual fragments, the sequences can be reassembled on the basis of their overlapping regions.^[42]

Bridge PCR

Another method for in vitro clonal amplification is bridge PCR, in which fragments are amplified upon primers attached to a solid surface^[28]^[43]^[44] and form "DNA colonies" or "DNA clusters". This method is used in the Illumina Genome Analyzer sequencers. Single-molecule methods, such as that developed by Stephen Quake's laboratory (later commercialized by Helicos) are an exception: they use bright fluorophores and laser excitation to detect base addition events from individual DNA molecules fixed to a surface, eliminating the need for molecular amplification.^[45]

Next-generation methods

Multiple, fragmented sequence reads must be assembled together on the basis of their overlapping areas.

Next-generation sequencing applies to genome sequencing, genome resequencing, transcriptome profiling (RNA-Seq), DNA-protein interactions (ChIP-sequencing), and epigenome characterization.^[46] Resequencing is necessary, because the genome of a single individual of a species will not indicate all of the genome variations among other individuals of the same species.

The high demand for low-cost sequencing has driven the development of high-throughput sequencing (or next-generation sequencing) technologies that parallelize the sequencing process, producing thousands or millions of sequences concurrently.^[47]^[48] High-throughput sequencing technologies are intended to lower the cost of DNA sequencing beyond what is possible with standard dye-terminator methods.^[31] In ultra-high-throughput sequencing as many as 500,000 sequencing-by-synthesis operations may be run in parallel.^[49]^[50]^[51]

Comparison of next-generation sequencing methods^[52]^[53]
Method	Read length	Accuracy	Reads per run	Time per run	Cost per 1 million bases (in US$)	Advantages	Disadvantages
Single-molecule real-time sequencing (Pacific Bio)	10,000 bp to 15,000 bp avg (14,000 bp N50); maximum read length >40,000 bases^[54]^[55]^[56]	99.9999% consensus accuracy; 87% single-read accuracy^[57]	50,000 per SMRT cell, or 500–1000 megabases^[58]^[59]	30 minutes to 4 hours^[60]	$0.13–$0.60	Longest read length. Fast. Detects 4mC, 5mC, 6mA.^[61]	Moderate throughput. Equipment can be very expensive.
Ion semiconductor (Ion Torrent sequencing)	up to 400 bp	98%	up to 80 million	2 hours	$1	Less expensive equipment. Fast.	Homopolymer errors.
Pyrosequencing (454)	700 bp	99.9%	1 million	24 hours	$10	Long read size. Fast.	Runs are expensive. Homopolymer errors.
Sequencing by synthesis (Illumina)	50 to 300 bp	98%	up to 3 billion	1 to 10 days, depending upon sequencer and specified read length^[62]	$0.05 to $0.15	Potential for high sequence yield, depending upon sequencer model and desired application.	Equipment can be very expensive. Requires high concentrations of DNA.
Sequencing by ligation (SOLiD sequencing)	50+35 or 50+50 bp	99.9%	1.2 to 1.4 billion	1 to 2 weeks	$0.13	Low cost per base.	Slower than other methods. Have issue sequencing palindromic sequence.^[63]
Chain termination (Sanger sequencing)	400 to 900 bp	99.9%	N/A	20 minutes to 3 hours	$2400	Long individual reads. Useful for many applications.	More expensive and impractical for larger sequencing projects.

Massively parallel signature sequencing (MPSS)

The first of the next-generation sequencing technologies, massively parallel signature sequencing (or MPSS), was developed in the 1990s at Lynx Therapeutics, a company founded in 1992 by Sydney Brenner and Sam Eletr. MPSS was a bead-based method that used a complex approach of adapter ligation followed by adapter decoding, reading the sequence in increments of four nucleotides. This method made it susceptible to sequence-specific bias or loss of specific sequences. Because the technology was so complex, MPSS was only performed 'in-house' by Lynx Therapeutics and no DNA sequencing machines were sold to independent laboratories. Lynx Therapeutics merged with Solexa (later acquired by Illumina) in 2004, leading to the development of sequencing-by-synthesis, a simpler approach acquired from Manteia Predictive Medicine, which rendered MPSS obsolete. However, the essential properties of the MPSS output were typical of later "next-generation" data types, including hundreds of thousands of short DNA sequences. In the case of MPSS, these were typically used for sequencing cDNA for measurements of gene expression levels.^[29]

Polony sequencing

The Polony sequencing method, developed in the laboratory of George M. Church at Harvard, was among the first next-generation sequencing systems and was used to sequence a full genome in 2005. It combined an in vitro paired-tag library with emulsion PCR, an automated microscope, and ligation-based sequencing chemistry to sequence an E. coli genome at an accuracy of >99.9999% and a cost approximately 1/9 that of Sanger sequencing.^[64] The technology was licensed to Agencourt Biosciences, subsequently spun out into Agencourt Personal Genomics, and eventually incorporated into the Applied Biosystems SOLiD platform, which is now owned by Life Technologies, which was recently bought by Thermo Fisher Scientific.

454 pyrosequencing

A parallelized version of pyrosequencing was developed by 454 Life Sciences, which has since been acquired by Roche Diagnostics. The method amplifies DNA inside water droplets in an oil solution (emulsion PCR), with each droplet containing a single DNA template attached to a single primer-coated bead that then forms a clonal colony. The sequencing machine contains many picoliter-volume wells each containing a single bead and sequencing enzymes. Pyrosequencing uses luciferase to generate light for detection of the individual nucleotides added to the nascent DNA, and the combined data are used to generate sequence read-outs.^[39] This technology provides intermediate read length and price per base compared to Sanger sequencing on one end and Solexa and SOLiD on the other.^[31]

Illumina (Solexa) sequencing

Solexa, now part of Illumina, was founded by Shankar Balasubramanian and David Klenerman in 1998, and developed a sequencing method based on reversible dye-terminators technology, and engineered polymerases.^[65] The terminated chemistry was developed internally at Solexa and the concept of the Solexa system was invented by Balasubramanian and Klenerman from Cambridge University's chemistry department. In 2004, Solexa acquired the company Manteia Predictive Medicine in order to gain a massivelly parallel sequencing technology based on "DNA Clusters", which involves the clonal amplification of DNA on a surface. The cluster technology was co-acquired with Lynx Therapeutics of California. Solexa Ltd. later merged with Lynx to form Solexa Inc.

An Illumina HiSeq 2500 sequencer

In this method, DNA molecules and primers are first attached on a slide and amplified with polymerase so that local clonal DNA colonies, later coined "DNA clusters", are formed. To determine the sequence, four types of reversible terminator bases (RT-bases) are added and non-incorporated nucleotides are washed away. A camera takes images of the fluorescently labeled nucleotides. Then the dye, along with the terminal 3' blocker, is chemically removed from the DNA, allowing for the next cycle to begin. Unlike pyrosequencing, the DNA chains are extended one nucleotide at a time and image acquisition can be performed at a delayed moment, allowing for very large arrays of DNA colonies to be captured by sequential images taken from a single camera.

An Illumina MiSeq sequencer

Decoupling the enzymatic reaction and the image capture allows for optimal throughput and theoretically unlimited sequencing capacity. With an optimal configuration, the ultimately reachable instrument throughput is thus dictated solely by the analog-to-digital conversion rate of the camera, multiplied by the number of cameras and divided by the number of pixels per DNA colony required for visualizing them optimally (approximately 10 pixels/colony). In 2012, with cameras operating at more than 10 MHz A/D conversion rates and available optics, fluidics and enzymatics, throughput can be multiples of 1 million nucleotides/second, corresponding roughly to 1 human genome equivalent at 1x coverage per hour per instrument, and 1 human genome re-sequenced (at approx. 30x) per day per instrument (equipped with a single camera).^[66]

SOLiD sequencing

Library preparation for the SOLiD platform

Applied Biosystems' (now a Life Technologies brand) SOLiD technology employs sequencing by ligation. Here, a pool of all possible oligonucleotides of a fixed length are labeled according to the sequenced position. Oligonucleotides are annealed and ligated; the preferential ligation by DNA ligase for matching sequences results in a signal informative of the nucleotide at that position. Before sequencing, the DNA is amplified by emulsion PCR. The resulting beads, each containing single copies of the same DNA molecule, are deposited on a glass slide.^[67] The result is sequences of quantities and lengths comparable to Illumina sequencing.^[31] This sequencing by ligation method has been reported to have some issue sequencing palindromic sequences.^[63]

Ion Torrent semiconductor sequencing

Ion Torrent Systems Inc. (now owned by Life Technologies) developed a system based on using standard sequencing chemistry, but with a novel, semiconductor based detection system. This method of sequencing is based on the detection of hydrogen ions that are released during the polymerisation of DNA, as opposed to the optical methods used in other sequencing systems. A microwell containing a template DNA strand to be sequenced is flooded with a single type of nucleotide. If the introduced nucleotide is complementary to the leading template nucleotide it is incorporated into the growing complementary strand. This causes the release of a hydrogen ion that triggers a hypersensitive ion sensor, which indicates that a reaction has occurred. If homopolymer repeats are present in the template sequence multiple nucleotides will be incorporated in a single cycle. This leads to a corresponding number of released hydrogens and a proportionally higher electronic signal.^[68]

Sequencing of the TAGGCT template with IonTorrent, PacBioRS and GridION

DNA nanoball sequencing

DNA nanoball sequencing is a type of high throughput sequencing technology used to determine the entire genomic sequence of an organism. The company Complete Genomics uses this technology to sequence samples submitted by independent researchers. The method uses rolling circle replication to amplify small fragments of genomic DNA into DNA nanoballs. Unchained sequencing by ligation is then used to determine the nucleotide sequence.^[69] This method of DNA sequencing allows large numbers of DNA nanoballs to be sequenced per run and at low reagent costs compared to other next generation sequencing platforms.^[70] However, only short sequences of DNA are determined from each DNA nanoball which makes mapping the short reads to a reference genome difficult.^[69] This technology has been used for multiple genome sequencing projects and is scheduled to be used for more.^[71]

Heliscope single molecule sequencing

Heliscope sequencing is a method of single-molecule sequencing developed by Helicos Biosciences. It uses DNA fragments with added poly-A tail adapters which are attached to the flow cell surface. The next steps involve extension-based sequencing with cyclic washes of the flow cell with fluorescently labeled nucleotides (one nucleotide type at a time, as with the Sanger method). The reads are performed by the Heliscope sequencer. The reads are short, up to 55 bases per run, but recent improvements allow for more accurate reads of stretches of one type of nucleotides.^[72]^[73]

This sequencing method and equipment were used to sequence the genome of the M13 bacteriophage.^[74]

Single molecule real time (SMRT) sequencing

SMRT sequencing is based on the sequencing by synthesis approach. The DNA is synthesized in zero-mode wave-guides (ZMWs) – small well-like containers with the capturing tools located at the bottom of the well. The sequencing is performed with use of unmodified polymerase (attached to the ZMW bottom) and fluorescently labelled nucleotides flowing freely in the solution. The wells are constructed in a way that only the fluorescence occurring by the bottom of the well is detected. The fluorescent label is detached from the nucleotide upon its incorporation into the DNA strand, leaving an unmodified DNA strand. According to Pacific Biosciences, the SMRT technology developer, this methodology allows detection of nucleotide modifications (such as cytosine methylation). This happens through the observation of polymerase kinetics. This approach allows reads of 20,000 nucleotides or more, with average read lengths of 5 kilobases.^[58]^[75]

Methods in development

DNA sequencing methods currently under development include labeling the DNA polymerase,^[76] reading the sequence as a DNA strand transits through nanopores,^[77]^[78] and microscopy-based techniques, such as atomic force microscopy or transmission electron microscopy that are used to identify the positions of individual nucleotides within long DNA fragments (>5,000 bp) by nucleotide labeling with heavier elements (e.g., halogens) for visual detection and recording.^[79]^[80] Third generation technologies aim to increase throughput and decrease the time to result and cost by eliminating the need for excessive reagents and harnessing the processivity of DNA polymerase.^[81]

Nanopore DNA sequencing

This method is based on the readout of electrical signals occurring at nucleotides passing by alpha-hemolysin pores covalently bound with cyclodextrin. The DNA passing through the nanopore changes its ion current. This change is dependent on the shape, size and length of the DNA sequence. Each type of the nucleotide blocks the ion flow through the pore for a different period of time. The method has a potential of development as it does not require modified nucleotides, however single nucleotide resolution is not yet available.^[82]

Two main areas of nanopore sequencing in development are solid state nanopore sequencing, and protein based nanopore sequencing. Protein nanopore sequencing utilizes membrane protein complexes ∝-Hemolysin and MspA (Mycobacterium Smegmatis Porin A), which show great promise given their ability to distinguish between individual and groups of nucleotides.^[83] Whereas, solid-state nanopore sequencing utilizes synthetic materials such as silicon nitride and aluminum oxide and it is preferred for its superior mechanical ability and thermal and chemical stability.^[84] The fabrication method is essential for this type of sequencing given that the nanopore array can contain hundreds of pores with diameters smaller than eight nanometers.^[83]

The concept originated from the idea that single stranded DNA or RNA molecules can be electrophoretically driven in a strict linear sequence through a biological pore that can be less than eight nanometers, and can be detected given that the molecules release an ionic current while moving through the pore. The pore contains a detection region capable of recognizing different bases, with each base generating various time specific signals corresponding to the sequence of bases as they cross the pore which are then evaluated.^[84] When implementing this process it is important to note that precise control over the DNA transport through the pore is crucial for success. Various enzymes such as exonucleases and polymerases have been used to moderate this process by positioning them near the pore’s entrance.^[85]

Tunnelling currents DNA sequencing

Another approach uses measurements of the electrical tunnelling currents across single-strand DNA as it moves through a channel. Depending on its electronic structure each base affects the tunnelling current differently, allowing differentiation between different bases.^[86]

The use of tunnelling currents has the potential to sequence orders of magnitude faster than ionic current methods and the sequencing of several DNA oligomers and micro-RNA has already been achieved.^[87]

Sequencing by hybridization

Sequencing by hybridization is a non-enzymatic method that uses a DNA microarray. A single pool of DNA whose sequence is to be determined is fluorescently labeled and hybridized to an array containing known sequences. Strong hybridization signals from a given spot on the array identifies its sequence in the DNA being sequenced.^[88]

This method of sequencing utilizes binding characteristics of a library of short single stranded DNA molecules (oligonucleotides) also called DNA probes to reconstruct a target DNA sequence. Non-specific hybrids are removed by washing and the target DNA is eluted.^[89] Hybrids are re-arranged such that the DNA sequence can be reconstructed. The benefit of this sequencing type is its ability to capture a large number of targets with a homogenous coverage.^[90] Although a large number of chemicals and starting DNA is usually required. But, with the advent of solution-based hybridization much less equipment and chemicals are necessary.^[89]

Sequencing with mass spectrometry

Mass spectrometry may be used to determine DNA sequences. Matrix-assisted laser desorption ionization time-of-flight mass spectrometry, or MALDI-TOF MS, has specifically been investigated as an alternative method to gel electrophoresis for visualizing DNA fragments. With this method, DNA fragments generated by chain-termination sequencing reactions are compared by mass rather than by size. The mass of each nucleotide is different from the others and this difference is detectable by mass spectrometry. Single-nucleotide mutations in a fragment can be more easily detected with MS than by gel electrophoresis alone. MALDI-TOF MS can more easily detect differences between RNA fragments, so researchers may indirectly sequence DNA with MS-based methods by converting it to RNA first.^[91]

The higher resolution of DNA fragments permitted by MS-based methods is of special interest to researchers in forensic science, as they may wish to find single-nucleotide polymorphisms in human DNA samples to identify individuals. These samples may be highly degraded so forensic researchers often prefer mitochondrial DNA for its higher stability and applications for lineage studies. MS-based sequencing methods have been used to compare the sequences of human mitochondrial DNA from samples in a Federal Bureau of Investigation database^[92] and from bones found in mass graves of World War I soldiers.^[93]

Early chain-termination and TOF MS methods demonstrated read lengths of up to 100 base pairs.^[94] Researchers have been unable to exceed this average read size; like chain-termination sequencing alone, MS-based DNA sequencing may not be suitable for large de novo sequencing projects. Even so, a recent study did use the short sequence reads and mass spectroscopy to compare single-nucleotide polymorphisms in pathogenic Streptococcus strains.^[95]

Microfluidic Sanger sequencing

In microfluidic Sanger sequencing the entire thermocycling amplification of DNA fragments as well as their separation by electrophoresis is done on a single glass wafer (approximately 10 cm in diameter) thus reducing the reagent usage as well as cost.^[96] In some instances researchers have shown that they can increase the throughput of conventional sequencing through the use of microchips.^[97] Research will still need to be done in order to make this use of technology effective.

Microscopy-based techniques

This approach directly visualizes the sequence of DNA molecules using electron microscopy. The first identification of DNA base pairs within intact DNA molecules by enzymatically incorporating modified bases, which contain atoms of increased atomic number, direct visualization and identification of individually labeled bases within a synthetic 3,272 base-pair DNA molecule and a 7,249 base-pair viral genome has been demonstrated.^[98]

RNAP sequencing

This method is based on use of RNA polymerase (RNAP), which is attached to a polystyrene bead. One end of DNA to be sequenced is attached to another bead, with both beads being placed in optical traps. RNAP motion during transcription brings the beads in closer and their relative distance changes, which can then be recorded at a single nucleotide resolution. The sequence is deduced based on the four readouts with lowered concentrations of each of the four nucleotide types, similarly to the Sanger method.^[99]

RNA polymerase is attached to one end of a polystyrene bead and the other end is attached to the distal end of a DNA fragment. Each bead is then stuck in to an optical trap that levitates the beads. The interactions between the RNAP and the DNA result in a change in the length of the DNA between the two beads. This change is the measured with precision resulting in a single base resolution on a single DNA molecule. This is then repeated four times where each time there is a lower concentration of one of the four nucleotides, this shares some similarity with the primers used in the Sanger Sequencing method. A comparison is made between regions and sequence information is deduced by comparing the known sequence regions to the unknown sequence regions.^[100]

In vitro virus high-throughput sequencing

A method has been developed to analyze full sets of protein interactions using a combination of 454 pyrosequencing and an in vitro virus mRNA display method. Specifically, this method covalently links proteins of interest to the mRNAs encoding them, then detects the mRNA pieces using reverse transcription PCRs. The mRNA may then be amplified and sequenced. The combined method was titled IVV-HiTSeq and can be performed under cell-free conditions, though its results may not be representative of in vivo conditions.^[101]

Development initiatives

Total cost of sequencing a human genome over time as calculated by the NHGRI.

In October 2006, the X Prize Foundation established an initiative to promote the development of full genome sequencing technologies, called the Archon X Prize, intending to award $10 million to "the first Team that can build a device and use it to sequence 100 human genomes within 10 days or less, with an accuracy of no more than one error in every 100,000 bases sequenced, with sequences accurately covering at least 98% of the genome, and at a recurring cost of no more than $10,000 (US) per genome."^[102]

Each year the National Human Genome Research Institute, or NHGRI, promotes grants for new research and developments in genomics. 2010 grants and 2011 candidates include continuing work in microfluidic, polony and base-heavy sequencing methodologies.^[103]

Computational challenges

The sequencing technologies described here produce raw data that needs to be assembled into longer sequences such as complete genomes (sequence assembly). There are many computational challenges to achieve this, such as the evaluation of the raw sequence data which is done by programs and algorithms such as Phred and Phrap. Other challenges have to deal with repetitive sequences that often prevent complete genome assemblies because they occur in many places of the genome. As a consequence, many sequences may not be assigned to particular chromosomes. The production of raw sequence data is only the beginning of its detailed bioinformatical analysis.^[104] Yet new methods for sequencing and correcting sequencing errors were developed.^[105]

Read Trimming

Sometimes, the raw reads produced by the sequencer are correct and precise only in a fraction of their length. Using the entire read may introduce artifacts in the downstream analyses like genome assembly, snp calling, or gene expression estimation. Two classes of trimming programs have been introduced, based on the window-based or the running-sum classes of algorithms.^[106] This is a partial list of the trimming algorithms currently available, specifying the algorithm class they belong to:

Cutadapt Running sum
ConDeTri Window based
ERNE-FILTER Running sum
FASTX quality trimmer Window based
PRINSEQ Window based
Trimmomatic Window based
SolexaQA Window based
SolexaQA-BWA Running sum
Sickle Window based

References

^ Olsvik O, Wahlberg J, Petterson B, Uhlén M, Popovic T, Wachsmuth IK et al. (January 1993). "Use of automated sequencing of polymerase chain reaction-generated amplicons to identify three types of cholera toxin subunit B in Vibrio cholerae O1 strains". J. Clin. Microbiol. 31 (1): 22–25. PMC 262614. PMID 7678018.
^ Pettersson E, Lundeberg J, Ahmadian A (February 2009). "Generations of sequencing technologies". Genomics 93 (2): 105–11. doi:10.1016/j.ygeno.2008.10.003. PMID 18992322.
^ Watson JD, Crick FH (1953). "The structure of DNA". Cold Spring Harb. Symp. Quant. Biol. 18: 123–31. doi:10.1101/SQB.1953.018.01.020. PMID 13168976.
^ Min Jou W, Haegeman G, Ysebaert M, Fiers W (May 1972). "Nucleotide sequence of the gene coding for the bacteriophage MS2 coat protein". Nature 237 (5350): 82–8. Bibcode:1972Natur.237...82J. doi:10.1038/237082a0. PMID 4555447.
^ Fiers W, Contreras R, Duerinck F, Haegeman G, Iserentant D, Merregaert J et al. (April 1976). "Complete nucleotide sequence of bacteriophage MS2 RNA: primary and secondary structure of the replicase gene". Nature 260 (5551): 500–7. Bibcode:1976Natur.260..500F. doi:10.1038/260500a0. PMID 1264203.
^ . Cornell University http://mbg.cornell.edu/faculty-staff/faculty/wu.cfm. Missing or empty |title= (help)
^ PADMANABHAN, R; Ray Wu; Ernest Jay (June 1974). "Chemical Synthesis of a Primer and Its Use in the Sequence Analysis of the Lysozyme Gene of Bacteriophage T4". Proceedings of the National Academy of Sciences 71 (6): 2510–2514. Bibcode:1974PNAS...71.2510P. doi:10.1073/pnas.71.6.2510.
^ Onaga LA (June 2014). "Ray Wu as Fifth Business: Demonstrating Collective Memory in the History of DNA Sequencing". Studies in the History and Philosophy of Science. Part C 46: 1–14. doi:10.1016/j.shpsc.2013.12.006. PMID 24565976.
^ Wu R (1972). "Nucleotide sequence analysis of DNA". Nature New Biol. 236 (68): 198–200. doi:10.1038/newbio236198a0. PMID 4553110.
^ Padmanabhan R, Wu R (1972). "Nucleotide sequence analysis of DNA. IX. Use of oligonucleotides of defined sequence as primers in DNA sequence analysis". Biochem. Biophys. Res. Commun. 48 (5): 1295–302. PMID 4560009.
^ Wu R, Tu CD, Padmanabhan R (1973). "Nucleotide sequence analysis of DNA. XII. The chemical synthesis and sequence analysis of a dodecadeoxynucleotide which binds to the endolysin gene of bacteriophage lambda". Biochem. Biophys. Res. Commun. 55 (4): 1092–9. PMID 4358929.
^ Jay E, Bambara R, Padmanabhan R, Wu R (March 1974). "DNA sequence analysis: a general, simple and rapid method for sequencing large oligodeoxyribonucleotide fragments by mapping". Nucleic Acids Research 1 (3): 331–353. doi:10.1093/nar/1.3.331. PMC 344020. PMID 10793670.
^ ^a ^b Sanger F, Nicklen S, Coulson AR (December 1977). "DNA sequencing with chain-terminating inhibitors". Proc. Natl. Acad. Sci. U.S.A. 74 (12): 5463–7. Bibcode:1977PNAS...74.5463S. doi:10.1073/pnas.74.12.5463. PMC 431765. PMID 271968.
^ ^a ^b ^c Maxam AM, Gilbert W (February 1977). "A new method for sequencing DNA". Proc. Natl. Acad. Sci. U.S.A. 74 (2): 560–4. Bibcode:1977PNAS...74..560M. doi:10.1073/pnas.74.2.560. PMC 392330. PMID 265521.
^ Gilbert, W. DNA sequencing and gene structure. Nobel lecture, 8 December 1980.
^ Gilbert W, Maxam A (December 1973). "The Nucleotide Sequence of the lac Operator". Proc. Natl. Acad. Sci. U.S.A. 70 (12): 3581–4. Bibcode:1973PNAS...70.3581G. doi:10.1073/pnas.70.12.3581. PMC 427284. PMID 4587255.
^ Sanger F, Air GM, Barrell BG, Brown NL, Coulson AR, Fiddes CA et al. (February 1977). "Nucleotide sequence of bacteriophage phi X174 DNA". Nature 265 (5596): 687–95. Bibcode:1977Natur.265..687S. doi:10.1038/265687a0. PMID 870828.
^ Beck S, Pohl FM (1984). "DNA sequencing with direct blotting electrophoresis". EMBO J 3 (12): 2905–2909. PMC 557787. PMID 6396083.
^ United States Patent 4,631,122 (1986)
^ Feldmann H, Aigle M, Aljinovic G, André B, Baclet MC, Barthe C et al. (1994). "Complete DNA sequence of yeast chromosome II". EMBO J. 13 (24): 5795–809. PMC 395553. PMID 7813418.
^ Smith LM, Sanders JZ, Kaiser RJ, Hughes P, Dodd C, Connell CR et al. (12 June 1986). "Fluorescence Detection in Automated DNA Sequence Analysis". Nature 321 (6071): 674–79. Bibcode:1986Natur.321..674S. doi:10.1038/321674a0. PMID 3713851.
^ Prober JM, Trainor GL, Dam RJ, Hobbs FW, Robertson CW, Zagursky RJ et al. (16 Oct 1987). "A system for rapid DNA sequencing with fluorescent chain-terminating dideoxynucleotides". Science 238 (4825): 336–41. Bibcode:1987Sci...238..336P. doi:10.1126/science.2443975. PMID 2443975.
^ Adams MD, Kelley JM, Gocayne JD, Dubnick M, Polymeropoulos MH, Xiao H et al. (June 1991). "Complementary DNA sequencing: expressed sequence tags and human genome project". Science 252 (5013): 1651–6. Bibcode:1991Sci...252.1651A. doi:10.1126/science.2047873. PMID 2047873.
^ Fleischmann RD, Adams MD, White O, Clayton RA, Kirkness EF, Kerlavage AR et al. (July 1995). "Whole-genome random sequencing and assembly of Haemophilus influenzae Rd". Science 269 (5223): 496–512. Bibcode:1995Sci...269..496F. doi:10.1126/science.7542800. PMID 7542800.
^ Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J et al. (February 2001). "Initial sequencing and analysis of the human genome". Nature 409 (6822): 860–921. Bibcode:2001Natur.409..860L. doi:10.1038/35057062. PMID 11237011.
^ Venter JC, Adams MD, Myers EW, Li PW, Mural RJ, Sutton GG et al. (February 2001). "The sequence of the human genome". Science 291 (5507): 1304–51. Bibcode:2001Sci...291.1304V. doi:10.1126/science.1058040. PMID 11181995.
^ Ronaghi M, Karamohamed S, Pettersson B, Uhlén M, Nyrén P (1996). "Real-time DNA sequencing using detection of pyrophosphate release". Analytical Biochemistry 242 (1): 84–9. doi:10.1006/abio.1996.0432. PMID 8923969.
^ ^a ^b Kawashima, Eric H.; Laurent Farinelli; Pascal Mayer (2005-05-12). "Patent: Method of nucleic acid amplification". Retrieved 2012-12-22{{inconsistent citations}}
^ ^a ^b Brenner S, Johnson M, Bridgham J, Golda G, Lloyd DH, Johnson D et al. (2000). "Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arrays". Nature Biotechnology (Nature Biotechnology) 18 (6): 630–634. doi:10.1038/76469. PMID 10835600.
^ Stein RA (1 September 2008). "Next-Generation Sequencing Update". Genetic Engineering & Biotechnology News 28 (15).
^ ^a ^b ^c ^d Schuster SC (January 2008). "Next-generation sequencing transforms today's biology". Nat. Methods 5 (1): 16–8. doi:10.1038/nmeth1156. PMID 18165802.
^ Ewing B, Green P (March 1998). "Base-calling of automated sequencer traces using phred. II. Error probabilities". Genome Res. 8 (3): 186–94. doi:10.1101/gr.8.3.186 (inactive 2015-01-01). PMID 9521922.
^ Sanger F, Coulson AR (May 1975). "A rapid method for determining sequences in DNA by primed synthesis with DNA polymerase". J. Mol. Biol. 94 (3): 441–8. doi:10.1016/0022-2836(75)90213-2. PMID 1100841.
^ Wetterstrand, Kris. "DNA Sequencing Costs: Data from the NHGRI Genome Sequencing Program (GSP)". National Human Genome Research Institute. Retrieved 30 May 2013.
^ Quail MA, Gu Y, Swerdlow H, Mayho M (2012). "Evaluation and optimisation of preparative semi-automated electrophoresis systems for Illumina library preparation". Electrophoresis 33 (23): 3521–8. doi:10.1002/elps.201200128. PMID 23147856.
^ Duhaime MB, Deng L, Poulos BT, Sullivan MB (2012). "Towards quantitative metagenomics of wild viruses and other ultra-low concentration DNA samples: a rigorous assessment and optimization of the linker amplification method". Environ. Microbiol. 14 (9): 2526–37. doi:10.1111/j.1462-2920.2012.02791.x. PMC 3466414. PMID 22713159.
^ Peterson BK, Weber JN, Kay EH, Fisher HS, Hoekstra HE (2012). "Double digest RADseq: an inexpensive method for de novo SNP discovery and genotyping in model and non-model species". PLoS ONE 7 (5): e37135. doi:10.1371/journal.pone.0037135. PMC 3365034. PMID 22675423.
^ Williams R, Peisajovich SG, Miller OJ, Magdassi S, Tawfik DS, Griffiths AD (2006). "Amplification of complex gene libraries by emulsion PCR". Nature methods 3 (7): 545–550. doi:10.1038/nmeth896. PMID 16791213.
^ ^a ^b Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, Bemben LA et al. (September 2005). "Genome Sequencing in Open Microfabricated High Density Picoliter Reactors". Nature 437 (7057): 376–80. Bibcode:2005Natur.437..376M. doi:10.1038/nature03959. PMC 1464427. PMID 16056220.
^ Shendure J, Porreca GJ, Reppas NB, Lin X, McCutcheon JP, Rosenbaum AM et al. (2005). "Accurate Multiplex Polony Sequencing of an Evolved Bacterial Genome". Science 309 (5741): 1728–32. Bibcode:2005Sci...309.1728S. doi:10.1126/science.1117389. PMID 16081699.
^ Applied Biosystems' SOLiD technology
^ Staden R (11 Jun 1979). "A strategy of DNA sequencing employing computer programs.". Nucleic Acids Research 6 (7): 2601–10. doi:10.1093/nar/6.7.2601. PMC 327874. PMID 461197.
^ P. Mayer,L. Farinelli, G. Matton, C. Adessi, G. Turcatti, J. J. Mermod, E. Kawashima.DNA colony massively parallel sequencing ams98 presentation
^ U.S. Patent 5,641,658
^ Braslavsky I, Hebert B, Kartalov E, Quake SR (April 2003). "Sequence information can be obtained from single DNA molecules". Proc. Natl. Acad. Sci. U.S.A. 100 (7): 3960–4. Bibcode:2003PNAS..100.3960B. doi:10.1073/pnas.0230489100. PMC 153030. PMID 12651960.
^ de Magalhães JP, Finch CE, Janssens G (2010). "Next-generation sequencing in aging research: emerging applications, problems, pitfalls and possible solutions". Ageing Research Reviews 9 (3): 315–323. doi:10.1016/j.arr.2009.10.006. PMC 2878865. PMID 19900591.
^ Hall N (May 2007). "Advanced sequencing technologies and their wider impact in microbiology". J. Exp. Biol. 209 (Pt 9): 1518–1525. doi:10.1242/jeb.001370. PMID 17449817.
^ Church GM (January 2006). "Genomes for all". Sci. Am. 294 (1): 46–54. doi:10.1038/scientificamerican0106-46. PMID 16468433. (subscription required)
^ Kalb, Gilbert; Moxley, Robert (1992). Massively Parallel, Optical, and Neural Computing in the United States. IOS Press. ISBN 90-5199-097-9. ^{[page needed]}
^ ten Bosch JR, Grody WW (2008). "Keeping Up with the Next Generation". The Journal of Molecular Diagnostics 10 (6): 484–492. doi:10.2353/jmoldx.2008.080027. PMC 2570630. PMID 18832462.
^ Tucker T, Marra M, Friedman JM (2009). "Massively Parallel Sequencing: The Next Big Thing in Genetic Medicine". The American Journal of Human Genetics 85 (2): 142–154. doi:10.1016/j.ajhg.2009.06.022. PMC 2725244. PMID 19679224.
^ Quail MA, Smith M, Coupland P, Otto TD, Harris SR, Connor TR et al. (1 January 2012). "A tale of three next generation sequencing platforms: comparison of Ion torrent, pacific biosciences and illumina MiSeq sequencers". BMC Genomics 13 (1): 341. doi:10.1186/1471-2164-13-341. PMC 3431227. PMID 22827831.
^ Liu L, Li Y, Li S, Hu N, He Y, Pong R et al. (1 January 2012). "Comparison of Next-Generation Sequencing Systems". Journal of Biomedicine and Biotechnology (Hindawi Publishing Corporation) 2012: 1–11. doi:10.1155/2012/251364. PMID 22829749.
^ New Products: PacBio's RS II; Cufflinks | In Sequence | Sequencing | GenomeWeb
^ "After a Year of Testing, Two Early PacBio Customers Expect More Routine Use of RS Sequencer in 2012". GenomeWeb. 10 January 2012. (registration required)
^ Pacific Biosciences Introduces New Chemistry With Longer Read Lengths
^ Chin CS, Alexander DH, Marks P, Klammer AA, Drake J, Heiner C et al. (2013). "Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data". Nat. Methods 10 (6): 563–9. doi:10.1038/nmeth.2474. PMID 23644548.
^ ^a ^b De novo bacterial genome assembly: a solved problem? | In between lines of code
^ Rasko DA, Webster DR, Sahl JW, Bashir A, Boisen N, Scheutz F et al. (25 August 2011). "Origins of the Strain Causing an Outbreak of Hemolytic–Uremic Syndrome in Germany". N Engl J Med 365 (8): 709–717. doi:10.1056/NEJMoa1106920. PMID 21793740.
^ Tran B, Brown AM, Bedard PL, Winquist E, Goss GD, Hotte SJ et al. (1 January 2012). "Feasibility of real time next generation sequencing of cancer genes linked to drug response: Results from a clinical trial". Int. J. Cancer 132 (7): 1547–1555. doi:10.1002/ijc.27817. PMID 22948899. (subscription required)
^ Murray IA, Clark TA, Morgan RD, Boitano M, Anton BP, Luong K et al. (2 October 2012). "The methylomes of six bacteria". Nucleic Acids Research 40 (22): 11450–62. doi:10.1093/nar/gks891. PMC 3526280. PMID 23034806.
^ van Vliet AH (1 January 2010). "Next generation sequencing of microbial transcriptomes: challenges and opportunities". FEMS Microbiology Letters 302 (1): 1–7. doi:10.1111/j.1574-6968.2009.01767.x. PMID 19735299.
^ ^a ^b Huang YF, Chen SC, Chiang YS, Chen TH, Chiu KP (2012). "Palindromic sequence impedes sequencing-by-ligation mechanism". BMC systems biology. 6 Suppl 2: S10. doi:10.1186/1752-0509-6-S2-S10. PMID 23281822.
^ Shendure J, Porreca GJ, Reppas NB, Lin X, McCutcheon JP, Rosenbaum AM et al. (9 Sep 2005). "Accurate multiplex polony sequencing of an evolved bacterial genome.". Science 309 (5741): 1728–32. Bibcode:2005Sci...309.1728S. doi:10.1126/science.1117389. PMID 16081699.
^ Bentley DR, Balasubramanian S, Swerdlow HP, Smith GP, Milton J, Brown CG et al. (2008). "Accurate whole human genome sequencing using reversible terminator chemistry". Nature 456 (7218): 53–59. doi:10.1038/nature07517. PMC 2581791. PMID 18987734.
^ Mardis ER (2008). "Next-generation DNA sequencing methods". Annu Rev Genomics Hum Genet 9: 387–402. doi:10.1146/annurev.genom.9.081307.164359. PMID 18576944.
^ Valouev A, Ichikawa J, Tonthat T, Stuart J, Ranade S, Peckham H et al. (July 2008). "A high-resolution, nucleosome position map of C. elegans reveals a lack of universal sequence-dictated positioning". Genome Res. 18 (7): 1051–63. doi:10.1101/gr.076463.108. PMC 2493394. PMID 18477713.
^ Rusk N (2011). "Torrents of sequence". Nat Meth 8 (1): 44–44. doi:10.1038/nmeth.f.330.
^ ^a ^b Drmanac R, Sparks AB, Callow MJ, Halpern AL, Burns NL, Kermani BG et al. (2010). "Human Genome Sequencing Using Unchained Base Reads in Self-Assembling DNA Nanoarrays". Science 327 (5961): 78–81. Bibcode:2010Sci...327...78D. doi:10.1126/science.1181498. PMID 19892942.
^ Porreca GJ (2010). "Genome Sequencing on Nanoballs". Nature Biotechnology 28 (1): 43–44. doi:10.1038/nbt0110-43. PMID 20062041.
^ Complete Genomics Press release, 2010
^ HeliScope Gene Sequencing / Genetic Analyzer System : Helicos BioSciences
^ Thompson JF, Steinmann KE (October 2010). "Single molecule sequencing with a HeliScope genetic analysis system.". Current Protocols in Molecular Biology. Chapter 7: Unit7.10. doi:10.1002/0471142727.mb0710s92. PMC 2954431. PMID 20890904.
^ Harris TD, Buzby PR, Babcock H, Beer E, Bowers J, Braslavsky I et al. (4 Apr 2008). "Single-molecule DNA sequencing of a viral genome.". Science 320 (5872): 106–9. Bibcode:2008Sci...320..106H. doi:10.1126/science.1150427. PMID 18388294.
^ PacBio Sales Start to Pick Up as Company Delivers on Product Enhancements | In Sequence | Sequencing | GenomeWeb
^ "VisiGen Biotechnologies Inc. – Technology Overview". Visigenbio.com. Retrieved 2009-11-15.
^ "The Harvard Nanopore Group". Mcb.harvard.edu. Retrieved 2009-11-15.
^ "Nanopore Sequencing Could Slash DNA Analysis Costs".
^ US patent 20060029957, ZS Genetics, "Systems and methods of analyzing nucleic acid polymers and related components", issued 2005-07-14
^ Xu M, Fujita D, Hanagata N (December 2009). "Perspectives and challenges of emerging single-molecule DNA sequencing technologies". Small 5 (23): 2638–49. doi:10.1002/smll.200900976. PMID 19904762.
^ Schadt EE, Turner S, Kasarskis A (2010). "A window into third-generation sequencing". Human Molecular Genetics 19 (R2): R227–40. doi:10.1093/hmg/ddq416. PMID 20858600.
^ Stoddart D, Heron AJ, Mikhailova E, Maglia G, Bayley H (12 May 2009). "Single-nucleotide discrimination in immobilized DNA oligonucleotides with a biological nanopore". Proceedings of the National Academy of Sciences of the United States of America 106 (19): 7702–7. Bibcode:2009PNAS..106.7702S. doi:10.1073/pnas.0901054106. PMC 2683137. PMID 19380741.
^ ^a ^b dela Torre R, Larkin J, Singer A, Meller A (2012). "Fabrication and characterization of solid-state nanopore arrays for high-throughput DNA sequencing". Nanotechnology 23 (38): 385308. Bibcode:2012Nanot..23L5308D. doi:10.1088/0957-4484/23/38/385308. PMC 3557807. PMID 22948520.
^ ^a ^b Pathak, B., Lofas, H., Prasongkit, J., Grigoriev, A., Ahuja, R., & Scheicher, R. H. (9 January 2012). Double-functionalized nanopore-embedded gold electrodes for rapid DNA sequencing. Applied Physics Letters, 100, 2.)
^ Korlach J, Marks PJ, Cicero RL, Gray JJ, Murphy DL, Roitman DB et al. (2008). "Selective aluminum passivation for targeted immobilization of single DNA polymerase molecules in zero-mode waveguide nanostructures". Proceedings of the National Academy of Sciences 105 (4): 1176–1181. Bibcode:2008PNAS..105.1176K. doi:10.1073/pnas.0710982105. PMC 2234111. PMID 18216253.
^ Di Ventra M (2013). "Fast DNA sequencing by electrical means inches closer". Nanotechnology 24 (34): 342501. Bibcode:2013Nanot..24H2501D. doi:10.1088/0957-4484/24/34/342501. PMID 23899780.
^ Ohshiro T, Matsubara K, Tsutsui M, Furuhashi M, Taniguchi M, Kawai T (2012). "Single-molecule electrical random resequencing of DNA and RNA". Sci Rep 2: 501. doi:10.1038/srep00501. PMC 3392642. PMID 22787559.
^ Hanna GJ, Johnson VA, Kuritzkes DR, Richman DD, Martinez-Picado J, Sutton L et al. (1 July 2000). "Comparison of Sequencing by Hybridization and Cycle Sequencing for Genotyping of Human Immunodeficiency Virus Type 1 Reverse Transcriptase". J. Clin. Microbiol. 38 (7): 2715–21. PMC 87006. PMID 10878069.
^ ^a ^b Morey M, Fernández-Marmiesse A, Castiñeiras D, Fraga JM, Couce ML, Cocho JA (2013). "A glimpse into past, present, and future DNA sequencing". Molecular Genetics and Metabolism 110 (1–2): 3–24. doi:10.1016/j.ymgme.2013.04.024. PMID 23742747.
^ Qin Y, Schneider TM, Brenner MP (2012). Gibas, Cynthia, ed. "Sequencing by Hybridization of Long Targets". PLoS ONE 7 (5): e35819. Bibcode:2012PLoSO...735819Q. doi:10.1371/journal.pone.0035819. PMC 3344849. PMID 22574124.
^ Edwards JR, Ruparel H, Ju J (2005). "Mass-spectrometry DNA sequencing". Mutation Research 573 (1–2): 3–12. doi:10.1016/j.mrfmmm.2004.07.021. PMID 15829234.
^ Hall TA, Budowle B, Jiang Y, Blyn L, Eshoo M, Sannes-Lowery KA et al. (2005). "Base composition analysis of human mitochondrial DNA using electrospray ionization mass spectrometry: A novel tool for the identification and differentiation of humans". Analytical Biochemistry 344 (1): 53–69. doi:10.1016/j.ab.2005.05.028. PMID 16054106.
^ Howard R, Encheva V, Thomson J, Bache K, Chan YT, Cowen S et al. (15 Jun 2011). "Comparative analysis of human mitochondrial DNA from World War I bone samples by DNA sequencing and ESI-TOF mass spectrometry". Forensic science international. Genetics 7 (1): 1–9. doi:10.1016/j.fsigen.2011.05.009. PMID 21683667.
^ Monforte JA, Becker CH (1 March 1997). "High-throughput DNA analysis by time-of-flight mass spectrometry". Nature Medicine 3 (3): 360–362. doi:10.1038/nm0397-360. PMID 9055869.
^ Beres SB, Carroll RK, Shea PR, Sitkiewicz I, Martinez-Gutierrez JC, Low DE et al. (8 February 2010). "Molecular complexity of successive bacterial epidemics deconvoluted by comparative pathogenomics". Proceedings of the National Academy of Sciences 107 (9): 4371–4376. Bibcode:2010PNAS..107.4371B. doi:10.1073/pnas.0911295107. PMC 2840111. PMID 20142485.
^ Kan CW, Fredlake CP, Doherty EA, Barron AE (1 November 2004). "DNA sequencing and genotyping in miniaturized electrophoresis systems". Electrophoresis 25 (21–22): 3564–3588. doi:10.1002/elps.200406161. PMID 15565709.
^ Chen YJ, Roller EE, Huang X (2010). "DNA sequencing by denaturation: experimental proof of concept with an integrated fluidic device". Lab on Chip 10 (9): 1153–1159. doi:10.1039/b921417h. PMC 2881221. PMID 20390134.
^ Bell DC, Thomas WK, Murtagh KM, Dionne CA, Graham AC, Anderson JE et al. (9 Oct 2012). "DNA Base Identification by Electron Microscopy". Microscopy and microanalysis : the official journal of Microscopy Society of America, Microbeam Analysis Society, Microscopical Society of Canada 18 (5): 1–5. Bibcode:2012MiMic..18.1049B. doi:10.1017/S1431927612012615. PMID 23046798.
^ Pareek CS, Smoczynski R, Tretyn A (November 2011). "Sequencing technologies and genome sequencing". Journal of applied genetics 52 (4): 413–35. doi:10.1007/s13353-011-0057-x. PMC 3189340. PMID 21698376.
^ Pareek CS, Smoczynski R, Tretyn A (2011). "Sequencing technologies and genome sequencing". Journal of Applied Genetics 52 (4): 413–435. doi:10.1007/s13353-011-0057-x. PMC 3189340. PMID 21698376.
^ Fujimori S, Hirai N, Ohashi H, Masuoka K, Nishikimi A, Fukui Y et al. (2012). "Next-generation sequencing coupled with a cell-free display technology for high-throughput production of reliable interactome data". Scientific reports 2: 691. Bibcode:2012NatSR...2E.691F. doi:10.1038/srep00691. PMC 3466446. PMID 23056904.
^ "PRIZE Overview: Archon X PRIZE for Genomics"
^ The Future of DNA Sequencing
^ Severin J, Lizio M, Harshbarger J, Kawaji H, Daub CO, Hayashizaki Y et al. (2014). "Interactive visualization and analysis of large-scale sequencing datasets using ZENBU". Nat. Biotechnol. 32 (3): 217–9. doi:10.1038/nbt.2840. PMID 24727769.
^ Shmilovici A,Ben-Gal I (2007). "Using a VOM model for reconstructing potential coding regions in EST sequences". Computational Statistics 22 (1): 49–69. doi:10.1007/s00180-007-0021-8.
^ Del Fabbro C, Scalabrin S, Morgante M, Giorgi FM (2013). "An Extensive Evaluation of Read Trimming Effects on Illumina NGS Data Analysis". PLoS ONE 8 (12): e85024. Bibcode:2013PLoSO...885024D. doi:10.1371/journal.pone.0085024. PMC 3871669. PMID 24376861.

External links

Wikibooks has a book on the topic of: Next Generation Sequencing (NGS)

A wikibook on next generation sequencing
A free didactic directory for DNA sequencing analysis.

UpToDate Contents

全文を閲覧するには購読必要です。 To read the full text you will need to subscribe.

1. 次世代DNAシークエンシングの原理および臨床応用 principles and clinical applications of next generation dna sequencing
2. 小頭症に対する臨床遺伝学的アプローチ microcephaly a clinical genetics approach
3. 遺伝子検査での二次的所見 secondary findings from genetic testing
4. オーダーメイド医療 personalized medicine
5. 先天異常：評価アプローチ birth defects approach to evaluation

English Journal

Methods for cancer epigenome analysis.

Nagarajan RP, Fouse SD, Bell RJ, Costello JF.SourceUniversity of California, San Francisco, CA, USA.
Advances in experimental medicine and biology.Adv Exp Med Biol.2013;754:313-38.
Accurate detection of epimutations in tumor cells is crucial for -understanding the molecular pathogenesis of cancer. Alterations in DNA methylation in cancer are functionally important and clinically relevant, but even this well-studied area is continually re-evaluated in light of unanticipated res
PMID 22956508

Rapid DNA extraction protocol for detection of alpha-1 antitrypsin deficiency from dried blood spots by real-time PCR.

Struniawski R, Szpechcinski A, Poplawska B, Skronski M, Chorostowska-Wynimko J.SourceLaboratory of Molecular Diagnostics and Immunology, National Institute of Tuberculosis and Lung Diseases, Warsaw, Poland.
Advances in experimental medicine and biology.Adv Exp Med Biol.2013;756:29-37.
The dried blood spot (DBS) specimens have been successfully employed for the large-scale diagnostics of α1-antitrypsin (AAT) deficiency as an easy to collect and transport alternative to plasma/serum. In the present study we propose a fast, efficient, and cost effective protocol of DNA extraction f
PMID 22836616

Japanese Journal

Recent advances in forest tree biotechnology

Suzuki Shiro,Suzuki Hideyuki
Plant Biotechnology 31(1), 1-9, 2014-03
NAID 40020023798

MePIC, Metagenomic Pathogen Identification for Clinical Specimens

Takeuchi Fumihiko,Sekizuka Tsuyoshi,Yamashita Akifumi [他],Ogasawara Yumiko,Mizuta Katsumi,Kuroda Makoto
Japanese Journal of Infectious Diseases 67(1), 62-65, 2014
… Next-generation DNA sequencing technologies have led to a new method of identifying the causative agents of infectious diseases. … Last, the percentages of the organisms' genomic sequences in the specimen (i.e., the metagenome) are estimated, and the pathogen is identified. …
NAID 130003399236

Recent advances in forest tree biotechnology

Suzuki Shiro,Suzuki Hideyuki
Plant Biotechnology, 2014
… In the last decade, forest tree biotechnology has considerably progressed: genomic sequences of several forest tree species have been decoded, efficient Agrobacterim-mediated genetic transformation and regeneration systems have been established in a number of forest tree species, and many reports have been published on the metabolic engineering of a major wood component, lignin, in forest trees. …
NAID 130003391366

★リンクテーブル★

リンク元	「ゲノム配列決定」「ゲノムシークエンシング」
関連記事	「sequencing」「genomic」

「ゲノム配列決定」

	Library resources about; DNA sequencing
	Resources in your library; Resources in other libraries;

　　[★]

英: genomic sequencing
関: ゲノムシークエンシング

「ゲノムシークエンシング」

　　[★]

英: genomic sequencing
関: ゲノム配列決定

「sequencing」

　　[★]

n.

配列決定、塩基配列決定、塩基配列決定法、シークエンシング

関: sequence

「genomic」

　　[★]

ゲノムの

関: genome、genomically

[olsvik1993-1] Olsvik O, Wahlberg J, Petterson B, Uhlén M, Popovic T, Wachsmuth IK et al. (January 1993). "Use of automated sequencing of polymerase chain reaction-generated amplicons to identify three types of cholera toxin subunit B in Vibrio cholerae O1 strains". J. Clin. Microbiol. 31 (1): 22–25. PMC 262614. PMID 7678018.

[pmid18992322-2] Pettersson E, Lundeberg J, Ahmadian A (February 2009). "Generations of sequencing technologies". Genomics 93 (2): 105–11. doi:10.1016/j.ygeno.2008.10.003. PMID 18992322.

[pmid13168976-3] Watson JD, Crick FH (1953). "The structure of DNA". Cold Spring Harb. Symp. Quant. Biol. 18: 123–31. doi:10.1101/SQB.1953.018.01.020. PMID 13168976.

[4] Min Jou W, Haegeman G, Ysebaert M, Fiers W (May 1972). "Nucleotide sequence of the gene coding for the bacteriophage MS2 coat protein". Nature 237 (5350): 82–8. Bibcode:1972Natur.237...82J. doi:10.1038/237082a0. PMID 4555447.

[5] Fiers W, Contreras R, Duerinck F, Haegeman G, Iserentant D, Merregaert J et al. (April 1976). "Complete nucleotide sequence of bacteriophage MS2 RNA: primary and secondary structure of the replicase gene". Nature 260 (5551): 500–7. Bibcode:1976Natur.260..500F. doi:10.1038/260500a0. PMID 1264203.

[6] . Cornell University http://mbg.cornell.edu/faculty-staff/faculty/wu.cfm. Missing or empty |title= (help)

[7] PADMANABHAN, R; Ray Wu; Ernest Jay (June 1974). "Chemical Synthesis of a Primer and Its Use in the Sequence Analysis of the Lysozyme Gene of Bacteriophage T4". Proceedings of the National Academy of Sciences 71 (6): 2510–2514. Bibcode:1974PNAS...71.2510P. doi:10.1073/pnas.71.6.2510.

[8] Onaga LA (June 2014). "Ray Wu as Fifth Business: Demonstrating Collective Memory in the History of DNA Sequencing". Studies in the History and Philosophy of Science. Part C 46: 1–14. doi:10.1016/j.shpsc.2013.12.006. PMID 24565976.

[pmid4553110-9] Wu R (1972). "Nucleotide sequence analysis of DNA". Nature New Biol. 236 (68): 198–200. doi:10.1038/newbio236198a0. PMID 4553110.

[pmid4560009-10] Padmanabhan R, Wu R (1972). "Nucleotide sequence analysis of DNA. IX. Use of oligonucleotides of defined sequence as primers in DNA sequence analysis". Biochem. Biophys. Res. Commun. 48 (5): 1295–302. PMID 4560009.

[pmid4358929-11] Wu R, Tu CD, Padmanabhan R (1973). "Nucleotide sequence analysis of DNA. XII. The chemical synthesis and sequence analysis of a dodecadeoxynucleotide which binds to the endolysin gene of bacteriophage lambda". Biochem. Biophys. Res. Commun. 55 (4): 1092–9. PMID 4358929.

[12] Jay E, Bambara R, Padmanabhan R, Wu R (March 1974). "DNA sequence analysis: a general, simple and rapid method for sequencing large oligodeoxyribonucleotide fragments by mapping". Nucleic Acids Research 1 (3): 331–353. doi:10.1093/nar/1.3.331. PMC 344020. PMID 10793670.

[Sanger1977-13] Sanger F, Nicklen S, Coulson AR (December 1977). "DNA sequencing with chain-terminating inhibitors". Proc. Natl. Acad. Sci. U.S.A. 74 (12): 5463–7. Bibcode:1977PNAS...74.5463S. doi:10.1073/pnas.74.12.5463. PMC 431765. PMID 271968.

[Maxam77-14] Maxam AM, Gilbert W (February 1977). "A new method for sequencing DNA". Proc. Natl. Acad. Sci. U.S.A. 74 (2): 560–4. Bibcode:1977PNAS...74..560M. doi:10.1073/pnas.74.2.560. PMC 392330. PMID 265521.

[15] Gilbert, W. DNA sequencing and gene structure. Nobel lecture, 8 December 1980.

[16] Gilbert W, Maxam A (December 1973). "The Nucleotide Sequence of the lac Operator". Proc. Natl. Acad. Sci. U.S.A. 70 (12): 3581–4. Bibcode:1973PNAS...70.3581G. doi:10.1073/pnas.70.12.3581. PMC 427284. PMID 4587255.

[17] Sanger F, Air GM, Barrell BG, Brown NL, Coulson AR, Fiddes CA et al. (February 1977). "Nucleotide sequence of bacteriophage phi X174 DNA". Nature 265 (5596): 687–95. Bibcode:1977Natur.265..687S. doi:10.1038/265687a0. PMID 870828.

[18] Beck S, Pohl FM (1984). "DNA sequencing with direct blotting electrophoresis". EMBO J 3 (12): 2905–2909. PMC 557787. PMID 6396083.

[19] United States Patent 4,631,122 (1986)

[Feldmann_1994-20] Feldmann H, Aigle M, Aljinovic G, André B, Baclet MC, Barthe C et al. (1994). "Complete DNA sequence of yeast chromosome II". EMBO J. 13 (24): 5795–809. PMC 395553. PMID 7813418.

[21] Smith LM, Sanders JZ, Kaiser RJ, Hughes P, Dodd C, Connell CR et al. (12 June 1986). "Fluorescence Detection in Automated DNA Sequence Analysis". Nature 321 (6071): 674–79. Bibcode:1986Natur.321..674S. doi:10.1038/321674a0. PMID 3713851.

[22] Prober JM, Trainor GL, Dam RJ, Hobbs FW, Robertson CW, Zagursky RJ et al. (16 Oct 1987). "A system for rapid DNA sequencing with fluorescent chain-terminating dideoxynucleotides". Science 238 (4825): 336–41. Bibcode:1987Sci...238..336P. doi:10.1126/science.2443975. PMID 2443975.

[pmid2047873-23] Adams MD, Kelley JM, Gocayne JD, Dubnick M, Polymeropoulos MH, Xiao H et al. (June 1991). "Complementary DNA sequencing: expressed sequence tags and human genome project". Science 252 (5013): 1651–6. Bibcode:1991Sci...252.1651A. doi:10.1126/science.2047873. PMID 2047873.

[24] Fleischmann RD, Adams MD, White O, Clayton RA, Kirkness EF, Kerlavage AR et al. (July 1995). "Whole-genome random sequencing and assembly of Haemophilus influenzae Rd". Science 269 (5223): 496–512. Bibcode:1995Sci...269..496F. doi:10.1126/science.7542800. PMID 7542800.

[Lander_2001-25] Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J et al. (February 2001). "Initial sequencing and analysis of the human genome". Nature 409 (6822): 860–921. Bibcode:2001Natur.409..860L. doi:10.1038/35057062. PMID 11237011.

[Venter_2001-26] Venter JC, Adams MD, Myers EW, Li PW, Mural RJ, Sutton GG et al. (February 2001). "The sequence of the human genome". Science 291 (5507): 1304–51. Bibcode:2001Sci...291.1304V. doi:10.1126/science.1058040. PMID 11181995.

[Ronaghi-27] Ronaghi M, Karamohamed S, Pettersson B, Uhlén M, Nyrén P (1996). "Real-time DNA sequencing using detection of pyrophosphate release". Analytical Biochemistry 242 (1): 84–9. doi:10.1006/abio.1996.0432. PMID 8923969.

[DNA_colony_patents-28] Kawashima, Eric H.; Laurent Farinelli; Pascal Mayer (2005-05-12). "Patent: Method of nucleic acid amplification". Retrieved 2012-12-22{{inconsistent citations}}

[Brenner_2000-29] Brenner S, Johnson M, Bridgham J, Golda G, Lloyd DH, Johnson D et al. (2000). "Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arrays". Nature Biotechnology (Nature Biotechnology) 18 (6): 630–634. doi:10.1038/76469. PMID 10835600.

[Stein_2008-30] Stein RA (1 September 2008). "Next-Generation Sequencing Update". Genetic Engineering & Biotechnology News 28 (15).

[pmid18165802-31] Schuster SC (January 2008). "Next-generation sequencing transforms today's biology". Nat. Methods 5 (1): 16–8. doi:10.1038/nmeth1156. PMID 18165802.

[32] Ewing B, Green P (March 1998). "Base-calling of automated sequencer traces using phred. II. Error probabilities". Genome Res. 8 (3): 186–94. doi:10.1101/gr.8.3.186 (inactive 2015-01-01). PMID 9521922.

[Sanger75-33] Sanger F, Coulson AR (May 1975). "A rapid method for determining sequences in DNA by primed synthesis with DNA polymerase". J. Mol. Biol. 94 (3): 441–8. doi:10.1016/0022-2836(75)90213-2. PMID 1100841.

[34] Wetterstrand, Kris. "DNA Sequencing Costs: Data from the NHGRI Genome Sequencing Program (GSP)". National Human Genome Research Institute. Retrieved 30 May 2013.

[pmid23147856-35] Quail MA, Gu Y, Swerdlow H, Mayho M (2012). "Evaluation and optimisation of preparative semi-automated electrophoresis systems for Illumina library preparation". Electrophoresis 33 (23): 3521–8. doi:10.1002/elps.201200128. PMID 23147856.

[pmid22713159-36] Duhaime MB, Deng L, Poulos BT, Sullivan MB (2012). "Towards quantitative metagenomics of wild viruses and other ultra-low concentration DNA samples: a rigorous assessment and optimization of the linker amplification method". Environ. Microbiol. 14 (9): 2526–37. doi:10.1111/j.1462-2920.2012.02791.x. PMC 3466414. PMID 22713159.

[pmid22675423-37] Peterson BK, Weber JN, Kay EH, Fisher HS, Hoekstra HE (2012). "Double digest RADseq: an inexpensive method for de novo SNP discovery and genotyping in model and non-model species". PLoS ONE 7 (5): e37135. doi:10.1371/journal.pone.0037135. PMC 3365034. PMID 22675423.

[Williams2006ePCR-38] Williams R, Peisajovich SG, Miller OJ, Magdassi S, Tawfik DS, Griffiths AD (2006). "Amplification of complex gene libraries by emulsion PCR". Nature methods 3 (7): 545–550. doi:10.1038/nmeth896. PMID 16791213.

[Margulies_2005-39] Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, Bemben LA et al. (September 2005). "Genome Sequencing in Open Microfabricated High Density Picoliter Reactors". Nature 437 (7057): 376–80. Bibcode:2005Natur.437..376M. doi:10.1038/nature03959. PMC 1464427. PMID 16056220.

[polony_sequencing-40] Shendure J, Porreca GJ, Reppas NB, Lin X, McCutcheon JP, Rosenbaum AM et al. (2005). "Accurate Multiplex Polony Sequencing of an Evolved Bacterial Genome". Science 309 (5741): 1728–32. Bibcode:2005Sci...309.1728S. doi:10.1126/science.1117389. PMID 16081699.

[solid_sequencing-41] Applied Biosystems' SOLiD technology

[42] Staden R (11 Jun 1979). "A strategy of DNA sequencing employing computer programs.". Nucleic Acids Research 6 (7): 2601–10. doi:10.1093/nar/6.7.2601. PMC 327874. PMID 461197.

[DNA_colony_presentation-43] P. Mayer,L. Farinelli, G. Matton, C. Adessi, G. Turcatti, J. J. Mermod, E. Kawashima.DNA colony massively parallel sequencing ams98 presentation

[Mosaic_patent-44] U.S. Patent 5,641,658

[45] Braslavsky I, Hebert B, Kartalov E, Quake SR (April 2003). "Sequence information can be obtained from single DNA molecules". Proc. Natl. Acad. Sci. U.S.A. 100 (7): 3960–4. Bibcode:2003PNAS..100.3960B. doi:10.1073/pnas.0230489100. PMC 153030. PMID 12651960.

[pmid19900591-46] Magalhães JP, Finch CE, Janssens G (2010). "Next-generation sequencing in aging research: emerging applications, problems, pitfalls and possible solutions". Ageing Research Reviews 9 (3): 315–323. doi:10.1016/j.arr.2009.10.006. PMC 2878865. PMID 19900591.

[hall2007-47] Hall N (May 2007). "Advanced sequencing technologies and their wider impact in microbiology". J. Exp. Biol. 209 (Pt 9): 1518–1525. doi:10.1242/jeb.001370. PMID 17449817.

[church2006-48] Church GM (January 2006). "Genomes for all". Sci. Am. 294 (1): 46–54. doi:10.1038/scientificamerican0106-46. PMID 16468433. (subscription required)

[kalb1992-49] Kalb, Gilbert; Moxley, Robert (1992). Massively Parallel, Optical, and Neural Computing in the United States. IOS Press. ISBN 90-5199-097-9. ^{[page needed]}

[tenBosch2008-50] ten Bosch JR, Grody WW (2008). "Keeping Up with the Next Generation". The Journal of Molecular Diagnostics 10 (6): 484–492. doi:10.2353/jmoldx.2008.080027. PMC 2570630. PMID 18832462.

[Tucker2009-51] Tucker T, Marra M, Friedman JM (2009). "Massively Parallel Sequencing: The Next Big Thing in Genetic Medicine". The American Journal of Human Genetics 85 (2): 142–154. doi:10.1016/j.ajhg.2009.06.022. PMC 2725244. PMID 19679224.

[quail2012-52] Quail MA, Smith M, Coupland P, Otto TD, Harris SR, Connor TR et al. (1 January 2012). "A tale of three next generation sequencing platforms: comparison of Ion torrent, pacific biosciences and illumina MiSeq sequencers". BMC Genomics 13 (1): 341. doi:10.1186/1471-2164-13-341. PMC 3431227. PMID 22827831.

[lin2012-53] Liu L, Li Y, Li S, Hu N, He Y, Pong R et al. (1 January 2012). "Comparison of Next-Generation Sequencing Systems". Journal of Biomedicine and Biotechnology (Hindawi Publishing Corporation) 2012: 1–11. doi:10.1155/2012/251364. PMID 22829749.

[54] New Products: PacBio's RS II; Cufflinks | In Sequence | Sequencing | GenomeWeb

[autogenerated1-55] "After a Year of Testing, Two Early PacBio Customers Expect More Routine Use of RS Sequencer in 2012". GenomeWeb. 10 January 2012. (registration required)

[56] Pacific Biosciences Introduces New Chemistry With Longer Read Lengths

[pmid23644548-57] Chin CS, Alexander DH, Marks P, Klammer AA, Drake J, Heiner C et al. (2013). "Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data". Nat. Methods 10 (6): 563–9. doi:10.1038/nmeth.2474. PMID 23644548.

[flxlexblog.wordpress.com-58] De novo bacterial genome assembly: a solved problem? | In between lines of code

[rasko2011-59] Rasko DA, Webster DR, Sahl JW, Bashir A, Boisen N, Scheutz F et al. (25 August 2011). "Origins of the Strain Causing an Outbreak of Hemolytic–Uremic Syndrome in Germany". N Engl J Med 365 (8): 709–717. doi:10.1056/NEJMoa1106920. PMID 21793740.

[tran2012-60] Tran B, Brown AM, Bedard PL, Winquist E, Goss GD, Hotte SJ et al. (1 January 2012). "Feasibility of real time next generation sequencing of cancer genes linked to drug response: Results from a clinical trial". Int. J. Cancer 132 (7): 1547–1555. doi:10.1002/ijc.27817. PMID 22948899. (subscription required)

[61] Murray IA, Clark TA, Morgan RD, Boitano M, Anton BP, Luong K et al. (2 October 2012). "The methylomes of six bacteria". Nucleic Acids Research 40 (22): 11450–62. doi:10.1093/nar/gks891. PMC 3526280. PMID 23034806.

[vliet2010-62] van Vliet AH (1 January 2010). "Next generation sequencing of microbial transcriptomes: challenges and opportunities". FEMS Microbiology Letters 302 (1): 1–7. doi:10.1111/j.1574-6968.2009.01767.x. PMID 19735299.

[Yu-Feng_Huang.2C_Sheng-Chung_Chen.2C_Yih-Shien_Chiang.2C_Tzu-Han_Chen_.26_Kuo-Ping_Chiu_2012_S10-63] Huang YF, Chen SC, Chiang YS, Chen TH, Chiu KP (2012). "Palindromic sequence impedes sequencing-by-ligation mechanism". BMC systems biology. 6 Suppl 2: S10. doi:10.1186/1752-0509-6-S2-S10. PMID 23281822.

[64] Shendure J, Porreca GJ, Reppas NB, Lin X, McCutcheon JP, Rosenbaum AM et al. (9 Sep 2005). "Accurate multiplex polony sequencing of an evolved bacterial genome.". Science 309 (5741): 1728–32. Bibcode:2005Sci...309.1728S. doi:10.1126/science.1117389. PMID 16081699.

[Bentley_2008-65] Bentley DR, Balasubramanian S, Swerdlow HP, Smith GP, Milton J, Brown CG et al. (2008). "Accurate whole human genome sequencing using reversible terminator chemistry". Nature 456 (7218): 53–59. doi:10.1038/nature07517. PMC 2581791. PMID 18987734.

[pmid18576944-66] Mardis ER (2008). "Next-generation DNA sequencing methods". Annu Rev Genomics Hum Genet 9: 387–402. doi:10.1146/annurev.genom.9.081307.164359. PMID 18576944.

[pmid18477713-67] Valouev A, Ichikawa J, Tonthat T, Stuart J, Ranade S, Peckham H et al. (July 2008). "A high-resolution, nucleosome position map of C. elegans reveals a lack of universal sequence-dictated positioning". Genome Res. 18 (7): 1051–63. doi:10.1101/gr.076463.108. PMC 2493394. PMID 18477713.

[rusk-68] Rusk N (2011). "Torrents of sequence". Nat Meth 8 (1): 44–44. doi:10.1038/nmeth.f.330.

[Drmanac_2010-69] Drmanac R, Sparks AB, Callow MJ, Halpern AL, Burns NL, Kermani BG et al. (2010). "Human Genome Sequencing Using Unchained Base Reads in Self-Assembling DNA Nanoarrays". Science 327 (5961): 78–81. Bibcode:2010Sci...327...78D. doi:10.1126/science.1181498. PMID 19892942.

[70] Porreca GJ (2010). "Genome Sequencing on Nanoballs". Nature Biotechnology 28 (1): 43–44. doi:10.1038/nbt0110-43. PMID 20062041.

[71] Complete Genomics Press release, 2010

[72] HeliScope Gene Sequencing / Genetic Analyzer System : Helicos BioSciences

[73] Thompson JF, Steinmann KE (October 2010). "Single molecule sequencing with a HeliScope genetic analysis system.". Current Protocols in Molecular Biology. Chapter 7: Unit7.10. doi:10.1002/0471142727.mb0710s92. PMC 2954431. PMID 20890904.

[74] Harris TD, Buzby PR, Babcock H, Beer E, Bowers J, Braslavsky I et al. (4 Apr 2008). "Single-molecule DNA sequencing of a viral genome.". Science 320 (5872): 106–9. Bibcode:2008Sci...320..106H. doi:10.1126/science.1150427. PMID 18388294.

[75] PacBio Sales Start to Pick Up as Company Delivers on Product Enhancements | In Sequence | Sequencing | GenomeWeb

[76] "VisiGen Biotechnologies Inc. – Technology Overview". Visigenbio.com. Retrieved 2009-11-15.

[77] "The Harvard Nanopore Group". Mcb.harvard.edu. Retrieved 2009-11-15.

[Physorg-78] "Nanopore Sequencing Could Slash DNA Analysis Costs".

[79] US patent 20060029957, ZS Genetics, "Systems and methods of analyzing nucleic acid polymers and related components", issued 2005-07-14

[80] Xu M, Fujita D, Hanagata N (December 2009). "Perspectives and challenges of emerging single-molecule DNA sequencing technologies". Small 5 (23): 2638–49. doi:10.1002/smll.200900976. PMID 19904762.

[81] Schadt EE, Turner S, Kasarskis A (2010). "A window into third-generation sequencing". Human Molecular Genetics 19 (R2): R227–40. doi:10.1093/hmg/ddq416. PMID 20858600.

[82] Stoddart D, Heron AJ, Mikhailova E, Maglia G, Bayley H (12 May 2009). "Single-nucleotide discrimination in immobilized DNA oligonucleotides with a biological nanopore". Proceedings of the National Academy of Sciences of the United States of America 106 (19): 7702–7. Bibcode:2009PNAS..106.7702S. doi:10.1073/pnas.0901054106. PMC 2683137. PMID 19380741.

[Torre_2012-83] Torre R, Larkin J, Singer A, Meller A (2012). "Fabrication and characterization of solid-state nanopore arrays for high-throughput DNA sequencing". Nanotechnology 23 (38): 385308. Bibcode:2012Nanot..23L5308D. doi:10.1088/0957-4484/23/38/385308. PMC 3557807. PMID 22948520.

[Pathak_2012-84] Pathak, B., Lofas, H., Prasongkit, J., Grigoriev, A., Ahuja, R., & Scheicher, R. H. (9 January 2012). Double-functionalized nanopore-embedded gold electrodes for rapid DNA sequencing. Applied Physics Letters, 100, 2.)

[Korlach_2008-85] Korlach J, Marks PJ, Cicero RL, Gray JJ, Murphy DL, Roitman DB et al. (2008). "Selective aluminum passivation for targeted immobilization of single DNA polymerase molecules in zero-mode waveguide nanostructures". Proceedings of the National Academy of Sciences 105 (4): 1176–1181. Bibcode:2008PNAS..105.1176K. doi:10.1073/pnas.0710982105. PMC 2234111. PMID 18216253.

[86] Di Ventra M (2013). "Fast DNA sequencing by electrical means inches closer". Nanotechnology 24 (34): 342501. Bibcode:2013Nanot..24H2501D. doi:10.1088/0957-4484/24/34/342501. PMID 23899780.

[pmid22787559-87] Ohshiro T, Matsubara K, Tsutsui M, Furuhashi M, Taniguchi M, Kawai T (2012). "Single-molecule electrical random resequencing of DNA and RNA". Sci Rep 2: 501. doi:10.1038/srep00501. PMC 3392642. PMID 22787559.

[88] Hanna GJ, Johnson VA, Kuritzkes DR, Richman DD, Martinez-Picado J, Sutton L et al. (1 July 2000). "Comparison of Sequencing by Hybridization and Cycle Sequencing for Genotyping of Human Immunodeficiency Virus Type 1 Reverse Transcriptase". J. Clin. Microbiol. 38 (7): 2715–21. PMC 87006. PMID 10878069.

[Morey-89] Morey M, Fernández-Marmiesse A, Castiñeiras D, Fraga JM, Couce ML, Cocho JA (2013). "A glimpse into past, present, and future DNA sequencing". Molecular Genetics and Metabolism 110 (1–2): 3–24. doi:10.1016/j.ymgme.2013.04.024. PMID 23742747.

[Qin-90] Qin Y, Schneider TM, Brenner MP (2012). Gibas, Cynthia, ed. "Sequencing by Hybridization of Long Targets". PLoS ONE 7 (5): e35819. Bibcode:2012PLoSO...735819Q. doi:10.1371/journal.pone.0035819. PMC 3344849. PMID 22574124.

[91] Edwards JR, Ruparel H, Ju J (2005). "Mass-spectrometry DNA sequencing". Mutation Research 573 (1–2): 3–12. doi:10.1016/j.mrfmmm.2004.07.021. PMID 15829234.

[92] Hall TA, Budowle B, Jiang Y, Blyn L, Eshoo M, Sannes-Lowery KA et al. (2005). "Base composition analysis of human mitochondrial DNA using electrospray ionization mass spectrometry: A novel tool for the identification and differentiation of humans". Analytical Biochemistry 344 (1): 53–69. doi:10.1016/j.ab.2005.05.028. PMID 16054106.

[93] Howard R, Encheva V, Thomson J, Bache K, Chan YT, Cowen S et al. (15 Jun 2011). "Comparative analysis of human mitochondrial DNA from World War I bone samples by DNA sequencing and ESI-TOF mass spectrometry". Forensic science international. Genetics 7 (1): 1–9. doi:10.1016/j.fsigen.2011.05.009. PMID 21683667.

[94] Monforte JA, Becker CH (1 March 1997). "High-throughput DNA analysis by time-of-flight mass spectrometry". Nature Medicine 3 (3): 360–362. doi:10.1038/nm0397-360. PMID 9055869.

[95] Beres SB, Carroll RK, Shea PR, Sitkiewicz I, Martinez-Gutierrez JC, Low DE et al. (8 February 2010). "Molecular complexity of successive bacterial epidemics deconvoluted by comparative pathogenomics". Proceedings of the National Academy of Sciences 107 (9): 4371–4376. Bibcode:2010PNAS..107.4371B. doi:10.1073/pnas.0911295107. PMC 2840111. PMID 20142485.

[96] Kan CW, Fredlake CP, Doherty EA, Barron AE (1 November 2004). "DNA sequencing and genotyping in miniaturized electrophoresis systems". Electrophoresis 25 (21–22): 3564–3588. doi:10.1002/elps.200406161. PMID 15565709.

[97] Chen YJ, Roller EE, Huang X (2010). "DNA sequencing by denaturation: experimental proof of concept with an integrated fluidic device". Lab on Chip 10 (9): 1153–1159. doi:10.1039/b921417h. PMC 2881221. PMID 20390134.

[98] Bell DC, Thomas WK, Murtagh KM, Dionne CA, Graham AC, Anderson JE et al. (9 Oct 2012). "DNA Base Identification by Electron Microscopy". Microscopy and microanalysis : the official journal of Microscopy Society of America, Microbeam Analysis Society, Microscopical Society of Canada 18 (5): 1–5. Bibcode:2012MiMic..18.1049B. doi:10.1017/S1431927612012615. PMID 23046798.

[99] Pareek CS, Smoczynski R, Tretyn A (November 2011). "Sequencing technologies and genome sequencing". Journal of applied genetics 52 (4): 413–35. doi:10.1007/s13353-011-0057-x. PMC 3189340. PMID 21698376.

[Pareek_CS-100] Pareek CS, Smoczynski R, Tretyn A (2011). "Sequencing technologies and genome sequencing". Journal of Applied Genetics 52 (4): 413–435. doi:10.1007/s13353-011-0057-x. PMC 3189340. PMID 21698376.

[101] Fujimori S, Hirai N, Ohashi H, Masuoka K, Nishikimi A, Fukui Y et al. (2012). "Next-generation sequencing coupled with a cell-free display technology for high-throughput production of reliable interactome data". Scientific reports 2: 691. Bibcode:2012NatSR...2E.691F. doi:10.1038/srep00691. PMC 3466446. PMID 23056904.

[102] "PRIZE Overview: Archon X PRIZE for Genomics"

[103] The Future of DNA Sequencing

[pmid24727769-104] Severin J, Lizio M, Harshbarger J, Kawaji H, Daub CO, Hayashizaki Y et al. (2014). "Interactive visualization and analysis of large-scale sequencing datasets using ZENBU". Nat. Biotechnol. 32 (3): 217–9. doi:10.1038/nbt.2840. PMID 24727769.

[105] Shmilovici A,Ben-Gal I (2007). "Using a VOM model for reconstructing potential coding regions in EST sequences". Computational Statistics 22 (1): 49–69. doi:10.1007/s00180-007-0021-8.

[106] Del Fabbro C, Scalabrin S, Morgante M, Giorgi FM (2013). "An Extensive Evaluation of Read Trimming Effects on Illumina NGS Data Analysis". PLoS ONE 8 (12): e85024. Bibcode:2013PLoSO...885024D. doi:10.1371/journal.pone.0085024. PMC 3871669. PMID 24376861.

Genetics
Part of a series on

Key components
Chromosome DNA RNA Genome Heredity Mutation Nucleotide Variation
Outline Index Glossary
History and topics
Introduction History Evolution (molecular) Population genetics Mendelian inheritance Quantitative genetics Molecular genetics
Research
DNA sequencing Genetic engineering Genomics ( template) Medical genetics
Branches of genetics
Personalized Medicine
Personalized Medicine
Biology portal Molecular and cellular biology portal
v t e

匿名

検索

案内

genomic sequencing