Sequencing based approaches of gene expression analysis generate millions of sequence tags, thus providing the dynamic range required to investigate genes of low abundance. Currently available digital gene expression analysis systems offer the potential for high-throughput transcriptomic measurements, however truly quantitative data are routinely not obtained. The most widely used RNA-seq protocol relies upon fragmentation of mRNA generating a library of uniformly distributed fragments of mRNA. This protocol requires large amounts of starting material (100ng of mRNA) limiting its application in many fields such as in developmental biology, where it is impractical to get such large amounts. Furthermore, this protocol maintains the relative order of transcript expression resulting in poor representation of low abundance transcripts at current sequencing depths. Multireads and biases introduced by transcript length and random hexamer primer hybridization further restrict reliable quantitation of low abundance transcripts for large mammalian transcriptomes. While random priming strategies amplify starting material (mRNA or cDNA) by exploiting hybridization and extension potential of hexamer/heptamer primers, they often result in low yield of good quality reads arising out of mis-hybridization of primers and primer dimerization. In a recent experiment, the inventors used a widely available sequencer to generate sequence tags via random priming strategy. Only 18% of the reads mapped uniquely to the transcriptome and low abundant transcripts were significantly under-represented because of poor dynamic range. Since many genes (signal transduction, transcription factors) are only expressed at relatively low levels, currently available strategies fall short in statistically quantifying these genes.
Researchers from UC San Diego have developed a patented cDNA expression profiling strategy, involving amplification of the majority (>80%) of mammalian transcriptome using a defined set of heptamer primers. The amplification protocol allows for efficient amplification of regions of interest from picograms of cDNA while minimizing mis-hybridization of primers and primer dimerization. The strategy reproducibly yields high levels of amplification necessary for sequencing-library generation from low starting material and offers a dynamic range of over five orders of magnitude in RNA concentrations. This invention reliably analyzes the mammalian transcriptome by employing ultra high throughput approaches and works on high or ultra high throughput sequencing platforms (e.g. Illumina's gene analyzer).
The inventor's system consists of software and novel primer sequences developed to hybridize to unique sequences identified in 74% of known genes. Care was taken to design this system to optimize amplification of target sequences that have low expression levels. The fragments produced are of sizes suitable for high-throughput sequencing and cover more than 90% of the transcriptome. This invention provides a platform for analyzing multiple samples at the same time thus reducing the operational cost.
Seeking commercialization partners for this patented technology.
Country | Type | Number | Dated | Case |
United States Of America | Issued Patent | 9,920,367 | 03/20/2018 | 2010-121 |