questions?. novel ncrnas are abundant: ex: mirnas mirnas were the second major story in 2001 (after...
TRANSCRIPT
![Page 1: Questions?. Novel ncRNAs are abundant: Ex: miRNAs miRNAs were the second major story in 2001 (after the genome). Subsequently, many other non-coding genes](https://reader036.vdocuments.mx/reader036/viewer/2022062803/56649f385503460f94c55223/html5/thumbnails/1.jpg)
Questions?
![Page 2: Questions?. Novel ncRNAs are abundant: Ex: miRNAs miRNAs were the second major story in 2001 (after the genome). Subsequently, many other non-coding genes](https://reader036.vdocuments.mx/reader036/viewer/2022062803/56649f385503460f94c55223/html5/thumbnails/2.jpg)
Novel ncRNAs are abundant: Ex: miRNAs
• miRNAs were the second major story in 2001 (after the genome).
• Subsequently, many other non-coding genes have been found
![Page 3: Questions?. Novel ncRNAs are abundant: Ex: miRNAs miRNAs were the second major story in 2001 (after the genome). Subsequently, many other non-coding genes](https://reader036.vdocuments.mx/reader036/viewer/2022062803/56649f385503460f94c55223/html5/thumbnails/3.jpg)
![Page 4: Questions?. Novel ncRNAs are abundant: Ex: miRNAs miRNAs were the second major story in 2001 (after the genome). Subsequently, many other non-coding genes](https://reader036.vdocuments.mx/reader036/viewer/2022062803/56649f385503460f94c55223/html5/thumbnails/4.jpg)
Scientific American, 2006
![Page 5: Questions?. Novel ncRNAs are abundant: Ex: miRNAs miRNAs were the second major story in 2001 (after the genome). Subsequently, many other non-coding genes](https://reader036.vdocuments.mx/reader036/viewer/2022062803/56649f385503460f94c55223/html5/thumbnails/5.jpg)
Bacterial Riboswitches
• -Breaker Lab
![Page 6: Questions?. Novel ncRNAs are abundant: Ex: miRNAs miRNAs were the second major story in 2001 (after the genome). Subsequently, many other non-coding genes](https://reader036.vdocuments.mx/reader036/viewer/2022062803/56649f385503460f94c55223/html5/thumbnails/6.jpg)
ncRNA gene finding
• The RNA world hypothesis:– RNA are as important as protein coding
genes.– Many undiscovered ncRNA exist
• Computational methods for discovering ncRNA are not mature.
• What are the clues to non-coding genes?– Structure: Given a sequence, what is the structure
into which it can fold with minimum energy?
![Page 7: Questions?. Novel ncRNAs are abundant: Ex: miRNAs miRNAs were the second major story in 2001 (after the genome). Subsequently, many other non-coding genes](https://reader036.vdocuments.mx/reader036/viewer/2022062803/56649f385503460f94c55223/html5/thumbnails/7.jpg)
RNA structure: Basics
• Key: RNA is single-stranded. Think of a string over 4 letters, AC,G, and U.
• The complementary bases form pairs.• A <-> U, C <-> G, G <-> U
• Base-pairing defines a secondary structure. The base-pairing is usually non-crossing.
![Page 8: Questions?. Novel ncRNAs are abundant: Ex: miRNAs miRNAs were the second major story in 2001 (after the genome). Subsequently, many other non-coding genes](https://reader036.vdocuments.mx/reader036/viewer/2022062803/56649f385503460f94c55223/html5/thumbnails/8.jpg)
RNA structure: Basics
• Key: RNA is single-stranded. Think of a string over 4 letters, AC,G, and U.
• The complementary bases form pairs.• Base-pairing defines a secondary structure.
The base-pairing is usually non-crossing.
![Page 9: Questions?. Novel ncRNAs are abundant: Ex: miRNAs miRNAs were the second major story in 2001 (after the genome). Subsequently, many other non-coding genes](https://reader036.vdocuments.mx/reader036/viewer/2022062803/56649f385503460f94c55223/html5/thumbnails/9.jpg)
RNA structure: pseudoknots
• Sometimes, unpaired bases in loops form ‘crossing pairs’. These are pseudoknots.
![Page 10: Questions?. Novel ncRNAs are abundant: Ex: miRNAs miRNAs were the second major story in 2001 (after the genome). Subsequently, many other non-coding genes](https://reader036.vdocuments.mx/reader036/viewer/2022062803/56649f385503460f94c55223/html5/thumbnails/10.jpg)
De novo RNA structure prediction
• Any set of non-crossing base-pairs defines a secondary structure.
• Abstract Question:– Given an RNA string find a structure that
maximizes the number of non-crossing base-pairs
– Incorporate the true energetics of folding– Incorporate Pseudo-knots
![Page 11: Questions?. Novel ncRNAs are abundant: Ex: miRNAs miRNAs were the second major story in 2001 (after the genome). Subsequently, many other non-coding genes](https://reader036.vdocuments.mx/reader036/viewer/2022062803/56649f385503460f94c55223/html5/thumbnails/11.jpg)
ncRNA discovery
• Q: Given genomic DNA, discover all regions likely to be ncRNA
• ncRNA (unlike other DNA) should have secondary structure
• Possible Approach: – Find all substrings that fold into a low
energy structure.
![Page 12: Questions?. Novel ncRNAs are abundant: Ex: miRNAs miRNAs were the second major story in 2001 (after the genome). Subsequently, many other non-coding genes](https://reader036.vdocuments.mx/reader036/viewer/2022062803/56649f385503460f94c55223/html5/thumbnails/12.jpg)
Unfortunately…
– Random DNA (with high GC content) often folds into low-energy structures.
– What other signals determine non-coding genes?
![Page 13: Questions?. Novel ncRNAs are abundant: Ex: miRNAs miRNAs were the second major story in 2001 (after the genome). Subsequently, many other non-coding genes](https://reader036.vdocuments.mx/reader036/viewer/2022062803/56649f385503460f94c55223/html5/thumbnails/13.jpg)
Discovering ncRNA: Approach 2
• Consider each ncRNA family separately. Compute features that are distinct from other sequences.
• Many families are of particular interest– miRNA– Riboswitches
![Page 14: Questions?. Novel ncRNAs are abundant: Ex: miRNAs miRNAs were the second major story in 2001 (after the genome). Subsequently, many other non-coding genes](https://reader036.vdocuments.mx/reader036/viewer/2022062803/56649f385503460f94c55223/html5/thumbnails/14.jpg)
ncRNA: miRNA
• ncRNA ~22 nt in length• Pairs to sites within the 3’ UTR,
specifying translational repression.• Similar to siRNA (involved in RNAi)• Unlike siRNA, miRNA do not need
perfect base complementarity• Until recently, no computational
techniques to predict miRNA• Most predictions based on cloning
small RNAs from size fractionated samples
-Burge Lab
![Page 15: Questions?. Novel ncRNAs are abundant: Ex: miRNAs miRNAs were the second major story in 2001 (after the genome). Subsequently, many other non-coding genes](https://reader036.vdocuments.mx/reader036/viewer/2022062803/56649f385503460f94c55223/html5/thumbnails/15.jpg)
Comparative approach to discovering ncRNA
• Given a pair of conserved sequences, are they conserved because they encode ncRNA?
• Q: How would you compute such conserved pairs in the first place?
![Page 16: Questions?. Novel ncRNAs are abundant: Ex: miRNAs miRNAs were the second major story in 2001 (after the genome). Subsequently, many other non-coding genes](https://reader036.vdocuments.mx/reader036/viewer/2022062803/56649f385503460f94c55223/html5/thumbnails/16.jpg)
Comparative Approach to discovering ncRNA
• Given a query ncRNA (sequence & structure), compute all homologs that are similar in sequence and structure.
• How can you do it efficiently?
query
db sequence
• We will answer these questions