motif andpatterndatabase

15
Motif and Pattern Databases And some practical approaches Sucheta Tripathy 10/2/2016

Upload: sucheta-tripathy

Post on 26-Jan-2017

270 views

Category:

Education


0 download

TRANSCRIPT

Page 1: Motif andpatterndatabase

Motif and Pattern Databases And some practical approaches

Sucheta Tripathy10/2/2016

Page 2: Motif andpatterndatabase

Motifs

• Defined as a nucleotide or amino acid sequence pattern that is widespread and is associated with a biological function.– A sequence motif = A structural Motif.– A sequence motif residing in the coding region

may encode a structural motif.– Non-coding nucleotide motifs may have regulatory

role. May have recognition sites for DNA binding proteins.

Page 3: Motif andpatterndatabase
Page 4: Motif andpatterndatabase

Motifs, profiles and patterns

• Conserved region of a DNA or protein – Motif• Qualitative expression of a motif – Pattern

– Regular Expression– C[TA]TTG{X}

• Quantitative expression of a motif – Profile – Position Specific Scoring Matrices (PSSMs)– Weight matrices

Page 5: Motif andpatterndatabase

Motifs/Patterns

N{P}[ST]{P}[FILV]Qxxx[RK]Gxxx[RK]xx[FILVWY][] -> or (Probability information is lost){} -> Not() -> repeated^ -> Beginning

Page 6: Motif andpatterndatabase

Profiles

• Quantitative representation.• More useful for training dataset.

TCTAGAAGATGGCAGTGGCGAAGATCTAGAAAATGACAGTGGCGAAGATCTAGAAAATGGCAGTAGCGAAGATCTACTA AATGA TAGTAGCGAAGA

A 0,0,0,100 ,0, 75,100, 75 ATGT 100,0,100,0,0, 25, 0, 0 ATGG 0, 0, 0, 0, 75 ,0, 0, 25 ATGC 0,100,0,0, 25 ,0, 0, 0 ATG

Page 7: Motif andpatterndatabase

De novo prediction of Motifs

• MEME; EXTREME; AlignAce, Amadeus, CisModule, FIRE, Gibbs Motif Sampler, PhyloGibbs, SeSiMCMC, ChIPMunk and Weeder. SCOPE, MotifVoter, and Mprofiler

MEME (Multiple Expectation Maximization for Motif Elicitation)

Page 8: Motif andpatterndatabase

Figure 3. Resources

MacIsaac KD, Fraenkel E (2006) Practical Strategies for Discovering Regulatory DNA Sequence Motifs. PLoS Comput Biol 2(4): e36. doi:10.1371/journal.pcbi.0020036http://journals.plos.org/ploscompbiol/article?id=info:doi/10.1371/journal.pcbi.0020036

Page 9: Motif andpatterndatabase
Page 10: Motif andpatterndatabase
Page 11: Motif andpatterndatabase

MRLSFVPLLQLSRLVVSTQHSTKMSTVYRTCKMNEIALSLLAPTQPLDADQGVMSPMASSDQTTSIGDFRFLRTHHDKEERGLLVTSLTKGLAETSFPYR YTSMCATICSITHSRADAAPAKQAH

Page 12: Motif andpatterndatabase
Page 13: Motif andpatterndatabase

Prosite

Page 14: Motif andpatterndatabase
Page 15: Motif andpatterndatabase

ATGCGTCTCTCCTTCGTTCCACTACTGCAGCTCTCTCGTCTGGTCGTTAGCACACAACATAGTACGAAAATGAGCACAGTATACCGTACCTGCAAAATGAATGAAATAGCTCTCTCGTTGCTGGCGCCAACGCAGCCATTGGACGCTGACCAGGGTGTTATGTCACCGATGGCCTCATCAGACCAGACAACCTCAATTGG TGACTTTCGGTTCCTGAGAACCCACCACGATAAAGAAGAGCGGGGCTTGCTGGTTACCAGCCTCACAAAAGGTTTGGCTGAAACATCATTTCCGTATCGATACACTTCGATGTGCGCAACTATTTGTTCAATTACGCATTCTCGGGCAGATGCTGCGCCTGCGAAGCAGGCGCACTA

Scan this sequence and get me the motif

OR Build a PSSM

ATGCGTCTCTCATGCCTCTGTCATGCGTCTCTCATGCGTCTCTCATGCGTCTATC