synthetic sequence design for signal location yaw-ling lin ( 林 耀 鈴 ) dept computer sci and...

Post on 18-Jan-2016

222 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Synthetic Sequence Design Synthetic Sequence Design for Signal Locationfor Signal Location

Yaw-Ling Lin ( 林 耀 鈴 )Dept Computer Sci and Info Engineering

College of Computing and Informatics Providence University, Taiwan

E-mail: yllin@pu.edu.twhttp://www.cs.pu.edu.tw/~yawlin

112/04/21 Synthetic Design for Signal Location 1

OutlineOutline• Motivation• Introduction• Terminology Definition• Signal location search• Group testing designs• Adjacent levels of the Hasse diagram• Suggested Designations• Conclusion

112/04/21 Synthetic Design for Signal Location 2

Synthetic BiologySynthetic Biology

112/04/21 Synthetic Design for Signal Location 3

Synthetic BiologySynthetic Biology

112/04/21 Synthetic Design for Signal Location 4

What have we done with synthesisWhat have we done with synthesis

112/04/21 Synthetic Design for Signal Location 5

IntroductionIntroduction• Large-scale synthesis opens new doors for rapid

signal detection:– Replace a wild type gene coding sequence (W) with a

different but synonymous encoding (D).– If the phenotype changes (e.g., the organism dies), it

implies that there must be a critical signal at some location within that region.

112/04/21 Synthetic Design for Signal Location 6

ContributionContribution

• A Group-Testing Approach for Biological Signal Location.

• Group Testing for Expensive Pools.

• Improved Designs for Consecutive Positive Group Testing.

• Middle-Levels Conjecture Equivalence.• Web link: http://www.algorithm.cs.sunysb.edu/signalSearch.

112/04/21 Synthetic Design for Signal Location 7

Biological Signal LocationBiological Signal Location

112/04/21 Synthetic Design for Signal Location 8

Design criteria for sequence Design criteria for sequence signal searchsignal search

• Experiences with polio and adenovirus.

• To construct t tests (design sequences) capable of pinpointing the location of a signal of length at most m (~20 to 60) bases as tightly as possible in a region of length g (~2k nt).

• We aim to partition the region into n segments and construct t tests to determine which segment contains the critical signal.

112/04/21 Synthetic Design for Signal Location 9

A Simple Design and ChallengesA Simple Design and Challenges

• In the previous design, n = 16 and t = 4.

• Multiple Signals? Region Boundaries?

• Experimental Robustness?

112/04/21 Synthetic Design for Signal Location 10

2-consecutive positive matrix2-consecutive positive matrix

112/04/21 Synthetic Design for Signal Location 11

A cyclic 2-consecutive positive detectable matrix such that its column is a k-set (out of t elements) such that each two adjacent k-sets has distinct unions which are (k+1)-sets.

Middle Level ConjectureMiddle Level Conjecture

112/04/21 Synthetic Design for Signal Location 12

Middle Level CoverageMiddle Level Coverage

112/04/21 Synthetic Design for Signal Location 13

Adjacent Level LemmaAdjacent Level Lemma

112/04/21 Synthetic Design for Signal Location 14

Cycles crossing adjacent levelsCycles crossing adjacent levels

112/04/21 Synthetic Design for Signal Location 15

Shimada and Amano (2011), running time about 81 days:

Consecutive Positives Detectable MatrixConsecutive Positives Detectable Matrix

112/04/21 Synthetic Design for Signal Location 16

Main result: Main result: Non-adpative Group TestingNon-adpative Group Testing

112/04/21 Synthetic Design for Signal Location 17

112/04/21 Synthetic Design for Signal Location 18

Designing Consecutive Positives Detectable MatrixDesigning Consecutive Positives Detectable Matrix

Experiment ResultsExperiment Results

112/04/21 Synthetic Design for Signal Location 19

Consecutive Positives Detectable MatrixConsecutive Positives Detectable Matrix

112/04/21 Synthetic Design for Signal Location 20

Design EfficiencyDesign Efficiency

112/04/21 Synthetic Design for Signal Location 21

• Our design:

• Colbourn’s design (1999):

• In particular, for r=3, d=3, Colbourn’s design create an 10 x 16 matrix; while our design, M3(7,3) gives a 10 x 105 matrix.

ConclusionConclusion• We give a new class of consecutive positive group

testing designs, which offer a better tradeoff of cost, resolution, and robustness than previous designs for signal search.

• Let n be the number of distinct regions, and d the number of consecutive positives regions. The design identifies the positive regions using t tests, where

• Given the target sequence, we propose one/two-round designs to maximize the number of inspected items n (therefore minimized boundary resolution).

• Future works: faults-detecting decoding algorithms.112/04/21 Synthetic Design for Signal Location 22

Conclusion Conclusion (Theory)(Theory)

• Equivalence of middle level conjecture to the adjacent level conjecture.

• Improvement of the consecutive positive matrix design.

• Future and continuous works:o More than one consecutive positives.o Efficient algorithms for false reads.o TagSNP selection in the haplotype block.o Further experiments on related biomedical

haplotype data.112/04/21 Synthetic Design for Signal Location 23

Thank You!

Any Question?

112/04/21 Synthetic Design for Signal Location 24

112/04/21 Synthetic Design for Signal Location 25

Thank you.

Q&A

112/04/21 Synthetic Design for Signal Location 26

What Weekday is Today?What Weekday is Today?• Magic Number:

- 4/4, 6/6, 8/8, 10/10, 12/12

- 7/11, 9/5 [also 11/7, 5/9]

- 3/0? [implying 2/28, 2/0 = 1/31]

• Extension:- 365 = 52 * 7 + 1

- Leap Year?

• 2012:3; 2013:4; 2014:5; 2015:6; 2016:1

• 20yy: [5yy/4]+2 mod 7

112/04/21 Synthetic Design for Signal Location 27

top related