second generation sequencing - australian oilseeds

9
1 Second Generation Sequencing ? ? $5.8k $6k $84k >$2500k Cost per GB ? 2 GB 3 GB 3 GB 100 MB 0.1 MB Throughput per run ? 85 M 85 M 60 M 400 K 96 Reads per run ? 28 25-35 35-50 250-400 800-1100 Read length (bp) 2009 2008 2007 2006 2007 2000 Launched Pacific Bioscience Helicos HeliScope AB SOLiD Illumina Solexa Roche GSFLX ABI 3730 Sequencing Machine

Upload: others

Post on 11-Feb-2022

6 views

Category:

Documents


0 download

TRANSCRIPT

1

Second Generation Sequencing

??$5.8k$6k$84k>$2500k Cost per GB

?2 GB3 GB3 GB100 MB0.1 MBThroughput per run

?85 M85 M60 M400 K96Reads per run

?2825-3535-50250-400800-1100Read length (bp)

200920082007200620072000Launched

Pacific BioscienceHelicos HeliScopeAB SOLiDIllumina SolexaRoche GSFLX

ABI 3730Sequencing Machine

2

Size comparisons

bp distance

Solexa read 35 1.9 m me

Sanger read 900 50 m swimming pool

BAC 120,000 6.7 Km

rice genome 400,000,000 22,222 Km distance to UK

canola genome 1,200,000,000 67, 000 Km

3

Brassica genome sequencing

A1 UK/ChinaA2 CanadaA3 KoreaA4 UK/ChinaA5 JapanA6 India? France?A7 Australia?A8 UK/ChinaA9 KoreaA10 Canada

4

http://www.brassica.info/resource/sequencing/status.php

5

New Chinese Initiative

Hanzhong Wang, Shengyi Liu Wei Hua

Jian Wu, Sanwen Huang, Rifei Sun, Xixiang Li

Peixiang Ni, Ruiqiang Li Hongkun Zheng

Korea:UK:

Canada:Australia:

The Netherlands:

Beom-Seok Park, Yong Pyo LimIan BancroftIsobel ParkinDavid EdwardsGuusje Bonnema, Mark Aarts

Oil Crops Research Institute (OCRI) CAAS

Institute of Vegetables and Flowers (IVF)

6

New Chinese Initiative

~50x coverage Illumina short reads

Contigs without N

• Longest = 45,025 bp

• N50 = 4,342 bp

Scaffolds containing N

• Longest = 293,912 bp

• N50 = 30,723 bp

7

Change of plan for completion

• Sequence BACs to phase 1, use paired read data to finish to phase 3

• Focus on assigned chromosomes but also sequence and map BACs from chromosomes 7 (and 6?)

8

New data, new capabilities

• Shotgun sequence B. oleracea and B. napus?

• Re-sequence different varieties to identify SNPs?

• Identify all gene variants in the genome and compare between genomes

9

GRDC related projects

ARC Linkage: Bayer CropScience

Developing technology for the cost effective de novo sequencing and analysis of complex genomes.

ARC Linkage: Bayer CropScience and DPI NWS

Characterising genetic variation in Brassica napus.

ARC Linkage: Biomatters Ltd.

Next generation metagenomics.

ARC Discovery: Jacqueline Batley and Regine Delourme

Co-evolution of the host pathogen interaction between Leptosphaeria maculans and Brassica species.