considerations for analyzing targeted ngs data exome tim hague, cto
TRANSCRIPT
![Page 1: Considerations for Analyzing Targeted NGS Data Exome Tim Hague, CTO](https://reader038.vdocuments.mx/reader038/viewer/2022110208/56649da65503460f94a91dbc/html5/thumbnails/1.jpg)
Considerations for Analyzing Targeted NGS Data
Exome
Tim Hague, CTO
Omixon WorkshopsConsiderations for Analyzing Targeted NGS Data - ExomeTim Hague, CEO
![Page 2: Considerations for Analyzing Targeted NGS Data Exome Tim Hague, CTO](https://reader038.vdocuments.mx/reader038/viewer/2022110208/56649da65503460f94a91dbc/html5/thumbnails/2.jpg)
![Page 3: Considerations for Analyzing Targeted NGS Data Exome Tim Hague, CTO](https://reader038.vdocuments.mx/reader038/viewer/2022110208/56649da65503460f94a91dbc/html5/thumbnails/3.jpg)
3 sets of full exome sequences for the same individual, targeted by 3 different kits
One set had data problems because reads were from 2 different sequencers
Remaining 2 sets were analyzed both by the customer and by Omixon
Exome Analysis
![Page 4: Considerations for Analyzing Targeted NGS Data Exome Tim Hague, CTO](https://reader038.vdocuments.mx/reader038/viewer/2022110208/56649da65503460f94a91dbc/html5/thumbnails/4.jpg)
Illumina TruSeq ~62 Mbp
Nimblegen SeqCap EZ Exome ~64 Mbp
~35 Mbp overlap between targets
Exons, ORFs and putative translated regions captured
40M and 37M read pairs resp., 101bp length
Exome Targets
![Page 5: Considerations for Analyzing Targeted NGS Data Exome Tim Hague, CTO](https://reader038.vdocuments.mx/reader038/viewer/2022110208/56649da65503460f94a91dbc/html5/thumbnails/5.jpg)
In this case we are comparing two full NGS analysis pipelines
Including the mapping/alignment and a multi-step variant call pipeline
The Omixon pipeline for this analysis uses two variant callers
The Omixon pipeline also uses recalibration and indel realignment
Full Analysis Pipelines
![Page 6: Considerations for Analyzing Targeted NGS Data Exome Tim Hague, CTO](https://reader038.vdocuments.mx/reader038/viewer/2022110208/56649da65503460f94a91dbc/html5/thumbnails/6.jpg)
Finding Long Indels 1.
![Page 7: Considerations for Analyzing Targeted NGS Data Exome Tim Hague, CTO](https://reader038.vdocuments.mx/reader038/viewer/2022110208/56649da65503460f94a91dbc/html5/thumbnails/7.jpg)
Better Indel Resolution 1.
![Page 8: Considerations for Analyzing Targeted NGS Data Exome Tim Hague, CTO](https://reader038.vdocuments.mx/reader038/viewer/2022110208/56649da65503460f94a91dbc/html5/thumbnails/8.jpg)
Better Indel Resolution 2.
![Page 9: Considerations for Analyzing Targeted NGS Data Exome Tim Hague, CTO](https://reader038.vdocuments.mx/reader038/viewer/2022110208/56649da65503460f94a91dbc/html5/thumbnails/9.jpg)
If indels are important to an analysis then this needs to be taken into account, from the planning stage onwards
BWA does better when indel realignment is used, in combination with paired data
Indel Handling
![Page 10: Considerations for Analyzing Targeted NGS Data Exome Tim Hague, CTO](https://reader038.vdocuments.mx/reader038/viewer/2022110208/56649da65503460f94a91dbc/html5/thumbnails/10.jpg)
Less Low Quality False Positives
![Page 11: Considerations for Analyzing Targeted NGS Data Exome Tim Hague, CTO](https://reader038.vdocuments.mx/reader038/viewer/2022110208/56649da65503460f94a91dbc/html5/thumbnails/11.jpg)
Some of these low quality variants can be removed by filtering, after variant call
Quality and coverage cut-offs have to be parameterized properly in the alignment and variant call
Quality recalibration can also help to reduce low quality false positives
Quality and Coverage
![Page 12: Considerations for Analyzing Targeted NGS Data Exome Tim Hague, CTO](https://reader038.vdocuments.mx/reader038/viewer/2022110208/56649da65503460f94a91dbc/html5/thumbnails/12.jpg)
Variations Next to Coding Areas
![Page 13: Considerations for Analyzing Targeted NGS Data Exome Tim Hague, CTO](https://reader038.vdocuments.mx/reader038/viewer/2022110208/56649da65503460f94a91dbc/html5/thumbnails/13.jpg)
Most of the exon kits also provide variant calls close to the coding regions
These should be included in the analysis if possible
Splicing and Promoters
![Page 14: Considerations for Analyzing Targeted NGS Data Exome Tim Hague, CTO](https://reader038.vdocuments.mx/reader038/viewer/2022110208/56649da65503460f94a91dbc/html5/thumbnails/14.jpg)
Less False Positives in Complex Regions 1.
![Page 15: Considerations for Analyzing Targeted NGS Data Exome Tim Hague, CTO](https://reader038.vdocuments.mx/reader038/viewer/2022110208/56649da65503460f94a91dbc/html5/thumbnails/15.jpg)
Less False Positives in Complex Regions 2.
![Page 16: Considerations for Analyzing Targeted NGS Data Exome Tim Hague, CTO](https://reader038.vdocuments.mx/reader038/viewer/2022110208/56649da65503460f94a91dbc/html5/thumbnails/16.jpg)
Less False Positives in Complex Regions 3.
![Page 17: Considerations for Analyzing Targeted NGS Data Exome Tim Hague, CTO](https://reader038.vdocuments.mx/reader038/viewer/2022110208/56649da65503460f94a91dbc/html5/thumbnails/17.jpg)
Less False Positives in Complex Regions 4.Higher Coverage
![Page 18: Considerations for Analyzing Targeted NGS Data Exome Tim Hague, CTO](https://reader038.vdocuments.mx/reader038/viewer/2022110208/56649da65503460f94a91dbc/html5/thumbnails/18.jpg)
Less False Positives in Complex Regions 5.Lower Coverage
![Page 19: Considerations for Analyzing Targeted NGS Data Exome Tim Hague, CTO](https://reader038.vdocuments.mx/reader038/viewer/2022110208/56649da65503460f94a91dbc/html5/thumbnails/19.jpg)
Mismappings due to pseudogenes or repeats – or just complex regions?
Sometime more coverage can actually be bad
Need to watch out for non-specific read mappings (reads mapping to multiple places)
Complex Regions
![Page 20: Considerations for Analyzing Targeted NGS Data Exome Tim Hague, CTO](https://reader038.vdocuments.mx/reader038/viewer/2022110208/56649da65503460f94a91dbc/html5/thumbnails/20.jpg)
Regions Where Both Aligners are Confused 1.
![Page 21: Considerations for Analyzing Targeted NGS Data Exome Tim Hague, CTO](https://reader038.vdocuments.mx/reader038/viewer/2022110208/56649da65503460f94a91dbc/html5/thumbnails/21.jpg)
Regions Where Both Aligners are Confused 2.
![Page 22: Considerations for Analyzing Targeted NGS Data Exome Tim Hague, CTO](https://reader038.vdocuments.mx/reader038/viewer/2022110208/56649da65503460f94a91dbc/html5/thumbnails/22.jpg)
Some regions are extremely difficult to map with any techniques
A different approach may be required to mapping/alignment
A different approach may be required to variant call (local de novo, phasing etc)
Very Complex Regions
![Page 23: Considerations for Analyzing Targeted NGS Data Exome Tim Hague, CTO](https://reader038.vdocuments.mx/reader038/viewer/2022110208/56649da65503460f94a91dbc/html5/thumbnails/23.jpg)
Problems with Sex Chromosomes
There are may heterozygous calls in the X and Y chromosomes that are certainly false positives or incorrect calls.
This is true for both pipelines, the read specificity and variant call procedure has to be improved for these chromosomes.
![Page 24: Considerations for Analyzing Targeted NGS Data Exome Tim Hague, CTO](https://reader038.vdocuments.mx/reader038/viewer/2022110208/56649da65503460f94a91dbc/html5/thumbnails/24.jpg)
These kinds of comparative studies can be useful in analyzing the effectiveness of exome sequencing
Different exome kits can give different results
The data analysis and variant call tools chosen for the analysis can also have a big impact
There is some potential to improve the quality of the customer's exome analysis pipeline
Summary
![Page 26: Considerations for Analyzing Targeted NGS Data Exome Tim Hague, CTO](https://reader038.vdocuments.mx/reader038/viewer/2022110208/56649da65503460f94a91dbc/html5/thumbnails/26.jpg)
Download our Omixon Target™ Evaluation Version
Today
OMIXON.COM