evolutionary analyses support that asp (anti sense protein

28
Evolutionary analyses support that ASP (Anti Sense Protein) overlapping ORF could be the 10th gene of HIV-1 M pandemic group JOBIM 2015 Elodie Cassan 1,2,3 Anne-Muriel Arigon Chifolleau 1,2 Antoine Gross 3 Olivier Gascuel 1,2 1 LIRMM, UMR5506 CNRS, Universit´ e de Montpellier, France 2 Computational Biology Institute, Montpellier, France 3 CPBS, FRE3689 CNRS, Montpellier, France

Upload: others

Post on 16-Oct-2021

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Evolutionary analyses support that ASP (Anti Sense Protein

Evolutionary analyses support that ASP (AntiSense Protein) overlapping ORF could be the 10th

gene of HIV-1 M pandemic groupJOBIM 2015

Elodie Cassan1,2,3 Anne-Muriel Arigon Chifolleau1,2

Antoine Gross3 Olivier Gascuel1,2

1LIRMM, UMR5506 CNRS, Universite de Montpellier, France

2Computational Biology Institute, Montpellier, France

3CPBS, FRE3689 CNRS, Montpellier, France

Page 2: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

HIV-1

2 types of Human Immunodeficiency Virus (HIV) : HIV-1 and HIV-2

More than 35 million people infected

Fast evolution + attack of immune cells=> no vaccine today

Origin : inter-speciestransmissions of simian viruses(SIV)

HIV-1 : Group M, N, O et P

Group M : pandemic group

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 1 / 17

Page 3: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

HIV-1 genome

9 genes (+ ASP gene ? ?)

Env gene region : presence of a long overlapping andanti-sens Open Reading Frame (ORF)

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 2 / 17

Page 4: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

Overlapping genes and evolutionary constraints

Strategy to reduce the size of the viral genome

Strong evolutionary constraints especially on frame -2

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 3 / 17

Page 5: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

Overlapping genes and evolutionary constraints

Strategy to reduce the size of the viral genome

Strong evolutionary constraints especially on frame -2

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 3 / 17

Page 6: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

Overlapping gene, and constraints

Strategy to reduce the size of the viral genome

Strong evolutionary constraints especially on frame -2

Frame +2/ + 3 −1 −2 −3

# of amino acids 4 2.5 < 2 10

Table : Average number of amino acids choice due to local constraints(considering +1 frame as fixed)

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 3 / 17

Page 7: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

Presence of ASP ORF ?

Group M

Presence of ASP ORF

Subtype A : ASP ORF is shorter

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 4 / 17

Page 8: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

Presence of ASP ORF ?

Group M

Presence of ASP ORF

Subtype A : ASP ORF is shorter

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 4 / 17

Page 9: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

Presence of ASP ORF ?

Group M

Presence of ASP ORF

Subtype A : ASP ORF is shorter

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 4 / 17

Page 10: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

Presence of ASP ORF ?

Group out-of-M :

No ASP ORF

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 4 / 17

Page 11: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

Emergence of ASP ORF ?

Phylogenetic tree

Group M : Los Alamos reference sequencesGroups out-of-M : selection of 10 sequences in group O + allothers sequences

Repartition of Start (red triangle) and Stop codons (blackcross) in the ASP region.

For each group or subtype : the frequency of Start codon andthe average length of ASP ORF (obtained from our wholedata by weighting the sequences so that each patient has totalweight 1).

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 5 / 17

Page 12: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 6 / 17

Page 13: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 6 / 17

Page 14: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 6 / 17

Page 15: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 6 / 17

Page 16: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

Measuring the selective pressure

Selection pressure ?

Most constrained frame

Unique solution for 55% of ASPsites (HXB2)

Mir & Shober 2014, selectionpressure in frame +1 and -2 are∼ undistinguishable.

Detection of selection pressureinduced by ASP protein isdifficult with conventionalmethods.

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 7 / 17

Page 17: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

Start codon may appear/disappear

Ile or Met on env => Ile or Met on ASP

xyC<=>xyT synonymous for every xy

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 8 / 17

Page 18: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

Start codon may appear/disappear

Ile or Met on env => Ile or Met on ASP

xyC<=>xyT synonymous for every xy

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 9 / 17

Page 19: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

Selection pressure to conserve the Start codon in group M

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 10 / 17

Page 20: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

What about Stop codons ?

Some Stop codons are potential and can synonymouslyappear/disappear

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 11 / 17

Page 21: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

Potential Stop Codons

Sites with appearance of Stop codons without modificationof env. Blue = potential Stops, Red = existing Stops

Presence of Stop codons in out-of-M groups

Statistical test

Figure : Potential Stops (blue) and existing Stops (red) on ASP

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 12 / 17

Page 22: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

Potential Stop Codons

Sites with appearance of Stop codons without modificationof env. Blue = potential Stops, Red = existing Stops

Presence of Stop codons in out-of-M groups

Statistical test

Figure : Potential Stops (blue) and existing Stops (red) on ASP

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 12 / 17

Page 23: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

Potential Stop Codons

Sites with appearance of Stop codons without modificationof env. Blue = potential Stops, Red = existing Stops

Presence of Stop codons in out-of-M groups

Statistical test

Figure : Potential Stops (blue) and existing Stops (red) on ASP

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 12 / 17

Page 24: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

Principle

Pairwise sequences analysis (Inspired by the method of Firth,Nucleic Acids Research, 2014)

Count synonymous mutations − > Stop codon

Comparison with theoretical count based on an evolutionarymodel

Z test

Z =Theo − Obs

2 ∗ variance

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 13 / 17

Page 25: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

Group M results

ASP region :

Z = 7.12, p-value = 10−12 (with RRE)√

Z = 3.6, p-value = 2.10−4 (without RRE)√

env-ASP region :

Z = -1.85, p-value = 0.063 ×

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 14 / 17

Page 26: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

Group M results

ASP region :

Z = 7.12, p-value = 10−12 (with RRE)√

Z = 3.6, p-value = 2.10−4 (without RRE)√

env-ASP region :

Z = -1.85, p-value = 0.063 ×

Group out-of-M results

ASP region :

Z = 1.46, p-value=0.14 (with RRE) ×Z=-0.1, p-value=0.92 (without RRE) ×

env-ASP region :

Z = -0.43, p-value=0.66 ×

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 15 / 17

Page 27: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

Summary

Existence of the ASP ORF in pandemic subtypes (C, B ...) ofHuman Immunodeficiency Virus 1 M.

Absence of the ASP ORF in other non-pandemic human andsimian groups.

Phylogenetic analyses indicate that ASP ORF appearedrecently.

ASP protein is subjected to significant selection pressure.

ASP ORF could be the 10th gene of HIV-1 M andseems to be correlated with human pandemy.

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 16 / 17

Page 28: Evolutionary analyses support that ASP (Anti Sense Protein

Introduction Presence of ASP ORF Emergence of ASP ORF Measuring the selective pressure Summary

Merci de votre attention

Elodie CASSAN ASP, the 10th gene of HIV-1 M pandemic group ? 17 / 17