use of causative variants and snp weighting in a single

12
Use of causative variants and SNP weighting in a single-step GBLUP context Fragomeni BO 1 , Lourenco DAL 1 , Legarra A 2 , Tooker ME 3 , VanRaden PM 3 , Misztal I 1 1 University of Georgia, Athens, USA 2 INRA, Castanet-Tolosan, France 3 AGIL ARS-USDA, Beltsville, USA

Upload: others

Post on 06-Jan-2022

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Use of causative variants and SNP weighting in a single

Use of causative variants and SNP weighting in a single-step

GBLUP contextFragomeni BO1, Lourenco DAL1, Legarra A2, Tooker ME 3, VanRaden PM3,

Misztal I1

1University of Georgia, Athens, USA

2INRA, Castanet-Tolosan, France

3AGIL ARS-USDA, Beltsville, USA

Page 2: Use of causative variants and SNP weighting in a single

Motivation

• Decreasing costs of whole genome sequence

• Revived interest in causative variants for prediction

• Several authors are finding and using causative variants• No improvement :

• Binsbergen et al., 2015 and Erbe et al., 2016

• Up to 5% improvement:• Brondum et al. 2015 and Vanraden et al., 2017

Page 3: Use of causative variants and SNP weighting in a single

Motivation

•ssGBLUP was able to reach accuracies close to 1 with simulated causative variants•When priori used for weights were the simulated

QTN effects

•GWA estimated weights had limited impact•GWA Methodology – no limitation in minimum and

maximum weights

Page 4: Use of causative variants and SNP weighting in a single

Objective

•Test different SNP weighting methods in GBLUP and ssGBLUP in field data which includes causative variants.

Page 5: Use of causative variants and SNP weighting in a single

Field Data

• 4M Records for Stature

• 3M Cows

• 4.6M Animals in pedigree

• h2=0.44

• 27k Genotyped Sires• 54k SNP

• 54k SNP + 17k Causative Variants (VanRaden et al., 2017)

Page 6: Use of causative variants and SNP weighting in a single

Analysis

•GBLUP• Multi-step approach• Daughter deviation as

phenotypes• Genomic Relationship

Matrix• Homogeneous or

heterogeneous residual variance

• ssGBLUP• Same model as national

evaluation for type traits• No deregressions• Matrix combining

pedigree and genomic information (H)

Page 7: Use of causative variants and SNP weighting in a single

Weighted Genomic relationship matrix

𝐆 = 𝐙𝑫𝐙′𝜎𝑠

2

𝜎𝑎2 =

𝐙𝑫𝐙′

𝑖 2𝑝𝑖 𝑞𝑖

•Default• 𝑑𝑖 = 1

• Linear weights• 𝑑𝑖 = 𝑢𝑖

2

•Non-linear A weights

• 𝑑𝑖 = 1.125 𝑢𝑖

𝑠𝑑 𝑢−2

• Value capped at 10

• Fast-Bayes A

• 𝑑𝑖 =𝑆𝑁𝑃𝑒𝑓𝑓𝑖

2 +𝑑𝑓∗𝑆2

𝑑𝑓+1

Page 8: Use of causative variants and SNP weighting in a single

Weighted

0

1

2

3

4

5

6

7

8

9

10

WEIGHTS

Non Linear A Linear

Page 9: Use of causative variants and SNP weighting in a single

GBLUP – 54K SNP

HOMOGENEOUS RESIDUAL VARIANCE HETEROGENEOUS RESIDUAL VARIANCE

54.954.5

56.6

52

53

54

55

56

57

58

59

No Weights Linear NonLinear

58.7

56.3

58.6

53

54

55

56

57

58

59

No Weights Linear NonLinear

Page 10: Use of causative variants and SNP weighting in a single

Weighted and unweighted reliabilities

Unweighted GRM Weighted GRM

58.959.5

60.8

57

58

59

60

61

62

GBLUP_HET ssGBLUP-IND ssGBLUP

58.7

59.5

60.9

57

58

59

60

61

62

GBLUP_HET ssGBLUP-IND ssGBLUP

Page 11: Use of causative variants and SNP weighting in a single

Including causative variants

54.9

56.4

58.2 58.4

60.9

55.556.3

58.7 58.9

60.8

59.5

50

52

54

56

58

60

62

GBLUP -Homogeneous

GBLUP - Linear GBLUP - NoWeights

GBLUP - NonLinear ssGBLUP

54k 70k ssGBLUP - DGV

HETEROGENEOUS RESIDUALS

Page 12: Use of causative variants and SNP weighting in a single

Conclusion

• Gains with inclusion of causative variants is limited when trait is polygenic

• Gains with causative variants has more impact in GBLUP than in ssGBLUP• More data is used in single-step methodology, therefore impact of prior is less

important

• Non-linear methodology is better for weighting marker effects than linear weights