table 1 soybean sample composition

1
Table 1 Soybean sample composition Detection of Roundup Ready™ Soybeans by Near-Infrared Spectroscopy S.A. Roussel*, C.L. Hardy**, C.R. Hurburgh, Jr.*, G.R. Rippke* Iowa State University,. Ames, IA * Grain Quality Laboratory ** Center for Crops Utilization Research Department of Agricultural & Biosystems Engineering The controversy over genetically modified (GMO) grains has created the need for a rapid screening method for grain handlers. Maintaining purity of an otherwise indistinguishable product is not an operation familiar to commodity markets. Near-infrared analyzers would be well suited to screening inbound deliveries, if there were sufficient spectral differences between GMO and conventional varieties. While NIR units cannot detect DNA, spectral differences may be caused by larger structural changes accompanying the modification. Roundup Ready™ soybean (RR) was the first genetically modified grain to be widely adopted by US farmers (0% of soybean acres grown in 1996 to more than 50% in 1999). International Diffuse Reflectance Conference (IDRC-2000), August 13-18 th , 2000, Chambersburg, PA. Figure 2 shows the region of greatest spectral response to be 900–950nm. A preliminary study demonstrated a spectral difference between RR and conventional soybeans (Figure 1). Samples of known varieties from Iowa county Additional samples (Table 1) were scanned in 1999 on near infrared spectrometers (Figure The objective of this study is to develop the protocol for distinguishing Roundup Ready™ from conventional soybeans using near infrared spectrometry. RR N on-R R Sam ple set (n RR /n non-R R ) Factor Avg. Std.D ev. Avg. Std.D ev. M oisture 11.5 0.63 10.9 0.81 Protein 37.5 0.93 36.5 0.84 Sam ple SetI 1998 harvestsam ples (53/61) Oil 18.9 0.48 19.5 0.69 M oisture 11.2 0.63 10.7 1.71 Protein 36.0 1.50 35.8 1.61 Sam ple SetII 1999 harvest,breeder, strip plotsam ples (308/341) Oil 18.5 0.82 18.3 1.00 M oisture 8.73 0.96 9.2 0.93 Protein 35.7 1.29 35.5 1.45 Sam ple SetIII 1999 Iow a Soybean Yield Testsam ples (3882/3482) Oil 18.2 0.84 18.3 0.80 Basis 13% moisture, predicted by the Infratec, calibration SB00994 After the preliminary study based on Sample Set I, the general progression of evaluation was: 1. New PLS calibrations and ANN models (Figure 5) were created using Sample Sets I & II (Table I). The constituent values as well as the spectra were used as input variables. 2. PLS, ANN (Figure 5), and LWR (Figure 6) calibrations were generated using a very large database (Sample Set III in Table I). 3.2 3.3 3.4 3.5 3.6 3.7 850 870 890 910 930 950 970 990 1010 1030 W avelength (nm ) ln (1/t) R oundup R eady Soybeans (51 sam ples) Non-R oundup R eady Soybeans (63 sam ples) Near-infrared transm ittance spectra,Infratec 1229; Roundup Ready vs.Non-Roundup Ready soybeans strip plots in 1998 comprised a calibration set of 53 RR soybeans and 61 non-RR soybeans (Table 1). On an independent set of 39 samples, this calibration classified RR soybeans with 85% accuracy and non-RR soybeans with 95% accuracy. However, it gave less than 70% accuracy on the following year’s grain (1999). -300 -200 -100 0 100 200 300 850 870 890 910 930 950 970 990 1010 1030 W avelength (nm ) Regression coefficientvalue W aterband, 960 nm Active R egion R egression coefficients for1998 R oundup R eady soybean calibrations C H -, C H 2 -, C H 3 -, R -O H activity Figure 2 0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 850 870 890 910 930 950 970 990 1010 1030 W avelength (nm ) A verage difference,R R -non-R R ,ln (1/T)units O riginal calibration set(n=114) O riginal validation set(n=39) Early harvest1999 (n=43) W etbreedersam ples (n=96) D ry breedersam ples (n=270) 1999 strip plots (n=107) D ifferences in N IR transm ission spectra (R oundup R eady -non-R oundup R eady) in 6 soybean sam ple sets,1998-1999 Figure 4 Figure 4 shows the Infratec spectral differences between RR and non-RR samples. The direction of differences (RR higher than non-RR) remained but the magnitude of the difference was not consistent. Figure 4 indicates that a linear PLS model might not describe the differences accurately. Figure 3 Foss/ Tecator Infratec 1229 Grain Analyzer To prevent confounding, the data was balanced in other constituents (Moisture, Protein, Oil) known to affect the spectra (Table 1). The classification accuracy of various models was compared: Partial Least Squares (PLS) (Unscrambler TM v. 7.5) Locally Weighted Regression (LWR) (Matlab TM v. 5.3) Artificial Neural Networks (ANN) (Matlab TM v. 5.3) Materials Methods 850 875 900 925 950 975 1000 1025 1050 3.4 3.45 3.5 3.55 3.6 3.65 3.7 3.75 3.8 3.85 3.9 Soybean N IR spectrum W avelengths (nm ) ln(1/T) Roundup Ready soybeans Non Roundup Ready soybeans Moisture Protein Oil Fiber Principal Components Figure 5 : Artificial Neural Networks The best ANN model (Figure 5) used 25 principal components generated from the 100- wavelength spectra and 4 constituent 5 to 8 hidden nodes, and 2 outputs values (moisture, protein, oil, and fiber) as inputs, encoding for RR and non- RR classes. C alibration set Predicted testset % correct classification M odel and param eters Year (RR/non-RR) Year (RR/non-RR) RR N on-R R P LS 16LVs 66 72 PLS+ a 17LVs Sam ple SetI& II 308 /341 Sam ple SetII (cv) 308 /341 71 76 P LS 12LVs 62 53 PLS+ a 10LVs 59 69 ANN (25+4:8:2) b Sam ple SetI& II C alibration 272 /275 S am ple SetI& II TestSet 37 /55 81 87 P LS a 16LVs S am ple SetIII c 4302 /3880 Sam ple S etIII (cv) 4302 /3880 89 87 P LS 16LVs 77 79 PLS+ a 15LVs 77 79 LW R 17LVs,850ng 93 92 ANN (25+4:5:2) Sam ple SetIII 3882 /3482 TestSetIII 420 /398 91 89 Table 2 Roundup Readysoybean classification results (cv) -Classification errors computed using cross-validation LVs - Latent variables ng - neighbors a Includes the constituents (moisture, protein, oil and fiber) as independent variables b (25+4:8:2): Input nodes: 25 principal components + 4 constituents; hidden nodes: 8; output nodes: 2 c Sample Set III and Test Set III were combined for cross-validation. Results 300 500 700 900 1100 1300 1500 13 14 15 16 17 18 19 7 7.5 8 8.5 9 9.5 10 10.5 11 N um berofneighbors LW R prediction ofthe 818-sam ple testset Numberoflatentvariables P ercentage ofclassification error 8.3% 7.6% Figure 6 Figure 6 shows the various parameters tested with LWR: 300 - 1500 neighbors 13 - 19 latent variables α = 0 - 1(proportion of x- and y- distances) Spectra + constituents or spectra only The lowest classification error (7.6%) was obtained with 850 neighbors and 17 latent variables and α=0 (Figure 6). The discrimination between RR and conventional soybeans has been achieved using non linear (ANN) and local (LWR) models a on large NIR spectral database. The optical standardization of spectra issued from different NIR spectrometers may further improve the GMO classification. The difference in the organic composition between RR and non-RR soybeans detected by NIR measurements must Among the 3 models tested, Locally Weighted Regression achieved the best classification performance (92.4% accuracy). Artificial Neural Networks classified the RR soybeans nearly as accurately as did LWR. Partial Least Squares produced unsatisfactory classification accuracy. Figure 1 Conclusions http://www.iowagrain.org http://www.ag.iastate.edu/centers/ccur/

Upload: jorryn

Post on 19-Jan-2016

47 views

Category:

Documents


0 download

DESCRIPTION

Moisture. Protein. Oil. Fiber. Roundup Ready soybeans. Principal Components. Non Roundup Ready soybeans. Detection of Roundup Ready ™ Soybeans by Near-Infrared Spectroscopy S.A. Roussel*, C.L. Hardy**, C.R. Hurburgh, Jr.*, G.R. Rippke* Iowa State University, . Ames, IA - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Table 1 Soybean sample composition

Table 1 Soybean sample composition

Detection of Roundup Ready™ Soybeans by Near-Infrared Spectroscopy

S.A. Roussel*, C.L. Hardy**, C.R. Hurburgh, Jr.*, G.R. Rippke* Iowa State University,. Ames, IA

* Grain Quality Laboratory ** Center for Crops Utilization ResearchDepartment of Agricultural & Biosystems Engineering

The controversy over genetically modified (GMO) grains has created the need for a rapid screening method for grain handlers. Maintaining purity of an otherwise indistinguishable product is not an operation familiar to commodity markets.

Near-infrared analyzers would be well suited to screening inbound deliveries, if there were sufficient spectral differences between GMO and conventional varieties. While NIR units cannot detect DNA, spectral differences may be caused by larger structural changes accompanying the modification.

Roundup Ready™ soybean (RR) was the first genetically modified grain to be widely adopted by US farmers (0% of soybean acres grown in 1996 to more than 50% in 1999).

10th International Diffuse Reflectance Conference (IDRC-2000), August 13-18th, 2000, Chambersburg, PA.

Figure 2 shows the region of greatest spectral response to be 900–950nm.

A preliminary study demonstrated a spectral difference between RR and conventional soybeans (Figure 1). Samples of known varieties from Iowa county

Additional samples (Table 1) were scanned in 1999 on near infrared spectrometers (Figure 3).

The objective of this study is to develop the protocol for distinguishing Roundup Ready™ from conventional soybeans using near infrared spectrometry.

RR Non-RRSample set(nRR/nnon-RR)

FactorAvg. Std. Dev. Avg. Std. Dev.

Moisture 11.5 0.63 10.9 0.81

Protein 37.5 0.93 36.5 0.84Sample Set I

1998 harvest samples(53/61) Oil 18.9 0.48 19.5 0.69

Moisture 11.2 0.63 10.7 1.71

Protein 36.0 1.50 35.8 1.61Sample Set II

1999 harvest, breeder, strip plot samples (308/341) Oil 18.5 0.82 18.3 1.00

Moisture 8.73 0.96 9.2 0.93

Protein 35.7 1.29 35.5 1.45Sample Set III

1999 Iowa SoybeanYield Test samples (3882/3482) Oil 18.2 0.84 18.3 0.80

Basis 13% moisture, predicted by the Infratec, calibration SB00994

After the preliminary study based on Sample Set I, the general progression of evaluation was:

1. New PLS calibrations and ANN models (Figure 5) were created using Sample Sets I & II (Table I). The constituent values as well as the spectra were used as input variables.2. PLS, ANN (Figure 5), and LWR (Figure 6) calibrations were generated using a very large database (Sample Set III in Table I).

3.2

3.3

3.4

3.5

3.6

3.7

850 870 890 910 930 950 970 990 1010 1030Wavelength (nm)

ln (1

/t)

Roundup Ready Soybeans(51 samples)

Non-Roundup Ready Soybeans

(63 samples)

Near-infrared transmittance spectra, Infratec 1229; Roundup Ready vs. Non-Roundup Ready soybeans

strip plots in 1998 comprised a calibration set of 53 RR soybeans and 61 non-RR

soybeans (Table 1). On an independent set of 39 samples, this calibration classified

RR soybeans with 85% accuracy and non-RR soybeans with 95% accuracy.

However, it gave less than 70% accuracy on the following year’s grain (1999).

-300

-200

-100

0

100

200

300

850 870 890 910 930 950 970 990 1010 1030

Wavelength (nm)

Reg

ress

ion

coef

ficie

nt v

alue Water band, 960 nm

Active Region

Regression coefficients for 1998 Roundup Ready soybean calibrations

CH-, CH2-, CH3-, R-OH activity

Figure 2

0.00

0.01

0.02

0.03

0.04

0.05

0.06

0.07

850 870 890 910 930 950 970 990 1010 1030

Wavelength (nm)

Ave

rag

e d

iffe

ren

ce, R

R -

no

n-R

R, l

n (

1/T

) u

nits

Original calibration set (n=114)

Original validation set (n=39)

Early harvest 1999 (n=43)

Wet breeder samples (n=96)

Dry breeder samples (n=270)

1999 strip plots (n=107)

Differences in NIR transmission spectra (Roundup Ready - non-Roundup Ready)

in 6 soybean sample sets, 1998-1999

Figure 4

Figure 4 shows the Infratec spectral differences between RR and non-RR samples. The direction of differences (RR higher than non-RR) remained but the magnitude of the difference was not consistent. Figure 4 indicates that a linear PLS model might not describe the differences accurately.

Figure 3Foss/Tecator Infratec 1229 Grain Analyzer

To prevent confounding, the data was balanced in other constituents (Moisture, Protein, Oil) known to affect the spectra (Table 1).

The classification accuracy of various models was compared: Partial Least Squares (PLS) (UnscramblerTM v. 7.5) Locally Weighted Regression (LWR) (MatlabTM v. 5.3) Artificial Neural Networks (ANN) (MatlabTM v. 5.3)

Materials

Methods

850 875 900 925 950 975 1000 1025 10503.4

3.45

3.5

3.55

3.6

3.65

3.7

3.75

3.8

3.85

3.9

Soybean NIR spectrum

Wavelengths (nm)

ln(1

/T)

Roundup Readysoybeans

Non Roundup Ready soybeans

Moisture

Protein

Oil

Fiber

Pri

ncip

al

Co

mp

one

nts

Figure 5: ArtificialNeural Networks

The best ANN model (Figure 5) used 25 principal components generated from the 100-wavelength spectra and 4 constituent

5 to 8 hidden nodes, and 2 outputs

values (moisture, protein, oil, and fiber) as inputs,

encoding for RR and non-RR classes.

Calibration set Predicted test set% correct

classificationModel and parameters Year (RR/non-RR) Year (RR/non-RR) RR Non-RRPLS 16LVs 66 72PLS+

a17LVs

Sample Set I & II 308 / 341 Sample Set II (cv)

308 / 34171 76

PLS 12LVs 62 53PLS+

a10LVs 59 69

ANN (25+4:8:2)b

Sample Set I & IICalibration

272 / 275Sample Set I & II

Test Set37 / 55

81 87PLS

a16LVs Sample Set III

c4302 / 3880 Sample Set III

(cv)4302 / 3880 89 87

PLS 16LVs 77 79PLS+

a15LVs 77 79

LWR 17LVs, 850ng 93 92ANN (25+4:5:2)

Sample Set III 3882 / 3482 Test Set III 420 / 398

91 89

Table 2 Roundup Ready™ soybean classification results

(cv) -Classification errors computed using cross-validation LVs - Latent variables ng - neighborsa Includes the constituents (moisture, protein, oil and fiber) as independent variablesb (25+4:8:2): Input nodes: 25 principal components + 4 constituents; hidden nodes: 8; output nodes: 2c Sample Set III and Test Set III were combined for cross-validation.

Results

300500

700900

11001300

1500

13 14 15 16 17 18 19

7

7.5

8

8.5

9

9.5

10

10.5

11

Number of neighbors

LWR prediction of the 818-sample test set

Number of latent variables

Pe

rce

nta

ge

of c

lass

ifica

tion

err

or

8.3%

7.6%

Figure 6Figure 6 shows the various parameters tested with LWR:

300 - 1500 neighbors 13 - 19 latent variables α = 0 - 1(proportion of

x- and y-distances) Spectra + constituents

or spectra onlyThe lowest classification error (7.6%) was obtained with 850 neighbors and 17 latent variables and α=0 (Figure 6).

The discrimination between RR and conventional soybeans has been achieved using non linear (ANN) and local (LWR) models a on large NIR spectral database.

The optical standardization of spectra issued from different NIR spectrometers may further improve the GMO classification.

The difference in the organic composition between RR and non-RR soybeans detected by NIR measurements must be more clearly defined.

Among the 3 models tested, Locally Weighted Regression achieved the best classification performance (92.4% accuracy). Artificial Neural Networks classified the RR soybeans nearly as accurately as did LWR. Partial Least Squares produced unsatisfactory classification accuracy.

Figure 1

Conclusions

http://www.iowagrain.org http://www.ag.iastate.edu/centers/ccur/