siamese network with multi-level features for patch-based ... · • tasseled cap transformations...

25
Siamese Network with Multi - Level Features for Patch - based Change Detection in Satellite Imagery 2018 IEEE Global Conference on Signal and Information Processing Faiz Ur Rahman 1 , Bhavan Vasu 1 , Jared Van Cor 2 John Kerekes 2 , Andreas Savakis 1 1 Vision and Image Processing Laboratory 2 Digital Imaging and Remote Sensing Laboratory Rochester Institute of Technology Rochester, New York, USA

Upload: others

Post on 16-Aug-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Siamese Network with Multi-Level Features for Patch-based ... · • Tasseled Cap transformations [7] • Gramm-Schmidt [8] • Chi-square [9] • Image regression to predict pixels

SiameseNetworkwithMulti-LevelFeaturesforPatch-basedChangeDetectioninSatelliteImagery

2018 IEEE Global Conference on Signal and Information Processing

Faiz Ur Rahman1, Bhavan Vasu1, Jared Van Cor2

John Kerekes2, Andreas Savakis1

1Vision and Image Processing Laboratory2Digital Imaging and Remote Sensing Laboratory

Rochester Institute of TechnologyRochester, New York, USA

Page 2: Siamese Network with Multi-Level Features for Patch-based ... · • Tasseled Cap transformations [7] • Gramm-Schmidt [8] • Chi-square [9] • Image regression to predict pixels

Outline

• ChangeDetection andMotivation

• CNN’sandSiameseNetworks

• SimulationDataset:DIRSIG

• Results

– UsingFullyConnectedDecisionNetwork

– UseofBootstrapping

– UseofEuclideanDistanceandThresholding

• Conclusions

Page 3: Siamese Network with Multi-Level Features for Patch-based ... · • Tasseled Cap transformations [7] • Gramm-Schmidt [8] • Chi-square [9] • Image regression to predict pixels

ChangeDetection

§ Identifyingchangesofinterest ineachsetofimagesisafundamental taskincomputervisionwithnumerousapplicationslike

• Faultdetection

• Disastermanagement

• Cropmonitoring

• Aerialsurveying• Weinvestigatedchangesinhelicopter

detection(present/not present)

3Fig1:Vector Illustration of GIS Spatial Data Layers Concept for Business Analysis, Geographic Information System, Icons Design, Liner Style. Source: Adobe Stock Images

Fig1

Page 4: Siamese Network with Multi-Level Features for Patch-based ... · • Tasseled Cap transformations [7] • Gramm-Schmidt [8] • Chi-square [9] • Image regression to predict pixels

ChangeDetection:AlgebraicApproaches

Earlierattemptstodetectchangesvariedincomplexityfromsomebasicalgebraicmethodstoseveralmachinelearningmethods.Somebasicalgebraicmethodswere:

• Imagedifferencing[3]• Connectionvectoranalysis[4]• Ratioing[5]

Buttheaboveapproachesyieldahighfalsepositiveduetoilluminationchangesweredeveloped forsinglesourceimagery.

42018 IEEE Global Conference on Signal and Information Processing

[3] D. Muchoney and B. Haack, “Change detection for monitoring forest defoliation,” Photogrammetric Engineering and Remote Sensing, vol. 60, pp. 1243–1251, 1994.

[4] E. F. Lambin, “Change detection at multiple temporal scales: seasonal and annual variations in landscape variables,” Photogrammetric Engineering and Remote Sensing, vol. 62, no. 8, pp. 931–938, 1996.

[5] A. Prakash and R. P. Gupta, “Land-use mapping and change detection in a coal mining area - a case study in the Jharia coalfield, India,” International Journal of Remote Sensing, vol. 19, no. 3, pp. 391–410, 1998.

Page 5: Siamese Network with Multi-Level Features for Patch-based ... · • Tasseled Cap transformations [7] • Gramm-Schmidt [8] • Chi-square [9] • Image regression to predict pixels

ChangeDetection:AdvancedApproachesSometechniques frommachinelearningwerealsoadoptedtoreducedataredundancybetweenbandsandpredictpixelsincluding:

• PrincipalComponentAnalysis(PCA)[6]• TasseledCaptransformations [7]• Gramm-Schmidt[8]• Chi-square [9]• Imageregression topredictpixels[10]

Thesewerelimitedasallchangesweredetectednotjustchangesofinterest.

52018 IEEE Global Conference on Signal and Information Processing

[6] G. Byrne, P. Crapper, and K. Mayo, “Monitoring land-cover change by principal component analysis of multi-temporal landsat data,” Remote sensing of Environment, vol. 10, no. 3, pp. 175–184, 1980.

[7] K. C. Seto, C. Woodcock, C. Song, X. Huang, J. Lu, and R. Kaufmann, “Monitoring land-use change in the pearl river delta using Landsat,” International Journal of Remote Sensing, vol. 23, pp. 1985–2004, 2002.

[8] J. B. Collins and C. E. Woodcock, “An assessment of several linear change detection techniques for mapping forest mortality using multitemporal Landsat data,” Remote sensing of environment, vol. 56, pp. 66–77, 1996.

[9] M. K. Ridd and J. Liu, “A comparison of four algorithms for change detection in an urban environment,” Remote sensing of environment, vol. 63, no. 2, pp. 95–100, 1998

[10] A. Singh, “Spectral separability of tropical forest cover classes,” International Journal of Remote Sensing, vol. 8, no. 7, pp. 971–979, 1987.

Page 6: Siamese Network with Multi-Level Features for Patch-based ... · • Tasseled Cap transformations [7] • Gramm-Schmidt [8] • Chi-square [9] • Image regression to predict pixels

DeepCNNasaFeatureExtractorDeepLearningApproaches:Theworkin[1]proposedusingadeepconvolutionalneuralnetworkasafeatureextractorfollowedbyastageofsegmentationandthresholding.

6[1] A. M. El Amin, Q. Liu, and Y. Wang, “Convolutional neural network features based change detection in satellite images,” in First International Workshop on Pattern Recognition International Society for Optics and Photonics, 2016.

Fig 2: Block diagram of convolutional neural network features based change detection

Page 7: Siamese Network with Multi-Level Features for Patch-based ... · • Tasseled Cap transformations [7] • Gramm-Schmidt [8] • Chi-square [9] • Image regression to predict pixels

SiameseArchitectures• ASiameseArchitecturehastwoidenticalchannels,thatareusuallybased

onstandardCNNarchitectures. Eachofthechannelsprocessesdifferentimages.

• PotentialapplicationsofSiamesenetworksinclude- PersonRe-Identification- ObjectTracking- ChangeDetection

• ASiameseCNNwasusedforchangedetectionwiththresholdsegmentation- Y.Zhan,K.Fu,M.Yan,X.Sun,H.Wang,andX.Qiu,“Changedetectionbasedon

deepSiameseconvolutionalnetworkforopticalaerialimages,”IEEEGeoscienceandRemoteSensingLetters,2017.

• AfullyconvolutionalSiamesearchitecturewasusedforchangedetectionofhyperspectral images- R.Daudt,B.LeSaux,A.Boulch,“FullyConvolutionalSiameseNetworksfor

ChangeDetection,“ IEEEInt.Conf.ImageProcessing(ICIP),2018.

72018 IEEE Global Conference on Signal and Information Processing

Page 8: Siamese Network with Multi-Level Features for Patch-based ... · • Tasseled Cap transformations [7] • Gramm-Schmidt [8] • Chi-square [9] • Image regression to predict pixels

SiameseNetworkArchitecture

8

Fig 4: Siamese Neural Network Architecture with Decision Network

• Our Siamese network has two identical convolutional networks that merge into a common decision network.

• The convolutional networks are VGG16 architectures pre-trained on ImageNet.

• Both VGG16 networks share their trainable parameters.

• The decision network is trained to detect changes between the Target and Reference Images.

2018 IEEE Global Conference on Signal and Information Processing

Page 9: Siamese Network with Multi-Level Features for Patch-based ... · • Tasseled Cap transformations [7] • Gramm-Schmidt [8] • Chi-square [9] • Image regression to predict pixels

Multi-LevelFeatureConcatenation

9

Fig 5: Siamese Neural Network multi-level feature concatenation.

Blocks SizeOfFilters

Block5 3X3X1024

Block5&Block4 6X6X2048

Block5,Block4&Block3 12X12X2560

2018 IEEE Global Conference on Signal and Information Processing

Block 5

Block 4

Block 3

Block 2

Block 1

Page 10: Siamese Network with Multi-Level Features for Patch-based ... · • Tasseled Cap transformations [7] • Gramm-Schmidt [8] • Chi-square [9] • Image regression to predict pixels

DecisionNetworkforSiameseNetwork

Fig6:Decisionnetworkarchitecture

• DecisionNetworkconsistsoftwoFullyConnected(FC)layers.

• Thistensorvariableconsistsof1024tensors,512x3x3fromeachnetwork(forBlock5).

• ThefirstFClayertakesasinputtheoutputsofthetwoSiameseVGG16blocks.

• ThesecondFClayeroutputscorrespond totwoclasses:Change andNoChange.

102018 IEEE Global Conference on Signal and Information Processing

Page 11: Siamese Network with Multi-Level Features for Patch-based ... · • Tasseled Cap transformations [7] • Gramm-Schmidt [8] • Chi-square [9] • Image regression to predict pixels

Decision Network Training

• Theconvolutionnetwork(Siamesechannel)isusedtoextractfeaturesthatarestoredfortrainingthedecisionnetwork.

• Weinterpolatethefeaturemapsto48X48,thisenablesustostackfeaturesfromdifferentlayers.

• ExtractedfeaturesfromdifferentVGGblocksareconcatenatedbeforepassingthemtothefullyconnected layer.

• WeexperimentwithVGGfeaturesfromjustBlock5,concatenatingBlock4andBlock5andthisprocessiscontinueduntilthewehavefeaturesfromBlock1,Block2,Block3,Block4&Block5.

112018 IEEE Global Conference on Signal and Information Processing

Page 12: Siamese Network with Multi-Level Features for Patch-based ... · • Tasseled Cap transformations [7] • Gramm-Schmidt [8] • Chi-square [9] • Image regression to predict pixels

DigitalImagingandRemoteSensingImageGeneration(DIRSIG)

• RITdevelopedDIRSIGisleadingtoolforsceneandimagechainsimulations• Commonusesinclude:

⁃ Sensor/systemperformanceevaluation⁃ Algorithmtrainingandevaluation⁃ Rapidparametricanalysis

• ImageModalities⁃ Visiblethroughthermalinfrared(0.4- 20.0microns)⁃ PassiveSensing

⁃ Broad-band,multi-spectral(MS),hyperspectral(HS)imaging

⁃ Polarization(usuallylimitedbymaterialproperties)⁃ ActiveLasersensing

⁃ TopographicLADARandatmospheric/gasLIDAR• Instruments

⁃ Singlepixel,1Darraysand2Darrays⁃ Filter,diffraction/refractionorinterferogram-based⁃ Photoncollection

• Platforms⁃ Ground,airorspaceonstaticormovingplatforms

• Phenomenology• Passivethermodynamics,secondarylightsources,dynamic

scenecontent(movinggeometry),volumeradiometry(plumes,water),etc.

MegaScene Tile #1- 5,000+ objects - 500+ million facets- 1.6 km2 (0.6 mi2)

122018 IEEE Global Conference on Signal and Information Processing

Page 13: Siamese Network with Multi-Level Features for Patch-based ... · • Tasseled Cap transformations [7] • Gramm-Schmidt [8] • Chi-square [9] • Image regression to predict pixels

Change Detection Dataset

• The DIRSIG simulation environment was used to create our training and testing datasets.

• Images of helicopters and backgrounds were simulated to be representative of pan-sharpened DigitalGlobe WorldView-2 imagery.

• Image chips for Target and Reference images of size 80 × 80 were generated under different backgrounds and illumination, with a variety of spectral and structural properties.

• We made sure that our data samples contained enough variation to allow the network to learn changes related to the presence of helicopters and ignore illumination changes.

132018 IEEE Global Conference on Signal and Information Processing

Page 14: Siamese Network with Multi-Level Features for Patch-based ... · • Tasseled Cap transformations [7] • Gramm-Schmidt [8] • Chi-square [9] • Image regression to predict pixels

Example Simulated Image Chip Pairs

142018 IEEE Global Conference on Signal and Information Processing

Our dataset consisted of

• 11,700 image pairs for training.

• 5,000 image pairs for testing.

• Variations in illumination for target/no target pairs

Image Pair 1&2

Image Pair 3&4

Image Pair 5&6

Image Pair 7&8

Illumination A

No Target Target

Illumination B

No Target Target

Page 15: Siamese Network with Multi-Level Features for Patch-based ... · • Tasseled Cap transformations [7] • Gramm-Schmidt [8] • Chi-square [9] • Image regression to predict pixels

Decision Network Training (Block 5)

• Decisionnetworktrainedonfeatures fromBlock5.

• Trainingaccuracy95.6%.

152018 IEEE Global Conference on Signal and Information Processing

Decisionnetworkwastrainedfor80 epochswithabatchsizeof150andlearningrateof0.01 usingSGD lossonaTITANGTX960.Samehyperparametersareusedforfurtherexperiments.

Fig 8: Training curve for decision network trained with Block 5 features.

Iteration

Accu

racy

(Fra

ctio

n)

Page 16: Siamese Network with Multi-Level Features for Patch-based ... · • Tasseled Cap transformations [7] • Gramm-Schmidt [8] • Chi-square [9] • Image regression to predict pixels

Example Results (Block5Features)

162018 IEEE Global Conference on Signal and Information Processing

ReferenceImage

TargetImage

Output:

GroundTruth:

NoChangeChange ChangeChange

NoChangeChange ChangeNoChange

Fig 9: Results for change detection using features from Block 5 to train decision network.

Page 17: Siamese Network with Multi-Level Features for Patch-based ... · • Tasseled Cap transformations [7] • Gramm-Schmidt [8] • Chi-square [9] • Image regression to predict pixels

Training Curve (Block4&Block5)

• DecisionnetworktrainedoncombinedfeaturesfromBlock4andBlock5.

• Learningrate0.01withSGD Loss.

• TrainingAccuracyis96.7%.

172018 IEEE Global Conference on Signal and Information Processing

Fig 10: Training curve for decision network trained with Block 4 and Block 5 features.

Page 18: Siamese Network with Multi-Level Features for Patch-based ... · • Tasseled Cap transformations [7] • Gramm-Schmidt [8] • Chi-square [9] • Image regression to predict pixels

Example Results (Block5&Block4Features)

182018 IEEE Global Conference on Signal and Information Processing

ReferenceImage

TargetImage

Output:

GroundTruth: NoChangeChangeNoChange NoChange

NoChangeChangeNoChange Change

Fig 11: Results for change detection using features from block 4 and block 5 to train decision network.

Page 19: Siamese Network with Multi-Level Features for Patch-based ... · • Tasseled Cap transformations [7] • Gramm-Schmidt [8] • Chi-square [9] • Image regression to predict pixels

Results with Fully Connected Decision Network

TestDataConfusionMatrices

Block5

Prediction→Truth Change NOChange

Change 4873 633

NOChange 127 4367

Prediction→Truth Change NOChange

Change 4951 411

NOChange 49 4589

Block5&Block4

NumberOfblocks TestAccuracy(%)

Block5 92.4

Block5&Block4 95.9

Block5,Block4&Block3 93.5

Block5,Block4,Block3&Block2 89.1

Block5,Block4,Block3,Block2&Block1 84.8

Accuracy

192018 IEEE Global Conference on Signal and Information Processing

Page 20: Siamese Network with Multi-Level Features for Patch-based ... · • Tasseled Cap transformations [7] • Gramm-Schmidt [8] • Chi-square [9] • Image regression to predict pixels

Bootstrapping

20

• Bootstrappingofimagechipstoincreasethenumberoftruepositivesdetected.

• Bootstrappingisperformedbyre-trainingthefinaldecisionlayerwithallfalsenegativesandfalsepositives.

202018 IEEE Global Conference on Signal and Information Processing

Accuracy

FeaturesUsed TestAccuracy(%)

Block5&Block4(Bootstrapping) 96.5

Page 21: Siamese Network with Multi-Level Features for Patch-based ... · • Tasseled Cap transformations [7] • Gramm-Schmidt [8] • Chi-square [9] • Image regression to predict pixels

Results with Bootstrapping

212018 IEEE Global Conference on Signal and Information Processing

ReferenceImage

TargetImage

Output:

GroundTruth: NoChangeChangeNoChange Change

NoChangeChangeNoChange NoChange

Fig 12: Results for change detection using features from block 4 and block 5 with bootstrapping

Page 22: Siamese Network with Multi-Level Features for Patch-based ... · • Tasseled Cap transformations [7] • Gramm-Schmidt [8] • Chi-square [9] • Image regression to predict pixels

Siamese Network withEuclidian Distance Comparison

• Euclidian distance is a simple yet effectivedistance measure for comparingsamples p and q in n-dim feature space.

• Euclidian Distance above a Threshold T=0.76is used to detect a Change, else No Change.

• This architecture does not require supervision(training) other than the selection of thethreshold parameter.

222018 IEEE Global Conference on Signal and Information Processing

Fig 13: Siamese network with Euclidean distance

VGG

16

VGG

16

Page 23: Siamese Network with Multi-Level Features for Patch-based ... · • Tasseled Cap transformations [7] • Gramm-Schmidt [8] • Chi-square [9] • Image regression to predict pixels

Results Using Euclidean DistanceFeatures TestAccuracy(%) Comments

Block5 91.8

Block5&4 93.4 Givesbetterresultswithmorefeatures

Block5&4&3 90.6 Startsgivingfalsetoomanyfalsepositives

Block5&4&3&2 88.5 Toomanyfalsepositives

Block5&4Euclidean(threshold=0.52) 86.9 AlltruePositiveswith

manyfalsepositives

• Weobservedthatthealgorithmgenerallygetsconfusedwhenhelicopter isonthetarmac

• WeexperimentedwithdecreasingthethresholdfortheEuclideandistanceto0.52(from0.76)todetectall thetruepositives,butthisresulted inmanyfalsepositives.

232018 IEEE Global Conference on Signal and Information Processing

Page 24: Siamese Network with Multi-Level Features for Patch-based ... · • Tasseled Cap transformations [7] • Gramm-Schmidt [8] • Chi-square [9] • Image regression to predict pixels

2018 IEEE Global Conference on Signal and Information Processing

Results using Euclidean (Block 5 & Block 4)

24

SourceImage

TargetImage

Output:

GroundTruth: ChangeNoChangeChangeNoChange

Fig 14: Results for change detection using features from Block 5 and Block 4with Euclidean distance.

ChangeNoChangeChangeChange

Page 25: Siamese Network with Multi-Level Features for Patch-based ... · • Tasseled Cap transformations [7] • Gramm-Schmidt [8] • Chi-square [9] • Image regression to predict pixels

Conclusions

25

• We proposed a Siamese architecture that can detect changes in the presence of helicopters between two satellite image pairs.

• Experiments with multilevel VGG features and a decision network showed that the features from Block 4 and Block 5, when combined, were the most useful at detecting changes.

• Bootstrapping was seen to aid in increasing the true positive rate of our Siamese architecture.

• The Siamese architecture with a trained decision network outperformed a Siamese network with simple thresholding of Euclidean distance.

2018 IEEE Global Conference on Signal and Information Processing 25