geometric robustness and certification our algorithm ...Β Β· our algorithm experimental evaluation...

1
Our algorithm Experimental evaluation Experimental evaluation Geometric robustness and certification Problem : , = (, ) : , , = ( + ,+ ) , = ( βˆ’ , + ) ( 0 ) ∈ 0 ( 0 ) ∈ Step 1: Approximation via Monte Carlo sampling β‰ˆ 1 =1 , βˆ’ + + ≀ , ෝ , Step 2: Bound the maximum violation : β†’ = ෝ + βˆ’ , . = + 1/2( β€² ) βˆ’ ≀ + 1/2 βˆ’ | β€² | ≀ | | β€² ∈ , ∈ , , βˆ€ ∈ ≀ + βˆ’ , βˆ€ ∈ . Optimization problem ∢= ΰΆ± ∈ , βˆ’ + ∢= ΰΆ± ∈ + βˆ’ , + ≀ , ≀ + , βˆ€ ∈ . Step 3: Sound constraints βˆ— βˆ— ෝ |( βˆ— βˆ— ) βˆ’ ( ෝ )| < + > Asymptotically optimal constraints ෝ + βˆ’ , ≀ βˆ€ ∈ , βˆ’ ෝ + ≀ βˆ€ ∈ ෝ βˆ’ ෝ + Comparison of training techniques Experiments on large networks ResNetTiny Properties accuracy certification rate ∈ ResNet18 Networks

Upload: others

Post on 20-Sep-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Geometric robustness and certification Our algorithm ...Β Β· Our algorithm Experimental evaluation Experimental evaluation Geometric robustness and certification Problem: πœ† , =(πœ†

Our algorithm Experimental evaluation

Experimental evaluation

Geometric robustness and certification

Problem

: π‘‡πœ† π‘₯, 𝑦 = (πœ†π‘₯, πœ†π‘¦)

: 𝑇𝛿π‘₯, 𝛿𝑦 π‘₯, 𝑦 = (π‘₯ + 𝛿π‘₯ , 𝑦 + 𝛿𝑦)

π‘‡πœ™ π‘₯, 𝑦 = (π‘₯π‘π‘œπ‘  πœ™ βˆ’ 𝑦𝑠𝑖𝑛 πœ™ ,

π‘₯𝑠𝑖𝑛 πœ™ + π‘¦π‘π‘œπ‘  πœ™ )

π‘…πœ™(π‘₯0) πœ™ ∈

π‘₯0𝐢(π‘…πœ™ π‘₯0

) πœ™ ∈

Step 1: Approximation via Monte Carlo sampling

𝐿 π’˜π‘™ 𝑏𝑙 β‰ˆ1

𝑁

𝑖=1

𝑁

πΌπœΏπ‘– π‘₯, 𝑦 βˆ’ π’˜π‘™π‘‡πœΏπ‘– + 𝑏𝑙

π’˜π‘™π‘‡πœΏπ‘– + 𝑏𝑙 ≀ πΌπœΏπ‘– π‘₯, 𝑦

ΰ·π’˜π‘™ , 𝑏𝑙

Step 2: Bound the maximum violation

𝑓: 𝐷 β†’ 𝑅

𝑓 𝜿 = ΰ·π’˜π‘™π‘‡πœΏ + 𝑏𝑙 βˆ’ 𝐼𝜿 π‘₯, 𝑦 .

𝑓

𝑓 𝜿 = 𝑓 πœΏπ‘ + 1/2𝛻𝑓(πœΏβ€²)𝑇 𝜿 βˆ’ πœΏπ‘β‰€ 𝑓 πœΏπ‘ + 1/2 𝑳 𝑇 𝜿 βˆ’ πœΏπ‘

|πœ•π‘–π‘“ πœΏβ€² | ≀ |𝐿𝑖| πœΏβ€² ∈ 𝐷

𝑓𝑙, 𝑒

𝑓 𝜿 ∈ 𝑙, 𝑒 , βˆ€πœΏ ∈ 𝐷

𝑓 𝜿 ≀ 𝑓 πœΏπ‘ + 𝑒 βˆ’ 𝑓 πœΏπ‘ , βˆ€πœΏ ∈ 𝐷.

Optimization problem

π’˜π‘™ π‘π‘™π’˜π‘’ 𝑏𝑒

𝐿 π’˜π‘™ 𝑏𝑙 ∢= ࢱ𝜿∈𝐷

𝐼𝜿 π‘₯, 𝑦 βˆ’ π’˜π‘™π‘‡πœΏ + 𝑏𝑙 π‘‘πœΏ

π‘ˆ π’˜π‘’ 𝑏𝑒 ∢= ࢱ𝜿∈𝐷

π’˜π‘’π‘‡πœΏ + 𝑏𝑒 βˆ’ 𝐼𝜿 π‘₯, 𝑦 π‘‘πœΏ

π’˜π‘™π‘‡πœΏ + 𝑏𝑙 ≀ 𝐼𝜿 π‘₯, 𝑦 ≀ π’˜π‘’

π‘‡πœΏ + 𝑏𝑒, βˆ€πœΏ ∈ 𝐷.

Step 3: Sound constraints

𝑁 πœ–π’˜π‘™βˆ— 𝑏𝑙

βˆ—

ΰ·π’˜π‘™π‘π‘™ 𝛿

𝑁𝛿 |𝐿(π’˜π‘™βˆ— 𝑏𝑙

βˆ—) βˆ’ 𝐿(ΰ·π’˜π‘™π‘π‘™)| < 𝛿 + πœ– 𝑁 > 𝑁𝛿

Asymptotically optimal constraints

ΰ·π’˜π‘™π‘‡πœΏ + 𝑏𝑙 βˆ’ 𝐼𝜿 π‘₯, 𝑦 ≀ 𝛿𝑙 βˆ€πœΏ ∈ 𝐷

𝐼𝜿 π‘₯, 𝑦 βˆ’ ΰ·π’˜π‘’π‘‡πœΏ + 𝑏𝑒 ≀ 𝛿𝑒 βˆ€πœΏ ∈ 𝐷

π’˜π‘™ ΰ·π’˜π‘™ 𝑏𝑙 𝑏𝑙 βˆ’ π›Ώπ‘™π’˜π‘’ ΰ·π’˜π‘’ 𝑏𝑒 𝑏𝑒 + 𝛿𝑒

Comparison of training techniques

Experiments on large networks

ResNetTiny

Properties

accuracycertification rate

𝐼𝜿 𝜿 ∈ 𝐷

ResNet18

Networks