backy - osbconf - open source backup conference · backy vm backup beyond bacula/bareos ... bacula:...

30
backy VM backup beyond Bacula/Bareos Christian Theune @theuni ct@flyingcircus.io

Upload: others

Post on 19-Sep-2020

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

backy

VM backup beyond Bacula/BareosChristian Theune

@theuni

[email protected]

Page 2: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

Mea Culpa

Page 3: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

And I almost

missed it — again

Page 4: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

Backup!!11!!

Page 5: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

• flyingcircus.io

• DevOps as a Service

• custom, mission-critical web

applications

Page 6: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

Part I - Oh the Pain

Page 7: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

The story unfolds …

Page 8: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient
Page 9: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient
Page 10: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient
Page 11: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient
Page 12: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

Root Cause Analysis

Page 13: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

http://flyingcircus.io/

postmortems/13266.pdf

Page 14: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

Restore script bottleneck:

global lock

Page 15: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

Undetected inconsistency in

important customer database

Page 16: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

Bacula: complexity and the

VTL

Page 17: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

Not “everything” backed up.

Page 18: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

24 hours are not a sufficient

RPO in quite a few cases

Page 19: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

Paper cuts

• Hard link farms

• Boot loaders

• The director as a “most valuable bottleneck”

Page 20: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

Recap

• Restore fiddly to script

• Undetected inconsistency that was hard to deal with

• Blind spots

• Daily Interval

• Overall complexity, performance and the VTL

• Paper cuts

Page 21: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

Part II - Make a wish

Page 22: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

Simplicity

• Restore with basic Unix tools

• No VTL

• Not mixing data of different VMs

Page 23: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

Reliability

• Verification / Scrubbing / (Repair)

• High frequency

• Integration with storage snapshots

• Not inventing new formats

Page 24: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

Operability

• Avoid bottlenecks / head-of-line blocking

• Efficient deltas for large files (ZODB)

• Parallelisation (multiple jobs and multiple servers)

• Simple scripting and environment-specific integration

• Coordination: pre/post actions on storage, hypervisor,

VM …

Page 25: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

Operability II

• Simple Nagios integration to ensure we notice RPO/

SLA failures

• RTO-compliance during mass-restore

• Self-service for customers to restore files or VMs

Page 26: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

Part III - Let’s do this!

Page 27: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

–Probably someone, maybe me

“One size fits all … not”

It’s all about size

Page 28: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

It’s all about size: backy

Page 29: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

It’s all about size: Bacula

Page 30: Backy - OSBConf - Open Source Backup Conference · backy VM backup beyond Bacula/Bareos ... Bacula: complexity and the VTL. Not “everything” backed up. 24 hours are not a sufficient

It’s all about size: Bareos