monitoring that will make your engineers give up - gil zellner, gigaspaces - devopsdays tel aviv...

21
Ignite Session DevOpsDays TLV 2015

Upload: devopsdays-tel-aviv

Post on 26-Jan-2017

306 views

Category:

Technology


0 download

TRANSCRIPT

Page 1: Monitoring That Will Make Your Engineers Give Up - Gil Zellner, GigaSpaces - DevOpsDays Tel Aviv 2015

Ignite SessionDevOpsDays TLV 2015

Page 2: Monitoring That Will Make Your Engineers Give Up - Gil Zellner, GigaSpaces - DevOpsDays Tel Aviv 2015

Monitoring that will make your engineers give up

Gil Zellner (CloudifyDev at Gigaspaces)

Page 3: Monitoring That Will Make Your Engineers Give Up - Gil Zellner, GigaSpaces - DevOpsDays Tel Aviv 2015

tl;dr

Page 4: Monitoring That Will Make Your Engineers Give Up - Gil Zellner, GigaSpaces - DevOpsDays Tel Aviv 2015

Why is monitoring so important ?

Page 5: Monitoring That Will Make Your Engineers Give Up - Gil Zellner, GigaSpaces - DevOpsDays Tel Aviv 2015
Page 6: Monitoring That Will Make Your Engineers Give Up - Gil Zellner, GigaSpaces - DevOpsDays Tel Aviv 2015
Page 7: Monitoring That Will Make Your Engineers Give Up - Gil Zellner, GigaSpaces - DevOpsDays Tel Aviv 2015
Page 8: Monitoring That Will Make Your Engineers Give Up - Gil Zellner, GigaSpaces - DevOpsDays Tel Aviv 2015
Page 9: Monitoring That Will Make Your Engineers Give Up - Gil Zellner, GigaSpaces - DevOpsDays Tel Aviv 2015

solution: alert only things that meet the following criteria:

1) actionable - can I do something about this?2) does this currently or immediately break the business?3) this cannot wait till morning

Page 10: Monitoring That Will Make Your Engineers Give Up - Gil Zellner, GigaSpaces - DevOpsDays Tel Aviv 2015

Next day

Page 11: Monitoring That Will Make Your Engineers Give Up - Gil Zellner, GigaSpaces - DevOpsDays Tel Aviv 2015

2nd Deadly sin of monitoringSingle team does monitoring, everyone else is second tier

Page 12: Monitoring That Will Make Your Engineers Give Up - Gil Zellner, GigaSpaces - DevOpsDays Tel Aviv 2015

Solution: direct alerts to relevant parties1) only person who can fix the problem gets alerted, others get emails2) system needs to be smart enough to make the choice, and fixed when it

makes a mistake in waking up the wrong person

Page 13: Monitoring That Will Make Your Engineers Give Up - Gil Zellner, GigaSpaces - DevOpsDays Tel Aviv 2015

Alerte générale!

Page 14: Monitoring That Will Make Your Engineers Give Up - Gil Zellner, GigaSpaces - DevOpsDays Tel Aviv 2015
Page 15: Monitoring That Will Make Your Engineers Give Up - Gil Zellner, GigaSpaces - DevOpsDays Tel Aviv 2015
Page 16: Monitoring That Will Make Your Engineers Give Up - Gil Zellner, GigaSpaces - DevOpsDays Tel Aviv 2015

Solution: Monitoring needs to be a part of the designthe empty error - classic example - null pointer exceptions in java

make your developers accountable for empty errors

Page 17: Monitoring That Will Make Your Engineers Give Up - Gil Zellner, GigaSpaces - DevOpsDays Tel Aviv 2015
Page 18: Monitoring That Will Make Your Engineers Give Up - Gil Zellner, GigaSpaces - DevOpsDays Tel Aviv 2015
Page 19: Monitoring That Will Make Your Engineers Give Up - Gil Zellner, GigaSpaces - DevOpsDays Tel Aviv 2015

solutions:self correcting metrics. if an alert goes off for a metric, and we decide it wasn’t a real error - a dialog for changing the threshold should pop up.

Page 20: Monitoring That Will Make Your Engineers Give Up - Gil Zellner, GigaSpaces - DevOpsDays Tel Aviv 2015

solution example: netflixstarts per minute

Page 21: Monitoring That Will Make Your Engineers Give Up - Gil Zellner, GigaSpaces - DevOpsDays Tel Aviv 2015

Bad artists copy, great artists steal

We’re hiring:

http://jobs.gigaspaces.com/ email:[email protected]

Twitter: @Heathenaspargus