appsphere 15 - achieving stability and end-to-end monitoring

Post on 15-Jan-2017

369 Views

Category:

Documents

3 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Achieving stability and End to End Monitoring Stace Baal, eHarmony Inc.Director of Engineering

The Problem

The Mess or Miss of Data

What we’re working on

The Rollout

The Business Transaction

The Drill Down

Copyright © 2015 AppDynamics. All rights reserved. 2

Dashboards

Health Rules

Proactive

That shouldn’t happen?!

Out of the datacenter – EUM

What’s next?

THE PROBLEM

The Problem

The Problem

Mature, large brand in the midst of changing architecture• Instability from Monolithic legacy application• Reactive not proactive with production issues• Little visibility into how new services interacted with legacy apps• Lack of data on customer experience

The Problem

THE MESS OR MISS OF DATA

The mess of data

We needed one source to truth to quickly diagnose issues• Fragmentation of monitoring systems• Lots of holes, lack of time for maintenance as services evolve• Ineffective alerting, noise• User facing outages

The mess of data

Copyright © 2015 AppDynamics. All rights reserved. 10

Why APM?

Challenges that caused us to choose AppDynamics• Stability• Time to Market: Needed a turnkey solution• Minimal internal resources• Engineering driven solution• Ability to track and trend baselines• Single pane of glass

THE ROLLOUT

How do we roll this out?

Challenges that caused us to choose AppDynamics• Deploy• Define• Diagnose• Notify• Discovery

Tiers

Business Transactions

Dashboards

Health Rules

Snapshots

Rolling out AppDynamics

Decide what to cover and how to divide it up• Create applications• Define and organize tiers• Deploy

– Start Front to Back– Chef Integration– Staging – Production, rolling deploy

• Wow, that went fast… – Time to market vs. homgrown solutions

Example call graph

Rolling out AppDynamics

Copyright © 2015 AppDynamics. All rights reserved. 16

• 4 Applications– User Front– User Back– Batch– Staging

• 38 Tiers (and counting)• EUM, Web & Mobile

THE BUSINESS TRANSACTION

Defining Business Transactions

Copyright © 2015 AppDynamics. All rights reserved. 18

Defining Business Transactions

BT: Business transaction correlated across service tiers• Visibility into what is most important• Start simple, broad transactions key to the business• Add granularity over time• May take some time, but it’s worth it

Defining Business Transactions

Copyright © 2015 AppDynamics. All rights reserved. 20

One Transaction

Several URLs

Defining Business Transactions

Copyright © 2015 AppDynamics. All rights reserved. 21

One Transaction

Correlated Tiers

Defining Business Transactions

BT is the foundation for all diagnostic tools• Correlation• Trends and dynamic baselines• Health rules• Drilldown and discovery

Defining Business Transactions

Copyright © 2015 AppDynamics. All rights reserved. 23

THE DRILLDOWN

Drilling down into Transaction Snapshots

Your diagnostic window into a transaction• Single pane view across tiers• Deep code-level visibility• Database and exit calls• Custom diagnostic data• Server health and utilization

Drilling down into Transaction Snapshots

Copyright © 2015 AppDynamics. All rights reserved. 26

Drilling down into Transaction Snapshots

Copyright © 2015 AppDynamics. All rights reserved. 27

DASHBOARDS

Dashboards

Dashboards

Your tool to bring everything together• At a glance diagnostic and status• Comprehensive tier and node monitoring• Connecting the dots between systems• Trending over time• Easily surface data between teams

Dashboards

Copyright © 2015 AppDynamics. All rights reserved. 31

Traffic

Performance

Dashboards

Copyright © 2015 AppDynamics. All rights reserved. 32

WEB

API

Service

Dashboards

Copyright © 2015 AppDynamics. All rights reserved. 33

PROACTIVE

So now what?

• Data and tools drive a cultural shift• Alert and surface issues before than can become outages• Health Rules are able to be predictive• Socialize the system beyond on-call • Empower teams and increase visibility• Encourage engineers to drilldown and optimize

THAT SHOULDN’T HAPPEN?!

That shouldn’t happen?!

That shouldn’t happen?!

That Random Spike

Copyright © 2015 AppDynamics. All rights reserved. 38

That shouldn’t happen?!

After the release

Copyright © 2015 AppDynamics. All rights reserved. 39

OUT OF THE DATACENTER - EUM

End user Monitoring

Extending visibility to the end user experience• Web: Pages & Ajax Requests• Mobile: Network Requests• Mobile: Crash Reporting• Custom Events & Timers• User snapshots correlated with BT snapshots• Justify and prioritize infrastructure and application enhancements• Correlate performance to business metrics • CDN and 3rd party monitoring

Stability & End to End Monitoring

Copyright © 2015 AppDynamics. All rights reserved. 42

WHAT’S NEXT?

Thank You

top related