facebook: infrastructure at scale using emerging hardware bill jia, phd manager, performance and...

32

Upload: angelina-newton

Post on 27-Dec-2015

220 views

Category:

Documents


4 download

TRANSCRIPT

Page 1: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013
Page 2: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Facebook: Infrastructure at Scale

Using Emerging Hardware

Bill Jia, PhDManager, Performance and Capacity Engineering

Oct 8th, 2013

Page 3: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Agenda

1 Facebook Scale & Infrastructure

2 Efficiency at Facebook

3 Disaggregated Rack

4 New Components – Emerging Hardware

Page 4: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Data Centers in 5 regions.

Facebook Scale

84%

of monthly active users are outside of the U.S. (as of Q2’13)

Page 5: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Facebook Stats

• 1.15 billion users (6/2013)

• ~700 million people use facebook daily

• 350+ million photos added per day (1/2013)

• 240+ billion photos

• 4.5 billion likes, posts and comments per day (5/2013)

Page 6: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Cost and Efficiency

• Infrastructure spend in 2012 (from our 10-K):•“...$1.24 billion for capital expenditures related to the purchase of servers, networking equipment, storage infrastructure, and the construction of data centers.”

• Efficiency work has been a top priority for several years

Page 7: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Architecture

Service Cluster Back-End Cluster

Front-End Cluster

WebWeb

AdsAds few racks

Cache Cache

Search Photos Msg Others UDB ADS-DB Tao Leader

MultifeedMultifeed few racksOther smallOther small services

Page 8: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Lots of “vanity free” servers.

Page 9: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Life of a “hit”Front-End Back-End

Web

MC

MC

MC

MC

Ads

Database

L

Feedagg

requeststarts

Time

requestcomplete

s

LLLLL

Page 10: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Standard Systems

IWeb

IIIDatabase

IVHadoop

VPhotos

VIFeed

CPU High High High Low High

Memory Low High Medium Low High

Disk Low High IOPSFlash High High Medium

Services Web, Chat Database Hadoop(big data) Photos, Video Multifeed,

Search, Ads

Five Standard Servers

Page 11: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Five Server Types

Advantages:• Volume pricing

• Re-purposing

• Easier operations - simpler repairs, drivers, DC headcount

• New servers allocated in hours rather than months

Drawbacks:• 40 major services; 200 minor ones - not all fit perfectly

• The needs of the service change over time.

Page 12: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Agenda

1 Facebook Scale & Infrastructure

2 Efficiency at FB

3 Disaggregated Rack

4 New Components

Page 13: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Efficiency at FB

Data Centers

• Heat management, electrical efficiency & operations

Servers

• “Vanity free” design & supply chain optimization

Software

• Horizontal wins like HPHP/HHVM, cache, db, web & service optimizations

Page 14: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Next Opportunities?

Disaggregated Rack

• Better component/service fit

• Extending component useful life

Developing New Components

• Alternatives in CPU, RAM, Disk & Flash

Page 15: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Agenda

1 Facebook Scale & Infrastructure

2 Efficiency at FB

3 Disaggregated Rack

4 New Components

Page 16: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

A rack of news feed servers...

COMPUTE

RAM

STORAGE

Type-6 ServerType-6 Server

Network SwitchNetwork Switch

Type-6 ServerType-6 Server

Type-6 ServerType-6 Server

Type-6 ServerType-6 Server

=>

5.8 TB

80 TB

.

.

.

FLASH30 TB

Type-6 ServerType-6 Server

80 processors640 cores

Leaf Aggregator

AL

AL

AL

.

.

.

.

The application lives on a rack of equipment--not a single server.

Page 17: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Compute

• Standard Server• N processors

• 8 or 16 DIMM slots

• no hard drive - small flash boot partition.

• big NIC - 10 Gbps or more

Page 18: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Ram Sled

•Hardware• 128GB to 512GB

• compute: FPGA, ASIC, mobile processor or desktop processor

•Performance• 450k to 1 million key/value gets/sec

•Cost• Excluding RAM cost: $500 to $700 or a few dollars per GB

Page 19: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Storage Sled (Knox)

•Hardware• 15 drives

• Replace SAS expander w/ small server

•Performance• 3k IOPS

•Cost• Excluding drives: $500 to $700 or less than $0.01 per GB

Page 20: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Flash Sled

•Hardware• 175GB to 18TB of flash

•Performance• 600k IOPS

•Cost• Excluding flash cost: $500 to $700

NIC at 70% utilization

IOPS Capacity

1 Gbps 21k 175 GB

10 Gb 210k 1.75 TB

25 Gb 525k 4.4 TB

40 Gb 840k 7.7 TB

50 Gb 1.05M 8.8 TB

100 Gb 2.1M 17.5 TB

Page 21: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Three Disaggregated Rack Wins

•Server/Service Fit - across services

•Server/Service Fit - over time

•Longer useful life through smarter hardware refreshes.

Page 22: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Longer Useful Life

Today servers are typically kept in production for about 3 years.

With disaggregated rack:•Compute - 3 to 6 years

•RAM sled - 5 years or more

•Disk sled - 4 to 5 years depending on usage

•Flash sled - 6 years depending on write volume

Page 23: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Disaggregated Rack

Strengths:• Volume pricing, serviceability, etc.

• Custom Configurations

• Hardware evolves with service

• Smarter Technology Refreshes

• Speed of Innovation

Potential issues:• Physical changes required

• Interface overhead

Page 24: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Approximate Win Estimates

Conservative assumptions show a 12% to 20% opex savings.

More aggressive assumptions promise between 14% and 30% opex savings.

* These are reasonable savings estimates of what may be possible across several use cases.

Page 25: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Agenda

1 Facebook Scale & Infrastructure

2 Efficiency at FB

3 Disaggregated Rack

4 New Components

Page 26: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

20 years of the same...The servers we deploy in the datacenter are pretty much identical to a 386 computer from 20 years ago.

•New Technologies: • Math coprocessor

• SMP - N processors per server

• Multi-core processors

• GPUs - vector math

• Flash memory

The exponential improvement of CPU, RAM, disk and NIC density has been incredible.

Page 27: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

CPU

Today: Server CPUs are more compute-dense versions of desktop processors.

Mobile phones are driving development of lower power processors. System on a Chip (SoC) processors for this market integrate several modules into a single package to minimize energy consumption and board real-estate.

Next: Smaller Server CPUs derived from mobile processors may become compelling alternatives.

Page 28: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

RAMToday: The DDR standard RAM modules were developed for desktop servers.

The DDR standard is difficult to hit on a cost/performance basis with alternative technologies.

Next: Lower latency memory technology is interesting for “near RAM.” Slower memory (between RAM and Flash) could be accommodated for “far RAM.”

Page 29: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Storage (ZB of data; 1 ZB = 1 billion TB)

Today: Photo and data storage uses 3.5” disk drives almost exclusively.

Applications and file systems expect low-latency writes and reads and unlimited write cycles. By changing these assumptions optical and solid-state media may be viable.

Next: Cold and big capacity storage. Lower power density and slower performance

Page 30: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Flash (solid state storage)

Today: Flash SSD drives are commonly used in databases and applications that need low-latency high-throughput storage.

SLC – Premium MLC: Expensive, performant, high enduring (10K)

MLC: Less enpensive, performant, lower enduring (3K)

TLC: Cheap, performant (100-300)

Worm TLC: Extremely cheap (1)

Page 31: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013

Network InterfacesToday: 10 Gbps NICs are becoming standard in servers.

40Gbps are emerging and in production

Next: By focusing on rack distances (3 meters) and allowing proprietary technologies, 100 Gbps could be affordable in a few years.

Page 32: Facebook: Infrastructure at Scale Using Emerging Hardware Bill Jia, PhD Manager, Performance and Capacity Engineering Oct 8 th, 2013