february 2016 webinar series - use aws cloud storage as the foundation for hybrid strategy
TRANSCRIPT
© 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Isaiah Weiner, AWS Partner Solutions Architecture
2/23/2016
Use AWS Cloud Storage as the Foundation for Hybrid Strategy
Getting Started
Clarifying Block vs. File vs. Object Use Case: Backup and Recovery Use Case: Big Data Use Case: Active Archive Use Case: Primary Storage
Block vs. File vs. Object
< 5 ms✔
< 10 ms✔✔Standards-based,Legacy app support
< 100 ms✔
✔ + Customizable!Modern stack support,
scalable interfaces
LatencyDataMetadataAccess
What should I use, and when?
Economics Easy to Use Reduce risk Agility, Scale Pay as you go No upfront investment No commitment No risky capacity
planning
Self service administration
SDKs for simple integration
Durable and Secure Avoid risks of physical
media handling
Reduce time to market Focus on your business,
not your infrastructure
Amazon S3Durable object storage
for all types of data
Amazon EBSBlock storage for use
with Amazon EC2
Amazon GlacierArchival storage for infrequently accessed data
Amazon EFSFile storage for use with Amazon EC2
Backup and Recovery
Backup and Recovery to the Cloud
Amazon S3
Amazon GlacierAWS
DirectConnect
InternetAmazon S3-IA
Applicationservers
Cloud Gateway
Local disk
MediaServer
Cloud Gateway
HTTPS/API
Applicationservers
Cloud Connector
Local diskMedia
Server with cloud
connector
HTTPS/API
Gateway: AWS Storage Gateway
Amazon EBS snapshots
Amazon S3
Amazon Glacier
AWSStorage Gateway
appliance
Applicationserver
AWSStorage Gateway
backendAWS
DirectConnect
Internet
Customer premises
Gateway: NetApp AltaVault
Common backup applications integrated with AltaVaultSolve backup & archive headaches with cloud-integrated storage
90% reduction in time, cost, and data volumes Shrink recovery times from days to minutes 85% of backup & software providers supported
Glacier
On Premises
AWS
Cloud-integrated storage appliance
NetApp AltaVault
FAS
E-SeriesNon-NetApp
Storage
Seamlessly integrates into existing storage and backup
software environment
Deduplicates, compresses, and encrypts
Caches recent backups locally, vaults older copies to
the cloud
NetApp SnapProtect Arcserve CommVault Simpana EMC NetWorker HP Data Protector IBM Tivoli
Storage Mgr
Symantec Backup Exec
Symantec NetBackup
Veeam Microsoft SQL
Server Oracle RMAN
S3
AltaVault also available on marketplace to protect cloud-native workloads
Store data in the public or private cloud of choice
Big Data
S3 + EMR
EMR cluster
Amazon S3
EMR cluster
corporate data center
AWSDirect
Connect
Internet
Applicationserver
S3 + RedShift
Amazon S3corporate data center
AWSDirect
Connect
Internet
Applicationserver
Dive Deep on Big Data with S3AWS re:Invent 2015: DAT201 Introduction to Amazon Redshifthttps://www.youtube.com/watch?v=DIj1bFjiqd8
AWS re:Invent 2015: DAT308 How Yahoo! Analyzes Billions of Events a Day on Amazon RedShifthttps://www.youtube.com/watch?v=3qmzwqnC67kDAT308 Slides:http://www.slideshare.net/AmazonWebServices/dat308-yahoo-analyzes-billions-of-events-a-day-on-amazon-redshift
AWS re:Invent 2015: BDT305 Amazon EMR Deep Dive and Best Practiceshttps://www.youtube.com/watch?v=4HseALaLllcBDT305 Slides:http://www.slideshare.net/AmazonWebServices/bdt305-amazon-emr-deep-dive-and-best-practices
BDT314: Running a Big Data and Analytics Application on Amazon EMR and Amazon Redshift with a Focus on Securityhttp://www.slideshare.net/AmazonWebServices/bdt314-a-big-data-analytics-app-on-amazon-emr-amazon-redshift
Active Archive
AWS Import/Export Disk
• Accelerates moving large amounts of data into and out of Amazon S3, Glacier and EBS
• Transfers your data directly onto and off of customer owned storage devices
• Uses Amazon high-speed internal network to complete the transfer
• Supports up to eSATA and USB 2,3 attached drives up to 6 TB and 16 TB arrays
AWS Import/Export
What is Snowball? Petabyte scale data transport
E-ink shipping label
Ruggedizedcase“8.5G Impact”
All data encrypted end-to-end
Rain & dust resistant
Tamper-resistant case & electronics
50 TB10GE network
How it works
How fast is Snowball?• Less than 1 day to transfer 250TB via 5x10G connections with 5
Snowballs, less than 1 week including shipping• Number of days to transfer 250TB via the Internet at typical utilizations
Internet Connection SpeedUtilization 1Gbps 500Mbps 300Mbps 150Mbps
25% 95 190 316 63250% 47 95 158 31675% 32 63 105 211
How fast is Snowball? Less than 1 day to transfer 250TB via 5x10G connections with 5
Snowballs, less than 1 week including shipping Number of days to transfer 250TB via the Internet at typical utilizations
Internet Connection SpeedUtilization 1Gbps 500Mbps 300Mbps 150Mbps
25% 95 190 316 63250% 47 95 158 31675% 32 63 105 211
How fast is Snowball? Less than 1 day to transfer 250TB via 5x10G connections with 5
Snowballs, less than 1 week including shipping Number of days to transfer 250TB via the Internet at typical utilizations
Internet Connection SpeedUtilization 1Gbps 500Mbps 300Mbps 150Mbps
25% 95 190 316 63250% 47 95 158 31675% 32 63 105 211
When to use AWS Import/Export Snowball
Cloud Migration
Disaster Recovery
DatacenterDecommission
ContentDistribution
AWS Snowball AWS Import/Export Disk
When to use Disk vs Snowball?
Import only, Export coming soon Currently available in US East
and US West 2 Import to S3 only Supports large data transfers,
from TBs to PBs
Supports import and export for S3 buckets and EBS snapshot import in:
US East (N. Virginia) US West (Oregon) US West (Northern California) EU (Ireland) Asia Pacific (Singapore)
Supports import into Glacier in: US East (N. Virginia) US West (Oregon) US West (Northern California) EU (Ireland) regions.
Use Amazon Glacierfor lowest-cost, durable cold
storage of archival data
Use Amazon S3 for reliable,
durable primary storage
Use Amazon S3 Infrequent Access
Storage for secondary backups
at a lower cost
S3-IA
Tiering on AWS: optimize your storage spend
Key prefix “logs/”
Transition objects to Glacier 30 days after creation
Delete 365 days after creation date
<LifecycleConfiguration> <Rule>
<ID>archive-in-30-days</ID> <Prefix>logs/</Prefix> <Status>Enabled</Status> <Transition>
<Days>30</Days>
<StorageClass>GLACIER</StorageClass> </Transition> <Expiration>
<Days>365</Days> </Expiration>
</Rule></LifecycleConfiguration
S3 lifecycle policies
What about WORM?
SEC Rule 17a-4(f) FINRA Rule 4511 CFTC Regulation 1.31
Have: Need: Glacier Vault Lock
Dive Deep on Active Archive
AWS re:Invent 2015: STG202 AWS Import/Export Snowball: Large-Scale Data Ingest into AWShttps://www.youtube.com/watch?v=86ogJHFSJRoSlides:http://www.slideshare.net/AmazonWebServices/stg202-aws-importexport-snowball-largescale-data-ingest-into-aws
Third-Party SEC 17a-4(f) Assessment for Vault Lockhttps://aws.amazon.com/blogs/aws/glacier-cohasset-assessment/
Service details and pricinghttps://aws.amazon.com/importexport/
Primary Storage
Hybrid Primary Storage Approaches
Java Python (boto) PHP .NET Ruby Node.js
iOS Android AWS Toolkit for Visual
Studio
AWS Toolkit for Eclipse
AWS Tools for Windows
PowerShell
AWS CLI
JavaScript
Hybrid Primary Storage Approaches
Customer Datacenter
Amazon S3
Existing Storage
Infrastructure
On-PremisesApplicationServers
NFS, CIFS, SMB, iSCSI, FC, AoE, …
Works withexisting
applications
NAS or SAN Resources
?
Primary Storage Technology Partners
S3 backed, NFS/SMB server
Natural fit for hybrid strategy
Originally a NAS accelerator
Past founder successes: CMU AFS, Transarc (IBM), Spinnaker Networks (NetApp)
Primary Storage Technology Partners
Global filesystem backed by S3 supports NFS/SMB
Natural fit for hybrid strategy
Past founder successes: Alteon (Nortel), Aruba Networks (HP)
Dive Deep with Primary Storage Use Cases
AWS re:Invent 2015: STG308 How Electronic Arts, State of Texas, & H3 Biomedicine Use AWShttps://www.youtube.com/watch?v=PmPriuFEz1kSlides:http://www.slideshare.net/AmazonWebServices/stg308-how-ea-state-of-texas-h3-biomedicine-protect-data
Thank you!