Download - Hadoop and HBase on Amazon Web Services
![Page 2: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/2.jpg)
Thank you.
![Page 3: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/3.jpg)
Introducing Hadoop3
![Page 4: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/4.jpg)
HBase on AWSg
Introducing Hadoop3
![Page 5: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/5.jpg)
Cost optimizationv
HBase on AWSg
Introducing Hadoop3
![Page 6: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/6.jpg)
Data for competitive advantage.
![Page 7: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/7.jpg)
Customer segmentation, financial modeling, system analysis,line-of-sight,business intelligence...
Using data
![Page 8: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/8.jpg)
Generation
Collection & storage
Analytics & computation
Collaboration & sharing
![Page 9: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/9.jpg)
Cost of data generationis falling.
![Page 10: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/10.jpg)
lower cost, increased throughput
Generation
Collection & storage
Analytics & computation
Collaboration & sharing
![Page 11: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/11.jpg)
HIGHLY CONSTRAINED
Generation
Collection & storage
Analytics & computation
Collaboration & sharing
![Page 12: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/12.jpg)
Very high barrier to turning data into information.
![Page 13: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/13.jpg)
Move from a data generation challengeto analytics challenge.
![Page 14: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/14.jpg)
Enter the AWS Cloud.
![Page 15: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/15.jpg)
Remove the constraints.
![Page 16: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/16.jpg)
Enable data-driven innovation.
![Page 17: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/17.jpg)
Move to a distributed data approach.
![Page 18: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/18.jpg)
Maturation of two things.
![Page 19: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/19.jpg)
Maturation of two things.
Software for distributed storage and analysis
![Page 20: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/20.jpg)
Maturation of two things.
Software for distributed storage and analysis
Infrastructure for distributed storage and analysis
![Page 21: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/21.jpg)
Frameworks for data-intensive workloads.
Software
Distributed by design.
![Page 22: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/22.jpg)
Platform for data-intensive workloads.
Infrastructure
Distributed by design.
![Page 23: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/23.jpg)
Support the data life cycle.
![Page 24: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/24.jpg)
HIGHLY CONSTRAINED
Generation
Collection & storage
Analytics & computation
Collaboration & sharing
![Page 25: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/25.jpg)
Generation
Collection & storage
Analytics & computation
Collaboration & sharing
![Page 26: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/26.jpg)
Lower the barrier to entry.
![Page 27: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/27.jpg)
Accelerate time to market and increase agility.
![Page 28: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/28.jpg)
Enable new business opportunities.
![Page 29: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/29.jpg)
Washington Post
NASA
![Page 30: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/30.jpg)
“AWS enables Pfizer to explore di!cult or deep scientific questions in a timely, scalable manner and helps us make better decisions more quickly”
Michael Miller, Pfizer
![Page 31: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/31.jpg)
Introducing Hadoop3
![Page 32: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/32.jpg)
Maturation of two things.
Software for distributed storage and analysis
Infrastructure for distributed storage and analysis
![Page 33: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/33.jpg)
Maturation of two things.
Software for distributed storage and analysis
Infrastructure for distributed storage and analysis
![Page 34: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/34.jpg)
Apache Hadoop
Software for distributed storage and analysis
Implements the map/reduce pattern
Focus on your data
![Page 35: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/35.jpg)
Built for uncertainty
Hadoop provides tools to navigate data
Allows discovery
Query flexibility at scale
![Page 36: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/36.jpg)
Built for flexibility
Java native
Executes code in any language
Just a distribution mechanism
![Page 37: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/37.jpg)
Rich ecosystem
Diverse tools
Machine learning, recommendations, predictive analytics, segmentation, real time analysis
Lots of innovation
![Page 38: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/38.jpg)
But...
A very big project
500k+ lines of code
Challenging to configure and optimize
![Page 39: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/39.jpg)
Undi!erentiated heavy liftingG
![Page 40: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/40.jpg)
Amazon Elastic MapReduce
![Page 41: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/41.jpg)
Amazon Elastic MapReduce
Web service for data processing
Hosted Hadoop
Configured and optimized
![Page 42: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/42.jpg)
Amazon Elastic MapReduce
Job flows
Elastic platform
Maintain clusters or run once and terminate
Debugging tools
![Page 43: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/43.jpg)
Input data
S3
![Page 44: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/44.jpg)
Elastic MapReduce
Code
Input data
S3
![Page 45: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/45.jpg)
Elastic MapReduce
Code Name node
Input data
S3
![Page 46: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/46.jpg)
Elastic MapReduce
Code Name node
Input data
S3
Elastic cluster
![Page 47: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/47.jpg)
Elastic MapReduce
Code Name node
Input data
S3
Elastic cluster
HDFS
![Page 48: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/48.jpg)
Elastic MapReduce
Code Name node
Input data
S3
Elastic cluster
HDFSQueries
+ BIVia JDBC, Pig, Hive
![Page 49: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/49.jpg)
Elastic MapReduce
Code Name node
OutputS3 + SimpleDB
Input data
S3
Elastic cluster
HDFSQueries
+ BIVia JDBC, Pig, Hive
![Page 50: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/50.jpg)
OutputS3 + SimpleDB
Input data
S3
![Page 51: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/51.jpg)
![Page 52: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/52.jpg)
![Page 53: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/53.jpg)
![Page 54: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/54.jpg)
![Page 55: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/55.jpg)
![Page 56: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/56.jpg)
![Page 57: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/57.jpg)
![Page 58: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/58.jpg)
![Page 59: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/59.jpg)
![Page 60: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/60.jpg)
![Page 61: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/61.jpg)
![Page 62: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/62.jpg)
Hadoop all the way down
Amazon Hadoop distribution
HDFS
Streaming interface
Hive, Pig, Mahout, Spark, Shark
![Page 63: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/63.jpg)
Data integration
Optimized and integrated into AWS environment
Reads and writes to S3
Analytics on DynamoDB data
Can process data from any source: Cassandra, Mongo, Couch, Amazon RDS
![Page 64: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/64.jpg)
Data movement
Multi-part upload
Import/Export
AWS Direct Connect
Aspera
![Page 65: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/65.jpg)
Cluster scalability
Resize running job flows
Add capacity for shorter runs
Remove capacity during o! peak hours
Balance scale and cost
![Page 66: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/66.jpg)
Cluster scalability
14 hours remaining
![Page 67: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/67.jpg)
Cluster scalability
7 hours remaining
![Page 68: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/68.jpg)
Cluster scalability
3 hours remaining
![Page 69: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/69.jpg)
Cluster scalability
Steady state Steady stateLarge batch task
![Page 70: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/70.jpg)
Cluster availability
Canonical source of data
Any one in the engineering team
IAM integration
Monitoring
![Page 71: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/71.jpg)
Click stream analysis for retail
3.5 billion records71 million unique cookies1.7 million targeted ads
13 Tb of clickstream logs
Each day
![Page 72: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/72.jpg)
Click stream analysis for retail
Workflow time from 2 days to 8 hours
Procurement time from 2 months to 5 minutes
$13k per month
500% increase return on advertising spend
![Page 73: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/73.jpg)
![Page 74: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/74.jpg)
Months of user click-through data Search terms Ads displayed Premium listing inventory
Amazon S3
Log data stored in Amazon S3
![Page 75: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/75.jpg)
Hadoop Cluster
Amazon EMR Amazon S3
Elastic Map Reduce spins up 200 instance cluster
![Page 76: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/76.jpg)
Hadoop Cluster
Amazon EMR Amazon S3
Find patterns across logs. Write results to S3.
![Page 77: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/77.jpg)
Hadoop in the AWS Cloud
Elastic MapReduce for hosted Hadoop
Optimized, configured, ready to roll
Focus on the business benefit of data
Hadoop all the way down
![Page 78: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/78.jpg)
Maturation of two things.
Software for distributed storage and analysis
Infrastructure for distributed storage and analysis
![Page 79: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/79.jpg)
HBase on AWSg
![Page 80: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/80.jpg)
Vibrant ecosystem
Mahout for machine learning
Mesos for cluster management
Spark for fast analytics
HBase for unstructured data
![Page 81: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/81.jpg)
HBase
NoSQL data store
Runs on top of HDFS
Scalable
Rapid retrieval across large datasets
![Page 82: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/82.jpg)
Architecture
Huge, distributed map/hash
Distributed
Implements Bloom filters
Sortable
![Page 83: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/83.jpg)
Column based
Columns are similar to fields
Rows are records
![Page 84: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/84.jpg)
Built for data
Built to scale across billions of rows
The more data, the better the relative performance
![Page 85: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/85.jpg)
But...
Large, complex project
Running in production can be challenging
Distributed system
![Page 86: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/86.jpg)
Undi!erentiated heavy liftingG
![Page 87: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/87.jpg)
HBase for Elastic MapReduce
![Page 88: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/88.jpg)
![Page 89: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/89.jpg)
![Page 90: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/90.jpg)
![Page 91: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/91.jpg)
![Page 92: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/92.jpg)
![Page 93: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/93.jpg)
![Page 94: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/94.jpg)
![Page 95: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/95.jpg)
![Page 96: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/96.jpg)
Using HBase
Social media firehose
Customer information
Usage and application logs
Hadoop analytics
![Page 97: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/97.jpg)
Generation
Collection & storage
Analytics & computation
Collaboration & sharing
![Page 98: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/98.jpg)
Amazon DynamoDB
NoSQL database service
Provisioned throughput
Unlimited storage
Very easy to use
![Page 99: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/99.jpg)
DynamoDB & Amazon EMR
SQL like queries
Query flexibility at scale
Integrate queries across datasets
Hive
![Page 100: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/100.jpg)
NoSQL on the AWS Marketplace
CouchDB
Cassandra
MongoDB
aws.amazon.com/marketplace
![Page 101: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/101.jpg)
Cost optimizationv
![Page 102: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/102.jpg)
Lowered prices 19 times in the past six years.
![Page 103: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/103.jpg)
On-demand
![Page 104: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/104.jpg)
Reserved capacity
![Page 105: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/105.jpg)
100%
Reserved capacity
![Page 106: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/106.jpg)
100%
Reserved capacity
On-demand
![Page 107: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/107.jpg)
100%
Reserved capacity
On-demand
![Page 108: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/108.jpg)
Spot market
![Page 109: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/109.jpg)
![Page 110: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/110.jpg)
![Page 111: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/111.jpg)
![Page 112: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/112.jpg)
$0.08 vs $0.007(yesterday evening)
![Page 113: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/113.jpg)
![Page 114: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/114.jpg)
![Page 115: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/115.jpg)
Reserved Instance Marketplace
![Page 116: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/116.jpg)
![Page 117: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/117.jpg)
Cost optimizationv
HBase on AWSg
Introducing Hadoop3
![Page 118: Hadoop and HBase on Amazon Web Services](https://reader033.vdocuments.mx/reader033/viewer/2022060111/5563a2ddd8b42a01658b51fc/html5/thumbnails/118.jpg)
aws.amazon.com/elasticmapreduceB