Google Cloud Platform Blog
Product updates, customer stories, and tips and tricks on Google Cloud Platform
Ezakus runs 1000 nodes of Hadoop on Google Compute Engine
Thursday, August 28, 2014
Ezakus
, a leading data management platform, relies on Hadoop to process 600 million digital touch points raised by 40 million users and mobile users.
Fast growth created challenges in managing Ezakus’s existing Hadoop installation, so they tested different alternatives for running Hadoop. Their benchmarks found that Hadoop on
Google Compute Engine
provided processing speed that was three to four times better than the next-best cloud provider.
“Our benchmark tests used the Cloudera Hadoop distribution”, said Olivier Gardinetti, CTO. “We were careful to use identical infrastructure - the same logical CPU count, the same mem capacity and so forth. We also ran each test several times to ensure that outliers weren't skewing the results.”
When using MapReduce for basic stats processing of 20,469,283 entries along their browsing history over 1 month, Compute Engine computed the stats in 1 minute and 3 seconds, four times faster than the alternative tested. When more complex queries were run in a second test, Compute Engine computed in 7 minutes and 47 seconds, 3 times faster than the closest alternative which ran at 23 minutes and 31 seconds.
Ezakus can now provide more performance and predictions and serve more clients, “because we can more easily deploy all the servers in a very short time,” said Gardinetti. To learn more about their migration to Google Cloud Platform and subsequent results for their business, read the case study
here
.
-Posted by Ori Weinroth, Product Marketing Manager
Free Trial
GCP Blogs
Big Data & Machine Learning
Kubernetes
GCP Japan Blog
Firebase Blog
Apigee Blog
Popular Posts
Understanding Cloud Pricing
World's largest event dataset now publicly available in BigQuery
A look inside Google’s Data Center Networks
New in Google Cloud Storage: auto-delete, regional buckets and faster uploads
Enter the Andromeda zone - Google Cloud Platform’s latest networking stack
Labels
Announcements
193
Big Data & Machine Learning
134
Compute
271
Containers & Kubernetes
92
CRE
27
Customers
107
Developer Tools & Insights
151
Events
38
Infrastructure
44
Management Tools
87
Networking
43
Open
1
Open Source
135
Partners
102
Pricing
28
Security & Identity
85
Solutions
24
Stackdriver
24
Storage & Databases
164
Weekly Roundups
20
Feed
Subscribe by email
Demonstrate your proficiency to design, build and manage solutions on Google Cloud Platform.
Learn More
Technical questions? Check us out on
Stack Overflow
.
Subscribe to
our monthly newsletter
.
Google
on
Follow @googlecloud
Follow
Follow