PoweredBy

Applications and organizations using Hive include (alphabetically):

Bizo

We use Hive for reporting and ad hoc queries.

Chitika

We use Hive for data mining and analysis on our 435M monthly global users.

CNET

We use Hive for data mining, internal log analysis and ad hoc queries.

Digg

We use Hive for data mining, internal log analysis, R&D, and reporting/analytics.

eHarmony

We use Hadoop to store copies of internal log and dimension data sources and use it as a source for reporting/analytics and machine learning. Currently have a 640 machine cluster with ~5000 cores and 2PB raw storage. Each (commodity) node has 8 cores and 4 TB of storage.

Grooveshark

We use Hive for user analytics, dataset cleaning, and machine learning R&D.

hi5

We use Hive for analytics, machine learning and social graph analysis.

HubSpot

We use Hive as part of a larger Hadoop pipeline to serve near-realtime web analytics.

Last.fm

We use Hive for various ad hoc queries.

MedHelp Find a Doctor

We implemented Hive to analyse large amounts of doctors across the United States, and for internal analytics for over 1M pageview/day.

NexR

We use hive for replacing Oracle DW, big data analysis and integrating R. We develop the enterprise Hive.

Papertrail

We use Hive as a customer-facing analysis destination for our hosted syslog and app log management service.

Rocket Fuel

We use Hive to host all our fact and dimension data. Off this warehouse, we do reporting, analytics, machine learning and model building, and various ad hoc queries.

SaaSPulse

We use Hive for analytics, machine learning and customer interaction analysis of web applications.

Scribd

We use hive for machine learning, data mining, ad-hoc querying, and both internal and user-facing analytics

TaoBao

We use Hive for data mining, internal log analysis and ad-hoc queries. We also do some extensively developing work on Hive.

Trending Topics

Hot Wikipedia Topics, Served Fresh Daily. Powered by Cloudera Hadoop Distribution & Hive on EC2. We use Hive for log data normalization and building sample datasets for trend detection R&D.

VideoEgg

We use Hive as the core database for our data warehouse where we track and analyze all the usage data of the ads across our network.