Druid is a high-performance, column-oriented, distributed data store.

Download GitHub


Interactive Queries

Issue sub-second ad-hoc queries to group, filter, and aggregate data. Druid is ideal for powering multi-tenant user-facing applications.


Real-time Streams

Explore events immediately after they occur. Ingest data in streams or batches to unify real-time and historical views.


Horizontally Scalable

Existing Druid clusters have scaled to petabytes of data and trillions of events, ingesting millions of events every second. Druid is extremely cost effective, even at scale.


Deploy Anywhere

Druid runs on commodity hardware. Deploy it in the cloud or on-premise. Integrate with existing big data systems such as Hadoop, Spark, Kafka, Storm, Flink, and Samza.


Vibrant Community

Druid is a community led project. Join the fast growing community and work with developers from across the world.

Learn More:


Powered by Druid

Learn more about how many different organizations use Druid in production.


Quickstart

Try the quickstart and get started in minutes. Load your own data and query it.


Introduction

Learn more about the high level architecture and concepts behind Druid.


Powerful UIs and Rich Client Libraries

Visualize data with Pivot and Superset. Query data in many different languages.