Search and analytics at cloud scale

Rockset is a real-time search and analytics database designed to serve millisecond-latency analytical queries on event streams, CDC streams, and vectors

Cloud-native efficiency

Compute-compute separation

Compute-storage separation

Autoscaling in the cloud

Fully-managed cloud service

Rockset isolates streaming ingest compute from query compute for predictable performance even in the face of high-volume writes or reads. Avoid compute contention, overprovisioning, and replicas.

Read our architecture whitepaper

Real-time streaming ingest

Schemaless Ingest

Rockset automatically and continuously infers schema based on fields and types, even if types are mixed. Ingest semi-structured data or nested objects and arrays and execute relational SQL queries over these constructs.

Built-In Connectors

Ingest data with native connectors for streams, databases, data lakes, and cloud data warehouses. Ingest from anywhere using the write API or CDC templates. Ingest millions of events per second and query within 2 seconds. No ETL tools required.

Ingest Transformations

Pre-aggregate and transform data at ingest time using SQL. Reduce the cost of storing and querying high-volume streaming data by up to 100x, and eliminate the need for separate pipelines to process streaming data.

Real-Time Speed

Ingest streaming data, index it, and query it with end to end latency less than a second.

In-Place Updates

Insert, update, and delete data in place without expensive merge operations. Avoid the slow, error prone workarounds for append only databases.

Millisecond-latency search & analytics

Converged Index

Rockset stores all data in a Converged Index, which combines row, column and inverted indexes. This enables fast, compute-efficient queries, regardless of the access pattern or shape of the data.

Full-Featured SQL

Run SQL search, aggregations and joins directly on semi-structured data. Rockset uses SQL as its native query language.

High-Performance Joins

Rockset supports multiple strategies to optimize join performance, so users do not have to denormalize data or perform application-side joins.

Vector Search

Fast KNN search with metadata filters to deliver relevant results with just milliseconds of latency.

Developer productivity

Query Lambdas

Query Lambdas are named, parameterized SQL queries that can be executed from dedicated REST endpoints. Enforce version control and integrate them into CI/CD workflows.

Resource Automation

Automatically deploy Rockset integrations, collections, resources, and third-party dependencies using the Rockset Terraform Provider. Use the dbt adapter to load data and create collections using SQL SELECT statements.

UDFs

JavaScript user-defined functions (UDFs) can perform operations beyond built-in SQL functions. They run in an isolated, sandboxed environment and provide support for advanced math functions, transpositions, and more.

SDKs

Rockset has client libraries for Node.js, Java, Go, Python, and the Rockset CLI that wrap the Rockset API. Client libraries can be used to programmatically insert, update, and query data from your application's code.

Production ready

Reliability

Continuous backups with data stored on SSDs for performance and cloud storage for durability Disaster recovery with
Multi-region hot-cold and hot-hot deployment options
Enterprise SLAs

Compliance

SOC 2 Type II
HIPAA
GDPR
CCPA

Security

Single Sign On with Okta and custom SSO
Data Masking
Data encrypted at rest and in transit
Role Based Access Controls (RBAC)
IP Allowlisting
AWS Privatelink
Virtual Private Rockset in your VPC

Monitoring

Performance metric endpoint