What is Rockset?
Rockset is a real-time analytics database which enables queries on massive, semi-structured data without operational burden. Rockset is serverless and fully managed. It offloads the work of managing configuration, cluster provisioning, denormalization, and shard / index management. Rockset is also SOC 2 Type II compliant and offers encryption at rest and in flight, securing and protecting any sensitive data. Most teams can ingest data into Rockset and start executing queries in less than 15 minutes.
With Rockset, ingested data is queryable within one second and analytical queries against that data typically execute in milliseconds. Rockset supports schemaless ingest for structured, semi-structured, geo, time-series, and embeddings data. Via Rockset’s Converged Index(™), all data is automatically indexed three ways - column, row, and search - at the time of ingestion. The SQL query optimizer examines each query and chooses an execution plan for optimal performance. You will typically see query executions in the 100s of milliseconds on fresh data. Rockset is compute optimized, making it suitable for serving high concurrency applications in the sub-100TB range (or larger than 100s of TBs with rollups).
Rockset provides the following key features and benefits:
Rockset has pre-built integrations for:
- Amazon DynamoDB
- Amazon Kinesis
- Amazon S3
- Amazon MSK
- Apache Kafka
- Azure Blob Storage
- Azure Event Hubs
- Azure Service Bus
- Google Cloud Storage
- Microsoft SQL Server
As you follow our step-by-step tutorials, Rockset will automatically load your data within seconds so you can begin making SQL queries immediately. New data can be queried with a p95 (95th percentile latency) of two seconds. Rockset can initially load the data in bulk and then continuously ingest upwards of millions of events per second, to stay in sync with your data source. No ETL tools are required for this process.
Rockset can also follow CDC streams from both RDBMS and NoSQL data stores.
If your data is not in any of these sources or you want to stream events directly into Rockset, you can always use the Write API to directly write data to your Rockset collections. To request support for a new data source, please drop us a note at email@example.com.
Rockset ingests your data without the need for pre-built schemas. Smart schemas are automatically generated based on the exact fields and types present in the ingested data. The smart schema represents and enables SQL queries for semi-structured data, nested objects and arrays, mixed types and nulls. You can also define your own ingest transformation to be applied as documents are ingested into Rockset to create new fields, manipulate existing ones, or configure rollups from your data source.
Learn more about how smart schemas are generated in Rockset.
Full SQL Support
Rockset supports full SQL including:
And queries over all types of fields (including heavily nested objects and arrays) and on any semi-structured data. This enables the use and flexibility of SQL queries over data in supported data sources, even if they don't natively support SQL.
See our SQL Reference for the full list of all functions available for writing SQL queries in Rockset.
Query Lambdas are named, parameterized SQL queries stored in Rockset that can be executed from a dedicated REST endpoint. With Query Lambdas, you can save and enforce version control for your SQL queries and integrate them into your CI/CD workflows.
Use the Rockset CLI to create, manage, and deploy your Query Lambdas directly from your local computer. Query Lambdas are also fully supported in Rockset’s official client libraries and the Rockset API.
Watch our tutorial on how to build applications using Query Lambdas.
Rockset has several client libraries which wrap the Rockset API. The client libraries can be used to programmatically insert, update, and query data from your application's code:
If you want to use Rockset and compose SQL queries inside your IDE, check out our VS Code Extension.
How does it work?
The following subsections describe key aspects on how Rockset works:
- Converged Index™
- Scale Compute and Storage Independently
- Serverless Auto-Scaling in the Cloud
- Enterprise-Grade Security
All fields, including deeply nested fields, are automatically indexed in a Converged Index™ as each record is ingested. They include three indexes:
- Inverted index
- Columnar index
- Row index
A Converged Index™ allows analytical queries on large datasets to return in milliseconds. Using Rockset, you will never have to manually define or create your indexes or update them over time. You can also customize Rockset for efficient, cost-optimized, and massive-scale applications.
Read more about how Rockset builds a Converged Index™ and other design concepts in Rockset’s Architecture Whitepaper.
Scale Compute and Storage Independently
Using Rockset, you can scale compute and storage resources independently for the best balance of price and performance. As your data size grows, you can choose the right amount of compute for the query performance you need at any given time. Hot storage and ingest costs are charged at a fixed rate, while compute resources are based on your Virtual Instance Type.
See Rockset’s full pricing model.
Serverless Auto-Scaling in the Cloud
Rockset uses a modern, cloud-native architecture that auto-scales in the cloud, and automates cluster provisioning and index management. This significantly minimizes any operational overhead because you will never need to provision capacity or manage servers.
For more information about Rockset's architecture and its performance benchmarks, see the Evaluating Data Latency for Real-Time Databases white paper.
Stored data is encrypted using AES-256, and SSL is used in transit. In addition, you can mask sensitive information using an ingest transformation. Read more about our security features including SAML, OAuth, and Okta for single sign-on in the Security section of our documentation. See our Data Privacy Addendumfor additional information.
To get started, create a Rockset account. See our Quick Start to get started running queries on Rockset with sample data. Or, start by load data from your own data sources! If you have any questions, please contact us at firstname.lastname@example.org.