Resources
From podcasts to how-to guides, white papers and videos, find what you need below.


DeveloperWeeek 2021
Rockset CTO and Co-Founder Dhruba Borthakur is speaking at DeveloperWeeek 2021 on empowering application developers with serverless real-time analytics. Join us at the virtual event Feb 17-19th.

Reimagining Real-Time Analytics in the Cloud
Rockset co-founders Venkat Venkataramani and Dhruba Borthakur share the trends they are seeing in the market and their vision for a real-time cloud data stack.

Real-Time Analytics at Speed and Scale: When Managing Elasticsearch Gets Too Hard
In this tech talk, we assess Elasticsearch and Rockset for real-time analytics on real-time ingest, data flexibility and scalability.
Postman Galaxy 2021
We'll be speaking at Postman Galaxy 2021 on making application developers more productive with data as an API. Join us February 2nd-4th!

CMU: Quarantine 2020 Database Talks with Dhruba Borthakur
Dhruba Borthakur presents on real-time indexing for fast queries on massive semi-structured data at Carnegie.

Real-time Analytics on DynamoDB: The Ultimate Guide
In this ebook, we discuss the analytical query performance of DynamoDB and compare different options for ETL and analytics tools including Athena, Hive/Spark, Elasticsearch, ElasticCache for Redis, and more.

How We Scaled It: Facebook's Online Data Infrastructure to 1B+ Users
An inside look into building massively scalable online infrastructure at Facebook.

Real-Time Analytics on Data Lakes
In this talk, Rockset Co-founder and CTO Dhruba Borthakur explains how real-time indexing on data lakes provides up to 125X faster queries than Athena.

Evaluating Data Latency for Real-Time Databases
RockBench is a benchmark designed to measure the data latency of a real-time database. This paper describes RockBench in detail and the results from running the benchmark on Rockset, a real-time indexing database.

Matter Uses Rockset to Bring AI-Powered Sustainable Insights to Investors
With Rockset, Danish fintech Matter has the flexibility to run analytical queries on semi-structured data in S3 and DynamoDB as part of their NLP architecture.

Future of Real-time Analytics
In this whitepaper, we’re sharing the 5 key principles for real-time analytics of the future.

Scaling Real-time Gaming Leaderboards for Millions of Players
A tech talk on the growing need for real-time leaderboards that can handle user activity at massive scale and real-time aggregations and joins.

Running Real-time A/B Experiments at Massive Scale
In this talk, Rockset Co-founder and CEO Venkat Venkataramani will cover the changing face of real-time analytics and what that means for the data stack.

eGoGames Esports Platform Uses Rockset for Real-Time Analytics on Gaming Data
eGoGames improves user experience, detects fraud, and makes business decisions using Rockset for real-time analytics on gaming data in Amazon DynamoDB and S3.

Elasticsearch v. Rockset Whitepaper
Rockset's Converged Index™ enables faster time to market and up to 50% lower TCO as compared to Elasticsearch’s search index, for real-time analytics use cases. This is achieved by optimizing for hardware and developer efficiency in the cloud.

Serverless Real-time Indexing: A Low Ops Alternative to Elasticsearch
In this talk, we compare and contrast Elasticsearch and Rockset as indexing data stores for serving low latency queries.

StoryFire - Scaling a Social Video Platform on MongoDB and Rockset
StoryFire uses Rockset to index data from their transactional MongoDB database to achieve performance and scale for analytical queries on their social video platform.

Designing a Real-Time ETA Prediction System Using Kafka, DynamoDB and Rockset
Generate ETA predictions for a delivery service using real-time location and order data from Kafka and DynamoDB.

Scaling MongoDB: Best Practices for Sharding, Indexing and Performance Isolation
A tech talk on how organizations tackle the challenges of sharding, indexing and offloading queries to scale MongoDB

Real-Time Recommendations for Event Ticketing Using MongoDB and Rockset
Implementing a real-time recommendations API for an event ticketing system by indexing MongoDB data in Rockset for fast SQL.

JOINs and Aggregations Using Real-Time Indexing on MongoDB Atlas
Learn how Rockset builds real-time indexes on MongoDB data for search, aggregations and joins.

Create APIs for Aggregations and Joins on MongoDB in Under 15 minutes
Build a Python application to create and execute APIs on aggregations and joins using Rockset and MongoDB.

Using MongoDB Change Streams for Indexing with Elasticsearch vs Rockset
Learn how Rockset indexes data from MongoDB change data capture (CDC) streams and how it compares to indexing in Elasticsearch.

Understanding Rockset
A guide to help you understand how Rockset works including the basics of integrating data sources, creating collections, running SQL queries and building real-time applications.

Query Performance Assessment
An inside look into the performance of Rockset, illustrating real-world numbers for what you can expect when querying data in Rockset.

How Standard Cognition Builds AI-powered Autonomous Checkout on Computer Vision Data
A tech talk on building real-time applications on computer vision data with customer Standard Cognition

How to Use KSQL Stream Processing and Real-Time Databases to Analyze Streaming Data in Kafka
We discuss when stream processing, with KSQL and Kafka Streams, and when a real-time database like Rockset are best used for analyzing Kafka data.
RocksDB Meetup 2020 at Rockset
Two RocksDB talks on stateful stream processing at LinkedIn and characterizing key-value workloads at Facebook.

Where's My Tesla? Creating a Data API Using Kafka, Rockset and Postman to Find Out
We demonstrate how to expose real-time IoT data in Kafka through the Rockset REST API in this example.

SQL on NoSQL - Enabling Real-Time Analytics on DynamoDB
This tech talk covers how change data capture with DynamoDB Streams can be used to enable complex queries for analytics.

Real-Time Analytics on Connected Car IoT Data Streams from Apache Kafka
In this IoT example, we examine how to enable complex analytic queries on real-time Kafka streams from connected car sensors.

Standard Cognition Uses Rockset to Deliver Data APIs and Real-Time Metrics for Vision AI
Standard Cognition, an AI-powered computer vision company, uses Rockset to enable their developers to deliver data APIs and product improvements.
How Real-Time Updates Work on Rockset with Kshitij
Kshitij is a software engineer here at Rockset. This video will talk about how real-time updates work on Rockset via Rockset's Patch API and remote compaction for writes.
How Query Lambdas Make it Easy to Build on Rockset + Developer Tool Demo
Scott is an engineer at Rockset and helped build Query Lambdas. In the developer video, he'll be talking and showing you how Query Lambdas make it easier for you to build applications.
Rockset's Developer Tool Demo: CLI, Developer UI, and VSCode Plugin
There are now more ways to create and execute query lambdas than just the console! Check out the demo to see the new release of our developer tools.
Smart Schema: Enabling SQL on Schemaless Data
Purvi is a software engineer at Rockset, where she helped built Smart Schemas. This video covers how you can go from raw data (without ever knowing the shape of your data) to insights with the help of smart schemas.
Rockset Product Overview with Nadine
Nadine shares how to use Rockset to power data-driven applications.
How Rockset Does Query Plan Optimization
Ari is a software engineer at Rockset where he helps build all aspects of query planning and execution. This video covers how Rockset performs query plan optimization through various examples.
How Rockset's Converged Index Works
Igor is a Founding Engineer at Rockset, where he is building its data indexing and the distributed SQL query engine. This video goes over how Rockset indexes your data via a Converged Index.

Balancing Coding And Management With Dhruba Borthakur
Dhruba shares his experience as Rockset CTO and how he still finds time to write code in addition to building a strong technical team.

SQL API for Real-Time Kafka Analytics in 3 Steps
Learn how to create a SQL API for real-time Kafka analytics on the Twitter Streaming API, using AWS Lambda and Rockset.

Best Practices for Analyzing Kafka Event Streams
Dhruba Borthakur discusses various design patterns for building analytics on Kafka event streams in this tech talk.
Rockset Demo Replay
Recorded version of our weekly live demo, where we go from raw data to real-time applications and dashboards, powered by Rockset.

Analytics on Kafka Event Streams Using Druid, Elasticsearch and Rockset
We examine how different data backends - Druid, Elasticsearch and Rockset - can be used alongside Kafka for analytics on event data streams.

Using Tableau with Kafka: How to Build a Real-Time SQL Dashboard on Streaming Data
Build a real-time Tableau dashboard for operational monitoring and analytics on streaming event data from Kafka.

Fast Analytics On Semi-Structured And Structured Data In The Cloud
Venkat Venkataramani and Shruti Bhat explain how Rockset is architected to allow for fast and flexible SQL analytics on your data.

How We Analyze and Visualize Kubernetes Events in Real Time at Rockset
Learn how we rolled our own tool for analysis and visualization of Kubernetes events, and try the open-source dashboard for yourself.

Outside Lands, Airbnb Prices, and Rockset’s Geospatial Queries
How to use Rockset's fast geospatial indexes with Airbnb data.

Using Tableau with DynamoDB: How to Build a Real-Time SQL Dashboard on NoSQL Data
We create an example dashboard in Tableau on data in DynamoDB, using Rockset as the SQL intelligence layer.

3 cost-cutting tips for Amazon DynamoDB
How to avoid costly mistakes with DynamoDB partition keys, read/write capacity modes, and global secondary indexes.

Tableau Operational Dashboards and Reporting on DynamoDB
We review several approaches to building operational dashboards and reporting on DynamoDB data, using SQL engines like Redshift and Athena.

Custom Live Dashboards on DynamoDB
We cover different approaches to live dashboards on DynamoDB, using DynamoDB Streams, Lambda, and ElastiCache.
Tech Talk: ALT architecture for low-latency queries on large datasets
We present how the Aggregator-Leaf-Tailer architecture is used to implement Rockset's low-latency, serverless operational analytics engine.

Facebook Data Infrastructure with Dhruba Borthakur
Dhruba joins the show to discuss his time at Facebook building data infrastructure. He takes us through the major projects he worked on.

Redshift with Rockset: High performance queries for operational analytics
Run high-performance queries on data from Redshift tables by continuously ingesting and indexing Redshift data through a Rockset-Redshift integration.
Data Council SF 19: Architecting a Low-Latency Schemaless SQL Engine
Learn about the challenges when adapting SQL to principles of strong dynamic typing and the idea of a Converged Index.

Venkat Venkataramani - Valuing People
Venkat Venkataramani talks with Dave Rael about leadership, humility, contribution, commitment to doing the right things, and valuing people over software and data.

Bringing Scalable Real-Time Analytics to the Enterprise
Dhruba Borthakur and Shruti Bhat share how Rockset is bringing operational analytics to the enterprise by simplifying data architecture in the cloud.

Using Tableau for Live Dashboards on Event Data
Connect a Tableau live dashboard to a real-time event stream of complex JSON in a few easy steps.

Up and Coming: Three Startups with Amazing Potential
Venkat Venkataramani discusses how Rockset is innovating in cloud data management to minimize the effort needed to get from data to apps.

FULL Uses Rockset with DynamoDB for Live Dashboard to Manage Remote Workforce
FULL Creative uses Rockset to build live dashboards and run complex SQL on contact center call data in DynamoDB.

Building a Serverless Analytics App to Capture and Query Clickstream Data
We built a web app that collects clickstream data as free-form JSON and runs SQL queries on the live data in a completely serverless fashion.
NorCalDB2019: Powering fast SQL over semi-structured data in Rockset
We discuss and demonstrate some of the key aspects of Rockset: strong dynamic typing, the Converged Index, and continuous auto-scaling.

Decore Uses Rockset for Search & Analytics on DynamoDB
Decore needed to enable ad hoc queries in their crypto accounting software service, so they turned to Rockset for fast analytics on DynamoDB.
Kafka Meetup: Operational Analytics on Event Streams in Kafka
We demonstrate how to create stateful microservices to analyze event streams using Kafka and Rockset.

Analytics on DynamoDB: Comparing Athena, Spark and Elastic
We compare options for real-time analytics on DynamoDB in terms of ease of setup, maintenance, query capability, and latency.
Strata 2019: A Data System for Low-Latency Queries for Search and Analytics
Learn how Rockset's Converged Index—combining columnar and search indexes—powers fast search and analytics.

Fynd - How Does a Growing E-Commerce Portal Respond to Consumer Behavior in Real Time?
Fynd uses Rockset to perform fast queries on real-time event streams, so they can react to consumer behavior as it happens.

The Path to Better Pollution Forecasting Goes Through Nested JSON
Pittsburgh-based developer, Doug Balog, collects and analyzes nested JSON weather data to improve pollution forecasts in his community.

Implementing a Sensor Network Simply and Efficiently - An MIT Smart City Project
An MIT team collaborates with a school in Brazil on a smart city project to analyze weather sensor data using Rockset.

How to Build a Facebook Messenger Chatbot Powered by Fast SQL on CSV
Build a chatbot that provides instant responses, leveraging fast SQL queries on CSV data.

Using Smart Schema to Accelerate Insights from Nested JSON
Use Rockset's Smart Schema to understand complex, nested JSON and enable immediate queries using SQL on raw data.

How to Run SQL on PDF Files
Run SQL queries on data from PDF files, and join PDFs with JSON, CSV, XLSX, and other data.

RocksDB with Dhruba Borthakur and Igor Canadi
Dhruba and Igor explain the technology behind RocksDB, which backs some of the most demanding applications and databases around.

Visualize Data in Rockset with Redash
Create visualizations on your Rockset data quickly and easily using native integration with Redash.

Running Fast SQL on DynamoDB Tables
Run SQL queries on your DynamoDB tables without any ETL and without impacting your production workloads.

Rockset adds Excel spreadsheet support
Run complex SQL across multiple Excel spreadsheets and join XLSX files with JSON, Parquet or CSV data.

Real-Time Analytics Using SQL on Streaming Data with Apache Kafka and Rockset
Connect Kafka and Rockset to obtain real-time analytics with ad hoc SQL queries on event streams.

Building a Serverless Microservice Using Rockset and AWS Lambda
Use SQL to join and query JSON and CSV data, and build a serverless microservice using AWS Lambda.

Rockset Data Platform with Venkat Venkataramani
Venkat discusses his journey from scaling data infrastructure at Facebook to transforming how data-driven apps are built at Rockset.

Building a Live Dashboard on Streaming Data Using Amazon Kinesis and Rockset
Serve real-time analytics on Twitter data, using SQL on streaming data from Amazon Kinesis.

Running SQL on Nested JSON
Make raw JSON immediately queryable through fast SQL queries, without ETL, data pipelines, or fixed schema.

Revolutionizing Big Data Apps
Rockset CEO Venkat Venkataramani talks about building apps without pipelines, and serverless search and analytics with native SQL.
From Useful Data to Useful Applications with Rockset
Rockset CEO Venkat Venkataramani explains how Rockset makes it really simple to build data-driven apps with very fast SQL on raw data.
Speeding Cloud-Native App Development with Rockset
Rockset CTO Dhruba Borthakur speaks with ActualTech about what Rockset does and how it changes the game for enterprises.
CUBEConversation: Venkat Venkataramani, Rockset & Jerry Chen, Greylock
Venkat and Jerry discuss Rockset's vision and the future of cloud and data.

Building Live Apps on Raw Data with Rockset
As the first cloud service that runs SQL directly on raw data, Rockset greatly simplifies app development and data science on complex data sets.

Rockset Concepts, Design & Architecture
Learn how Rockset's architecture enables highly parallelized execution of complex queries across diverse data sets.