Featured Post

September 21, 2021

Engineering

How We Improved the Concurrency and Scalability of Our Redis Rate Limiting System

We use a rate limiting system, based on Redis, to protect services from overload. Learn how we increased its concurrency and scalability in this blog.

Akshay Nanavati

Follow our stories and unique insights.

Latest Posts

September 15, 2021

Kafka

Product

Rockset Enhances Kafka Integration to Simplify Real-Time Analytics on Streaming Data

We’re introducing a new fully-managed Kafka Integration with native support for Confluent Cloud and Apache Kafka. Get started with real-time analytics on event streams from Apache Kafka in minutes.

Boyang Chen

September 10, 2021

Company

Why I Joined Rockset

Daniel Lu, Rockset’s new Director of Product Marketing, shares why he is excited about Rockset

Daniel Lu

September 8, 2021

Developer

Hello World: Join the New Rockset Developer Community

We are unveiling our community, developer mascot, and Real-time Rockstars!

Nadine Farah

September 7, 2021

Kafka

See Rockset’s Rollups for Streaming Data at Kafka Summit 2021

Rockset, a Gold Sponsor of Kafka Summit Americas 2021, to present and demo SQL-based rollups on streaming data.

Giovanni Tropeano

September 3, 2021

Product

Faster Results and a Better Experience with New Pagination in Rockset

Rockset’s new pagination approach enables customers to query large amounts of data fast and more consistently

Rafael Kabesa

August 31, 2021

Real-Time Analytics

Product

How Rockset Enables SQL-Based Rollups for Streaming Data

Learn how Rockset enables SQL-based rollups on streaming data for complex and accurate real-time analytics.

Venkat Venkataramani

August 25, 2021

Druid

Product

Kafka

Kinesis

Real-Time Analytics

Rollups on Streaming Data: Rockset vs Apache Druid

Continuously rollup and transform streaming data from any source using SQL. Learn how rollups in Rockset compare to Apache Druid.

Vibhuti Bhushan

August 5, 2021

Company

Real-Time Analytics

Why Rockset & Why Now

Ryan Precious shares his thoughts on real-time analytics and the opportunity ahead for Rockset.

Ryan Precious

August 4, 2021

Snowflake

Real-Time Analytics

Real-Time Data Ingestion: Snowflake, Snowpipe and Rockset

We examine the performance and cost of real-time data ingestion in Snowflake and Snowpipe as compared to Rockset.

Shawn Adams

July 29, 2021

DynamoDB

Product

Engineering

20x Faster Ingestion with Rockset's New DynamoDB Connector

Get 20x faster ingestion on DynamoDB tables with Rockset's improved connector, which uses DynamoDB's export to S3 functionality.

Purvi Desai

July 22, 2021

DynamoDB

Real-Time Analytics

Scaling Real-Time Gaming Leaderboards with DynamoDB and Rockset

Learn how DynamoDB and Rockset deliver the ultimate data stack for real-time analytics in gaming.

Julie Mills

July 19, 2021

Druid

Developer

How to Handle Nested Data in Apache Druid

Nested data needs to be flattened upon ingestion when using Apache Druid. We look at how to ingest and query nested data in Druid and alternatives to flattening data.

Shawn Adams

July 15, 2021

SQL

Real-Time Analytics

Product

Real-Time Analytics with dbt + Rockset

The dbt-Rockset adapter makes it easy to perform SQL transformations for real-time analytics. Load data into Rockset and create collections by writing SQL SELECT statements in dbt.

Sam Crowder

July 8, 2021

MongoDB

5 Can't Miss MongoDB.live Talks

As we gear up for MongoDB.live on July 13-14, here are some conference talks we're looking forward to attending.

Kevin Leong

July 7, 2021

SQL

Druid

How to Handle Database Joins in Apache Druid

This article focuses on implementing database joins in Apache Druid, looks at some limitations developers face, and explores possible solutions. Denormalization

Shawn Adams

July 1, 2021

Developer

Create a Data API on MySQL Data with Rockset

We’ll be uploading, analyzing, and creating a data API on Airbnb data from Amazon RDS MySQL in Rockset.

Nadine Farah

June 29, 2021

Product

Production Visibility: Metrics Monitoring and Alerting

Rockset introduced console metrics and an integration for third-party monitoring tools to provide greater visibility for production workloads.

Brian Liang

June 18, 2021

Engineering

Company

My New Grad Experience at Rockset

Karen joined Rockset two years ago as a fresh CS grad. She shares highlights of her Rockset experience as a software engineer on the backend team.

Karen Li

June 17, 2021

Real-Time Analytics

The Emergence of Real-Time Analytics

Real-time analytics is now within reach of all companies from lean startups to large enterprises.

Julie Mills

June 7, 2021

Big Ideas

DynamoDB

MongoDB

MySQL

PostgreSQL

Change Data Capture: What It Is and How to Use It

Change data capture (CDC) is a useful tool in many data architectures. Learn what CDC is, how it is implemented and when to use it.

Lewis Gavin

June 4, 2021

Engineering

RocksDB

Rockset Converged Index Adds Clustered Search Index for 70% Query Latency Reduction

We share how a new storage format for the search index in Rockset’s Converged Index reduced query latencies by as much as 70% and the size of the search index by about 20%.

Sandeep Dhoot

June 1, 2021

Developer

MySQL

Real-Time Analytics

Getting Started with Real-Time Analytics on MySQL Using Rockset

In this blog, we walk you through how to scale your Amazon RDS MySQL analytical workload with Rockset.

Nadine Farah

May 27, 2021

Product

Elasticsearch

RocksDB

Compare and Contrast Search Indexing With Real-Time Converged Indexing

Elasticsearch and Rockset as indexing data stores for serving low latency queries.

Giovanni Tropeano

May 24, 2021

Big Ideas

What Is a Serverless Database and Why Use One

Serverless is commonly associated with functions and Lambdas, but engineering teams should also be knowledgeable about serverless databases and the benefits they provide.

Ben Rogojan

May 21, 2021

Real-Time Analytics

Popular Use Cases for Real-Time Analytics

While real-time analytics is in demand, it’s not without its challenges in implementing.

Julie Mills

May 17, 2021

Real-Time Analytics

3 Reasons Why Real-Time Analytics Is More Affordable Than You Think

If you are considering real-time analytics, here are some ways to ensure you are taking the most cost-effective approach.

Kevin Leong

May 14, 2021

Developer

SQL

Find and Replace Text with SQL Regular Expressions in Rockset

When we tried to unnest a field, we get multiple errors. Check out this blog to see how we use regex to debug the error and replace the problematic characters!

Nadine Farah

May 13, 2021

Real-Time Analytics

SaaS Industry Trends in Real-Time Analytics

Multiple industries are seeing real time analytics trends emerge due to customer application usage. requirements for instant access to data is driving app development teams to heavily invest in embedded real time analytics.

Giovanni Tropeano

May 11, 2021

Real-Time Analytics

Data Applications

Building Data Applications Powered by Real-Time Analytics

We share 3 key criteria for your real-time analytics platform that will fuel long-term success with data apps.

Shruti Bhat

May 5, 2021

Developer

Working with Mixed Data Types within a Field Using Rockset

When working with mixed field types, you’ll have to adjust your queries to take into consideration data types and values you don’t want to use. Here, we work through an example by ordering movies by release year.

Nadine Farah

April 28, 2021

Company

Engineering

Leading Design as a UX Team of 1

Aditi shares her experience leading design in Rockset’s fast-paced, developer-first environment.

Aditi Dhar

April 27, 2021

Developer

Flattening a JSON Object So It’s Queryable Using Rockset

You will often need to flatten a JSON object so you can query it. In this post, we’ll show how to do so using the UNNEST function in Rockset.

Nadine Farah

April 15, 2021

Real-Time Analytics

Product

PostgreSQL

MySQL

Powering Real-Time Analytics at Scale on MySQL and PostgreSQL

Enable sub-second, high-concurrency analytics for MySQL and PostgreSQL using Rockset for real-time external indexing.

Justin Liu

April 12, 2021

Case Study

DynamoDB

Real-Time Analytics

Elasticsearch

Data Applications

Case Study: Bringing Real-Time Analytics to Construction Logistics at Command Alkon

Command Alkon offers a SaaS application to digitize construction logistics, allowing suppliers, transportation providers and contractors on jobsites to analyze and collaborate on data in real time.

Kevin Leong

April 5, 2021

Case Study

Elasticsearch

SQL

Sequoia Capital: Why We Moved from Elasticsearch to Rockset

We spoke with Sequoia’s head of engineering, Jake Quist, and VP of data science, Hem Wadhar, about their reasons for moving their internal analytics off Elasticsearch to Rockset.

Kevin Leong

March 31, 2021

Case Study

Data Applications

Snowflake

Real-Time Analytics

Case Study: Ritual’s Move to Real-Time Analytics to Personalize the Multivitamin Experience

Ritual, a health-meets-technology company, personalized the cart checkout experience, email promotions and banners using Rockset. Learn how Ritual effectively monetized new product lines with real-time analytics.

Julie Mills

March 23, 2021

Engineering

On the Pursuit of Happiness (aka Squashing 502/504 Errors)

We recount our experience hunting down, diagnosing and fixing 502 and 504 errors to improve product quality and user experience.

Hieu Pham

March 15, 2021

Elasticsearch

Elasticsearch or Rockset for Real-Time Analytics: Real-Time Ingestion and Indexing

In part 3 of our Elasticsearch and Rockset comparison, we examine how well Elasticsearch and Rockset ingest and index real-time data.

Shawn Adams

March 12, 2021

Engineering

Big Ideas

5 Tips for Recruiting Top Engineering Talent in Startups

Rockset CEO Venkat Venkataramani and engineering leaders Nimrod Hoofien (Gusto) and Adam Wolff (Robinhood) share best practices for recruiting great engineers.

Julie Mills

March 5, 2021

Big Ideas

Snowflake

Space-Time Tradeoff: Examining Snowflake's Compute Cost

In this post, we explore how developers should think about space, time, storage and compute cost as it relates to modern data analytics offerings like Snowflake and Rockset.

Shruti Bhat

February 25, 2021

Elasticsearch

Elasticsearch or Rockset for Real-Time Analytics: How Much Query Flexibility Do You Have?

In part 2 of our Elasticsearch and Rockset comparison, we take a look at query flexibility and its impact on developer productivity.

Shawn Adams

February 18, 2021

Druid

Engineering

Real-Time Analytics

Rockset Is Up to 9.4x Faster than Apache Druid on the Star Schema Benchmark

We evaluated Rockset on the Star Schema Benchmark and found up to 9.4x query runtime speedup compared to Druid. We discuss our benchmarking exercise, results and analysis in this blog post.

Kevin Leong

February 9, 2021

Real-Time Analytics

Indexing Amazon S3 for Real-Time Analytics on Data Lakes

We explore how indexing Amazon S3 data can enable low-latency, high-concurrency queries for real-time analytics.

Shawn Adams

January 19, 2021

Elasticsearch

Elasticsearch or Rockset for Real-Time Analytics: Managing Clusters vs Going Serverless

In part 1 of our Elasticsearch and Rockset comparison, we explore the operational costs associated with both real-time analytics solutions.

Shawn Adams

December 22, 2020

Elasticsearch

How to Join Data in Elasticsearch vs Rockset

In this blog post, we'll look at what it takes to join data sets in Elasticsearch and in Rockset, using the same online marketplace example.

Lewis Gavin

December 17, 2020

Data Applications

Build Internal Apps in Minutes with Retool and Rockset: A Customer 360 Example

Learn how to integrate Rockset with Retool on a customer 360 sample app, using data APIs and pre-built UI components.

Ben Rogojan

December 10, 2020

Engineering

Company

What I've Learned in 2020: A Technical Version

Hieu shares thoughts on columnar databases, RocksDB, SQL engines and his year as an engineer at Rockset.

Hieu Pham

November 24, 2020

Engineering

RocksDB

Real-Time Analytics

How Rockset’s Converged Index Powers Real-Time Analytics

Rockset enables millisecond-latency queries on terabytes of data because all data ingested is indexed multiple ways in its Converged Index. Learn how the Converged Index works in this blog post.

Shawn Adams

November 19, 2020

Engineering

SQL

Smart Schema: Enabling SQL Queries on Semi-Structured Data

We explain and show how users can perform schemaless ingestion of their data and then use Rockset's Smart Schema to enable SQL queries directly on that data.

Shawn Adams

November 12, 2020

MongoDB

Elasticsearch

Real-Time Analytics

Using Elasticsearch to Offload Real-Time Analytics from MongoDB

This post weighs the advantages and disadvantages of moving read-heavy analytics off a primary MongoDB database using Elasticsearch for indexing.

Shawn Adams

October 27, 2020

Company

Real-Time Analytics

Rockset Raises $40M Series B to Empower Developers Building Real-Time Analytics

Rockset is the real-time cloud database built for modern data apps, bringing speed, scale and simplicity to developers building real-time analytics.

Venkat Venkataramani

October 27, 2020

Engineering

Company

Why I Am Joining Rockset

Nathan Bronson is joining Rockset to make real-time data infrastructure simple for users at scale.

Nathan Bronson

October 26, 2020

Case Study

PostgreSQL

Data Applications

Real-Time Analytics

Case Study: Rumble’s Real-Time Leaderboards Empower Users to Lead Healthier Lifestyles

Learn how Rockset powers Rumble's real-time leaderboards, which serve to motivate its users to keep active.

Nadine Farah

October 8, 2020

MongoDB

3 Tools to Help Debug Slow Queries in MongoDB

How can you investigate query performance issues in MongoDB? We give an overview of 3 tools available for troubleshooting slow queries in MongoDB Atlas.

Ben Rogojan

October 1, 2020

Kafka

MongoDB

Data Applications

Real-Time Analytics

Building a Real-Time Customer 360 on Kafka, MongoDB and Rockset

A step-by-step guide to building a real-time customer 360 using seconds-old purchase data from MongoDB and marketing data from Kafka.

Lewis Gavin

September 25, 2020

MongoDB

3 Ways to Offload Read-Heavy Applications from MongoDB

Offloading read-heavy analytics from an operational database, like MongoDB, is a common architectural pattern. This post examines 3 options for offloading MongoDB to a secondary system.

Ben Rogojan

September 15, 2020

Real-Time Analytics

Engineering

Rockset: 1 Billion Events in a Day with 1-Second Data Latency

This post introduces RockBench, a benchmark for measuring the data latency of real-time databases.

Dhruba Borthakur

September 3, 2020

MongoDB

PostgreSQL

Offload Real-Time Reporting and Analytics from MongoDB Using PostgreSQL

This post weighs the advantages and disadvantages of moving read-heavy analytics off a primary MongoDB database to PostgreSQL.

Shawn Adams

August 27, 2020

DynamoDB

Case Study

Case Study: Matter Uses Rockset to Bring AI-Powered Sustainable Insights to Investors

With Rockset, Danish fintech Matter has the flexibility to run analytical queries on semi-structured data in S3 and DynamoDB as part of their NLP architecture.

Alexander Harrington

August 25, 2020

MongoDB

Handling Slow Queries in MongoDB - Part 2: Solutions

We discuss the advantages and disadvantages to various strategies for improving the performance of our MongoDB database

Justin Liu

August 20, 2020

Developer

Product

Announcing the New Rockset Developer Tools

We released Rockset Developer Tools, including a new CLI tool and a new VS Code extension, to make it easier to develop real-time data applications on Rockset.

Tanmay Chordia

August 18, 2020

Real-Time Analytics

Changing face of real-time analytics

We explore the continuum of real-time analytics, from live, interactive dashboards to online applications that automatically take action on real-time data.

Shruti Bhat

August 13, 2020

Big Ideas

The future is serverless: what about your data stack?

Serverless architectures offer ease of use and cost advantages. We explore what serverless means for your data stack.

Shruti Bhat

August 11, 2020

Real-Time Analytics

Analytics-on-the-fly: from batch to real-time user engagement

Companies need to embrace real-time analytics to compete and survive. Only those that have invested in a real-time data stack will thrive.

Dhruba Borthakur

August 10, 2020

Real-Time Analytics

Rapid Experimentation and Growth Using Real-Time Analytics

Learn how to build for the requirements of a massive-scale A/B experiments platform.

Venkat Venkataramani

August 10, 2020

Case Study

DynamoDB

Real-Time Analytics

Case Study: eGoGames Esports Platform Uses Rockset for Real-Time Analytics on Gaming Data

eGoGames improves user experience, detects fraud, and makes business decisions using Rockset for real-time analytics on gaming data in Amazon DynamoDB and S3.

Kevin Leong

August 7, 2020

MongoDB

Handling Slow Queries in MongoDB - Part 1: Investigation

Explore various methods of identifying slow queries on MongoDB and understand how to improve them.

Justin Liu

July 29, 2020

MongoDB

Performance Isolation for Your Primary MongoDB Cluster

Performance of your primary MongoDB cluster is crucial. We look at how using multiple MongoDB clusters can help with performance isolation.

Dai Shi

July 23, 2020

MongoDB

Improving MongoDB Read Performance - Indexing, Replication and Sharding

Real-time analytics demands low-latency complex queries. Learn how to speed up read performance by indexing, replication and sharding in MongoDB.

Shawn Adams

July 21, 2020

Big Ideas

Real-Time Analytics

Building Real-Time Data Architectures to Foster Innovation

Lessons on building real-time data architectures based on experiences growing Facebook users 30x, from 50 million to 1.5 billion.

Venkat Venkataramani

July 16, 2020

MongoDB

Engineering

Indexing on MongoDB Using Rockset - How It Works

An in-depth look at indexing MongoDB data in Rockset and how it compares to indexing in MongoDB itself.

Ben Hannel

July 14, 2020

Case Study

MongoDB

Case Study: StoryFire - Scaling a Social Video Platform on MongoDB and Rockset

Learn how StoryFire uses Rockset to index data from their transactional MongoDB database to achieve performance and scale.

Ben Hagan

July 8, 2020

DynamoDB

Kafka

Data Applications

Real-Time Analytics

Designing a Real-Time ETA Prediction System Using Kafka, DynamoDB and Rockset

Generate ETA predictions for a delivery service using real-time location and order data from Kafka and DynamoDB.

Kartik Khare

June 23, 2020

MongoDB

Data Applications

Real-Time Analytics

Real-Time Recommendations for Event Ticketing Using MongoDB and Rockset

Implementing a real-time recommendations API for an event ticketing system by indexing MongoDB data in Rockset for fast SQL.

Lewis Gavin

June 16, 2020

MongoDB

Product

JOINs and Aggregations Using Real-Time Indexing on MongoDB Atlas

We explore how real-time indexing on MongoDB enables fast aggregation and join queries, and how Rockset is specifically designed to meet real-time indexing requirements.

Kevin Leong

June 9, 2020

MongoDB

MongoDB Performance Tuning - Top 5 Resources

A compilation of MongoDB performance tuning resources, covering topics such as sharding, indexing, schema design and performance isolation.

Kevin Leong

June 4, 2020

RocksDB

Engineering

Remote Compactions in RocksDB-Cloud

We modified RocksDB-Cloud to allow remote compactions in order to optimize RocksDB for cloud environments.

Hieu Pham

June 2, 2020

MongoDB

Top 10 sessions for MongoDB.live 2020

Sessions to look forward to for MongoDB.live 2020

Nadine Farah

May 19, 2020

MongoDB

Create APIs for Aggregations and Joins on MongoDB in Under 15 Minutes

Build a Python application to create and execute APIs on aggregations and joins using Rockset and MongoDB.

Nadine Farah

May 6, 2020

MongoDB

Engineering

Elasticsearch

Using MongoDB Change Streams for Indexing with Elasticsearch vs Rockset

Learn how Rockset indexes data from MongoDB change data capture (CDC) streams and how it compares to indexing in Elasticsearch.

Kshitij Wadhwa

April 28, 2020

Engineering

Index Scan: Using Rockset's Search Index to Speed up Range Scans Over a Specific Field

Rockset uses Converged Indexing to make different types of queries run fast. We look at how Rockset's Index Scan uses the search index to accelerate range scans.

Karen Li

April 20, 2020

Elasticsearch

SQL

Can I Do SQL-Style Joins in Elasticsearch?

We explore how to perform the equivalent of SQL joins when using Elasticsearch. While joins are primarily an SQL concept, they are equally important in NoSQL

Shawn Adams

April 3, 2020

DynamoDB

Case Study

Dashboards

Fleet Management System – An End-to-End Streaming Data Pipeline

This post outlines a fleet management solution using IoT and data technologies, such as DynamoDB, AWS IoT Core, AWS Lambda, and Rockset.

Abhijeet Upadhyay

March 19, 2020

Kafka

Real-Time Analytics

How to Use KSQL Stream Processing and Real-Time Databases to Analyze Streaming Data in Kafka

We discuss when stream processing, with KSQL and Kafka Streams, and when a real-time database like Rockset are best used for analyzing Kafka data.

Ari Ekmekji

March 12, 2020

Developer

Product

Query Lambdas: Increasing Developer Velocity for Application Development

We’re now proud to release a new product feature - Query Lambdas - that similarly rethinks the data application development workflow.

Scott Morris

March 5, 2020

Kafka

Best Practices for Analyzing Kafka Event Streams

What are the key considerations when selecting an analytics stack for building data applications on Kafka event streams?

Kevin Leong

February 28, 2020

MongoDB

Product

Real-time external indexing for aggregations and joins on MongoDB collections

This is a tech preview of an integration that will allow you to index your MongoDB data in row, column and inverted indexes, and run millisecond-latency SQL queries in real-time.

Shruti Bhat

February 14, 2020

Kafka

Data Applications

Where's My Tesla? Creating a Data API Using Kafka, Rockset and Postman to Find Out

We demonstrate how to expose real-time IoT data in Kafka through the Rockset REST API in this example.

Lewis Gavin

February 7, 2020

Kafka

Dashboards

Real-Time Analytics

Real-Time Analytics on Connected Car IoT Data Streams from Apache Kafka

In this IoT example, we examine how to enable complex analytic queries on real-time Kafka streams from connected car sensors.

Shawn Adams

January 28, 2020

Case Study

Data Applications

Case Study: Standard Cognition Uses Rockset to Deliver Data APIs and Real-Time Metrics for Vision AI

Standard Cognition, an AI-powered computer vision company, uses Rockset to enable their developers to deliver data APIs and product improvements.

Kevin Leong

January 23, 2020

RocksDB

RocksDB Is Eating the Database World

An overview of what makes RocksDB well-suited to power many of the world's high-performance distributed data systems.

Ethan Hamilton

January 17, 2020

Kafka

SQL

Real-Time Analytics

Data Applications

SQL API for Real-Time Kafka Analytics in 3 Steps

Learn how to create a SQL API for real-time Kafka analytics on the Twitter Streaming API, using AWS Lambda and Rockset.

Tanmay Chordia

January 10, 2020

DynamoDB

Joining Data in DynamoDB and S3 for Live, Ad-Hoc Analysis

Using SQL to join DynamoDB and S3 data, operations teams can perform live, ad-hoc analysis across multiple cloud systems.

Ben Rogojan

December 9, 2019

Big Ideas

What Data Engineers Think About - Variety, Volume, Velocity and Real-Time Analytics

Data engineers are often tasked with moving and preparing data to facilitate analytics. This guest post examines several considerations for data engineers designing for real-time analytics.

Lewis Gavin

November 6, 2019

Kafka

Elasticsearch

Druid

Analytics on Kafka Event Streams Using Druid, Elasticsearch and Rockset

We discuss how different data backends - Druid, Elasticsearch and Rockset - can be used alongside Kafka for analytics on event data streams.

Anirudh Ramanathan

October 31, 2019

DynamoDB

5 Use Cases for DynamoDB

This guest post lays out the benefits of using DynamoDB, including 5 real-life examples, along with recommendations for performing analytics on DynamoDB data.

Ben Rogojan

October 21, 2019

Engineering

Company

The Role of UX in Making Rockset the Shortest Path from Data to Applications

Learn how our UX team continually improves common user workflows in Rockset to simplify development of data-driven applications.

Aditi Dhar

October 10, 2019

Kafka

Dashboards

Real-Time Analytics

Using Tableau with Kafka: How to Build a Real-Time SQL Dashboard on Streaming Data

Build a real-time Tableau dashboard for operational monitoring and analytics on streaming event data from Kafka.

Scott Morris

October 1, 2019

Engineering

Dashboards

How We Analyze and Visualize Kubernetes Events in Real Time at Rockset

Learn how we rolled our own tool for analysis and visualization of Kubernetes events, and try the open-source dashboard for yourself.

Rui Aguiar

September 20, 2019

Engineering

Outside Lands, Airbnb Prices, and Rockset’s Geospatial Queries

How to use Rockset's fast geospatial indexes with Airbnb data.

Ben Hannel

September 13, 2019

Dashboards

Engineering

Grafana Time-Series Dashboards with the Rockset-Grafana Plugin

How Rockset uses Grafana dashboards for monitoring production systems, Kubernetes, and GitHub metrics, and how we built a Rockset-Grafana plugin.

Rui Aguiar

September 6, 2019

Kafka

Real-Time Analytics

Real-Time Analytics in the World of Virtual Reality and Live Streaming

An architecture for real-time decision-making and live dashboards on VR data in Kafka, coming from live-streamed events.

Sebastian Zangaro

August 29, 2019

DynamoDB

Dashboards

Using Tableau with DynamoDB: How to Build a Real-Time SQL Dashboard on NoSQL Data

We create an example dashboard in Tableau on data in DynamoDB, using Rockset as the SQL intelligence layer.

Vahid Fazel-Rezai

August 27, 2019

DynamoDB

3 cost-cutting tips for Amazon DynamoDB

How to avoid costly mistakes with DynamoDB partition keys, read/write capacity modes, and global secondary indexes.

Anirudh Ramanathan

August 23, 2019

DynamoDB

Engineering

How We Reduced DynamoDB Costs by Using DynamoDB Streams and Scans More Efficiently

Get an inside look at the some of the techniques we used to reduce the cost of ingesting data from DynamoDB.

Aditi Srinivasan

August 21, 2019

Engineering

Kafka

The Kafka Connect Plugin for Rockset and How It Works

Get an in-depth look at the Kafka Connect Plugin for Rockset and the process to get it listed in Confluent Hub.

Jacob Klegar

August 21, 2019

Engineering

RocksDB

Optimizing Bulk Load in RocksDB

Discover an effective technique for quickly loading data into RocksDB.

Igor Canadi

August 16, 2019

Data Applications

Data-Driven Decisions for Where to Park in SF

We built an app to estimate the risk of a car break-in based on historical incidents.

Vahid Fazel-Rezai

August 13, 2019

Dashboards

DynamoDB

Tableau Operational Dashboards and Reporting on DynamoDB - Evaluating Redshift and Athena

We review several approaches to building Tableau operational dashboards and reporting on DynamoDB data, using SQL engines like Redshift and Athena.

Ari Ekmekji

August 12, 2019

DynamoDB

Real-Time Analytics

Real-Time Analytics on DynamoDB - Using DynamoDB Streams with Lambda and ElastiCache

We cover different approaches to real-time analytics on DynamoDB, using DynamoDB Streams, Lambda, and ElastiCache.

Ari Ekmekji

July 30, 2019

Real-Time Analytics

From Good to Great: How Operational Analytics Gives Businesses a Real-Time Edge

All businesses today are a series of real-time events. But what separates the good from the great is how they capture and operationalize that data.

Shruti Bhat

July 25, 2019

Real-Time Analytics

Operational Analytics: What every software engineer should know about low-latency queries on large data sets

What are the characteristics of an Operational Analytics processing system, and how does it differ from OLTP, OLAP and other data systems?

Dhruba Borthakur

July 18, 2019

Engineering

SQL

SQL Query Planning for Operational Analytics

We discuss how SQL query planning is implemented to support operational analytics requirements, like low latency and high concurrency, in Rockset.

Purvi Desai

July 9, 2019

MySQL

PostgreSQL

SQL

Methods for Running SQL on JSON in PostgreSQL, MySQL and Other Relational Databases

We examine various options for running SQL on JSON in relational databases, like PostgreSQL and MySQL, and in Rockset.

Shawn Adams

June 27, 2019

Engineering

RocksDB

How we use RocksDB at Rockset

This blog describes how we use RocksDB at Rockset and how we tuned it for optimal performance.

Sandeep Dhoot

June 13, 2019

Product

Building a SQL Development Environment for Messy, Semi-Structured Data

Learn how and why Rockset developed a new SQL development environment for messy, semi-structured data.

Scott Morris

June 6, 2019

Engineering

IValue: efficient representation of dynamic types in C++

This post shows one of many challenges that we encountered while building a fully dynamically typed SQL database: how we manipulate values of unknown types in our query execution backend, while approaching the performance of using native types directly.

Tudor Bosman

May 31, 2019

Dashboards

Real-Time Analytics

Using Tableau for Live Dashboards on Event Data

Connect a Tableau live dashboard to a real-time event stream of complex JSON in a few easy steps.

Haneesh Reddy Poddutoori

May 24, 2019

Case Study

DynamoDB

Dashboards

Real-Time Analytics

Case Study: FULL Uses Rockset with DynamoDB for Live Dashboard to Manage Remote Workforce

FULL Creative uses Rockset to build live dashboards and run complex SQL on contact center call data in DynamoDB.

Kevin Leong

May 23, 2019

Engineering

Converged Index™: The Secret Sauce Behind Rockset's Fast Queries

Learn how Rockset delivers low-latency SQL for search and analytics using a combination of row, column, and search indexes.

Igor Canadi

May 17, 2019

Data Applications

Building a Serverless Analytics App to Capture and Query Clickstream Data

We built a web app that collects clickstream data as free-form JSON and runs SQL queries on the live data in a completely serverless fashion. We also seek to answer age-old questions besetting developers: tabs or spaces, vim or emacs?

Vahid Fazel-Rezai

May 17, 2019

Big Ideas

Developer Pulse: 5 Things Developers Love

When the existential question of spaces vs. tabs came up in our team, we ran a real-time survey to collect thousands of data points around it. We also wanted to settle the debate around other developer issues like SQL vs NoSQL.

Shruti Bhat

May 6, 2019

Case Study

DynamoDB

Data Applications

Case Study: Decore Uses Rockset for Search & Analytics on DynamoDB

Decore needed to enable ad hoc queries in their crypto accounting software service, so they turned to Rockset for fast analytics on DynamoDB.

Kevin Leong

April 29, 2019

DynamoDB

Analytics on DynamoDB: Comparing Elasticsearch, Athena and Spark

We compare options for real-time analytics on DynamoDB - Elasticsearch, Athena, and Spark - in terms of ease of setup, maintenance, query capability, latency.

Anirudh Ramanathan

April 29, 2019

DynamoDB

Secondary Indexes For Analytics On DynamoDB

Learn how to support analytical queries on DynamoDB without prohibitive scan costs - using secondary indexes.

Anirudh Ramanathan

March 27, 2019

Product

SQL

From Schemaless Ingest to Smart Schema: Enabling SQL on Raw Data

Rockset's schemaless SQL platform automatically infers schema at read time, allowing you to analyze messy data using SQL.

Purvi Desai

March 21, 2019

Big Ideas

Company

Product

Serverless Data Management: A SQL Search and Analytics Engine

Designed from the ground up for serverless data management, Rockset makes SQL search and analytics simple and accessible.

Venkat Venkataramani

March 19, 2019

Case Study

Kafka

Dashboards

Case Study: Fynd Uses Kafka and Rockset to Respond to E-Commerce Consumer Behavior in Real Time

Fynd uses Rockset to perform fast queries on real-time Kafka event streams, so they can react to consumer behavior as it happens.

Kevin Leong

March 19, 2019

Case Study

Case Study: The Path to Better Pollution Forecasting Goes Through Nested JSON

Pittsburgh-based developer Doug Balog collects and analyzes nested JSON weather data to improve pollution forecasts in his community.

Kevin Leong

March 19, 2019

Case Study

Case Study: Implementing Real-Time IoT Analytics Simply and Efficiently - An MIT Smart City Project

An MIT team collaborates with a school in Brazil on a smart city project to analyze weather sensor data using Rockset.

Kevin Leong

February 28, 2019

Data Applications

How to Build a Facebook Messenger Chatbot Powered by Fast SQL on CSV

Build a chatbot that provides instant responses, leveraging fast SQL queries on CSV data.

Kshitij Wadhwa

February 21, 2019

SQL

Product

Using Smart Schema to Accelerate Insights from Nested JSON

Use Rockset's Smart Schema to understand complex, nested JSON and enable immediate queries using SQL on raw data.

Purvi Desai

February 21, 2019

Product

SQL

How to Run SQL on PDF Files

Run SQL queries on data from PDF files, and join PDFs with JSON, CSV, XLSX, and other data.

Kshitij Wadhwa

February 13, 2019

Engineering

Company

Distributed Aggregation Queries - A Rockset Intern Story

Rockset distributes aggregation queries to reduce query latency and memory requirements. This was an intern project by Ashwath, Rockset's first ever intern.

Ashwath Thirumalai

February 6, 2019

Engineering

Aggregator Leaf Tailer: An Alternative to Lambda Architecture for Real-Time Analytics

The Aggregator Leaf Tailer architecture takes advantage of powerful indexing and cloud scalability to enable live analytics on real-time event streams.

Dhruba Borthakur

January 23, 2019

DynamoDB

SQL

Running Fast SQL on DynamoDB Tables

Run fast SQL queries on data from DynamoDB tables by continuously ingesting and indexing DynamoDB data through a Rockset-DynamoDB integration.

Kshitij Wadhwa

January 23, 2019

Dashboards

Product

Live Dashboards with Redash and Rockset

Build live dashboards by connecting Redash to Rockset to create visualizations quickly and easily.

Igor Canadi

January 21, 2019

SQL

Product

Rockset adds Excel spreadsheet support: Use SQL across XLSX files and join with other JSON, CSV or Parquet data

Run complex SQL across multiple Excel spreadsheets and join XLSX files with JSON, Parquet or CSV data.

Shruti Bhat

January 16, 2019

Kafka

Real-Time Analytics

SQL

Real-Time Analytics Using SQL on Streaming Data with Apache Kafka and Rockset

Connect Kafka and Rockset to obtain real-time analytics with ad hoc SQL queries on event streams.

Shawn Adams

January 10, 2019

Product

SQL

How to Do Data Science Using SQL on Raw JSON

Learn how to query nested JSON and CSV using SQL (including joins), without any upfront data preparation or complex data pipelines.

Anirudh Ramanathan

January 8, 2019

Kinesis

Data Applications

Building a Serverless Microservice Using Rockset and AWS Lambda

Build serverless microservices, data APIs, and data-driven applications. Use SQL to join and query JSON and CSV data using AWS Lambda and Rockset.

Kevin Leong

December 20, 2018

Dashboards

Kinesis

Real-Time Analytics

Live Dashboards on Streaming Data - A Tutorial Using Amazon Kinesis and Rockset

Serve a live dashboard using SQL on streaming Twitter data from Amazon Kinesis.

Haneesh Reddy Poddutoori

December 7, 2018

SQL

Product

Running SQL on Nested JSON

Make raw JSON immediately queryable through fast SQL queries, without ETL, data pipelines, or fixed schema.

Anirudh Ramanathan

November 7, 2018

Engineering

RocksDB

Rockset's RocksDB-Cloud Library - Enabling the Next Generation of Cloud Native Databases

David Cohen, System Architect at Intel, explores how RocksDB-Cloud can be be used to build an open-source cloud-friendly storage system.

David Cohen

November 1, 2018

Engineering

SQL

Dynamic Typing in SQL

Rockset Chief Architect Tudor Bosman discusses strong dynamic typing in SQL, and how it is implemented in Rockset.

Tudor Bosman