Featured Post

Outside Lands, Airbnb Prices, and Rockset’s Geospatial Queries

How to use Rockset's fast geospatial indexes
Ben Hannel | September 20th, 2019
Read More
Follow our stories and unique insights.

Latest Posts

Grafana Time-Series Dashboards with the Rockset-Grafana Plugin

How Rockset uses Grafana dashboards for monitoring production systems, Kubernetes, and GitHub metrics, and how we built a Rockset-Grafana plugin.
Rui Aguiar | September 13th, 2019

Real-Time Analytics in the World of Virtual Reality and Live Streaming

An architecture for real-time decision-making and live dashboards on VR data in Kafka, coming from live-streamed events.
Sebastian Zangaro | September 6th, 2019

Using Tableau with DynamoDB: How to Build a Real-Time SQL Dashboard on NoSQL Data

We create an example dashboard in Tableau on data in DynamoDB, using Rockset as the SQL intelligence layer.
Vahid Fazel-Rezai | August 29th, 2019

3 cost-cutting tips for Amazon DynamoDB

How to avoid costly mistakes with DynamoDB partition keys, read/write capacity modes, and global secondary indexes.
Anirudh Ramanathan | August 27th, 2019

Operational Analytics - The Last Mile In Data and Analytics

We explore operational analytics and why providing insights in real time to large numbers of users is crucial for organizations.
Jay Maloney | August 27th, 2019

How We Reduced DynamoDB Costs by Using DynamoDB Streams and Scans More Efficiently

Get an inside look at the some of the techniques we used to reduce the cost of ingesting data from DynamoDB.
Aditi Srinivasan | August 23rd, 2019

The Kafka Connect Plugin for Rockset and How It Works

Get an in-depth look at the Kafka Connect Plugin for Rockset and the process to get it listed in Confluent Hub.
Jacob Klegar | August 21st, 2019

Optimizing Bulk Load in RocksDB

What’s the fastest we can load data into RocksDB?
Igor Canadi | August 21st, 2019

Data-Driven Decisions for Where to Park in SF

We built an app to estimate the risk of a car break-in based on historical incidents.
Vahid Fazel-Rezai | August 16th, 2019

Tableau Operational Dashboards and Reporting on DynamoDB - Evaluating Redshift and Athena

We review several approaches to building Tableau operational dashboards and reporting on DynamoDB data, using SQL engines like Redshift and Athena.
Ari Ekmekji | August 13th, 2019

Custom Live Dashboards on DynamoDB - Using DynamoDB Streams with Lambda and ElastiCache

We cover different approaches to live dashboards on DynamoDB, using DynamoDB Streams, Lambda, and ElastiCache.
Ari Ekmekji | August 12th, 2019

From Good to Great: How Operational Analytics Gives Businesses a Real-Time Edge

All businesses today are a series of real-time events. But what separates the good from the great is how they capture and operationalize that data.
Shruti Bhat | July 30th, 2019

Client-Side SQL Query Parsing with ANTLR

Learn how Rockset does basic ANTLR parsing in the browser to separate out SQL statements in a string.
Rahul Patel | July 26th, 2019

Operational Analytics: What every software engineer should know about low-latency queries on large data sets

What are the characteristics of an Operational Analytics processing system, and how does it differ from OLTP, OLAP and other data systems?
Dhruba Borthakur | July 25th, 2019

SQL Query Planning for Operational Analytics

We discuss how SQL query planning is implemented to support operational analytics requirements, like low latency and high concurrency, in Rockset.
Purvi Desai | July 18th, 2019

Methods for Running SQL on JSON in PostgreSQL, MySQL and Other Relational Databases

We examine various options for running SQL on JSON in relational databases, like PostgreSQL and MySQL, and in Rockset.
Shawn Adams | July 9th, 2019

How we use RocksDB at Rockset

This blog describes how we use RocksDB at Rockset and how we tuned it for optimal performance.
Sandeep Dhoot | June 27th, 2019

Redshift with Rockset: High performance queries for operational analytics

Run high performance queries for operational analytics on data from Redshift tables by continuously ingesting and indexing Redshift data through a Rockset-Redshift integration.
Kshitij Wadhwa | June 20th, 2019

Building a SQL Development Environment for Messy, Semi-Structured Data

Learn how and why Rockset developed a new SQL development environment for messy, semi-structured data.
Scott Morris | June 13th, 2019

IValue: efficient representation of dynamic types in C++

This post shows one of many challenges that we encountered while building a fully dynamically typed SQL database: how we manipulate values of unknown types in our query execution backend, while approaching the performance of using native types directly.
Tudor Bosman | June 6th, 2019

Using Tableau for Live Dashboards on Event Data

Connect a Tableau live dashboard to a real-time event stream of complex JSON in a few easy steps.
Haneesh Reddy Poddutoori | May 31st, 2019

Case Study: FULL Uses Rockset with DynamoDB for Live Dashboard to Manage Remote Workforce

FULL Creative uses Rockset to build live dashboards and run complex SQL on contact center call data in DynamoDB.
Kevin Leong | May 24th, 2019

Converged Indexing: The Secret Sauce Behind Rockset's Fast Queries

Learn how Rockset delivers low-latency SQL for search and analytics using a combination of row, column, and search indexes.
Igor Canadi | May 23rd, 2019

Developer Pulse: 5 Things Developers Love

When the existential question of spaces vs. tabs came up in our team, we ran a real-time survey to collect thousands of data points around it. We also wanted to settle the debate around other developer issues like SQL vs NoSQL.
Shruti Bhat | May 17th, 2019

Building a Serverless Analytics App to Capture and Query Clickstream Data

We built a web app that collects clickstream data as free-form JSON and runs SQL queries on the live data in a completely serverless fashion. We also seek to answer age-old questions besetting developers: tabs or spaces, vim or emacs?
Vahid Fazel-Rezai | May 17th, 2019

Case Study: Decore Uses Rockset for Search & Analytics on DynamoDB

Decore needed to enable ad hoc queries in their crypto accounting software service, so they turned to Rockset for fast analytics on DynamoDB.
Kevin Leong | May 6th, 2019

Analytics on DynamoDB: Comparing Athena, Spark and Elastic

We compare options for real-time analytics on DynamoDB- Athena, Spark and Elastic - in terms of ease of setup, maintenance, query capability, latency. We also evaluate which use cases each of them are best suited for.
Anirudh Ramanathan | April 29th, 2019

Secondary Indexes For Analytics On DynamoDB

In this post I explore how to support analytical queries on DynamoDB without prohibitive scan costs - using secondary indexes. I also evaluate the pros and cons of this approach in contrast to extracting data to Athena, Spark or Elastic for analytics
Anirudh Ramanathan | April 29th, 2019

From Schemaless Ingest to Smart Schema: Enabling SQL on Raw Data

Rockset's schemaless SQL platform automatically infers schema at read time, allowing you to analyze messy data using SQL.
Purvi Desai | March 27th, 2019

Serverless Data Management: A SQL Search and Analytics Engine

Designed from the ground up for serverless data management, Rockset makes SQL search and analytics simple and accessible.
Venkat Venkataramani | March 21st, 2019

Case Study: Fynd Uses Kafka and Rockset to Respond to E-Commerce Consumer Behavior in Real Time

Fynd uses Rockset to perform fast queries on real-time Kafka event streams, so they can react to consumer behavior as it happens.
Kevin Leong | March 19th, 2019

Case Study: The Path to Better Pollution Forecasting Goes Through Nested JSON

Pittsburgh-based developer Doug Balog collects and analyzes nested JSON weather data to improve pollution forecasts in his community.
Kevin Leong | March 19th, 2019

Case Study: Implementing Real-Time IoT Analytics Simply and Efficiently - An MIT Smart City Project

An MIT team collaborates with a school in Brazil on a smart city project to analyze weather sensor data using Rockset.
Kevin Leong | March 19th, 2019

How to Build a Facebook Messenger Chatbot Powered by Fast SQL on CSV

Build a chatbot that provides instant responses, leveraging fast SQL queries on CSV data.
Kshitij Wadhwa | February 28th, 2019

How to Run SQL on PDF Files

Run SQL queries on data from PDF files, and join PDFs with JSON, CSV, XLSX, and other data.
Kshitij Wadhwa | February 21st, 2019

Using Smart Schema to Accelerate Insights from Nested JSON

Use Rockset's Smart Schema to understand complex, nested JSON and enable immediate queries using SQL on raw data.
Purvi Desai | February 21st, 2019

Distributed Aggregation Queries - A Rockset Intern Story

Rockset distributes aggregation queries to reduce query latency and memory requirements. This was an intern project by Ashwath, Rockset's first ever intern.
Ashwath Thirumalai | February 13th, 2019

Love Data, Cloud, and T-shirts?

Become a Rockset user this month, and we'll send you your own Serverless SQL T-shirt
Kevin Leong | February 8th, 2019

Aggregator Leaf Tailer: An Alternative to Lambda Architecture for Real-Time Analytics

The Aggregator Leaf Tailer architecture takes advantage of powerful indexing and cloud scalability to enable live analytics on real-time event streams.
Dhruba Borthakur | February 6th, 2019

Running Fast SQL on DynamoDB Tables

Run fast SQL queries on data from DynamoDB tables by continuously ingesting and indexing DynamoDB data through a Rockset-DynamoDB integration.
Kshitij Wadhwa | January 23rd, 2019

Live Dashboards with Redash and Rockset

Build live dashboards by connecting Redash to Rockset to create visualizations quickly and easily.
Igor Canadi | January 23rd, 2019

Rockset adds Excel spreadsheet support: Use SQL across XLSX files and join with other JSON, CSV or Parquet data

Run complex SQL across multiple Excel spreadsheets and join XLSX files with JSON, Parquet or CSV data.
Shruti Bhat | January 21st, 2019

Real-Time Analytics Using SQL on Streaming Data with Apache Kafka and Rockset

Connect Kafka and Rockset to obtain real-time analytics with ad hoc SQL queries on event streams.
Shawn Adams | January 16th, 2019

How to Do Data Science Using SQL on Raw JSON

How to query nested JSON and CSV using SQL (including joins), without any upfront data preparation or complex data pipelines - for interactive data science using Python notebooks.
Anirudh Ramanathan | January 10th, 2019

Building a Serverless Microservice Using Rockset and AWS Lambda

Build serverless microservices, data APIs, and data-driven applications. Use SQL to join and query JSON and CSV data using AWS Lambda and Rockset.
Kevin Leong | January 8th, 2019

Live Dashboards on Streaming Data - A Tutorial Using Amazon Kinesis and Rockset

Serve a live dashboard using SQL on streaming Twitter data from Amazon Kinesis.
Haneesh Reddy Poddutoori | December 20th, 2018

Running SQL on Nested JSON

Make raw JSON immediately queryable through fast SQL queries, without ETL, data pipelines, or fixed schema.
Anirudh Ramanathan | December 7th, 2018

Rockset's RocksDB-Cloud Library - Enabling the Next Generation of Cloud Native Databases

David Cohen, System Architect at Intel, explores how RocksDB-Cloud can be be used to build an open-source cloud-friendly storage system.
David Cohen | November 7th, 2018

Dynamic Typing in SQL

Rockset Chief Architect Tudor Bosman discusses strong dynamic typing in SQL, and how it is implemented in Rockset.
Tudor Bosman | November 1st, 2018

Why SQL on Raw Data?

SQL on unstructured data is hard. But storage and compute in the cloud are making SQL on raw data a reality.
Peter Bailis | November 1st, 2018

Cloud Native: What It Means in the Data World

Rockset CTO and co-founder Dhruba Borthakur discusses what Cloud-Native data processing entails, and how best to build for the cloud today.
Dhruba Borthakur | October 30th, 2018

The Road Ahead: From Open Source to Open Services

Rockset CTO and co-founder Dhruba Borthakur discusses the shift from Open Source to Open Services in data infrastructure, and how Open Services will become the new standard.
Dhruba Borthakur | October 19th, 2018