Featured Post

June 21, 2024

Company

OpenAI Acquires Rockset

We are thrilled to join the OpenAI team and bring our technology and expertise to building safe and beneficial AGI.

Venkat Venkataramani

Follow our stories and unique insights.

Latest Posts

May 21, 2024

How To

Use Cases

How to Build a Chatbot Using Retrieval Augmented Generation (RAG)

Discover how to build a Chatbot using RAG with Rockset as a vector database and OpenAI's GPT-4 as the LLM.

Ankit Khare

April 22, 2024

How To

Use Cases

How to Build a Recommender System using Rockset and OpenAI Embedding Models

Discover how to build a recommender system using Rockset as a vector database and OpenAI embeddings. This tutorial covers creating a dynamic web app with CSS, HTML, Js, and Flask, integrating Rockset and OpenAI APIs for a robust recommendation system.

Ankit Khare

April 2, 2024

Product

Reducing Costs

How We Optimized Rockset's Hot Storage Tier to Improve Efficiency By More Than 200%

Rockset’s new tiered pricing is as low as $0.13/GB-month, making real-time data more affordable than ever before.

Rafael Kabesa

March 27, 2024

Dashboards

How To

Explo and Rockset One-Click Integration for Real-Time Embedded Analytics

Rockset users can integrate with Explo to provide their customers a quality embedded analytics experience. In this article, we step through how to integrate Rockset with Explo to create charts and dashboards in your applications.

Brian Bakerman

March 18, 2024

Indexing

Streaming

Kafka

How To

Build AI-powered Recommendations with Confluent Cloud for Apache Flink® and Rockset

We discuss how RAG fits into the paradigm of real-time data processing and show an example product recommendation application using both Kafka and Flink on Confluent Cloud together with Rockset.

Julie Mills

March 15, 2024

Engineering

Profiling Individual Queries in a Concurrent System

This blog introduces trampoline histories, a technique Rockset has developed to efficiently attach application-level information (query IDs) to the samples of a CPU profile.

Nathan Bronson

February 22, 2024

DynamoDB

Indexing

Understanding DynamoDB Secondary Indexes

Discover the challenges secondary indexes solve in DynamoDB, including the optimal circumstances and methods for their effective application.

Alex DeBrie

February 16, 2024

Kafka

Case Study

Dashboards

Real-Time Analytics

Streaming

How Klarna Scales Buy Now Pay Later with Real-Time Anomaly Detection

With Rockset, Klarna was able to identify and alert teams to issues with partner and merchant integrations in real time, saving the company millions of dollars.

Julie Mills

January 31, 2024

Product

Real-Time Analytics

Reducing Costs

Rockset Ushers in the New Era of Search and AI with a 30% Lower Price

Rockset releases the general purpose instance class, autoscaling, microbatching and incremental materializations to make search and analytics applications more affordable than ever before.

Julie Mills

January 23, 2024

Developer

Elasticsearch

CDC

Streaming

How to Update Documents in Elasticsearch

A walk through of the the different options available for updates in Elasticsearch, including full updates, partial updates and scripted updates.

Shawn Adams

January 19, 2024

Product

How To

SQL

Mutable Data in Rockset

We explore the concept of data mutability in Rockset and cover examples demonstrating how to manipulate Rockset data using SQL.

Luka Lovosevic

December 21, 2023

Elasticsearch

SQL

Choosing Between Nested Queries and Parent-Child Relationships in Elasticsearch

In this blog, we’ll discuss how you can design your data model in Elasticsearch to handle relationships using the nested field type and parent-child relationships.

Julie Mills

December 19, 2023

RocksDB

Use Cases

How To

A Blueprint for a Real-World Recommendation System

A comprehensive exploration of the general blueprint of modern recommendation systems, this guide focuses on the intricate details of each stage and delves deeply into the infrastructure challenges involved in building these systems.

Ankit Khare

December 14, 2023

Product

Using Query Logs in Rockset

Learn how query logs are implemented in Rockset and how they can greater visibility into your queries.

Julius Hochmuth

December 1, 2023

How To

How to Do Load Testing with Rockset

This blog discusses the motivation behind load testing and provides a step-by-step guide to performing load testing on Rockset.

Luka Lovosevic

November 7, 2023

Product

Elasticsearch

Indexing

How Rockset Built Vector Search for Scale in the Cloud

Learn how Rockset built similarity indexes using FAISS-IVF that are memory-efficient and optimized for immediate insertion and recall.

Julie Mills

November 6, 2023

Company

Celebrating Engineering Innovation at Index Conference 2023

A recap of the first edition of Index, the conference for engineers building search, analytics and AI applications at scale.

Kevin Leong

October 31, 2023

Product

Customer-Managed Encryption Keys in Rockset

Learn how you can use customer-managed encryption keys, also called bring your own key, in Rockset.

Esteban Talavera

October 26, 2023

Case Study

Indexing

JetBlue Scales Real-Time AI on Rockset

"Iteration and the speed of new ML products was the most important to us. With Rockset, we found a database that could keep up with the fast pace of innovation at JetBlue," says Sai Ravuru, Senior Manager of Data Science and Analytics at JetBlue.

Julie Mills

October 17, 2023

Product

Creating and Restoring from Snapshots in Rockset

Understand how snapshots work in Rockset, when to use them and how users can create and restore from snapshots in the console.

Yashwanth Nannapaneni

October 13, 2023

Big Ideas

Introduction to Semantic Search: Embeddings, Similarity Metrics and Vector Databases

What does it take to implement semantic search? This article explains vector embeddings, nearest neighbor search and what to look for in a vector database.

M.Joel Dubinko

October 4, 2023

Elasticsearch

Elasticsearch Reindexing: When to Reindex, Best Practices and Alternatives

Reindexing in Elasticsearch is often necessary to handle changing data or improve performance. Understand situations when reindexing is required, guidance for performing a reindex, and alternatives to reindexing.

Lewis Gavin

September 26, 2023

Data Applications

Elasticsearch

Kafka

Streaming

Indexing

Case Study

Real-time AI: Live Recommendations Using Confluent and Rockset

We discuss using Confluent Cloud’s data streaming platform and Rockset’s vector search capabilities to power real-time AI applications.

Kevin Leong

September 19, 2023

Engineering

Performance

4x Faster Search Query Performance with Rockset’s Row Store Cache

The Rocket engineering team implemented a RowStoreCache to improve search performance after seeing an opportunity to speed up the fetching of values from the row store.

Nithin Venkatesh

September 12, 2023

Big Ideas

Introduction to Semantic Search: From Keyword to Vector Search

This article provides a brief history of semantic search, covering the evolution of search from keyword to vectors.

M.Joel Dubinko

September 11, 2023

Elasticsearch

SQL

How To

Can I Do SQL-Style Joins in Elasticsearch?

We explore how to perform the equivalent of SQL joins when using Elasticsearch. While joins are primarily an SQL concept, they are equally important in NoSQL

Shawn Adams

August 29, 2023

Big Ideas

Company

Elasticsearch

Redefining Search and Analytics for the AI Era

Rockset is on a mission to bring the power of search and AI to every digital disruptor in the world. Today, we are thrilled to announce a major milestone in our journey towards redefining search and analytics for the AI era.

Venkat Venkataramani

August 28, 2023

Product

5 Tasks You Can Automate in Rockset Using Scheduled Query Lambdas

Scheduled Query Lambdas are a useful feature in Rockset, allowing users to automate alerts, view creation, exports and more.

Luka Lovosevic

August 28, 2023

Big Ideas

Indexing

6 Hard Problems Scaling Vector Search

You’ve decided to use vector search in your application. Almost immediately upon productionizing vector search, you will run into hard and potentially unanticipated difficulties. This blog attempts to arm you with some knowledge of your future.

Louis Brandy

August 2, 2023

Case Study

Snowflake

Real-Time Analytics

Data Applications

How Windward Built Real-Time Logistics Tracking and AI Insights for the Maritime Industry

Learn how Windward built a real-time data platform that enables rapid innovation in AI for the maritime industry.

Julie Mills

June 12, 2023

Elasticsearch

DynamoDB

Case Study

Snowflake

Use Cases

Performance

Real-Time Clinical Trial Monitoring at Clinical ink

How Clinical ink built a real-time 360-degree view of patients and their outcomes across global clinical trials by migrating from Opensearch to Rockset for DynamoDB indexing.

Alex Doan

June 8, 2023

Engineering

Kafka

Kinesis

Streaming

Performance

CDC

When Real-Time Matters: Rockset Delivers 70ms Data Latency at 20MB/s Streaming Ingest

We’re often asked how low we’re capable of pushing our end-to-end data latency, i.e. the time it takes to receive data, index it, and make it available for querying. To answer this question, we ran a benchmark to push data latency as low as we could.

John Solitario

June 8, 2023

DynamoDB

Elasticsearch

Indexing

A Guide to DynamoDB Secondary Indexes: GSI, LSI, Elasticsearch and Rockset

Secondary indexing is a common strategy to boost search and analytics performance in DynamoDB. In this guide, we discuss the pros and cons of using DynamoDB GSIs and LSIs along with external secondary indexes such as Elasticsearch and Rockset.

Kevin Leong

June 6, 2023

Engineering

RocksDB

How Rockset Separates Compute and Storage Using RocksDB

We describe how Rockset achieves compute-storage separation without performance degradation.

Esteban Talavera

May 31, 2023

Performance

Engineering

May the Speed Be with You: 20K QPS on Rockset

We ran a 20K QPS workload on Rockset while ingesting data at 10MB/s and maintaining query latency at 200ms in a recent customer engagement. Read more about how Rockset achieved this scale and performance.

Purvi Desai

May 8, 2023

Use Cases

Real-Time Analytics

Indexing

5 Use Cases for Vector Search

In this blog, we capture engineering stories from 5 early adopters of vector search- Pinterest, Spotify, eBay, Airbnb and Doordash- who have integrated AI into their applications.

Julie Mills

May 3, 2023

Elasticsearch

Real-Time Analytics

Performance

Streaming

Benchmarking Elasticsearch and Rockset: Rockset achieves up to 4X faster streaming data ingestion

We evaluated Elasticsearch and Rockset streaming ingestion performance on throughput and latency. In this blog, we walk through the benchmark framework, configuration and results.

Julie Mills

April 27, 2023

Dashboards

Data Applications

Developer

Engineering

IoT

Kafka

Kinesis

Real-Time Analytics

Snowflake

SQL

Use Cases

How To

Reducing Costs

Streaming

Three Reference Architectures for Real-Time Analytics On Streaming Data

In part three of "Making Sense of Real-Time Analytics On Streaming Data", we provide reference architectures for anomaly detection, IoT, and recommendation systems.

Scott Dwyer

April 18, 2023

Big Ideas

Product

Real-Time Analytics

Introducing Vector Search on Rockset: How to run semantic search with OpenAI and Rockset

We’re excited to introduce vector search on Rockset to power fast and efficient search experiences, personalization engines, fraud detection systems and more.

John Solitario

April 17, 2023

Big Ideas

Developer

Product

Real-Time Analytics

Rockset and Feast Feature Store for Real-Time Machine Learning

To better serve real-time machine learning, Rockset integrates with the Feast Feature Store which acts as a centralized platform for deploying, monitoring and managing production ML features.

Daniel Lin

April 11, 2023

RocksDB

Engineering

Real-Time Analytics

Tech Overview of Compute-Compute Separation- A New Cloud Architecture for Real-Time Analytics

The high-level implementation of compute-compute separation, a new cloud architecture with multiple, isolated clusters for ingest compute and query compute on shared real-time data.

Julie Mills

March 28, 2023

Big Ideas

Data Applications

Developer

Druid

Elasticsearch

Engineering

Kafka

Kinesis

Real-Time Analytics

Streaming

Stream Processing vs. Real-Time Analytics Databases

Learn about conceptual differences between stream processing and RTA databases and develop a framework for choosing the right tool. .

Scott Dwyer

March 27, 2023

Data Applications

Engineering

Kafka

PostgreSQL

Real-Time Analytics

CDC

How To

Real-Time CDC With Rockset And Confluent Cloud

Learn how Rockset and Confluent Cloud provide a real-time CDC analytics pipeline that requires zero code and zero infrastructure to manage.

Patrick Druley

March 9, 2023

Developer

Engineering

web3

Use Cases

How To

How To Query The Ethereum Blockchain

Learn how to query Ethereum data using clients, RPC node providers, and using SQL queries on public datasets.

Justin Liu

March 1, 2023

Real-Time Analytics

RocksDB

Engineering

Introducing Compute-Compute Separation for Real-Time Analytics

Rockset unveils compute-compute separation that eliminates the challenge of compute contention and makes it possible to build efficient, reliable real-time applications at massive scale.

Venkat Venkataramani

March 1, 2023

Real-Time Analytics

Product

How To

Data Applications

A Breakthrough Architecture for Real-Time Analytics- An Overview of Compute-Compute Separation in Rockset

Rockset introduces a new architecture that enables separate virtual instances to isolate streaming ingestion from queries and one application from another.

Rafael Kabesa

February 25, 2023

Use Cases

Streaming

How To

Real-Time Analytics

Kinesis

Kafka

Making Sense of Real-Time Analytics on Streaming Data: The Landscape

This blog series will help demystify streaming data and provide engineering leaders a guide for incorporating streaming data into their analytics pipelines.

Scott Dwyer

February 9, 2023

DynamoDB

How To

Using DynamoDB Single-Table Design with Rockset

Single-table design is a popular data modeling technique in DynamoDB. We present several options for performing real-time analytics on single-table models using Rockset.

Tyler Denton

February 8, 2023

Real-Time Analytics

Druid

ClickHouse

Top Real-Time Analytics Databases in 2023: Rockset, Apache Druid, ClickHouse and Pinot

Learn how Rockset, Druid, ClickHouse and Pinot compare for real-time analytics in real-world use cases.

Shruti Bhat

January 31, 2023

Case Study

S3

Developing Global Labor Market Intelligence at SkyHive Using Rockset and Databricks

SkyHive builds a platform for labor market intelligence, using Databricks for ML processing and Rockset to serve their user-facing application.

Mohan Reddy

January 26, 2023

How To

How to Use Terraform with Rockset

Learn how Terraform can be used to automate the configuration and deployment of Rockset resources.

Martin Englund

January 11, 2023

DynamoDB

Elasticsearch

Real-Time Analytics

Using Elasticsearch to Offload Search and Analytics from DynamoDB

A walkthrough of how to offload text search, complex filters and aggregations from DynamoDB to Elasticsearch.

Julie Mills

January 9, 2023

ClickHouse

Case Study

Snowflake

Dashboards

MongoDB

Scaling Our SaaS Sales Training Platform with Real-Time Analytics from Rockset

As users and data volumes grew, ConveYour needed to scale their customer-facing dashboards. Learn how their developer team achieved scalability, concurrency and low ops using Rockset.

Stephen Rhyne

January 3, 2023

Big Ideas

Real-Time Analytics

Streaming

Real-Time Data Predictions for 2023

This blog compiles real-time data predictions from industry leaders so you know what’s coming in 2023.

Julie Mills

January 1, 2023

DynamoDB

Use Cases

NoSQL

5 Use Cases for DynamoDB in 2023

This guest post lays out the benefits of using DynamoDB, including 5 real-life examples, along with recommendations for performing analytics on DynamoDB data

Ben Rogojan

December 27, 2022

Elasticsearch

Developer

How to Solve 4 Elasticsearch Performance Challenges at Scale

We walk through solutions to common Elasticsearch performance challenges at scale including slow indexing, search speed, shard and index sizing, and multi-tenancy.

Julie Mills

December 14, 2022

Kafka

Streaming

Using the Amazon MSK Native Connector to Simplify Real-Time Analytics on Kafka

Rockset's native connector allows users to easily ingest and query streaming data from Amazon MSK, Amazon's managed Kafka service.

Avi Shah

December 13, 2022

Developer

An Open-Source Go Module to Secure the Command Line Using the OAuth2 Device Authorization Flow

We show you how we implemented a Go module that secures the CLI using an OAuth2 device authorization flow that supports both Auth0 and Okta SSO providers.

Martin Englund

November 29, 2022

Big Ideas

CDC

Breaking Down Cost Barriers For Real-Time Change Data Capture (CDC)

Learn how to improve the efficiency of real-time CDC with Rockset

Ari Ekmekji

November 21, 2022

Company

AWS re:Invent 2022: Rockset Will Be There…Will You?

See Rockset live at AWS re:Invent in Las Vegas. Join real-time analytics demos at our booth and architecture sessions in our executive suite.

Ashley Andrada

November 15, 2022

Real-Time Analytics

Performance

Product

Rockset Achieves 84% Better Performance on the Star Schema Benchmark with Intel Ice Lake

As a result of ongoing enhancements, we released software that leverages 3rd Gen Intel® Xeon® Scalable processors and delivers 84% faster performance.

Julie Mills

November 2, 2022

Engineering

Product

The New Rockset Query Editor Experience

We're excited to announce the release of a new query editor in the Rockset Console with improved performance and an updated design.

Kristie Lim

November 2, 2022

Elasticsearch

Real-Time Analytics

5 Steps for Migrating from Elasticsearch to Rockset for Real-Time Analytics

Best practices from customers who migrated from Elasticsearch to Rockset in days to weeks by avoiding common migration pitfalls.

Patrick Druley

October 26, 2022

Case Study

web3

DynamoDB

Case Study: How Rockset's Real-Time Analytics Platform Propels the Growth of Our NFT Marketplace

Own the Moment uses Rockset to build the real-time analytics and leaderboards that are core to their NFT and fantasy sports platform.

Scott Mitchell

October 21, 2022

Kafka

How To

S3

Building Real-Time Recommendations with Kafka, S3, Rockset and Retool

Step through a real-time recommendations example using Kafka, S3, Rockset and Retool.

Nadine Farah

October 21, 2022

Big Ideas

Product

Public SQL Endpoints in Rockset

Learn how to share SQL query results and metadata with public endpoints

Scott Dwyer

October 13, 2022

Big Ideas

Snowflake

Data Warehouse

Reducing Costs

How To

7 Practical Ways to Cut Snowflake Compute Cost

Ok, so Snowflake is expensive. But what do I do about it? Here are 7 Practical Ways to Cut Snowflake Compute Cost

Shruti Bhat

October 11, 2022

Elasticsearch

CDC

Streaming

Updates, Inserts, Deletes: Challenges to avoid when indexing mutable data in Elasticsearch

We examine common challenges when indexing mutable data such as CDC streams in Elasticsearch and contrast with Rockset, as well as provide practical techniques for using these systems for real-time search and analytics.

Julie Mills

October 6, 2022

Case Study

DynamoDB

Dashboards

PyTorch Infra's Journey to Rockset

The PyTorch infra team at Meta runs thousands of tests to validate every change as part of their Continuous Integration. Learn how they moved to Rockset to deliver metrics on the health of their CI.

Jane Xu

October 4, 2022

ClickHouse

Streaming

CDC

Comparing ClickHouse vs Rockset for Event and CDC Streams

We compare ClickHouse and Rockset for real-time analytics on event and CDC streams, examining their similarities and differences across architecture, data ingestion, querying and operations.

Kevin Leong

September 20, 2022

Big Ideas

Real-Time Analytics

web3

3 Use Cases for Real-Time Blockchain Analytics

Learn about emerging use cases for real-time blockchain analytics and some key considerations for developers building dApps.

Sid Chhibber

September 13, 2022

DynamoDB

DynamoDB Filtering and Aggregation Queries Using SQL on Rockset

Learn how to build an application that handles high-volume transactions as well as filtering and aggregation using a combination of DynamoDB and Rockset.

Alex DeBrie

September 2, 2022

Data Applications

Use Cases

Real-Time Analytics

Expert Roundtable: How to Build Real-Time Personalization and Recommendation Systems

Hear experts share why real-time personalization offers greater accuracy and efficiency compared to offline alternatives, along with best practices for getting to real time.

Dhruba Borthakur

August 26, 2022

Case Study

IoT

Case Study: iYOTAH Brings Real-Time IoT Analytics to Dairy Farming with Its AgTech SaaS Platform

iYOTAH uses real-time IoT data to moooo-ve dairy farming into a smart future.

Daniel Lu

August 16, 2022

Kinesis

Kafka

Streaming

How To

Kafka vs Kinesis: How to Choose

Which is the best stream processing solution for your needs and environment?

Patrick Druley

August 11, 2022

Big Ideas

Real-Time Analytics

Expert Roundtable: Batch vs Streaming in the Modern Data Stack [Video]

Data engineering experts come together to discuss where batch and streaming analytics fit in the modern data stack.

Shruti Bhat

August 5, 2022

Case Study

Elasticsearch

Kafka

Use Cases

Case Study: How Rockset Turbocharges Real-Time Personalization at Whatnot

Whatnot implemented real-time personalization for their live shopping platform using Rockset, which proved a more efficient alternative to Elasticsearch.

Emmanuel Fuentes

July 29, 2022

Snowflake

Real-Time Analytics

Data Warehouse

Can BigQuery, Snowflake, and Redshift Handle Real-Time Data Analytics?

In this article, we’ll explore the strengths and shortcomings of three prominent data warehouses today for real-time analytics

Daniel Lu

July 28, 2022

MongoDB

Kafka

CDC

Streaming

How To

NoSQL

MongoDB CDC: When to Use Kafka, Debezium, Change Streams and Rockset

Change data capture from MongoDB is a reliable and performant way to move MongoDB data to a complementary system for search and analytics. We review several options for CDC on MongoDB.

Lewis Gavin

July 22, 2022

Big Ideas

DynamoDB

MongoDB

SQL

Expert Talk TLDR: SQL vs NoSQL Databases in the Modern Data Stack

Top takeaways from a recent panel of seasoned data architects and data practitioners steeped in NoSQL databases.

Daniel Lu

July 21, 2022

Dashboards

Case Study

MongoDB

Case Study: Is Your NoSQL Data Hindering Real-Time Analytics? Savvy Solved It with Rockset.

Savvy provides real-time analytics for growth teams using its service to create no-code interactive experiences. Learn how they built this functionality using Rockset on MongoDB data.

Jeremy Evans

July 12, 2022

Developer

Kinesis

SQL

Streaming SQL Joins in Rockset

We compare building collections in Rockset using JOINs at query time and at ingestion time and why you might choose each approach.

Tyler Denton

July 8, 2022

Company

Rockset's Summer Road Trip!

Rockset was talking fast and efficient real-time analytics in New York, Las Vegas and San Francisco in June. You can still catch us July 12 in New York at AWS Summit.

Ashley Andrada

July 6, 2022

Big Ideas

Real-Time Analytics

Why Real-Time Analytics Requires Both the Flexibility of NoSQL and Strict Schemas of SQL Systems

Modern, real-time use cases require databases that strongly enforce schemas and have the flexibility to automatically redefine those schemas based on the data itself.

Dhruba Borthakur

June 21, 2022

Snowflake

Product

Kafka

Kinesis

Real-Time Analytics

Streaming

Data Warehouse

Joining Streaming and Historical Data for Real-Time Analytics: Your Options With Snowflake, Snowpipe and Rockset

New Snowflake-Rockset connector provides Snowflake users cost-efficient option for real-time analytics on streaming data from Kafka and historical data in Snowflake.

Vibhuti Bhushan

June 14, 2022

Real-Time Analytics

Company

Engineering

Rockset Architecture Whiteboard Session With CTO Dhruba Borthakur

Learn about Rockset's ALT architecture and how data is ingested, stored and queried.

Dhruba Borthakur

June 7, 2022

MongoDB

DynamoDB

NoSQL

MongoDB vs DynamoDB Head-to-Head: Which Should You Choose?

We compare MongoDB and DynamoDB, their pros and cons, data types, cost, reliability, performance and security.

Shawn Adams

June 3, 2022

Case Study

Elasticsearch

Case Study: Zembula and Rockset Power Real-Time Marketing Email Personalization

Low-ops and cost-effective, Rockset is helping Zembula scale our next 100x growth.

Robert Haydock

May 25, 2022

Developer

Office Hours

Office Hours Recap: Optimize Cost and Query Latency With SQL Transformations and Real-Time Rollups

Recap of a recent Rockset Office Hours.

Nadine Farah

May 17, 2022

Real-Time Analytics

Big Ideas

SQL

SQL and Complex Queries Are Needed for Real-Time Analytics

Modern, cloud-native SQL databases deliver what today's data-driven businesses require.

Dhruba Borthakur

May 12, 2022

Real-Time Analytics

Big Ideas

Handling Bursty Traffic in Real-Time Analytics Applications

We examine the database architecture choices for handling bursty data traffic.

Dhruba Borthakur

May 10, 2022

Real-Time Analytics

DynamoDB

CDC on DynamoDB

We look at how CDC works with DynamoDB and its potential use cases.

Lewis Gavin

May 5, 2022

Engineering

Company

A Real-Time Rockset Intern Experience

The real real on interning at Rockset.

Shreya Shekhar

May 3, 2022

Real-Time Analytics

Engineering

Kafka

How Rockset Handles Data Deduplication

What is data duplication, how it plagues teams adopting real-time analytics, and what Rockset does to resolve duplication issues.

Tyler Denton

April 28, 2022

Company

Reflections of a Rockset UXer

Time flies when you're UXing at Rockset.

Aditi Dhar

April 26, 2022

Kafka

Real-Time Analytics

Streaming Data and Real-Time Analytics With Kafka + Rockset

Real-time analytics for streaming data is alive, growing and affordable for today’s modern real-time data stack.

Vibhuti Bhushan

April 19, 2022

Real-Time Analytics

Big Ideas

The Real-Time Revolution and Digital Economics in the COVID Era

Driven by COVID, economists are finally embracing streaming and real-time data – just like the business world.

Shruti Bhat

April 15, 2022

Real-Time Analytics

Big Ideas

Data Applications

Handling Out-of-Order Data in Real-Time Analytics Applications

Mutability is the most important capability for real-time analytics applications, but close behind is the ability to handle out-of-order data.

Dhruba Borthakur

April 12, 2022

Company

Kafka

DynamoDB

Rockset Goes on the Road!

Rockset will be exhibiting at three events this month in San Francisco and London.

Ashley Andrada

April 5, 2022

Druid

ClickHouse

Performance

Rockset Beats ClickHouse and Druid on the Star Schema Benchmark (SSB)

Rockset is 1.67 times faster than ClickHouse and 1.12 times faster than Druid on the Star Schema Benchmark.

Ben Hannel

March 31, 2022

Case Study

MongoDB

Developer

Case Study: How Rockset Made Me a Day Three Hero at Sounding Board

From Rockset trial to usable and reportable real-time information in just three days.

Jon Farr

March 29, 2022

Case Study

MySQL

Real-Time Analytics

Case Study: How Dimona Built a Real-Time Inventory Management System on Rockset

Dimona needed a better technology solution, one that could handle massive data sets and query them fast.

Igor Blumberg

March 25, 2022

Case Study

MongoDB

DynamoDB

Case Study: Rockset Enables Real-Time Operational Analytics in Hardware Manufacturing for PCH

Rockset delivers ad hoc complex queries within seconds, a huge improvement over the one-hour latency PCH was seeing before.

Daniel Lu

March 24, 2022

Real-Time Analytics

Developer

Elasticsearch

Druid

Empowering Developers With Query Flexibility

Query flexibility enables developers to prototype and build new features quickly, increasing overall productivity.

Nadine Farah

March 22, 2022

Real-Time Analytics

Kafka

Streaming

Streaming Analytics With KSQL vs. A Real-Time Analytics Database

The arguments for and against two approaches to data analytics and their optimal use cases

Lewis Gavin

March 17, 2022

Real-Time Analytics

MongoDB

PostgreSQL

Druid

ClickHouse

How Mutable Databases Make It Easy To Do Real-Time Updates

Three reasons why you need a mutable database for real-time updates

Nadine Farah

March 15, 2022

DynamoDB

Case Study

IoT

Case Study: Complementing DynamoDB with Rockset for Real-Time IoT Analytics at 1NCE

Thanks to Rockset, 1NCE is able to provide customers with fast and valuable insight into their data

Jan Sulaiman

March 10, 2022

Real-Time Analytics

Big Ideas

Why Mutability Is Essential for Real-Time Data Analytics

Mutability enables updates to existing records in a data store and is key to real-time analytics.

Dhruba Borthakur

March 4, 2022

Kinesis

How Rockset Supports Kinesis Shard Autoscaling to Handle Varying Throughputs

On-demand capacity increases efficiency and supports cost savings

Sudhindra Tirupati Nagaraj

March 3, 2022

Real-Time Analytics

SQL

Real-Time Analytics on Oracle and MSSQL With Rockset

Rockset announces early access for Oracle and Microsoft SQL Server integrations

Vibhuti Bhushan

February 24, 2022

Kinesis

Elasticsearch

Druid

Real-Time Analytics

Real-Time Analytics on Kinesis Event Streams Using Rockset, Druid, Elasticsearch and Redshift

An overview of popular options for RTA on Kinesis event streams highlighting ideal use cases and associated tradeoffs.

Scott Dwyer

February 17, 2022

Big Ideas

Engineering

17 New Things Every Modern Data Engineer Should Know in 2022

We asked data industry thought leaders to tell us what we should be paying attention to in coming months. Here is what they told us.

Shruti Bhat

February 14, 2022

Real-Time Analytics

Big Ideas

Top 5 Reasons for Moving From Batch To Real-Time Analytics

Fast analytics on fresh data beats slow analytics on stale data every time.

Venkat Venkataramani

February 10, 2022

MongoDB

How To

NoSQL

How To Join Data in MongoDB

Choosing between $lookup, denormalization and alternatives for joining data in MongoDB.

Shawn Adams

February 2, 2022

MongoDB

Real-Time Analytics

NoSQL

Slow Queries

Five Ways to Run Analytics on MongoDB – Their Pros and Cons

Your choices range from performing analytics directly in MongoDB to moving data to a data store better equipped for real-time analytics.

Shawn Adams

January 28, 2022

DynamoDB

Case Study

Case Study: Real-Time Insights Help Propel 10X Growth at E-Learning Provider Seesaw

Rockset, along with DynamoDB, Hightouch, and Retool, enabled Seesaw to obtain actionable, real-time insights that helped grow their e-learning platform.

Daniel Lu

January 25, 2022

Snowflake

How To

Slow Queries

Data Warehouse

What Do I Do When My Snowflake Query Is Slow? Part 2: Solutions

Part two of a two part series on improving Snowflake query performance

Shawn Adams

January 20, 2022

Snowflake

How To

Slow Queries

Data Warehouse

What Do I Do When My Snowflake Query Is Slow? Part 1: Diagnosis

Part one of a two part series on improving Snowflake query performance

Shawn Adams

January 5, 2022

Real-Time Analytics

SQL

Mythbusting: The Venerable SQL Database and Today’s Real-Time Analytics

The SQL database that came of age in the 1980s still has a critical role today in moving data-driven companies from batch to real-time analytics.

Dhruba Borthakur

December 21, 2021

Company

Engineering

Developer

How We Use Rockset's Real-Time Analytics to Debug Distributed Systems

Jonathan, a software engineering intern at Rockset, describes how Rockset uses its own tech to debug its highly distributed ingest system.

Jonathan Kula

December 17, 2021

SQL

Developer

Powering SQL Draw with Rockset, Retool and dbt

SQL Draw is a Slack-based game that uses Rockset, Retool and dbt to create fun drawings with cartesian geometry, creativity and teamwork.

James Weakley

December 10, 2021

Company

Wrap-up of Rockset at AWS re: Invent 2021

November 29 to December 3, 2021 in Las Vegas, NV

Rod Bauer

December 9, 2021

Big Ideas

Real-Time Analytics

Streaming

The Rise of Streaming Data and the Modern Real-Time Data Stack

Now more than 10 years old, the modern data stack is ripe for innovation. The inevitable next stage? Real-time insights delivered straight to users — the modern real-time data stack.

Shruti Bhat

December 1, 2021

Company

Engineering

Why Rockset Is My Next Job After Facebook

Louis Brandy, director of engineering, shares his thoughts on joining Rockset.

Louis Brandy

November 9, 2021

MySQL

PostgreSQL

OLTP

CDC

How To

How to Implement CDC for MySQL and Postgres

We examine different options for implementing change data capture (CDC) from MySQL and Postgres and make recommendations for when to use each.

Lewis Gavin

November 5, 2021

Case Study

PostgreSQL

Case Study: Powering Customer-Facing Dashboards at Scale Using Rockset with PostgreSQL at DataBrain

Learn how Rockset’s PostgreSQL integration helped DataBrain scale smoothly as its production data size and query volume exploded.

Daniel Lu

November 4, 2021

S3

Data Lakes

Getting Started with Apache Spark, S3 and Rockset for Real-Time Analytics

Get fast query performance with Apache Spark + Rockset to power data apps.

Nadine Farah

November 2, 2021

Product

Rockset’s Reverse ETL Integrations Extend the Modern Real-Time Data Stack

Rockset’s new partner integrations with leading reverse ETL platforms Census, Hightouch and Omnata will enable everyday business tools to consume real-time customer insights seamlessly from Rockset.

Daniel Lu

October 26, 2021

Case Study

DynamoDB

Case Study: Fast and Simple — Building Rich Patient Dashboards for Speech Therapists with Rockset

Rockset is used to power interactive visualizations of the rehabilitation data of speech-impaired patients for their speech therapists and other caregivers.

Antonio Domínguez

October 20, 2021

Product

Real-Time Data Transformations with dbt + Rockset

The dbt-Rockset adapter 2.0 supports all four core dbt materializations. Learn about how to transform data in real-time using dbt and Rockset.

Justin Liu

October 15, 2021

Big Ideas

What Is a Cloud Database? IaaS, PaaS, SaaS and DBaaS Explained

Cloud databases are not created equal. We discuss what these different terms mean with respect to cloud databases: IaaS, PaaS, SaaS and DBaaS.

Shawn Adams

September 29, 2021

Product

Rockset Elevates Security Posture with RBAC Custom Roles & Views

New security features enable customers to enforce least privileged access to all resources within Rockset

Rafael Kabesa

September 29, 2021

Company

Product

Rockset Is Now SOC 2 Type II Compliant

The Rockset team is proud to announce that we have been accredited as SOC 2 Type II compliant.

Martin Englund

September 21, 2021

Engineering

How To

How We Improved the Concurrency and Scalability of Our Redis Rate Limiting System

We use a rate limiting system, based on Redis, to protect services from overload. Learn how we increased its concurrency and scalability in this blog.

Akshay Nanavati

September 15, 2021

Kafka

Product

Rockset Enhances Kafka Integration to Simplify Real-Time Analytics on Streaming Data

We’re introducing a new fully-managed Kafka Integration with native support for Confluent Cloud and Apache Kafka. Get started with real-time analytics on event streams from Apache Kafka in minutes.

Boyang Chen

September 8, 2021

Developer

Hello World: Join the New Rockset Developer Community

We are unveiling our community, developer mascot, and Real-time Rockstars!

Nadine Farah

September 7, 2021

Kafka

See Rockset’s Rollups for Streaming Data at Kafka Summit 2021

Rockset, a Gold Sponsor of Kafka Summit Americas 2021, to present and demo SQL-based rollups on streaming data.

Giovanni Tropeano

September 3, 2021

Product

Faster Results and a Better Experience with New Pagination in Rockset

Rockset’s new pagination approach enables customers to query large amounts of data fast and more consistently

Rafael Kabesa

August 31, 2021

Real-Time Analytics

Product

How Rockset Enables SQL-Based Rollups for Streaming Data

Learn how Rockset enables SQL-based rollups on streaming data for complex and accurate real-time analytics.

Venkat Venkataramani

August 25, 2021

Druid

Product

Kafka

Kinesis

Real-Time Analytics

Rollups on Streaming Data: Rockset vs Apache Druid

Continuously rollup and transform streaming data from any source using SQL. Learn how rollups in Rockset compare to Apache Druid.

Vibhuti Bhushan

August 4, 2021

Snowflake

Real-Time Analytics

Streaming

How To

Slow Queries

Real-Time Data Ingestion: Snowflake, Snowpipe and Rockset

We examine the performance and cost of real-time data ingestion in Snowflake and Snowpipe as compared to Rockset.

Shawn Adams

July 29, 2021

DynamoDB

Product

Engineering

20x Faster Ingestion with Rockset's New DynamoDB Connector

Get 20x faster ingestion on DynamoDB tables with Rockset's improved connector, which uses DynamoDB's export to S3 functionality.

Purvi Desai

July 22, 2021

DynamoDB

Real-Time Analytics

Scaling Real-Time Gaming Leaderboards with DynamoDB and Rockset

Learn how DynamoDB and Rockset deliver the ultimate data stack for real-time analytics in gaming.

Julie Mills

July 19, 2021

Druid

How to Handle Nested Data in Apache Druid vs Rockset

Nested data needs to be flattened upon ingestion when using Apache Druid. We look at how to ingest and query nested data in Druid vs alternatives like Rockset.

Shawn Adams

July 15, 2021

SQL

Real-Time Analytics

Product

Real-Time Analytics with dbt + Rockset

The dbt-Rockset adapter makes it easy to perform SQL transformations for real-time analytics. Load data into Rockset and create collections by writing SQL SELECT statements in dbt.

Sam Crowder

July 8, 2021

MongoDB

5 Can't Miss MongoDB.live Talks

As we gear up for MongoDB.live on July 13-14, here are some conference talks we're looking forward to attending.

Kevin Leong

July 7, 2021

SQL

Druid

How To

How to Handle Database Joins in Apache Druid vs Rockset

This article focuses on implementing database joins in Apache Druid, explores workarounds like denormalization and examines alternative solutions like Rockset.

Shawn Adams

July 1, 2021

Developer

Create a Data API on MySQL Data with Rockset

We’ll be uploading, analyzing, and creating a data API on Airbnb data from Amazon RDS MySQL in Rockset.

Nadine Farah

June 29, 2021

Product

Production Visibility: Metrics Monitoring and Alerting

Rockset introduced console metrics and an integration for third-party monitoring tools to provide greater visibility for production workloads.

Brian Liang

June 18, 2021

Engineering

Company

My New Grad Experience at Rockset

Karen joined Rockset two years ago as a fresh CS grad. She shares highlights of her Rockset experience as a software engineer on the backend team.

Karen Li

June 17, 2021

Real-Time Analytics

The Emergence of Real-Time Analytics

Real-time analytics is now within reach of all companies from lean startups to large enterprises.

Julie Mills

June 7, 2021

Big Ideas

DynamoDB

MongoDB

MySQL

PostgreSQL

NoSQL

CDC

OLTP

Change Data Capture: What It Is and How to Use It

Change data capture (CDC) is a useful tool in many data architectures. Learn what CDC is, how it is implemented and when to use it.

Lewis Gavin

June 4, 2021

Engineering

RocksDB

Rockset Converged Index Adds Clustered Search Index for 70% Query Latency Reduction

We share how a new storage format for the search index in Rockset’s Converged Index reduced query latencies by as much as 70% and the size of the search index by about 20%.

Sandeep Dhoot

June 1, 2021

Developer

MySQL

Real-Time Analytics

Getting Started with Real-Time Analytics on MySQL Using Rockset

In this blog, we walk you through how to scale your Amazon RDS MySQL analytical workload with Rockset.

Nadine Farah

May 27, 2021

Elasticsearch

Compare and Contrast Search Indexing With Real-Time Converged Indexing

Elasticsearch and Rockset as indexing data stores for serving low latency queries.

Giovanni Tropeano

May 24, 2021

Big Ideas

What Is a Serverless Database and Why Use One

Serverless is commonly associated with functions and Lambdas, but engineering teams should also be knowledgeable about serverless databases and the benefits they provide.

Ben Rogojan

May 21, 2021

Real-Time Analytics

Use Cases

Big Ideas

Popular Use Cases for Real-Time Analytics

While real-time analytics is in demand, it’s not without its challenges in implementing.

Julie Mills

May 17, 2021

Real-Time Analytics

3 Reasons Why Real-Time Analytics Is More Affordable Than You Think

If you are considering real-time analytics, here are some ways to ensure you are taking the most cost-effective approach.

Kevin Leong

May 14, 2021

SQL

Find and Replace Text with SQL Regular Expressions in Rockset

When we tried to unnest a field, we get multiple errors. Check out this blog to see how we use regex to debug the error and replace the problematic characters!

Nadine Farah

May 13, 2021

Real-Time Analytics

SaaS Industry Trends in Real-Time Analytics

Multiple industries are seeing real time analytics trends emerge due to customer application usage. requirements for instant access to data is driving app development teams to heavily invest in embedded real time analytics.

Giovanni Tropeano

May 11, 2021

Real-Time Analytics

Data Applications

Building Data Applications Powered by Real-Time Analytics

We share 3 key criteria for your real-time analytics platform that will fuel long-term success with data apps.

Shruti Bhat

May 5, 2021

Developer

Working with Mixed Data Types within a Field Using Rockset

When working with mixed field types, you’ll have to adjust your queries to take into consideration data types and values you don’t want to use. Here, we work through an example by ordering movies by release year.

Nadine Farah

April 28, 2021

Company

Engineering

Leading Design as a UX Team of 1

Aditi shares her experience leading design in Rockset’s fast-paced, developer-first environment.

Aditi Dhar

April 27, 2021

Developer

Flattening a JSON Object So It’s Queryable Using Rockset

You will often need to flatten a JSON object so you can query it. In this post, we’ll show how to do so using the UNNEST function in Rockset.

Nadine Farah

April 15, 2021

Real-Time Analytics

Product

PostgreSQL

MySQL

Powering Real-Time Analytics at Scale on MySQL and PostgreSQL

Enable sub-second, high-concurrency analytics for MySQL and PostgreSQL using Rockset for real-time external indexing.

Justin Liu

April 12, 2021

Case Study

DynamoDB

Real-Time Analytics

Elasticsearch

Data Applications

Case Study: Bringing Real-Time Analytics to Construction Logistics at Command Alkon

Command Alkon offers a SaaS application to digitize construction logistics, allowing suppliers, transportation providers and contractors on jobsites to analyze and collaborate on data in real time.

Kevin Leong

April 5, 2021

Case Study

Elasticsearch

SQL

Use Cases

Case Study: Sequoia Capital — Why We Moved from Elasticsearch to Rockset

We spoke with Sequoia’s head of engineering, Jake Quist, and VP of data science, Hem Wadhar, about their reasons for moving their internal analytics off Elasticsearch to Rockset.

Kevin Leong

March 31, 2021

Case Study

Data Applications

Snowflake

Real-Time Analytics

Case Study: Ritual’s Move to Real-Time Analytics to Personalize the Multivitamin Experience

Ritual, a health-meets-technology company, personalized the cart checkout experience, email promotions and banners using Rockset. Learn how Ritual effectively monetized new product lines with real-time analytics.

Julie Mills

March 23, 2021

Engineering

On the Pursuit of Happiness (aka Squashing 502/504 Errors)

We recount our experience hunting down, diagnosing and fixing 502 and 504 errors to improve product quality and user experience.

Hieu Pham

March 15, 2021

Elasticsearch

Elasticsearch or Rockset for Real-Time Analytics: Real-Time Ingestion and Indexing

In part 3 of our Elasticsearch and Rockset comparison, we examine how well Elasticsearch and Rockset ingest and index real-time data.

Shawn Adams

March 12, 2021

Engineering

Big Ideas

5 Tips for Recruiting Top Engineering Talent in Startups

Rockset CEO Venkat Venkataramani and engineering leaders Nimrod Hoofien (Gusto) and Adam Wolff (Robinhood) share best practices for recruiting great engineers.

Julie Mills

March 5, 2021

Big Ideas

Snowflake

Reducing Costs

Data Warehouse

Space-Time Tradeoff: Examining Snowflake's Compute Cost

In this post, we explore how developers should think about space, time, storage and compute cost as it relates to modern data analytics offerings like Snowflake and Rockset.

Shruti Bhat

February 25, 2021

Elasticsearch

Elasticsearch or Rockset for Real-Time Analytics: How Much Query Flexibility Do You Have?

In part 2 of our Elasticsearch and Rockset comparison, we take a look at query flexibility and its impact on developer productivity.

Shawn Adams

February 18, 2021

Druid

Engineering

Real-Time Analytics

Rockset Is Up to 9.4x Faster than Apache Druid on the Star Schema Benchmark

We evaluated Rockset on the Star Schema Benchmark and found up to 9.4x query runtime speedup compared to Druid. We discuss our benchmarking exercise, results and analysis in this blog post.

Kevin Leong

February 9, 2021

Real-Time Analytics

Indexing

Data Lakes

S3

Indexing Amazon S3 for Real-Time Analytics on Data Lakes

We explore how indexing Amazon S3 data can enable low-latency, high-concurrency queries for real-time analytics.

Shawn Adams

January 19, 2021

Elasticsearch

Elasticsearch or Rockset for Real-Time Analytics: Managing Clusters vs Going Serverless

In part 1 of our Elasticsearch and Rockset comparison, we explore the operational costs associated with both real-time analytics solutions.

Shawn Adams

December 22, 2020

Elasticsearch

How to Join Data in Elasticsearch vs Rockset

In this blog post, we'll look at what it takes to join data sets in Elasticsearch and in Rockset, using the same online marketplace example.

Lewis Gavin

December 17, 2020

Data Applications

Build Internal Apps in Minutes with Retool and Rockset: A Customer 360 Example

Learn how to integrate Rockset with Retool on a customer 360 sample app, using data APIs and pre-built UI components.

Ben Rogojan

December 10, 2020

Engineering

Company

What I've Learned in 2020: A Technical Version

Hieu shares thoughts on columnar databases, RocksDB, SQL engines and his year as an engineer at Rockset.

Hieu Pham

November 24, 2020

Engineering

RocksDB

Real-Time Analytics

How Rockset’s Converged Index Powers Real-Time Analytics

Rockset enables millisecond-latency queries on terabytes of data because all data ingested is indexed multiple ways in its Converged Index. Learn how the Converged Index works in this blog post.

Shawn Adams

November 19, 2020

Engineering

SQL

Smart Schema: Enabling SQL Queries on Semi-Structured Data

We explain and show how users can perform schemaless ingestion of their data and then use Rockset's Smart Schema to enable SQL queries directly on that data.

Shawn Adams

November 12, 2020

MongoDB

Elasticsearch

Real-Time Analytics

How To

Reducing Costs

NoSQL

Using Elasticsearch to Offload Real-Time Analytics from MongoDB

This post weighs the advantages and disadvantages of moving read-heavy analytics off a primary MongoDB database using Elasticsearch for indexing.

Shawn Adams

October 27, 2020

Company

Real-Time Analytics

Rockset Raises $40M Series B to Empower Developers Building Real-Time Analytics

Rockset is the real-time cloud database built for modern data apps, bringing speed, scale and simplicity to developers building real-time analytics.

Venkat Venkataramani

October 27, 2020

Engineering

Company

Why I Am Joining Rockset

Nathan Bronson is joining Rockset to make real-time data infrastructure simple for users at scale.

Nathan Bronson

October 26, 2020

Case Study

PostgreSQL

Data Applications

Real-Time Analytics

Case Study: Rumble’s Real-Time Leaderboards Empower Users to Lead Healthier Lifestyles

Learn how Rockset powers Rumble's real-time leaderboards, which serve to motivate its users to keep active.

Nadine Farah

October 8, 2020

MongoDB

How To

Slow Queries

NoSQL

3 Tools to Help Debug Slow Queries in MongoDB

How can you investigate query performance issues in MongoDB? We give an overview of 3 tools available for troubleshooting slow queries in MongoDB Atlas.

Ben Rogojan

October 1, 2020

Kafka

MongoDB

Data Applications

Real-Time Analytics

Building a Real-Time Customer 360 on Kafka, MongoDB and Rockset

A step-by-step guide to building a real-time customer 360 using seconds-old purchase data from MongoDB and marketing data from Kafka.

Lewis Gavin

September 25, 2020

Slow Queries

How To

NoSQL

3 Ways to Offload Read-Heavy Applications from MongoDB

Offloading read-heavy analytics from an operational database, like MongoDB, is a common architectural pattern. This post examines 3 options for offloading MongoDB to a secondary system.

Ben Rogojan

September 15, 2020

Real-Time Analytics

Engineering

Rockset: 1 Billion Events in a Day with 1-Second Data Latency

This post introduces RockBench, a benchmark for measuring the data latency of real-time databases.

Dhruba Borthakur

September 3, 2020

MongoDB

PostgreSQL

Offload Real-Time Reporting and Analytics from MongoDB Using PostgreSQL

This post weighs the advantages and disadvantages of moving read-heavy analytics off a primary MongoDB database to PostgreSQL.

Shawn Adams

August 27, 2020

DynamoDB

Case Study

Case Study: Matter Uses Rockset to Bring AI-Powered Sustainable Insights to Investors

With Rockset, Danish fintech Matter has the flexibility to run analytical queries on semi-structured data in S3 and DynamoDB as part of their NLP architecture.

Alexander Harrington

August 25, 2020

MongoDB

How To

Slow Queries

NoSQL

Handling Slow Queries in MongoDB - Part 2: Solutions

We discuss the advantages and disadvantages to various strategies for improving the performance of our MongoDB database

Justin Liu

August 20, 2020

Developer

Product

Announcing the New Rockset Developer Tools

We released Rockset Developer Tools, including a new CLI tool and a new VS Code extension, to make it easier to develop real-time data applications on Rockset.

Tanmay Chordia

August 18, 2020

Real-Time Analytics

Changing face of real-time analytics

We explore the continuum of real-time analytics, from live, interactive dashboards to online applications that automatically take action on real-time data.

Shruti Bhat

August 13, 2020

Big Ideas

The Future is Serverless: What About Your Data Stack?

Serverless architectures offer ease of use and cost advantages. We explore what serverless means for your data stack.

Shruti Bhat

August 11, 2020

Real-Time Analytics

Analytics-on-the-fly: from batch to real-time user engagement

Companies need to embrace real-time analytics to compete and survive. Only those that have invested in a real-time data stack will thrive.

Dhruba Borthakur

August 10, 2020

Real-Time Analytics

Rapid Experimentation and Growth Using Real-Time Analytics

Learn how to build for the requirements of a massive-scale A/B experiments platform.

Venkat Venkataramani

August 10, 2020

Case Study

DynamoDB

Real-Time Analytics

Case Study: eGoGames Esports Platform Uses Rockset for Real-Time Analytics on Gaming Data

eGoGames improves user experience, detects fraud, and makes business decisions using Rockset for real-time analytics on gaming data in Amazon DynamoDB and S3.

Kevin Leong

August 7, 2020

MongoDB

Slow Queries

How To

NoSQL

Handling Slow Queries in MongoDB - Part 1: Investigation

Explore various methods of identifying slow queries on MongoDB and understand how to improve them.

Justin Liu

July 29, 2020

MongoDB

Performance Isolation for Your Primary MongoDB Cluster

Performance of your primary MongoDB cluster is crucial. We look at how using multiple MongoDB clusters can help with performance isolation.

Dai Shi

July 23, 2020

MongoDB

How To

Slow Queries

NoSQL

Improving MongoDB Read Performance - Indexing, Replication and Sharding

Real-time analytics demands low-latency complex queries. Learn how to speed up read performance by indexing, replication and sharding in MongoDB.

Shawn Adams

July 21, 2020

Big Ideas

Real-Time Analytics

Lessons from Scaling Facebook's Online Data Infrastructure

Lessons on building real-time data architectures based on experiences growing Facebook users 30x, from 50 million to 1.5 billion.

Venkat Venkataramani

July 16, 2020

MongoDB

Engineering

Indexing on MongoDB Using Rockset - How It Works

An in-depth look at indexing MongoDB data in Rockset and how it compares to indexing in MongoDB itself.

Ben Hannel

July 14, 2020

Case Study

MongoDB

Case Study: StoryFire - Scaling a Social Video Platform on MongoDB and Rockset

Learn how StoryFire uses Rockset to index data from their transactional MongoDB database to achieve performance and scale.

Ben Hagan

July 8, 2020

DynamoDB

Kafka

Data Applications

Real-Time Analytics

Designing a Real-Time ETA Prediction System Using Kafka, DynamoDB and Rockset

Generate ETA predictions for a delivery service using real-time location and order data from Kafka and DynamoDB.

Kartik Khare

June 23, 2020

MongoDB

Data Applications

Real-Time Analytics

Real-Time Recommendations for Event Ticketing Using MongoDB and Rockset

Implementing a real-time recommendations API for an event ticketing system by indexing MongoDB data in Rockset for fast SQL.

Lewis Gavin

June 16, 2020

MongoDB

Product

JOINs and Aggregations Using Real-Time Indexing on MongoDB Atlas

We explore how real-time indexing on MongoDB enables fast aggregation and join queries, and how Rockset is specifically designed to meet real-time indexing requirements.

Kevin Leong

June 9, 2020

MongoDB

MongoDB Performance Tuning - Top 5 Resources

A compilation of MongoDB performance tuning resources, covering topics such as sharding, indexing, schema design and performance isolation.

Kevin Leong

June 4, 2020

RocksDB

Engineering

Big Ideas

Remote Compactions in RocksDB-Cloud

We modified RocksDB-Cloud to allow remote compactions in order to optimize RocksDB for cloud environments.

Hieu Pham

June 2, 2020

MongoDB

Top 10 sessions for MongoDB.live 2020

Sessions to look forward to for MongoDB.live 2020

Nadine Farah

May 19, 2020

MongoDB

Create APIs for Aggregations and Joins on MongoDB in Under 15 Minutes

Build a Python application to create and execute APIs on aggregations and joins using Rockset and MongoDB.

Nadine Farah

May 6, 2020

MongoDB

Engineering

Elasticsearch

Using MongoDB Change Streams for Indexing with Elasticsearch vs Rockset

Learn how Rockset indexes data from MongoDB change data capture (CDC) streams and how it compares to indexing in Elasticsearch.

Kshitij Wadhwa

April 28, 2020

Engineering

Index Scan: Using Rockset's Search Index to Speed up Range Scans Over a Specific Field

Rockset uses Converged Indexing to make different types of queries run fast. We look at how Rockset's Index Scan uses the search index to accelerate range scans.

Karen Li

April 3, 2020

DynamoDB

Case Study

Dashboards

IoT

Case Study: Fleet Management System – An End-to-End Streaming Data Pipeline

This post outlines a fleet management solution using IoT and data technologies, such as DynamoDB, AWS IoT Core, AWS Lambda, and Rockset.

Abhijeet Upadhyay

March 19, 2020

Kafka

Real-Time Analytics

How To

Streaming

How to Use KSQL Stream Processing and Real-Time Databases to Analyze Streaming Data in Kafka

We discuss when stream processing, with KSQL and Kafka Streams, and when a real-time database like Rockset are best used for analyzing Kafka data.

Ari Ekmekji

March 12, 2020

Developer

Product

Query Lambdas: Increasing Developer Velocity for Application Development

We’re now proud to release a new product feature - Query Lambdas - that similarly rethinks the data application development workflow.

Scott Morris

March 5, 2020

Kafka

Best Practices for Analyzing Kafka Event Streams

What are the key considerations when selecting an analytics stack for building data applications on Kafka event streams?

Kevin Leong

February 28, 2020

MongoDB

Product

Real-Time External Indexing For Aggregations and Joins on MongoDB Collections

This is a tech preview of an integration that will allow you to index your MongoDB data in row, column and inverted indexes, and run millisecond-latency SQL queries in real-time.

Shruti Bhat

February 14, 2020

Kafka

Data Applications

IoT

Dashboards

Streaming

Use Cases

Where's My Tesla? Creating a Data API Using Kafka, Rockset and Postman to Find Out

We demonstrate how to expose real-time IoT data in Kafka through the Rockset REST API in this example.

Lewis Gavin

February 7, 2020

Kafka

Dashboards

Real-Time Analytics

IoT

Use Cases

Streaming

Real-Time Analytics on Connected Car IoT Data Streams from Apache Kafka

In this IoT example, we examine how to enable complex analytic queries on real-time Kafka streams from connected car sensors.

Shawn Adams

January 28, 2020

Case Study

Data Applications

Case Study: Standard Cognition Uses Rockset to Deliver Data APIs and Real-Time Metrics for Vision AI

Standard Cognition, an AI-powered computer vision company, uses Rockset to enable their developers to deliver data APIs and product improvements.

Kevin Leong

January 23, 2020

RocksDB

Big Ideas

RocksDB Is Eating the Database World

An overview of what makes RocksDB well-suited to power many of the world's high-performance distributed data systems.

Ethan Hamilton

January 17, 2020

Kafka

SQL

Real-Time Analytics

Data Applications

SQL API for Real-Time Kafka Analytics in 3 Steps

Learn how to create a SQL API for real-time Kafka analytics on the Twitter Streaming API, using AWS Lambda and Rockset.

Tanmay Chordia

January 10, 2020

DynamoDB

Joining Data in DynamoDB and S3 for Live, Ad-Hoc Analysis

Using SQL to join DynamoDB and S3 data, operations teams can perform live, ad-hoc analysis across multiple cloud systems.

Ben Rogojan

December 9, 2019

Big Ideas

What Data Engineers Think About - Variety, Volume, Velocity and Real-Time Analytics

Data engineers are often tasked with moving and preparing data to facilitate analytics. This guest post examines several considerations for data engineers designing for real-time analytics.

Lewis Gavin

November 6, 2019

Kafka

Elasticsearch

Druid

Analytics on Kafka Event Streams Using Druid, Elasticsearch and Rockset

We discuss how different data backends - Druid, Elasticsearch and Rockset - can be used alongside Kafka for analytics on event data streams.

Anirudh Ramanathan

October 21, 2019

Engineering

Company

The Role of UX in Making Rockset the Shortest Path from Data to Applications

Learn how our UX team continually improves common user workflows in Rockset to simplify development of data-driven applications.

Aditi Dhar

October 10, 2019

Kafka

Dashboards

Real-Time Analytics

How To

Streaming

Using Tableau with Kafka: How to Build a Real-Time SQL Dashboard on Streaming Data

Build a real-time Tableau dashboard for operational monitoring and analytics on streaming event data from Kafka.

Scott Morris

October 1, 2019

Engineering

Dashboards

Big Ideas

Use Cases

How We Analyze and Visualize Kubernetes Events in Real Time at Rockset

Learn how we rolled our own tool for analysis and visualization of Kubernetes events, and try the open-source dashboard for yourself.

Rui Aguiar

September 20, 2019

Engineering

Outside Lands, Airbnb Prices, and Rockset’s Geospatial Queries

How to use Rockset's fast geospatial indexes with Airbnb data.

Ben Hannel

September 13, 2019

Dashboards

Engineering

Grafana Time-Series Dashboards with the Rockset-Grafana Plugin

How Rockset uses Grafana dashboards for monitoring production systems, Kubernetes, and GitHub metrics, and how we built a Rockset-Grafana plugin.

Rui Aguiar

September 6, 2019

Kafka

Real-Time Analytics

Real-Time Analytics in the World of Virtual Reality and Live Streaming

An architecture for real-time decision-making and live dashboards on VR data in Kafka, coming from live-streamed events.

Sebastian Zangaro

August 29, 2019

DynamoDB

Dashboards

Using Tableau with DynamoDB: How to Build a Real-Time SQL Dashboard on NoSQL Data

We create an example dashboard in Tableau on data in DynamoDB, using Rockset as the SQL intelligence layer.

Vahid Fazel-Rezai

August 27, 2019

DynamoDB

Reducing Costs

NoSQL

3 cost-cutting tips for Amazon DynamoDB

How to avoid costly mistakes with DynamoDB partition keys, read/write capacity modes, and global secondary indexes.

Anirudh Ramanathan

August 23, 2019

DynamoDB

Engineering

How We Reduced DynamoDB Costs by Using DynamoDB Streams and Scans More Efficiently

Get an inside look at the some of the techniques we used to reduce the cost of ingesting data from DynamoDB.

Aditi Srinivasan

August 21, 2019

Engineering

RocksDB

Big Ideas

Optimizing Bulk Load in RocksDB

Discover an effective technique for quickly loading data into RocksDB.

Igor Canadi

August 21, 2019

Engineering

Kafka

The Kafka Connect Plugin for Rockset and How It Works

Get an in-depth look at the Kafka Connect Plugin for Rockset and the process to get it listed in Confluent Hub.

Jacob Klegar

August 16, 2019

Data Applications

Data-Driven Decisions for Where to Park in SF

We built an app to estimate the risk of a car break-in based on historical incidents.

Vahid Fazel-Rezai

August 13, 2019

Dashboards

DynamoDB

Tableau Operational Dashboards and Reporting on DynamoDB - Evaluating Redshift and Athena

We review several approaches to building Tableau operational dashboards and reporting on DynamoDB data, using SQL engines like Redshift and Athena.

Ari Ekmekji

August 12, 2019

DynamoDB

Real-Time Analytics

Dashboards

How To

Use Cases

NoSQL

Real-Time Analytics on DynamoDB - Using DynamoDB Streams with Lambda and ElastiCache

We cover different approaches to real-time analytics on DynamoDB, using DynamoDB Streams, Lambda, and ElastiCache.

Ari Ekmekji

July 30, 2019

Real-Time Analytics

From Good to Great: How Operational Analytics Gives Businesses a Real-Time Edge

All businesses today are a series of real-time events. But what separates the good from the great is how they capture and operationalize that data.

Shruti Bhat

July 25, 2019

Real-Time Analytics

Use Cases

Big Ideas

Operational Analytics: What every software engineer should know about low-latency queries on large data sets

What are the characteristics of an Operational Analytics processing system, and how does it differ from OLTP, OLAP and other data systems?

Dhruba Borthakur

July 18, 2019

Engineering

SQL

SQL Query Planning for Operational Analytics

We discuss how SQL query planning is implemented to support operational analytics requirements, like low latency and high concurrency, in Rockset.

Purvi Desai

July 9, 2019

MySQL

PostgreSQL

SQL

Methods for Running SQL on JSON in PostgreSQL, MySQL and Other Relational Databases

We examine various options for running SQL on JSON in relational databases, like PostgreSQL and MySQL, and in Rockset.

Shawn Adams

June 27, 2019

Engineering

RocksDB

Big Ideas

How We Use RocksDB at Rockset

This blog post describes how we use RocksDB at Rockset and how we tuned it for optimal performance.

Sandeep Dhoot

June 13, 2019

Product

Building a SQL Development Environment for Messy, Semi-Structured Data

Learn how and why Rockset developed a new SQL development environment for messy, semi-structured data.

Scott Morris

June 6, 2019

Engineering

IValue: efficient representation of dynamic types in C++

This post shows one of many challenges that we encountered while building a fully dynamically typed SQL database: how we manipulate values of unknown types in our query execution backend, while approaching the performance of using native types directly.

Tudor Bosman

May 31, 2019

Dashboards

Real-Time Analytics

Using Tableau for Live Dashboards on Event Data

Connect a Tableau live dashboard to a real-time event stream of complex JSON in a few easy steps.

Haneesh Reddy Poddutoori

May 24, 2019

Case Study

DynamoDB

Dashboards

Real-Time Analytics

Case Study: FULL Uses Rockset with DynamoDB for Live Dashboard to Manage Remote Workforce

FULL Creative uses Rockset to build live dashboards and run complex SQL on contact center call data in DynamoDB.

Kevin Leong

May 23, 2019

Engineering

Product

Indexing

Big Ideas

Real-Time Analytics

Converged Indexâ„¢: The Secret Sauce Behind Rockset's Fast Queries

Learn how Rockset delivers low-latency SQL for search and analytics using compute-efficient indexing.

Igor Canadi

May 17, 2019

Data Applications

Building a Serverless Analytics App to Capture and Query Clickstream Data

We built a web app that collects clickstream data as free-form JSON and runs SQL queries on the live data in a completely serverless fashion. We also seek to answer age-old questions besetting developers: tabs or spaces, vim or emacs?

Vahid Fazel-Rezai

May 17, 2019

Big Ideas

Developer Pulse: 5 Things Developers Love

When the existential question of spaces vs. tabs came up in our team, we ran a real-time survey to collect thousands of data points around it. We also wanted to settle the debate around other developer issues like SQL vs NoSQL.

Shruti Bhat

May 6, 2019

Case Study

DynamoDB

Data Applications

Case Study: Decore Uses Rockset for Search & Analytics on DynamoDB

Decore needed to enable ad hoc queries in their crypto accounting software service, so they turned to Rockset for fast analytics on DynamoDB.

Kevin Leong

April 29, 2019

DynamoDB

NoSQL

Data Warehouse

SQL

Elasticsearch

Analytics on DynamoDB: Comparing Elasticsearch, Athena and Spark

We compare options for real-time analytics on DynamoDB - Elasticsearch, Athena, and Spark - in terms of ease of setup, maintenance, query capability, latency.

Anirudh Ramanathan

April 29, 2019

DynamoDB

Secondary Indexes For Analytics On DynamoDB

Learn how to support analytical queries on DynamoDB without prohibitive scan costs - using secondary indexes.

Anirudh Ramanathan

March 27, 2019

Product

SQL

From Schemaless Ingest to Smart Schema: Enabling SQL on Raw Data

Rockset's schemaless SQL platform automatically infers schema at read time, allowing you to analyze messy data using SQL.

Purvi Desai

March 21, 2019

Big Ideas

Company

Product

Serverless Data Management: A SQL Search and Analytics Engine

Designed from the ground up for serverless data management, Rockset makes SQL search and analytics simple and accessible.

Venkat Venkataramani

March 19, 2019

Case Study

IoT

Case Study: Implementing Real-Time IoT Analytics Simply and Efficiently - An MIT Smart City Project

An MIT team collaborates with a school in Brazil on a smart city project to analyze weather sensor data using Rockset.

Kevin Leong

March 19, 2019

Case Study

Kafka

Dashboards

Case Study: Fynd Uses Kafka and Rockset to Respond to E-Commerce Consumer Behavior in Real Time

Fynd uses Rockset to perform fast queries on real-time Kafka event streams, so they can react to consumer behavior as it happens.

Kevin Leong

March 19, 2019

Case Study

Case Study: The Path to Better Pollution Forecasting Goes Through Nested JSON

Pittsburgh-based developer Doug Balog collects and analyzes nested JSON weather data to improve pollution forecasts in his community.

Kevin Leong

February 28, 2019

Data Applications

How to Build a Facebook Messenger Chatbot Powered by Fast SQL on CSV

Build a chatbot that provides instant responses, leveraging fast SQL queries on CSV data.

Kshitij Wadhwa

February 21, 2019

SQL

Product

Using Smart Schema to Accelerate Insights from Nested JSON

Use Rockset's Smart Schema to understand complex, nested JSON and enable immediate queries using SQL on raw data.

Purvi Desai

February 21, 2019

Product

SQL

How to Run SQL on PDF Files

Run SQL queries on data from PDF files, and join PDFs with JSON, CSV, XLSX, and other data.

Kshitij Wadhwa

February 13, 2019

Engineering

Company

Distributed Aggregation Queries - A Rockset Intern Story

Rockset distributes aggregation queries to reduce query latency and memory requirements. This was an intern project by Ashwath, Rockset's first ever intern.

Ashwath Thirumalai

February 6, 2019

Engineering

Aggregator Leaf Tailer: An Alternative to Lambda Architecture for Real-Time Analytics

The Aggregator Leaf Tailer architecture takes advantage of powerful indexing and cloud scalability to enable live analytics on real-time event streams.

Dhruba Borthakur

January 23, 2019

DynamoDB

SQL

Running Fast SQL on DynamoDB Tables

Run fast SQL queries on data from DynamoDB tables by continuously ingesting and indexing DynamoDB data through a Rockset-DynamoDB integration.

Kshitij Wadhwa

January 23, 2019

Dashboards

Product

Live Dashboards with Redash and Rockset

Build live dashboards by connecting Redash to Rockset to create visualizations quickly and easily.

Igor Canadi

January 21, 2019

SQL

Product

Rockset adds Excel spreadsheet support: Use SQL across XLSX files and join with other JSON, CSV or Parquet data

Run complex SQL across multiple Excel spreadsheets and join XLSX files with JSON, Parquet or CSV data.

Shruti Bhat

January 16, 2019

Kafka

Real-Time Analytics

SQL

Real-Time Analytics Using SQL on Streaming Data with Apache Kafka and Rockset

Connect Kafka and Rockset to obtain real-time analytics with ad hoc SQL queries on event streams.

Shawn Adams

January 10, 2019

Product

SQL

How to Do Data Science Using SQL on Raw JSON

Learn how to query nested JSON and CSV using SQL (including joins), without any upfront data preparation or complex data pipelines.

Anirudh Ramanathan

January 8, 2019

Kinesis

Data Applications

Building a Serverless Microservice Using Rockset and AWS Lambda

Build serverless microservices, data APIs, and data-driven applications. Use SQL to join and query JSON and CSV data using AWS Lambda and Rockset.

Kevin Leong

December 20, 2018

Dashboards

Kinesis

Real-Time Analytics

Live Dashboards on Streaming Data - A Tutorial Using Amazon Kinesis and Rockset

Serve a live dashboard using SQL on streaming Twitter data from Amazon Kinesis.

Haneesh Reddy Poddutoori

December 7, 2018

SQL

Product

Running SQL on Nested JSON

Make raw JSON immediately queryable through fast SQL queries, without ETL, data pipelines, or fixed schema.

Anirudh Ramanathan

November 7, 2018

Engineering

RocksDB

Rockset's RocksDB-Cloud Library - Enabling the Next Generation of Cloud Native Databases

David Cohen, System Architect at Intel, explores how RocksDB-Cloud can be be used to build an open-source cloud-friendly storage system.

David Cohen

November 1, 2018

Engineering

SQL

Big Ideas

Dynamic Typing in SQL

Rockset Chief Architect Tudor Bosman discusses strong dynamic typing in SQL, and how it is implemented in Rockset.

Tudor Bosman

November 1, 2018

Big Ideas

SQL

Why SQL on Raw Data?

SQL on unstructured data is hard. But storage and compute in the cloud are making SQL on raw data a reality.

Peter Bailis

October 30, 2018

Big Ideas

Cloud Native: What It Means in the Data World

Rockset CTO and co-founder Dhruba Borthakur discusses what Cloud-Native data processing entails, and how best to build for the cloud today.

Dhruba Borthakur

October 19, 2018

Big Ideas

The Road Ahead: From Open Source to Open Services

Rockset CTO and co-founder Dhruba Borthakur discusses the shift from Open Source to Open Services in data infrastructure, and how Open Services will become the new standard.

Dhruba Borthakur