Featured Post

June 21, 2024

Company

OpenAI Acquires Rockset

We are thrilled to join the OpenAI team and bring our technology and expertise to building safe and beneficial AGI.

Venkat Venkataramani

Follow our stories and unique insights.

Latest Posts

May 21, 2024

How To

Use Cases

How to Build a Chatbot Using Retrieval Augmented Generation (RAG)

Discover how to build a Chatbot using RAG with Rockset as a vector database and OpenAI's GPT-4 as the LLM.

Ankit Khare

April 22, 2024

How To

Use Cases

How to Build a Recommender System using Rockset and OpenAI Embedding Models

Discover how to build a recommender system using Rockset as a vector database and OpenAI embeddings. This tutorial covers creating a dynamic web app with CSS, HTML, Js, and Flask, integrating Rockset and OpenAI APIs for a robust recommendation system.

Ankit Khare

April 2, 2024

Product

Reducing Costs

How We Optimized Rockset's Hot Storage Tier to Improve Efficiency By More Than 200%

Rockset’s new tiered pricing is as low as $0.13/GB-month, making real-time data more affordable than ever before.

Rafael Kabesa

March 27, 2024

Dashboards

How To

Explo and Rockset One-Click Integration for Real-Time Embedded Analytics

Rockset users can integrate with Explo to provide their customers a quality embedded analytics experience. In this article, we step through how to integrate Rockset with Explo to create charts and dashboards in your applications.

Brian Bakerman

March 18, 2024

Indexing

Streaming

Kafka

How To

Build AI-powered Recommendations with Confluent Cloud for Apache Flink® and Rockset

We discuss how RAG fits into the paradigm of real-time data processing and show an example product recommendation application using both Kafka and Flink on Confluent Cloud together with Rockset.

Julie Mills

March 15, 2024

Engineering

Profiling Individual Queries in a Concurrent System

This blog introduces trampoline histories, a technique Rockset has developed to efficiently attach application-level information (query IDs) to the samples of a CPU profile.

Nathan Bronson

February 22, 2024

DynamoDB

Indexing

Understanding DynamoDB Secondary Indexes

Discover the challenges secondary indexes solve in DynamoDB, including the optimal circumstances and methods for their effective application.

Alex DeBrie

February 16, 2024

Kafka

Case Study

Dashboards

Real-Time Analytics

Streaming

How Klarna Scales Buy Now Pay Later with Real-Time Anomaly Detection

With Rockset, Klarna was able to identify and alert teams to issues with partner and merchant integrations in real time, saving the company millions of dollars.

Julie Mills

January 31, 2024

Product

Real-Time Analytics

Reducing Costs

Rockset Ushers in the New Era of Search and AI with a 30% Lower Price

Rockset releases the general purpose instance class, autoscaling, microbatching and incremental materializations to make search and analytics applications more affordable than ever before.

Julie Mills

January 23, 2024

Developer

Elasticsearch

CDC

Streaming

How to Update Documents in Elasticsearch

A walk through of the the different options available for updates in Elasticsearch, including full updates, partial updates and scripted updates.

Shawn Adams

January 19, 2024

Product

How To

SQL

Mutable Data in Rockset

We explore the concept of data mutability in Rockset and cover examples demonstrating how to manipulate Rockset data using SQL.

Luka Lovosevic

December 21, 2023

Elasticsearch

SQL

Choosing Between Nested Queries and Parent-Child Relationships in Elasticsearch

In this blog, we’ll discuss how you can design your data model in Elasticsearch to handle relationships using the nested field type and parent-child relationships.

Julie Mills

December 19, 2023

RocksDB

Use Cases

How To

A Blueprint for a Real-World Recommendation System

A comprehensive exploration of the general blueprint of modern recommendation systems, this guide focuses on the intricate details of each stage and delves deeply into the infrastructure challenges involved in building these systems.

Ankit Khare

December 14, 2023

Product

Using Query Logs in Rockset

Learn how query logs are implemented in Rockset and how they can greater visibility into your queries.

Julius Hochmuth

December 1, 2023

How To

How to Do Load Testing with Rockset

This blog discusses the motivation behind load testing and provides a step-by-step guide to performing load testing on Rockset.

Luka Lovosevic

November 7, 2023

Product

Elasticsearch

Indexing

How Rockset Built Vector Search for Scale in the Cloud

Learn how Rockset built similarity indexes using FAISS-IVF that are memory-efficient and optimized for immediate insertion and recall.

Julie Mills

November 6, 2023

Company

Celebrating Engineering Innovation at Index Conference 2023

A recap of the first edition of Index, the conference for engineers building search, analytics and AI applications at scale.

Kevin Leong

October 31, 2023

Product

Customer-Managed Encryption Keys in Rockset

Learn how you can use customer-managed encryption keys, also called bring your own key, in Rockset.

Esteban Talavera

October 26, 2023

Case Study

Indexing

JetBlue Scales Real-Time AI on Rockset

"Iteration and the speed of new ML products was the most important to us. With Rockset, we found a database that could keep up with the fast pace of innovation at JetBlue," says Sai Ravuru, Senior Manager of Data Science and Analytics at JetBlue.

Julie Mills

October 17, 2023

Product

Creating and Restoring from Snapshots in Rockset

Understand how snapshots work in Rockset, when to use them and how users can create and restore from snapshots in the console.

Yashwanth Nannapaneni

October 13, 2023

Big Ideas

Introduction to Semantic Search: Embeddings, Similarity Metrics and Vector Databases

What does it take to implement semantic search? This article explains vector embeddings, nearest neighbor search and what to look for in a vector database.

M.Joel Dubinko

October 4, 2023

Elasticsearch

Elasticsearch Reindexing: When to Reindex, Best Practices and Alternatives

Reindexing in Elasticsearch is often necessary to handle changing data or improve performance. Understand situations when reindexing is required, guidance for performing a reindex, and alternatives to reindexing.

Lewis Gavin

September 26, 2023

Data Applications

Elasticsearch

Kafka

Streaming

Indexing

Case Study

Real-time AI: Live Recommendations Using Confluent and Rockset

We discuss using Confluent Cloud’s data streaming platform and Rockset’s vector search capabilities to power real-time AI applications.

Kevin Leong

September 19, 2023

Engineering

Performance

4x Faster Search Query Performance with Rockset’s Row Store Cache

The Rocket engineering team implemented a RowStoreCache to improve search performance after seeing an opportunity to speed up the fetching of values from the row store.

Nithin Venkatesh

September 12, 2023

Big Ideas

Introduction to Semantic Search: From Keyword to Vector Search

This article provides a brief history of semantic search, covering the evolution of search from keyword to vectors.

M.Joel Dubinko

September 11, 2023

Elasticsearch

SQL

How To

Can I Do SQL-Style Joins in Elasticsearch?

We explore how to perform the equivalent of SQL joins when using Elasticsearch. While joins are primarily an SQL concept, they are equally important in NoSQL

Shawn Adams

August 29, 2023

Big Ideas

Company

Elasticsearch

Redefining Search and Analytics for the AI Era

Rockset is on a mission to bring the power of search and AI to every digital disruptor in the world. Today, we are thrilled to announce a major milestone in our journey towards redefining search and analytics for the AI era.

Venkat Venkataramani

August 28, 2023

Product

5 Tasks You Can Automate in Rockset Using Scheduled Query Lambdas

Scheduled Query Lambdas are a useful feature in Rockset, allowing users to automate alerts, view creation, exports and more.

Luka Lovosevic

August 28, 2023

Big Ideas

Indexing

6 Hard Problems Scaling Vector Search

You’ve decided to use vector search in your application. Almost immediately upon productionizing vector search, you will run into hard and potentially unanticipated difficulties. This blog attempts to arm you with some knowledge of your future.

Louis Brandy

August 2, 2023

Case Study

Snowflake

Real-Time Analytics

Data Applications

How Windward Built Real-Time Logistics Tracking and AI Insights for the Maritime Industry

Learn how Windward built a real-time data platform that enables rapid innovation in AI for the maritime industry.

Julie Mills

June 12, 2023

Elasticsearch

DynamoDB

Case Study

Snowflake

Use Cases

Performance

Real-Time Clinical Trial Monitoring at Clinical ink

How Clinical ink built a real-time 360-degree view of patients and their outcomes across global clinical trials by migrating from Opensearch to Rockset for DynamoDB indexing.

Alex Doan

June 8, 2023

Engineering

Kafka

Kinesis

Streaming

Performance

CDC

When Real-Time Matters: Rockset Delivers 70ms Data Latency at 20MB/s Streaming Ingest

We’re often asked how low we’re capable of pushing our end-to-end data latency, i.e. the time it takes to receive data, index it, and make it available for querying. To answer this question, we ran a benchmark to push data latency as low as we could.

John Solitario

June 8, 2023

DynamoDB

Elasticsearch

Indexing

A Guide to DynamoDB Secondary Indexes: GSI, LSI, Elasticsearch and Rockset

Secondary indexing is a common strategy to boost search and analytics performance in DynamoDB. In this guide, we discuss the pros and cons of using DynamoDB GSIs and LSIs along with external secondary indexes such as Elasticsearch and Rockset.

Kevin Leong

June 6, 2023

Engineering

RocksDB

How Rockset Separates Compute and Storage Using RocksDB

We describe how Rockset achieves compute-storage separation without performance degradation.

Esteban Talavera

May 31, 2023

Performance

Engineering

May the Speed Be with You: 20K QPS on Rockset

We ran a 20K QPS workload on Rockset while ingesting data at 10MB/s and maintaining query latency at 200ms in a recent customer engagement. Read more about how Rockset achieved this scale and performance.

Purvi Desai

May 8, 2023

Use Cases

Real-Time Analytics

Indexing

5 Use Cases for Vector Search

In this blog, we capture engineering stories from 5 early adopters of vector search- Pinterest, Spotify, eBay, Airbnb and Doordash- who have integrated AI into their applications.

Julie Mills

May 3, 2023

Elasticsearch

Real-Time Analytics

Performance

Streaming

Benchmarking Elasticsearch and Rockset: Rockset achieves up to 4X faster streaming data ingestion

We evaluated Elasticsearch and Rockset streaming ingestion performance on throughput and latency. In this blog, we walk through the benchmark framework, configuration and results.

Julie Mills

April 27, 2023

Dashboards

Data Applications

Developer

Engineering

IoT

Kafka

Kinesis

Real-Time Analytics

Snowflake

SQL

Use Cases

How To

Reducing Costs

Streaming

Three Reference Architectures for Real-Time Analytics On Streaming Data

In part three of "Making Sense of Real-Time Analytics On Streaming Data", we provide reference architectures for anomaly detection, IoT, and recommendation systems.

Scott Dwyer

April 18, 2023

Big Ideas

Product

Real-Time Analytics

Introducing Vector Search on Rockset: How to run semantic search with OpenAI and Rockset

We’re excited to introduce vector search on Rockset to power fast and efficient search experiences, personalization engines, fraud detection systems and more.

John Solitario

April 17, 2023

Big Ideas

Developer

Product

Real-Time Analytics

Rockset and Feast Feature Store for Real-Time Machine Learning

To better serve real-time machine learning, Rockset integrates with the Feast Feature Store which acts as a centralized platform for deploying, monitoring and managing production ML features.

Daniel Lin

April 11, 2023

RocksDB

Engineering

Real-Time Analytics

Tech Overview of Compute-Compute Separation- A New Cloud Architecture for Real-Time Analytics

The high-level implementation of compute-compute separation, a new cloud architecture with multiple, isolated clusters for ingest compute and query compute on shared real-time data.

Julie Mills

March 28, 2023

Big Ideas

Data Applications

Developer

Druid

Elasticsearch

Engineering

Kafka

Kinesis

Real-Time Analytics

Streaming

Stream Processing vs. Real-Time Analytics Databases

Learn about conceptual differences between stream processing and RTA databases and develop a framework for choosing the right tool. .

Scott Dwyer

March 27, 2023

Data Applications

Engineering

Kafka

PostgreSQL

Real-Time Analytics

CDC

How To

Real-Time CDC With Rockset And Confluent Cloud

Learn how Rockset and Confluent Cloud provide a real-time CDC analytics pipeline that requires zero code and zero infrastructure to manage.

Patrick Druley

March 9, 2023

Developer

Engineering

web3

Use Cases

How To

How To Query The Ethereum Blockchain

Learn how to query Ethereum data using clients, RPC node providers, and using SQL queries on public datasets.

Justin Liu

March 1, 2023

Real-Time Analytics

RocksDB

Engineering

Introducing Compute-Compute Separation for Real-Time Analytics

Rockset unveils compute-compute separation that eliminates the challenge of compute contention and makes it possible to build efficient, reliable real-time applications at massive scale.

Venkat Venkataramani

March 1, 2023

Real-Time Analytics

Product

How To

Data Applications

A Breakthrough Architecture for Real-Time Analytics- An Overview of Compute-Compute Separation in Rockset

Rockset introduces a new architecture that enables separate virtual instances to isolate streaming ingestion from queries and one application from another.

Rafael Kabesa

February 25, 2023

Use Cases

Streaming

How To

Real-Time Analytics

Kinesis

Kafka

Making Sense of Real-Time Analytics on Streaming Data: The Landscape

This blog series will help demystify streaming data and provide engineering leaders a guide for incorporating streaming data into their analytics pipelines.

Scott Dwyer

February 9, 2023

DynamoDB

How To

Using DynamoDB Single-Table Design with Rockset

Single-table design is a popular data modeling technique in DynamoDB. We present several options for performing real-time analytics on single-table models using Rockset.

Tyler Denton

February 8, 2023

Real-Time Analytics

Druid

ClickHouse

Top Real-Time Analytics Databases in 2023: Rockset, Apache Druid, ClickHouse and Pinot

Learn how Rockset, Druid, ClickHouse and Pinot compare for real-time analytics in real-world use cases.

Shruti Bhat

January 31, 2023

Case Study

S3

Developing Global Labor Market Intelligence at SkyHive Using Rockset and Databricks

SkyHive builds a platform for labor market intelligence, using Databricks for ML processing and Rockset to serve their user-facing application.

Mohan Reddy

January 26, 2023

How To

How to Use Terraform with Rockset

Learn how Terraform can be used to automate the configuration and deployment of Rockset resources.

Martin Englund

January 11, 2023

DynamoDB

Elasticsearch

Real-Time Analytics

Using Elasticsearch to Offload Search and Analytics from DynamoDB

A walkthrough of how to offload text search, complex filters and aggregations from DynamoDB to Elasticsearch.

Julie Mills

January 9, 2023

ClickHouse

Case Study

Snowflake

Dashboards

MongoDB

Scaling Our SaaS Sales Training Platform with Real-Time Analytics from Rockset

As users and data volumes grew, ConveYour needed to scale their customer-facing dashboards. Learn how their developer team achieved scalability, concurrency and low ops using Rockset.

Stephen Rhyne

January 3, 2023

Big Ideas

Real-Time Analytics

Streaming

Real-Time Data Predictions for 2023

This blog compiles real-time data predictions from industry leaders so you know what’s coming in 2023.

Julie Mills

January 1, 2023

DynamoDB

Use Cases

NoSQL

5 Use Cases for DynamoDB in 2023

This guest post lays out the benefits of using DynamoDB, including 5 real-life examples, along with recommendations for performing analytics on DynamoDB data

Ben Rogojan

December 27, 2022

Elasticsearch

Developer

How to Solve 4 Elasticsearch Performance Challenges at Scale

We walk through solutions to common Elasticsearch performance challenges at scale including slow indexing, search speed, shard and index sizing, and multi-tenancy.

Julie Mills

December 14, 2022

Kafka

Streaming

Using the Amazon MSK Native Connector to Simplify Real-Time Analytics on Kafka

Rockset's native connector allows users to easily ingest and query streaming data from Amazon MSK, Amazon's managed Kafka service.

Avi Shah

December 13, 2022

Developer

An Open-Source Go Module to Secure the Command Line Using the OAuth2 Device Authorization Flow

We show you how we implemented a Go module that secures the CLI using an OAuth2 device authorization flow that supports both Auth0 and Okta SSO providers.

Martin Englund

November 29, 2022

Big Ideas

CDC

Breaking Down Cost Barriers For Real-Time Change Data Capture (CDC)

Learn how to improve the efficiency of real-time CDC with Rockset

Ari Ekmekji

November 21, 2022

Company

AWS re:Invent 2022: Rockset Will Be There…Will You?

See Rockset live at AWS re:Invent in Las Vegas. Join real-time analytics demos at our booth and architecture sessions in our executive suite.

Ashley Andrada

November 15, 2022

Real-Time Analytics

Performance

Product

Rockset Achieves 84% Better Performance on the Star Schema Benchmark with Intel Ice Lake

As a result of ongoing enhancements, we released software that leverages 3rd Gen Intel® Xeon® Scalable processors and delivers 84% faster performance.

Julie Mills

November 2, 2022

Engineering

Product

The New Rockset Query Editor Experience

We're excited to announce the release of a new query editor in the Rockset Console with improved performance and an updated design.

Kristie Lim

November 2, 2022

Elasticsearch

Real-Time Analytics

5 Steps for Migrating from Elasticsearch to Rockset for Real-Time Analytics

Best practices from customers who migrated from Elasticsearch to Rockset in days to weeks by avoiding common migration pitfalls.

Patrick Druley

October 26, 2022

Case Study

web3

DynamoDB

Case Study: How Rockset's Real-Time Analytics Platform Propels the Growth of Our NFT Marketplace

Own the Moment uses Rockset to build the real-time analytics and leaderboards that are core to their NFT and fantasy sports platform.

Scott Mitchell

October 21, 2022

Kafka

How To

S3

Building Real-Time Recommendations with Kafka, S3, Rockset and Retool

Step through a real-time recommendations example using Kafka, S3, Rockset and Retool.

Nadine Farah

October 21, 2022

Big Ideas

Product

Public SQL Endpoints in Rockset

Learn how to share SQL query results and metadata with public endpoints

Scott Dwyer

October 13, 2022

Big Ideas

Snowflake

Data Warehouse

Reducing Costs

How To

7 Practical Ways to Cut Snowflake Compute Cost

Ok, so Snowflake is expensive. But what do I do about it? Here are 7 Practical Ways to Cut Snowflake Compute Cost

Shruti Bhat

October 11, 2022

Elasticsearch

CDC

Streaming

Updates, Inserts, Deletes: Challenges to avoid when indexing mutable data in Elasticsearch

We examine common challenges when indexing mutable data such as CDC streams in Elasticsearch and contrast with Rockset, as well as provide practical techniques for using these systems for real-time search and analytics.

Julie Mills

October 6, 2022

Case Study

DynamoDB

Dashboards

PyTorch Infra's Journey to Rockset

The PyTorch infra team at Meta runs thousands of tests to validate every change as part of their Continuous Integration. Learn how they moved to Rockset to deliver metrics on the health of their CI.

Jane Xu

October 4, 2022

ClickHouse

Streaming

CDC

Comparing ClickHouse vs Rockset for Event and CDC Streams

We compare ClickHouse and Rockset for real-time analytics on event and CDC streams, examining their similarities and differences across architecture, data ingestion, querying and operations.

Kevin Leong

September 20, 2022

Big Ideas

Real-Time Analytics

web3

3 Use Cases for Real-Time Blockchain Analytics

Learn about emerging use cases for real-time blockchain analytics and some key considerations for developers building dApps.

Sid Chhibber

September 13, 2022

DynamoDB

DynamoDB Filtering and Aggregation Queries Using SQL on Rockset

Learn how to build an application that handles high-volume transactions as well as filtering and aggregation using a combination of DynamoDB and Rockset.

Alex DeBrie

September 2, 2022

Data Applications

Use Cases

Real-Time Analytics

Expert Roundtable: How to Build Real-Time Personalization and Recommendation Systems

Hear experts share why real-time personalization offers greater accuracy and efficiency compared to offline alternatives, along with best practices for getting to real time.

Dhruba Borthakur

August 26, 2022

Case Study

IoT

Case Study: iYOTAH Brings Real-Time IoT Analytics to Dairy Farming with Its AgTech SaaS Platform

iYOTAH uses real-time IoT data to moooo-ve dairy farming into a smart future.

Daniel Lu

August 16, 2022

Kinesis

Kafka

Streaming

How To

Kafka vs Kinesis: How to Choose

Which is the best stream processing solution for your needs and environment?

Patrick Druley

August 11, 2022

Big Ideas

Real-Time Analytics

Expert Roundtable: Batch vs Streaming in the Modern Data Stack [Video]

Data engineering experts come together to discuss where batch and streaming analytics fit in the modern data stack.

Shruti Bhat

August 5, 2022

Case Study

Elasticsearch

Kafka

Use Cases

Case Study: How Rockset Turbocharges Real-Time Personalization at Whatnot

Whatnot implemented real-time personalization for their live shopping platform using Rockset, which proved a more efficient alternative to Elasticsearch.

Emmanuel Fuentes

July 29, 2022

Snowflake

Real-Time Analytics

Data Warehouse

Can BigQuery, Snowflake, and Redshift Handle Real-Time Data Analytics?

In this article, we’ll explore the strengths and shortcomings of three prominent data warehouses today for real-time analytics

Daniel Lu

July 28, 2022

MongoDB

Kafka

CDC

Streaming

How To

NoSQL

MongoDB CDC: When to Use Kafka, Debezium, Change Streams and Rockset

Change data capture from MongoDB is a reliable and performant way to move MongoDB data to a complementary system for search and analytics. We review several options for CDC on MongoDB.

Lewis Gavin

July 22, 2022

Big Ideas

DynamoDB

MongoDB

SQL

Expert Talk TLDR: SQL vs NoSQL Databases in the Modern Data Stack

Top takeaways from a recent panel of seasoned data architects and data practitioners steeped in NoSQL databases.

Daniel Lu

July 21, 2022

Dashboards

Case Study

MongoDB

Case Study: Is Your NoSQL Data Hindering Real-Time Analytics? Savvy Solved It with Rockset.

Savvy provides real-time analytics for growth teams using its service to create no-code interactive experiences. Learn how they built this functionality using Rockset on MongoDB data.

Jeremy Evans

July 12, 2022

Developer

Kinesis

SQL

Streaming SQL Joins in Rockset

We compare building collections in Rockset using JOINs at query time and at ingestion time and why you might choose each approach.

Tyler Denton

July 8, 2022

Company

Rockset's Summer Road Trip!

Rockset was talking fast and efficient real-time analytics in New York, Las Vegas and San Francisco in June. You can still catch us July 12 in New York at AWS Summit.

Ashley Andrada

July 6, 2022

Big Ideas

Real-Time Analytics

Why Real-Time Analytics Requires Both the Flexibility of NoSQL and Strict Schemas of SQL Systems

Modern, real-time use cases require databases that strongly enforce schemas and have the flexibility to automatically redefine those schemas based on the data itself.

Dhruba Borthakur

June 21, 2022

Snowflake

Product

Kafka

Kinesis

Real-Time Analytics

Streaming

Data Warehouse

Joining Streaming and Historical Data for Real-Time Analytics: Your Options With Snowflake, Snowpipe and Rockset

New Snowflake-Rockset connector provides Snowflake users cost-efficient option for real-time analytics on streaming data from Kafka and historical data in Snowflake.

Vibhuti Bhushan

June 14, 2022

Real-Time Analytics

Company

Engineering

Rockset Architecture Whiteboard Session With CTO Dhruba Borthakur

Learn about Rockset's ALT architecture and how data is ingested, stored and queried.

Dhruba Borthakur

June 7, 2022

MongoDB

DynamoDB

NoSQL

MongoDB vs DynamoDB Head-to-Head: Which Should You Choose?

We compare MongoDB and DynamoDB, their pros and cons, data types, cost, reliability, performance and security.

Shawn Adams

June 3, 2022

Case Study

Elasticsearch

Case Study: Zembula and Rockset Power Real-Time Marketing Email Personalization

Low-ops and cost-effective, Rockset is helping Zembula scale our next 100x growth.

Robert Haydock

May 25, 2022

Developer

Office Hours

Office Hours Recap: Optimize Cost and Query Latency With SQL Transformations and Real-Time Rollups

Recap of a recent Rockset Office Hours.

Nadine Farah

May 17, 2022

Real-Time Analytics

Big Ideas

SQL

SQL and Complex Queries Are Needed for Real-Time Analytics

Modern, cloud-native SQL databases deliver what today's data-driven businesses require.

Dhruba Borthakur

May 12, 2022

Real-Time Analytics

Big Ideas

Handling Bursty Traffic in Real-Time Analytics Applications

We examine the database architecture choices for handling bursty data traffic.

Dhruba Borthakur

May 10, 2022

Real-Time Analytics

DynamoDB

CDC on DynamoDB

We look at how CDC works with DynamoDB and its potential use cases.

Lewis Gavin

May 5, 2022

Engineering

Company

A Real-Time Rockset Intern Experience

The real real on interning at Rockset.

Shreya Shekhar

May 3, 2022

Real-Time Analytics

Engineering

Kafka

How Rockset Handles Data Deduplication

What is data duplication, how it plagues teams adopting real-time analytics, and what Rockset does to resolve duplication issues.

Tyler Denton

April 28, 2022

Company

Reflections of a Rockset UXer

Time flies when you're UXing at Rockset.

Aditi Dhar

April 26, 2022

Kafka

Real-Time Analytics

Streaming Data and Real-Time Analytics With Kafka + Rockset

Real-time analytics for streaming data is alive, growing and affordable for today’s modern real-time data stack.

Vibhuti Bhushan

April 19, 2022

Real-Time Analytics

Big Ideas

The Real-Time Revolution and Digital Economics in the COVID Era

Driven by COVID, economists are finally embracing streaming and real-time data – just like the business world.

Shruti Bhat

April 15, 2022

Real-Time Analytics

Big Ideas

Data Applications

Handling Out-of-Order Data in Real-Time Analytics Applications

Mutability is the most important capability for real-time analytics applications, but close behind is the ability to handle out-of-order data.

Dhruba Borthakur

April 12, 2022

Company

Kafka

DynamoDB

Rockset Goes on the Road!

Rockset will be exhibiting at three events this month in San Francisco and London.

Ashley Andrada

April 5, 2022

Druid

ClickHouse

Performance

Rockset Beats ClickHouse and Druid on the Star Schema Benchmark (SSB)

Rockset is 1.67 times faster than ClickHouse and 1.12 times faster than Druid on the Star Schema Benchmark.

Ben Hannel

March 31, 2022

Case Study

MongoDB

Developer

Case Study: How Rockset Made Me a Day Three Hero at Sounding Board

From Rockset trial to usable and reportable real-time information in just three days.

Jon Farr

March 29, 2022

Case Study

MySQL

Real-Time Analytics

Case Study: How Dimona Built a Real-Time Inventory Management System on Rockset

Dimona needed a better technology solution, one that could handle massive data sets and query them fast.

Igor Blumberg

March 25, 2022

Case Study

MongoDB

DynamoDB

Case Study: Rockset Enables Real-Time Operational Analytics in Hardware Manufacturing for PCH

Rockset delivers ad hoc complex queries within seconds, a huge improvement over the one-hour latency PCH was seeing before.

Daniel Lu

March 24, 2022

Real-Time Analytics

Developer

Elasticsearch

Druid

Empowering Developers With Query Flexibility

Query flexibility enables developers to prototype and build new features quickly, increasing overall productivity.

Nadine Farah

March 22, 2022

Real-Time Analytics

Kafka

Streaming

Streaming Analytics With KSQL vs. A Real-Time Analytics Database

The arguments for and against two approaches to data analytics and their optimal use cases

Lewis Gavin

March 17, 2022

Real-Time Analytics

MongoDB

PostgreSQL

Druid

ClickHouse

How Mutable Databases Make It Easy To Do Real-Time Updates

Three reasons why you need a mutable database for real-time updates

Nadine Farah

March 15, 2022

DynamoDB

Case Study

IoT

Case Study: Complementing DynamoDB with Rockset for Real-Time IoT Analytics at 1NCE

Thanks to Rockset, 1NCE is able to provide customers with fast and valuable insight into their data

Jan Sulaiman

March 10, 2022

Real-Time Analytics

Big Ideas

Why Mutability Is Essential for Real-Time Data Analytics

Mutability enables updates to existing records in a data store and is key to real-time analytics.

Dhruba Borthakur

March 4, 2022

Kinesis

How Rockset Supports Kinesis Shard Autoscaling to Handle Varying Throughputs

On-demand capacity increases efficiency and supports cost savings

Sudhindra Tirupati Nagaraj

March 3, 2022

Real-Time Analytics

SQL

Real-Time Analytics on Oracle and MSSQL With Rockset

Rockset announces early access for Oracle and Microsoft SQL Server integrations

Vibhuti Bhushan

February 24, 2022

Kinesis

Elasticsearch

Druid

Real-Time Analytics

Real-Time Analytics on Kinesis Event Streams Using Rockset, Druid, Elasticsearch and Redshift

An overview of popular options for RTA on Kinesis event streams highlighting ideal use cases and associated tradeoffs.

Scott Dwyer

February 17, 2022

Big Ideas

Engineering

17 New Things Every Modern Data Engineer Should Know in 2022

We asked data industry thought leaders to tell us what we should be paying attention to in coming months. Here is what they told us.

Shruti Bhat

February 14, 2022

Real-Time Analytics

Big Ideas

Top 5 Reasons for Moving From Batch To Real-Time Analytics

Fast analytics on fresh data beats slow analytics on stale data every time.

Venkat Venkataramani

February 10, 2022

MongoDB

How To

NoSQL

How To Join Data in MongoDB

Choosing between $lookup, denormalization and alternatives for joining data in MongoDB.

Shawn Adams

February 2, 2022

MongoDB

Real-Time Analytics

NoSQL

Slow Queries

Five Ways to Run Analytics on MongoDB – Their Pros and Cons

Your choices range from performing analytics directly in MongoDB to moving data to a data store better equipped for real-time analytics.

Shawn Adams

January 28, 2022

DynamoDB

Case Study

Case Study: Real-Time Insights Help Propel 10X Growth at E-Learning Provider Seesaw

Rockset, along with DynamoDB, Hightouch, and Retool, enabled Seesaw to obtain actionable, real-time insights that helped grow their e-learning platform.

Daniel Lu

January 25, 2022

Snowflake

How To

Slow Queries

Data Warehouse

What Do I Do When My Snowflake Query Is Slow? Part 2: Solutions

Part two of a two part series on improving Snowflake query performance

Shawn Adams

January 20, 2022

Snowflake

How To

Slow Queries

Data Warehouse

What Do I Do When My Snowflake Query Is Slow? Part 1: Diagnosis

Part one of a two part series on improving Snowflake query performance

Shawn Adams

January 5, 2022

Real-Time Analytics

SQL

Mythbusting: The Venerable SQL Database and Today’s Real-Time Analytics

The SQL database that came of age in the 1980s still has a critical role today in moving data-driven companies from batch to real-time analytics.

Dhruba Borthakur

December 21, 2021

Company

Engineering

Developer

How We Use Rockset's Real-Time Analytics to Debug Distributed Systems

Jonathan, a software engineering intern at Rockset, describes how Rockset uses its own tech to debug its highly distributed ingest system.

Jonathan Kula

December 17, 2021

SQL

Developer

Powering SQL Draw with Rockset, Retool and dbt

SQL Draw is a Slack-based game that uses Rockset, Retool and dbt to create fun drawings with cartesian geometry, creativity and teamwork.

James Weakley

December 10, 2021

Company

Wrap-up of Rockset at AWS re: Invent 2021

November 29 to December 3, 2021 in Las Vegas, NV

Rod Bauer

December 9, 2021

Big Ideas

Real-Time Analytics

Streaming

The Rise of Streaming Data and the Modern Real-Time Data Stack

Now more than 10 years old, the modern data stack is ripe for innovation. The inevitable next stage? Real-time insights delivered straight to users — the modern real-time data stack.

Shruti Bhat

December 1, 2021

Company

Engineering

Why Rockset Is My Next Job After Facebook

Louis Brandy, director of engineering, shares his thoughts on joining Rockset.

Louis Brandy

November 9, 2021

MySQL

PostgreSQL

OLTP

CDC

How To

How to Implement CDC for MySQL and Postgres

We examine different options for implementing change data capture (CDC) from MySQL and Postgres and make recommendations for when to use each.

Lewis Gavin

November 5, 2021

Case Study

PostgreSQL

Case Study: Powering Customer-Facing Dashboards at Scale Using Rockset with PostgreSQL at DataBrain

Learn how Rockset’s PostgreSQL integration helped DataBrain scale smoothly as its production data size and query volume exploded.

Daniel Lu

November 4, 2021

S3

Data Lakes

Getting Started with Apache Spark, S3 and Rockset for Real-Time Analytics

Get fast query performance with Apache Spark + Rockset to power data apps.

Nadine Farah

November 2, 2021

Product

Rockset’s Reverse ETL Integrations Extend the Modern Real-Time Data Stack

Rockset’s new partner integrations with leading reverse ETL platforms Census, Hightouch and Omnata will enable everyday business tools to consume real-time customer insights seamlessly from Rockset.

Daniel Lu

October 26, 2021

Case Study

DynamoDB

Case Study: Fast and Simple — Building Rich Patient Dashboards for Speech Therapists with Rockset

Rockset is used to power interactive visualizations of the rehabilitation data of speech-impaired patients for their speech therapists and other caregivers.

Antonio Domínguez

October 20, 2021

Product

Real-Time Data Transformations with dbt + Rockset

The dbt-Rockset adapter 2.0 supports all four core dbt materializations. Learn about how to transform data in real-time using dbt and Rockset.

Justin Liu

October 15, 2021

Big Ideas

What Is a Cloud Database? IaaS, PaaS, SaaS and DBaaS Explained

Cloud databases are not created equal. We discuss what these different terms mean with respect to cloud databases: IaaS, PaaS, SaaS and DBaaS.

Shawn Adams

September 29, 2021

Product

Rockset Elevates Security Posture with RBAC Custom Roles & Views

New security features enable customers to enforce least privileged access to all resources within Rockset

Rafael Kabesa

September 29, 2021

Company

Product

Rockset Is Now SOC 2 Type II Compliant

The Rockset team is proud to announce that we have been accredited as SOC 2 Type II compliant.

Martin Englund

September 21, 2021

Engineering

How To

How We Improved the Concurrency and Scalability of Our Redis Rate Limiting System

We use a rate limiting system, based on Redis, to protect services from overload. Learn how we increased its concurrency and scalability in this blog.

Akshay Nanavati

September 15, 2021

Kafka

Product

Rockset Enhances Kafka Integration to Simplify Real-Time Analytics on Streaming Data

We’re introducing a new fully-managed Kafka Integration with native support for Confluent Cloud and Apache Kafka. Get started with real-time analytics on event streams from Apache Kafka in minutes.

Boyang Chen