June 21, 2024
Company
OpenAI Acquires Rockset
We are thrilled to join the OpenAI team and bring our technology and expertise to building safe and beneficial AGI.
Venkat Venkataramani
Follow our stories and unique insights.
May 21, 2024
How To
Use Cases
How to Build a Chatbot Using Retrieval Augmented Generation (RAG)
Discover how to build a Chatbot using RAG with Rockset as a vector database and OpenAI's GPT-4 as the LLM.
Ankit Khare
April 22, 2024
How To
Use Cases
How to Build a Recommender System using Rockset and OpenAI Embedding Models
Discover how to build a recommender system using Rockset as a vector database and OpenAI embeddings. This tutorial covers creating a dynamic web app with CSS, HTML, Js, and Flask, integrating Rockset and OpenAI APIs for a robust recommendation system.
Ankit Khare
April 2, 2024
Product
Reducing Costs
How We Optimized Rockset's Hot Storage Tier to Improve Efficiency By More Than 200%
Rockset’s new tiered pricing is as low as $0.13/GB-month, making real-time data more affordable than ever before.
Rafael Kabesa
March 27, 2024
Dashboards
How To
Explo and Rockset One-Click Integration for Real-Time Embedded Analytics
Rockset users can integrate with Explo to provide their customers a quality embedded analytics experience. In this article, we step through how to integrate Rockset with Explo to create charts and dashboards in your applications.
Brian Bakerman
March 18, 2024
Indexing
Streaming
Kafka
How To
Build AI-powered Recommendations with Confluent Cloud for Apache Flink® and Rockset
We discuss how RAG fits into the paradigm of real-time data processing and show an example product recommendation application using both Kafka and Flink on Confluent Cloud together with Rockset.
Julie Mills
March 15, 2024
Engineering
Profiling Individual Queries in a Concurrent System
This blog introduces trampoline histories, a technique Rockset has developed to efficiently attach application-level information (query IDs) to the samples of a CPU profile.
Nathan Bronson
February 22, 2024
DynamoDB
Indexing
Understanding DynamoDB Secondary Indexes
Discover the challenges secondary indexes solve in DynamoDB, including the optimal circumstances and methods for their effective application.
Alex DeBrie
February 16, 2024
Kafka
Case Study
Dashboards
Real-Time Analytics
Streaming
How Klarna Scales Buy Now Pay Later with Real-Time Anomaly Detection
With Rockset, Klarna was able to identify and alert teams to issues with partner and merchant integrations in real time, saving the company millions of dollars.
Julie Mills
January 31, 2024
Product
Real-Time Analytics
Reducing Costs
Rockset Ushers in the New Era of Search and AI with a 30% Lower Price
Rockset releases the general purpose instance class, autoscaling, microbatching and incremental materializations to make search and analytics applications more affordable than ever before.
Julie Mills
January 23, 2024
Developer
Elasticsearch
CDC
Streaming
How to Update Documents in Elasticsearch
A walk through of the the different options available for updates in Elasticsearch, including full updates, partial updates and scripted updates.
Shawn Adams
January 19, 2024
Product
How To
SQL
Mutable Data in Rockset
We explore the concept of data mutability in Rockset and cover examples demonstrating how to manipulate Rockset data using SQL.
Luka Lovosevic
December 21, 2023
Elasticsearch
SQL
Choosing Between Nested Queries and Parent-Child Relationships in Elasticsearch
In this blog, we’ll discuss how you can design your data model in Elasticsearch to handle relationships using the nested field type and parent-child relationships.
Julie Mills
December 19, 2023
RocksDB
Use Cases
How To
A Blueprint for a Real-World Recommendation System
A comprehensive exploration of the general blueprint of modern recommendation systems, this guide focuses on the intricate details of each stage and delves deeply into the infrastructure challenges involved in building these systems.
Ankit Khare
December 14, 2023
Product
Using Query Logs in Rockset
Learn how query logs are implemented in Rockset and how they can greater visibility into your queries.
Julius Hochmuth
December 1, 2023
How To
How to Do Load Testing with Rockset
This blog discusses the motivation behind load testing and provides a step-by-step guide to performing load testing on Rockset.
Luka Lovosevic
November 7, 2023
Product
Elasticsearch
Indexing
How Rockset Built Vector Search for Scale in the Cloud
Learn how Rockset built similarity indexes using FAISS-IVF that are memory-efficient and optimized for immediate insertion and recall.
Julie Mills
November 6, 2023
Company
Celebrating Engineering Innovation at Index Conference 2023
A recap of the first edition of Index, the conference for engineers building search, analytics and AI applications at scale.
Kevin Leong
October 31, 2023
Product
Customer-Managed Encryption Keys in Rockset
Learn how you can use customer-managed encryption keys, also called bring your own key, in Rockset.
Esteban Talavera
October 26, 2023
Case Study
Indexing
JetBlue Scales Real-Time AI on Rockset
"Iteration and the speed of new ML products was the most important to us. With Rockset, we found a database that could keep up with the fast pace of innovation at JetBlue," says Sai Ravuru, Senior Manager of Data Science and Analytics at JetBlue.
Julie Mills
October 17, 2023
Product
Creating and Restoring from Snapshots in Rockset
Understand how snapshots work in Rockset, when to use them and how users can create and restore from snapshots in the console.
Yashwanth Nannapaneni
October 13, 2023
Big Ideas
Introduction to Semantic Search: Embeddings, Similarity Metrics and Vector Databases
What does it take to implement semantic search? This article explains vector embeddings, nearest neighbor search and what to look for in a vector database.
M.Joel Dubinko
October 4, 2023
Elasticsearch
Elasticsearch Reindexing: When to Reindex, Best Practices and Alternatives
Reindexing in Elasticsearch is often necessary to handle changing data or improve performance. Understand situations when reindexing is required, guidance for performing a reindex, and alternatives to reindexing.
Lewis Gavin
September 26, 2023
Data Applications
Elasticsearch
Kafka
Streaming
Indexing
Case Study
Real-time AI: Live Recommendations Using Confluent and Rockset
We discuss using Confluent Cloud’s data streaming platform and Rockset’s vector search capabilities to power real-time AI applications.
Kevin Leong
September 19, 2023
Engineering
Performance
4x Faster Search Query Performance with Rockset’s Row Store Cache
The Rocket engineering team implemented a RowStoreCache to improve search performance after seeing an opportunity to speed up the fetching of values from the row store.
Nithin Venkatesh
September 12, 2023
Big Ideas
Introduction to Semantic Search: From Keyword to Vector Search
This article provides a brief history of semantic search, covering the evolution of search from keyword to vectors.
M.Joel Dubinko
September 11, 2023
Elasticsearch
SQL
How To
Can I Do SQL-Style Joins in Elasticsearch?
We explore how to perform the equivalent of SQL joins when using Elasticsearch. While joins are primarily an SQL concept, they are equally important in NoSQL
Shawn Adams
August 29, 2023
Big Ideas
Company
Elasticsearch
Redefining Search and Analytics for the AI Era
Rockset is on a mission to bring the power of search and AI to every digital disruptor in the world. Today, we are thrilled to announce a major milestone in our journey towards redefining search and analytics for the AI era.
Venkat Venkataramani
August 28, 2023
Product
5 Tasks You Can Automate in Rockset Using Scheduled Query Lambdas
Scheduled Query Lambdas are a useful feature in Rockset, allowing users to automate alerts, view creation, exports and more.
Luka Lovosevic
August 28, 2023
Big Ideas
Indexing
6 Hard Problems Scaling Vector Search
You’ve decided to use vector search in your application. Almost immediately upon productionizing vector search, you will run into hard and potentially unanticipated difficulties. This blog attempts to arm you with some knowledge of your future.
Louis Brandy
August 2, 2023
Case Study
Snowflake
Real-Time Analytics
Data Applications
How Windward Built Real-Time Logistics Tracking and AI Insights for the Maritime Industry
Learn how Windward built a real-time data platform that enables rapid innovation in AI for the maritime industry.
Julie Mills
June 12, 2023
Elasticsearch
DynamoDB
Case Study
Snowflake
Use Cases
Performance
Real-Time Clinical Trial Monitoring at Clinical ink
How Clinical ink built a real-time 360-degree view of patients and their outcomes across global clinical trials by migrating from Opensearch to Rockset for DynamoDB indexing.
Alex Doan
June 8, 2023
Engineering
Kafka
Kinesis
Streaming
Performance
CDC
When Real-Time Matters: Rockset Delivers 70ms Data Latency at 20MB/s Streaming Ingest
We’re often asked how low we’re capable of pushing our end-to-end data latency, i.e. the time it takes to receive data, index it, and make it available for querying. To answer this question, we ran a benchmark to push data latency as low as we could.
John Solitario
June 8, 2023
DynamoDB
Elasticsearch
Indexing
A Guide to DynamoDB Secondary Indexes: GSI, LSI, Elasticsearch and Rockset
Secondary indexing is a common strategy to boost search and analytics performance in DynamoDB. In this guide, we discuss the pros and cons of using DynamoDB GSIs and LSIs along with external secondary indexes such as Elasticsearch and Rockset.
Kevin Leong
June 6, 2023
Engineering
RocksDB
How Rockset Separates Compute and Storage Using RocksDB
We describe how Rockset achieves compute-storage separation without performance degradation.
Esteban Talavera
May 31, 2023
Performance
Engineering
May the Speed Be with You: 20K QPS on Rockset
We ran a 20K QPS workload on Rockset while ingesting data at 10MB/s and maintaining query latency at 200ms in a recent customer engagement. Read more about how Rockset achieved this scale and performance.
Purvi Desai
May 8, 2023
Use Cases
Real-Time Analytics
Indexing
5 Use Cases for Vector Search
In this blog, we capture engineering stories from 5 early adopters of vector search- Pinterest, Spotify, eBay, Airbnb and Doordash- who have integrated AI into their applications.
Julie Mills
May 3, 2023
Elasticsearch
Real-Time Analytics
Performance
Streaming
Benchmarking Elasticsearch and Rockset: Rockset achieves up to 4X faster streaming data ingestion
We evaluated Elasticsearch and Rockset streaming ingestion performance on throughput and latency. In this blog, we walk through the benchmark framework, configuration and results.
Julie Mills
April 27, 2023
Dashboards
Data Applications
Developer
Engineering
IoT
Kafka
Kinesis
Real-Time Analytics
Snowflake
SQL
Use Cases
How To
Reducing Costs
Streaming
Three Reference Architectures for Real-Time Analytics On Streaming Data
In part three of "Making Sense of Real-Time Analytics On Streaming Data", we provide reference architectures for anomaly detection, IoT, and recommendation systems.
Scott Dwyer
April 18, 2023
Big Ideas
Product
Real-Time Analytics
Introducing Vector Search on Rockset: How to run semantic search with OpenAI and Rockset
We’re excited to introduce vector search on Rockset to power fast and efficient search experiences, personalization engines, fraud detection systems and more.
John Solitario
April 17, 2023
Big Ideas
Developer
Product
Real-Time Analytics
Rockset and Feast Feature Store for Real-Time Machine Learning
To better serve real-time machine learning, Rockset integrates with the Feast Feature Store which acts as a centralized platform for deploying, monitoring and managing production ML features.
Daniel Lin
April 11, 2023
RocksDB
Engineering
Real-Time Analytics
Tech Overview of Compute-Compute Separation- A New Cloud Architecture for Real-Time Analytics
The high-level implementation of compute-compute separation, a new cloud architecture with multiple, isolated clusters for ingest compute and query compute on shared real-time data.
Julie Mills
March 28, 2023
Big Ideas
Data Applications
Developer
Druid
Elasticsearch
Engineering
Kafka
Kinesis
Real-Time Analytics
Streaming
Stream Processing vs. Real-Time Analytics Databases
Learn about conceptual differences between stream processing and RTA databases and develop a framework for choosing the right tool. .
Scott Dwyer
March 27, 2023
Data Applications
Engineering
Kafka
PostgreSQL
Real-Time Analytics
CDC
How To
Real-Time CDC With Rockset And Confluent Cloud
Learn how Rockset and Confluent Cloud provide a real-time CDC analytics pipeline that requires zero code and zero infrastructure to manage.
Patrick Druley
March 9, 2023
Developer
Engineering
web3
Use Cases
How To
How To Query The Ethereum Blockchain
Learn how to query Ethereum data using clients, RPC node providers, and using SQL queries on public datasets.
Justin Liu
March 1, 2023
Real-Time Analytics
RocksDB
Engineering
Introducing Compute-Compute Separation for Real-Time Analytics
Rockset unveils compute-compute separation that eliminates the challenge of compute contention and makes it possible to build efficient, reliable real-time applications at massive scale.
Venkat Venkataramani
March 1, 2023
Real-Time Analytics
Product
How To
Data Applications
A Breakthrough Architecture for Real-Time Analytics- An Overview of Compute-Compute Separation in Rockset
Rockset introduces a new architecture that enables separate virtual instances to isolate streaming ingestion from queries and one application from another.
Rafael Kabesa
February 25, 2023
Use Cases
Streaming
How To
Real-Time Analytics
Kinesis
Kafka
Making Sense of Real-Time Analytics on Streaming Data: The Landscape
This blog series will help demystify streaming data and provide engineering leaders a guide for incorporating streaming data into their analytics pipelines.
Scott Dwyer
February 9, 2023
DynamoDB
How To
Using DynamoDB Single-Table Design with Rockset
Single-table design is a popular data modeling technique in DynamoDB. We present several options for performing real-time analytics on single-table models using Rockset.
Tyler Denton
February 8, 2023
Real-Time Analytics
Druid
ClickHouse
Top Real-Time Analytics Databases in 2023: Rockset, Apache Druid, ClickHouse and Pinot
Learn how Rockset, Druid, ClickHouse and Pinot compare for real-time analytics in real-world use cases.
Shruti Bhat
January 31, 2023
Case Study
S3
Developing Global Labor Market Intelligence at SkyHive Using Rockset and Databricks
SkyHive builds a platform for labor market intelligence, using Databricks for ML processing and Rockset to serve their user-facing application.
Mohan Reddy
January 26, 2023
How To
How to Use Terraform with Rockset
Learn how Terraform can be used to automate the configuration and deployment of Rockset resources.
Martin Englund
January 11, 2023
DynamoDB
Elasticsearch
Real-Time Analytics
Using Elasticsearch to Offload Search and Analytics from DynamoDB
A walkthrough of how to offload text search, complex filters and aggregations from DynamoDB to Elasticsearch.
Julie Mills
January 9, 2023
ClickHouse
Case Study
Snowflake
Dashboards
MongoDB
Scaling Our SaaS Sales Training Platform with Real-Time Analytics from Rockset
As users and data volumes grew, ConveYour needed to scale their customer-facing dashboards. Learn how their developer team achieved scalability, concurrency and low ops using Rockset.
Stephen Rhyne
January 3, 2023
Big Ideas
Real-Time Analytics
Streaming
Real-Time Data Predictions for 2023
This blog compiles real-time data predictions from industry leaders so you know what’s coming in 2023.
Julie Mills
January 1, 2023
DynamoDB
Use Cases
NoSQL
5 Use Cases for DynamoDB in 2023
This guest post lays out the benefits of using DynamoDB, including 5 real-life examples, along with recommendations for performing analytics on DynamoDB data
Ben Rogojan
December 27, 2022
Elasticsearch
Developer
How to Solve 4 Elasticsearch Performance Challenges at Scale
We walk through solutions to common Elasticsearch performance challenges at scale including slow indexing, search speed, shard and index sizing, and multi-tenancy.
Julie Mills
December 14, 2022
Kafka
Streaming
Using the Amazon MSK Native Connector to Simplify Real-Time Analytics on Kafka
Rockset's native connector allows users to easily ingest and query streaming data from Amazon MSK, Amazon's managed Kafka service.
Avi Shah
December 13, 2022
Developer
An Open-Source Go Module to Secure the Command Line Using the OAuth2 Device Authorization Flow
We show you how we implemented a Go module that secures the CLI using an OAuth2 device authorization flow that supports both Auth0 and Okta SSO providers.
Martin Englund
November 29, 2022
Big Ideas
CDC
Breaking Down Cost Barriers For Real-Time Change Data Capture (CDC)
Learn how to improve the efficiency of real-time CDC with Rockset
Ari Ekmekji
November 21, 2022
Company
AWS re:Invent 2022: Rockset Will Be There…Will You?
See Rockset live at AWS re:Invent in Las Vegas. Join real-time analytics demos at our booth and architecture sessions in our executive suite.
Ashley Andrada
November 15, 2022
Real-Time Analytics
Performance
Product
Rockset Achieves 84% Better Performance on the Star Schema Benchmark with Intel Ice Lake
As a result of ongoing enhancements, we released software that leverages 3rd Gen Intel® Xeon® Scalable processors and delivers 84% faster performance.
Julie Mills
November 2, 2022
Engineering
Product
The New Rockset Query Editor Experience
We're excited to announce the release of a new query editor in the Rockset Console with improved performance and an updated design.
Kristie Lim
November 2, 2022
Elasticsearch
Real-Time Analytics
5 Steps for Migrating from Elasticsearch to Rockset for Real-Time Analytics
Best practices from customers who migrated from Elasticsearch to Rockset in days to weeks by avoiding common migration pitfalls.
Patrick Druley
October 26, 2022
Case Study
web3
DynamoDB
Case Study: How Rockset's Real-Time Analytics Platform Propels the Growth of Our NFT Marketplace
Own the Moment uses Rockset to build the real-time analytics and leaderboards that are core to their NFT and fantasy sports platform.
Scott Mitchell
October 21, 2022
Kafka
How To
S3
Building Real-Time Recommendations with Kafka, S3, Rockset and Retool
Step through a real-time recommendations example using Kafka, S3, Rockset and Retool.
Nadine Farah
October 21, 2022
Big Ideas
Product
Public SQL Endpoints in Rockset
Learn how to share SQL query results and metadata with public endpoints
Scott Dwyer
October 13, 2022
Big Ideas
Snowflake
Data Warehouse
Reducing Costs
How To
7 Practical Ways to Cut Snowflake Compute Cost
Ok, so Snowflake is expensive. But what do I do about it? Here are 7 Practical Ways to Cut Snowflake Compute Cost
Shruti Bhat
October 11, 2022
Elasticsearch
CDC
Streaming
Updates, Inserts, Deletes: Challenges to avoid when indexing mutable data in Elasticsearch
We examine common challenges when indexing mutable data such as CDC streams in Elasticsearch and contrast with Rockset, as well as provide practical techniques for using these systems for real-time search and analytics.
Julie Mills
October 6, 2022
Case Study
DynamoDB
Dashboards
PyTorch Infra's Journey to Rockset
The PyTorch infra team at Meta runs thousands of tests to validate every change as part of their Continuous Integration. Learn how they moved to Rockset to deliver metrics on the health of their CI.
Jane Xu
October 4, 2022
ClickHouse
Streaming
CDC
Comparing ClickHouse vs Rockset for Event and CDC Streams
We compare ClickHouse and Rockset for real-time analytics on event and CDC streams, examining their similarities and differences across architecture, data ingestion, querying and operations.
Kevin Leong
September 20, 2022
Big Ideas
Real-Time Analytics
web3
3 Use Cases for Real-Time Blockchain Analytics
Learn about emerging use cases for real-time blockchain analytics and some key considerations for developers building dApps.
Sid Chhibber
September 13, 2022
DynamoDB
DynamoDB Filtering and Aggregation Queries Using SQL on Rockset
Learn how to build an application that handles high-volume transactions as well as filtering and aggregation using a combination of DynamoDB and Rockset.
Alex DeBrie
September 2, 2022
Data Applications
Use Cases
Real-Time Analytics
Expert Roundtable: How to Build Real-Time Personalization and Recommendation Systems
Hear experts share why real-time personalization offers greater accuracy and efficiency compared to offline alternatives, along with best practices for getting to real time.
Dhruba Borthakur
August 26, 2022
Case Study
IoT
Case Study: iYOTAH Brings Real-Time IoT Analytics to Dairy Farming with Its AgTech SaaS Platform
iYOTAH uses real-time IoT data to moooo-ve dairy farming into a smart future.
Daniel Lu
August 16, 2022
Kinesis
Kafka
Streaming
How To
Kafka vs Kinesis: How to Choose
Which is the best stream processing solution for your needs and environment?
Patrick Druley
August 11, 2022
Big Ideas
Real-Time Analytics
Expert Roundtable: Batch vs Streaming in the Modern Data Stack [Video]
Data engineering experts come together to discuss where batch and streaming analytics fit in the modern data stack.
Shruti Bhat
August 5, 2022
Case Study
Elasticsearch
Kafka
Use Cases
Case Study: How Rockset Turbocharges Real-Time Personalization at Whatnot
Whatnot implemented real-time personalization for their live shopping platform using Rockset, which proved a more efficient alternative to Elasticsearch.
Emmanuel Fuentes
July 29, 2022
Snowflake
Real-Time Analytics
Data Warehouse
Can BigQuery, Snowflake, and Redshift Handle Real-Time Data Analytics?
In this article, we’ll explore the strengths and shortcomings of three prominent data warehouses today for real-time analytics
Daniel Lu
July 28, 2022
MongoDB
Kafka
CDC
Streaming
How To
NoSQL
MongoDB CDC: When to Use Kafka, Debezium, Change Streams and Rockset
Change data capture from MongoDB is a reliable and performant way to move MongoDB data to a complementary system for search and analytics. We review several options for CDC on MongoDB.
Lewis Gavin
July 22, 2022
Big Ideas
DynamoDB
MongoDB
SQL
Expert Talk TLDR: SQL vs NoSQL Databases in the Modern Data Stack
Top takeaways from a recent panel of seasoned data architects and data practitioners steeped in NoSQL databases.
Daniel Lu
July 21, 2022
Dashboards
Case Study
MongoDB
Case Study: Is Your NoSQL Data Hindering Real-Time Analytics? Savvy Solved It with Rockset.
Savvy provides real-time analytics for growth teams using its service to create no-code interactive experiences. Learn how they built this functionality using Rockset on MongoDB data.
Jeremy Evans
July 12, 2022
Developer
Kinesis
SQL
Streaming SQL Joins in Rockset
We compare building collections in Rockset using JOINs at query time and at ingestion time and why you might choose each approach.
Tyler Denton
July 8, 2022
Company
Rockset's Summer Road Trip!
Rockset was talking fast and efficient real-time analytics in New York, Las Vegas and San Francisco in June. You can still catch us July 12 in New York at AWS Summit.
Ashley Andrada
July 6, 2022
Big Ideas
Real-Time Analytics
Why Real-Time Analytics Requires Both the Flexibility of NoSQL and Strict Schemas of SQL Systems
Modern, real-time use cases require databases that strongly enforce schemas and have the flexibility to automatically redefine those schemas based on the data itself.
Dhruba Borthakur
June 21, 2022
Snowflake
Product
Kafka
Kinesis
Real-Time Analytics
Streaming
Data Warehouse
Joining Streaming and Historical Data for Real-Time Analytics: Your Options With Snowflake, Snowpipe and Rockset
New Snowflake-Rockset connector provides Snowflake users cost-efficient option for real-time analytics on streaming data from Kafka and historical data in Snowflake.
Vibhuti Bhushan
June 14, 2022
Real-Time Analytics
Company
Engineering
Rockset Architecture Whiteboard Session With CTO Dhruba Borthakur
Learn about Rockset's ALT architecture and how data is ingested, stored and queried.
Dhruba Borthakur
June 7, 2022
MongoDB
DynamoDB
NoSQL
MongoDB vs DynamoDB Head-to-Head: Which Should You Choose?
We compare MongoDB and DynamoDB, their pros and cons, data types, cost, reliability, performance and security.
Shawn Adams
June 3, 2022
Case Study
Elasticsearch
Case Study: Zembula and Rockset Power Real-Time Marketing Email Personalization
Low-ops and cost-effective, Rockset is helping Zembula scale our next 100x growth.
Robert Haydock
May 25, 2022
Developer
Office Hours
Office Hours Recap: Optimize Cost and Query Latency With SQL Transformations and Real-Time Rollups
Recap of a recent Rockset Office Hours.
Nadine Farah
May 17, 2022
Real-Time Analytics
Big Ideas
SQL
SQL and Complex Queries Are Needed for Real-Time Analytics
Modern, cloud-native SQL databases deliver what today's data-driven businesses require.
Dhruba Borthakur
May 12, 2022
Real-Time Analytics
Big Ideas
Handling Bursty Traffic in Real-Time Analytics Applications
We examine the database architecture choices for handling bursty data traffic.
Dhruba Borthakur
May 10, 2022
Real-Time Analytics
DynamoDB
CDC on DynamoDB
We look at how CDC works with DynamoDB and its potential use cases.
Lewis Gavin
May 5, 2022
Engineering
Company
A Real-Time Rockset Intern Experience
The real real on interning at Rockset.
Shreya Shekhar
May 3, 2022
Real-Time Analytics
Engineering
Kafka
How Rockset Handles Data Deduplication
What is data duplication, how it plagues teams adopting real-time analytics, and what Rockset does to resolve duplication issues.
Tyler Denton
April 28, 2022
Company
Reflections of a Rockset UXer
Time flies when you're UXing at Rockset.
Aditi Dhar
April 26, 2022
Kafka
Real-Time Analytics
Streaming Data and Real-Time Analytics With Kafka + Rockset
Real-time analytics for streaming data is alive, growing and affordable for today’s modern real-time data stack.
Vibhuti Bhushan
April 19, 2022
Real-Time Analytics
Big Ideas
The Real-Time Revolution and Digital Economics in the COVID Era
Driven by COVID, economists are finally embracing streaming and real-time data – just like the business world.
Shruti Bhat
April 15, 2022
Real-Time Analytics
Big Ideas
Data Applications
Handling Out-of-Order Data in Real-Time Analytics Applications
Mutability is the most important capability for real-time analytics applications, but close behind is the ability to handle out-of-order data.
Dhruba Borthakur
April 12, 2022
Company
Kafka
DynamoDB
Rockset Goes on the Road!
Rockset will be exhibiting at three events this month in San Francisco and London.
Ashley Andrada
April 5, 2022
Druid
ClickHouse
Performance
Rockset Beats ClickHouse and Druid on the Star Schema Benchmark (SSB)
Rockset is 1.67 times faster than ClickHouse and 1.12 times faster than Druid on the Star Schema Benchmark.
Ben Hannel
March 31, 2022
Case Study
MongoDB
Developer
Case Study: How Rockset Made Me a Day Three Hero at Sounding Board
From Rockset trial to usable and reportable real-time information in just three days.
Jon Farr
March 29, 2022
Case Study
MySQL
Real-Time Analytics
Case Study: How Dimona Built a Real-Time Inventory Management System on Rockset
Dimona needed a better technology solution, one that could handle massive data sets and query them fast.
Igor Blumberg
March 25, 2022
Case Study
MongoDB
DynamoDB
Case Study: Rockset Enables Real-Time Operational Analytics in Hardware Manufacturing for PCH
Rockset delivers ad hoc complex queries within seconds, a huge improvement over the one-hour latency PCH was seeing before.
Daniel Lu
March 24, 2022
Real-Time Analytics
Developer
Elasticsearch
Druid
Empowering Developers With Query Flexibility
Query flexibility enables developers to prototype and build new features quickly, increasing overall productivity.
Nadine Farah
March 22, 2022
Real-Time Analytics
Kafka
Streaming
Streaming Analytics With KSQL vs. A Real-Time Analytics Database
The arguments for and against two approaches to data analytics and their optimal use cases
Lewis Gavin
March 17, 2022
Real-Time Analytics
MongoDB
PostgreSQL
Druid
ClickHouse
How Mutable Databases Make It Easy To Do Real-Time Updates
Three reasons why you need a mutable database for real-time updates
Nadine Farah
March 15, 2022
DynamoDB
Case Study
IoT
Case Study: Complementing DynamoDB with Rockset for Real-Time IoT Analytics at 1NCE
Thanks to Rockset, 1NCE is able to provide customers with fast and valuable insight into their data
Jan Sulaiman
March 10, 2022
Real-Time Analytics
Big Ideas
Why Mutability Is Essential for Real-Time Data Analytics
Mutability enables updates to existing records in a data store and is key to real-time analytics.
Dhruba Borthakur
March 4, 2022
Kinesis
How Rockset Supports Kinesis Shard Autoscaling to Handle Varying Throughputs
On-demand capacity increases efficiency and supports cost savings
Sudhindra Tirupati Nagaraj
March 3, 2022
Real-Time Analytics
SQL
Real-Time Analytics on Oracle and MSSQL With Rockset
Rockset announces early access for Oracle and Microsoft SQL Server integrations
Vibhuti Bhushan
February 24, 2022
Kinesis
Elasticsearch
Druid
Real-Time Analytics
Real-Time Analytics on Kinesis Event Streams Using Rockset, Druid, Elasticsearch and Redshift
An overview of popular options for RTA on Kinesis event streams highlighting ideal use cases and associated tradeoffs.
Scott Dwyer
February 17, 2022
Big Ideas
Engineering
17 New Things Every Modern Data Engineer Should Know in 2022
We asked data industry thought leaders to tell us what we should be paying attention to in coming months. Here is what they told us.
Shruti Bhat
February 14, 2022
Real-Time Analytics
Big Ideas
Top 5 Reasons for Moving From Batch To Real-Time Analytics
Fast analytics on fresh data beats slow analytics on stale data every time.
Venkat Venkataramani
February 10, 2022
MongoDB
How To
NoSQL
How To Join Data in MongoDB
Choosing between $lookup, denormalization and alternatives for joining data in MongoDB.
Shawn Adams
February 2, 2022
MongoDB
Real-Time Analytics
NoSQL
Slow Queries
Five Ways to Run Analytics on MongoDB – Their Pros and Cons
Your choices range from performing analytics directly in MongoDB to moving data to a data store better equipped for real-time analytics.
Shawn Adams
January 28, 2022
DynamoDB
Case Study
Case Study: Real-Time Insights Help Propel 10X Growth at E-Learning Provider Seesaw
Rockset, along with DynamoDB, Hightouch, and Retool, enabled Seesaw to obtain actionable, real-time insights that helped grow their e-learning platform.
Daniel Lu
January 25, 2022
Snowflake
How To
Slow Queries
Data Warehouse
What Do I Do When My Snowflake Query Is Slow? Part 2: Solutions
Part two of a two part series on improving Snowflake query performance
Shawn Adams
January 20, 2022
Snowflake
How To
Slow Queries
Data Warehouse
What Do I Do When My Snowflake Query Is Slow? Part 1: Diagnosis
Part one of a two part series on improving Snowflake query performance
Shawn Adams
January 5, 2022
Real-Time Analytics
SQL
Mythbusting: The Venerable SQL Database and Today’s Real-Time Analytics
The SQL database that came of age in the 1980s still has a critical role today in moving data-driven companies from batch to real-time analytics.
Dhruba Borthakur
December 21, 2021
Company
Engineering
Developer
How We Use Rockset's Real-Time Analytics to Debug Distributed Systems
Jonathan, a software engineering intern at Rockset, describes how Rockset uses its own tech to debug its highly distributed ingest system.
Jonathan Kula
December 17, 2021
SQL
Developer
Powering SQL Draw with Rockset, Retool and dbt
SQL Draw is a Slack-based game that uses Rockset, Retool and dbt to create fun drawings with cartesian geometry, creativity and teamwork.
James Weakley
December 10, 2021
Company
Wrap-up of Rockset at AWS re: Invent 2021
November 29 to December 3, 2021 in Las Vegas, NV
Rod Bauer
December 9, 2021
Big Ideas
Real-Time Analytics
Streaming
The Rise of Streaming Data and the Modern Real-Time Data Stack
Now more than 10 years old, the modern data stack is ripe for innovation. The inevitable next stage? Real-time insights delivered straight to users — the modern real-time data stack.
Shruti Bhat
December 1, 2021
Company
Engineering
Why Rockset Is My Next Job After Facebook
Louis Brandy, director of engineering, shares his thoughts on joining Rockset.
Louis Brandy
November 9, 2021
MySQL
PostgreSQL
OLTP
CDC
How To
How to Implement CDC for MySQL and Postgres
We examine different options for implementing change data capture (CDC) from MySQL and Postgres and make recommendations for when to use each.
Lewis Gavin
November 5, 2021
Case Study
PostgreSQL
Case Study: Powering Customer-Facing Dashboards at Scale Using Rockset with PostgreSQL at DataBrain
Learn how Rockset’s PostgreSQL integration helped DataBrain scale smoothly as its production data size and query volume exploded.
Daniel Lu
November 4, 2021
S3
Data Lakes
Getting Started with Apache Spark, S3 and Rockset for Real-Time Analytics
Get fast query performance with Apache Spark + Rockset to power data apps.
Nadine Farah
November 2, 2021
Product
Rockset’s Reverse ETL Integrations Extend the Modern Real-Time Data Stack
Rockset’s new partner integrations with leading reverse ETL platforms Census, Hightouch and Omnata will enable everyday business tools to consume real-time customer insights seamlessly from Rockset.
Daniel Lu
October 26, 2021
Case Study
DynamoDB
Case Study: Fast and Simple — Building Rich Patient Dashboards for Speech Therapists with Rockset
Rockset is used to power interactive visualizations of the rehabilitation data of speech-impaired patients for their speech therapists and other caregivers.
Antonio DomÃnguez
October 20, 2021
Product
Real-Time Data Transformations with dbt + Rockset
The dbt-Rockset adapter 2.0 supports all four core dbt materializations. Learn about how to transform data in real-time using dbt and Rockset.
Justin Liu
October 15, 2021
Big Ideas
What Is a Cloud Database? IaaS, PaaS, SaaS and DBaaS Explained
Cloud databases are not created equal. We discuss what these different terms mean with respect to cloud databases: IaaS, PaaS, SaaS and DBaaS.
Shawn Adams
September 29, 2021
Product
Rockset Elevates Security Posture with RBAC Custom Roles & Views
New security features enable customers to enforce least privileged access to all resources within Rockset
Rafael Kabesa
September 29, 2021
Company
Product
Rockset Is Now SOC 2 Type II Compliant
The Rockset team is proud to announce that we have been accredited as SOC 2 Type II compliant.
Martin Englund
September 21, 2021
Engineering
How To
How We Improved the Concurrency and Scalability of Our Redis Rate Limiting System
We use a rate limiting system, based on Redis, to protect services from overload. Learn how we increased its concurrency and scalability in this blog.
Akshay Nanavati
September 15, 2021
Kafka
Product
Rockset Enhances Kafka Integration to Simplify Real-Time Analytics on Streaming Data
We’re introducing a new fully-managed Kafka Integration with native support for Confluent Cloud and Apache Kafka. Get started with real-time analytics on event streams from Apache Kafka in minutes.
Boyang Chen
September 8, 2021
Developer
Hello World: Join the New Rockset Developer Community
We are unveiling our community, developer mascot, and Real-time Rockstars!
Nadine Farah
September 7, 2021
Kafka
See Rockset’s Rollups for Streaming Data at Kafka Summit 2021
Rockset, a Gold Sponsor of Kafka Summit Americas 2021, to present and demo SQL-based rollups on streaming data.
Giovanni Tropeano
September 3, 2021
Product
Faster Results and a Better Experience with New Pagination in Rockset
Rockset’s new pagination approach enables customers to query large amounts of data fast and more consistently
Rafael Kabesa
August 31, 2021
Real-Time Analytics
Product
How Rockset Enables SQL-Based Rollups for Streaming Data
Learn how Rockset enables SQL-based rollups on streaming data for complex and accurate real-time analytics.
Venkat Venkataramani
August 25, 2021
Druid
Product
Kafka
Kinesis
Real-Time Analytics
Rollups on Streaming Data: Rockset vs Apache Druid
Continuously rollup and transform streaming data from any source using SQL. Learn how rollups in Rockset compare to Apache Druid.
Vibhuti Bhushan
August 4, 2021
Snowflake
Real-Time Analytics
Streaming
How To
Slow Queries
Real-Time Data Ingestion: Snowflake, Snowpipe and Rockset
We examine the performance and cost of real-time data ingestion in Snowflake and Snowpipe as compared to Rockset.
Shawn Adams
July 29, 2021
DynamoDB
Product
Engineering
20x Faster Ingestion with Rockset's New DynamoDB Connector
Get 20x faster ingestion on DynamoDB tables with Rockset's improved connector, which uses DynamoDB's export to S3 functionality.
Purvi Desai
July 22, 2021
DynamoDB
Real-Time Analytics
Scaling Real-Time Gaming Leaderboards with DynamoDB and Rockset
Learn how DynamoDB and Rockset deliver the ultimate data stack for real-time analytics in gaming.
Julie Mills
July 19, 2021
Druid
How to Handle Nested Data in Apache Druid vs Rockset
Nested data needs to be flattened upon ingestion when using Apache Druid. We look at how to ingest and query nested data in Druid vs alternatives like Rockset.
Shawn Adams
July 15, 2021
SQL
Real-Time Analytics
Product
Real-Time Analytics with dbt + Rockset
The dbt-Rockset adapter makes it easy to perform SQL transformations for real-time analytics. Load data into Rockset and create collections by writing SQL SELECT statements in dbt.
Sam Crowder
July 8, 2021
MongoDB
5 Can't Miss MongoDB.live Talks
As we gear up for MongoDB.live on July 13-14, here are some conference talks we're looking forward to attending.
Kevin Leong
July 7, 2021
SQL
Druid
How To
How to Handle Database Joins in Apache Druid vs Rockset
This article focuses on implementing database joins in Apache Druid, explores workarounds like denormalization and examines alternative solutions like Rockset.
Shawn Adams
July 1, 2021
Developer
Create a Data API on MySQL Data with Rockset
We’ll be uploading, analyzing, and creating a data API on Airbnb data from Amazon RDS MySQL in Rockset.
Nadine Farah
June 29, 2021
Product
Production Visibility: Metrics Monitoring and Alerting
Rockset introduced console metrics and an integration for third-party monitoring tools to provide greater visibility for production workloads.
Brian Liang
June 18, 2021
Engineering
Company
My New Grad Experience at Rockset
Karen joined Rockset two years ago as a fresh CS grad. She shares highlights of her Rockset experience as a software engineer on the backend team.
Karen Li
June 17, 2021
Real-Time Analytics
The Emergence of Real-Time Analytics
Real-time analytics is now within reach of all companies from lean startups to large enterprises.
Julie Mills
June 7, 2021
Big Ideas
DynamoDB
MongoDB
MySQL
PostgreSQL
NoSQL
CDC
OLTP
Change Data Capture: What It Is and How to Use It
Change data capture (CDC) is a useful tool in many data architectures. Learn what CDC is, how it is implemented and when to use it.
Lewis Gavin
June 4, 2021
Engineering
RocksDB
Rockset Converged Index Adds Clustered Search Index for 70% Query Latency Reduction
We share how a new storage format for the search index in Rockset’s Converged Index reduced query latencies by as much as 70% and the size of the search index by about 20%.
Sandeep Dhoot
June 1, 2021
Developer
MySQL
Real-Time Analytics
Getting Started with Real-Time Analytics on MySQL Using Rockset
In this blog, we walk you through how to scale your Amazon RDS MySQL analytical workload with Rockset.
Nadine Farah
May 27, 2021
Elasticsearch
Compare and Contrast Search Indexing With Real-Time Converged Indexing
Elasticsearch and Rockset as indexing data stores for serving low latency queries.
Giovanni Tropeano
May 24, 2021
Big Ideas
What Is a Serverless Database and Why Use One
Serverless is commonly associated with functions and Lambdas, but engineering teams should also be knowledgeable about serverless databases and the benefits they provide.
Ben Rogojan
May 21, 2021
Real-Time Analytics
Use Cases
Big Ideas
Popular Use Cases for Real-Time Analytics
While real-time analytics is in demand, it’s not without its challenges in implementing.
Julie Mills
May 17, 2021
Real-Time Analytics
3 Reasons Why Real-Time Analytics Is More Affordable Than You Think
If you are considering real-time analytics, here are some ways to ensure you are taking the most cost-effective approach.
Kevin Leong
May 14, 2021
SQL
Find and Replace Text with SQL Regular Expressions in Rockset
When we tried to unnest a field, we get multiple errors. Check out this blog to see how we use regex to debug the error and replace the problematic characters!
Nadine Farah
May 13, 2021
Real-Time Analytics
SaaS Industry Trends in Real-Time Analytics
Multiple industries are seeing real time analytics trends emerge due to customer application usage. requirements for instant access to data is driving app development teams to heavily invest in embedded real time analytics.
Giovanni Tropeano
May 11, 2021
Real-Time Analytics
Data Applications
Building Data Applications Powered by Real-Time Analytics
We share 3 key criteria for your real-time analytics platform that will fuel long-term success with data apps.
Shruti Bhat
May 5, 2021
Developer
Working with Mixed Data Types within a Field Using Rockset
When working with mixed field types, you’ll have to adjust your queries to take into consideration data types and values you don’t want to use. Here, we work through an example by ordering movies by release year.
Nadine Farah
April 28, 2021
Company
Engineering
Leading Design as a UX Team of 1
Aditi shares her experience leading design in Rockset’s fast-paced, developer-first environment.
Aditi Dhar
April 27, 2021
Developer
Flattening a JSON Object So It’s Queryable Using Rockset
You will often need to flatten a JSON object so you can query it. In this post, we’ll show how to do so using the UNNEST function in Rockset.
Nadine Farah
April 15, 2021
Real-Time Analytics
Product
PostgreSQL
MySQL
Powering Real-Time Analytics at Scale on MySQL and PostgreSQL
Enable sub-second, high-concurrency analytics for MySQL and PostgreSQL using Rockset for real-time external indexing.
Justin Liu
April 12, 2021
Case Study
DynamoDB
Real-Time Analytics
Elasticsearch
Data Applications
Case Study: Bringing Real-Time Analytics to Construction Logistics at Command Alkon
Command Alkon offers a SaaS application to digitize construction logistics, allowing suppliers, transportation providers and contractors on jobsites to analyze and collaborate on data in real time.
Kevin Leong
April 5, 2021
Case Study
Elasticsearch
SQL
Use Cases
Case Study: Sequoia Capital — Why We Moved from Elasticsearch to Rockset
We spoke with Sequoia’s head of engineering, Jake Quist, and VP of data science, Hem Wadhar, about their reasons for moving their internal analytics off Elasticsearch to Rockset.
Kevin Leong
March 31, 2021
Case Study
Data Applications
Snowflake
Real-Time Analytics
Case Study: Ritual’s Move to Real-Time Analytics to Personalize the Multivitamin Experience
Ritual, a health-meets-technology company, personalized the cart checkout experience, email promotions and banners using Rockset. Learn how Ritual effectively monetized new product lines with real-time analytics.
Julie Mills
March 23, 2021
Engineering
On the Pursuit of Happiness (aka Squashing 502/504 Errors)
We recount our experience hunting down, diagnosing and fixing 502 and 504 errors to improve product quality and user experience.
Hieu Pham
March 15, 2021
Elasticsearch
Elasticsearch or Rockset for Real-Time Analytics: Real-Time Ingestion and Indexing
In part 3 of our Elasticsearch and Rockset comparison, we examine how well Elasticsearch and Rockset ingest and index real-time data.
Shawn Adams
March 12, 2021
Engineering
Big Ideas
5 Tips for Recruiting Top Engineering Talent in Startups
Rockset CEO Venkat Venkataramani and engineering leaders Nimrod Hoofien (Gusto) and Adam Wolff (Robinhood) share best practices for recruiting great engineers.
Julie Mills
March 5, 2021
Big Ideas
Snowflake
Reducing Costs
Data Warehouse
Space-Time Tradeoff: Examining Snowflake's Compute Cost
In this post, we explore how developers should think about space, time, storage and compute cost as it relates to modern data analytics offerings like Snowflake and Rockset.
Shruti Bhat
February 25, 2021
Elasticsearch
Elasticsearch or Rockset for Real-Time Analytics: How Much Query Flexibility Do You Have?
In part 2 of our Elasticsearch and Rockset comparison, we take a look at query flexibility and its impact on developer productivity.
Shawn Adams
February 18, 2021
Druid
Engineering
Real-Time Analytics
Rockset Is Up to 9.4x Faster than Apache Druid on the Star Schema Benchmark
We evaluated Rockset on the Star Schema Benchmark and found up to 9.4x query runtime speedup compared to Druid. We discuss our benchmarking exercise, results and analysis in this blog post.
Kevin Leong
February 9, 2021
Real-Time Analytics
Indexing
Data Lakes
S3
Indexing Amazon S3 for Real-Time Analytics on Data Lakes
We explore how indexing Amazon S3 data can enable low-latency, high-concurrency queries for real-time analytics.
Shawn Adams
January 19, 2021
Elasticsearch
Elasticsearch or Rockset for Real-Time Analytics: Managing Clusters vs Going Serverless
In part 1 of our Elasticsearch and Rockset comparison, we explore the operational costs associated with both real-time analytics solutions.
Shawn Adams
December 22, 2020
Elasticsearch
How to Join Data in Elasticsearch vs Rockset
In this blog post, we'll look at what it takes to join data sets in Elasticsearch and in Rockset, using the same online marketplace example.
Lewis Gavin
December 17, 2020
Data Applications
Build Internal Apps in Minutes with Retool and Rockset: A Customer 360 Example
Learn how to integrate Rockset with Retool on a customer 360 sample app, using data APIs and pre-built UI components.
Ben Rogojan
December 10, 2020
Engineering
Company
What I've Learned in 2020: A Technical Version
Hieu shares thoughts on columnar databases, RocksDB, SQL engines and his year as an engineer at Rockset.
Hieu Pham
November 24, 2020
Engineering
RocksDB
Real-Time Analytics
How Rockset’s Converged Index Powers Real-Time Analytics
Rockset enables millisecond-latency queries on terabytes of data because all data ingested is indexed multiple ways in its Converged Index. Learn how the Converged Index works in this blog post.
Shawn Adams
November 19, 2020
Engineering
SQL
Smart Schema: Enabling SQL Queries on Semi-Structured Data
We explain and show how users can perform schemaless ingestion of their data and then use Rockset's Smart Schema to enable SQL queries directly on that data.
Shawn Adams
November 12, 2020
MongoDB
Elasticsearch
Real-Time Analytics
How To
Reducing Costs
NoSQL
Using Elasticsearch to Offload Real-Time Analytics from MongoDB
This post weighs the advantages and disadvantages of moving read-heavy analytics off a primary MongoDB database using Elasticsearch for indexing.
Shawn Adams
October 27, 2020
Company
Real-Time Analytics
Rockset Raises $40M Series B to Empower Developers Building Real-Time Analytics
Rockset is the real-time cloud database built for modern data apps, bringing speed, scale and simplicity to developers building real-time analytics.
Venkat Venkataramani
October 27, 2020
Engineering
Company
Why I Am Joining Rockset
Nathan Bronson is joining Rockset to make real-time data infrastructure simple for users at scale.
Nathan Bronson
October 26, 2020
Case Study
PostgreSQL
Data Applications
Real-Time Analytics
Case Study: Rumble’s Real-Time Leaderboards Empower Users to Lead Healthier Lifestyles
Learn how Rockset powers Rumble's real-time leaderboards, which serve to motivate its users to keep active.
Nadine Farah
October 8, 2020
MongoDB
How To
Slow Queries
NoSQL
3 Tools to Help Debug Slow Queries in MongoDB
How can you investigate query performance issues in MongoDB? We give an overview of 3 tools available for troubleshooting slow queries in MongoDB Atlas.
Ben Rogojan
October 1, 2020
Kafka
MongoDB
Data Applications
Real-Time Analytics
Building a Real-Time Customer 360 on Kafka, MongoDB and Rockset
A step-by-step guide to building a real-time customer 360 using seconds-old purchase data from MongoDB and marketing data from Kafka.
Lewis Gavin
September 25, 2020
Slow Queries
How To
NoSQL
3 Ways to Offload Read-Heavy Applications from MongoDB
Offloading read-heavy analytics from an operational database, like MongoDB, is a common architectural pattern. This post examines 3 options for offloading MongoDB to a secondary system.
Ben Rogojan
September 15, 2020
Real-Time Analytics
Engineering
Rockset: 1 Billion Events in a Day with 1-Second Data Latency
This post introduces RockBench, a benchmark for measuring the data latency of real-time databases.
Dhruba Borthakur
September 3, 2020
MongoDB
PostgreSQL
Offload Real-Time Reporting and Analytics from MongoDB Using PostgreSQL
This post weighs the advantages and disadvantages of moving read-heavy analytics off a primary MongoDB database to PostgreSQL.
Shawn Adams
August 27, 2020
DynamoDB
Case Study
Case Study: Matter Uses Rockset to Bring AI-Powered Sustainable Insights to Investors
With Rockset, Danish fintech Matter has the flexibility to run analytical queries on semi-structured data in S3 and DynamoDB as part of their NLP architecture.
Alexander Harrington
August 25, 2020
MongoDB
How To
Slow Queries
NoSQL
Handling Slow Queries in MongoDB - Part 2: Solutions
We discuss the advantages and disadvantages to various strategies for improving the performance of our MongoDB database
Justin Liu
August 20, 2020
Developer
Product
Announcing the New Rockset Developer Tools
We released Rockset Developer Tools, including a new CLI tool and a new VS Code extension, to make it easier to develop real-time data applications on Rockset.
Tanmay Chordia
August 18, 2020
Real-Time Analytics
Changing face of real-time analytics
We explore the continuum of real-time analytics, from live, interactive dashboards to online applications that automatically take action on real-time data.
Shruti Bhat
August 13, 2020
Big Ideas
The Future is Serverless: What About Your Data Stack?
Serverless architectures offer ease of use and cost advantages. We explore what serverless means for your data stack.
Shruti Bhat
August 11, 2020
Real-Time Analytics
Analytics-on-the-fly: from batch to real-time user engagement
Companies need to embrace real-time analytics to compete and survive. Only those that have invested in a real-time data stack will thrive.
Dhruba Borthakur
August 10, 2020
Real-Time Analytics
Rapid Experimentation and Growth Using Real-Time Analytics
Learn how to build for the requirements of a massive-scale A/B experiments platform.
Venkat Venkataramani
August 10, 2020
Case Study
DynamoDB
Real-Time Analytics
Case Study: eGoGames Esports Platform Uses Rockset for Real-Time Analytics on Gaming Data
eGoGames improves user experience, detects fraud, and makes business decisions using Rockset for real-time analytics on gaming data in Amazon DynamoDB and S3.
Kevin Leong
August 7, 2020
MongoDB
Slow Queries
How To
NoSQL
Handling Slow Queries in MongoDB - Part 1: Investigation
Explore various methods of identifying slow queries on MongoDB and understand how to improve them.
Justin Liu
July 29, 2020
MongoDB
Performance Isolation for Your Primary MongoDB Cluster
Performance of your primary MongoDB cluster is crucial. We look at how using multiple MongoDB clusters can help with performance isolation.
Dai Shi
July 23, 2020
MongoDB
How To
Slow Queries
NoSQL
Improving MongoDB Read Performance - Indexing, Replication and Sharding
Real-time analytics demands low-latency complex queries. Learn how to speed up read performance by indexing, replication and sharding in MongoDB.
Shawn Adams
July 21, 2020
Big Ideas
Real-Time Analytics
Lessons from Scaling Facebook's Online Data Infrastructure
Lessons on building real-time data architectures based on experiences growing Facebook users 30x, from 50 million to 1.5 billion.
Venkat Venkataramani
July 16, 2020
MongoDB
Engineering
Indexing on MongoDB Using Rockset - How It Works
An in-depth look at indexing MongoDB data in Rockset and how it compares to indexing in MongoDB itself.
Ben Hannel
July 14, 2020
Case Study
MongoDB
Case Study: StoryFire - Scaling a Social Video Platform on MongoDB and Rockset
Learn how StoryFire uses Rockset to index data from their transactional MongoDB database to achieve performance and scale.
Ben Hagan
July 8, 2020
DynamoDB
Kafka
Data Applications
Real-Time Analytics
Designing a Real-Time ETA Prediction System Using Kafka, DynamoDB and Rockset
Generate ETA predictions for a delivery service using real-time location and order data from Kafka and DynamoDB.
Kartik Khare
June 23, 2020
MongoDB
Data Applications
Real-Time Analytics
Real-Time Recommendations for Event Ticketing Using MongoDB and Rockset
Implementing a real-time recommendations API for an event ticketing system by indexing MongoDB data in Rockset for fast SQL.
Lewis Gavin
June 16, 2020
MongoDB
Product
JOINs and Aggregations Using Real-Time Indexing on MongoDB Atlas
We explore how real-time indexing on MongoDB enables fast aggregation and join queries, and how Rockset is specifically designed to meet real-time indexing requirements.
Kevin Leong
June 9, 2020
MongoDB
MongoDB Performance Tuning - Top 5 Resources
A compilation of MongoDB performance tuning resources, covering topics such as sharding, indexing, schema design and performance isolation.
Kevin Leong
June 4, 2020
RocksDB
Engineering
Big Ideas
Remote Compactions in RocksDB-Cloud
We modified RocksDB-Cloud to allow remote compactions in order to optimize RocksDB for cloud environments.
Hieu Pham
June 2, 2020
MongoDB
Top 10 sessions for MongoDB.live 2020
Sessions to look forward to for MongoDB.live 2020
Nadine Farah
May 19, 2020
MongoDB
Create APIs for Aggregations and Joins on MongoDB in Under 15 Minutes
Build a Python application to create and execute APIs on aggregations and joins using Rockset and MongoDB.
Nadine Farah
May 6, 2020
MongoDB
Engineering
Elasticsearch
Using MongoDB Change Streams for Indexing with Elasticsearch vs Rockset
Learn how Rockset indexes data from MongoDB change data capture (CDC) streams and how it compares to indexing in Elasticsearch.
Kshitij Wadhwa
April 28, 2020
Engineering
Index Scan: Using Rockset's Search Index to Speed up Range Scans Over a Specific Field
Rockset uses Converged Indexing to make different types of queries run fast. We look at how Rockset's Index Scan uses the search index to accelerate range scans.
Karen Li
April 3, 2020
DynamoDB
Case Study
Dashboards
IoT
Case Study: Fleet Management System – An End-to-End Streaming Data Pipeline
This post outlines a fleet management solution using IoT and data technologies, such as DynamoDB, AWS IoT Core, AWS Lambda, and Rockset.
Abhijeet Upadhyay
March 19, 2020
Kafka
Real-Time Analytics
How To
Streaming
How to Use KSQL Stream Processing and Real-Time Databases to Analyze Streaming Data in Kafka
We discuss when stream processing, with KSQL and Kafka Streams, and when a real-time database like Rockset are best used for analyzing Kafka data.
Ari Ekmekji
March 12, 2020
Developer
Product
Query Lambdas: Increasing Developer Velocity for Application Development
We’re now proud to release a new product feature - Query Lambdas - that similarly rethinks the data application development workflow.
Scott Morris
March 5, 2020
Kafka
Best Practices for Analyzing Kafka Event Streams
What are the key considerations when selecting an analytics stack for building data applications on Kafka event streams?
Kevin Leong
February 28, 2020
MongoDB
Product
Real-Time External Indexing For Aggregations and Joins on MongoDB Collections
This is a tech preview of an integration that will allow you to index your MongoDB data in row, column and inverted indexes, and run millisecond-latency SQL queries in real-time.
Shruti Bhat
February 14, 2020
Kafka
Data Applications
IoT
Dashboards
Streaming
Use Cases
Where's My Tesla? Creating a Data API Using Kafka, Rockset and Postman to Find Out
We demonstrate how to expose real-time IoT data in Kafka through the Rockset REST API in this example.
Lewis Gavin
February 7, 2020
Kafka
Dashboards
Real-Time Analytics
IoT
Use Cases
Streaming
Real-Time Analytics on Connected Car IoT Data Streams from Apache Kafka
In this IoT example, we examine how to enable complex analytic queries on real-time Kafka streams from connected car sensors.
Shawn Adams
January 28, 2020
Case Study
Data Applications
Case Study: Standard Cognition Uses Rockset to Deliver Data APIs and Real-Time Metrics for Vision AI
Standard Cognition, an AI-powered computer vision company, uses Rockset to enable their developers to deliver data APIs and product improvements.
Kevin Leong
January 23, 2020
RocksDB
Big Ideas
RocksDB Is Eating the Database World
An overview of what makes RocksDB well-suited to power many of the world's high-performance distributed data systems.
Ethan Hamilton
January 17, 2020
Kafka
SQL
Real-Time Analytics
Data Applications
SQL API for Real-Time Kafka Analytics in 3 Steps
Learn how to create a SQL API for real-time Kafka analytics on the Twitter Streaming API, using AWS Lambda and Rockset.
Tanmay Chordia
January 10, 2020
DynamoDB
Joining Data in DynamoDB and S3 for Live, Ad-Hoc Analysis
Using SQL to join DynamoDB and S3 data, operations teams can perform live, ad-hoc analysis across multiple cloud systems.
Ben Rogojan
December 9, 2019
Big Ideas
What Data Engineers Think About - Variety, Volume, Velocity and Real-Time Analytics
Data engineers are often tasked with moving and preparing data to facilitate analytics. This guest post examines several considerations for data engineers designing for real-time analytics.
Lewis Gavin
November 6, 2019
Kafka
Elasticsearch
Druid
Analytics on Kafka Event Streams Using Druid, Elasticsearch and Rockset
We discuss how different data backends - Druid, Elasticsearch and Rockset - can be used alongside Kafka for analytics on event data streams.
Anirudh Ramanathan
October 21, 2019
Engineering
Company
The Role of UX in Making Rockset the Shortest Path from Data to Applications
Learn how our UX team continually improves common user workflows in Rockset to simplify development of data-driven applications.
Aditi Dhar
October 10, 2019
Kafka
Dashboards
Real-Time Analytics
How To
Streaming
Using Tableau with Kafka: How to Build a Real-Time SQL Dashboard on Streaming Data
Build a real-time Tableau dashboard for operational monitoring and analytics on streaming event data from Kafka.
Scott Morris
October 1, 2019
Engineering
Dashboards
Big Ideas
Use Cases
How We Analyze and Visualize Kubernetes Events in Real Time at Rockset
Learn how we rolled our own tool for analysis and visualization of Kubernetes events, and try the open-source dashboard for yourself.
Rui Aguiar
September 20, 2019
Engineering
Outside Lands, Airbnb Prices, and Rockset’s Geospatial Queries
How to use Rockset's fast geospatial indexes with Airbnb data.
Ben Hannel
September 13, 2019
Dashboards
Engineering
Grafana Time-Series Dashboards with the Rockset-Grafana Plugin
How Rockset uses Grafana dashboards for monitoring production systems, Kubernetes, and GitHub metrics, and how we built a Rockset-Grafana plugin.
Rui Aguiar
September 6, 2019
Kafka
Real-Time Analytics
Real-Time Analytics in the World of Virtual Reality and Live Streaming
An architecture for real-time decision-making and live dashboards on VR data in Kafka, coming from live-streamed events.
Sebastian Zangaro
August 29, 2019
DynamoDB
Dashboards
Using Tableau with DynamoDB: How to Build a Real-Time SQL Dashboard on NoSQL Data
We create an example dashboard in Tableau on data in DynamoDB, using Rockset as the SQL intelligence layer.
Vahid Fazel-Rezai
August 27, 2019
DynamoDB
Reducing Costs
NoSQL
3 cost-cutting tips for Amazon DynamoDB
How to avoid costly mistakes with DynamoDB partition keys, read/write capacity modes, and global secondary indexes.
Anirudh Ramanathan
August 23, 2019
DynamoDB
Engineering
How We Reduced DynamoDB Costs by Using DynamoDB Streams and Scans More Efficiently
Get an inside look at the some of the techniques we used to reduce the cost of ingesting data from DynamoDB.
Aditi Srinivasan
August 21, 2019
Engineering
RocksDB
Big Ideas
Optimizing Bulk Load in RocksDB
Discover an effective technique for quickly loading data into RocksDB.
Igor Canadi
August 21, 2019
Engineering
Kafka
The Kafka Connect Plugin for Rockset and How It Works
Get an in-depth look at the Kafka Connect Plugin for Rockset and the process to get it listed in Confluent Hub.
Jacob Klegar
August 16, 2019
Data Applications
Data-Driven Decisions for Where to Park in SF
We built an app to estimate the risk of a car break-in based on historical incidents.
Vahid Fazel-Rezai
August 13, 2019
Dashboards
DynamoDB
Tableau Operational Dashboards and Reporting on DynamoDB - Evaluating Redshift and Athena
We review several approaches to building Tableau operational dashboards and reporting on DynamoDB data, using SQL engines like Redshift and Athena.
Ari Ekmekji
August 12, 2019
DynamoDB
Real-Time Analytics
Dashboards
How To
Use Cases
NoSQL
Real-Time Analytics on DynamoDB - Using DynamoDB Streams with Lambda and ElastiCache
We cover different approaches to real-time analytics on DynamoDB, using DynamoDB Streams, Lambda, and ElastiCache.
Ari Ekmekji
July 30, 2019
Real-Time Analytics
From Good to Great: How Operational Analytics Gives Businesses a Real-Time Edge
All businesses today are a series of real-time events. But what separates the good from the great is how they capture and operationalize that data.
Shruti Bhat
July 25, 2019
Real-Time Analytics
Use Cases
Big Ideas
Operational Analytics: What every software engineer should know about low-latency queries on large data sets
What are the characteristics of an Operational Analytics processing system, and how does it differ from OLTP, OLAP and other data systems?
Dhruba Borthakur
July 18, 2019
Engineering
SQL
SQL Query Planning for Operational Analytics
We discuss how SQL query planning is implemented to support operational analytics requirements, like low latency and high concurrency, in Rockset.
Purvi Desai
July 9, 2019
MySQL
PostgreSQL
SQL
Methods for Running SQL on JSON in PostgreSQL, MySQL and Other Relational Databases
We examine various options for running SQL on JSON in relational databases, like PostgreSQL and MySQL, and in Rockset.
Shawn Adams
June 27, 2019
Engineering
RocksDB
Big Ideas
How We Use RocksDB at Rockset
This blog post describes how we use RocksDB at Rockset and how we tuned it for optimal performance.
Sandeep Dhoot
June 13, 2019
Product
Building a SQL Development Environment for Messy, Semi-Structured Data
Learn how and why Rockset developed a new SQL development environment for messy, semi-structured data.
Scott Morris
June 6, 2019
Engineering
IValue: efficient representation of dynamic types in C++
This post shows one of many challenges that we encountered while building a fully dynamically typed SQL database: how we manipulate values of unknown types in our query execution backend, while approaching the performance of using native types directly.
Tudor Bosman
May 31, 2019
Dashboards
Real-Time Analytics
Using Tableau for Live Dashboards on Event Data
Connect a Tableau live dashboard to a real-time event stream of complex JSON in a few easy steps.
Haneesh Reddy Poddutoori
May 24, 2019
Case Study
DynamoDB
Dashboards
Real-Time Analytics
Case Study: FULL Uses Rockset with DynamoDB for Live Dashboard to Manage Remote Workforce
FULL Creative uses Rockset to build live dashboards and run complex SQL on contact center call data in DynamoDB.
Kevin Leong
May 23, 2019
Engineering
Product
Indexing
Big Ideas
Real-Time Analytics
Converged Indexâ„¢: The Secret Sauce Behind Rockset's Fast Queries
Learn how Rockset delivers low-latency SQL for search and analytics using compute-efficient indexing.
Igor Canadi
May 17, 2019
Data Applications
Building a Serverless Analytics App to Capture and Query Clickstream Data
We built a web app that collects clickstream data as free-form JSON and runs SQL queries on the live data in a completely serverless fashion. We also seek to answer age-old questions besetting developers: tabs or spaces, vim or emacs?
Vahid Fazel-Rezai
May 17, 2019
Big Ideas
Developer Pulse: 5 Things Developers Love
When the existential question of spaces vs. tabs came up in our team, we ran a real-time survey to collect thousands of data points around it. We also wanted to settle the debate around other developer issues like SQL vs NoSQL.
Shruti Bhat
May 6, 2019
Case Study
DynamoDB
Data Applications
Case Study: Decore Uses Rockset for Search & Analytics on DynamoDB
Decore needed to enable ad hoc queries in their crypto accounting software service, so they turned to Rockset for fast analytics on DynamoDB.
Kevin Leong
April 29, 2019
DynamoDB
NoSQL
Data Warehouse
SQL
Elasticsearch
Analytics on DynamoDB: Comparing Elasticsearch, Athena and Spark
We compare options for real-time analytics on DynamoDB - Elasticsearch, Athena, and Spark - in terms of ease of setup, maintenance, query capability, latency.
Anirudh Ramanathan
April 29, 2019
DynamoDB
Secondary Indexes For Analytics On DynamoDB
Learn how to support analytical queries on DynamoDB without prohibitive scan costs - using secondary indexes.
Anirudh Ramanathan
March 27, 2019
Product
SQL
From Schemaless Ingest to Smart Schema: Enabling SQL on Raw Data
Rockset's schemaless SQL platform automatically infers schema at read time, allowing you to analyze messy data using SQL.
Purvi Desai
March 21, 2019
Big Ideas
Company
Product
Serverless Data Management: A SQL Search and Analytics Engine
Designed from the ground up for serverless data management, Rockset makes SQL search and analytics simple and accessible.
Venkat Venkataramani
March 19, 2019
Case Study
IoT
Case Study: Implementing Real-Time IoT Analytics Simply and Efficiently - An MIT Smart City Project
An MIT team collaborates with a school in Brazil on a smart city project to analyze weather sensor data using Rockset.
Kevin Leong
March 19, 2019
Case Study
Kafka
Dashboards
Case Study: Fynd Uses Kafka and Rockset to Respond to E-Commerce Consumer Behavior in Real Time
Fynd uses Rockset to perform fast queries on real-time Kafka event streams, so they can react to consumer behavior as it happens.
Kevin Leong
March 19, 2019
Case Study
Case Study: The Path to Better Pollution Forecasting Goes Through Nested JSON
Pittsburgh-based developer Doug Balog collects and analyzes nested JSON weather data to improve pollution forecasts in his community.
Kevin Leong
February 28, 2019
Data Applications
How to Build a Facebook Messenger Chatbot Powered by Fast SQL on CSV
Build a chatbot that provides instant responses, leveraging fast SQL queries on CSV data.
Kshitij Wadhwa
February 21, 2019
SQL
Product
Using Smart Schema to Accelerate Insights from Nested JSON
Use Rockset's Smart Schema to understand complex, nested JSON and enable immediate queries using SQL on raw data.
Purvi Desai
February 21, 2019
Product
SQL
How to Run SQL on PDF Files
Run SQL queries on data from PDF files, and join PDFs with JSON, CSV, XLSX, and other data.
Kshitij Wadhwa
February 13, 2019
Engineering
Company
Distributed Aggregation Queries - A Rockset Intern Story
Rockset distributes aggregation queries to reduce query latency and memory requirements. This was an intern project by Ashwath, Rockset's first ever intern.
Ashwath Thirumalai
February 6, 2019
Engineering
Aggregator Leaf Tailer: An Alternative to Lambda Architecture for Real-Time Analytics
The Aggregator Leaf Tailer architecture takes advantage of powerful indexing and cloud scalability to enable live analytics on real-time event streams.
Dhruba Borthakur
January 23, 2019
DynamoDB
SQL
Running Fast SQL on DynamoDB Tables
Run fast SQL queries on data from DynamoDB tables by continuously ingesting and indexing DynamoDB data through a Rockset-DynamoDB integration.
Kshitij Wadhwa
January 23, 2019
Dashboards
Product
Live Dashboards with Redash and Rockset
Build live dashboards by connecting Redash to Rockset to create visualizations quickly and easily.
Igor Canadi
January 21, 2019
SQL
Product
Rockset adds Excel spreadsheet support: Use SQL across XLSX files and join with other JSON, CSV or Parquet data
Run complex SQL across multiple Excel spreadsheets and join XLSX files with JSON, Parquet or CSV data.
Shruti Bhat
January 16, 2019
Kafka
Real-Time Analytics
SQL
Real-Time Analytics Using SQL on Streaming Data with Apache Kafka and Rockset
Connect Kafka and Rockset to obtain real-time analytics with ad hoc SQL queries on event streams.
Shawn Adams
January 10, 2019
Product
SQL
How to Do Data Science Using SQL on Raw JSON
Learn how to query nested JSON and CSV using SQL (including joins), without any upfront data preparation or complex data pipelines.
Anirudh Ramanathan
January 8, 2019
Kinesis
Data Applications
Building a Serverless Microservice Using Rockset and AWS Lambda
Build serverless microservices, data APIs, and data-driven applications. Use SQL to join and query JSON and CSV data using AWS Lambda and Rockset.
Kevin Leong
December 20, 2018
Dashboards
Kinesis
Real-Time Analytics
Live Dashboards on Streaming Data - A Tutorial Using Amazon Kinesis and Rockset
Serve a live dashboard using SQL on streaming Twitter data from Amazon Kinesis.
Haneesh Reddy Poddutoori
December 7, 2018
SQL
Product
Running SQL on Nested JSON
Make raw JSON immediately queryable through fast SQL queries, without ETL, data pipelines, or fixed schema.
Anirudh Ramanathan
November 7, 2018
Engineering
RocksDB
Rockset's RocksDB-Cloud Library - Enabling the Next Generation of Cloud Native Databases
David Cohen, System Architect at Intel, explores how RocksDB-Cloud can be be used to build an open-source cloud-friendly storage system.
David Cohen
November 1, 2018
Engineering
SQL
Big Ideas
Dynamic Typing in SQL
Rockset Chief Architect Tudor Bosman discusses strong dynamic typing in SQL, and how it is implemented in Rockset.
Tudor Bosman
November 1, 2018
Big Ideas
SQL
Why SQL on Raw Data?
SQL on unstructured data is hard. But storage and compute in the cloud are making SQL on raw data a reality.
Peter Bailis
October 30, 2018
Big Ideas
Cloud Native: What It Means in the Data World
Rockset CTO and co-founder Dhruba Borthakur discusses what Cloud-Native data processing entails, and how best to build for the cloud today.
Dhruba Borthakur
October 19, 2018
Big Ideas
The Road Ahead: From Open Source to Open Services
Rockset CTO and co-founder Dhruba Borthakur discusses the shift from Open Source to Open Services in data infrastructure, and how Open Services will become the new standard.
Dhruba Borthakur