June 21, 2024
Company
OpenAI Acquires Rockset
We are thrilled to join the OpenAI team and bring our technology and expertise to building safe and beneficial AGI.
Venkat Venkataramani
Follow our stories and unique insights.
May 21, 2024
How To
Use Cases
How to Build a Chatbot Using Retrieval Augmented Generation (RAG)
Discover how to build a Chatbot using RAG with Rockset as a vector database and OpenAI's GPT-4 as the LLM.
Ankit Khare
April 22, 2024
How To
Use Cases
How to Build a Recommender System using Rockset and OpenAI Embedding Models
Discover how to build a recommender system using Rockset as a vector database and OpenAI embeddings. This tutorial covers creating a dynamic web app with CSS, HTML, Js, and Flask, integrating Rockset and OpenAI APIs for a robust recommendation system.
Ankit Khare
April 2, 2024
Product
Reducing Costs
How We Optimized Rockset's Hot Storage Tier to Improve Efficiency By More Than 200%
Rockset’s new tiered pricing is as low as $0.13/GB-month, making real-time data more affordable than ever before.
Rafael Kabesa
March 27, 2024
Dashboards
How To
Explo and Rockset One-Click Integration for Real-Time Embedded Analytics
Rockset users can integrate with Explo to provide their customers a quality embedded analytics experience. In this article, we step through how to integrate Rockset with Explo to create charts and dashboards in your applications.
Brian Bakerman
March 18, 2024
Indexing
Streaming
Kafka
How To
Build AI-powered Recommendations with Confluent Cloud for Apache Flink® and Rockset
We discuss how RAG fits into the paradigm of real-time data processing and show an example product recommendation application using both Kafka and Flink on Confluent Cloud together with Rockset.
Julie Mills
March 15, 2024
Engineering
Profiling Individual Queries in a Concurrent System
This blog introduces trampoline histories, a technique Rockset has developed to efficiently attach application-level information (query IDs) to the samples of a CPU profile.
Nathan Bronson
February 22, 2024
DynamoDB
Indexing
Understanding DynamoDB Secondary Indexes
Discover the challenges secondary indexes solve in DynamoDB, including the optimal circumstances and methods for their effective application.
Alex DeBrie
February 16, 2024
Kafka
Case Study
Dashboards
Real-Time Analytics
Streaming
How Klarna Scales Buy Now Pay Later with Real-Time Anomaly Detection
With Rockset, Klarna was able to identify and alert teams to issues with partner and merchant integrations in real time, saving the company millions of dollars.
Julie Mills
January 31, 2024
Product
Real-Time Analytics
Reducing Costs
Rockset Ushers in the New Era of Search and AI with a 30% Lower Price
Rockset releases the general purpose instance class, autoscaling, microbatching and incremental materializations to make search and analytics applications more affordable than ever before.
Julie Mills
January 23, 2024
Developer
Elasticsearch
CDC
Streaming
How to Update Documents in Elasticsearch
A walk through of the the different options available for updates in Elasticsearch, including full updates, partial updates and scripted updates.
Shawn Adams
January 19, 2024
Product
How To
SQL
Mutable Data in Rockset
We explore the concept of data mutability in Rockset and cover examples demonstrating how to manipulate Rockset data using SQL.
Luka Lovosevic
December 21, 2023
Elasticsearch
SQL
Choosing Between Nested Queries and Parent-Child Relationships in Elasticsearch
In this blog, we’ll discuss how you can design your data model in Elasticsearch to handle relationships using the nested field type and parent-child relationships.
Julie Mills
December 19, 2023
RocksDB
Use Cases
How To
A Blueprint for a Real-World Recommendation System
A comprehensive exploration of the general blueprint of modern recommendation systems, this guide focuses on the intricate details of each stage and delves deeply into the infrastructure challenges involved in building these systems.
Ankit Khare
December 14, 2023
Product
Using Query Logs in Rockset
Learn how query logs are implemented in Rockset and how they can greater visibility into your queries.
Julius Hochmuth
December 1, 2023
How To
How to Do Load Testing with Rockset
This blog discusses the motivation behind load testing and provides a step-by-step guide to performing load testing on Rockset.
Luka Lovosevic
November 7, 2023
Product
Elasticsearch
Indexing
How Rockset Built Vector Search for Scale in the Cloud
Learn how Rockset built similarity indexes using FAISS-IVF that are memory-efficient and optimized for immediate insertion and recall.
Julie Mills
November 6, 2023
Company
Celebrating Engineering Innovation at Index Conference 2023
A recap of the first edition of Index, the conference for engineers building search, analytics and AI applications at scale.
Kevin Leong
October 31, 2023
Product
Customer-Managed Encryption Keys in Rockset
Learn how you can use customer-managed encryption keys, also called bring your own key, in Rockset.
Esteban Talavera
October 26, 2023
Case Study
Indexing
JetBlue Scales Real-Time AI on Rockset
"Iteration and the speed of new ML products was the most important to us. With Rockset, we found a database that could keep up with the fast pace of innovation at JetBlue," says Sai Ravuru, Senior Manager of Data Science and Analytics at JetBlue.
Julie Mills
October 17, 2023
Product
Creating and Restoring from Snapshots in Rockset
Understand how snapshots work in Rockset, when to use them and how users can create and restore from snapshots in the console.
Yashwanth Nannapaneni
October 13, 2023
Big Ideas
Introduction to Semantic Search: Embeddings, Similarity Metrics and Vector Databases
What does it take to implement semantic search? This article explains vector embeddings, nearest neighbor search and what to look for in a vector database.
M.Joel Dubinko
October 4, 2023
Elasticsearch
Elasticsearch Reindexing: When to Reindex, Best Practices and Alternatives
Reindexing in Elasticsearch is often necessary to handle changing data or improve performance. Understand situations when reindexing is required, guidance for performing a reindex, and alternatives to reindexing.
Lewis Gavin
September 26, 2023
Data Applications
Elasticsearch
Kafka
Streaming
Indexing
Case Study
Real-time AI: Live Recommendations Using Confluent and Rockset
We discuss using Confluent Cloud’s data streaming platform and Rockset’s vector search capabilities to power real-time AI applications.
Kevin Leong
September 19, 2023
Engineering
Performance
4x Faster Search Query Performance with Rockset’s Row Store Cache
The Rocket engineering team implemented a RowStoreCache to improve search performance after seeing an opportunity to speed up the fetching of values from the row store.
Nithin Venkatesh
September 12, 2023
Big Ideas
Introduction to Semantic Search: From Keyword to Vector Search
This article provides a brief history of semantic search, covering the evolution of search from keyword to vectors.
M.Joel Dubinko
September 11, 2023
Elasticsearch
SQL
How To
Can I Do SQL-Style Joins in Elasticsearch?
We explore how to perform the equivalent of SQL joins when using Elasticsearch. While joins are primarily an SQL concept, they are equally important in NoSQL
Shawn Adams
August 29, 2023
Big Ideas
Company
Elasticsearch
Redefining Search and Analytics for the AI Era
Rockset is on a mission to bring the power of search and AI to every digital disruptor in the world. Today, we are thrilled to announce a major milestone in our journey towards redefining search and analytics for the AI era.
Venkat Venkataramani
August 28, 2023
Product
5 Tasks You Can Automate in Rockset Using Scheduled Query Lambdas
Scheduled Query Lambdas are a useful feature in Rockset, allowing users to automate alerts, view creation, exports and more.
Luka Lovosevic
August 28, 2023
Big Ideas
Indexing
6 Hard Problems Scaling Vector Search
You’ve decided to use vector search in your application. Almost immediately upon productionizing vector search, you will run into hard and potentially unanticipated difficulties. This blog attempts to arm you with some knowledge of your future.
Louis Brandy
August 2, 2023
Case Study
Snowflake
Real-Time Analytics
Data Applications
How Windward Built Real-Time Logistics Tracking and AI Insights for the Maritime Industry
Learn how Windward built a real-time data platform that enables rapid innovation in AI for the maritime industry.
Julie Mills
June 12, 2023
Elasticsearch
DynamoDB
Case Study
Snowflake
Use Cases
Performance
Real-Time Clinical Trial Monitoring at Clinical ink
How Clinical ink built a real-time 360-degree view of patients and their outcomes across global clinical trials by migrating from Opensearch to Rockset for DynamoDB indexing.
Alex Doan
June 8, 2023
Engineering
Kafka
Kinesis
Streaming
Performance
CDC
When Real-Time Matters: Rockset Delivers 70ms Data Latency at 20MB/s Streaming Ingest
We’re often asked how low we’re capable of pushing our end-to-end data latency, i.e. the time it takes to receive data, index it, and make it available for querying. To answer this question, we ran a benchmark to push data latency as low as we could.
John Solitario
June 8, 2023
DynamoDB
Elasticsearch
Indexing
A Guide to DynamoDB Secondary Indexes: GSI, LSI, Elasticsearch and Rockset
Secondary indexing is a common strategy to boost search and analytics performance in DynamoDB. In this guide, we discuss the pros and cons of using DynamoDB GSIs and LSIs along with external secondary indexes such as Elasticsearch and Rockset.
Kevin Leong
June 6, 2023
Engineering
RocksDB
How Rockset Separates Compute and Storage Using RocksDB
We describe how Rockset achieves compute-storage separation without performance degradation.
Esteban Talavera
May 31, 2023
Performance
Engineering
May the Speed Be with You: 20K QPS on Rockset
We ran a 20K QPS workload on Rockset while ingesting data at 10MB/s and maintaining query latency at 200ms in a recent customer engagement. Read more about how Rockset achieved this scale and performance.
Purvi Desai
May 8, 2023
Use Cases
Real-Time Analytics
Indexing
5 Use Cases for Vector Search
In this blog, we capture engineering stories from 5 early adopters of vector search- Pinterest, Spotify, eBay, Airbnb and Doordash- who have integrated AI into their applications.
Julie Mills
May 3, 2023
Elasticsearch
Real-Time Analytics
Performance
Streaming
Benchmarking Elasticsearch and Rockset: Rockset achieves up to 4X faster streaming data ingestion
We evaluated Elasticsearch and Rockset streaming ingestion performance on throughput and latency. In this blog, we walk through the benchmark framework, configuration and results.
Julie Mills
April 27, 2023
Dashboards
Data Applications
Developer
Engineering
IoT
Kafka
Kinesis
Real-Time Analytics
Snowflake
SQL
Use Cases
How To
Reducing Costs
Streaming
Three Reference Architectures for Real-Time Analytics On Streaming Data
In part three of "Making Sense of Real-Time Analytics On Streaming Data", we provide reference architectures for anomaly detection, IoT, and recommendation systems.
Scott Dwyer
April 18, 2023
Big Ideas
Product
Real-Time Analytics
Introducing Vector Search on Rockset: How to run semantic search with OpenAI and Rockset
We’re excited to introduce vector search on Rockset to power fast and efficient search experiences, personalization engines, fraud detection systems and more.
John Solitario
April 17, 2023
Big Ideas
Developer
Product
Real-Time Analytics
Rockset and Feast Feature Store for Real-Time Machine Learning
To better serve real-time machine learning, Rockset integrates with the Feast Feature Store which acts as a centralized platform for deploying, monitoring and managing production ML features.
Daniel Lin
April 11, 2023
RocksDB
Engineering
Real-Time Analytics
Tech Overview of Compute-Compute Separation- A New Cloud Architecture for Real-Time Analytics
The high-level implementation of compute-compute separation, a new cloud architecture with multiple, isolated clusters for ingest compute and query compute on shared real-time data.
Julie Mills
March 28, 2023
Big Ideas
Data Applications
Developer
Druid
Elasticsearch
Engineering
Kafka
Kinesis
Real-Time Analytics
Streaming
Stream Processing vs. Real-Time Analytics Databases
Learn about conceptual differences between stream processing and RTA databases and develop a framework for choosing the right tool. .
Scott Dwyer
March 27, 2023
Data Applications
Engineering
Kafka
PostgreSQL
Real-Time Analytics
CDC
How To
Real-Time CDC With Rockset And Confluent Cloud
Learn how Rockset and Confluent Cloud provide a real-time CDC analytics pipeline that requires zero code and zero infrastructure to manage.
Patrick Druley
March 9, 2023
Developer
Engineering
web3
Use Cases
How To
How To Query The Ethereum Blockchain
Learn how to query Ethereum data using clients, RPC node providers, and using SQL queries on public datasets.
Justin Liu
March 1, 2023
Real-Time Analytics
RocksDB
Engineering
Introducing Compute-Compute Separation for Real-Time Analytics
Rockset unveils compute-compute separation that eliminates the challenge of compute contention and makes it possible to build efficient, reliable real-time applications at massive scale.
Venkat Venkataramani
March 1, 2023
Real-Time Analytics
Product
How To
Data Applications
A Breakthrough Architecture for Real-Time Analytics- An Overview of Compute-Compute Separation in Rockset
Rockset introduces a new architecture that enables separate virtual instances to isolate streaming ingestion from queries and one application from another.
Rafael Kabesa
February 25, 2023
Use Cases
Streaming
How To
Real-Time Analytics
Kinesis
Kafka
Making Sense of Real-Time Analytics on Streaming Data: The Landscape
This blog series will help demystify streaming data and provide engineering leaders a guide for incorporating streaming data into their analytics pipelines.
Scott Dwyer
February 9, 2023
DynamoDB
How To
Using DynamoDB Single-Table Design with Rockset
Single-table design is a popular data modeling technique in DynamoDB. We present several options for performing real-time analytics on single-table models using Rockset.
Tyler Denton
February 8, 2023
Real-Time Analytics
Druid
ClickHouse
Top Real-Time Analytics Databases in 2023: Rockset, Apache Druid, ClickHouse and Pinot
Learn how Rockset, Druid, ClickHouse and Pinot compare for real-time analytics in real-world use cases.
Shruti Bhat
January 31, 2023
Case Study
S3
Developing Global Labor Market Intelligence at SkyHive Using Rockset and Databricks
SkyHive builds a platform for labor market intelligence, using Databricks for ML processing and Rockset to serve their user-facing application.
Mohan Reddy
January 26, 2023
How To
How to Use Terraform with Rockset
Learn how Terraform can be used to automate the configuration and deployment of Rockset resources.
Martin Englund
January 11, 2023
DynamoDB
Elasticsearch
Real-Time Analytics
Using Elasticsearch to Offload Search and Analytics from DynamoDB
A walkthrough of how to offload text search, complex filters and aggregations from DynamoDB to Elasticsearch.
Julie Mills
January 9, 2023
ClickHouse
Case Study
Snowflake
Dashboards
MongoDB
Scaling Our SaaS Sales Training Platform with Real-Time Analytics from Rockset
As users and data volumes grew, ConveYour needed to scale their customer-facing dashboards. Learn how their developer team achieved scalability, concurrency and low ops using Rockset.
Stephen Rhyne
January 3, 2023
Big Ideas
Real-Time Analytics
Streaming
Real-Time Data Predictions for 2023
This blog compiles real-time data predictions from industry leaders so you know what’s coming in 2023.
Julie Mills
January 1, 2023
DynamoDB
Use Cases
NoSQL
5 Use Cases for DynamoDB in 2023
This guest post lays out the benefits of using DynamoDB, including 5 real-life examples, along with recommendations for performing analytics on DynamoDB data
Ben Rogojan
December 27, 2022
Elasticsearch
Developer
How to Solve 4 Elasticsearch Performance Challenges at Scale
We walk through solutions to common Elasticsearch performance challenges at scale including slow indexing, search speed, shard and index sizing, and multi-tenancy.
Julie Mills
December 14, 2022
Kafka
Streaming
Using the Amazon MSK Native Connector to Simplify Real-Time Analytics on Kafka
Rockset's native connector allows users to easily ingest and query streaming data from Amazon MSK, Amazon's managed Kafka service.
Avi Shah
December 13, 2022
Developer
An Open-Source Go Module to Secure the Command Line Using the OAuth2 Device Authorization Flow
We show you how we implemented a Go module that secures the CLI using an OAuth2 device authorization flow that supports both Auth0 and Okta SSO providers.
Martin Englund
November 29, 2022
Big Ideas
CDC
Breaking Down Cost Barriers For Real-Time Change Data Capture (CDC)
Learn how to improve the efficiency of real-time CDC with Rockset
Ari Ekmekji
November 21, 2022
Company
AWS re:Invent 2022: Rockset Will Be There…Will You?
See Rockset live at AWS re:Invent in Las Vegas. Join real-time analytics demos at our booth and architecture sessions in our executive suite.
Ashley Andrada
November 15, 2022
Real-Time Analytics
Performance
Product
Rockset Achieves 84% Better Performance on the Star Schema Benchmark with Intel Ice Lake
As a result of ongoing enhancements, we released software that leverages 3rd Gen Intel® Xeon® Scalable processors and delivers 84% faster performance.
Julie Mills
November 2, 2022
Engineering
Product
The New Rockset Query Editor Experience
We're excited to announce the release of a new query editor in the Rockset Console with improved performance and an updated design.
Kristie Lim
November 2, 2022
Elasticsearch
Real-Time Analytics
5 Steps for Migrating from Elasticsearch to Rockset for Real-Time Analytics
Best practices from customers who migrated from Elasticsearch to Rockset in days to weeks by avoiding common migration pitfalls.
Patrick Druley
October 26, 2022
Case Study
web3
DynamoDB
Case Study: How Rockset's Real-Time Analytics Platform Propels the Growth of Our NFT Marketplace
Own the Moment uses Rockset to build the real-time analytics and leaderboards that are core to their NFT and fantasy sports platform.
Scott Mitchell
October 21, 2022
Kafka
How To
S3
Building Real-Time Recommendations with Kafka, S3, Rockset and Retool
Step through a real-time recommendations example using Kafka, S3, Rockset and Retool.
Nadine Farah
October 21, 2022
Big Ideas
Product
Public SQL Endpoints in Rockset
Learn how to share SQL query results and metadata with public endpoints
Scott Dwyer
October 13, 2022
Big Ideas
Snowflake
Data Warehouse
Reducing Costs
How To
7 Practical Ways to Cut Snowflake Compute Cost
Ok, so Snowflake is expensive. But what do I do about it? Here are 7 Practical Ways to Cut Snowflake Compute Cost
Shruti Bhat
October 11, 2022
Elasticsearch
CDC
Streaming
Updates, Inserts, Deletes: Challenges to avoid when indexing mutable data in Elasticsearch
We examine common challenges when indexing mutable data such as CDC streams in Elasticsearch and contrast with Rockset, as well as provide practical techniques for using these systems for real-time search and analytics.
Julie Mills
October 6, 2022
Case Study
DynamoDB
Dashboards
PyTorch Infra's Journey to Rockset
The PyTorch infra team at Meta runs thousands of tests to validate every change as part of their Continuous Integration. Learn how they moved to Rockset to deliver metrics on the health of their CI.
Jane Xu
October 4, 2022
ClickHouse
Streaming
CDC
Comparing ClickHouse vs Rockset for Event and CDC Streams
We compare ClickHouse and Rockset for real-time analytics on event and CDC streams, examining their similarities and differences across architecture, data ingestion, querying and operations.
Kevin Leong
September 20, 2022
Big Ideas
Real-Time Analytics
web3
3 Use Cases for Real-Time Blockchain Analytics
Learn about emerging use cases for real-time blockchain analytics and some key considerations for developers building dApps.
Sid Chhibber
September 13, 2022
DynamoDB
DynamoDB Filtering and Aggregation Queries Using SQL on Rockset
Learn how to build an application that handles high-volume transactions as well as filtering and aggregation using a combination of DynamoDB and Rockset.
Alex DeBrie
September 2, 2022
Data Applications
Use Cases
Real-Time Analytics
Expert Roundtable: How to Build Real-Time Personalization and Recommendation Systems
Hear experts share why real-time personalization offers greater accuracy and efficiency compared to offline alternatives, along with best practices for getting to real time.
Dhruba Borthakur
August 26, 2022
Case Study
IoT
Case Study: iYOTAH Brings Real-Time IoT Analytics to Dairy Farming with Its AgTech SaaS Platform
iYOTAH uses real-time IoT data to moooo-ve dairy farming into a smart future.
Daniel Lu
August 16, 2022
Kinesis
Kafka
Streaming
How To
Kafka vs Kinesis: How to Choose
Which is the best stream processing solution for your needs and environment?
Patrick Druley
August 11, 2022
Big Ideas
Real-Time Analytics
Expert Roundtable: Batch vs Streaming in the Modern Data Stack [Video]
Data engineering experts come together to discuss where batch and streaming analytics fit in the modern data stack.
Shruti Bhat
August 5, 2022
Case Study
Elasticsearch
Kafka
Use Cases
Case Study: How Rockset Turbocharges Real-Time Personalization at Whatnot
Whatnot implemented real-time personalization for their live shopping platform using Rockset, which proved a more efficient alternative to Elasticsearch.
Emmanuel Fuentes
July 29, 2022
Snowflake
Real-Time Analytics
Data Warehouse
Can BigQuery, Snowflake, and Redshift Handle Real-Time Data Analytics?
In this article, we’ll explore the strengths and shortcomings of three prominent data warehouses today for real-time analytics
Daniel Lu
July 28, 2022
MongoDB
Kafka
CDC
Streaming
How To
NoSQL
MongoDB CDC: When to Use Kafka, Debezium, Change Streams and Rockset
Change data capture from MongoDB is a reliable and performant way to move MongoDB data to a complementary system for search and analytics. We review several options for CDC on MongoDB.
Lewis Gavin
July 22, 2022
Big Ideas
DynamoDB
MongoDB
SQL
Expert Talk TLDR: SQL vs NoSQL Databases in the Modern Data Stack
Top takeaways from a recent panel of seasoned data architects and data practitioners steeped in NoSQL databases.
Daniel Lu
July 21, 2022
Dashboards
Case Study
MongoDB
Case Study: Is Your NoSQL Data Hindering Real-Time Analytics? Savvy Solved It with Rockset.
Savvy provides real-time analytics for growth teams using its service to create no-code interactive experiences. Learn how they built this functionality using Rockset on MongoDB data.
Jeremy Evans
July 12, 2022
Developer
Kinesis
SQL
Streaming SQL Joins in Rockset
We compare building collections in Rockset using JOINs at query time and at ingestion time and why you might choose each approach.
Tyler Denton
July 8, 2022
Company
Rockset's Summer Road Trip!
Rockset was talking fast and efficient real-time analytics in New York, Las Vegas and San Francisco in June. You can still catch us July 12 in New York at AWS Summit.
Ashley Andrada
July 6, 2022
Big Ideas
Real-Time Analytics
Why Real-Time Analytics Requires Both the Flexibility of NoSQL and Strict Schemas of SQL Systems
Modern, real-time use cases require databases that strongly enforce schemas and have the flexibility to automatically redefine those schemas based on the data itself.
Dhruba Borthakur
June 21, 2022
Snowflake
Product
Kafka
Kinesis
Real-Time Analytics
Streaming
Data Warehouse
Joining Streaming and Historical Data for Real-Time Analytics: Your Options With Snowflake, Snowpipe and Rockset
New Snowflake-Rockset connector provides Snowflake users cost-efficient option for real-time analytics on streaming data from Kafka and historical data in Snowflake.
Vibhuti Bhushan
June 14, 2022
Real-Time Analytics
Company
Engineering
Rockset Architecture Whiteboard Session With CTO Dhruba Borthakur
Learn about Rockset's ALT architecture and how data is ingested, stored and queried.
Dhruba Borthakur
June 7, 2022
MongoDB
DynamoDB
NoSQL
MongoDB vs DynamoDB Head-to-Head: Which Should You Choose?
We compare MongoDB and DynamoDB, their pros and cons, data types, cost, reliability, performance and security.
Shawn Adams
June 3, 2022
Case Study
Elasticsearch
Case Study: Zembula and Rockset Power Real-Time Marketing Email Personalization
Low-ops and cost-effective, Rockset is helping Zembula scale our next 100x growth.
Robert Haydock
May 25, 2022
Developer
Office Hours
Office Hours Recap: Optimize Cost and Query Latency With SQL Transformations and Real-Time Rollups
Recap of a recent Rockset Office Hours.
Nadine Farah
May 17, 2022
Real-Time Analytics
Big Ideas
SQL
SQL and Complex Queries Are Needed for Real-Time Analytics
Modern, cloud-native SQL databases deliver what today's data-driven businesses require.
Dhruba Borthakur
May 12, 2022
Real-Time Analytics
Big Ideas
Handling Bursty Traffic in Real-Time Analytics Applications
We examine the database architecture choices for handling bursty data traffic.
Dhruba Borthakur
May 10, 2022
Real-Time Analytics
DynamoDB
CDC on DynamoDB
We look at how CDC works with DynamoDB and its potential use cases.
Lewis Gavin
May 5, 2022
Engineering
Company
A Real-Time Rockset Intern Experience
The real real on interning at Rockset.
Shreya Shekhar
May 3, 2022
Real-Time Analytics
Engineering
Kafka
How Rockset Handles Data Deduplication
What is data duplication, how it plagues teams adopting real-time analytics, and what Rockset does to resolve duplication issues.
Tyler Denton
April 28, 2022
Company
Reflections of a Rockset UXer
Time flies when you're UXing at Rockset.
Aditi Dhar
April 26, 2022
Kafka
Real-Time Analytics
Streaming Data and Real-Time Analytics With Kafka + Rockset
Real-time analytics for streaming data is alive, growing and affordable for today’s modern real-time data stack.
Vibhuti Bhushan
April 19, 2022
Real-Time Analytics
Big Ideas
The Real-Time Revolution and Digital Economics in the COVID Era
Driven by COVID, economists are finally embracing streaming and real-time data – just like the business world.
Shruti Bhat
April 15, 2022
Real-Time Analytics
Big Ideas
Data Applications
Handling Out-of-Order Data in Real-Time Analytics Applications
Mutability is the most important capability for real-time analytics applications, but close behind is the ability to handle out-of-order data.
Dhruba Borthakur
April 12, 2022
Company
Kafka
DynamoDB
Rockset Goes on the Road!
Rockset will be exhibiting at three events this month in San Francisco and London.
Ashley Andrada
April 5, 2022
Druid
ClickHouse
Performance
Rockset Beats ClickHouse and Druid on the Star Schema Benchmark (SSB)
Rockset is 1.67 times faster than ClickHouse and 1.12 times faster than Druid on the Star Schema Benchmark.
Ben Hannel
March 31, 2022
Case Study
MongoDB
Developer
Case Study: How Rockset Made Me a Day Three Hero at Sounding Board
From Rockset trial to usable and reportable real-time information in just three days.
Jon Farr
March 29, 2022
Case Study
MySQL
Real-Time Analytics
Case Study: How Dimona Built a Real-Time Inventory Management System on Rockset
Dimona needed a better technology solution, one that could handle massive data sets and query them fast.
Igor Blumberg
March 25, 2022
Case Study
MongoDB
DynamoDB
Case Study: Rockset Enables Real-Time Operational Analytics in Hardware Manufacturing for PCH
Rockset delivers ad hoc complex queries within seconds, a huge improvement over the one-hour latency PCH was seeing before.
Daniel Lu
March 24, 2022
Real-Time Analytics
Developer
Elasticsearch
Druid
Empowering Developers With Query Flexibility
Query flexibility enables developers to prototype and build new features quickly, increasing overall productivity.
Nadine Farah
March 22, 2022
Real-Time Analytics
Kafka
Streaming
Streaming Analytics With KSQL vs. A Real-Time Analytics Database
The arguments for and against two approaches to data analytics and their optimal use cases
Lewis Gavin
March 17, 2022
Real-Time Analytics
MongoDB
PostgreSQL
Druid
ClickHouse
How Mutable Databases Make It Easy To Do Real-Time Updates
Three reasons why you need a mutable database for real-time updates
Nadine Farah
March 15, 2022
DynamoDB
Case Study
IoT
Case Study: Complementing DynamoDB with Rockset for Real-Time IoT Analytics at 1NCE
Thanks to Rockset, 1NCE is able to provide customers with fast and valuable insight into their data
Jan Sulaiman
March 10, 2022
Real-Time Analytics
Big Ideas
Why Mutability Is Essential for Real-Time Data Analytics
Mutability enables updates to existing records in a data store and is key to real-time analytics.
Dhruba Borthakur
March 4, 2022
Kinesis
How Rockset Supports Kinesis Shard Autoscaling to Handle Varying Throughputs
On-demand capacity increases efficiency and supports cost savings
Sudhindra Tirupati Nagaraj
March 3, 2022
Real-Time Analytics
SQL
Real-Time Analytics on Oracle and MSSQL With Rockset
Rockset announces early access for Oracle and Microsoft SQL Server integrations
Vibhuti Bhushan
February 24, 2022
Kinesis
Elasticsearch
Druid
Real-Time Analytics
Real-Time Analytics on Kinesis Event Streams Using Rockset, Druid, Elasticsearch and Redshift
An overview of popular options for RTA on Kinesis event streams highlighting ideal use cases and associated tradeoffs.
Scott Dwyer
February 17, 2022
Big Ideas
Engineering
17 New Things Every Modern Data Engineer Should Know in 2022
We asked data industry thought leaders to tell us what we should be paying attention to in coming months. Here is what they told us.
Shruti Bhat
February 14, 2022
Real-Time Analytics
Big Ideas
Top 5 Reasons for Moving From Batch To Real-Time Analytics
Fast analytics on fresh data beats slow analytics on stale data every time.
Venkat Venkataramani
February 10, 2022
MongoDB
How To
NoSQL
How To Join Data in MongoDB
Choosing between $lookup, denormalization and alternatives for joining data in MongoDB.
Shawn Adams
February 2, 2022
MongoDB
Real-Time Analytics
NoSQL
Slow Queries
Five Ways to Run Analytics on MongoDB – Their Pros and Cons
Your choices range from performing analytics directly in MongoDB to moving data to a data store better equipped for real-time analytics.
Shawn Adams
January 28, 2022
DynamoDB
Case Study
Case Study: Real-Time Insights Help Propel 10X Growth at E-Learning Provider Seesaw
Rockset, along with DynamoDB, Hightouch, and Retool, enabled Seesaw to obtain actionable, real-time insights that helped grow their e-learning platform.
Daniel Lu
January 25, 2022
Snowflake
How To
Slow Queries
Data Warehouse
What Do I Do When My Snowflake Query Is Slow? Part 2: Solutions
Part two of a two part series on improving Snowflake query performance
Shawn Adams
January 20, 2022
Snowflake
How To
Slow Queries
Data Warehouse
What Do I Do When My Snowflake Query Is Slow? Part 1: Diagnosis
Part one of a two part series on improving Snowflake query performance
Shawn Adams
January 5, 2022
Real-Time Analytics
SQL
Mythbusting: The Venerable SQL Database and Today’s Real-Time Analytics
The SQL database that came of age in the 1980s still has a critical role today in moving data-driven companies from batch to real-time analytics.
Dhruba Borthakur
December 21, 2021
Company
Engineering
Developer
How We Use Rockset's Real-Time Analytics to Debug Distributed Systems
Jonathan, a software engineering intern at Rockset, describes how Rockset uses its own tech to debug its highly distributed ingest system.
Jonathan Kula
December 17, 2021
SQL
Developer
Powering SQL Draw with Rockset, Retool and dbt
SQL Draw is a Slack-based game that uses Rockset, Retool and dbt to create fun drawings with cartesian geometry, creativity and teamwork.
James Weakley
December 10, 2021
Company
Wrap-up of Rockset at AWS re: Invent 2021
November 29 to December 3, 2021 in Las Vegas, NV
Rod Bauer
December 9, 2021
Big Ideas
Real-Time Analytics
Streaming
The Rise of Streaming Data and the Modern Real-Time Data Stack
Now more than 10 years old, the modern data stack is ripe for innovation. The inevitable next stage? Real-time insights delivered straight to users — the modern real-time data stack.
Shruti Bhat
December 1, 2021
Company
Engineering
Why Rockset Is My Next Job After Facebook
Louis Brandy, director of engineering, shares his thoughts on joining Rockset.
Louis Brandy
November 9, 2021
MySQL
PostgreSQL
OLTP
CDC
How To
How to Implement CDC for MySQL and Postgres
We examine different options for implementing change data capture (CDC) from MySQL and Postgres and make recommendations for when to use each.
Lewis Gavin
November 5, 2021
Case Study
PostgreSQL
Case Study: Powering Customer-Facing Dashboards at Scale Using Rockset with PostgreSQL at DataBrain
Learn how Rockset’s PostgreSQL integration helped DataBrain scale smoothly as its production data size and query volume exploded.
Daniel Lu
November 4, 2021
S3
Data Lakes
Getting Started with Apache Spark, S3 and Rockset for Real-Time Analytics
Get fast query performance with Apache Spark + Rockset to power data apps.
Nadine Farah
November 2, 2021
Product
Rockset’s Reverse ETL Integrations Extend the Modern Real-Time Data Stack
Rockset’s new partner integrations with leading reverse ETL platforms Census, Hightouch and Omnata will enable everyday business tools to consume real-time customer insights seamlessly from Rockset.
Daniel Lu
October 26, 2021
Case Study
DynamoDB
Case Study: Fast and Simple — Building Rich Patient Dashboards for Speech Therapists with Rockset
Rockset is used to power interactive visualizations of the rehabilitation data of speech-impaired patients for their speech therapists and other caregivers.
Antonio Domínguez
October 20, 2021
Product
Real-Time Data Transformations with dbt + Rockset
The dbt-Rockset adapter 2.0 supports all four core dbt materializations. Learn about how to transform data in real-time using dbt and Rockset.
Justin Liu
October 15, 2021
Big Ideas
What Is a Cloud Database? IaaS, PaaS, SaaS and DBaaS Explained
Cloud databases are not created equal. We discuss what these different terms mean with respect to cloud databases: IaaS, PaaS, SaaS and DBaaS.
Shawn Adams
September 29, 2021
Product
Rockset Elevates Security Posture with RBAC Custom Roles & Views
New security features enable customers to enforce least privileged access to all resources within Rockset
Rafael Kabesa
September 29, 2021
Company
Product
Rockset Is Now SOC 2 Type II Compliant
The Rockset team is proud to announce that we have been accredited as SOC 2 Type II compliant.
Martin Englund
September 21, 2021
Engineering
How To
How We Improved the Concurrency and Scalability of Our Redis Rate Limiting System
We use a rate limiting system, based on Redis, to protect services from overload. Learn how we increased its concurrency and scalability in this blog.
Akshay Nanavati
September 15, 2021
Kafka
Product
Rockset Enhances Kafka Integration to Simplify Real-Time Analytics on Streaming Data
We’re introducing a new fully-managed Kafka Integration with native support for Confluent Cloud and Apache Kafka. Get started with real-time analytics on event streams from Apache Kafka in minutes.
Boyang Chen