April 15th, 2021
Fast SQL
Powering Real-Time Analytics at Scale on MySQL and PostgreSQL
Enable sub-second, high-concurrency analytics for MySQL and PostgreSQL using Rockset for real-time external indexing.
Justin Liu
Follow our stories and unique insights.
April 12th, 2021
Case Study
Case Study: Bringing Real-Time Analytics to Construction Logistics at Command Alkon
Command Alkon offers a SaaS application to digitize construction logistics, allowing suppliers, transportation providers and contractors on jobsites to analyze and collaborate on data in real time.
Kevin Leong
April 5th, 2021
Case Study
Sequoia Capital: Why We Moved from Elasticsearch to Rockset
We spoke with Sequoia’s head of engineering, Jake Quist, and VP of data science, Hem Wadhar, about their reasons for moving their internal analytics off Elasticsearch to Rockset.
Kevin Leong
March 31st, 2021
Case Study
Case Study: Ritual’s Move to Real-Time Analytics to Personalize the Multivitamin Experience
Ritual, a health-meets-technology company, personalized the cart checkout experience, email promotions and banners using Rockset. Learn how Ritual effectively monetized new product lines with real-time analytics.
Julie Mills
March 23rd, 2021
Engineering
On the Pursuit of Happiness (aka Squashing 502/504 Errors)
We recount our experience hunting down, diagnosing and fixing 502 and 504 errors to improve product quality and user experience.
Hieu Pham
March 15th, 2021
APIs & Applications
Elasticsearch or Rockset for Real-Time Analytics: Real-Time Ingestion and Indexing
In part 3 of our Elasticsearch and Rockset comparison, we examine how well Elasticsearch and Rockset ingest and index real-time data.
Shawn Adams
March 12th, 2021
Engineering
Big Ideas
5 Tips for Recruiting Top Engineering Talent in Startups
Rockset CEO Venkat Venkataramani and engineering leaders Nimrod Hoofien (Gusto) and Adam Wolff (Robinhood) share best practices for recruiting great engineers.
Julie Mills
March 5th, 2021
Big Ideas
Space-Time Tradeoff and Your Snowflake Compute Cost
In this post, we discuss how you should think about space, time, storage and compute in the context of modern data analytics offerings like Snowflake and Rockset.
Shruti Bhat
February 25th, 2021
APIs & Applications
Elasticsearch or Rockset for Real-Time Analytics: How Much Query Flexibility Do You Have?
In part 2 of our Elasticsearch and Rockset comparison, we take a look at query flexibility and its impact on developer productivity.
Shawn Adams
February 18th, 2021
Fast SQL
Rockset Is Up to 9.4x Faster than Apache Druid on the Star Schema Benchmark
We evaluated Rockset on the Star Schema Benchmark and found up to 9.4x query runtime speedup compared to Druid. We discuss our benchmarking exercise, results and analysis in this blog post.
Kevin Leong
February 9th, 2021
Fast SQL
Indexing Amazon S3 for Real-Time Analytics on Data Lakes
We explore how indexing Amazon S3 data can enable low-latency, high-concurrency queries for real-time analytics.
Shawn Adams
January 27th, 2021
Engineering
Big Ideas
Hello Rockset
Mark Callaghan shares his thoughts on joining Rockset, where he will focus on performance engineering.
Mark Callaghan
January 19th, 2021
APIs & Applications
Elasticsearch or Rockset for Real-Time Analytics: Managing Clusters vs Going Serverless
In part 1 of our Elasticsearch and Rockset comparison, we explore the operational costs associated with both real-time analytics solutions.
Shawn Adams
December 22nd, 2020
APIs & Applications
How to Join Data in Elasticsearch vs Rockset
In this blog post, we'll look at what it takes to join data sets in Elasticsearch and in Rockset, using the same online marketplace example.
Lewis Gavin
December 17th, 2020
APIs & Applications
Build Internal Apps in Minutes with Retool and Rockset: A Customer 360 Example
Build performant internal apps with Retool and Rockset by just using data APIs and pre-built UI components. We show you how to integrate Rockset with Retool on a customer 360 sample app.
Ben Rogojan
December 10th, 2020
Engineering
What I've Learned in 2020: A Technical Version
Hieu shares thoughts on columnar databases, RocksDB, SQL engines and his year as an engineer at Rockset.
Hieu Pham
November 24th, 2020
Fast SQL
Engineering
RocksDB
How Rockset’s Converged Index Powers Real-Time Analytics
Rockset enables millisecond-latency queries on terabytes of data because all data ingested is indexed multiple ways in its Converged Index. Learn how the Converged Index works in this blog post.
Shawn Adams
November 19th, 2020
Fast SQL
Engineering
Smart Schema: Enabling SQL Queries on Semi-Structured Data
We explain and show how users can perform schemaless ingestion of their data and then use Rockset's Smart Schema to enable SQL queries directly on that data.
Shawn Adams
November 12th, 2020
MongoDB
Using Elasticsearch to Offload Real-Time Analytics from MongoDB
This post weighs the advantages and disadvantages of moving read-heavy analytics off a primary MongoDB database using Elasticsearch for indexing.
Shawn Adams
October 27th, 2020
Big Ideas
Rockset Raises $40M Series B to Empower Developers Building Real-Time Analytics
Rockset is the real-time cloud database built for modern data apps, bringing speed, scale and simplicity to developers building real-time analytics.
Venkat Venkataramani
October 27th, 2020
Big Ideas
Engineering
Why I Am Joining Rockset
Nathan Bronson is joining Rockset to make real-time data infrastructure simple for users at scale.
Nathan Bronson
October 26th, 2020
APIs & Applications
Case Study
Case Study: Rumble’s Real-Time Leaderboards Empower Users to Lead Healthier Lifestyles
Rumble encourages people to lead healthy lifestyles by providing incentives based on how much walking they do. Rockset to powers Rumble's real-time leaderboards, which serve to motivate its users to keep active.
Nadine Farah
October 8th, 2020
MongoDB
3 Tools to Help Debug Slow Queries in MongoDB
How can you investigate query performance issues in MongoDB? We give an overview of 3 tools available for troubleshooting slow queries in MongoDB Atlas.
Ben Rogojan
October 1st, 2020
Kafka
MongoDB
APIs & Applications
Building a Real-Time Customer 360 on Kafka, MongoDB and Rockset
A step-by-step guide to building a real-time customer 360 using seconds-old purchase data from MongoDB and marketing data from Kafka.
Lewis Gavin
September 25th, 2020
MongoDB
3 Ways to Offload Read-Heavy Applications from MongoDB
Offloading read-heavy analytics from an operational database, like MongoDB, is a common architectural pattern. This post examines 3 options for offloading MongoDB to a secondary system.
Ben Rogojan
September 15th, 2020
Big Ideas
Rockset: 1 Billion Events in a Day with 1-Second Data Latency
This post introduces RockBench, a benchmark for measuring the data latency of real-time databases. We ran RockBench on Rockset, which maintained a 1-second data latency while ingesting 1 billion events in a day.
Dhruba Borthakur
September 3rd, 2020
MongoDB
Offload Real-Time Reporting and Analytics from MongoDB Using PostgreSQL
This post weighs the advantages and disadvantages of moving real-time analytics off a primary MongoDB database to PostgreSQL.
Shawn Adams
August 27th, 2020
DynamoDB
Case Study
Case Study: Matter Uses Rockset to Bring AI-Powered Sustainable Insights to Investors
With Rockset, Danish fintech Matter has the flexibility to run analytical queries on semi-structured data in S3 and DynamoDB as part of their NLP architecture.
Alexander Harrington
August 25th, 2020
MongoDB
Handling Slow Queries in MongoDB - Part 2: Solutions
Now that we've identified and investigated the slow and problematic queries on MongoDB, we discuss the advantages and disadvantages to various strategies for improving the performance of our database.
Justin Liu
August 20th, 2020
APIs & Applications
Announcing the New Rockset Developer Tools
We released Rockset Developer Tools, including a new CLI tool and a new VS Code extension, to make it easier to develop real-time data applications on Rockset.
Tanmay Chordia
August 18th, 2020
Big Ideas
Changing face of real-time analytics
We explore the continuum of real-time analytics, from live, interactive dashboards to online applications that automatically take action on real-time data.
Shruti Bhat
August 13th, 2020
Big Ideas
The future is serverless: what about your data stack?
Serverless architectures offer ease of use and cost advantages. We explore what serverless means for your data stack.
Shruti Bhat
August 11th, 2020
Big Ideas
Analytics-on-the-fly: from batch to real-time user engagement
Companies need to embrace real-time analytics to compete and survive. Only those that have invested in a real-time data stack will thrive.
Dhruba Borthakur
August 10th, 2020
Case Study
DynamoDB
Case Study: eGoGames Esports Platform Uses Rockset for Real-Time Analytics on Gaming Data
eGoGames improves user experience, detects fraud, and makes business decisions using Rockset for real-time analytics on gaming data in Amazon DynamoDB and S3.
Kevin Leong
August 10th, 2020
Big Ideas
Rapid Experimentation and Growth Using Real-Time Analytics
A real-time A/B experiments system is the best showcase of the power of real-time analytics. This post explains how to build for the requirements of a massive-scale A/B experiments platform.
Venkat Venkataramani
August 7th, 2020
MongoDB
Handling Slow Queries in MongoDB - Part 1: Investigation
Latency is one of the most essential factors in the performance of any application at scale. We explore various methods of identifying slow queries on MongoDB and understand how to improve them.
Justin Liu
July 29th, 2020
MongoDB
Performance Isolation for Your Primary MongoDB Cluster
Performance of your primary MongoDB cluster is crucial. We look at how using multiple MongoDB clusters can help with performance isolation.
Dai Shi
July 23rd, 2020
MongoDB
Improving MongoDB Read Performance - Indexing, Replication and Sharding
Real-time analytics demands low-latency complex queries. We discuss various options for speeding up read performance, including indexing, replication and sharding in MongoDB.
Shawn Adams
July 21st, 2020
Big Ideas
Building Real-Time Data Architectures to Foster Innovation
Lessons on building real-time data architectures based on experiences growing Facebook users 30x, from 50 million to 1.5 billion.
Venkat Venkataramani
July 16th, 2020
MongoDB
Engineering
Indexing on MongoDB Using Rockset - How It Works
An in-depth look at indexing MongoDB data in Rockset and how it compares to indexing in MongoDB itself.
Ben Hannel
July 14th, 2020
Case Study
MongoDB
Case Study: StoryFire - Scaling a Social Video Platform on MongoDB and Rockset
StoryFire uses Rockset to index data from their transactional MongoDB database to achieve performance and scale for analytical queries on their social video platform.
Ben Hagan
July 8th, 2020
APIs & Applications
DynamoDB
Kafka
Designing a Real-Time ETA Prediction System Using Kafka, DynamoDB and Rockset
Generate ETA predictions for a delivery service using real-time location and order data from Kafka and DynamoDB.
Kartik Khare
June 23rd, 2020
MongoDB
APIs & Applications
Real-Time Recommendations for Event Ticketing Using MongoDB and Rockset
Implementing a real-time recommendations API for an event ticketing system by indexing MongoDB data in Rockset for fast SQL.
Lewis Gavin
June 16th, 2020
MongoDB
JOINs and Aggregations Using Real-Time Indexing on MongoDB Atlas
We explore how real-time indexing on MongoDB enables fast aggregation and join queries, and how Rockset is specifically designed to meet real-time indexing requirements.
Kevin Leong
June 9th, 2020
MongoDB
MongoDB Performance Tuning - Top 5 Resources
A compilation of MongoDB performance tuning resources, covering topics such as sharding, indexing, schema design and performance isolation.
Kevin Leong
June 4th, 2020
RocksDB
Engineering
Remote Compactions in RocksDB-Cloud
To optimize RocksDB for cloud environments, we modified RocksDB-Cloud to allow remote compactions, separating compaction compute from storage, for greater efficiency.
Hieu Pham
June 2nd, 2020
MongoDB
Big Ideas
Top 10 sessions for MongoDB.live 2020
Sessions to look forward to for MongoDB.live 2020
Nadine Farah
May 19th, 2020
APIs & Applications
MongoDB
Create APIs for Aggregations and Joins on MongoDB in Under 15 Minutes
Build a Python application to create and execute APIs on aggregations and joins using Rockset and MongoDB.
Nadine Farah
May 6th, 2020
MongoDB
Engineering
Using MongoDB Change Streams for Indexing with Elasticsearch vs Rockset
Learn how Rockset indexes data from MongoDB change data capture (CDC) streams and how it compares to indexing in Elasticsearch.
Kshitij Wadhwa
April 28th, 2020
Engineering
Fast SQL
Index Scan: Using Rockset's Search Index to Speed up Range Scans Over a Specific Field
Rockset uses Converged Indexing to make different types of queries run fast. We look at how Rockset's Index Scan uses the search index to accelerate range scans.
Karen Li
April 20th, 2020
Fast SQL
APIs & Applications
Can I Do SQL-Style Joins in Elasticsearch?
While joins are primarily an SQL concept, they are equally important in the NoSQL world as well. We explore how to perform the equivalent of SQL joins when using Elasticsearch.
Shawn Adams
April 3rd, 2020
DynamoDB
Case Study
Dashboards
Fleet Management System – An End-to-End Streaming Data Pipeline
This post outlines a fleet management solution using IoT and data technologies, such as DynamoDB, AWS IoT Core, AWS Lambda, and Rockset.
Abhijeet Upadhyay
March 19th, 2020
Kafka
How to Use KSQL Stream Processing and Real-Time Databases to Analyze Streaming Data in Kafka
We discuss when stream processing, with KSQL and Kafka Streams, and when a real-time database like Rockset are best used for analyzing Kafka data.
Ari Ekmekji
March 12th, 2020
Dashboards
APIs & Applications
Query Lambdas: Increasing Developer Velocity for Application Development
Just as Rockset has reimagined and dramatically simplified the traditional ETL pipeline on the data-loading side, we’re now proud to release a new product feature - Query Lambdas - that similarly rethinks the data application development workflow.
Scott Morris
March 5th, 2020
Kafka
Best Practices for Analyzing Kafka Event Streams
What are the key considerations when selecting an analytics stack for building data applications on Kafka event streams?
Kevin Leong
February 28th, 2020
Big Ideas
MongoDB
Real-time external indexing for aggregations and joins on MongoDB collections
This is a tech preview of an integration that will allow you to index your MongoDB data in row, column and inverted indexes, and run millisecond-latency SQL queries in real-time.
Shruti Bhat
February 14th, 2020
Kafka
APIs & Applications
Where's My Tesla? Creating a Data API Using Kafka, Rockset and Postman to Find Out
We demonstrate how to expose real-time IoT data in Kafka through the Rockset REST API in this example.
Lewis Gavin
February 7th, 2020
Kafka
Dashboards
Real-Time Analytics on Connected Car IoT Data Streams from Apache Kafka
In this IoT example, we examine how to enable complex analytic queries on real-time Kafka streams from connected car sensors.
Shawn Adams
January 28th, 2020
Case Study
APIs & Applications
Case Study: Standard Cognition Uses Rockset to Deliver Data APIs and Real-Time Metrics for Vision AI
Standard Cognition, an AI-powered computer vision company, uses Rockset to enable their developers to deliver data APIs and product improvements.
Kevin Leong
January 23rd, 2020
RocksDB
RocksDB Is Eating the Database World
An overview of what makes RocksDB well-suited to power many of the world's high-performance distributed data systems.
Ethan Hamilton
January 17th, 2020
Dashboards
APIs & Applications
Kafka
SQL API for Real-Time Kafka Analytics in 3 Steps
Learn how to create a SQL API for real-time Kafka analytics on the Twitter Streaming API, using AWS Lambda and Rockset.
Tanmay Chordia
January 10th, 2020
DynamoDB
Fast SQL
Joining Data in DynamoDB and S3 for Live, Ad-Hoc Analysis
Using SQL to join DynamoDB and S3 data, operations teams can perform live, ad-hoc analysis across multiple cloud systems.
Ben Rogojan
December 9th, 2019
Big Ideas
What Data Engineers Think About - Variety, Volume, Velocity and Real-Time Analytics
Data engineers are often tasked with moving and preparing data to facilitate analytics. This guest post examines several considerations for data engineers designing for real-time analytics.
Lewis Gavin
November 6th, 2019
Kafka
Analytics on Kafka Event Streams Using Druid, Elasticsearch and Rockset
We discuss how different data backends - Druid, Elasticsearch and Rockset - can be used alongside Kafka for analytics on event data streams.
Anirudh Ramanathan
October 31st, 2019
DynamoDB
5 Use Cases for DynamoDB
This guest post lays out the benefits of using DynamoDB, including 5 real-life examples, along with recommendations for performing analytics on DynamoDB data.
Ben Rogojan
October 21st, 2019
Engineering
The Role of UX in Making Rockset the Shortest Path from Data to Applications
Learn how our UX team continually improves common user workflows in Rockset to simplify development of data-driven applications.
Aditi Dhar
October 10th, 2019
Kafka
Dashboards
Using Tableau with Kafka: How to Build a Real-Time SQL Dashboard on Streaming Data
Build a real-time Tableau dashboard for operational monitoring and analytics on streaming event data from Kafka.
Scott Morris
October 1st, 2019
Engineering
Dashboards
How We Analyze and Visualize Kubernetes Events in Real Time at Rockset
Learn how we rolled our own tool for analysis and visualization of Kubernetes events, and try the open-source dashboard for yourself.
Rui Aguiar
September 20th, 2019
Engineering
Outside Lands, Airbnb Prices, and Rockset’s Geospatial Queries
How to use Rockset's fast geospatial indexes with Airbnb data.
Ben Hannel
September 13th, 2019
Dashboards
Grafana Time-Series Dashboards with the Rockset-Grafana Plugin
How Rockset uses Grafana dashboards for monitoring production systems, Kubernetes, and GitHub metrics, and how we built a Rockset-Grafana plugin.
Rui Aguiar
September 6th, 2019
Kafka
Big Ideas
Real-Time Analytics in the World of Virtual Reality and Live Streaming
An architecture for real-time decision-making and live dashboards on VR data in Kafka, coming from live-streamed events.
Sebastian Zangaro
August 29th, 2019
DynamoDB
Dashboards
Using Tableau with DynamoDB: How to Build a Real-Time SQL Dashboard on NoSQL Data
We create an example dashboard in Tableau on data in DynamoDB, using Rockset as the SQL intelligence layer.
Vahid Fazel-Rezai
August 27th, 2019
DynamoDB
3 cost-cutting tips for Amazon DynamoDB
How to avoid costly mistakes with DynamoDB partition keys, read/write capacity modes, and global secondary indexes.
Anirudh Ramanathan
August 27th, 2019
Big Ideas
Operational Analytics - The Last Mile In Data and Analytics
We explore operational analytics and why providing insights in real time to large numbers of users is crucial for organizations.
Jay Maloney
August 23rd, 2019
DynamoDB
Engineering
How We Reduced DynamoDB Costs by Using DynamoDB Streams and Scans More Efficiently
Get an inside look at the some of the techniques we used to reduce the cost of ingesting data from DynamoDB.
Aditi Srinivasan
August 21st, 2019
Engineering
Kafka
The Kafka Connect Plugin for Rockset and How It Works
Get an in-depth look at the Kafka Connect Plugin for Rockset and the process to get it listed in Confluent Hub.
Jacob Klegar
August 21st, 2019
Engineering
RocksDB
Optimizing Bulk Load in RocksDB
What’s the fastest we can load data into RocksDB?
Igor Canadi
August 16th, 2019
APIs & Applications
Data-Driven Decisions for Where to Park in SF
We built an app to estimate the risk of a car break-in based on historical incidents.
Vahid Fazel-Rezai
August 13th, 2019
Dashboards
DynamoDB
Tableau Operational Dashboards and Reporting on DynamoDB - Evaluating Redshift and Athena
We review several approaches to building Tableau operational dashboards and reporting on DynamoDB data, using SQL engines like Redshift and Athena.
Ari Ekmekji
August 12th, 2019
Dashboards
DynamoDB
APIs & Applications
Real-Time Analytics on DynamoDB - Using DynamoDB Streams with Lambda and ElastiCache
We cover different approaches to real-time analytics on DynamoDB, using DynamoDB Streams, Lambda, and ElastiCache.
Ari Ekmekji
July 30th, 2019
Big Ideas
From Good to Great: How Operational Analytics Gives Businesses a Real-Time Edge
All businesses today are a series of real-time events. But what separates the good from the great is how they capture and operationalize that data.
Shruti Bhat
July 25th, 2019
Big Ideas
Operational Analytics: What every software engineer should know about low-latency queries on large data sets
What are the characteristics of an Operational Analytics processing system, and how does it differ from OLTP, OLAP and other data systems?
Dhruba Borthakur
July 18th, 2019
Engineering
SQL Query Planning for Operational Analytics
We discuss how SQL query planning is implemented to support operational analytics requirements, like low latency and high concurrency, in Rockset.
Purvi Desai
July 9th, 2019
Fast SQL
Methods for Running SQL on JSON in PostgreSQL, MySQL and Other Relational Databases
We examine various options for running SQL on JSON in relational databases, like PostgreSQL and MySQL, and in Rockset.
Shawn Adams
June 27th, 2019
Engineering
RocksDB
How we use RocksDB at Rockset
This blog describes how we use RocksDB at Rockset and how we tuned it for optimal performance.
Sandeep Dhoot
June 20th, 2019
Fast SQL
Redshift with Rockset: High performance queries for operational analytics
Run high performance queries for operational analytics on data from Redshift tables by continuously ingesting and indexing Redshift data through a Rockset-Redshift integration.
Kshitij Wadhwa
June 13th, 2019
Fast SQL
Building a SQL Development Environment for Messy, Semi-Structured Data
Learn how and why Rockset developed a new SQL development environment for messy, semi-structured data.
Scott Morris
June 6th, 2019
Engineering
IValue: efficient representation of dynamic types in C++
This post shows one of many challenges that we encountered while building a fully dynamically typed SQL database: how we manipulate values of unknown types in our query execution backend, while approaching the performance of using native types directly.
Tudor Bosman
May 31st, 2019
Dashboards
Using Tableau for Live Dashboards on Event Data
Connect a Tableau live dashboard to a real-time event stream of complex JSON in a few easy steps.
Haneesh Reddy Poddutoori
May 24th, 2019
Case Study
DynamoDB
Dashboards
Case Study: FULL Uses Rockset with DynamoDB for Live Dashboard to Manage Remote Workforce
FULL Creative uses Rockset to build live dashboards and run complex SQL on contact center call data in DynamoDB.
Kevin Leong
May 23rd, 2019
Engineering
Converged Index™: The Secret Sauce Behind Rockset's Fast Queries
Learn how Rockset delivers low-latency SQL for search and analytics using a combination of row, column, and search indexes.
Igor Canadi
May 17th, 2019
APIs & Applications
Building a Serverless Analytics App to Capture and Query Clickstream Data
We built a web app that collects clickstream data as free-form JSON and runs SQL queries on the live data in a completely serverless fashion. We also seek to answer age-old questions besetting developers: tabs or spaces, vim or emacs?
Vahid Fazel-Rezai
May 17th, 2019
Big Ideas
Developer Pulse: 5 Things Developers Love
When the existential question of spaces vs. tabs came up in our team, we ran a real-time survey to collect thousands of data points around it. We also wanted to settle the debate around other developer issues like SQL vs NoSQL.
Shruti Bhat
May 6th, 2019
Case Study
DynamoDB
APIs & Applications
Case Study: Decore Uses Rockset for Search & Analytics on DynamoDB
Decore needed to enable ad hoc queries in their crypto accounting software service, so they turned to Rockset for fast analytics on DynamoDB.
Kevin Leong
April 29th, 2019
DynamoDB
Analytics on DynamoDB: Comparing Elasticsearch, Athena, and Spark
We compare options for real-time analytics on DynamoDB - Elasticsearch, Athena, and Spark - in terms of ease of setup, maintenance, query capability, latency. We also evaluate which use cases each of them are best suited for.
Anirudh Ramanathan
April 29th, 2019
DynamoDB
Secondary Indexes For Analytics On DynamoDB
In this post I explore how to support analytical queries on DynamoDB without prohibitive scan costs - using secondary indexes. I also evaluate the pros and cons of this approach in contrast to extracting data to Athena, Spark or Elastic for analytics
Anirudh Ramanathan
March 27th, 2019
Fast SQL
From Schemaless Ingest to Smart Schema: Enabling SQL on Raw Data
Rockset's schemaless SQL platform automatically infers schema at read time, allowing you to analyze messy data using SQL.
Purvi Desai
March 21st, 2019
Big Ideas
Serverless Data Management: A SQL Search and Analytics Engine
Designed from the ground up for serverless data management, Rockset makes SQL search and analytics simple and accessible.
Venkat Venkataramani
March 19th, 2019
Case Study
Kafka
Dashboards
Case Study: Fynd Uses Kafka and Rockset to Respond to E-Commerce Consumer Behavior in Real Time
Fynd uses Rockset to perform fast queries on real-time Kafka event streams, so they can react to consumer behavior as it happens.
Kevin Leong
March 19th, 2019
Case Study
Case Study: Implementing Real-Time IoT Analytics Simply and Efficiently - An MIT Smart City Project
An MIT team collaborates with a school in Brazil on a smart city project to analyze weather sensor data using Rockset.
Kevin Leong
March 19th, 2019
Case Study
Fast SQL
Case Study: The Path to Better Pollution Forecasting Goes Through Nested JSON
Pittsburgh-based developer Doug Balog collects and analyzes nested JSON weather data to improve pollution forecasts in his community.
Kevin Leong
February 28th, 2019
APIs & Applications
Fast SQL
How to Build a Facebook Messenger Chatbot Powered by Fast SQL on CSV
Build a chatbot that provides instant responses, leveraging fast SQL queries on CSV data.
Kshitij Wadhwa
February 21st, 2019
Fast SQL
Using Smart Schema to Accelerate Insights from Nested JSON
Use Rockset's Smart Schema to understand complex, nested JSON and enable immediate queries using SQL on raw data.
Purvi Desai
February 21st, 2019
Fast SQL
How to Run SQL on PDF Files
Run SQL queries on data from PDF files, and join PDFs with JSON, CSV, XLSX, and other data.
Kshitij Wadhwa
February 13th, 2019
Engineering
Distributed Aggregation Queries - A Rockset Intern Story
Rockset distributes aggregation queries to reduce query latency and memory requirements. This was an intern project by Ashwath, Rockset's first ever intern.
Ashwath Thirumalai
February 6th, 2019
Engineering
Aggregator Leaf Tailer: An Alternative to Lambda Architecture for Real-Time Analytics
The Aggregator Leaf Tailer architecture takes advantage of powerful indexing and cloud scalability to enable live analytics on real-time event streams.
Dhruba Borthakur
January 23rd, 2019
DynamoDB
Running Fast SQL on DynamoDB Tables
Run fast SQL queries on data from DynamoDB tables by continuously ingesting and indexing DynamoDB data through a Rockset-DynamoDB integration.
Kshitij Wadhwa
January 23rd, 2019
Dashboards
Live Dashboards with Redash and Rockset
Build live dashboards by connecting Redash to Rockset to create visualizations quickly and easily.
Igor Canadi
January 21st, 2019
Fast SQL
Rockset adds Excel spreadsheet support: Use SQL across XLSX files and join with other JSON, CSV or Parquet data
Run complex SQL across multiple Excel spreadsheets and join XLSX files with JSON, Parquet or CSV data.
Shruti Bhat
January 16th, 2019
Kafka
Real-Time Analytics Using SQL on Streaming Data with Apache Kafka and Rockset
Connect Kafka and Rockset to obtain real-time analytics with ad hoc SQL queries on event streams.
Shawn Adams
January 10th, 2019
Fast SQL
How to Do Data Science Using SQL on Raw JSON
How to query nested JSON and CSV using SQL (including joins), without any upfront data preparation or complex data pipelines - for interactive data science using Python notebooks.
Anirudh Ramanathan
January 8th, 2019
APIs & Applications
Building a Serverless Microservice Using Rockset and AWS Lambda
Build serverless microservices, data APIs, and data-driven applications. Use SQL to join and query JSON and CSV data using AWS Lambda and Rockset.
Kevin Leong
December 20th, 2018
Dashboards
Live Dashboards on Streaming Data - A Tutorial Using Amazon Kinesis and Rockset
Serve a live dashboard using SQL on streaming Twitter data from Amazon Kinesis.
Haneesh Reddy Poddutoori
December 7th, 2018
Fast SQL
Running SQL on Nested JSON
Make raw JSON immediately queryable through fast SQL queries, without ETL, data pipelines, or fixed schema.
Anirudh Ramanathan
November 7th, 2018
Engineering
RocksDB
Rockset's RocksDB-Cloud Library - Enabling the Next Generation of Cloud Native Databases
David Cohen, System Architect at Intel, explores how RocksDB-Cloud can be be used to build an open-source cloud-friendly storage system.
David Cohen
November 1st, 2018
Fast SQL
Engineering
Dynamic Typing in SQL
Rockset Chief Architect Tudor Bosman discusses strong dynamic typing in SQL, and how it is implemented in Rockset.
Tudor Bosman
November 1st, 2018
Big Ideas
Why SQL on Raw Data?
SQL on unstructured data is hard. But storage and compute in the cloud are making SQL on raw data a reality.
Peter Bailis
October 30th, 2018
Big Ideas
Cloud Native: What It Means in the Data World
Rockset CTO and co-founder Dhruba Borthakur discusses what Cloud-Native data processing entails, and how best to build for the cloud today.
Dhruba Borthakur
October 19th, 2018
Big Ideas
The Road Ahead: From Open Source to Open Services
Rockset CTO and co-founder Dhruba Borthakur discusses the shift from Open Source to Open Services in data infrastructure, and how Open Services will become the new standard.
Dhruba Borthakur