Rockset Product Overview with Nadine
Nadine shares how to use Rockset to power data-driven applications.>
Show Notes
Nadine:
Hi everyone. My name is Nadine and I'm a Senior Developer Advocate here at Rockset, and today, I'm going to give you a product overview. Rockset is a real-time indexing database serving low latency, high-concurrency queries at scale. If you're familiar with Search, then you'll find Rockset's converge indexing really interesting. Rockset is used for analytical queries, and not just the dashboard kind. Think leaderboards, personalization or recommendation engines, location search, and so on. We're talking real-time analytics for apps.
Nadine:
There are many challenges when building real-time analytical applications. You'll need to have real-time sync with your data source. You'll need to handle semi-structured data. You'll need to have multiple indexes for low-latency queries and so on, but Rockset handles all of this out of the box. So let's see how you can get started.
Nadine:
With Rockset, you can get started with either the command line or the console. Let's go ahead and jump into the console. So create integration takes about three or four minutes, where you'll provide read access to your data source. Rockset will do a bulk load followed by continuous ingest. As you see, Rockset has multiple data source connectors. So you can click and connect to your data source such as MongoDB, Amazon S3, Apache Kafka, and DynamoDB. After Rockset bulk loads the data, it'll switch to continuous ingest to stay in sync with your source so you never have to worry about data being out of sync.
Nadine:
So now let's go ahead and create a collection from our integration. I'm going to go ahead and click on the collections tab and click on a collection I already made with the data that Rockset ingested. So Rockset creates smart schemas. Smart schemas are automatically generated schemas based on the exact fields and types present in the ingested data.
Nadine:
The schema represents semi-structured data, nested objects and arrays, mixed types and nulls, enabling relational SQL queries. So if I hover above a particular field, you'll see the percentage of each data types shows up in that field. So here we have float types and we have in types. Similarly, for cloud ceiling dot value, you'll see there are null types and there are in types. So now let's go ahead and query the data now that it is in Rockset. So I'm going to go ahead and navigate to the query editor and we're going to go ahead and take a look at this query. As you can see from the sample data set, we can do complex joints and we can also do aggregations. And at this point we haven't had to do any denormalization or data clean-up. We can actually just run this query on semi-structured data.
Nadine:
And this is just a sample data set, but if you were to run a search query in a hundred billion records, it would return in under a hundred milliseconds, and this is used to converge indexing. So all fields, including deeply nested fields, are indexed in an inverted index, like elastic surge, column restore and row index, and our query optimizer selects the best index for the query. This also means you don't have to do any more index management. You get fast queries out of the box, and we also support high-volume rights while every field and document is being indexed.
Nadine:
So now that we ran a query, we'll be able to create a rest endpoint and turn it into an API with a query lambda. I'm going to go ahead and click on create query lambda and fill out those details. Afterwards, you'll be able to get a code snippet that you'll embed and execute in your application. This is what it looks like. With query lambdas, you can enforce version control and integrate it to your CICD workflow. This is the same query lambda that we created earlier and I'm going to go ahead and run the project. And as you see, we're getting the same results that we did when we ran it in the console.
Nadine:
So if we go to the Rockset docs, you'll see all the languages we support for the client libraries. So Rockset is serverless, as you see. It autoscales in the cloud and automates cluster provisioning and index management. So if you want to learn more, go ahead and try out our quick start guide with a free developer account.