Introduction InfluxDB 3.0
InfluxDB is more than just a source time series database. There are also client libraries which enable developers to easily integrate InfluxDB into their applications. Telegraf is an open source collection agent for application metrics and events. InfluxDB 3.0 also offers interoperability with a lot of data analytics and visualization tools so that you execute your analytics workload with the tools that you’re already familiar with.
The new InfluxDB engine is built with Rust, Apache Arrow (and Arrow Flight), DataFusion, and Parquet. This technology stack makes InfluxDB 3.0 an ideal choice for storing time series data because it offers increased resource management, high performance, and interoperability. Let’s take a second to understand how each piece contributes benefits to InfluxDB:
- Rust is a programming language that is very performant and offers fine grain memory management. This allows users to have more operator control over memory usage (for certain versions of InfluxDB v3).
- Apache Arrow is a framework for defining in-memory columnar data. The columnar data representation enables really fast compression, which enables InfluxDB users to write over 4 million values per second.
- Parquet is a column-oriented durable file format. Parquet files are 16 times cheaper to store than CSV files. Additionally
- Arrow Flight is a “new general-purpose client-server framework to simplify high performance transport of large datasets over network interfaces.” Arrow and Arrow Flight enable interoperability with other tools that leverage those technologies as well including: Pandas, Kafka, Snowflake, Spark, Clickhouse, and more.
- DataFusion is an “extensible query execution framework, written in Rust, that uses Apache Arrow as its in-memory format.” DataFusion lets InfluxDB users query with SQL (and eventually maybe Python), so you don’t have to worry about learning a new query language.
Get started using InfluxDB 3.0
The easiest way to get started using InfluxDB 3.0 is by signing up for a free InfluxDB Cloud Serverless trial. Then I recommend loading some sample data with any of these approaches:
- Install Telegraf. Then navigate to the InfluxDB UI and follow this short video to configure it through the InfluxDB UI. I recommend configuring a CPU input plugin so you get some system stats.
- Write the Air Sensors Sample dataset through the UI.
- Navigate to the Buckets Page.
- Create a new bucket “airSensors”.
- Click the +Add Data button with the Line Protocol option
- Navigate to this page and copy and paste the line protocol data in the Enter Manually box and write it to your bucket.