OpenTSDB has a thriving community who contributed and requested a number of new features.
While 3.0 is still a ways off, we'll be pushing some of the new features into a new branch of the repo. Some are in progress and other features are planned. If you have any features that you want to see, let us know.
- Distributed Queries - Based on the great work of Turn on Splicer we have a distributed query layer to split queries amongst multiple TSDs for greater throughput.
- Query Caching - Improve queries with time-sharded caching of results.
- Improved Expressions - Perform group by, downsampling and arithmetic modifications in any order. Potentially support UDFs as well.
- Anomaly Processing/Forecasting - Integrate with modeling libraries (such as EGADs ) for deeper time series analysis.
- Rollup/Pre-Aggregates - Support for storing and querying time-based rolled up data and/or pre-aggregated values.
- Distributed Percentile - Store histograms (or sketches) for calculating proper percentiles over multiple sources.
- Expressions - Query time computations using time series data. For example, dividing one metric by another.
- Graphite Style Functions - Additional filtering and mutation of data at query time using Graphite style functions.
- Calendar Based Downsampling - The ability to align downsampled data on Gregorian calendar boundaries.
- Bigtable Support - Run TSDB in the cloud using Google's hosted Bigtable service.
- Cassandra Support - Support for running OpenTSDB on legacy Cassandra clusters.
- Write Filters - Block or allow time series or UID assignments based on plugins or whitelists.
- New Aggregators - None for returning raw data. First and Last to return the first or last data points during downsampling.
- Meta Data Cache Plugin - A new API for caching meta data to improve query performance.
- Startup Plugins - APIs to help with service discovery on TSD startup.
- Example Java API usage classes.
- Appends - Support writing all data points for an hour in a single column. This saves the need for TSD compactions and reduces network traffic at query time.
- Salting - Enables greater distribution of writes for high cardinality metrics as well as asynchronous scanning for improved query speed. (Non backwards compatible)
- Random Metric UIDs - Enables better distribution of writes when creating new metrics
- Storage Exception Plugin - Enables various handling of data points when HBase is unavailable
- Secure AsyncHBase - Access HBase clusters requiring Kerberos or simple authentication along with optional encryption.
- Fill Policy - Enable emitting NaNs or Nulls via the JSON query endpoint when data points are "missing"
- Count and Percentiles - New aggregator functions
- More Stats - Gives greater insight into query performance via the query stats endpoint and new stats for threads, region clients and the JVM
- Annotations - Scan for multiple annotations only via the /api/annotations endpoint
- Query Filters - New filters for flexibility including case (in)sensitive literals, wildcards and regular expressions.
- Override Tag Widths - You can now override tag widths in the config instead of having to recompile the code.
- Compaction Tuning - New parameters allow for tuning the TSD compaction process.
- Delete Data And UIDs - Allow for deleting data at query time as well as removing UIDs from the system.
- Synchronous Writing - The HTTP Put API now supports synchronous writing to make sure data is flushed to HBase.
- Query Stats - Query details are now logged that include timing statistics. A new endpoint also shows running and completed queries.
- Downsampling - Timestamps are now aligned on modulus boundaries, reducing the need to interpolation across series.
- Last Data Point API - Query for the last data point for specific time series within a certain time window
- Duplicates - Handle duplicate data points at query time or during FSCK
- FSCK - An updated FSCK utility that iterates over the main data table, finding and fixing errors
- Read/Write Modes - Block assigning UIDs on individual TSDs for backup clusters
- UID Cache - Preload portions of the UID table on startup to improve writes
- Lock-less UID Assignment - Drastically improves write speed when storing new metrics, tag names, or values
- Restful API - Provides access to all of OpenTSDB's features as well as offering new options, defaulting to JSON
- Cross Origin Resource Sharing - For the API so you can make AJAX calls easily
- Store Data Via HTTP - Write data points over HTTP as an alternative to Telnet
- Configuration File - A key/value file shared by the TSD and command line tools
- Pluggable Serializers - Enable different inputs and outputs for the API
- Annotations - Record meta data about specific time series or data points
- Meta Data - Record meta data for each time series, metrics, tag names, or values
- Trees - Flatten metric and tag combinations into a single name for navigation or usage with different tools
- Search Plugins - Send meta data to search engines to delve into your data and figure out what's in your database
- Real-Time Publishing Plugin - Send data to external systems as they arrive to your TSD
- Ingest Plugins - Accept data points in different formats
- Millisecond Resolution - Optionally store data with millisecond precision
- Variable Length Encoding - Use less storage space for smaller integer values
- Non-Interpolating Aggregation Functions - For situations where you require raw data
- Rate Counter Calculations - Handle roll-over and anomaly supression
- Additional Statistics - Including the number of UIDs assigned and available
Thank you to everyone who has contributed to 2.4. Help us out by sharing your ideas and code at GitHub