登录查看更多内容

Elasticsearch vs. CtrlB

Adarsh Srivastava

Co-Founder @ CtrlB / Data engine for Observability

发布日期: 2024年5月31日

Telemetry data explosion

Data being generated is growing at a staggering rate of 23% YoY, whereas the IT budget grows by 5% YoY in the best cases. But the biggest challenge with this data is that despite it growing with time, the value from it does not.

Data volume grows; hence, the cost to manage that volume grows, but the value from that plateau

Challenges with managing Elasticsearch on larger data volumes

Let us first look at the architecture of the ELK stack to understand the underlying issue

Typical ELK stack architecture with Data Nodes, Master Nodes, Logstash, and Kibana

Logstash collects the logs from Kafka or other supported sources and writes them to stateless nodes called ES Writers, which puts data on the data nodes.
Kibana/Grafna sends the query to a stateless ES Query node; these fan-out queries to the data nodes that serve the query.

High operational overhead.

Managing 100+ data nodes is tedious and error-prone.

Cluster-wide operations can take days/weeks; one can not just isolate and do operations on a few nodes; these operations impact read/write performance.

There are several single points of failure; if you're doing bulk indexing and one of the indexers is slow, the indexing throughput goes down because everything is as fast as the slowest node. If you have 100s of nodes, then a few of them will always be hitting P99.

Difficult to handle log spikes.

Peak provisioning leads to unused resources, as teams have to peak provision their clusters, keeping log spikes in mind.

Log spikes lead to lag, that is, loss of real-time visibility into our system as there is no capacity to ingest all the messages; hence, ingestion stops or slows down.

During an incident when there is a spike, if you add capacity, ES tries to re-distribute the shards, which in turn reduces the capacity at the moment.

Multi-tenancy and data reliability.

ES rejects requests with field conflicts; if a field ID is an integer in one message and a string in another, ES will reject the event that came later.

Loss in system visibility if there is a re-deploy of the cluster of the indexer rollsover. It might then start to ingest the other ID field, and all the alerts/dashboards will be blown off.

领英推荐

A deep dive: What is LSM tree?

Vivek Bansal 7 个月前

Awesome Insights Into How Ancestry.com Uses Big Data

Bernard Marr 8 年前

The Ultimate Road Trip for Data Engineers Your first…

Zara Harvey 8 个月前

Backups needed for data reliability add to the infra cost.

How CtrlB solves for observability at scale

CtrlB can be divided into two parts -

CtrlB Flow - An observability pipeline that allows developers/ops to route data from any source to any destination while analyzing it in the stream.
CtrlB Explore - Querable storage on top of S3/Blob storages with sub-second latency, optimized for observability data.

Let us look at the architecture of CtrlB Explore

Kafka writes to the stateless Interface Node, which in turn writes to the elastic compute nodes.
For queries, the Interface node receives the query and fans out to the elastic compute nodes, which grow in number to address the queries and then shrink.

CtrlB Advantages

Cut down your observability cost by up to 80-90%
Take control of your observability data and choose what is important for you and what is not. Pay for data value, not volume.
Eliminate vendor lock-in and give your teams the flexibility and freedom to use any tool they like.
Have a central place to govern data, retract PII, react to alerts faster, etc.

Interested in knowing more or ready to take control of your observability data? Reach out to us at - [email protected]

Data/Reference

Viraj Phanse

9 个月

The issue with ES has been taken care of by Opensearch tiered storage. https://aws.amazon.com/blogs/big-data/petabyte-scale-log-analytics-with-amazon-s3-amazon-opensearch-service-and-amazon-opensearch-ingestion/

查看更多评论

要查看或添加评论，请登录

Adarsh Srivastava的更多文章

Is Observability in 2024 burning your cloud budget?

2024年5月3日

Is Observability in 2024 burning your cloud budget?

Managing cloud costs is a multi-faceted challenge for online businesses operating at scale. Essential services like…

3 条评论

Elasticsearch vs. CtrlB

Adarsh Srivastava

Co-Founder @ CtrlB / Data engine for Observability

Telemetry data explosion

Challenges with managing Elasticsearch on larger data volumes

High operational overhead.

Difficult to handle log spikes.

Multi-tenancy and data reliability.

领英推荐

How CtrlB solves for observability at scale

CtrlB Advantages

Data/Reference

Adarsh Srivastava的更多文章

社区洞察

其他会员也浏览了

The Ultimate Road Trip for Data Engineers Your first stop: Databricks & Unity Catalogues

Data Wars: Vector Strikes Back

Buckle up for Big Data

Real-time Universal DataLakeHouse: Harnessing Debezium, Kafka, DeltaStreamer, HiveMetastore, MiniO, and Trino Data Freshness <5min

8 Data Structures Powering Modern Databases-Scaler

How to use GraphQL API with Purview

“Data Mess to Data Mesh” - Part:2/2

BREAKING NEWS: FinTech Company Fu$chure's Data Pipelines Fall Flat

Three V's of Big Data

Telemetry data explosion

Challenges with managing Elasticsearch on larger data volumes

High operational overhead.

Difficult to handle log spikes.

Multi-tenancy and data reliability.

领英推荐

How CtrlB solves for observability at scale

CtrlB Advantages

Data/Reference

Adarsh Srivastava的更多文章

Is Observability in 2024 burning your cloud budget?

社区洞察

其他会员也浏览了

The Ultimate Road Trip for Data Engineers Your first stop: Databricks & Unity Catalogues

Data Wars: Vector Strikes Back

Buckle up for Big Data

Real-time Universal DataLakeHouse: Harnessing Debezium, Kafka, DeltaStreamer, HiveMetastore, MiniO, and Trino Data Freshness <5min

8 Data Structures Powering Modern Databases-Scaler

How to use GraphQL API with Purview

“Data Mess to Data Mesh” - Part:2/2

BREAKING NEWS: FinTech Company Fu$chure's Data Pipelines Fall Flat

Three V's of Big Data