登录查看更多内容

Streamlining Log Management with OpenTelemetry - Best Practices for Capturing, Parsing, and Storing Logs

Greptime

Industry-leading time series database for IoT and Observability with real-time insights and lower costs at any scale.

发布日期: 2025年1月9日

OpenTelemetry (OTel) is an open-source standard designed specifically for monitoring the health of applications. By collecting metrics, logs, and traces, it provides a comprehensive view of the system, helping operations teams quickly identify and resolve issues.

In our previous blog What is OpenTelemetry — Metrics, Logs, and Traces for Application Health Monitoring we introduced the different components of OTel and how they work together.

In this article, we’ll focus on logs, one of the three pillars of observability. Logs are essential for understanding events within a system, providing detailed, time-stamped records of operations. However, the unstructured nature of logs can make it challenging to extract meaningful insights. That’s where OpenTelemetry’s standardized Log Data Model steps in, offering a structured framework to make logs easier to capture, query, and analyze.

Let’s dive into how OpenTelemetry transforms raw log data into actionable insights and the tools it provides for efficient log collection and processing.

Capturing Logs in OpenTelemetry

Logs are one of the most fundamental ways developers observe events happening within their systems. Logs are fundamentally unstructured data streams which provide developers with the flexibility to emit signals about their applications as they see fit. Though the lack of structure makes it hard to create standardization across tooling to support the capturing, parsing, and saving of log files.

This is where the OpenTelemetry Log Model provides significant value. Its standardized structure offers a deeper understanding of each log's context, helping to answer questions about when, where, why, and how a log was emitted.

Log Data Model field

Examples of log data model fields and what types of questions they answer

1. Tracing Context

The TraceId and SpanId fields allow logs to be correlated with distributed traces, enabling developers to follow the path of a request across multiple services.
Example question: "What other events occurred during this specific user transaction?"

2. Temporal Information

The Timestamp and ObservedTimestamp fields help answer questions about when events occurred and how long they took to be processed.
Example question: "Is there a delay between when events occur and when they're observed in our system?"

3. Severity and Importance

SeverityText and SeverityNumber fields allow for easy filtering and prioritization of logs.
Example question: "How many critical errors occurred in the last hour?"

4. Contextual Attributes

The Attributes field allows for custom key-value pairs to be added, providing additional context.
Example question: "What was the user ID associated with this error?"

5. Source Identification

The Resource field helps identify where the log came from (e.g., which service, host, or container).
Example question: "Which specific microservice is generating the most errors?"

6. Instrumentation Details

The InstrumentationScope field provides information about the library or module that generated the log.
Example question: "Are there any patterns in errors coming from a specific third-party library?"

7. Flexible Message Content

The Body field allows for structured or unstructured log messages, supporting both legacy logging practices and more modern structured logging approaches.
Example question: "What are the most common errors we're seeing around our widget service?"

By adhering to this model, logs become much more than just text entries. They become rich, queryable data points that can be easily correlated with other telemetry data (like traces and metrics) to provide a comprehensive view of system behavior and performance.

Converting Logs to the Log Data Model

OpenTelemetry provides two main mechanisms for converting logs to this data model.

Log SDK

The Log SDK consists of libraries provided by OpenTelemetry which help bridge existing logging libraries to the OTLP format expected by the other downstream OpenTelemetry systems. Directly using the SDK is best for simple, straightforward implementations where developers want to save the logs into a database with minimal setup and maintenance of other components.

领英推荐

AI data management: optimize your company’s operations…

N-iX 1 个月前

Revolutionizing Data Management with Intelligent Data…

Dr. Jagreet Kaur 9 个月前

Enhancing Data Quality Management Through Automation

XenonStack 3 个月前

Setting up the Log SDK involves configuring existing language logging frameworks like Python's logging, Go's log, or Java's Log4j with an OpenTelemetry LogHandler. After configuring your standard logging library with important details like the name of the log, what kind of logs you want to capture (stdout, stderr, debug), and the destination of your logs (any OTLP compatible backend), invocations of those loggers throughout your application will be sent to the configured destination with the configured attributes.

Below is an example instrumenting a Python application with the OpenTelemetry Logging SDK.

import logging
from opentelemetry.sdk._logs import LogEmitterProvider, LoggingHandler
from opentelemetry.sdk._logs.export import BatchLogProcessor, ConsoleLogExporter, OTLPLogExporter
from opentelemetry.sdk.resources import Resource
from opentelemetry.exporter.otlp.proto.grpc._log_exporter import OTLPLogExporter
from opentelemetry.sdk._logs import set_log_emitter_provider

log_emitter_provider = LogEmitterProvider(resource=Resource.create({"service.name": "my-python-app"}))

# Set the global log emitter provider (this will bridge the Python logging with OpenTelemetry)
set_log_emitter_provider(log_emitter_provider)

# Configure the OTLP exporter (sending to OpenTelemetry-compatible backend)
otlp_exporter = OTLPLogExporter(endpoint="https://greptimedb:4000", insecure=True)

# Add a batch log processor for sending logs asynchronously
log_processor = BatchLogProcessor(otlp_exporter)
log_emitter_provider.add_log_processor(log_processor)

# Optionally, add a ConsoleLogExporter for debugging
# console_exporter = ConsoleLogExporter()
# log_emitter_provider.add_log_processor(BatchLogProcessor(console_exporter))

# Set up logging integration
handler = LoggingHandler(level=logging.INFO, log_emitter_provider=log_emitter_provider)
logging.getLogger().addHandler(handler)

# Example usage of logging with OpenTelemetry bridge
logger = logging.getLogger(__name__)
logger.setLevel(logging.INFO)

# Example log message
logger.info("This is an OpenTelemetry log message!")

Log Collector

The OpenTelemetry Collector is a powerful component that can be used to collect, process, and export logs from various sources. It's particularly useful in more complex environments or when you need additional flexibility in handling logs, though setting it up comes with extra challenges and components to maintain. Here are some key points about using the Collector for logs:

1.Versatile Log Collection

The Collector can ingest logs from multiple sources, including:

Application logs sent directly via OTLP
Log files on disk
System logs (e.g., syslog)
Container logs (e.g., from Docker or Kubernetes)

2. Processing Capabilities

The Collector can perform various operations on logs before forwarding them:

Filtering: Remove unnecessary logs
Transformation: Modify log structure or content
Enrichment: Add metadata or attributes to logs

3. Multiple Export Options

Logs can be sent to various backends, including:

OpenTelemetry-native backends
Popular logging systems (e.g., Elasticsearch, Splunk)
Cloud provider services (e.g., AWS CloudWatch, Google Cloud Logging)

4. Scalability and Performance

The Collector can handle high volumes of logs and provides features like batching and retry mechanisms for reliable delivery.

5. Configuration Flexibility

You can easily adjust the Collector's behavior through YAML configuration files, allowing for quick changes without modifying application code.

Here's a basic example of a Collector configuration for logs:

receivers:
  otlp:
    protocols:
      grpc:
      http:

processors:
  batch:

exporters:
  logging:
    loglevel: debug
  otlp:
    endpoint: 'otel-collector:4317'
    tls:
      insecure: true

service:
  pipelines:
    logs:
      receivers: [otlp]
      processors: [batch]
      exporters: [logging, otlp]

This configuration sets up the Collector to receive logs via OTLP, batch them for efficiency, and then export them to both a logging exporter (for debugging) and another OTLP endpoint.

Using the Collector provides a centralized way to manage logs across your entire infrastructure, offering greater flexibility and control over log processing and routing compared to the Log SDK approach.

GreptimeDB as OpenTelemetry Log Collector

GreptimeDB is a cloud-native time-series database designed for real-time, efficient data storage and analysis, especially in observability scenarios. With native support for OpenTelemetry, GreptimeDB can act as a collector, allowing users to easily ingest, store, and analyze observability data. This simplifies log pipelines while providing a scalable and powerful backend for monitoring.

For more details, check out the GreptimeDB OpenTelemetry documentation.

Choosing Collector vs Exporter

These two mechanisms help developers support log capture for any usecase that is needed. The main questions to ask yourself when determining what is the best option are around the complexity of your environment.

For example:

Will I be collecting logs from many different sources?
Do I have several additional transformations to perform against these logs?
Am I willing to invest the development effort to manage this additional component?

If you need to collect from several sources and desire to perform many complex operations, it will be best to invest upfront in setting up a log collection agent like fluent bit or Grafana alloy. Otherwise, if you just want to simply collect logs from a few sources with lower overhead and complexity, try out the logging sdk.

If you need help setting up your Logging pipeline with OpenTelemetry, reach out to our team to discuss how you can get started with collecting your logs and building a more resilient deployment.

Streamlining Log Management with OpenTelemetry - Best Practices for Capturing, Parsing, and Storing Logs

Greptime

Industry-leading time series database for IoT and Observability with real-time insights and lower costs at any scale.

Capturing Logs in OpenTelemetry

Log Data Model field

Examples of log data model fields and what types of questions they answer

Converting Logs to the Log Data Model

Log SDK

领英推荐

Log Collector

Choosing Collector vs Exporter

Observability Unlocked

311 位关注者

Greptime的更多文章

社区洞察

其他会员也浏览了

The Evolution of Data Quality in Financial Institutions: A Five-Year Journey

Transform Raw Logs into Actionable Insights: A Guide to Parsing for OT SIEM

June 14, 2024

Data Governance is Non-Negotiable: A Board of Directors’ Perspective on Establishing a Strong Foundation for AI-Driven Initiatives

The Dark, Unstructured and Hidden World of Financial Services Data

Data Governance: Mastering the Info Asset in the Digital Age

The Data Challenge: Shifting from DevSecOps to MLSecOps

CULTURE AS A BARRIER TO DATA-DRIVEN ORGANIZATION

A Comparative Analysis of Tosca Live Compare vs. Traditional Data Comparison Tools

Efficient Vector Retrieval - a perspective

Capturing Logs in OpenTelemetry

Log Data Model field

Examples of log data model fields and what types of questions they answer

Converting Logs to the Log Data Model

Log SDK

领英推荐

Log Collector

Choosing Collector vs Exporter

Observability Unlocked

311 位关注者

Greptime的更多文章

GreptimeDB Takes on the Billion-JSON-Document Challenge - Outperforms ClickHouse, VictoriaLogs, and Competitors

How Vector Remap Enhances Log Data Parsing and Storage in Observability

VictoriaLogs Source?Reading

Full Power of Vector Unleashed | In-Depth Look at Adaptive Request Concurrency Mechanism

Choosing the Right Log Aggregation Tool for Performance, Compression, and Cost Efficiency

What is Log Aggregation? Key Factors to consider for a good Log Management System

An OpenTelemetry Python Example — Building a Tesla Data Monitor

Scaling Prometheus: Deploying a Database Cluster for Long-Term Storage in Kubernetes

Are you Listening? - Emit and Capture Signals with OpenTelemetry Instrumentation in NodeJS

What is OpenTelemetry —— an Introduction for Beginners

社区洞察

其他会员也浏览了

The Evolution of Data Quality in Financial Institutions: A Five-Year Journey

Transform Raw Logs into Actionable Insights: A Guide to Parsing for OT SIEM

June 14, 2024

Data Governance is Non-Negotiable: A Board of Directors’ Perspective on Establishing a Strong Foundation for AI-Driven Initiatives

The Dark, Unstructured and Hidden World of Financial Services Data

Data Governance: Mastering the Info Asset in the Digital Age

The Data Challenge: Shifting from DevSecOps to MLSecOps

CULTURE AS A BARRIER TO DATA-DRIVEN ORGANIZATION

A Comparative Analysis of Tosca Live Compare vs. Traditional Data Comparison Tools

Efficient Vector Retrieval - a perspective