登录查看更多内容

Data Pipeline Monitoring

Rajaraman Sathyamurthy

Associate Director & Senior Architect, Data Architecture

发布日期: 2023年1月23日

There are many reasons, a data pipeline could break. Most of the time it is due to issues on the data source. Either the servers from where we are ingesting data from, could be down, or there are some connectivity issues, authentication issues etc. But when this happen, there would be missing data and dashboards won't be able to show correct / complete data.

Users of the dashboards / application owners would notice this and contact us for remediation. Now, would you want your customers to escalate this to you so that you can troubleshoot the issue and rectify? That would be re-active. So, how can you monitor the data pipelines proactively?

In ELK, you can use watchers to monitor the indices on required frequency. The frequency of monitoring depends on how periodically you are ingesting the data / refresh frequency and can be different for each index. When you have dedicated monitoring cluster (based on Elastic) to monitor all ELK clusters in the environment, you can enable cross-cluster search (CCS) option. There is cross-cluster replication (CCR) option also, but that would require a larger monitoring cluster (more storage). The watcher can be configured to send alert to respective application stake holders, support team etc. who can pro-actively take corrective action without having to wait for the end-users to report issues.

The diagram shows integration with Slack for outbound notification alert, but using respective webhook, you can integrate with ServiceNow, MS Teams etc. according to your need.

Team: parthiban p ; Liju Thomas ; Rajesh Mehra ; Rajaraman Sathyamurthy

要查看或添加评论，请登录

Rajaraman Sathyamurthy的更多文章

Fail-safe Logstash Pipelines

2023年11月14日

Fail-safe Logstash Pipelines

This article talks about establishing fail-safe mechanism for Logstash pipelines. Logstash is an open-source data…

1 条评论
Adding Readiness & Liveness to Kubernetes Workloads (Kibana)

2023年4月10日

Adding Readiness & Liveness to Kubernetes Workloads (Kibana)

In our Kubernetes environment, we have Traefik load balancers routing the user traffic to Kibana application servers…
Business Function Health Visualization (BFHV)

2023年3月30日

Business Function Health Visualization (BFHV)

Authors (alphabetical order): 1. Anup Kumar Gupta PMP? 2.

1 条评论
Issues in Kubernetes Pods post node reboot

2023年1月30日

Issues in Kubernetes Pods post node reboot

You may have automated patching and reboot scheduled for your VMs (as part of maintenance / patching window), using…
Setting up multiple replicas in Kubernetes

2023年1月23日

Setting up multiple replicas in Kubernetes

Our Kibana application was running as a single instance (pod) and there was no redundancy. How did we address it…
Prod Kubernetes Challenges - Single Master, Legacy version, non-prod DC

2023年1月16日

Prod Kubernetes Challenges - Single Master, Legacy version, non-prod DC

We were stuck in legacy K8 version (v1.13.
Elastic Data Lake - Cluster status RED - issues, challenges in remediation, how did we solve it?

2023年1月13日

Elastic Data Lake - Cluster status RED - issues, challenges in remediation, how did we solve it?

One of the datalake environment we built as POC, in couple of years became production datalake and hit it's capacity…

2 条评论
Elastic Data Lake - Decision to move away from Tivoli LFA

2023年1月13日

Elastic Data Lake - Decision to move away from Tivoli LFA

AIX logs from hundreds of servers, were being ingested to Elastic Datalake; to Logstash using Tivoli LogFile Agent…

1 条评论

See all articles

Data Pipeline Monitoring

Rajaraman Sathyamurthy

Associate Director & Senior Architect, Data Architecture

Rajaraman Sathyamurthy的更多文章

社区洞察

其他会员也浏览了

Sync Data with OmnibusX Global File Adapter

EaseUS vs Disk Drill: Comprehensive Data Recovery Software Comparison

5 Key Points to Know Before Migrating your Data

What is RAID Data Recovery and How Does it Work?

Kovair QuickSync – The Perfect Tool for Data Migration

The ServiceNow Data Digest - 08/13

Data Migration and what people don't know about it ...

?? Exciting News: Voltage Fusion 24.2.0 Now Live!

Seamless Data Migration Services for Your Business Success!

Data Protocol: Developer Lifecycle Management Platform

Rajaraman Sathyamurthy的更多文章

Fail-safe Logstash Pipelines

Adding Readiness & Liveness to Kubernetes Workloads (Kibana)

Business Function Health Visualization (BFHV)

Issues in Kubernetes Pods post node reboot

Setting up multiple replicas in Kubernetes

Prod Kubernetes Challenges - Single Master, Legacy version, non-prod DC

Elastic Data Lake - Cluster status RED - issues, challenges in remediation, how did we solve it?

Elastic Data Lake - Decision to move away from Tivoli LFA

社区洞察

其他会员也浏览了

Sync Data with OmnibusX Global File Adapter

EaseUS vs Disk Drill: Comprehensive Data Recovery Software Comparison

5 Key Points to Know Before Migrating your Data

What is RAID Data Recovery and How Does it Work?

Kovair QuickSync – The Perfect Tool for Data Migration

The ServiceNow Data Digest - 08/13

Data Migration and what people don't know about it ...

?? Exciting News: Voltage Fusion 24.2.0 Now Live!

Seamless Data Migration Services for Your Business Success!

Data Protocol: Developer Lifecycle Management Platform