登录查看更多内容

Filter the Noise

Syed Nadeem

DevOps Architect | Kubernetes Expert | System Design Innovator | Multi Cloud Expert | Transforming Ideas into Robust Architectures

发布日期: 2023年6月15日

??Modern day systems generate an overwhelming amount of noise in the form of logs, events, and notifications. This noise can often drown out important alerts and critical information, making it challenging to identify and address significant issues promptly. IOT devices and microservices can produce millions of garbage logs and investigating something within this pile can make you feel as if you are trapped beneath a mountain. While there are long term solutions at application and service level, there are quite few short term solutions that can help you filter the digital noise.

??Log Parsing: This involves meticulously examining and extracting structured data from log entries. Start with identifying the key fields which you are looking for. Define Regular Expressions or pattern-matching technique and put them in your script or tools. Key thing here is to not do parsing manually, include any tool like Graylog, Fluentd or Grok which should act as Log Centralization point for all the applications and services. All of these tools come with Inbuild Streams and Filters which are excellent for parsing.

??Intelligent Alerting: Artificial Intelligence has taken alerting by leaps and bounds in recent times. Based on sample data over a sample time frame, we can get not only the live alerts but we can also get alerts for future events. Isn't that crazy :)

For example, based on your current usage AWS can alert and predict what you will spend on the future months. DataDog can automatically identify abnormal behavior in metrics and generate intelligent alerts and BigPanda can automatically correlates related alerts, filters out noise, and present high-priority incidents.

领英推荐

NTT's ISAP and the 6G Revolution

NTT 10 个月前

How 5G Impacts Data Management in the Enterprise

Phison Electronics USA 1 年前

Gen AI and 5G: Transforming the future of connectivity

Yuvo 1 年前

??Less tools: Less is good, that's right only choose the tools that you need and never try to find a need for tool that you have chosen. There will be experts suggesting you all the new fancy tools and that okay, but if you don't have a need that requires a new tool then just take that advice as a side note and move on. You should strictly have only one solution for log management and one solution for alerting and reporting. In other words, you don't need two anti virus to go crazy in one laptop.

??Defining Threshold's and Aggregate metrics: This is the most important part. To decide that something is serious and needs immediate attention you need to define thresholds and this will also improve the security and reliability for the over all system. If you are using Prometheus, then use static thresholds or dynamic thresholds and aggregate metrics using functions like sum(), avg(), rate(), increase() etc to define what is acceptable. Make use of record_transformer and kubernetes_metadata if you are using Fluentd for transforming logs and then setting what is acceptable. In Sysdig use filters like contains, matches to get a count and eventual metrics. Once you have your threshold and limits reached, automatically a event driven procedure needs to be inplace. For example, having a matrices system can let you know your system current capacity and if it reach's 60% you automatically want extra nodes to be added (what we call HPA or Auto-Scaling ). Or if there is IP that is consistently coming in logs and is not from your know CIDR, you automatically want to block it. The examples are many but the logic is simple, define and make use of thresholds and aggregated metrics."

要查看或添加评论，请登录

Syed Nadeem的更多文章

Right Way To Containerisation

2021年8月23日

Right Way To Containerisation

I was lucky to have a mentor, a guru during my early days with Containerisation. I still call him up for advice and…

4 条评论
Top 5 must have tools for DevOps Engineers

2021年8月8日

Top 5 must have tools for DevOps Engineers

In continuation to my last post on How to get into Devops, I will be listing out Top 5 must have skills in-terms of…
How to get into DevOps

2021年8月1日

How to get into DevOps

"DevOps" by now is no longer an alien term to software engineers, however there are so many interpretations of DevOps…

Filter the Noise

Syed Nadeem

DevOps Architect | Kubernetes Expert | System Design Innovator | Multi Cloud Expert | Transforming Ideas into Robust Architectures

领英推荐

Syed Nadeem的更多文章

社区洞察

其他会员也浏览了

Tech News & Insights for November 11-17

Cyber-Physical Systems: Bridging the Gap Between Physical and Digital Worlds

Future Of IT: What the (IT) Industry will look like in 2025?

Is Your 5G Network Observable? Advanced Analytics Hold the Key

TruConnect by Truminds - Transforming Insights into True Actions

Machines Talking at Warp Speed: Why 5G is the Secret Weapon of the Coming Smart World

Leading with AI: Madison Technologies Takes Top Honours in Cisco Global Partner Innovation Challenge.

Navigating the Future: Trends and Innovations in the IT Industry.

Smart Industry Weekly Round Up

Data Centres and Climate Change – Is there a link?

领英推荐

Syed Nadeem的更多文章

Right Way To Containerisation

Top 5 must have tools for DevOps Engineers

How to get into DevOps

社区洞察

其他会员也浏览了

Tech News & Insights for November 11-17

Cyber-Physical Systems: Bridging the Gap Between Physical and Digital Worlds

Future Of IT: What the (IT) Industry will look like in 2025?

Is Your 5G Network Observable? Advanced Analytics Hold the Key

TruConnect by Truminds - Transforming Insights into True Actions

Machines Talking at Warp Speed: Why 5G is the Secret Weapon of the Coming Smart World

Leading with AI: Madison Technologies Takes Top Honours in Cisco Global Partner Innovation Challenge.

Navigating the Future: Trends and Innovations in the IT Industry.

Smart Industry Weekly Round Up

Data Centres and Climate Change – Is there a link?