登录查看更多内容

IT Infrastructure Monitoring & Observability Engineer

Luis C.

Observability (Prometheus, Grafana, OpenTelemetry, Graylog) & IT Infrastructure Monitoring Engineer | Nagios | Zabbix | Golang & Ansible Fan | Certified Coach, Speaker, & Trainer by Maxwell Leadership

发布日期: 2024年3月5日

In recent days, a colleague asked me to help her create a profile for what would be a Monitoring and Observability Engineer, and this was the result.

Professional Profile: IT Infrastructure Monitoring & Observability Engineer

Professional Summary: Highly experienced IT Infrastructure Monitoring & Observability Engineer with over 5 years of expertise in designing, implementing, and managing monitoring and observability solutions for complex IT infrastructures. Proficient with leading monitoring tools such as Nagios, Zabbix, Centreon, and Icinga, and specialized in observability technologies including Prometheus, Grafana, Jaeger, and DTrace. Skilled in managing both Linux and Windows environments and knowledgeable in a variety of databases including MariaDB, MongoDB, Cassandra, PostgreSQL, Oracle, and SQL Server. Dedicated to enhancing IT infrastructure reliability, performance, and security through detailed observation and data analysis.

Technical Skills:

IT Infrastructure Monitoring: Advanced expertise in setting up and managing Nagios, Zabbix, Centreon, and Icinga for comprehensive monitoring of IT infrastructures, including servers, networks, and critical services.
Observability and Data Analysis: Expert in deploying observability solutions with Prometheus for metrics collection, Grafana for data visualization, Jaeger for distributed tracing, and DTrace for real-time dynamic system analysis.
Cloud and Container Technologies: Proficient in monitoring and observability in cloud environments (AWS, Azure, GCP) and container technologies (Kubernetes, Docker), utilizing tools like Amazon CloudWatch, Azure Monitor, and Google Operations.
Automation and Infrastructure as Code (IaC): Experience in automating monitoring solution deployments using Ansible, Terraform, or CloudFormation, ensuring scalable and efficient infrastructure management.
Log and Event Analysis: Knowledge in log aggregation and analysis tools such as ELK Stack (Elasticsearch, Logstash, Kibana) or Splunk, adding critical dimension to observability practices.
Operating System Management: Competent in Linux and Windows system administration, including task automation and system performance optimization.
Database Administration: Advanced knowledge in managing relational and NoSQL databases, including MariaDB, MongoDB, Cassandra, PostgreSQL, Oracle, and SQL Server, ensuring performance, availability, and security.

Oracle Cloud 1 年前

Value Stories- CLEAR SKIES AHEAD

Redington X 1 年前

10 Questions to Guide Your Oracle Strategy:…

Steven Kaplan 1 个月前

Roles and Responsibilities:

Design and implement monitoring and observability solutions across the IT infrastructure and applications, ensuring comprehensive and real-time visibility into system status and performance.
Configure and maintain infrastructure monitoring tools (Nagios, Zabbix, Centreon, Icinga) to detect and alert on performance, availability, and security issues.
Deploy and manage observability solutions using Prometheus and Grafana, creating custom dashboards for key metrics visualization and data-driven decision-making.
Utilize Jaeger and DTrace for detailed tracking and analysis of performance issues and errors in applications and operating systems.
Implement cloud and container monitoring strategies, ensuring integration and visibility across dynamic and scalable environments.
Develop and manage Infrastructure as Code (IaC) for deploying and managing monitoring and observability tools, ensuring consistent and reproducible practices across environments.
Proactively analyze and predict trends using AI/ML to identify potential issues and prevent incidents before they occur.
Ensure monitoring tools and practices comply with industry security standards and regulations, including patch and vulnerability management.
Collaborate with development and operations teams to incorporate monitoring and observability practices into the software development lifecycle, promoting a DevOps culture.
Conduct proactive and post-mortem analyses to identify root causes of incidents and develop solutions to prevent future problems.

What do you think? Please leave your comments.

要查看或添加评论，请登录

Building your own port scanner in Go ;-)

2024年11月14日
Step-by-Step Guide: Monitoring MariaDB with GoLang

2024年11月2日
?No Usas LinkedIn? Esto es lo que Estás Perdiendo en tu Vida Profesional

2024年10月21日
Crecimiento Sin Límites: Aplicando las 15 Leyes Indispensables del Crecimiento de John C. Maxwell

2024年10月10日
Crecimiento Sin Límites: Aplicando las 15 Leyes Indispensables del Crecimiento de John C. Maxwell

2024年10月10日
Ya está aquí!

2024年10月3日
Workshop Odoo Recursos Humanos

2024年9月17日
Workshop Odoo Recursos Humanos

2024年9月17日
Workshop: Odoo Contabilidad

2024年9月5日
Shadows and Light: The Journey of an Unborn Being

2024年8月17日

查看全部

IT Infrastructure Monitoring & Observability Engineer

Luis C.

Observability (Prometheus, Grafana, OpenTelemetry, Graylog) & IT Infrastructure Monitoring Engineer | Nagios | Zabbix | Golang & Ansible Fan | Certified Coach, Speaker, & Trainer by Maxwell Leadership

Professional Profile: IT Infrastructure Monitoring & Observability Engineer

领英推荐

更多精彩文章

社区洞察

其他会员也浏览了

Infinidat Announces Tight Integration with Kasten by Veeam for Container-Based Workload Backup and Veeam Data Platform v12 Certification

Managing Terraform State

Case Study: Modernizing Infrastructure Applications for Telecommunications Services

Transitioning from Oracle SiteGuard to Oracle Cloud Infrastructure (OCI) Full Stack Disaster Recovery (DR)

Ensuring Your Oracle Database Stays Up and Running: A Guide to High Availability and Disaster Recovery

Infrastructure automation: why should enterprises embrace it?

The Role of Ansible in Streamlining and Scaling Your Infrastructure

Oracle Virtual Machine aka Oracle VM

IT Infrastructure Management Tools Market May Set an Epic Growth Story | IBM (United States), SolarWinds (United States), Splunk (United States)

Professional Profile: IT Infrastructure Monitoring & Observability Engineer

领英推荐

Building your own port scanner in Go ;-)

2024年11月14日

Step-by-Step Guide: Monitoring MariaDB with GoLang

2024年11月2日

?No Usas LinkedIn? Esto es lo que Estás Perdiendo en tu Vida Profesional

2024年10月21日

Crecimiento Sin Límites: Aplicando las 15 Leyes Indispensables del Crecimiento de John C. Maxwell

2024年10月10日

Crecimiento Sin Límites: Aplicando las 15 Leyes Indispensables del Crecimiento de John C. Maxwell

2024年10月10日

Ya está aquí!

2024年10月3日

Workshop Odoo Recursos Humanos

2024年9月17日

Workshop Odoo Recursos Humanos

2024年9月17日

Workshop: Odoo Contabilidad

2024年9月5日

Shadows and Light: The Journey of an Unborn Being

2024年8月17日

社区洞察

其他会员也浏览了

Infinidat Announces Tight Integration with Kasten by Veeam for Container-Based Workload Backup and Veeam Data Platform v12 Certification

Managing Terraform State

Case Study: Modernizing Infrastructure Applications for Telecommunications Services

Transitioning from Oracle SiteGuard to Oracle Cloud Infrastructure (OCI) Full Stack Disaster Recovery (DR)

Ensuring Your Oracle Database Stays Up and Running: A Guide to High Availability and Disaster Recovery

Infrastructure automation: why should enterprises embrace it?

The Role of Ansible in Streamlining and Scaling Your Infrastructure

Oracle Virtual Machine aka Oracle VM

IT Infrastructure Management Tools Market May Set an Epic Growth Story | IBM (United States), SolarWinds (United States), Splunk (United States)