登录查看更多内容

Enhancing the Availability and Reliability of ISP Network Infrastructures

Leonardo Furtado

Senior Network Development Engineer

发布日期: 2022年2月6日

Introduction

This article presents relevant thoughts about the correct sizing of one of the most primary "functional towers" of an ISP's networking infrastructure. Among the leading key business performance indicators (KPI) of an ISP, two are viewed as primaries: "Performance" and "Availability." The reasons for these determinations are pretty simple, as we analyze the effects of each case as follows:

What good is a high-speed Internet connection, but whose service is unavailable to the customer reasonably or very often?
What good is an Internet service that is very available or with an unprecedented uptime or availability rate if its performance is poor and delivers content experience under the desired SLA (low throughput, E2E latency, jitter, or packet loss) for its subscribers?

As we can see, these two indicators are the basic ones in networking infrastructures, as subscribers/customers can easily perceive those when using data, video, or voice services.

This article focuses more closely on the relationship between the Availability and Reliability indicators ("functional towers" of a technical design). I will comment more closely on other areas such as Performance and others in future articles.

Definition of the Availability Concept and other Peripheral Functional Towers

I mainly treat Availability as a "functional tower," as it is possible to identify, typify, categorize, and merge sets of technical specifications and processes into this concept, leading to much better network uptime altogether. This strategy includes the design of proper physical, electromechanical, and logical specifications (i.e., hardware in redundant configuration; power and cooling requirements; reliability block diagram, clusters of devices, network links, etc.). Then blend it all with software-level systemic approaches that include services, resources, or facilities, such as protocols and the sort, to increase the desired state of the Availability indicator. Improving this indicator means a whole new thing regarding customer satisfaction, competitiveness, and infrastructure costs!

Availability aims to provide the obvious: ideally, whenever a user (customer or subscriber, whatever you prefer to call them) wants to use the contracted product or service, it is available, ready for whatever the interests of that user. On the other hand, whenever the user tries to access something online and it is unavailable, its downtime frequency characterizes it, and we all know what happens next. Networks are not immune to failures, so we need to predict and anticipate these incidents so that service restoration times meet users' expectations and tolerances.

The Availability indicator is affected by combining two other functional towers that participate in the same proposed mission, supporting each other, which satisfies users with their contracted services. These disciplines would be Reliability and Resilience, respectively.

When studying the concepts of computer network reliability, we can identify issues such as manufacturing quality of networking gear, and the presence or lack of specialized technologies, both physical and logical, in addition to other mechanisms, peripheral resources, and processes that participate in aggregating the intended redundancy + reliability + resiliency = availability set. In my personal view, the reliability of a network by itself is also an indicator of a functional tower of its own. Still, it adds up positively to the overall (and desired) state of network availability.

Resilience, in turn, is related to how a device and the network as a whole react in situations where infrastructure failures (link, devices) occur, whether these failures are equipment components or incidents of logical context.

I particularly like to treat these three as follows: the intended beacon indicator is Availability, which can be calculated and improved by sets of technological specifications derived from the principles of Reliability and Resilience.

The Challenges of Providers in the Question of Network Availability

Internet Service Providers (ISP) need to understand the fundamentals of redundancy + reliability + resilience = availability with absolute clarity so that their infrastructures can be modified to meet or exceed their customers' expectations and desired service level aggreements. Among the many challenges, we can list some situations or truths on the subject:

Device availability is not directly related to the overall availability of a network. Those two are different things!
The availability of a device and its due redundancy are often in conflict, as some "simpler" devices may be more reliable. In contrast, some more reliable devices tend not to be simple to deploy and integrate!
High costs are an eternal question and need to be well balanced.

How much availability does your network infrastructure need, and how much are you willing to pay for it?

One thing that may not seem obvious to many individuals and companies: way too much redundancy can be terrible because, in addition to significantly increasing the costs of the project and the infrastructure as a whole, it makes the logical functions of the network equally way too complex. Think about it! And it can even become a problem to your intended network uptime and operational management goals.

Perhaps one of the biggest challenges here is designing a redundant, reliable, and resilient infrastructure with the desired/ideal Availability indicator or state. The choice of quantity or quality (of redundancy) in a network cannot be treated as "how do you like your steak done?" (rare, medium, well-done), analogies here; that is, it is not exactly a matter of personal choice. Infrastructure projects aiming at better availability need to have confident and ideal physical and logical redundancy standards, which cannot be too scarce or excessive. The costs of adopting these approaches must be understood and compatible with the business missions and strived outcomes. And the same rules must apply to the financial reality of the network operator (you or your company).

领英推荐

Benefits of SD-WAN Solutions for Enterprises

Vi Business India 11 个月前

Software-Defined Networking: Revolutionizing Modern…

Outworks Solutions Private Ltd. 2 个月前

MPLS-TP or IP/MPLS Networking Protocol: Which Is Best…

Belden Inc. 1 年前

Here's my first tip:

Determine DOWNTIME COSTS first, then determine and balance the Availability costs. It will be easier for you to accept the harsh reality of the investments required when you clearly understand the business impacts of a failure, whether it's a simple low-spectrum annoyance failure or a catastrophe on your network.

How much are you willing to lose financially with a failure in your network?
How much are you willing to lose in terms of customer base, market share, reputation, and the like due to disasters in your network?
How much are you willing to invest to properly mitigate the many risks that can cause trauma to your business, from small but inconvenient unavailability to major headaches with massive failures and impacts?

Practice precisely the above three questions before you even try to design your next infrastructure project!

Matching Downtime Costs versus Availability Costs

Above all, seek to identify and quantify the following impacts on your business.

Immediate impacts:

Loss of revenue
Unexpected and undesired corrective maintenance or repair costs
Contracted SLA penalties
Customer dissatisfaction
Delays in internal and external projects
Negative distractions in business

Long-term impacts:

Damage to the company/ISP reputation
Customer churn; subscriber evasion
Undesirable "favoring" to direct competitors
Legal actions against your business
Loss of trust, both from ISP employees/collaborators, the market, and customers

Check out the unfolding of this story in the full version of this article, available on the Wiki do Brasil Peering Forum (BPF), written in Brazilian Portuguese:

https://wiki.brasilpeeringforum.org/w/Aprimorando_a_Disponibilidade_da_rede_do_ISP

In this full version of the article, I present some critical fundamentals about MTBF, MTTR, MDT, concepts of parallel and serial physical and logical redundancy, technological facilities (protocols, services), and many ideas related to this subject. Ultimately, where all this falls in and affects or adds positively to the availability metrics.

Let me know your thoughts about this subject!

Until next time!

Leonardo Furtado

Leonardo Furtado - Newsletter

5,742 位关注者

Hiracelmo Neto

Network Engineer

2 年

O Mestre

1 次回应

José Carlos Borges de Couto

3 年

Very good!!!

1 次回应

Alexandre Silva N.

IT Network Analyst and Consultant / Consultor e Analista de Redes e TIC

3 年

Leo, esse material é OURO PURO! Obrigado por compartilhar!

3 次回应

Aislam Souza

3 年

Very good.

1 次回应

查看更多评论

要查看或添加评论，请登录

Leonardo Furtado的更多文章

Mastering Design Review Meetings for Network Engineering: Ensuring Efficiency and Value

2024年1月8日

Mastering Design Review Meetings for Network Engineering: Ensuring Efficiency and Value

What is a design review meeting? Network engineering and design review meetings serve to present an ideal design…
Beyond Earthly Limits: Crafting the Blueprint for Tomorrow's Computer Network Technologies

2023年12月14日

Beyond Earthly Limits: Crafting the Blueprint for Tomorrow's Computer Network Technologies

And so, our story begins… "Once upon a time, in a world not so different from our own, humanity was on the verge of a…

6 条评论
Elevating Your Network Engineering Game: A Deep Dive into Today's Essential Skills and Strategies

2023年12月6日

Elevating Your Network Engineering Game: A Deep Dive into Today's Essential Skills and Strategies

This article delves into the essential elements of modern network engineering, such as deployability, scaling…

16 条评论
Balancing Act: Network Automation Using Pure Python vs. Established Libraries - A Comprehensive Analysis of Pros and Cons

2023年11月1日

Balancing Act: Network Automation Using Pure Python vs. Established Libraries - A Comprehensive Analysis of Pros and Cons

In the constantly evolving scenario of network automation, the availability of powerful libraries and tools, such as…

13 条评论
Unlock Your Full Potential with the Power of Leadership Principles!

2023年6月10日

Unlock Your Full Potential with the Power of Leadership Principles!

A friendly and warm welcome to this post. If you find yourself kind of lost in the middle of so much hustle and bustle…

3 条评论
O Poder do Subconsciente na Carreira: Como Superar a Autossabotagem e Se Tornar um Profissional Competente e Motivado

2023年5月14日

O Poder do Subconsciente na Carreira: Como Superar a Autossabotagem e Se Tornar um Profissional Competente e Motivado

A mente humana é uma máquina incrível em praticamente todos os aspectos possíveis e imagináveis! Por este motivo, é…

41 条评论
Dicas para o desenvolvimento de conhecimentos e habilidades técnicas

2023年4月16日

Dicas para o desenvolvimento de conhecimentos e habilidades técnicas

Saiba priorizar os seus esfor?os para o seu desenvolvimento de conhecimentos e habilidades técnicas Ao desenvolver as…

10 条评论
Dicas para evoluir em sua carreira técnica

2023年4月15日

Dicas para evoluir em sua carreira técnica

Ultimamente, tenho recebido muitos InMails no LinkedIn, principalmente de pessoas que est?o lutando para seguir em…

11 条评论
Pensando fora da caixa, e tomando decis?es acertadas nos projetos de redes

2022年7月17日

Pensando fora da caixa, e tomando decis?es acertadas nos projetos de redes

Sobre o tema My experiences, cases listed, and opinions expressed are solely my own and do not express the views…

7 条评论
Como a "Sua História" poderá transformá-lo em um profissional excepcional

2022年3月31日

Como a "Sua História" poderá transformá-lo em um profissional excepcional

Vivemos num mundo aceleradíssimo, com coisas extremamente corridas, cotidianos de alta performance, repletos de…

31 条评论

See all articles

Enhancing the Availability and Reliability of ISP Network Infrastructures

Leonardo Furtado

Senior Network Development Engineer

Introduction

Definition of the Availability Concept and other Peripheral Functional Towers

The Challenges of Providers in the Question of Network Availability

How much availability does your network infrastructure need, and how much are you willing to pay for it?

领英推荐

Matching Downtime Costs versus Availability Costs

Leonardo Furtado - Newsletter

5,742 位关注者

Leonardo Furtado的更多文章

社区洞察

其他会员也浏览了

Everything about SD-WAN

From Virtual Roads to Digital Superhighways: 5 Key benefits of SD-WAN for businesses

The Impact of AI on Network Infrastructure

Managed switches enable organizations to configure and monitor network settings and handle administrative tasks from a centralized console.

Unlock network potential with QSFP-100G-CWDM4-ISP integration

Real-World Examples of Cisco SDN Solutions in Action

The Perfect Solution for Reliable Network Connectivity

Understanding Single-Homed and Multi-Homed Network Designs

Understanding Network Switches: A Guide for IT Professionals

Design of MPLS Network for Rail System

Introduction

Definition of the Availability Concept and other Peripheral Functional Towers

The Challenges of Providers in the Question of Network Availability

How much availability does your network infrastructure need, and how much are you willing to pay for it?

领英推荐

Matching Downtime Costs versus Availability Costs

Leonardo Furtado - Newsletter

5,742 位关注者

Leonardo Furtado的更多文章

Mastering Design Review Meetings for Network Engineering: Ensuring Efficiency and Value

Beyond Earthly Limits: Crafting the Blueprint for Tomorrow's Computer Network Technologies

Elevating Your Network Engineering Game: A Deep Dive into Today's Essential Skills and Strategies

Balancing Act: Network Automation Using Pure Python vs. Established Libraries - A Comprehensive Analysis of Pros and Cons

Unlock Your Full Potential with the Power of Leadership Principles!

O Poder do Subconsciente na Carreira: Como Superar a Autossabotagem e Se Tornar um Profissional Competente e Motivado

Dicas para o desenvolvimento de conhecimentos e habilidades técnicas

Dicas para evoluir em sua carreira técnica

Pensando fora da caixa, e tomando decis?es acertadas nos projetos de redes

Como a "Sua História" poderá transformá-lo em um profissional excepcional

社区洞察

其他会员也浏览了

Everything about SD-WAN

From Virtual Roads to Digital Superhighways: 5 Key benefits of SD-WAN for businesses

The Impact of AI on Network Infrastructure

Managed switches enable organizations to configure and monitor network settings and handle administrative tasks from a centralized console.

Unlock network potential with QSFP-100G-CWDM4-ISP integration

Real-World Examples of Cisco SDN Solutions in Action

The Perfect Solution for Reliable Network Connectivity

Understanding Single-Homed and Multi-Homed Network Designs

Understanding Network Switches: A Guide for IT Professionals

Design of MPLS Network for Rail System