The potential of SONiC, an open source network operating system and unleashing the power of Dual ToR (Active/Active)
Ravi Kishore Chitakani
Hyperscale Data Center Networking Expert | RDMA | AI/ML Networking | Network Automation | Cisco SDN | Cloud Security | SONiC | VXLAN EVPN
In the evolving landscape of networking technology, traditional routing and switching devices were once confined to tightly integrated hardware and software systems. Vendors offered closed-source proprietary stacks, restricting network operators from implementing tailored features and stalling innovation. This conventional model proved costly, time-consuming, and inherently unscalable, necessitating vendor intervention for any device modifications.
Recognizing the limitations of this approach, the industry pivoted towards a more flexible paradigm. Enter the era of white-box switches and versatile Network Operating Systems (NOSs) designed to accommodate multiple vendors and diverse Application Specific Integrated Circuits (ASICs). This transformative model, known as "disaggregated networking," represents a fundamental shift. Here, software and hardware are decoupled, allowing switching, such as Broadcom, to seamlessly integrate with various NOS platforms, including the innovative SONiC (Software for Open Networking in the Cloud).
This shift not only heralds a new era of customization and innovation in the networking sphere but also empowers network operators to adapt swiftly to changing demands without relying on vendor constraints. In this article, we explore the concept of disaggregated networking with SONiC architecture, its benefits, and industry implications. Highlighting the significance of Active-Active dual Top of Rack (ToR) setup, we discuss how they promote load balancing and continuous operation. This piece simplifies the layers of innovation, customization, and resilience shaping the future of networking.
SONiC (The Software for Open Networking in the Cloud):
Disaggregating the hardware from software in whitebox switches has driven the constant development and maintenance of open-source Network Operating Systems (NOS). SONiC is a Linux-based open-source NOS initially developed by Microsoft and the Open Compute Project (OCP), and currently hosted by the Linux Foundation. SONiC offers a comprehensive set of network functionalities, including BGP and Remote Direct Memory Access (RDMA), and is compatible with switches from various vendors and ASICs.
SONiC comprises several modules, housed either in Docker containers or the Linux-host system itself. Docker containers are lightweight, self-sufficient packages containing the necessary elements for running an application, including code, runtime, system tools, libraries, and settings. In SONiC's high-level architecture, it operates within the user space. Each module within SONiC has a specific role, such as managing DHCP requests, handling Link Layer Discovery Protocol (LLDP) functions, providing a Command Line Interface (CLI) and system configuration options, and running FRR or Quagga routing stacks.
Main Components:
Switch Abstraction Interface (SAI):
The Switch Abstraction Interface (SAI) serves as the backbone of the SONiC dataplane configuration. This open-source toolkit provides a vital bridge between SONiC's generic functionalities and the intricate hardware-specific details of various networking platforms. Acting as an isolation layer, SAI ensures seamless communication between SONiC and diverse switch components, enabling SONiC to function effectively across a wide array of hardware.
SAI's significance lies in its establishment of a standardized interface dedicated to controlling switching Application-Specific Integrated Circuits (ASICs). This meticulously crafted interface offers a vendor-independent approach, allowing the management of diverse switching entities, including hardware ASICs, Network Processing Units (NPUs), and software switches, in a uniform manner. By providing a consistent method of interaction, SAI simplifies the complexity associated with managing different types of switches.
Furthermore, SAI's architecture goes beyond standardization. It empowers network engineers by enabling the exposure of vendor-specific functionalities and extensions. This flexibility permits the customization of networking solutions according to specific requirements. As a result, SAI not only ensures interoperability and operational efficiency but also promotes innovation by allowing tailored integrations and optimizations.
领英推荐
SAI acts as a pivotal component, enabling SONiC to seamlessly integrate with diverse switch platforms. Its standardized approach, vendor independence, and customizable nature not only enhance SONiC's functionality but also foster innovation within the networking ecosystem.
High Availability with Active-Active Dual ToR:
ToR (Top of Rack) switch is a single point of failure for all the rack of servers. SONiC OS introduces dual Top of Rack (ToR) support to boost network reliability by minimizing the risk of a single point of failure. In the event of a failure in one ToR switch or its associated link, the second ToR switch acts as a backup, maintaining uninterrupted connectivity.
.Active-Active dual ToR support with a single NIC ensures load balancing, optimized bandwidth utilization, and continuous operation even if one ToR switch fails. Each server will have a NIC connected to 2 ToRs with 2 100Gbps DAC cables.
Server Functionalities:
The server NIC plays a critical role in managing traffic flow ensuring both high availability and efficient communication between ToR devices and applications running on the server host.
In summary, the server NIC acts as a pivotal component in this network architecture, ensuring robust southbound and northbound traffic handling, dynamic control through gRPC communication, and intelligent replication of critical control messages to both ToRs.
ToR Functionalities:
The evolving landscape of networking technology has seen a transformative shift from traditional closed-source systems to the realm of disaggregated networking. This paradigm, marked by the decoupling of software and hardware, has opened the doors to unparalleled customization and innovation within the industry. At the heart of this transformation stands SONiC (Software for Open Networking in the Cloud), a Linux-based open-source Network Operating System developed by Microsoft and the Open Compute Project (OCP). In dual ToR design, the server NIC takes center stage, managing traffic flow, enabling dynamic control through gRPC communication, and intelligently replicating essential control messages to both ToRs. This approach not only fosters innovation and adaptability but also empowers network operators to meet changing demands seamlessly, breaking free from vendor constraints and shaping the future of networking with unprecedented flexibility, scalability, and reliability.