GFTs, Hyperscaler Magic Pixie Dust
Scott Schweitzer, CISSP
Positioning Achronix FPGAs as 400GbE DPU Leaders
Recent experience has shown that Hyperscalers are gaga about Generic Flow Tables (GFT) because they appreciate the importance of improving latency by fast-tracking networking flows and understanding the hidden value of flow-tracking data. Then there's the underly element of control. Depending on your networking requirements, your need for a "Lite" versus a "Full-featured" set of GFTs can vary. On the surface, the difference between a GFT record size for a lite table versus a full-featured one could be on the order of 64 bytes, not much, honestly, but this is a problem of scale, and it will quickly translate into Gigabytes. For example, vendors commonly requested that GFTs track 16 million active flows per pair of 100 GbE links. Today, we're designing SmartNICs for 400 GbE, so tracking has to be scaled to support 64 million active flows. These numbers are table stakes and commonly accepted throughout the industry. Multiply that by an additional 64 bytes per flow, and we're talking about another four Gigabytes of additional very-fast memory to house the GFTs. So what constitutes a lite versus a Full-featured table?
Like a traffic cop at a busy intersection, a GFT with lite table records, by definition, associates a specific flow to an action. The sole function of this table is extreme performance-based access control and routing. The GFT can quickly and easily fast-track subsequent packets for known flows by rapidly dropping, passing, or modifying them. Fast-tracking steers a packet to a direct memory access (DMA) ring for rapid transmission into host memory. Flows are defined by their five-tuple, a set of five values grouped. Typically this is the source and destination, address and port, and protocol for a given packet. Assuming the GFT supports IPv4 and IPv6, a lite GFT record could be packed into less than 64 bytes. Some might argue that mathematic tricks could further reduce the record size by sacrificing some accuracy. Still, those tricks cost clock cycles for each packet transiting the GFT pipeline stage, and time is one of the most precious assets when fast-tracking packets. Also, knowing how computers store and retrieve data, a case could be made to retain a record size of 64 bytes, as memory accesses using smaller, especially odder sizes, could prove significantly less efficient. A lite GFT record of 32 bytes would be ideal, but that is impractical as each of the two IPv6 addresses are 16 bytes, and we still have to include the ports, protocol, and action.??
By contrast, a full-featured GFT table record could retain and report all the flow data one would typically collect from enterprise-class network switches. This data, especially in a highly virtualized environment at the server level, can be of enormous value, especially to hyperscalers. Bumping the GFT record size up to 128 bytes is enough to provide this level of flow data reporting. Furthermore, if this flow table is stored in a high-speed memory subsystem like GDDR6, performance further benefits from these records being in multiples of 32 bytes.
So what are the benefits of a full-featured GFT with 128-byte flow records to hyperscalers or large enterprises:
领英推荐
Throughout this article, I talk about GFTs in the plural because some implementations might want to leverage more than one GFT within a SmartNIC—for example, ingress and egress GFTs with different attributes and features. There are several other use cases for multiple levels of GFTs, but for today, this is as far as we can go. Next, we'll discuss the access control use cases around a well-structured GFT. ???
If you're reading this and found the above interesting but are not planning to attend the?SmartNIC Summit?in San Jose in mid-June, you should rethink that decision. We will have a fantastic collection of professionals in the field of SmartNICs there holding talks, panels, and exhibiting the latest in technology. If this is your jam, I'd strongly recommend?registering?to attend.
?
Vice President - Dispersive
1 年Not to mention tho, that most flows are now "to the cloud" which means that blocking based on IP information has become of decreasing value. e.g. Multi-site IPs and CDNs are transits for so many domains that blocking at IP layer now less relevant and effectual..
FPGA Engineer at Major defense contractor
1 年looking forward to it