登录查看更多内容

CDN Cache and Machine Learning

Andrew Antonopoulos

Senior Solutions Architect at Sony Professional Solutions Europe

发布日期: 2020年6月17日

The majority of the Internet’s content is delivered by global caching networks, also known as Content Delivery Networks (CDNs). CDNs enhance performance by caching content in servers located in user proximity. This proximity enables fast content delivery but requires CDNs to operate servers in hundreds of networks around the world.

A major operational cost factor is the bandwidth cost between CDN caching servers and data centers storing the original copies of web content. Hence, CDNs aim to maximize the fraction of bytes served locally from the cache, which is also known as the byte hit ratio (BHR).

To improve efficiency, CDNs seek to remove their dependence on manual parameter tuning. Fortunately, recent advances in reinforcement learning (RL) promise a general approach to systems that “manage resources on their own”.

Existing proposals for caching rely on “model-free” RL where the system starts without any knowledge (or bias) about the task at hand. Such systems learn to make decisions from experience interacting with its environment, where good behavior is reinforced via a reward function. While model-free RL is very popular, recent discussions in the RL community highlight three key challenges

First, millions of learning samples are typically required, which leads to slow reaction times in dynamic environments
Second, overfitting to past samples happens frequently, which complicates dealing with unexpected situations and can lead to unintended behaviors
Third, debugging and maintenance is complicated due to high sensitivity to hyperparameters and random seeds

For Internet-facing systems, these challenges are a significant roadblock. For example, CDN servers face quickly changing conditions that include unexpected (or even adversarial) traffic patterns. CDN server also need to be easily maintainable while serving requests at 40+ Gbit/s. While more sophisticated learning techniques, such as model-based RL promise faster and more robust learning rates, they typically lead to significantly higher complexity and computational overhead.

领英推荐

Tracking Job IDs: Enhancing Observability and…

VAST Data 2 周前

IFIP Insights August

International Federation for Information Processing (IFIP) 6 个月前

openEuler Monthly Bulletin – August

openEuler 5 个月前

For this reason, a lightweight and robust machine learning will be necessary, read more in this paper from Daniel S. Berger Carnegie Mellon University:

#Machinelearning #datascience #cdn #artificialintelligence

要查看或添加评论，请登录

Andrew Antonopoulos的更多文章

Sustainable ML - Monitor Power Consumption

2024年5月25日

Sustainable ML - Monitor Power Consumption

Training models will also consider the power consumption of the hardware. The following paper compares the most common…
TensorFlow Serving API & gRPC

2024年5月25日

TensorFlow Serving API & gRPC

To serve models for production applications, one can use REST API or gRPC. gRPC is a high-performance, binary, and…
Blockchain & Web3 Technology

2024年5月22日

Blockchain & Web3 Technology

Blockchain is a technology that securely stores transactional information by linking blocks together in a specific…
NVIDIA Mixed Precision - Loss & Accuracy - Part 2

2024年5月20日

NVIDIA Mixed Precision - Loss & Accuracy - Part 2

Part 1 explained how Nvidia's mixed precision can help reduce power consumption. However, we also need to consider…
NVIDIA Mixed Precision & Power Consumption - Part 1

2024年5月14日

NVIDIA Mixed Precision & Power Consumption - Part 1

Deep Learning has enabled progress in many different applications and can be used for developing models for…
Nvidia GPU & TensorFlow for ML in Ubuntu 24.04 LTS

2024年5月13日

Nvidia GPU & TensorFlow for ML in Ubuntu 24.04 LTS

Tensorflow announced that it would stop supporting GPUs for Windows. The latest support version was 2.

5 条评论
FreeBSD 13 & TCP BBR Congestion Control

2022年4月29日

FreeBSD 13 & TCP BBR Congestion Control

Finally TCP BBR is available for FreeBSD new release 13.x.

2 条评论
Kubernetes - Open Source Tools

2020年6月17日

Kubernetes - Open Source Tools

Kubernetes (also known as k8s or “kube”) is a very popular container orchestration platform that automates many of the…
Cache-Control Headers

2020年6月17日

Cache-Control Headers

The performance of content that is available via web sites and applications can be significantly improved by reusing…
OTT & Mobile Battle in Africa

2019年9月5日

OTT & Mobile Battle in Africa

OTT and specially SVOD is growing in Africa. Recently big OTT providers such as Netflix, muvi, Showmax, iFlix, MTN and…

See all articles

CDN Cache and Machine Learning

Andrew Antonopoulos

Senior Solutions Architect at Sony Professional Solutions Europe

领英推荐

Andrew Antonopoulos的更多文章

社区洞察

其他会员也浏览了

How We at Avesha Elastic GPU Service(EGS) Enable Real-Time Data Fluidity with Inference

Gift-Wrapping Your Data: Unwrap the Benefits of Katonic MLOps Platform This Holiday Season

Simplyblock Byte - Issue 2024-08/1

What is parallel processing?

Gooxi 8-GPU Server Sets New Benchmark, Boosting AI Training and Inference Performance by 35%

Introducing Microsoft 'Singularity' AI Infrastructure Service

Breakdown the BMC: Felafax

KubeCon + CloudNativeCon Europe 2024 list of sessions with YouTube links

Picking Teams in AI

OpenAI’s Stargate Project: A $500 Billion Leap in AI Infrastructure Development

领英推荐

Andrew Antonopoulos的更多文章

Sustainable ML - Monitor Power Consumption

TensorFlow Serving API & gRPC

Blockchain & Web3 Technology

NVIDIA Mixed Precision - Loss & Accuracy - Part 2

NVIDIA Mixed Precision & Power Consumption - Part 1

Nvidia GPU & TensorFlow for ML in Ubuntu 24.04 LTS

FreeBSD 13 & TCP BBR Congestion Control

Kubernetes - Open Source Tools

Cache-Control Headers

OTT & Mobile Battle in Africa

社区洞察

其他会员也浏览了

How We at Avesha Elastic GPU Service(EGS) Enable Real-Time Data Fluidity with Inference

Gift-Wrapping Your Data: Unwrap the Benefits of Katonic MLOps Platform This Holiday Season

Simplyblock Byte - Issue 2024-08/1

What is parallel processing?

Gooxi 8-GPU Server Sets New Benchmark, Boosting AI Training and Inference Performance by 35%

Introducing Microsoft 'Singularity' AI Infrastructure Service

Breakdown the BMC: Felafax

KubeCon + CloudNativeCon Europe 2024 list of sessions with YouTube links

Picking Teams in AI

OpenAI’s Stargate Project: A $500 Billion Leap in AI Infrastructure Development