登录查看更多内容

Neural Networks in Netflix

Akanksha Bhatt

Devops Engineer

发布日期: 2021年3月28日

What are Neural Networks?

Neural networks are a set of algorithms, they are designed to mimic the human brain, that is designed to recognize patterns. They interpret data through a form of machine perception by labeling or clustering raw input data.

Let’s take a moment to consider the human brain. Made up of a network of neurons, the brain is a very complex structure.

It’s capable of quickly assessing and understanding the context of numerous different situations. Computers struggle to react to situations in a similar way. Artificial Neural Networks are a way of overcoming this limitation.

Applications in Deep Learning and Artificial Intelligence

Artificial neural networks are a form of deep learning. They are also one of the main tools used in machine learning. Consequently ANN’s play an increasingly important role in the development of artificial intelligence. The rise in importance of Artificial Neural Network’s is due to the development of backpropagation. This technique allows the system’s hidden layers to become versatile. Adapting to situations where the outcome doesn’t match the one originally intended. The development of deep learning neural networks has also helped in the development of Artificial Neural Networks. Deep learning neural networks are networks made up of multiple layers. This allows the system to become more versatile. Different layers are able to analyze and extract different features. This process allows the system to identify new data or images. It also allows for unsupervised learning and more complex tasks to be undertaken.

Neural Network in Netflix

As the world’s leading Internet television network with over 160 million members in over 190 countries, our members enjoy hundreds of millions of hours of content per day, including original series, documentaries and feature films. We invest heavily in machine learning to continually improve our member experience and optimize the Netflix service end-to-end. As researchers, we innovate using machine learning in many areas where we prototype, design, implement, evaluate, and productionize models and algorithms through both offline experiments and online A/B testing.

Machine learning impacts many exciting areas throughout our company. Historically, personalization has been the most well-known area, where machine learning powers our recommendation algorithms. We’re also using machine learning to help shape our catalog of movies and TV shows by learning characteristics that make content successful. We use it to optimize the production of original movies and TV shows in Netflix’s rapidly growing studio. Machine learning also enables us to optimize video and audio encoding, adaptive bitrate selection, and our in-house Content Delivery Network that accounts for more than a third of North American internet traffic. It also powers our advertising spend, channel mix, and advertising creative so that we can find new members who will enjoy Netflix.

Using machine learning pervasively across Netflix brings many new challenges where we need to push forward the state-of-the-art. This means coming up with new ideas and testing them out, be it new models and algorithms or improvements to existing ones, better metrics or evaluation methodologies, and addressing the challenges of scale. Our research spans many different algorithmic approaches including causal modeling, bandits, reinforcement learning, ensembles, neural networks, probabilistic graphical models, and matrix factorization.

Netflix streams to over 117M members worldwide. Well over half of those members live outside the United States, where there is a great opportunity to grow and bring Netflix to more consumers. Providing a quality streaming experience for this global audience is an immense technical challenge. A large portion of this is engineering effort required to install and maintain servers throughout the world, as well as algorithms for streaming content from those servers to our subscribers’ devices. As we expand rapidly to audiences with diverse viewing behavior, operating on networks and devices with widely varying capabilities, a “one size fits all” solution for streaming video becomes increasingly suboptimal. For example:

Viewing/browsing behavior on mobile devices is different than on Smart TVs
Cellular networks may be more volatile and unstable than fixed broadband networks
Networks in some markets may experience higher degrees of congestion
Different device groups have different capabilities and fidelities of internet connection due to hardware differences.

Network quality characterization and prediction

Network quality is difficult to characterize and predict. While the average bandwidth and round trip time supported by a network are well-known indicators of network quality, other characteristics such as stability and predictability make a big difference when it comes to video streaming. A richer characterization of network quality would prove useful for analyzing networks (for targeting/analyzing product improvements), determining initial video quality and/or adapting video quality throughout playback (more on that below).

Below are a few examples of network throughput measured during real viewing sessions. You can see they are quite noisy and fluctuate within a wide range. Can we predict what throughput will look like in the next 15 minutes given the last 15 minutes of data? How can we incorporate longer-term historical information about the network and device? What kind of data can we provide from the server that would allow the device to adapt optimally? Even if we cannot predict exactly when a network drop will happen (this could be due to all kinds of things, e.g. a microwave turning on or going through a tunnel while streaming from a vehicle), can we at least characterize the distribution of throughput that we expect to see given historical data?

Since we are observing these traces at scale, there is opportunity to bring to bear more complex models that combine temporal pattern recognition with various contextual indicators to make more accurate predictions of network quality.

One useful application of network prediction is to adapt video quality during playback, which we describe in the following section.

Video quality adaptation during playback

Movies and shows are often encoded at different video qualities to support different network and device capabilities. Adaptive streaming algorithms are responsible for adapting which video quality is streamed throughout playback based on the current network and device conditions. The figure below illustrates the setup for video quality adaptation. Can we leverage data to determine the video quality that will optimize the quality of experience? The quality of experience can be measured in several ways, including the initial amount of time spent waiting for video to play, the overall video quality experienced by the user, the number of times playback paused to load more video into the buffer , and the amount of perceptible fluctuation in quality during playback.

These metrics can trade off with one another: we can choose to be aggressive and stream very high-quality video but increase the risk of a rebuffer. Or we can choose to download more video up front and reduce the rebuffer risk at the cost of increased wait time. The feedback signal of a given decision is delayed and sparse. For example, an aggressive switch to higher quality may not have immediate repercussions, but could gradually deplete the buffer and eventually lead to a rebuffer event on some occasions. This credit assignment problem is a well-known challenge when learning optimal control algorithms, and machine learning techniques (e.g., recent advances in reinforcement learning) have great potential to tackle these issues.

Predictive caching

Another area in which statistical models can improve the streaming experience is by predicting what a user will play in order to cache (part of) it on the device before the user hits play, enabling the video to start faster and/or at a higher quality. For example, we can exploit the fact that a user who has been watching a particular series is very likely to play the next unwatched episode. By combining various aspects of their viewing history together with recent user interactions and other contextual variables, one can formulate this as a supervised learning problem where we want to maximize the model’s likelihood of caching what the user actually ended up playing, while respecting constraints around resource usage coming from the cache size and available bandwidth. We have seen substantial reductions in the time spent waiting for video to start when employing predictive caching models.

Device anomaly detection

Netflix operates on over a thousand different types of devices, ranging from laptops to tablets to Smart TVs to mobile phones to streaming sticks. New devices are constantly entering into this ecosystem, and existing devices often undergo updates to their firmware or interact with changes on our Netflix application. These often go without a hitch but at this scale it is not uncommon to cause a problem for the user experience — e.g., the app will not start up properly, or playback will be inhibited or degraded in some way. In addition, there are gradual trends in device quality that can accumulate over time. For example, a chain of successive UI changes may slowly degrade performance on a particular device such that it was not immediately noticeable after any individual change.

Detecting these changes is a challenging and manually intensive process. Alerting frameworks are a useful tool for surfacing potential issues but oftentimes it is tricky to determine the right criteria for labeling something as an actual problem. A “liberal” trigger will end up with too many false positives, resulting in a large amount of unnecessary manual investigation by our device reliability team, whereas a very strict trigger may miss out on the real problems. Fortunately, we have history on alerts that were triggered as well as the ultimate determination (made by a human) of whether or not the issue was in fact real and actionable. We can then use this to train a model that can predict the likelihood that a given set of measured conditions constitutes a real problem.

The aforementioned problems are a sampling of the technical challenges where we believe statistical modeling and machine learning methods can improve the state of the art:

there is sufficient data (over 117M members worldwide)
the data is high-dimensional and it is difficult to hand-craft the minimal set of informative variables for a particular problem
there is rich structure inherent in the data due to complex underlying phenomena (e.g., collective network usage, human preferences, device hardware capabilities

Thanks!!

要查看或添加评论，请登录

Akanksha Bhatt的更多文章

Task-8 Automation with Python

2021年3月28日

Task-8 Automation with Python

ARTH - Task 8 ??????? Task Description?? ?? Create a Menu Using Python integrating all the concepts that have been…
Task-14.3

2021年3月28日

Task-14.3

???????? ?????????????????????? : Create an Ansible Playbook which will dynamically load the variable file named same…
Task-14.2

2021年3月28日

Task-14.2

Ansible playbook that will retrieve new Container IP and dynamically update the inventory and Configure web-server…
Task-14.1

2021年3月28日

Task-14.1

Task description :- 14.1 Create a network Topology Setup in such a way so that System A can ping to two Systems System…
LOAD BALANCING WITH HAPROXY

2021年3月28日

LOAD BALANCING WITH HAPROXY

Task Description 12.1 Use Ansible playbook to Configure Reverse Proxy i.

1 条评论
ARTH TASK-15

2021年3月28日

ARTH TASK-15

Task Description?? Create an ansible role myapache to configure Httpd WebServer. Create another ansible role…

1 条评论
Restarting HTTPD Service is not idempotent in nature and also consume more resources suggest a way to rectify this challenge in Ansible Playbook.

2021年3月21日

Restarting HTTPD Service is not idempotent in nature and also consume more resources suggest a way to rectify this challenge in Ansible Playbook.

What is Idempotent nature in Ansible? An operation is idempotent if the result of performing it once is exactly the…
CONFIGURING HADOOP CLUSTER USING ANSIBLE

2021年3月21日

CONFIGURING HADOOP CLUSTER USING ANSIBLE

Welcome you all, in this blog I will cover how we can“ CONFIGURE HADOOP CLUSTER USING ANSIBLE” Before starting the…

1 条评论
How Industries are Solving Challenges Using Ansible??

2021年3月21日

How Industries are Solving Challenges Using Ansible??

What Is Ansible ? ?? Ansible is a software tool that provides simple but powerful automation for cross-platform…
Configuration of Apache Web Server in Docker Container using Ansible

2021年3月20日

Configuration of Apache Web Server in Docker Container using Ansible

In this article we will perform various tasks: Configure Docker Start and enable Docker Services Pull the httpd server…

1 条评论

See all articles

Neural Networks in Netflix

Akanksha Bhatt

Devops Engineer

What are Neural Networks?

Applications in Deep Learning and Artificial Intelligence

Neural Network in Netflix

Network quality characterization and prediction

Video quality adaptation during playback

Predictive caching

Device anomaly detection

Thanks!!

Akanksha Bhatt的更多文章

社区洞察

其他会员也浏览了

Top 5 Types of Neural Networks in Deep Learning

Neural Networks & Large Language Models

Neural Networks & Deep Learning

Exploring the Role of Neural Networks in the Future of Artificial Intelligence

The Depths of Neural Networks: Fractal Pattern Classification

Grokking: A Deep Dive into Delayed Generalization in Neural Networks

Deep Learning Techniques | An Overview

Understanding the Intricacies of Neural Networks in Deep Learning

Demystifying Neural Networks (Part 6): A Look Back and a Look Ahead

Neural Networks and Deep Learning: The Brains Behind AI - Explainer series 1

What are Neural Networks?

Applications in Deep Learning and Artificial Intelligence

Neural Network in Netflix

Network quality characterization and prediction

Video quality adaptation during playback

Predictive caching

Device anomaly detection

Thanks!!

Akanksha Bhatt的更多文章

Task-8 Automation with Python

Task-14.3

Task-14.2

Task-14.1

LOAD BALANCING WITH HAPROXY

ARTH TASK-15

Restarting HTTPD Service is not idempotent in nature and also consume more resources suggest a way to rectify this challenge in Ansible Playbook.

CONFIGURING HADOOP CLUSTER USING ANSIBLE

How Industries are Solving Challenges Using Ansible??

Configuration of Apache Web Server in Docker Container using Ansible

社区洞察

其他会员也浏览了

Top 5 Types of Neural Networks in Deep Learning

Neural Networks & Large Language Models

Neural Networks & Deep Learning

Exploring the Role of Neural Networks in the Future of Artificial Intelligence

The Depths of Neural Networks: Fractal Pattern Classification

Grokking: A Deep Dive into Delayed Generalization in Neural Networks

Deep Learning Techniques | An Overview

Understanding the Intricacies of Neural Networks in Deep Learning

Demystifying Neural Networks (Part 6): A Look Back and a Look Ahead

Neural Networks and Deep Learning: The Brains Behind AI - Explainer series 1