LinkedIn's A.I. Breakthrough in Real-Time Personalization Explained
Michael Spencer
A.I. Writer, researcher and curator - full-time Newsletter publication manager.
LinkedIn Showcases Real-Time Features for Near Real-Time Personalization
Making relevant recommendations to LinkedIn Members. Blog summary.
If you enjoy articles about A.I. at the intersection of breaking news join AiSupremacy?here. I cannot continue to write without community support. (follow the link below).
https://aisupremacy.substack.com/subscribe
AiSupremacy is a Newsletter at the intersection of A.I. and breaking news.?You can keep up to date with the articles?here.
Fair warning this article summary gets a bit technical as it is based on LinkedIn's AI Engineering blog full of technical details.
In a world of streaming, videos stories and competition of notifications for human attention, real-time personalization on our app Feeds is more important than ever. For busy professionals hunting for a job, someone in their network or some topic they are interested in, speed is of the essence.
At LinkedIn, enabling this kind of economic mobility at scale is their job—that is, they continually want to connect every member of the professional workforce in the world to opportunity.
The quality of their social feed is debatable, with many people having their own points of view. However the majority of people who log-in to LinkedIn are job seekers, sales professionals or HR professionals with their own particular goals.
LinkedIn’s Growing Relationship with A.I.
In this article we’re mostly going to be talking about the machine learning steps behind how the recommendation engine is improving.
LinkedIn, which is now becoming more popular in South Asia, even has a course on?Artificial Intelligence. In fact today in 2022, there are many such courses on LinkedIn Learning. I don’t know if they are any good.
Machine learning is one of the liveliest areas in artificial intelligence. Machine learning algorithms allow computers to learn new things without being programmed. Even in an era of?Deepfake LinkedIn profiles, LinkedIn is updating its feed with new AI technologies making if faster and more personalized.
Organizations applying real-time machine learning are reportedly seeing increased return on investment especially when it come sto real-time personalization. LinkedIn?AI engineering team?is thinking about a lot of issues?as it pertains to security?and personalization among other factors.
From?Bayesian optimization?by?Yunbo Ouyang, to?fairness in its AI products?by?Heloise Logan, you get a sense of the diversity of projects that LinkedIn’s AI team is working on. Recently a more exciting topic came across my desk to do with real-time personalization. So the rest of this article will attempt to summarize it in some detail.
Real-Time Features for Near Real-Time Personalization
According to a basic definition, AI is the science and engineering of building intelligent computer programs that can achieve complex goals. LinkedIn now under Microsoft is doing just that.
Looking back on AI’s history at LinkedIn some context can be helpful.
In order to understand how AI systems help LinkedIn achieve its goals, it’s important step back and look at how these algorithms work.
So what’s new in 2022?
The blog article that we are going to summarize?Co-authored by:?Rupesh Gupta,?Sasha Ovsankin,?Qing Li,?Seunghyun Lee,?Benjamin Le, and?Sunil Khanal.
Do you enjoy A.I. articles at the intersection of breaking news, then help me continue to write on the subject. I cannot continue to write without support. Grateful for all tips, patronage and community contributions.
Improving LinkedIn Recommendation Engine
At LinkedIn, they are striving to serve the most relevant recommendations of their members, whether that’s a job they may be interested in, a member they may want to connect with, or another type of suggestion.
Speed Approaching Real-Time Personalization
However, there is usually a delay between when a member takes an action and when it can be leveraged to adapt recommendations for that member.
Figure 1. A conventional feature pipeline for leveraging past actions of a member to personalize recommendations.
LinkedIn’s solution is based on the following two ideas:
Understanding Past Behavior
Computation of features based on a member’s past actions
The team involved surveyed several AI teams at LinkedIn to understand how they compute features (through the conventional feature pipeline) based on a member’s past actions. They noticed a generic pattern in computation of a majority of these features. This computation comprises three steps:
Step 1:?Get relevant actions taken by a member over a duration of time. For example, get all the job-apply actions taken by a member over the last 7 days.
Step 2:?Look up certain attributes of the entities on which the above actions were taken. For example, look up the embedding (a numeric vector representation) of each job which the member applied to from the previous step.
Step 3:?Perform a summarization operation on the attributes of all the entities. For example, compute the average of the embeddings of all the jobs from the previous step.?
For datascience and AI generally speaking, it’s pretty interesting to see how they conceptualize this.
A recommender system might also compute a (member, item) pair feature based on the member’s past actions.
For example, the job recommender might use the following pair feature when scoring a candidate job recommendation jobi for a member: the number of times this member applied to any job in the same geographic location as jobi in the last 7 days.
领英推荐
Summary of requirements
The generic computation pattern above helped the team define the requirements for their desired solution. These requirements were:
Requirement 1: Ability to record any member action of interest within a few seconds.
Requirement 2: Ability to join any attributes of the entity on which an action was taken.?
Requirement 3: Ability to retrieve actions taken by a member (along with joined attributes) that meet certain criteria and compute features from those actions?in less than 100 milliseconds.?
So while our experience of LinkedIn’s feed might not be any better on a UX or content level, the personalization and speed behind it has improved significantly in recent months.
Design of solution
With the standard schema for representing any member action, we designed our solution as shown in Figure 2.?
Figure 2. LinkedIn AI Engineering team’s solution leveraging actions of a member in near real-time to adapt recommendations for that member in near real-time.
They introduced an?Apache Samza?stream processor to listen for and process events corresponding to member actions of interest from Kafka.
They chose to support the?Samza SQL API?for writing the processing logic in this processor. It limits processing logic to simple operations such as filtering, stream-table joins, and projections. This limitation helped them ensure that the stream processor is always simple and lightweight.
The exact processing logic can be different for different use cases, but generally looks like the following:
The store is configured to retain data for 96 hours, which means that an action is deleted from the?store 96 hours?after it is ingested—this keeps the size of the store under control.
The store is also configured to use “actor” as the primary key, so that the data is partitioned and sorted based on the “actor” column, which allows quick retrieval of actions taken by a specific member.
They chose Pinot as our store for several reasons. Their main ones were:
A recommender system can now query this Pinot store when recommendations need to be generated for a member.
Depending on the types of features required, the recommender system appropriately queries the Pinot and attributes stores to compute the near real-time features. It then uses these features, along with other features (such as those computed through the conventional feature pipeline) in a model to score candidate recommendations.
The near-real time features can capture the short-term intent and preferences of a member, while the other features can capture the longer-term intent and preferences.
After scoring, the recommender system also emits an event to log the computed features to HDFS. Since features based on a member’s actions can be very time sensitive (for example, a member may apply to two jobs within a minute, or click on two feed articles within seconds), logging them ensures that they have the correct value of these features associated with each impression of recommendations. This makes it easy to prepare training data for future iterations of the model.?
Results
Their solution has been successfully adopted by several recommender systems at LinkedIn to leverage actions of a member in near real-time to adapt recommendations for that member in near real-time.
So it seems even if your might find LinkedIn’s Feed cringe at times, the tech behind it is truly moving ahead with A.I. research by the engineering team. As far as functional things go like finding a job, the personalization of the user experience has been considerably refined.
The solution has been able to meet all the requirements:?
It has also resulted in significant gains in business metrics. The gains realized from the job, feed, and search typeahead recommenders and so forth.
This team also thanks fellow contributors and collaborators such as:?Jiaqi Ge,?Aditya Toomula,?Mayank Shrivastava,?Minhtu Nguyen,?Justin Zhang,?Xin Yang,?Ali Hooshmand,?Yuankun Xue,?Xin Hu,?Qian Li,?Hongyi Zhang,?Marco V. Varela,?Manas Somaiya,?Shraddha Sahay,?Raghavan Muthuregunathan,?Anand Kishore,?Daniel Gmach,?Joshua Hartman,?Shipeng Yu,?Abhimanyu Lad,?Tim Jurka,?Romer Rosales, and many others who helped them.
To read more of the details and workflow about how they achieved the solution you can read the original blog here:
At LinkedIn, they really do strive to serve the most relevant recommendations to their members, whether that’s a job you may be interested in, a member you may want to connect with, or another type of suggestion.
It’s pretty neat to try to understand some of the A.I. behind that recommendation process and how it’s improving even years after LinkedIn was born and grew to over 800 million users.
This is not a sponsored post, just something I was interested in seeing. I try to cover AI Google, Meta AI, Microsoft AI and Microsoft Research along with DeepMind and OpenAI in equal distribution so far as is possible.
The lead writer of the blog was?Rupesh Gupta, who has been a senior staff Engineer at LinkedIn for over 9 years. Finally if you are interested in following this sort of thing you can follow?LinkedIn’s Engineering blog here.
If you enjoy articles about A.I. at the intersection of breaking news join AiSupremacy?here. I cannot continue to write without community support. (follow the link below).
https://aisupremacy.substack.com/subscribe
AiSupremacy is a Newsletter at the intersection of A.I. and breaking news.?You can keep up to date with the articles?here.
NOTE FROM THE AUTHOR
I cannot continue to write without?tips, patronage and community support?from you, my readers and audience. I want to keep my articles free for the majority of my readers.
So by subscribing you are essentially?helping fund a network of Newsletters?whose aim is to inspire and inform. This is my only job and stream of income.
AiSupremacy is the fastest Substack Newsletter in AI at the intersection of breaking news.?It’s ranked #1 in Machine Learning as of January 22nd, 2022.
Thanks for reading!
Investigator SIU, ORMC
3 年So now I’ve got a question for the guru Michael Spencer. With the US debt headed towards $40 trillion and the technological Edge and likely the Chinese equivalent of the dollar becoming the new world currency what can we do to change the narrative and therefore the US trajectory?
Product Manager,
3 年Good having a break through can make the difference appreciate thank you ??