Consider Adding Operations Research Capabilities to Your Data Science Team
Salvatore Tirabassi
CFO Pro+Analytics | Top Fractional CFO Services | Growth Strategy | Modeling, Analytics, Transformation | 12 M&A & Exit Deals | $500M+ Capital Raised | 10 Yrs CFO | 15 Yrs VC & PE | Wharton MBA | New York & Remote
A few years ago, we started scaling our data science capabilities
Operations research and econometrics, I thought, would round out the team with more knowledge and collaboration possibilities. The vision of combining these diverse skills sets rests on the premise that highly trained analytical personnel will flourish in an environment of diverse backgrounds
For those of you less familiar with operations research, the techniques in that discipline, which lend themselves perfectly well to the typical data science python programming and use of large data sets, are more focused on solving for a clear objective, given stated constraints. Operations research techniques
For the most part, prescriptive analytics and predictive analytics
Here's an example of how the two methods differ when looking at the same problem.
Supposed you wanted to find the most least cost path (time and money) to drive from point A to point B. Over the years, point A and point B have been traveled between many times, using many forms of transportation and you have large data sets associated with the cost and time associated with each trip including associated variables such as time of day, weather, fuel costs, vehicle used, etc.
领英推荐
If you trained a tree algorithm with this data, it would predict the path between A and B (based on previous training data fed into it) that is most likely to satisfy the objective of least cost or highest efficiency. This means that the tree-based predictive model will try to mimic the past performances in the most optimal way. A typical predictive output might be a scored list of trip routes ranked by cost (time and money). The highest scored route would be the one it predicts to be the lowest cost. You could run the model for a time of day, day of week and in certain weather conditions by retraining it only for those instances and have different lists for different occasions.
An operations research model would work differently because it does not predict the right answer based on historical data, it prescriptively tells you the least cost path between A and B. Said another way, this model would be asked to provide the single correct solution to the trip based on constraints you give it and the solution might even be a path that was never taken historically. So, you would provide all of the inputs and way points associated with the trip. You could even give it probability driven inputs like the weather forecast for the specific trip date. This model would then solve for the single route that is most efficient, given constraints, after comparing all the possible choices of routes. It could even tell you what time of day you should leave for the trip in order to achieve the most efficient outcome. You could also have different models for different circumstances and each one have the best route prescriptively calculated for you so that if it was snowing, for example, you would already know how to get to point B. The predictive tree model
So, the two approaches might arrive at similar conclusions for similar circumstances, but they would arrive there differently. The tree approach would use a probabilistic approach to say this is likely the best route based on learning from all previous experiences. The operations research approach would say this is the best route because it looked at all possible choices and found the best one.
The tree algorithm deals with uncertainty better, so if you want choices or don’t know for sure how the trip circumstances are going to unfold, then the tree-based results will be helpful. On the other hand, if you know the inputs are highly certain and the circumstances are locked in but the combinations are too complex to figure out, then the operations research model will give you a reliable trip path to take.
The insight for us is that all of these skills and capabilities are data science and by bringing them under one umbrella we enable a more productive team environment with different questions posed to lead to more elegant and useful solutions. We look forward to adding an econometrician to our team in the future to further enhance our innovative problem solving
GTM | Winemaker
1 年The article raises an interesting point about strengthening analytical work through integrating operations research skills into data science teams. Combining such complementary approaches could maximize insightful outcomes. Thanks for sharing this, Salvatore Tirabassi.