Occam’s Razor

Occam’s Razor

Buying a new car can be a pretty daunting experience unless you know exactly what you want. Deciding on a make and model is just the start - at this point you are presented with a huge array of features, add-ons and extras. It can be very tempting to nod along and end up with everything without thinking through the cost.?

There are clear parallels for me when I think about the algorithmic choices available with Machine Learning. There are always newer, flashier algorithms, or ways to add more complexity to your current approach. Some of these choices will help you thrive.

Historically in our UK business logistic regression was the core algorithm which underpinned our critical underwriting models, the ones that helped us make key business decisions. There was a good level of understanding with the approach however a great data scientist never rests on their laurels and over the past 5 years the team has broadened their skillset to bring in more complex Machine Learning algorithms to support this key decision.?

At each step of the way the important thing to balance is increased complexity versus the improvement in model predictive power. Occam’s Razor is a heuristic focusing on parsimony or simplification. The philosophy advocates that when you are presented with competing hypotheses or models giving similar predictions, you should select the solution with the fewest assumptions. In a data science application this means being intentional about complexity you introduce through data or algorithms. Does this complexity lead to meaningful changes in your decisions or prediction levels? If not, then you should strip it all away.

Occam’s Razor remains more relevant than ever and an effective, pragmatic data scientist will always be looking to balance simplifying and improve model performance.

Questions to ask:

  • If you are using a new technique: can you articulate the increased complexity in terms of risks?
  • If so; do you understand the incremental benefits?
  • Have you made an intentional (& documented) decision to weigh up these two dimensions in making your decision?
  • How are you embedding an ongoing process to challenge and remove unnecessary complexity?

Last week: opening the watch

Next week: the jets and the sharks

要查看或添加评论,请登录

Dan Kellett的更多文章

  • My 4 microblogs on AI governance

    My 4 microblogs on AI governance

    Over the last 4 weeks I have looked to cover key learnings from my 21 years being involved in the governance of Machine…

    1 条评论
  • Data karma

    Data karma

    AI success relies on a large amount of knowledge. This may be technical knowledge, data knowledge or business knowledge.

    2 条评论
  • Goldilocks and SQL

    Goldilocks and SQL

    Last week I wrote about my early years as a data scientist and the challenge of jumping the experience chasm as I moved…

    2 条评论
  • Wise council

    Wise council

    I joined Capital One straight out of university. I completed my Bachelors degree in Mathematics and Statistics and…

    1 条评论
  • The Jets and the Sharks

    The Jets and the Sharks

    This week I want to tell you a story about one of my earliest model building projects. I was a recent graduate making…

    1 条评论
  • My 8 microblogs on AI model building

    My 8 microblogs on AI model building

    Over the last 8 weeks I have looked to cover key learnings from my 21 years building Machine Learning models in…

  • Opening up the watch

    Opening up the watch

    Imagine it’s your birthday and there’s a knock on your door. The delivery person hands you a beautifully wrapped parcel…

    1 条评论
  • Help out your future self

    Help out your future self

    I’ll be honest with you… I actually really enjoy building flat pack furniture. The step-by-step approach appeals to my…

    2 条评论
  • What can go wrong... and what will you do about it?

    What can go wrong... and what will you do about it?

    Sometimes, despite everyone’s best intentions, things go wrong. Good risk management can help ensure this doesn’t have…

    3 条评论
  • But is it working?

    But is it working?

    Sometimes it can feel like a long slog building and deploying a new AI application. The process of defining the…

    1 条评论

社区洞察

其他会员也浏览了