The Effort Behind an Algorithm

We all like the idea of magic. Some powerful spell that can solve our (business) problems for us. Unfortunately this just doesn’t exist. Almost always there is someone, a person, behind that magic, doing a lot of manual work. This is true of the world of data science.

You’ve likely heard about algorithms and machine learning. What you may not be fully aware of is how much manual effort is involved in building a great algorithm. By definition an algorithm is simply a process or set of rules to follow in a problem solving operation.

We often forget how unintelligent computers really are. An algorithm is only as effective as the amount of manual effort put forth up front to determine the best process or set of rules.

As an exercise, let’s examine a decision that we make on fairly regular basis:?

Should I fill my car with gas today or tomorrow?

Take two minutes and write down the steps involved in making this decision. I’ll guess you wrote something along these lines:

  1. Do I have time to do it today?
  2. How much driving will I be doing tomorrow?
  3. Do I have enough money to buy gas?

Even with just these three seemingly simple questions, there is now a complex web of information needed to make this decision. Check it out:

  • If I don’t have time today then the other questions don’t matter and the decision is made quickly that I have to wait until tomorrow.
  • If I don’t have enough money then, again, the other two questions don’t matter and I have to wait until tomorrow.
  • If I know I’m driving a lot tomorrow, have time right now, and enough money then it’s an easy yes to get gas today.

What’s interesting here is that there are only two outcomes to this decision: Either I get gas today or tomorrow. How I arrive at that outcome can look very different. The scenarios above are just a few of the possible pathways to reach one of two outcomes.

When data scientists are tasked with developing an algorithm to systematically make a decision, i.e. automate it, there is a ton of effort that goes into understanding how that decision gets made. The better we understand how the decision is made the better we can implement an algorithm which will produce the optimal outcomes (assuming the data is available).

There are often many underlying assumptions being made that also need to be addressed. In our example, I’m assuming:

  • You have a car
  • Your car needs gas instead of an all-electric one
  • You can only buy gas today or tomorrow instead of a different day
  • Your car will be working and drivable over the next 48 hours

If you’re reading this and metaphorically yelling: “You’re forgetting so many other factors that influence getting gas today or tomorrow!” I’m not forgetting. Even for decisions that only require a simple yes or no require a lot of information. We hope to have all the necessary information but realistically we often don’t.

If a small amount of information is missing we can likely still make the decision and be satisfied with the outcome. When a large amount of information is missing, I might as well flip a coin to determine the outcome.

In addition, some pieces of information are more important than others when making a decision. Knowing how much I will drive tomorrow may not be as important as knowing whether I have enough money to buy gas.

Knowing the possible answers and mapping out the different pathways to an outcome will produce a better algorithm than if we didn’t do this.

For data scientists it’s not only understanding how a decision is made but then also building the connections between information needed for the decision and available data that represents that information. When there aren’t enough data points to represent the information needed for a decision then the algorithm will be sub-optimal.

Agree? Disagree? Let me know in the comments.

Maureen Kalas

Fortune 100 strategy/ops + small biz mentoring = growth!

2 年

Love this. Agree with your point about data scientists needing to understand how the decision is made. Data driven solutions need to be built with a clear understanding of the business problem and, preferably, related aspects of the business itself.

要查看或添加评论,请登录

Polly Von Dollen的更多文章

社区洞察

其他会员也浏览了