登录查看更多内容

Testing Data and Drawing the Threshold (3/5)

Gary M. Shiffman

Economist ? 2x Artificial Intelligence company co-founder ? Writer

发布日期: 2022年11月23日

In my previous articles, I introduced Machine Learning (ML), training data, and the source of accuracy and bias and I made assertions about building “better” algorithms. Now, let’s unpack “better” and how to measure algorithmic performance.?

Remember, in this series, “chihuahua” can stand in for anything you seek to discover. You created a large sample of properly labeled data, the training data, and fed that to an algorithm, creating a chihuahua algorithm.?

The output of any ML algorithm is a distribution. Along the x- or horizontal axis, you have a measure of chihuahua-ness, sometimes referred to as the algorithm’s confidence in “predicting” the entity to be chihuahua. Along the y- or vertical axis, you have the count of entities.?

A simple graph distirbution shows chihuahuas and muffins, ranked by score of chihuahua-ness. Along the x- or horizontal axis, you have a measure of chihuahua-ness, sometimes referred to as the algorithm’s confidence in “predicting” the entity to be chihuahua. Along the y- or vertical axis, you have the count of entities.

Once the algorithm creates the distribution, the human must perform the single most important task:? draw the threshold. In my decade-plus of working with ML systems, this is perhaps the most misrepresented aspect of the art of deploying AI/ML technologies into high-consequence operational environments.?

Machines have no conscious awareness of right and wrong; humans must do this. How many images must be treated as “alerts” and sent for human review? A data scientist might say that the algorithm “predicted” what entity is of interest to the operator. But the prediction requires a threshold. And a threshold depends upon particular risk profiles and risk preferences. The machine only creates the distribution using training data provided by humans. The human makes the next move of drawing a threshold.??

In a small population, a person can easily identify the chihuahuas from the no-chihuahuas. But finding the sought-after pattern across the large data set, the ML goes back to work by using the labeled data and creating test data.?

A post-algorithm distribution graph shows 500 entities of chihuas and muffins. The graph displays the number of flagged chihuahua entities amongth the muffins. A threshold line is drawn between 7 & 8 (scores goes up to 10 on the x axis), which marks the probability of more flagged chihuahuas to the right of the threshold line.

In the image here of 500 entities in a post-algorithm distribution, the test data, only the labeled chihuahua images appear in color for the purpose of this article; the computer can “see” the label. The human-drawn threshold tells the system to treat scores of eight and above as-if chihuahua, and seven and below as-if not-chihuahua. Now, we can measure performance.??

First, we count True Positives, False Positives, True Negatives, and False Negatives.??

Above (right of) the threshold = Predicted Positive

领英推荐

Fine-tuning or RAG: Which LLM Strategy is Best for…

First Line Software 1 个月前

The Critical Intersection of Human Intelligence and…

Objectways 5 个月前

How can we prevent bias in machine learning models?

Machine Learning 2 年前

Chihuahuas above the threshold = True Positive

Not-chihuahuas above the threshold = False Positive

Below (left of) the threshold = Predicted Negative

Chihuahuas below the threshold = False Negatives

Not-Chihuahuas below the threshold = True Negative

Looking at the image, every chihuahua above the threshold that is actually positive is called a true positive – the algorithm-human team got it right. Everything above the threshold that isn’t a chihuahua is a false positive.

Similarly, “not-chihuahuas” below the threshold are true negatives – a win for team algorithm-human. All chihuahuas below the threshold are false negatives – human traffickers and money launderers that evaded us, again. Counting and some simple math gets us to the measurement of accuracy:? effectiveness and efficiency. In the next article, I will dive into accuracy in more detail.?

This is the second article of a 5-part series. See my video short on this topic or read Article 2 here.

要查看或添加评论，请登录

Gary M. Shiffman的更多文章

Bias Isn’t a “Given” in AI (5/5)

2022年12月5日

Bias Isn’t a “Given” in AI (5/5)

Thinking about the measurement of AI/ML in terms of chihuahuas and muffins, it turns out, is pretty easy and easy to…
How Do You Know If It Is Working? Measuring the Accuracy of AI (4/5)

2022年12月2日

How Do You Know If It Is Working? Measuring the Accuracy of AI (4/5)

As I mentioned in the first of this article series (link), accuracy is a measurement of both effectiveness and…
The Data is the Algorithm (2/5)

2022年11月16日

The Data is the Algorithm (2/5)

Look again at the chihuahua-muffin images from Article 1; can you identify the chihuahuas? Easy enough. But what if you…

1 条评论
What Do Chihuahuas and Muffins Have to Do with AI? (1/5)

2022年11月10日

What Do Chihuahuas and Muffins Have to Do with AI? (1/5)

I used a popular internet meme of “chihuahuas and muffins” a few years ago in a series of lectures to explain…

1 条评论
Update: Hushpuppi Pleads Guilty

2021年8月17日

Update: Hushpuppi Pleads Guilty

Update: Hushpuppi Pleads Guilty In January, I published a commentary in The Wall Street Journal discussing the…
Science and Collaboration: What the Financial Industry Can Learn from Vaccine Developers

2020年11月23日

Science and Collaboration: What the Financial Industry Can Learn from Vaccine Developers

In Axios’s Re:Cap podcast last week, host Dan Primack spoke with Moderna Chief Medical Officer Tal Zaks about the…
How to Engage in the Fight Against Cyber Threats

2020年8月7日

How to Engage in the Fight Against Cyber Threats

Last week at the amazingly successful American Bankers Association (ABA) Risk and Compliance Virtual (RCV) Conference…

2 条评论
Thoughts on Martin Luther King, Jr. Day 2020

2020年1月21日

Thoughts on Martin Luther King, Jr. Day 2020

I am moved this year to think and write more deeply on the occasion of Martin Luther King Jr. Day.

See all articles

Testing Data and Drawing the Threshold (3/5)

Gary M. Shiffman

Economist ? 2x Artificial Intelligence company co-founder ? Writer

领英推荐

Gary M. Shiffman的更多文章

社区洞察

其他会员也浏览了

Getting started with AI – how much data do you need?

How IT Contributes to Successful Science

5 Common Machine Learning Problems & How to Solve Them

Embracing AI, ML, and Data Science in the Goldilocks Zone: A Pragmatic Approach to Transformative Technology

Building a High-Quality Dataset: Best Practices and Challenges

Anti Hype AI / Data Science / Machine Learning: Thoughts AND Quotes

How can you integrate machine learning (#ML/#AI) into a rule-based system? (when the rules are already known and need not be inferred from data)?

Riches to RAGs

Implementing and Leveraging Machine Learning Models

Decoding Machine Learning: A Business Leader's Guide to Avoiding Common Misconceptions

领英推荐

Gary M. Shiffman的更多文章

Bias Isn’t a “Given” in AI (5/5)

How Do You Know If It Is Working? Measuring the Accuracy of AI (4/5)

The Data is the Algorithm (2/5)

What Do Chihuahuas and Muffins Have to Do with AI? (1/5)

Update: Hushpuppi Pleads Guilty

Science and Collaboration: What the Financial Industry Can Learn from Vaccine Developers

How to Engage in the Fight Against Cyber Threats

Thoughts on Martin Luther King, Jr. Day 2020

社区洞察

其他会员也浏览了

Getting started with AI – how much data do you need?

How IT Contributes to Successful Science

5 Common Machine Learning Problems & How to Solve Them

Embracing AI, ML, and Data Science in the Goldilocks Zone: A Pragmatic Approach to Transformative Technology

Building a High-Quality Dataset: Best Practices and Challenges

Anti Hype AI / Data Science / Machine Learning: Thoughts AND Quotes

How can you integrate machine learning (#ML/#AI) into a rule-based system? (when the rules are already known and need not be inferred from data)?

Riches to RAGs

Implementing and Leveraging Machine Learning Models

Decoding Machine Learning: A Business Leader's Guide to Avoiding Common Misconceptions