登录查看更多内容

Understanding statistical inference

Ajit Jaokar

发布日期: 2024年6月26日

In previous posts, I mentioned that statistical inference is different from machine learning inference. The key difference is that, in statistical inference, it is assumed that the underlying distribution of the data can be known.(is knowable)

If you are trying to learn machine learning - then you have a reasonable idea of machine learning inference. Machine learning inference is the process of using a trained machine learning model to make predictions on new, unseen data. This is the stage where the model, which has already been trained on a dataset, is deployed to perform tasks such as classification, regression, object detection, etc., in real-world scenarios.

So, what are the steps involved in statistical inference?

Because of the need for the underlying distribution to be knowable, statistical inference involves two steps:

Firstly checking if the models assumptions about the underlying distribution are reasonable. This is mostly achieved using visual tests and?
Goodness of fit tests: Goodness of fit tests are statistical tests used to determine how well a statistical model fits a set of observations. Essentially, they help assess whether the observed data matches the expected data distribution based on a specific model. These tests are crucial for validating the assumptions of statistical models and ensuring their accuracy in representing the underlying data.

领英推荐

Unlocking the Secrets of Data with Distance-Based…

Tariq A. 3 周前

Understanding Support Vector Machines (SVM) and…

Nasr Ullah 4 个月前

Handling Imbalanced Datasets in Machine Learning

RAMA GOPALA KRISHNA MASANI 2 个月前

In this case, the Observed Data is the actual data collected from experiments or real-world observations. The Expected Data is the data that we would expect to see if the model or theoretical distribution we are testing is correct.?

Because of the process of sampling, in goodness of fit tests, the null hypothesis typically states that the observed data follows the expected distribution. The test aims to either confirm or reject this hypothesis based on the test statistic The test statistic is the value calculated from the observed and expected data. This value is compared to a critical value from a statistical distribution to determine whether to reject the null hypothesis. There are a wide range of Goodness of fit tests depending on the model. These, we shall cover in subsequent sections

Image source

https://pixabay.com/photos/mexico-cave-cenote-sinkhole-maya-5066180/??

https://en.wikipedia.org/wiki/Cenote cave (I think it looked like something knowabl/unknowable!)

Artificial Intelligence

115,463 位关注者

要查看或添加评论，请登录

Ajit Jaokar的更多文章

Free review copies of our book - 10X AI developer - Understanding the value in human AI collaboration

2025年3月5日

Free review copies of our book - 10X AI developer - Understanding the value in human AI collaboration

Introduction In our teaching at the University of Oxford, me and Anjali Jain developed an end to end methodology for…

8 条评论
Elevator pitch - Creating an AI first educational Institution for Lifelong learning and re-skilling in AI

2025年3月2日

Elevator pitch - Creating an AI first educational Institution for Lifelong learning and re-skilling in AI

Today, I presented a two minute pitch about the idea Creating an AI first educational Institution for Lifelong learning…

10 条评论
Creating an AI first educational Institution for Lifelong learning and re-skilling in AI

2025年3月1日

Creating an AI first educational Institution for Lifelong learning and re-skilling in AI

Background I have been developing the idea of forming a new type of institution dedicated to lifelong learning and…

20 条评论
Micro scenarios: Scenarios for data driven decision making .. Teaching AI to a 10 year old with the help of chatGPT

2025年2月28日

Micro scenarios: Scenarios for data driven decision making .. Teaching AI to a 10 year old with the help of chatGPT

Background I spoke with Sophie Wrobel this week who has been working with my ideas to teach AI to a ten year old using…

1 条评论
A prompt to explore the narrative creativity approach for problem solving

2025年2月27日

A prompt to explore the narrative creativity approach for problem solving

Extending my previous post on narrative creativity - here is a prompt to learn narrative creativity approach to problem…

4 条评论
Both Elon Musk and Satya Nadella shared about our project on AI in Agtech

2025年2月26日

Both Elon Musk and Satya Nadella shared about our project on AI in Agtech

I woke up this week to find my name all over the Indian media Apparently, both Satya Nadella and Elon Musk shared about…

16 条评论
How to build vertical Vertical LLM Agents - Design considerations

2025年2月24日

How to build vertical Vertical LLM Agents - Design considerations

In the last post, I shared about How to build vertical LLM agents. In this post, I want to share about design…

3 条评论
How to build a vertical LLM Agents

2025年2月23日

How to build a vertical LLM Agents

Background There is a lot of discussion about vertical LLM agents. Most of the excitement stems from a specific y…

5 条评论
How to create Google co scientist like features using Open AI deep research

2025年2月21日

How to create Google co scientist like features using Open AI deep research

I have been using OpenAI deep research and really enjoy it I saw the announcement about co scientist from Google And I…

6 条评论
Types of statistical Inference

2025年2月20日

Types of statistical Inference

I recently posted about Statistical Inference vs Machine Learning inference vs Deep learning inference. These ideas are…

5 条评论

See all articles

Understanding statistical inference

Ajit Jaokar

领英推荐

Artificial Intelligence

115,463 位关注者

Ajit Jaokar的更多文章

社区洞察

其他会员也浏览了

Simplifying Machine Learning’s Orthogonality and Orthonormality

What is RandomizedSearchCV in Machine Learning

Understanding the Essentials of Machine Learning: A Deep Dive into Module 6 / Chapter 3 of Tom M. Mitchell, Machine Learning Book -Decision Trees

Finding Connections in Data: Your Guide to Understanding Distance Measures in Machine Learning

XGBoost - What is and why it reins all ML algorithms

Class 15 - INTRO TO SCIKIT LEARN AND CLASSIFICATION Notes from the AI Basic Course by Irfan Malik & Dr Sheraz Naseer (Xeven Solutions)

Model Fine-Tuning

The Power of Prediction: Linear Regression in Machine Learning

Bias and Variance in Good Fit Models

领英推荐

Artificial Intelligence

115,463 位关注者

Ajit Jaokar的更多文章

Free review copies of our book - 10X AI developer - Understanding the value in human AI collaboration

Elevator pitch - Creating an AI first educational Institution for Lifelong learning and re-skilling in AI

Creating an AI first educational Institution for Lifelong learning and re-skilling in AI

Micro scenarios: Scenarios for data driven decision making .. Teaching AI to a 10 year old with the help of chatGPT

A prompt to explore the narrative creativity approach for problem solving

Both Elon Musk and Satya Nadella shared about our project on AI in Agtech

How to build vertical Vertical LLM Agents - Design considerations

How to build a vertical LLM Agents

How to create Google co scientist like features using Open AI deep research

Types of statistical Inference

社区洞察

其他会员也浏览了

Simplifying Machine Learning’s Orthogonality and Orthonormality

What is RandomizedSearchCV in Machine Learning

Understanding the Essentials of Machine Learning: A Deep Dive into Module 6 / Chapter 3 of Tom M. Mitchell, Machine Learning Book -Decision Trees

Finding Connections in Data: Your Guide to Understanding Distance Measures in Machine Learning

XGBoost - What is and why it reins all ML algorithms

Class 15 - INTRO TO SCIKIT LEARN AND CLASSIFICATION Notes from the AI Basic Course by Irfan Malik & Dr Sheraz Naseer (Xeven Solutions)

Model Fine-Tuning

The Power of Prediction: Linear Regression in Machine Learning

Bias and Variance in Good Fit Models