Probability Distribution - Part 2
Shiva Shankar Moogi
Data Science Intern at AI Aware with expertise in Data Science
Hello Folks,
In this Blog, We will discuss about Continuous distribution. If you want to?Understand these topic please check before blog.
Continuous Distribution :
It is a random Variable that can take infinite no of possible values.
In continuous distribution, we calculate probability density function instead of probability mass function as in the case of discrete random variable.
Continuous Random Variable :
Probability if an outcome lying within a given interval. E.g : Cricket Match.
Discrete Random Variables :
Probability if an outcome being a discrete point in the given space. E.g : Number of Childrens in a Family
Now lets Discuss about Normal Distribution:
- Normal distribution represents real values of random variables whose distribution is not known.
- The Area Covered around the mean with multiples of S.D is?fixed.
- Here when we are plotting a graph it will give a Bell Shaped Curve.
The empirical rule, or the 68-95-99.7 rule, tells you where most of your values lie in a normal distribution:
Around 68% of values are within 1 standard deviation from the mean.
Around 95% of values are within 2 standard deviations from the mean.
Around 99.7% of values are within 3 standard deviations from the mean1.
As we can see where X increases Y decreases.
Standard Normal Distribution:
Lets talk about Standard Normal Distribution. Standard Normal distribution is the process of transferring a variable to one with the mean 0 and standard deviation is 1.
- In Simple Words, when we are transforming a non - normal distribution into Normal distribution is known as Standard normal distribution. When a Normal distribution is Standardized, its called Standard Normal distribution.
- Standard Normal distribution is also called as Z scores.
Lets take one Example, We have a dataset which we have X values over there, those X values will be called as Z - Scores.
It will tell us how much deviates from the mean. if Z score gets positive value we can say that the Z value is greater than Mean and if it comes Negative we can say that the Z value is lesser than mean value.
How to Compute Z- Scores ?
By Using the Formula we can we calculate the Z scores.
z = (x - μ) / σ
- x is the value of the data point
- μ is the mean of the population
- σ is the standard deviation of the population
Lets Discuss about Gamma Distribution:
For Modelling times waiting times are important, Gamma distribution is relevant.
This can be compared to the Poisson distribution - characterized waiting time between occurrences.
If we define X as a random variable that measures number of times a machine fails, then expected to follow Poisson distribution.
If Y is another random variable that defines time between two consistent failures of that machine then it is expected to follow Gamma Distribution.4
The probability density function (PDF) of the gamma distribution is given by:
f(x) = (β^α * x^(α-1) * e^(-βx)) / Γ(α)
- x is the value of the random variable
- α is the shape parameter
- β is the rate parameter (the reciprocal of the scale parameter θ)
- Γ(α) is the gamma function evaluated at α
The cumulative distribution function (CDF) of the gamma distribution can be expressed in terms of the regularized incomplete gamma function, which is defined as:
P(a,x) = γ(a,x) / Γ(a)
- γ(a,x) is the lower incomplete gamma function
- Γ(a) is the gamma function evaluated at a
The CDF of the gamma distribution is then given by:
F(x) = P(α,βx)
- x is the value of the random variable
- α is the shape parameter
- β is the rate parameter
FYI, Just Understand the Topic while we are doing practical way. We will?use Libraries over there.