Introduction
Statistical learning is based on "well-formed"statistical models. This makes it easy to define likelihood.
Probability and Statistics
- Random Variable
A random variable (r.v) is a measurable function from a probability space to a measurable space
If you don't understand anything, that's okay. You just need to remember that this is an application of a large space we don't know about to our real world. If you want to fully understand this definition, take a look at the Appendix.
A sample is a realization of the random variable .
- Exemple 1
- Exemple 2
You can imagine that you're flipping a coin. You'll know it's either heads or tails (i.e. ) without knowing the pulse you're giving (i.e. ).
A random variable can be more complex: the next word you think of (with and can be the whole structure of your brain and your knowledge)...
- Law of X
The law of or the probability distribution of is its measure of probability on .
If you don't understand anything again, that's okay too. You just need to remember that the law of is a function that describes how the values of are distributed. Many of these are already known and have names: bernoulli, normal, gamma, etc. A part of the job of a probability/statistics researcher is to play with them. If you want to fully understand this definition, take a look at the Appendix.
Statistical models
Statistical modeling is the basis of all statistical inference. To model an experiment is to propose a theoretical law for the random variable .
- Statistical model
A statistical model is the tuple where is a measurable space, is a family of probability law (i.e ) and is just the number of variable.
It may seem complicated at first, but reassuringly it's not. Let's look at an example through an exercise.
- Exercise
- Tips
- Result
Give the statistical model of a identical flipping coin. A priori, we don't know if the coins are balanced.
- Forget if you didn't understand the Appendix.
- How many variables you will have?
- What value can the flipping coin take? I.e What is the arrival set?
- You don't have to take a all family of probability law: you can just take one law! In this way, which one do you choose? And what are the possible values of ?
The number of coin is so we have . Each coin can get the value 0 or 1, so the set is and the -algebra is of the measurable space. The law of each coin is a bernoulli noted and because we don't know if the coins are balanced, can be between and , so . So finaly,
Note: Maybe you're frustrated because you haven't found Bernouilli's law, that's normal. In statistical modeling, the family of laws often comes from the hat like that. It's with experience and by analyzing throws that we can get this law reflex. That's why we talk about modeling and not reproduction.
If the distribution is i.i.d, we can use the direct notation is the tuple
- Parametric model
is said parametric when has a finite dimension.
- Unparametric model
is said unparametric when has a unfinished dimension.
As unparametric is less restrictive than parametric, it can be used with a wide range of assumptions.
- Identifiability
is identifiable when