flowchart LR A("N") -->|heads|B("#") A("N") -->|tails|C("#") B("#") -->|1 to 5|D("#") B("#") -->|six|E("#") C("#") -->|1 to 5|F("#") C("#") -->|six|G("#")
Chance, belief and frequency
MVEN10 Risk Assessment in Environment and Public Health
Exercise overview
- Work in groups of 4-5
Background
Probability is a mathematical concept defined basic rules.
Definition: The probability of an event A, denoted P(A), is a number between 0 and 1, with P(A) = 0 corresponding to A being impossible, and P(A) = 1 to A being certain.
Complement rule: P(not A) = 1 - P(A)
Addition rule:
for mutually exclusive events A, B: P(A or B) = P(A) + P(B)
for non-mutually exclusive events A, B: P(A or B) = P(A) + P(B) - P(A and B)
Multiplication rule:
for independent events A, B: P(A and B) = P(A) x P(B)
for dependent events A, B: P(A and B) = P(A|B) x P(B) where P(A|B) is the conditional probability of A given B
There are different ways to interpret and use probability, sometimes within the same assessment. In this exercise you will be exposed to probability as a
Theoretical probability: The number of outcomes favouring the event, divided by the total number of outcomes, assuming the outcomes are all equally likely.
Frequency: The proportion of times, in the long run of identical circumstances that the event occurs.
Subjective probability: A persons confidence that an event will occur, expressed as a number between 0 and 1.
Purpose
- To understand common interpretations of probability and for what they are used
Content
Experiment to illustrate the frequency interpretation of probability
Theoretical probability vs Expected frequency
Subjective probability
Duration
30 minutes
Reporting
No reporting required
References
I have used examples and text from the book Teaching probability by Jenny Gage and David Spiegelhalter from 2016. Cambridge University Press.
Frequency
The experiment is setup as follows:
Assign one student to flip the symmetric coin of the type Antoninus Pius - Bronze Sestertius - Roman Empire using the virtual coin flipper on random.org
Record if the outcome is heads or tails.
Assign another student to throw a six sided dice using the virtual dice roller on random.org
Record if the outcome is a number in the range 1 to 5 or a six
Repeat N=5 times
Assign one student to record the outcomes in this frequency tree (replace N and # with numbers).
Answer the following questions:
- What is the observed frequency of the event “heads followed by a six”?
Expected frequency
Is this a reliable estimate of the expected frequency? If not, what can one do to make it more reliable?
What do you expect the frequency to be if N would be a very large number?
Define the events A = “heads” and B = “six”.
Specify P(A) and P(B).
Calculate P(A and B) using the multiplication rule for two independent events.
Don’t forget to multiply by N to get the expected frequency.
Repeat the experiment with N = 100 to verify if estimates of the expected frequencies become more reliable with a larger number of observations.
Chance
Now let us go back to the step where you specified the probabilities P(A) and P(B). How did you do that? One way to do it is to look at the outcome space, find the outcomes that correspond to the event and divide by the total number of outcomes.
For the coin the outcome space is “heads” and “tails”, i.e. n = 2. The event of a getting “heads” can occur in one of the outcomes, i.e. m = 1. Under the assumption that all outcomes are equally likely, the theoretical probability for “heads” is \(\frac{m}{n} = \frac{1}{2}\).
For the dice, the outcome space is 1, 2, 3, 4, 5, and 6, i.e. n = 6. The event of getting a “six” can occur in one way, i.e. m = 1. The theoretical probability for the event “six” is therefore \(\frac{1}{6}\).
Notice that theoretical probabilities can only be used in balanced situations such as dice, cards, or lottery tickets where it justified to assume symmetry (equal probability) for all possible outcomes.
Relative frequency
If we divide the frequency of an event by the number of trials N, we get the relative frequency which is a good estimate of a probability for the event to occur at the next iteration of the same experiment.
Let \(m\) be the number of times the event has occurred. \(E(\frac{m}{N})=\frac{E(m)}{N}=\frac{N\cdot P(event)}{N} = P(event)\)
Relative frequencies can be used to estimate the probability for an event as long as the observations are equally likely across the full outcome space.
The more observations (i.e. larger N) the better estimate.
The more extreme event, i.e. very low or high probability of occurring, the more observations are needed.
Be very skeptical to estimates of probabilities that are either 0 and 1, when the event is possible to occur.
Belief
Take one of the thumbtacks provided in the exercise and a cup. Put the thumbtack in the cup, shake and place the cup upside down on a table without revealing the outcome.
- What outcomes are possible?
Focus on the outcome that the thumbtack is having its head down with the needle pointing upwards.
- Let everyone in the group state their personal probability of this event as a number between 0 and 1, where 0 means that it is impossible to occur and 1 means that it is certain to occur.
Probabilities 1 (“the event will definitely occur”) or 0 (“the event will definitely not occur”) should be avoided, except when applied to statements that are logically true or false.
- Discuss if and why the probabilities differ.
This is an example of probability as a subjective probability that is purely a personal judgement based on available evidence and assumptions.
More evidence ought to result in smaller divergence in judgements. One way to illustrate this is to lift the cup and let everyone revise their judgement.
- Do that!
Given that the evidence is revealing the outcome, the subjective probabilities held by the students in the group should now be either 1 or 0.
In risk assessment, we seldom have such full certainty as in this example. Probabilities are almost inevitably based on judgements and assumptions such as random sampling.
Probability can be thought of as an expected frequency. Instead of saying that “the probability of the event is 0.20 (or 20%)”, you can say “out of 100 situations like this, we would expect the event to occur 20 times”.
By carefully stating the denominator (reference class), ambiguity about the meaning of probability can be avoided.
This advice applies to any of the interpretations.