Probability Fundamentals | Module 4 | Applied Statistics Course
Applied Statistics · Module 4

Probability Fundamentals

Your complete beginner-to-intermediate guide to probability in statistics, finance, auditing, and business decision-making

⚡ Quick Answer: What is Probability?

Probability is a number between 0 and 1 that measures how likely an event is to occur. A probability of 0 means the event is impossible; a probability of 1 means the event is certain. Everything in between represents varying degrees of likelihood — and mastering probability is the single most important foundation for all of statistics.

Learning Objectives

By the end of this module, you will be able to:

#ObjectiveSkill LevelApplication
1Define probability and explain its role in statisticsBeginnerAll quantitative fields
2Identify sample spaces, events, and outcomesBeginnerExperimental design
3Apply the Addition, Multiplication, and Complement rulesBeginnerRisk, audit, business
4Calculate conditional probability using the formal formulaIntermediateFinance, insurance, fraud detection
5Apply Bayes' Theorem to update beliefs with new evidenceIntermediateMedical diagnosis, credit risk, ML
6Build and interpret probability treesIntermediateDecision analysis
7Identify and apply Binomial, Poisson, Normal, and Uniform distributionsIntermediateData analysis, forecasting
8Apply probability to real-world financial, audit, and business problemsIntermediateProfessional practice

Section 4.1 — Introduction to Probability

๐ŸŽฏ Learning Goal

Understand what probability is, where it comes from, and why it is the essential language of statistics, finance, and data-driven decision-making.

What Is Probability?

Simple definition: Probability is a measure of how likely something is to happen. It is expressed as a number from 0 to 1, or equivalently as a percentage from 0% to 100%.

Academic definition: Probability is a real-valued function defined on a sample space that assigns to each event a non-negative number satisfying the Kolmogorov axioms: non-negativity, normalization (total probability = 1), and additivity for mutually exclusive events.

Statistical definition: Probability quantifies uncertainty. It is the long-run relative frequency of an event across an infinite number of repeated experiments under identical conditions, or a subjective degree of belief calibrated to coherent standards.

Probability ValueMeaningEveryday Example
0Impossible — will never happenRolling a 7 on a standard die
0.1 — 0.2Unlikely but possibleBeing selected in a lottery
0.5Equal chance — coin flipGetting heads on a fair coin
0.7 — 0.9Likely but not certainA good student passing an exam
1.0Certain — will always happenThe sun rising tomorrow

Why Probability Matters

Probability is not abstract mathematics — it is the engine behind every major decision framework in modern life:

FieldApplicationProbability Question
FinancePortfolio riskWhat is the probability this investment loses value?
AuditingSampling riskWhat is the probability of missing a material error?
MedicineDiagnosisGiven a positive test, what is the probability of the disease?
InsurancePremium pricingHow likely is a policyholder to file a claim this year?
BusinessSales forecastingWhat is the probability next month's revenue exceeds budget?
Machine LearningClassificationWhat is the probability this email is spam?
Credit RiskLoan decisionsWhat is the probability this borrower defaults within 12 months?
Quality ControlDefect detectionWhat is the probability a batch contains at least one defect?
"Probability theory is nothing but common sense reduced to calculation." — Pierre-Simon Laplace
๐Ÿ”‘ Section 4.1 Key Takeaways
  • Probability measures likelihood on a scale from 0 (impossible) to 1 (certain).
  • It applies universally — finance, auditing, medicine, data science, and daily decisions all rely on probability.
  • Probability is the mathematical foundation upon which sampling, hypothesis testing, and regression are built.
  • There are three perspectives: classical (equally likely outcomes), empirical (observed data), and subjective (informed belief).

Section 4.2 — Basic Probability Concepts

๐ŸŽฏ Learning Goal

Master the vocabulary of probability: experiments, outcomes, sample spaces, and events — the building blocks for every probability calculation you will ever perform.

Core Vocabulary

TermDefinitionDice ExampleBusiness Example
ExperimentAny process that produces a measurable resultRolling a six-sided dieLaunching a new product
OutcomeA single possible result of an experimentRolling a 4Product succeeds
Sample Space (S)The set of ALL possible outcomesS = {1, 2, 3, 4, 5, 6}S = {success, failure}
Event (E)A specific subset of the sample spaceRolling an even numberProduct achieves >10% market share
Simple EventAn event containing exactly one outcomeRolling a 3Exactly 5 units sold
Compound EventAn event containing two or more outcomesRolling a number > 4Sales between 100–200 units
Complementary Event (E')All outcomes NOT in event ENot rolling a 6Product does not achieve target
Mutually ExclusiveTwo events that cannot both occur at onceRolling a 2 AND rolling a 5 (same roll)Profit AND loss in same quarter
Independent EventsOccurrence of one does not affect the otherFirst roll result vs. second roll resultWeather in Tokyo vs. sales in London

The Probability Formula

๐Ÿ“ Core Probability Formula
P(E) = n(E) / n(S)
P(E) = Number of favorable outcomes ÷ Total possible outcomes

Variable breakdown:

  • P(E) — Probability of event E occurring (a value between 0 and 1)
  • n(E) — Number of outcomes in the event (favorable outcomes)
  • n(S) — Total number of outcomes in the sample space

Worked Example — Standard Die Roll

1
Define the experiment: Roll one fair six-sided die.
2
List the sample space: S = {1, 2, 3, 4, 5, 6} → n(S) = 6
3
Define the event: E = "rolling an even number" = {2, 4, 6} → n(E) = 3
4
Calculate: P(E) = 3/6 = 0.5 = 50%
5
Interpret: There is a 50% chance of rolling an even number on a fair die.

Types of Probability

TypeDefinitionFormula BasisExampleBest Used When
ClassicalBased on equally likely outcomes from theory alonen(E)/n(S)Tossing a coin: P(H) = 1/2Games, theoretical problems
EmpiricalBased on observed frequency of actual past eventsf/n (frequency/total)200 of 1,000 loans defaulted: P(default) = 0.20Historical data available
SubjectiveBased on expert judgment and informed beliefExpert estimateAnalyst estimates 65% chance competitor enters marketNo data, expert knowledge available
๐Ÿ“Œ Finance Application
In credit risk modelling, classical probability underpins fair-value pricing models; empirical probability drives PD (Probability of Default) estimates in Basel III models using historical loan performance data; and subjective probability is used by investment analysts when assessing emerging markets with limited historical data.
๐Ÿ”‘ Section 4.2 Key Takeaways
  • Every probability problem has an experiment, sample space, and one or more events.
  • P(E) = n(E)/n(S) only when all outcomes are equally likely.
  • Empirical probability uses observed data: P = frequency / total observations.
  • Subjective probability is valid when no data exists but expert judgment is available.

Section 4.3 — Probability Rules

๐ŸŽฏ Learning Goal

Apply the three fundamental probability rules — Addition, Multiplication, and Complement — to calculate probabilities of combined events in finance, auditing, and business contexts.

Rule 1 — The Complement Rule

๐Ÿ“ Complement Rule
P(E') = 1 − P(E)
The probability that an event does NOT occur equals 1 minus the probability it does occur.

Worked Example: A company estimates a 30% chance of winning a contract. What is the probability of NOT winning?

P(win) = 0.30 → P(not win) = 1 − 0.30 = 0.70 = 70%

Audit Application: If an auditor estimates a 5% risk that internal controls are ineffective (control risk = 0.05), then the probability that controls ARE effective = 1 − 0.05 = 0.95 (95%).

Rule 2 — The Addition Rule

๐Ÿ“ Addition Rule — General Form
P(A ∪ B) = P(A) + P(B) − P(A ∩ B)
For mutually exclusive events where P(A ∩ B) = 0: P(A ∪ B) = P(A) + P(B)

Variable breakdown:

  • P(A ∪ B) — Probability that A OR B (or both) occur
  • P(A ∩ B) — Probability that BOTH A and B occur simultaneously (joint probability)

Worked Example — Loan Portfolio

A bank portfolio contains two loans. Based on credit models:

  • P(Loan A defaults) = 0.15
  • P(Loan B defaults) = 0.10
  • P(Both A and B default) = 0.04

What is the probability that at least one loan defaults?

P(A ∪ B) = 0.15 + 0.10 − 0.04 = 0.21 = 21%

Interpretation: There is a 21% chance that at least one loan in this portfolio will default.

⚠️ Common Mistake
Forgetting to subtract P(A ∩ B). If you add P(A) + P(B) = 0.25 without subtracting the overlap, you count the joint event twice — this inflates the probability and leads to incorrect risk assessments.

Rule 3 — The Multiplication Rule

๐Ÿ“ Multiplication Rule
P(A ∩ B) = P(A) × P(B|A)
For independent events: P(A ∩ B) = P(A) × P(B)

P(B|A) is the conditional probability of B given that A has already occurred (covered in Section 4.4).

Worked Example — Quality Control

A factory produces items. 5% are defective. You randomly test two items independently.

What is the probability BOTH items are defective?

P(defective) = 0.05 for each item (independent events)

P(both defective) = 0.05 × 0.05 = 0.0025 = 0.25%

Independent vs. Dependent Events

DimensionIndependent EventsDependent Events
DefinitionOccurrence of A has NO effect on P(B)Occurrence of A CHANGES P(B)
TestP(A ∩ B) = P(A) × P(B)P(A ∩ B) = P(A) × P(B|A)
Formula usedSimple multiplicationConditional probability required
Statistical exampleTwo separate coin flipsDrawing cards without replacement
Business exampleSales in Tokyo vs. Lagos (unrelated markets)Sales this month vs. last month (trend relationship)
Finance exampleStock returns of uncorrelated assetsDefault of a subsidiary given parent default
Audit exampleTesting two unrelated account balancesTesting accounts where errors in one indicate errors in another

Probability Rules Summary

RuleFormulaWhen to UseKey Caution
ComplementP(E') = 1 − P(E)Finding "not E" probabilityAlways valid — no conditions
Addition (Mutually Exclusive)P(A∪B) = P(A) + P(B)A and B cannot happen togetherEvents truly cannot overlap
Addition (General)P(A∪B) = P(A)+P(B)−P(A∩B)A and B might overlapAlways use this unless exclusivity is proven
Multiplication (Independent)P(A∩B) = P(A)×P(B)A and B are independentVerify independence before applying
Multiplication (Dependent)P(A∩B) = P(A)×P(B|A)A affects the probability of BRequires conditional probability
๐Ÿ”‘ Section 4.3 Key Takeaways
  • The Complement Rule: P(not E) = 1 − P(E). Use it to find "at least one" probabilities efficiently.
  • The Addition Rule: Always subtract the joint probability to avoid double-counting overlapping events.
  • The Multiplication Rule: Use P(A)×P(B) only for independent events; use P(A)×P(B|A) for dependent events.
  • Before applying any rule, classify your events: mutually exclusive? Independent? Dependent?

Section 4.4 — Conditional Probability

๐ŸŽฏ Learning Goal

Calculate and interpret conditional probability — the probability that an event occurs given that another event has already occurred — and apply it to financial screening, audit risk, and customer analytics.

⚡ Quick Answer: What is Conditional Probability?

Conditional probability is the probability of event B occurring given that event A has already occurred. It is written P(B|A) and calculated as P(A ∩ B) ÷ P(A). It narrows the sample space to only the scenarios where A is true.

Why Conditional Probability Matters

In the real world, information arrives sequentially. Knowing that a loan applicant has previously defaulted changes the probability that they will default again. Knowing a test result is positive changes the probability of disease. Conditional probability is how we formally update probabilities with new information.

๐Ÿ“ Conditional Probability Formula
P(B | A) = P(A ∩ B) / P(A)
Read: "The probability of B given A equals the joint probability of A and B divided by the probability of A"

Variable breakdown:

  • P(B|A) — Conditional probability: probability of B occurring, given A has occurred
  • P(A ∩ B) — Joint probability: probability that BOTH A and B occur
  • P(A) — Marginal probability: probability that A occurs (must be > 0)

Step-by-Step Worked Example — Credit Screening

A bank's historical data on 10,000 loan applications shows:

Defaults (D)Does Not Default (D')Total
Poor Credit Score (P)4201,5802,000
Good Credit Score (G)807,9208,000
Total5009,50010,000

Question: Given that a borrower has a poor credit score, what is the probability they default?

1
Identify P(A): P(Poor Credit) = 2,000/10,000 = 0.20
2
Identify P(A ∩ B): P(Poor Credit AND Default) = 420/10,000 = 0.042
3
Apply formula: P(Default | Poor Credit) = 0.042 / 0.20 = 0.21 = 21%
4
Interpret: Among borrowers with poor credit, 21% are expected to default.
5
Compare: P(Default | Good Credit) = (80/10,000) / (8,000/10,000) = 0.008/0.80 = 0.01 = 1%. Poor credit borrowers default at 21× the rate of good credit borrowers.

Audit Example — Transaction Testing

During an audit, 1,000 transactions are reviewed. 60 contain errors. Of those 60 errors, 45 involve amounts over £50,000. What is the probability a transaction contains an error, given that it exceeds £50,000?

Assume 150 transactions exceed £50,000 total.

P(Error | >£50K) = P(Error AND >£50K) / P(>£50K) = (45/1000) / (150/1000) = 0.045 / 0.15 = 0.30 = 30%

Conclusion: High-value transactions have a 30% error rate — the auditor should prioritise sampling from this stratum.

⚠️ Common Mistakes in Conditional Probability
1. Confusing P(B|A) with P(A|B): P(Default | Poor Credit) ≠ P(Poor Credit | Default). These are entirely different questions with different answers — confusing them is called the "inverse fallacy."

2. Ignoring base rates: A diagnostic test with 95% accuracy doesn't mean 95% of positive results are true positives. The base rate of the condition matters enormously (this is what Bayes' Theorem addresses).
๐Ÿ”‘ Section 4.4 Key Takeaways
  • P(B|A) = P(A∩B) / P(A) — always divide the joint probability by the conditioning event's probability.
  • Conditional probability narrows the sample space to only the cases where A is true.
  • P(B|A) ≠ P(A|B) — confusing these is one of the most common errors in probability and statistics.
  • In finance: conditional probability models differentiated default risk by borrower segment.
  • In auditing: it allows risk-stratified sampling by directing attention to high-probability error areas.

Section 4.5 — Bayes' Theorem

๐ŸŽฏ Learning Goal

Apply Bayes' Theorem to update probability estimates as new evidence arrives — a critical tool in medical diagnosis, fraud detection, credit risk modelling, and machine learning classification.

⚡ Quick Answer: What is Bayes' Theorem?

Bayes' Theorem is a mathematical formula that updates the probability of a hypothesis when new evidence is obtained. It combines your prior belief with the likelihood of observing the evidence to produce a revised "posterior" probability. It is the formal mathematical foundation for rational belief updating.

The Formula

๐Ÿ“ Bayes' Theorem
P(A | B) = [ P(B | A) × P(A) ] / P(B)
Posterior = (Likelihood × Prior) / Evidence

Variable breakdown:

SymbolNameMeaning
P(A|B)Posterior probabilityUpdated probability of A after observing B
P(B|A)LikelihoodProbability of observing B if A is true
P(A)Prior probabilityInitial probability of A before observing B
P(B)Marginal probability / EvidenceTotal probability of observing B across all scenarios

The denominator P(B) is expanded using the Total Probability Rule:

๐Ÿ“ Expanded (Total Probability) Form
P(A | B) = P(B|A)·P(A) / [P(B|A)·P(A) + P(B|A')·P(A')]

Case Study 1 — Medical Diagnosis

๐Ÿฅ Case Study: Disease Screening

Problem: A disease affects 1% of the population. A diagnostic test is 99% sensitive (correctly identifies 99% of those who have the disease) and 95% specific (correctly identifies 95% of those who don't). A random person tests positive. What is the probability they actually have the disease?

Data:

  • P(Disease) = 0.01 (prior — base rate)
  • P(No Disease) = 0.99
  • P(Positive | Disease) = 0.99 (sensitivity)
  • P(Positive | No Disease) = 0.05 (1 − specificity = false positive rate)

Calculation:

P(Positive) = P(Pos|Disease)×P(Disease) + P(Pos|No Disease)×P(No Disease)
= (0.99×0.01) + (0.05×0.99) = 0.0099 + 0.0495 = 0.0594

P(Disease | Positive) = (0.99 × 0.01) / 0.0594 = 0.0099 / 0.0594 ≈ 0.167 = 16.7%

Interpretation: Despite the highly accurate test, a positive result means only a 16.7% chance of actually having the disease. Why? Because the disease is rare (1% base rate), so false positives vastly outnumber true positives in the population.

Decision: A positive result warrants further confirmatory testing rather than immediate treatment — understanding this prevents overdiagnosis and unnecessary harm.

Case Study 2 — Fraud Detection

๐Ÿ” Case Study: Financial Transaction Fraud

Problem: A bank's transaction monitoring system flags 2% of all transactions as fraudulent. The fraud detection algorithm correctly identifies 90% of actual fraudulent transactions (sensitivity). It generates false alarms for 3% of legitimate transactions. A transaction is flagged. What is the probability it is genuinely fraudulent?

Data:

  • P(Fraud) = 0.02; P(Legitimate) = 0.98
  • P(Alert | Fraud) = 0.90
  • P(Alert | Legitimate) = 0.03

Calculation:

P(Alert) = (0.90 × 0.02) + (0.03 × 0.98) = 0.018 + 0.0294 = 0.0474

P(Fraud | Alert) = (0.90 × 0.02) / 0.0474 = 0.018 / 0.0474 ≈ 0.38 = 38%

Interpretation: Only 38% of flagged transactions are truly fraudulent, despite the algorithm's 90% sensitivity. The remaining 62% are false positives — legitimate transactions incorrectly flagged. This is critical for operational efficiency: the fraud team should expect to investigate roughly 2.6 false alarms for every genuine fraud case.

Case Study 3 — Credit Risk

๐Ÿ’ณ Case Study: Loan Application Assessment

Problem: Based on historical data, 8% of loan applicants default within 3 years. A credit bureau score below 600 occurs in 25% of applicants overall, but in 70% of those who eventually default. A new applicant has a score below 600. What is their probability of defaulting?

Data:

  • P(Default) = 0.08; P(No Default) = 0.92
  • P(Score <600 | Default) = 0.70
  • P(Score <600 | No Default) = (0.25 − 0.08×0.70) / 0.92 ≈ 0.21

Calculation:

P(Score<600) = (0.70×0.08) + (0.21×0.92) = 0.056 + 0.193 = 0.249 ≈ 0.25

P(Default | Score<600) = (0.70 × 0.08) / 0.25 = 0.056 / 0.25 = 0.224 = 22.4%

Decision: The applicant's default probability has risen from the base rate of 8% to 22.4% based on their credit score. This materially affects pricing: the bank should charge a higher interest rate or require additional collateral to compensate for the elevated risk.

๐Ÿ”‘ Section 4.5 Key Takeaways
  • Bayes' Theorem updates prior probabilities with new evidence to produce posterior probabilities.
  • Low base rates (rare events) mean positive test results carry less information than intuition suggests — this is the base rate fallacy.
  • The denominator P(B) uses the Total Probability Rule: sum across all mutually exclusive scenarios.
  • Bayes' Theorem is the mathematical foundation of: spam filters, medical diagnostics, credit scoring, and Bayesian machine learning.

Section 4.6 — Probability Trees

๐ŸŽฏ Learning Goal

Build and interpret probability trees to visualise sequential events, calculate path probabilities, and support structured decision-making.

A probability tree is a diagram that maps all possible sequences of events, with branches labelled by their probabilities. Multiplying probabilities along a path gives the joint probability of that sequence. Adding across paths gives marginal probabilities.

How to Build a Probability Tree

1
Identify the first event and draw branches for each outcome. Label each branch with its probability.
2
For each outcome, draw sub-branches for the next event. Use conditional probabilities if events are dependent.
3
Verify: Probabilities at each node must sum to 1.
4
Calculate path probabilities by multiplying along each branch (chain rule).
5
Sum relevant paths to find the probability of any composite event.

Example — Product Launch Decision

A company launches a product. Market reception is either Strong (60%) or Weak (40%). If reception is strong, the probability of exceeding revenue target is 80%. If reception is weak, the probability of exceeding revenue target is 20%.

PathMarket ReceptionRevenue ResultCalculationPath Probability
Path 1Strong (0.60)Exceeds Target (0.80)0.60 × 0.800.48
Path 2Strong (0.60)Misses Target (0.20)0.60 × 0.200.12
Path 3Weak (0.40)Exceeds Target (0.20)0.40 × 0.200.08
Path 4Weak (0.40)Misses Target (0.80)0.40 × 0.800.32
Total (must = 1.00)1.00 ✓

P(Exceeds Revenue Target) = Path 1 + Path 3 = 0.48 + 0.08 = 0.56 = 56%

Audit Sampling Tree

An auditor tests transactions where 10% contain errors. She tests two transactions independently. The tree produces:

Transaction 1Transaction 2Probability
Error (0.10)Error (0.10)0.10 × 0.10 = 0.01
Error (0.10)No Error (0.90)0.10 × 0.90 = 0.09
No Error (0.90)Error (0.10)0.90 × 0.10 = 0.09
No Error (0.90)No Error (0.90)0.90 × 0.90 = 0.81
Total1.00 ✓

P(At least one error) = 1 − P(No errors) = 1 − 0.81 = 0.19 = 19%

๐Ÿ”‘ Section 4.6 Key Takeaways
  • Probability trees map all possible sequential outcomes — use them when events occur in stages.
  • Multiply probabilities along a path to get the joint probability of that sequence.
  • Add relevant path probabilities to find the probability of composite events (e.g., "at least one").
  • Branch probabilities at each node must always sum to exactly 1.

Section 4.7 — Probability Distributions

๐ŸŽฏ Learning Goal

Identify, select, and apply the four key probability distributions — Binomial, Poisson, Normal, and Uniform — to model real-world data in finance, operations, and research.

⚡ Quick Answer: What is a Probability Distribution?

A probability distribution is a mathematical function that describes every possible value a random variable can take and the probability associated with each value. Distributions are the bridge between individual probability calculations and statistical inference, hypothesis testing, and predictive modelling.

Overview of Key Distributions

DistributionTypeVariableKey ParameterClassic Application
BinomialDiscreteCount of successes in n trialsn (trials), p (probability)Loan defaults, quality defects, audit errors
PoissonDiscreteCount of events in fixed intervalฮป (average rate)System failures, fraud events per day, call arrivals
NormalContinuousSymmetric, bell-shapedฮผ (mean), ฯƒ (std dev)Stock returns, heights, exam scores
UniformContinuousEqual probability across rangea (min), b (max)Simulation inputs, random number generation

Binomial Distribution

๐Ÿ“ Binomial Probability Formula
P(X = k) = C(n,k) · p^k · (1−p)^(n−k)
C(n,k) = n! / [k!(n−k)!] · Mean = np · Variance = np(1−p)

Conditions (all four must hold):

  • Fixed number of trials: n
  • Each trial has only two outcomes: success or failure
  • Probability of success p is constant across all trials
  • Trials are independent of each other

Worked Example — Credit Defaults

A bank has 10 small business loans, each with an independent 15% probability of defaulting in the next year. What is the probability that exactly 2 of the 10 loans default?

1
Identify parameters: n = 10, k = 2, p = 0.15, (1−p) = 0.85
2
Calculate C(10,2): 10! / (2! × 8!) = (10 × 9) / (2 × 1) = 45
3
Calculate: P(X=2) = 45 × (0.15)² × (0.85)⁸ = 45 × 0.0225 × 0.2725 ≈ 0.2759 = 27.6%
4
Interpret: There is approximately a 27.6% chance that exactly 2 out of 10 loans default — the most likely single outcome. Expected defaults = 10 × 0.15 = 1.5 loans.

Poisson Distribution

๐Ÿ“ Poisson Probability Formula
P(X = k) = (e^−ฮป × ฮป^k) / k!
ฮป = average number of events · e ≈ 2.71828 · Mean = Variance = ฮป

Use when: Counting rare events over a fixed interval of time, space, or volume, where events occur independently and at a constant average rate.

Worked Example — Fraud Events

A bank experiences an average of 3 fraudulent transactions per day. What is the probability of exactly 5 fraudulent transactions occurring on a given day?

ฮป = 3, k = 5

P(X=5) = (e⁻³ × 3⁵) / 5! = (0.0498 × 243) / 120 = 12.1 / 120 ≈ 0.1008 = 10.1%

Normal Distribution

๐Ÿ“ Normal Distribution — Standardisation (Z-score)
Z = (X − ฮผ) / ฯƒ
Mean = ฮผ · Standard Deviation = ฯƒ · Symmetric bell curve around mean

The Empirical Rule (68-95-99.7 Rule):

Range% of Data IncludedFinance Example (Portfolio Return ฮผ=8%, ฯƒ=5%)
ฮผ ± 1ฯƒ68.27%Returns between 3% and 13%
ฮผ ± 2ฯƒ95.45%Returns between −2% and 18%
ฮผ ± 3ฯƒ99.73%Returns between −7% and 23%

Worked Example — Portfolio Returns (Value at Risk)

A portfolio has mean annual return ฮผ = 10%, standard deviation ฯƒ = 15%. What return is at the 5th percentile (the threshold below which the worst 5% of years fall)?

Z at 5th percentile = −1.645

X = ฮผ + Z×ฯƒ = 10% + (−1.645 × 15%) = 10% − 24.7% = −14.7%

Interpretation: In the worst 5% of years, this portfolio loses more than 14.7%. This is the 5% VaR (Value at Risk) — a key regulatory capital measure.

Uniform Distribution

๐Ÿ“ Uniform Distribution
P(a ≤ X ≤ b) = (b − a) / (total range) · f(x) = 1/(b−a)
All values in [a, b] are equally likely · Mean = (a+b)/2 · Variance = (b−a)²/12

Example:

A payment processing time is uniformly distributed between 1 and 5 days. What is the probability processing takes more than 3 days?

P(X > 3) = (5 − 3) / (5 − 1) = 2/4 = 0.50 = 50%

Complete Distribution Comparison

FeatureBinomialPoissonNormalUniform
TypeDiscreteDiscreteContinuousContinuous
Parametersn, pฮปฮผ, ฯƒa, b
Meannpฮปฮผ(a+b)/2
Variancenp(1−p)ฮปฯƒ²(b−a)²/12
ShapeSkewed to symmetricRight-skewed for small ฮปSymmetric bellFlat rectangle
Finance useLoan defaults, option pricing (CRR)Operational loss eventsAsset returns, portfolio riskSimulation inputs
Audit useSampling for attribute errorsError events per time periodContinuous control metricsRandom sampling selection
Key assumptionIndependent trials, constant pRare, independent events at constant rateSymmetric, many small influencesEqually likely outcomes
LimitationBinary outcomes onlyOnly for count dataNot for fat-tailed financial dataReal data rarely uniform
๐Ÿ”‘ Section 4.7 Key Takeaways
  • Binomial: fixed trials, binary outcome, constant probability, independent — use for counts of successes.
  • Poisson: count rare events per interval where ฮป = mean rate — mean equals variance.
  • Normal: symmetric, bell-shaped, defined by mean and standard deviation — 68-95-99.7 rule is fundamental.
  • Uniform: equal likelihood across a range — baseline model and simulation input.
  • Match the distribution to the data type and generating process, not just the shape of the data.

Section 4.8 — Real-World Applications

Probability in Finance

ApplicationProbability Concept UsedHow It Works
Value at Risk (VaR)Normal distributionFind return threshold below which losses occur with P = 1%, 5%
Probability of Default (PD)Empirical + BayesEstimate P(default) from historical data, updated by credit signals
Option Pricing (Black-Scholes)Log-normal distributionAsset prices assumed to follow log-normal process
Portfolio DiversificationJoint probability / correlationP(both assets fall) depends on their dependence structure
Monte Carlo SimulationAll distributionsGenerate thousands of random scenarios to estimate risk distributions

Probability in Auditing

Audit Risk ComponentProbability ConceptFormulaTypical Value
Inherent Risk (IR)Prior probability of errorAssessed judgmentally40–80%
Control Risk (CR)P(controls fail to prevent error)Assessed from control testing20–60%
Detection Risk (DR)P(auditor misses existing error)AR / (IR × CR)Set to achieve AR ≤ 5%
Audit Risk (AR)Joint probabilityAR = IR × CR × DR≤ 5% (professional standard)
Sampling RiskConfidence intervalsBased on sample size and tolerable error rate5% standard in most audits

Probability in Business

Business DecisionProbability ToolDecision Rule
New product launchDecision tree with probabilitiesLaunch if Expected Value > launch cost
Inventory managementPoisson distribution for demandSet safety stock to cover P(stockout) ≤ 5%
Customer churn predictionLogistic regression / BayesFlag customers with P(churn) > threshold for retention campaigns
Insurance pricingActuarial probability modelsPremium = E[claim] + risk loading + operating margin
Supplier selectionConditional probabilitySelect supplier with lowest P(late delivery | large order)

Section 4.9 — Case Study: Loan Default Risk

๐Ÿ“Š Full Case Study: Community Bank Loan Portfolio Risk Assessment

1. Business Problem

A community bank holds a portfolio of 500 small business loans. The risk management team needs to estimate the probability of default events in the coming year to determine required loan loss provisions (regulatory capital) and to set loan approval criteria for new applications.

2. Available Data

Data PointValueSource
Portfolio size500 loansLoan management system
Historical default rate (overall)6%5-year internal records
Sector A (retail) — 200 loans9% historical default rateCredit analysis
Sector B (manufacturing) — 300 loans4% historical default rateCredit analysis
New applicant credit scoreBelow 550Credit bureau
P(Score < 550 | Default)0.65Historical model
P(Score < 550 | No Default)0.08Historical model

3. Step 1 — Expected Number of Defaults (Binomial)

Treating defaults as approximately independent Binomial events:

Sector A (Retail): n = 200, p = 0.09 → E[defaults] = 200 × 0.09 = 18 loans

Sector B (Manufacturing): n = 300, p = 0.04 → E[defaults] = 300 × 0.04 = 12 loans

Portfolio total expected defaults: 18 + 12 = 30 loans out of 500 (6%)

4. Step 2 — Probability of 35 or More Defaults (Risk Tail)

Using Normal approximation to Binomial (n=500, p=0.06):

Mean = 500 × 0.06 = 30; SD = √(500 × 0.06 × 0.94) = √16.92 ≈ 4.11

Z = (35 − 30) / 4.11 = 1.22 → P(X ≥ 35) ≈ 1 − ฮฆ(1.22) = 1 − 0.889 = 11.1%

Interpretation: There is an 11.1% chance the portfolio experiences 35 or more defaults — a materially elevated loss scenario the bank must provision for.

5. Step 3 — Bayes' Theorem for New Applicant

A new applicant has a credit score below 550. Using Bayes' Theorem:

Prior P(Default) = 0.06

P(Score <550) = (0.65×0.06) + (0.08×0.94) = 0.039 + 0.0752 = 0.1142

P(Default | Score<550) = (0.65 × 0.06) / 0.1142 = 0.039 / 0.1142 = 0.342 = 34.2%

6. Risk Assessment and Decision

ScenarioDefault ProbabilityDecision Recommendation
Average portfolio loan6.0%Standard approval process
Retail sector loan9.0%Enhanced monitoring, higher rate
New applicant (score <550)34.2%Decline or require significant collateral
Portfolio tail (≥35 defaults)11.1%Maintain elevated loan loss provisions

7. Recommendations

  • Set loan loss provision to cover the 99th percentile scenario (Z=2.33): 30 + 2.33×4.11 ≈ 39.6 → provision for 40 defaults.
  • Apply sector-specific approval criteria: retail applicants face a higher default rate and require adjusted pricing or additional covenants.
  • Implement Bayesian score-based credit decision rule: applicants with score <550 face a 34.2% default rate — well above the bank's risk tolerance threshold of 10%.
  • Review portfolio concentration: 40% retail exposure driving disproportionate risk suggests sector diversification is warranted.

Common Probability Mistakes — The Critical 15

#MistakeWhy It HappensCorrect Approach
1Using accuracy over base rate in Bayes' problemsIgnoring base rates (how rare the event is)Always identify P(A) before applying Bayes
2Confusing P(B|A) with P(A|B)Assuming conditional probability is symmetricThese are different — always check which direction you need
3Adding probabilities of dependent eventsAssuming events are independent when they're notTest independence: check if P(A∩B) = P(A)×P(B)
4Forgetting to subtract P(A∩B) in Addition RuleCounting overlapping outcomes twiceUse P(A∪B) = P(A)+P(B)−P(A∩B) always
5Probability greater than 1 or less than 0Algebraic errors in calculationIf result outside [0,1], recheck; it's always invalid
6Gambler's Fallacy — "it's overdue"Misunderstanding independence over timeEach fair coin flip is independent; past results don't change future P
7Applying Binomial when events are not independentMisidentifying independenceVerify all four Binomial conditions before applying
8Using Normal distribution for small samplesAssuming normality without justificationCheck n ≥ 30 and data symmetry; use t-distribution for small n
9Confusing "or" (union) with "and" (intersection)Language ambiguity in probability problems"Or" = ∪ (at least one); "And" = ∩ (both)
10Equating probability with certainty at extreme valuesTreating P=0.99 as certainEven P=0.99 events fail 1% of the time; plan for tail risk
11Ignoring conditional probability in sequential eventsUsing unconditional probabilities throughout a treeUpdate probabilities at each node based on prior outcomes
12Misusing the Complement Rule for joint eventsTaking complement of a multi-event expression incorrectlyP(at least one) = 1 − P(none) — use this carefully
13Confusing Poisson and Binomial for count dataBoth model counts so seem interchangeableBinomial: fixed n, known p. Poisson: events per interval, unknown n
14Treating subjective probability as preciseExpert estimates reported with false precisionExpress as ranges or sensitivity analyses; acknowledge uncertainty
15Not verifying tree branch probabilities sum to 1Arithmetic errors in building treesAlways check: each node's branches must sum exactly to 1.00
✅ Module 4 — Master Key Takeaways
  • Probability measures likelihood: P(E) ∈ [0,1]; 0 = impossible, 1 = certain
  • Three rules are the core toolkit: Complement (1−P), Addition (∪ with overlap), Multiplication (∩ with conditioning)
  • Conditional probability P(B|A): Always divide by P(A); never confuse with P(A|B)
  • Bayes' Theorem: Updates prior beliefs with new evidence; base rates critically matter
  • Trees: Multiply along paths; add across paths for composite events
  • Distributions: Binomial (fixed trials), Poisson (rate-based counts), Normal (continuous symmetric), Uniform (equal likelihood)
  • Applications: VaR uses Normal; PD models use Bayes/empirical; audit risk uses multiplication rule; Binomial drives sampling theory

Section 4.10 — Practice and Assessment

Part A — 15 Numerical Practice Problems (with Solutions)

๐Ÿ“ Practice Problems
Work each problem independently before reading the solution. Show all steps.

P1. Basic Probability — Card Draw

A standard deck has 52 cards. What is the probability of drawing a red King?

Solution: Red Kings = 2 (King of Hearts + King of Diamonds). P = 2/52 = 1/26 ≈ 0.0385

P2. Complement Rule — Project Delivery

A project has a 35% chance of being delivered late. What is the probability it is delivered on time?

Solution: P(on time) = 1 − 0.35 = 0.65 = 65%

P3. Addition Rule — Survey Results

In a customer survey: P(satisfied) = 0.70, P(would recommend) = 0.65, P(satisfied AND would recommend) = 0.55. Find P(satisfied OR would recommend).

Solution: P(S∪R) = 0.70 + 0.65 − 0.55 = 0.80 = 80%

P4. Multiplication Rule — Audit Test

An auditor tests two transactions. P(Transaction 1 has error) = 0.08. P(Transaction 2 has error) = 0.08. Both are independent. Find P(both have errors).

Solution: P = 0.08 × 0.08 = 0.0064 = 0.64%

P5. Conditional Probability — Insurance

Of 1,000 policyholders: 200 are under 25 years old; 40 of those under 25 filed claims; 80 of the 800 over 25 filed claims. Find P(claim | under 25).

Solution: P(claim | under 25) = 40/200 = 0.20 = 20%. Compare: P(claim | over 25) = 80/800 = 10%. Young drivers claim at double the rate.

P6. Bayes' Theorem — Product Defects

Machine A produces 60% of output; Machine B produces 40%. Machine A has a 3% defect rate; Machine B has a 5% defect rate. A defective item is found. What is the probability it came from Machine B?

Solution: P(defect) = 0.60×0.03 + 0.40×0.05 = 0.018 + 0.020 = 0.038
P(B | defect) = (0.05×0.40)/0.038 = 0.020/0.038 = 0.526 = 52.6%

P7. Binomial — Quality Sampling

A sample of 8 items is drawn from a batch with 20% defect rate. Find P(exactly 3 defective).

Solution: P(X=3) = C(8,3)×(0.20)³×(0.80)⁵ = 56×0.008×0.3277 = 0.1468 ≈ 14.7%

P8. Poisson — Server Failures

A server experiences 2 crashes per month on average. Find P(exactly 4 crashes next month).

Solution: ฮป=2, k=4. P = e⁻²×2⁴/4! = 0.1353×16/24 = 0.0902 ≈ 9.0%

P9. Normal Distribution — Salary Analysis

Salaries are normally distributed: ฮผ = £45,000, ฯƒ = £8,000. What fraction of employees earns more than £53,000?

Solution: Z = (53,000 − 45,000)/8,000 = 1.00. P(X > 53,000) = 1 − ฮฆ(1.00) = 1 − 0.8413 = 0.1587 = 15.87%

P10. At Least One — System Reliability

Three independent backup systems each have a 5% failure probability. Find P(at least one fails).

Solution: P(none fail) = 0.95³ = 0.857. P(at least one fails) = 1 − 0.857 = 0.143 = 14.3%

P11. Binomial — Loan Portfolio

A portfolio of 20 loans each has P(default) = 0.05. What is the expected number of defaults and standard deviation?

Solution: E[X] = 20×0.05 = 1 loan. SD = √(20×0.05×0.95) = √0.95 = 0.975 loans

P12. Conditional + Bayes — Stress Test

Before a recession: P(firm fails) = 0.04. P(negative cash flow | failure) = 0.80. P(negative cash flow | survival) = 0.15. A firm shows negative cash flow. Find P(failure | negative cash flow).

Solution: P(NCF) = 0.80×0.04 + 0.15×0.96 = 0.032+0.144 = 0.176
P(fail|NCF) = 0.032/0.176 = 0.182 = 18.2%

P13. Normal — Inventory Management

Daily demand is normally distributed: ฮผ=100 units, ฯƒ=15 units. What safety stock is needed so that stockouts occur no more than 2.5% of the time?

Solution: Need P(demand ≤ stock) = 0.975 → Z = 1.96. Stock = 100 + 1.96×15 = 100+29.4 = 130 units (safety stock of 30 units)

P14. Probability Tree — Two-Stage Tender

Stage 1 success probability: 0.60. If Stage 1 succeeded, Stage 2 success probability: 0.70. Find P(both stages successful).

Solution: P = 0.60×0.70 = 0.42 = 42%

P15. Empirical Probability — Audit Sample

An auditor reviews 250 invoices and finds 18 with errors. Based on this sample, estimate the probability that a randomly selected invoice contains an error.

Solution: P(error) = 18/250 = 0.072 = 7.2% (empirical probability). This exceeds the tolerable error rate of 5%, indicating a control weakness requiring further investigation.

Part B — 25 Multiple Choice Questions

#QuestionAnswerExplanation
1P(A) = 0.40. What is P(A')?B) 0.60Complement: 1 − 0.40 = 0.60
2P(A) = 0.3, P(B) = 0.5, P(A∩B) = 0.2. P(A∪B) = ?C) 0.600.3+0.5−0.2 = 0.60
3Which distribution models the number of successes in 10 independent coin flips?A) BinomialFixed n, binary outcome, constant p, independent
4P(B|A) = 0.6, P(A) = 0.4. P(A∩B) = ?B) 0.24P(A∩B) = P(B|A)×P(A) = 0.6×0.4 = 0.24
5A disease occurs in 2% of the population. A test is 95% accurate. A positive test result's probability of being a true positive is:C) Less than 30%Base rate fallacy — low prevalence means many false positives
6In the Normal distribution, what % of values fall within ฮผ ± 2ฯƒ?B) 95.45%Empirical rule: 68-95.45-99.73
7Two events cannot happen simultaneously. They are:A) Mutually exclusiveMutually exclusive: P(A∩B) = 0
8The mean and variance of a Poisson distribution with ฮป=5 are:C) Both equal 5Poisson: mean = variance = ฮป
9Which probability type relies on historical data?B) EmpiricalEmpirical = observed frequency / total trials
10In Bayes' Theorem, P(A) before observing evidence is called the:A) Prior probabilityPosterior is the updated probability after evidence
11P(A) = 0.5, P(B) = 0.4, events independent. P(A∩B) = ?B) 0.20Independent: P(A∩B) = 0.5×0.4 = 0.20
12Which distribution applies to: "number of customer calls in one hour"?C) PoissonCount of independent events per fixed time interval at constant rate
13Audit risk = IR × CR × DR. If IR=0.8, CR=0.5, desired AR=0.05, what must DR be?B) 0.125DR = 0.05/(0.8×0.5) = 0.05/0.4 = 0.125
14A uniform distribution spans [2, 10]. Its mean is:C) 6Mean = (2+10)/2 = 6
15What is the probability of rolling a sum of 7 with two dice?B) 1/66 favorable outcomes out of 36: (1,6),(2,5),(3,4),(4,3),(5,2),(6,1)
16P(not getting a defect) = 0.92. P(getting a defect) = ?A) 0.08Complement: 1 − 0.92 = 0.08
17n=100, p=0.05 Binomial. Expected value and variance?C) E=5, Var=4.75E=np=5; Var=np(1-p)=100×0.05×0.95=4.75
18If P(A|B) = P(A), then A and B are:B) IndependentIndependence definition: conditioning on B doesn't change P(A)
19P(at least one success in 3 independent trials, p=0.3) = ?C) 0.6571 − P(none) = 1 − 0.7³ = 1 − 0.343 = 0.657
20Which metric measures the spread of a Normal distribution?A) Standard deviation (ฯƒ)ฯƒ determines the width of the bell curve
21Bayes' Theorem is most useful when:C) Updating probabilities as new evidence arrivesPosterior = (Likelihood × Prior) / Evidence
22A probability tree path shows: 0.4 → 0.7. The path probability is:B) 0.28Multiply along path: 0.4 × 0.7 = 0.28
23Which distribution is appropriate for continuous data with equal probability across all values in [a,b]?D) UniformUniform distribution: f(x) = 1/(b-a) for all x in [a,b]
24The formula P(A∩B) = P(A) × P(B|A) is the:A) General Multiplication RuleHolds for both dependent and independent events
25Z-score = 1.96 corresponds to approximately what percentile?C) 97.5th percentileฮฆ(1.96) ≈ 0.975; used in 95% confidence intervals

Frequently Asked Questions

What is the basic probability formula?
P(E) = Number of favorable outcomes / Total number of possible outcomes. This formula applies when all outcomes are equally likely. For unequal outcomes, use empirical (historical) frequency or theoretical models.
What is the difference between probability and statistics?
Probability starts from a known model and predicts outcomes (top-down: model → prediction). Statistics starts from observed outcomes and infers the underlying model (bottom-up: data → model). Both are deeply connected — statistics requires probability as its theoretical foundation.
What is conditional probability and why does it matter?
Conditional probability P(B|A) is the probability of B occurring given A has already occurred. It matters because real decisions involve partial information. A borrower's default probability given poor credit (22%) is very different from the unconditional default probability (6%). Ignoring conditioning leads to systematically wrong risk assessments.
What is Bayes' Theorem in simple terms?
Bayes' Theorem is a formula for updating your belief in something based on new evidence. Start with an initial probability (prior), multiply it by how likely you'd see the evidence if your belief were true (likelihood), and divide by the total probability of seeing that evidence. The result is your revised probability (posterior).
What is the difference between mutually exclusive and independent events?
Mutually exclusive events cannot both happen at the same time — if one occurs, the other cannot (e.g., one coin flip cannot be both heads and tails). Independent events don't affect each other's probability — knowing one occurred tells you nothing about whether the other occurred. Mutually exclusive events with positive probabilities are always dependent, not independent.
When should I use the Binomial versus Poisson distribution?
Use Binomial when: (1) you have a fixed number of trials n, (2) each trial is binary, (3) probability p is constant, and (4) trials are independent. Use Poisson when: counting events over a fixed interval (time, area, volume), you don't have a fixed maximum number of events, and events occur independently at a constant average rate ฮป. As a rule: if n is very large and p is very small, Binomial approaches Poisson with ฮป = np.
What does a probability of 0.05 mean?
A probability of 0.05 (5%) means the event is expected to occur in 5 out of every 100 trials on average under identical conditions. It does not mean the event will occur exactly 5 times in 100 trials — that's the expected value, not a certainty. It also does not mean the event "won't happen" — rare events do occur.
How is probability used in finance?
Finance uses probability in: Value at Risk (probability of loss exceeding a threshold), Probability of Default modelling (credit risk), option pricing (Black-Scholes uses log-normal distribution), Monte Carlo simulation (thousands of random scenarios for portfolio analysis), and portfolio diversification (joint probability of asset returns under correlation assumptions).
What is a probability distribution?
A probability distribution is a complete mathematical description of all possible values a random variable can take and the probability assigned to each value (or range of values for continuous variables). It tells you not just how likely one specific outcome is, but the entire landscape of possibilities and their likelihoods.
What is the Normal distribution and why is it so important?
The Normal distribution is a symmetric, bell-shaped continuous distribution defined by its mean (ฮผ) and standard deviation (ฯƒ). It is important because: (1) many natural phenomena approximate normality; (2) the Central Limit Theorem states that sample means follow a Normal distribution for large samples regardless of the underlying distribution; (3) it underlies t-tests, z-tests, confidence intervals, and regression. In finance, asset returns are often approximately modelled as Normal (though fat tails are a known limitation).
What is the difference between joint probability and conditional probability?
Joint probability P(A∩B) is the probability that both A and B occur simultaneously. Conditional probability P(B|A) is the probability of B given A has occurred. They are related by: P(A∩B) = P(A) × P(B|A). Joint probability is smaller than or equal to either P(A) or P(B); conditional probability can be larger than either depending on the relationship between the events.
What is the complement rule used for in practice?
The complement rule P(E') = 1 − P(E) is most useful when direct calculation is complex. The classic application is "at least one" probabilities: P(at least one success) = 1 − P(no successes). For example, P(at least one default in 50 loans with p=0.02 each) = 1 − (0.98)^50 = 1 − 0.364 = 0.636 = 63.6%.
Why do audit risk models use multiplication?
The Audit Risk Model (AR = IR × CR × DR) uses multiplication because the three risk components — Inherent Risk, Control Risk, and Detection Risk — are treated as independent events. A material misstatement reaches the financial statements only if inherent susceptibility to error (IR), control failure (CR), and auditor failure to detect (DR) all occur. The probability of all three occurring simultaneously = product of their individual probabilities.
What is the law of total probability?
The law of total probability states that if events A₁, A₂, ..., Aโ‚™ form a complete partition of the sample space (mutually exclusive and exhaustive), then P(B) = ฮฃ P(B|Aแตข) × P(Aแตข). It is used as the denominator in Bayes' Theorem and whenever you need to compute a marginal probability by summing across all conditioning scenarios.
How do I know which probability rule to apply?
Follow this decision framework: (1) Is the question about an event NOT happening? → Complement Rule. (2) Is the question about A OR B happening? → Addition Rule (subtract overlap if not mutually exclusive). (3) Is the question about A AND B both happening? → Multiplication Rule (use P(A)×P(B) if independent; P(A)×P(B|A) if dependent). (4) Does new information change the probability? → Conditional probability / Bayes' Theorem.

Final Summary — Module 4 Complete Recap

๐Ÿ“š Complete Module 4 Summary

Probability Fundamentals

  • Probability measures likelihood: P(E) ∈ [0,1]; P(impossible)=0; P(certain)=1
  • P(E) = n(E)/n(S) for equally likely outcomes
  • Three types: Classical (theory), Empirical (data), Subjective (judgment)

Probability Rules

  • Complement: P(E') = 1 − P(E)
  • Addition: P(A∪B) = P(A) + P(B) − P(A∩B)
  • Multiplication: P(A∩B) = P(A) × P(B|A); if independent: P(A)×P(B)

Conditional Probability

  • P(B|A) = P(A∩B) / P(A) — probability of B given A occurred
  • P(B|A) ≠ P(A|B) — these are different questions with different answers

Bayes' Theorem

  • P(A|B) = [P(B|A) × P(A)] / P(B) — update prior beliefs with evidence
  • Low base rates critically reduce the positive predictive value of tests

Probability Trees

  • Multiply along paths → add across relevant paths
  • Branch probabilities at each node must sum to 1

Probability Distributions

  • Binomial B(n,p): fixed trials, binary, independent. E=np, Var=np(1−p)
  • Poisson P(ฮป): events per interval. E=Var=ฮป
  • Normal N(ฮผ,ฯƒ): symmetric bell. Z=(X−ฮผ)/ฯƒ. 68-95-99.7 rule
  • Uniform U(a,b): equal probability. Mean=(a+b)/2

Applications

  • Finance: VaR (Normal), PD modelling (Bayes/empirical), option pricing (log-normal)
  • Audit: AR = IR × CR × DR; sampling risk via confidence intervals
  • Business: Decision trees, inventory management, customer analytics

▶ Continue Learning

Module 5 — Sampling and Estimation

In Module 5, you will see exactly how the probability foundations you have built in this module come alive in practice. Sampling theory uses the Binomial distribution to determine how many transactions an auditor must test. Estimation uses the Normal distribution to construct confidence intervals around sample means. Hypothesis testing — the most powerful tool in applied statistics — is entirely built on conditional probability: "What is the probability of observing this result if the null hypothesis were true?" Every concept in Module 5 and beyond is a direct application of what you learned here.

Topics in Module 5: Random sampling methods · Central Limit Theorem · Point estimation · Confidence intervals (z and t) · Sample size determination · Applications in auditing, finance, and research


Module 4: Probability Fundamentals · Applied Statistics Course · © Educational Content