The knowledge map
Probability becomes easier when each topic is seen as part of one connected system. Counting describes possible worlds; probability weights them; random variables translate outcomes into numbers; distributions describe those numbers; limit laws explain stable patterns.
Model uncertainty
Experiments, outcomes, events and probability axioms.
02Count possibilities
Permutations, combinations and Pascal's triangle.
03Update knowledge
Conditioning, independence and Bayesian reversal.
04Measure outcomes
Random variables, PMFs, PDFs and CDFs.
05Summarize behavior
Expectation, variance, covariance and correlation.
06Reveal stability
Law of Large Numbers and Central Limit Theorem.
07Compute by chance
Simulation and Monte Carlo approximation.
08Follow random evolution
Random walks, Markov chains and long-run behavior.
09Build rigor
Measure theory, convergence, martingales and Brownian motion.
10Use the toolkit
A compact reference for the most important formulas.
Events, sets and the probability model
A probability model is a disciplined description of uncertainty. It identifies what can happen, which collections of outcomes matter, and how much probability each event receives.
The three-part model
Sample space: the set of all possible outcomes. For two coin tosses: Ω = {HH, HT, TH, TT}.
Events: selected subsets of Ω. Example: “exactly one head” is {HT, TH}.
Probability measure: a rule assigning each event a number from 0 to 1.
Interactive event geometry
Choose a set operation and see which region is selected.
Non-negativity
Probability cannot be negative.
Normalization
Some outcome in the model must occur.
Countable additivity
When the events are pairwise disjoint.
Count before you calculate
In finite equally likely models, probability is a ratio of counts. The central question becomes: how many favorable outcomes exist, and how many outcomes exist in total?
Permutation & combination calculator
Order matters for permutations; it does not matter for combinations.
Pascal's triangle
Each value is the sum of the two values above it and equals a binomial coefficient.
Multiplication principle
If one stage has m possibilities and the next has n, together they have mn.
Permutations
Use when the sequence or assignment matters: rankings, passwords without repetition, schedules.
Combinations
Use when only membership matters: committees, subsets, lottery selections.
Probability after information arrives
Conditioning restricts the universe to outcomes compatible with new evidence. Bayes' theorem then reverses the direction of a conditional probability.
The logic of conditioning
Read this as: among outcomes where B happened, what fraction also belong to A?
Bayes' theorem
The prior P(A) is updated by the likelihood P(B|A), producing the posterior P(A|B).
A ⫫ B ⇔ P(A ∩ B)=P(A)P(B)
Bayesian diagnostic simulator
See why a highly accurate test can still produce surprising posterior probabilities.
Turn outcomes into numbers
A random variable is a function from outcomes to numerical values. This translation lets algebra and calculus operate on uncertainty.
Probability mass function
Probability is concentrated on separate values. The masses sum to one.
Probability density function
Probability is area under a density curve. A single exact point usually has probability zero.
Cumulative distribution
The CDF works for discrete, continuous and mixed distributions and never decreases.
Outcome ω
One possible result of the experiment.
Variable X(ω)
A numerical feature extracted from that outcome.
Distribution
How probability is allocated across values of X.
Expectation
The probability-weighted center.
Variance
The expected squared distance from the center.
Inference
Use observations to learn about hidden structure.
Explore the shapes of chance
Different mechanisms generate different distribution families. Adjust parameters and watch the location, spread, skew and discreteness change.
Bars represent probability mass; curves represent probability density. In a density, height is not probability by itself—area is.
Center, variability and relationship
Expectation summarizes long-run balance. Variance measures dispersion. Covariance and correlation describe how two random variables move together.
A weighted average over possible values.
The density-weighted center of mass.
Average squared deviation from the mean.
Standardized linear association from -1 to 1.
Weighted balance: a loaded die
Change the probability of rolling six; the remaining outcomes share the rest equally.
Three ideas that prevent common mistakes
Expectation need not be a possible outcome. A fair die has mean 3.5, though 3.5 cannot be rolled.
Linearity is powerful. E[X+Y]=E[X]+E[Y], even without independence.
Zero correlation is weaker than independence. Nonlinear dependence can remain invisible to correlation.
How randomness becomes regular
The Law of Large Numbers stabilizes averages. The Central Limit Theorem explains why sums and averages often have an approximately normal shape.
Law of Large Numbers
Watch the running proportion of heads settle near the true probability.
Central Limit Theorem
Average several dice repeatedly. As the number of dice grows, the histogram becomes bell-shaped.
Law of Large Numbers
It is about convergence of the sample average to the population mean.
Central Limit Theorem
It is about the limiting shape of standardized sums.
Monte Carlo: computation by random experiment
Simulation turns probability into a numerical laboratory. It can estimate quantities that are difficult to compute analytically and reveal behavior before a proof is available.
Estimate π with random points
The fraction landing inside a quarter-circle approaches π/4.
The simulation cycle
Model the experiment and its assumptions.
Generate pseudorandom outcomes from the model.
Measure the quantity of interest in each run.
Aggregate results into an estimator.
Assess error by repetition, variance or confidence bounds.
To reduce random error by a factor of 10, roughly 100 times as many trials are needed.
Probability through time
A stochastic process is a collection of random variables indexed by time or space. It models evolving uncertainty: queues, reliability, diffusion, finance, communications and many other systems.
One-dimensional random walk
At each step move +1 with probability p and -1 otherwise.
Two-state Markov chain
The next state depends only on the current state, not the complete past.
Beyond the introductory layer
Advanced probability provides the rigorous language and tools needed for infinite sample spaces, dependent processes and continuous-time motion.
Formula field guide
Use formulas as compressed statements of ideas. Before substituting numbers, identify the event, the assumptions and the meaning of each symbol.
Complement
Often the fastest route to “at least one.”
Union of two events
Subtract overlap once.
Total probability
Partition the sample space into cases.
Bayes' theorem
Reverse a conditional probability.
Binomial PMF
Number of successes in n independent Bernoulli trials.
Poisson PMF
Counts in a fixed interval under a constant rate model.
Normal density
Symmetric continuous model with mean μ and variance σ².
Exponential density
Waiting time under a constant event rate.
Standardization
Express a value in standard deviation units.
Variance shortcut
Frequently simplifies calculation.
Covariance of sums
The covariance term disappears under independence.
Conditional expectation
The law of total expectation.
A practical route through probability
Move from intuition to calculation, then from calculation to proof. Simulation should accompany every stage rather than being postponed until the end.
Learn the language
- Sets, events and sample spaces
- Counting principles
- Probability axioms
- Conditional probability and Bayes
- Independence
Learn the machinery
- Discrete and continuous variables
- PMF, PDF and CDF
- Expectation and variance
- Joint distributions and covariance
- Common distribution families
Learn the deep patterns
- Convergence concepts
- Law of Large Numbers
- Central Limit Theorem
- Markov chains and random walks
- Measure theory and stochastic processes
Check 1
A fair coin is tossed 10 times. Is the sequence HHHHHTTTTT more probable than HHTHTTHTHT?
Check 2
Can two events be mutually exclusive and independent when both have positive probability?
Check 3
Does the Central Limit Theorem say that the original population becomes normal?
Check 4
Why can a 95%-sensitive test still have a low probability of disease after a positive result?
Sources used to shape this atlas
The structure combines an intuitive introductory route with the deeper measure-theoretic and stochastic-process perspective found in advanced probability texts.