In elementary probability theory, the fundamental object of interest is a collection of outcomes from a given experiment or observation (the so-called sample space $\Omega$). One then studies probability functions on this space: functions $\mathbb{P}:\wp(\Omega) \to [0,1]$ which associated a number between $0$ and $1$ to subsets of $\Omega$ according to some mild axioms, namely that $\mathbb{P}(\Omega)=1$ and that $\mathbb{P}$ is ``additive" on (countable) disjoint sets. Often this analysis is broken into two diametrically opposed cases, those in which the objects in question are discrete (in which a healthy dose of combinatorics is often what is needed to resolve questions), and those where they are instead continuous (in which case one uses the tools of calculus --- particularly integrals --- are the key integredient).
The first pass at probability, however, often passes over subtleties which become more glaring as one's mathematical sophistication grows: is it feasible to attach a probability to all subsets of $\Omega$ in a sensible way? how do probability functions behave under limits? is the bifurcation between continuous and discrete accurate? and how does one resolve probability functions which are modeled by non-integrable functions? In this course we give a proper treatment of probability by starting with a general approach --- measure theory --- that unifies the discrete and continuous cases. We then review the key facets of probability theory (random variables, expectation, etc.) in this light, and use it to answer questions which the naive theory cannot easily resolve.
