Kardinalna aritmetika Lecture notes
Master's course taught by Alex Simpson. Warning: The notes are not optimized for HTML (yet). Without warranty.
Introduction
Every infinite subset of \(\R\) is either in bijection with \(\N\) or in bijection with \(\R\).
This is independent from ZFC.
We also have a more explicit stronger version.
\[ 2^{\aleph_0} = \aleph_1 \] as an equation in cardinal arithmetic. Note that this version implies the regular continuum hypothesis.
- Interrelationship with general math
- Touted as a foundation for mathematics
- Leads to new interesting areas of mathematics
- Provides the tools for studying independence questions
We give the axioms in a way that mirrors the way mathematicians actually use them.
Our axioms will tell us which classes can be sets:
- Assume a universe \(\mathcal{U}\) of all mathematical entities that we may use as elements of sets.
- A collection of elements of the universe is called a class.
- If \(P(x)\) is a property of elements of \(\mathcal{U}\), then \(\{x \mid P(x)\}\) satisfies \[ y \in \{x \mid P(x)\} \iff P(y). \]
- A set is a class that is itself an element of \(\mathcal{U}\).
Not every class is a set.
Let \[ \Class{R} = \{x \mid x \text{ is a set and } x \not\in x\} \] be a class. If \(\Class{R}\) were a set, then \[ \Class{R} \in \Class{R} \iff \Class{R} \not\in \Class{R}, \] a contradiction.
If \(\Class{X}, \Class{Y}\) are classes and \(\forall z.\, z \in \Class{X} \iff z \in \Class{Y}\), then \(\Class{X} = \Class{Y}\).
\(\emptyset := \{x \mid \bot\}\) is a set.
For any \(x, y\), \(\{x, y\} := \{z \mid z = x \lor z = y\}\) is a set. In particular \(\{x\} := \{x,x\}\) is a set.
For a class \(\Class{X}\) of sets, \[ \bigcup \Class{X} := \{z \mid \exists Y \in \Class{X}.\, z \in Y\}. \] If \(X\) is a set of sets, then \(\bigcup X\) is a set.
For a class \(\Class{X}\), we define its powerclass as \[ \mathcal{P}\Class{X} := \{Y \mid Y \text{ is a subset of } \Class{X}\}. \] If \(X\) is a set, then \(\mathcal{P}X\) is a set. We call it a powerset.
If \(X\) is a set and \(P(x)\) is a property of elements of \(\mathcal{U}\), then \[ \{x \in X \mid P(x)\} \] is a set.
If \(X\) is a set and \(F : X \rightarrow \mathcal{U}\) is a class function, then \(\operatorname{image}(F)\) is a set.
A class function from \(\Class{X}\) to \(\Class{Y}\) is a property \(F(x,y)\) of elements \(x,y\) of \(\mathcal{U}\) satisfying:
- For all \(x \in \Class{X}\) there exists a unique \(y \in \Class{Y}\) such that \(F(x,y)\).
- If \(F(x,y)\), then \(x \in \Class{X}\) and \(y \in \Class{Y}\).
Then \[ \operatorname{image}(F) := \{y \in \Class{Y} \mid \exists x \in \Class{X}.\, F(x,y)\}. \]
A product of classes \(\Class{X}\) and \(\Class{Y}\) is a class \(\Class{X} \times \Class{Y}\) together with class functions \(\pi_1 : \Class{X} \times \Class{Y} \rightarrow \Class{X}\) and \(\pi_2 : \Class{X} \times \Class{Y} \rightarrow \Class{Y}\) satisfying \[ \forall x \in \Class{X}.\, \forall y \in \Class{Y}.\, \exists! z \in \Class{X} \times \Class{Y}.\, \pi_1(z) = x \land \pi_2(z) = y. \]
Any two products of \(\Class{X}\) and \(\Class{Y}\) are isomorphic.
For every \(\Class{X}\) and \(\Class{Y}\), a product \(\Class{X} \times \Class{Y}\) with \(\pi_1, \pi_2\) exists.
Define \[ \Class{X} \times \Class{Y} := \left\{ \left\{ \{x\}, \{x, y\} \right\} \mid x \in \Class{X},\, y \in \Class{Y} \right\} \] and \[ \pi_1(z) := \text{the unique } x \in \Class{X} \text{ such that } \exists y \in \Class{Y}.\, z = \{\{x\}, \{x,y\}\}, \] and similarly for \(\pi_2\).
If \(X\) and \(Y\) are sets, then so is \(X \times Y\).
Suppose \(F : \Class{X} \rightarrow \Class{Y}\) is a class function. Its graph is \[ \Gamma(F) := \{(x, y) \in \Class{X} \times \Class{Y} \mid y = F(x)\} \subseteq \Class{X} \times \Class{Y}. \] If \(X\) is a set and \(F : X \rightarrow Y\) is a class function, then \(\Gamma(F)\) is a set by Replacement.
The exponential class is defined as \[ \Class{Y}^{X} := \left\{ \Gamma(F) \mid F : X \rightarrow \Class{Y} \text{ is a function} \right\}. \]
If \(Y\) is a set then \(Y^X\) is a set.
A natural number structure is a triple \((\Class{N}, 0, S)\) where \(\Class{N}\) is a class, \(0 \in \Class{N}\), and \(S : \Class{N} \rightarrow \Class{N}\) satisfying:
- \(S\) is injective: \(\forall x, y \in \Class{N}.\, S(x) = S(y) \implies x = y\).
- Acyclicity: \(\forall x \in \Class{N}.\, S(x) \neq 0\).
- Induction: if \(\Class{X}\) satisfies \(0 \in \Class{X}\) and \(\forall x \in \Class{N}.\, x \in \Class{X} \implies S(x) \in \Class{X}\), then \(\Class{N} \subseteq \Class{X}\).
A natural number structure exists.
Construct \(\Class{N}\) as the class of Von Neumann natural numbers (see below).
Suppose \(\mathcal{A}\) is a class, \(a \in \mathcal{A}\), and \(F : \mathcal{A} \rightarrow \mathcal{A}\). Then there exists a unique class function \(R : \Class{N} \rightarrow \mathcal{A}\) such that
- \(R(0) = a\),
- \(R(S(x)) = F(R(x))\).
Suppose \(\mathcal{A}\) is a class, \(\Class{Z}\) is a class, \(a : \Class{Z} \rightarrow \mathcal{A}\), and \(F : \Class{Z} \times \mathcal{A} \rightarrow \mathcal{A}\). Then there exists a unique class function \(R : \Class{Z} \times \Class{N} \rightarrow \mathcal{A}\) such that
- \(R(z, 0) = a(z)\),
- \(R(z, S(x)) = F(z, R(z, x))\).
Apply the PRT with \(\mathcal{A} = \Class{N}\), \(\Class{Z} = \Class{N}\), \(B(z) = z\), and \(F(z, x) = S(x)\). The unique resulting \[ R : \Class{N} \times \Class{N} \rightarrow \Class{N} \] satisfies
\begin{align*} R(z, 0) &= z, \\ R(z, S(x)) &= S(R(z, x)). \end{align*}This \(R\) is addition on \(\Class{N}\).
For a set \(n\) define \(n^{+} := n \cup \{n\}\). A set \(X\) is a vN-number if:
- For every subset \(Y \subseteq X\) satisfying \[ \emptyset \in X \implies \emptyset \in Y \quad \text{and} \quad \forall z \in Y.\, z^{ + } \in X \implies z^{ + } \in Y, \] it holds that \(Y = X\).
- \(X = \emptyset\) or \(\exists z \in X\) such that \(X = z^{+}\).
The class of all vN-numbers is \[ \mathrm{vN} := \{x \mid x \text{ is a set and a vN-number}\}. \]
The Von Neumann natural numbers structure is a set. Equivalently: \[ \exists X.\, \emptyset \in X \land \forall y \in X.\, y \text{ is a set} \implies y^{+} \in X. \]
The axioms of set theory (ZFC-style, synthetic formulation):
- Extensionality
- Empty Set
- Pairing
- Powerset
- Separation
- Replacement
- Axiom of Infinity
Comparison of cardinalities
Classes are collections of elements of the universe \(\Class{\mathcal{U}}\). Sets are classes that are themselves elements of \(\Class{\mathcal{U}}\). Sets are axiomasised by the axioms from lecture 1: extensionality, emptyset, pairing, union, powerset, separation, replacement, and infinity.
Let \((\N, \dots)\) be a chosen natural numbers structure. Then \(\Z, \Q\) and \(\R\) can be derived using standard mathematical constructions, e.g. Dedekind cuts.
For the time being we will work without the axiom of choice.
One intuition for sets is that they are "small" classes. There are two observations consistent with this:
- Suppose \(A\) is a set and \(\Class{B}\) a class and we have a surjective class function \(A \twoheadrightarrow \Class{B}\) then \(\Class{B}\) is a set (by replacement).
- Suppose \(\Class{A}\) is a class and \(B\) a set and we have an injective class function \(\Class{A} \rightarrowtail B\), then \(\Class{A}\) is a set (using separation and replacement).
Two sets \(A, B\) have the same cardinality (are equipotent), if there exists a bijective function \(f : A \rightarrow B\) between them. We write this as \[ \abs{A} = \abs{B}. \]
\noindent Note we don't yet have an independent meaning for \(\abs{A}\).
Since \(\abs{ ~ \cdot ~ } = \abs{ ~ \cdot ~}\) is given by an isomorphism, it is in particular an equivalence relation:
\begin{align*} \abs{A} &= \abs{A} \\ \abs{A} = \abs{B} &\implies \abs{B} = \abs{A} \\ \abs{A} = \abs{B} \land \abs{B} = \abs{C} &\implies \abs{A} = \abs{C} \end{align*}A set \(A\) has cardinality less than or equal to a set \(B\) if there exists an injective function \(i : A\rightarrowtail B\) from \(A\) to \(B\). We write \[ \abs{A} \leq \abs{B} \]
\noindent
The relation is obviously reflexive and transitive due to monomorphism composition. Antisymmetry follows from the following theorem.
Assume \(\abs{A} \leq \abs{B}\) and \(\abs{B} \leq \abs{A}\). Then \(\abs{A} = \abs{B}\).
\noindent It was reportedly first proved by Dedekind.
If \(B_0 \subseteq A_0\) and \(\abs{A_0} \leq \abs{B_0}\), then \(\abs{B_0} = \abs{A_0}\).
Let \(f : A_0 \rightarrow B_0\) be injective. Define two sequences of subsets of
\begin{align*} A_{n+1} &:= f(A_n) \\ B_{n+1} &:= f(B_n) \end{align*}Define \(C_n = A_n \setminus B_n \) for every \(n \geq 0\). Likewise define \[ C = \bigcup\limits_n C_n \text{ and } D = A_0 \setminus C. \]
We have
\begin{align*} f(C_n) &= f(A_n \setminus B_n) \\ &= f(A_n) \setminus f(B_n) \text{ because } f \text{ is injective} \\ &= A_{n+1} \setminus B_{n+1} \\ &= C_{n+1} \end{align*}So we have \[ f(C) = f\left(\bigcup\limits_{n\geq 0} C_n\right) = \bigcup\limits_{n \geq 0} f(C_n) = \bigcup\limits_{n \geq 0} C_{n+1} = C \setminus C_0 \] Then we have that \(f : C \rightarrow C \setminus C_0\) is a bijection. Then \[ B_0 = B \uplus C \setminus C_0 \cong D \uplus C = A_0. \]
Suppose \(\abs{A} \leq \abs{B}\) and \(\abs{B} \leq \abs{A}\). Then we have injective functions \[ f : A \rightarrow B \text{ and } g : B \rightarrow A. \] Define \(A_0 = A\) and \(B_0 = g(B)\). Then \[ g \circ f : A \rightarrow B_0 \] is injective so \(\abs{A_0} \leq \abs{B_0}\). By the lemma, we have \[ \abs{A} = \abs{A_0} = \abs{B_0} = \abs{B}. \]
A set \(A\) has cardinality strictly less than a set \(B\) if \[ \abs{A} < \abs{B} :\iff \abs{A} \leq \abs{B} \land \abs{A} \neq \abs{B}. \]
\leavevmode
- Strict ordering between cardinalities is irreflexive and transitive
- \(\abs{A} \leq \abs{B} < \abs{C} \leq \abs{D} \implies \abs{A} < \abs{D}\).
- It is not a linear relation, even with choice.
A set \(A\) has cardinality very strictly less than a set \(B\) if we have \(\abs{A} \ll \abs{B} :\iff \abs{A} \leq \abs{B}\) and there is no surjection from \(A\) to \(B\). If \(\abs{A} \ll \abs{B}\), then \(\abs{A} < \abs{B}\), and we have the sandwiching property \[ \abs{A} \leq \abs{B} \ll \abs{C} \leq \abs{D} \implies \abs{A} \ll \abs{D}. \]
\noindent
We have
\begin{align*} \abs{X} \leq \abs{Y} \text{ and } \abs{Y} \ll \abs{Z} &\implies X \ll \abs{Z} \\ \abs{X} \ll \abs{Y} \text{ and } \abs{Y} \leq \abs{Z} &\implies X \ll \abs{Z} \\ \end{align*}Suppose first we have \(\abs{X} \leq \abs{Y}\) and \(\abs{Y} \ll \abs{Z}\). Let \(i : X \rightarrow Y\) and \(j : Y \rightarrow Z\) be injections. Then since \(j \circ i\) is injective we have \(\abs{X} \leq \abs{Z}\). Suppose, for contradiction, that there exists a surjection \(g : X \rightarrow Z\). We define \(h : Y \rightarrow Z\) by \[ h(y) = \begin{cases} g(x) & \text{ if } y = i(x) \\ j(y) & \text{ if } y \not\in i(x) \end{cases} \] Then \(h\) is well-defined because \(i\) is injective. Consider any \(z \in Z\). Because \(g\) is surjective, we have \(z = g(x)\) for some \(x \in X\). Then \(z = h(i(x))\). This shows \(h\) is surjective, contradicting \(\abs{Y} \ll \abs{Z}\).
Now suppose we have \(\abs{X} \ll \abs{Y}\) and \(\abs{Y} \leq \abs{Z}\). This gives us the injections \(i : X \rightarrow Y\) and \(j : Y \rightarrow Z\). Now suppose for contradiction that \(g : X \rightarrow Z\) is surjective. Define \(h: X \rightarrow Y\) by \[ h(x) = \begin{cases} y & \text{ if } j(y) = g(x) \\ i(x) & \text{ if } g(x) \not\in j(y) \end{cases} \] Then \(h\) is well-defined because \(j\) is injective. Consider any \(y \in Y\). Since \(g\) is surjective, there exists \(x \in X\) such that \(g(x) = j(y)\). Hence \(h(x) = y\). This shows \(h\) is surjective, contradicting \(\abs{X} \ll \abs{Y}\).
The standard finite sets are, for \(n \in \N\), \[ [n] = \{m \in \N \mid m < n\}. \] Herein we define a function \([ ~ \cdot ~ ] : \N \rightarrow \pot \N\) using the recursion theorem.
\leavevmode
- \(\forall m, n \in \N .\, \abs{[m]} \leq \abs{[n]} \iff m \leq n\).
- \(\forall n \in \N .\, \abs{[n]} \ll \abs{\N}\).
In particular, \(\N\) is not finite.
Let us prove the first statement. First, if we have \(m \leq n\), then \(k \mapsto k\) is an injection on \([m] \rightarrow [n]\).
We prove the right implication by induction on \(n\), i.e. showing \[ \forall n \in \N .\, \left[ \forall m \in \N .\, \abs{[m]} \leq \abs{[n]} \implies m \leq n\right]. \] Assume \(n = 0\). We need to show \(\forall m \in \N .\, \abs{[m]} \leq \abs{[0]} \implies m \leq 0\). Suppose \(j : [m] \rightarrow [0]\) is injective. Because the codomain \([0]\) is \(\emptyset\), the domain \([m]\) is also \(\emptyset\). Hence we must have \(m = 0\) since for \(m > 0\) we get \(0 \in [m]\).
For \(n > 0\) we assume the induction hypothesis \(\forall m \in \N .\, \abs{[m]} \leq \abs{[n - 1]} \implies m \leq n - 1\). We show \(\forall m \in \N .\, \abs{[m]} \leq \abs{[n]}\implies m \leq n\). Suppose \(j : [m] \rightarrow [n]\) is injective.
If \(n - 1 \not\in j([m])\), we have \(j([m]) \subseteq [n - 1]\). So \(j\) defines an injection \([m] \rightarrow [n - 1]\). By the induction hypothesis, we get \(m \leq n - 1 \leq n\).
Now if \(n - 1 \in j([m])\), because \(j\) is injective, there is exactly one \(u \in [m]\) such that \(j(u) = j - 1\). Define \(h : [m - 1] \rightarrow [n - 1]\) with \[ h(i) = \begin{cases} j(i) & \text{ if } i < u \\ j(i + 1) & \text{ if } i \geq u \end{cases} \] Because \(j\) is injective, so is \(h\). By the induction hypothesis, it follows that \(m - 1 \leq n - 1\). Therefore \(m \leq n\).
A set \(A\) is finite if there exists \(n \in \N\) such that \(\abs{A} = \abs{[n]}\). In this case we define its cardinality \(\abs{A} = n\).
\noindent Note that this is well defined, since there is a unique such \(n\) by the previous proposition.
\leavevmode
- If we have \(A\) finite and \(A \twoheadrightarrow B\), then \(B\) is finite
- If \(B\) is finite and \(A \rightarrowtail B\), then \(A\) is finite
Think about how we could define finiteness without referring to the structure of \(\N\).
\leavevmode
- A set is infinite if it is not finite.
- A set \(X\) is Dedekind infinite if there exists some proper subset \(X' \subsetneq X\) such that \(\abs{X} = \abs{X'}\).
Consider the following.
- \(X\) is infinite.
- \(X\) is Dedekind infinite.
- There exists a proper injection \(i : X \rightarrowtail X\).
- \(\abs{\N} \leq \abs{X}\).
Then we have \[ (2) \iff (3) \iff (4) \implies (1). \] Furthermore, \(1 \implies 4\) holds under the axiom of dependent choice. It can also be proved using the weaker axiom of countable choice, but the proof is less intuitive. We will meet CC later in the course.
\leavevmode First, the equivalence \(2 \iff 3\) is easy. For \(4 \implies 3\), suppose \(e : \N \rightarrow X\) is injective. Define \(i : X \rightarrow X\) by \[ i(x) = \begin{cases} x & \text{if } x \neq e(\N) \\ e(n + 1) & \text{if } x = e(n) \end{cases} \] Then \(i\) is a proper injection because \(e(0) \not\in i(x)\).
For \(3 \implies 4\) suppose \(i : X \rightarrow X\) is a proper injection. Because it's proper, there exists a \(x_0 \in X \setminus i(x)\). Define \(e : \N \rightarrow X\) by \[ e(0) = x_0 \text{ and } e(n + 1) = i(e(n)). \] The injectivity of \(e\) is left as an exercise.
For \(4 \implies 1\) suppose that \(\abs{\N} \leq \abs{X}\). Suppose for contradiction that \(X\) is finite. Then there exists \(n\) with \(\abs{X} = \abs{[n]}\). So \(\abs{\N} \leq \abs{X} = \abs{[n]}\), showing that \(\abs{\N} \leq \abs{[n]}\). Since we also have \(\abs{[n]} \leq \abs{\N}\), we have \(\abs{\N} = \abs{[n]}\). This contradicts \(\abs{[n]} \ll \abs{\N}\) from the previous proposition.
Finally, for \(1 \implies 4\), suppose \(X\) is infinite. We define an injection, that is, a sequence \((x_n)_{n\in \N}\) of distinct elements of \(X\). Suppose \((x_i)_{0 \leq i < n}\) is given. Since \(X\) is infinite, the function \(i \mapsto x_i : [n] \rightarrow X\) is not a bijection, but it is an injection, hence it cannot be surjective. Thus we can choose \[ x_n \in X \setminus \{x_i \mid 0 \leq i < n\} \neq \emptyset. \] The above defines an infinite sequence \((x_n)_{n \in \N}\) of distinct elements of \(X\). This gives the injection \(i : \N \rightarrowtail X\) defined by \(i(n) = x_n\).
Note that this version of the proof relies on the axiom of dependent choice. The relation is defined on finite sequences of elements from \(X\), relating two sequences if the second extends the first. In particular, we apply dependent choice to the set \[ S = \left\{ s \mid s \text{ is a finite sequence of distinct elements of } X \right\} \] with the relation \[ sRs' \iff \abs{s'} = 1 + \abs{s} \text{ and } s \text{ and } s'\vert_{\abs{s}} \] where \(\abs{s}\) is the length of the sequence \(s\) and \(s\vert_n\) restricts a sequence \(s\) with \(\abs{s} \geq n\) to its first \(n\) elements.
Axiom of dependent choice
:ANKINOTETYPE: Basic :ANKIDECK: Math::KarAr::Lecture02 :ANKITAGS: KarAr :ID: 89e0cf67-4b57-4c5f-849f-948f49c3d788 :ANKINOTEID: 1772719178448
If a binary relation \(R \subseteq X \times X\) is total, that is \(\forall x \in X .\, \exists y \in X . xRy\), then for any \(x_0 \in X\), there exists an infinite sequence \((x_n)_{n \in \N}\) such that \[ \forall n \in \N .\, x_n R x_{n+1}. \]
\leavevmode First, the equivalence \(2 \iff 3\) is easy. For \(4 \implies 3\), suppose \(e : \N \rightarrow X\) is injective. Define \(i : X \rightarrow X\) by \[ i(x) = \begin{cases} x & \text{if } x \neq e(\N) \\ e(n + 1) & \text{if } x = e(n) \end{cases} \] Then \(i\) is a proper injection because \(e(0) \not\in i(x)\).
For \(3 \implies 4\) suppose \(i : X \rightarrow X\) is a proper injection. Because it's proper, there exists a \(x_0 \in X \setminus i(x)\). Define \(e : \N \rightarrow X\) by \[ e(0) = x_0 \text{ and } e(n + 1) = i(e(n)). \] The injectivity of \(e\) is left as an exercise.
For \(4 \implies 1\) suppose that \(\abs{\N} \leq \abs{X}\). Suppose for contradiction that \(X\) is finite. Then there exists \(n\) with \(\abs{X} = \abs{[n]}\). So \(\abs{\N} \leq \abs{X} = \abs{[n]}\), showing that \(\abs{\N} \leq \abs{[n]}\). Since we also have \(\abs{[n]} \leq \abs{\N}\), we have \(\abs{\N} = \abs{[n]}\). This contradicts \(\abs{[n]} \ll \abs{\N}\) from the previous proposition.
Finally, for \(1 \implies 4\), suppose \(X\) is infinite. We define an injection, that is, a sequence \((x_n)_{n\in \N}\) of distinct elements of \(X\). Suppose \((x_i)_{0 \leq i < n}\) is given. Since \(X\) is infinite, the function \(i \mapsto x_i : [n] \rightarrow X\) is not a bijection, but it is an injection, hence it cannot be surjective. Thus we can choose \[ x_n \in X \setminus \{x_i \mid 0 \leq i < n\} \neq \emptyset. \] The above defines an infinite sequence \((x_n)_{n \in \N}\) of distinct elements of \(X\). This gives the injection \(i : \N \rightarrowtail X\) defined by \(i(n) = x_n\).
Note that this version of the proof relies on the axiom of dependent choice. The relation is defined on finite sequences of elements from \(X\), relating two sequences if the second extends the first. In particular, we apply dependent choice to the set \[ S = \left\{ s \mid s \text{ is a finite sequence of distinct elements of } X \right\} \] with the relation \[ sRs' \iff \abs{s'} = 1 + \abs{s} \text{ and } s \text{ and } s'\vert_{\abs{s}} \] where \(\abs{s}\) is the length of the sequence \(s\) and \(s\vert_n\) restricts a sequence \(s\) with \(\abs{s} \geq n\) to its first \(n\) elements.
We define the cardinality of \(\N\) as \(\aleph_0 = \abs{\N}\).
\noindent
By the last theorem, \(X\) is Dedekind infinite if and only if \(\aleph_0 \leq X\).
\leavevmode
- A set \(X\) is countable if \(\abs{X} \leq \aleph_0\).
- If \(X\) is countable and infinite, then we call it countably infinite.
- If \(X\) is not countable, we say it is uncountable.
Examples of countable sets are \(\N, \Z, \Q, \N \times \N\).
\leavevmode
- If \(X\) is countable then \(X\) is finite or \(\abs{X} = \aleph_0\).
- Countable sets are closed under finite product, disjoint finite union, countable union, countable product, \(\dots\)
- Also if \(A\) is countable and \(A \twoheadrightarrow B\), then \(B\) is countable.
- Likewise if \(B\) is countable and \(A \rightarrowtail B\), then \(A\) is countable.
\noindent
Examples of countably infinite sets are \(\N, \N \times \N, \Z, \Q\). Uncountable sets are \(\R, \pot(\N), 2^{\N}\), where \(2 = \left\{ 0, 1 \right\}\). Instead of manually computing isomorphisms, we can use CSB to prove cardinal equalities.
For instance, showing \(\abs{\N} = \abs{\Q}\) entails showing:
- \(\abs{\N} \leq \abs{\Q}\), which is immediate because the inclusion \(n \mapsto n\) is injective.
- For \(\abs{\Q} \leq \abs{\N}\), the function \(q \mapsto 2^{\sgn(q) + 1} \cdot 3^m \cdot 5^n\), where we set \(m = n =0\) if \(q = 0\) and \(\frac{m}{n}\) is the reduced fraction for \(\abs{q}\) otherwise. This function is again injective.
Let us also show \(\abs{\R} = \abs{\pot(\N)}\)
- For \(\abs{\R} \leq \abs{\pot(\Q)}\), the function \(r \mapsto \left\{ q \in \Q \mid q < r \right\}\) is injective.
- \(\abs{\pot(\Q)} \leq \abs{\pot(\N)}\) holds since we have \(\abs{\Q} \leq \abs{\N}\).
- For \(\abs{\pot(\N)} \leq \abs{\R}\), the function \[ s \mapsto \sum\limits_{n = 0}^{\infty} \frac{1_S(n)}{3^n} \text{ where } 1_S(n) = \begin{cases} 1 & \text{if } n \in S \\ 0 & \text{if } n \not\in S \end{cases} \] is injective.
For every set \(X\), we have \(\abs{X} \ll \abs{\pot X}\).
We have the injection \(i : X \rightarrowtail \pot X\) defined by \(i(x) = \{x\}\). Thus \(\abs{X} \leq \abs{\pot X}\). Now suppose \(g : X \twoheadrightarrow \pot X\) is a surjection. Define \[ S = \{x \in X \mid x \not\in g(x)\}. \] So \(S \in \pot X\) and \[ \forall x \in X .\, x\in S \iff x \not\in g(x). \] Since \(g\) is a surjection, there exists \(y \in X\) such that \(g(y) = S\). But then \(y \in g(y) \iff y \in S \iff y \not\in g(y)\), a contradiction. Thus there is no surjection from \(X\) to \(\pot X\), so indeed \(\abs{X} \ll \abs{\pot X}\).
Since \(\R\) is in bijection with \(\pot \N\), we have \(\abs{\R} = \abs{\pot \N}\). By Cantor's theorem, \(\abs{\N} \ll \abs{\pot \N}\), so \(\R\) is uncountable.
Basic Cardinal Arithmetic
The powerclass operation \(\Class{X} \mapsto \pot \Class{X}\) induces an operation on class functions. That is, if \(f : \Class{X} \rightarrow \Class{Y}\) is a function, then we have a function \[ \pot f : \pot \Class{X} \rightarrow \pot \Class{Y} \] defined by \((\pot f)(A) = \{f(a) \mid a \in A\} = f(A)\). Note that \(\pot \) preserves identities and composition.
The operation \(f \mapsto \pot f\) is functorial, i.e.
- For any \(\Class{X}\), we have \(\pot(\operatorname{id}_{\Class{X}}) = \operatorname{id}_{\pot \mathcal{X}}\), and
- for any \(G : \Class{X} \rightarrow \Class{Y}\) and \(G : \Class{Y} \rightarrow \Class{Z}\), we have \[ \pot(G \circ F) = (\pot G) \circ (\pot F) : \pot \Class{X} \rightarrow \pot \Class{Z} \]
The operation \(X \mapsto \pot X\) on sets \(X\) preserves cardinal inequalities:
\begin{align*} \abs{X} = \abs{Y} &\implies \abs{\pot X} = \abs{\pot Y}, \\ \abs{X} \leq \abs{Y} &\implies \abs{\pot X} \leq \abs{\pot Y}. \end{align*}What the statements above are saying is that if \(f : X \rightarrow Y\) is a bijection or injection respectively, so is \(\pot f : \pot X \rightarrow \pot Y\). This is easy to verify directly, but a more methodological approach is to derive them via the functoriality of the powerclass map, since functors preserve isomorphisms and split monomorphisms.
In particular,
- A function \(f : X \rightarrow Y\) is bijective if and only if it is an isomorphism, i.e. \[ \exists f^{-1} : Y \rightarrow X \text{ such that } f^{-1} \circ f = \operatorname{id}_X \text{ and } f \circ f^{-1} = \operatorname{id}_{Y} \]
- Functors always preserve isomorphisms.
- A function \(f : X \rightarrow Y\) where \(X \neq \emptyset\) is injective if and only if it is a section, i.e. \[ \exists g : Y \rightarrow X \text{ such that } g\circ f = \operatorname{id}_X. \]
- Functors always preserve sections.
- Since \(\pot \emptyset = \left\{ \emptyset \right\}\) is a singleton, every function \(\pot \emptyset \rightarrow Z\) is likewise injective.
In lecture \(\)
We also consider the functoriality of the products, exponentials, and coproducts. They're also functorial, and preserve isomorphisms and monomorphisms.
If \(\abs{X} = \abs{X'}\) and \(\abs{Y} = \abs{Y'}\), then
\begin{align*} \abs{X \times Y} &= \abs{X' \times Y'}, \\ \abs{X + Y} &= \abs{X' + Y'}, \\ \abs{Y^X} &= \abs{(Y')^{(X')}}. \end{align*}Note in the first two cases we can also take \(\leq\). In the last case, \[ \abs{Y^X} = \abs{(Y')^{(X')}} \] holds, except when \(X = Y = Y' = \emptyset \neq X'\). The reason we have a corner case corresponds to the fact that functors preserve sections, not general monomorphisms.
We've already defined \(n\) as the cardinality \(\abs{[n]}\) and \(\aleph_0 = \abs{\N}\). We also define
\begin{align*} \abs{X} + \abs{Y} &:= \abs{X + Y}, \\ \abs{X} \cdot \abs{Y} &:= \abs{X \times Y}, \\ \abs{Y}^{\abs{X}} &:= \abs{Y^X}. \end{align*}The following hold for all sets \(X, Y, Z\).
Addition is a commutative monoid with unit 0:
\begin{align*} \abs{X} + 0 &= \abs{X} = 0 + \abs{X}, \\ \abs{X} + \abs{Y} &= \abs{Y} + \abs{X}, \\ \abs{X} + (\abs{Y} + \abs{Z}) &= (\abs{X} + \abs{Y}) + \abs{Z}. \end{align*}Multiplication is a commutative monoid with unit 1:
\begin{align*} \abs{X} \cdot 1 &= \abs{X} = 1 \cdot \abs{X}, \\ \abs{X} \cdot \abs{Y} &= \abs{Y} \cdot \abs{X}, \\ \abs{X} \cdot (\abs{Y} \cdot \abs{Z}) &= (\abs{X} \cdot \abs{Y}) \cdot \abs{Z}. \end{align*}Distributivity and annihilation:
\begin{align*} \abs{X} \cdot 0 &= 0, \\ \abs{X} \cdot (\abs{Y} + \abs{Z}) &= \abs{X} \cdot \abs{Y} + \abs{X} \cdot \abs{Z}. \end{align*}Exponentiation laws:
\begin{align*} \abs{X}^0 &= 1, \\ \abs{X}^1 &= \abs{X}, \\ \abs{X}^{\abs{Y} + \abs{Z}} &= \abs{X}^{\abs{Y}} \cdot \abs{X}^{\abs{Z}}, \\ \abs{X}^{\abs{Y} \cdot \abs{Z}} &= (\abs{X}^{\abs{Y}})^{\abs{Z}}, \\ 1^{\abs{X}} &= 1, \\ (\abs{X} \cdot \abs{Y})^{\abs{Z}} &= \abs{X}^{\abs{Z}} \cdot \abs{Y}^{\abs{Z}}. \end{align*}
For instance, to prove \(\abs{X}^{\abs{Y} \cdot \abs{Z}} = (\abs{X}^{\abs{Y}})^{\abs{Z}}\), we establish the bijection \(\Lambda : X^{Y \times Z} \rightarrow (X^Y)^Z\) \[ \Lambda(g : Y \times Z \rightarrow X) := \lambda z \in Z . \lambda y \in Y . g(y, z). \]
In ordinary arithmetic, we have
\begin{align*} x + z &= y + z \implies x = y, \\ x \cdot z &= y \cdot z \implies x = y \text{ (if \(z \neq 0\))}. \end{align*}But in cardinal arithmetic, cancellation fails. In particular: \[ \aleph_0 + 0 = \aleph_0 + \aleph_0, \] but \(0 \neq \aleph_0\). Likewise, \[ 1 \cdot \aleph_0 = \aleph_0 = \aleph_0 \cdot \aleph_0, \] but \(1 \neq \aleph_0\).
We say that \(\abs{X}\) is idempotent (or its own square) if \(\abs{X} = \abs{X} \cdot \abs{X}\).
For instance, \(\aleph_0\) is idempotent.
If \(\abs{X} \geq 2\) and \(\abs{X}\) is idempotent, then
- \(\aleph_0 \leq \abs{X}\), i.e., \(X\) is Dedekind infinite, and
- \(\abs{X} = \abs{X} + \abs{X}\).
Let's show the second point first. We have \[ \abs{X} \leq \abs{X} + \abs{X} \] since we have two obvious injections. Next, \[ \abs{X} + \abs{X} = \abs{X} \cdot 2 \leq \abs{X} \cdot \abs{X} = \abs{X} \] first by distributivity and then by assumption. Thus \(\abs{X} = \abs{X} + \abs{X}\) by the Cantor-Schröder-Bernstein theorem.
For the first point, note that \(X\) is Dedekind infinite, since we have the injection \[ x \mapsto f(0, x) \] by the bijection \(f : X \times X \rightarrow X\).
If \(\abs{X} \geq 2\) and \(\abs{X}\) is idempotent, then \(2^{\abs{X}}\) is also idempotent.
We have
\begin{align*} 2^{\abs{X}} \cdot 2^{\abs{X}} &= 2^{\abs{X} + \abs{X}} \\ &= 2^{\abs{X}} \text{ (by the previous lemma)} \end{align*}If \(X\) is infinite and \(Y \subseteq X\) satisfies \(\abs{Y} \ll \abs{X}\), then we might expect \[ \abs{X \setminus Y} = \abs{X}. \] This holds assuming the axiom of choice. The proof of this is nonetheless quite involved.
If \(\abs{X}\) is idempotent and \(Y \subseteq X\) is such that \(\abs{Y} \ll \abs{X}\), then \(\abs{X \setminus Y} = \abs{X}\).
Consider the following diagram: \[ Y \xrightarrow{\subseteq} X \xrightarrow{f} X \times X \xrightarrow{\pi_1} X \] Because \(\abs{Y} \ll \abs{X}\), the composition is not a surjection.
So there exists \(x_0 \in X\) such that for all \(y \in Y\), we have \(f(y) = (x_1, x_2)\) where \(x_1 \neq x_0\). We therefore have an injective function \[ X \xrightarrow{x \mapsto (x_0, x)} (X \times X) \setminus f(Y). \] Then \[ \abs{X} \leq \abs{(X \times X) \setminus f(Y)} = \abs{X \setminus Y} \] because \(f\) is a bijection.
Clearly we also have \(\abs{X \setminus Y} \leq \abs{X}\). By the Cantor-Schröder-Bernstein theorem, we have \(\abs{X \setminus Y} = \abs{X}\).
For any set \(X\), we have \[ \abs{X} \ll 2^{\abs{X}}. \]
If \(\aleph_0 \leq \abs{X} \leq 2^{\aleph_0}\), then either \(\abs{X} = \aleph_0\) or \(\abs{X} = 2^{\aleph_0}\).
If we have \[ \aleph_0 \leq \abs{Z} \leq \abs{X} \leq 2^{\abs{Z}}, \] then either \(\abs{X} = \abs{Z}\) or \(\abs{X} = 2^{\abs{Z}}\).
Gödel showed that GCH is consistent with the axioms of set theory.
Ordinals
We can come to the notion of an ordinal through two main ways.
- Ordinals measure order type: Ordinals represent the order type of a well-ordered set.
- Ordinals as extended numbers: We can think of ordinals as an extended number system for counting.
This counting proceeds through the natural numbers, then through \(\omega\), then continues as:
\begin{align*} 0, 1, 2, 3, 4, 5, \dots, \\ \omega, \omega + 1, \omega + 2, \omega + 3, \dots, \\ \omega \cdot 2, \omega \cdot 2 + 1, \dots, \omega \cdot 3, \omega \cdot 3 + 1, \dots, \\ \omega^2, \omega^2 + 1, \dots, \omega^2 + \omega, \dots, \omega^3, \dots \end{align*}Eventually we reach ordinals like \[ \omega^{\omega^{\omega^{\dots}}} \] and even a pattern of \(\omega\) many exponentiations. We call this ordinal \(\varepsilon_0\). This is the smallest ordinal \(\alpha\) satisfying \[ \alpha = \omega^{\alpha}, \] the smallest fixed point of exponentiation by \(\omega\). Then we continue to \(\varepsilon_1\) and so on.
But this collection is still countable. We can then ascend to \(\omega_1\), the smallest uncountable ordinal. Repeating the pattern, we obtain \(\omega_2\), \(\omega_{\omega}\), \(\omega_{\omega_1}\), and so forth.
We construct this structure by adjoining to the natural numbers the rule that for any collection of ordinals, we can always form the next smallest ordinal that follows all of them.
We would like to formalise this intuition of generalized counting. Note that we have \(0\) and a successor operation, like with the naturals. We need to add an operation that will take us "beyond" the current structure. It turns out the right one is the supremum which maps any set of ordinals to the least upper bound.
An ordinal structure is given by a class \(\Ord\) of ordinals, a partial order \(\leq\) on \(\Ord\) and
- an element \(0 \in \Ord\), i.e. zero,
- a class function \(s : \Ord \rightarrow \Ord\), i.e. the successor, and
- a class function \(\sup : \Ord \rightarrow \Ord\), i.e. the supremum.
The following must hold.
- \(\forall \alpha \in \Ord\), we have \(0 \leq \alpha\), i.e. \(0\) is a minimal element.
- \(s\) must be a successor operation, i.e. \[ \alpha < s(\alpha) \text{ and } \alpha \leq \beta \implies (\beta = \alpha \lor s(\alpha) \leq \beta). \]
- \(\sup\) is a supremum operation, namely for any set \(X \subseteq \Ord\), its supremum is the least upper bound, i.e. \[ X \leq \sup X \text{ and if }X \leq \alpha, \text{ then } \sup X \leq \alpha. \]
- Lastly, the principle of transfinite induction (TI) must hold, i.e. if \(\Class{X} \subseteq \Ord\) is such that
- (base case): \(0 \in \Class{X}\),
- (step case): \(\alpha \in \Class{X} \implies s(\alpha) \in \Class{X}\),
- (supremum case): For every subset \(A \subseteq \Class{X}\), we have \(\sup A \in \Class{X}\),
then \(\Class{X} = \Ord\).
For ease of readability, we often write \(\alpha + 1\) instead of \(s(\alpha)\).
\(\leq\)is a linear order on \(\Ord\).
Let \(\beta \in \Ord\) be arbitrary. We will prove that \[ \alpha \leq \beta \lor \beta \leq \alpha \] holds for all \(\alpha \in \Ord\) by transfinite induction on \(\alpha\).
- (Base case): When \(\alpha = 0\), we have \(0 \leq \beta\) by axiom.
(Step case): By induction hypotheses, we have \(\alpha \leq \beta\) or \(\beta \leq \alpha\). If \(\alpha \leq \beta\), then either \(\alpha = \beta\) or \(\alpha + 1 \leq \beta\) by the successor axiom. So either \(\beta \leq \alpha + 1\) or \(\alpha + 1 \leq \beta\) as required.
If \(\beta \leq \alpha\), then we have \(\beta \leq \alpha < \alpha + 1\), so we have \(\beta \leq \alpha + 1\).
- (Supremum case): Suppose that \(A\) is a set of ordinals \(\alpha\) satisfying \(\alpha \leq \beta \lor \beta \leq \alpha\). We need to show
\[
\sup A \leq \beta \lor \beta \leq \sup A.
\]
There are two possible cases.
- If \(\forall \alpha \in A\), we have \(\alpha \leq \beta\), then we have \(\sup A \leq B\).
- Elsewise we have \(\exists \alpha \in A\) such that \(\beta \leq \alpha\). In this case, we have \(\beta \leq \alpha \leq \sup A\).
For the split into the two cases, we used the Law of Excluded Middle.
Recall the standard finite set was given as \([n] = \left\{ x \in \N \mid x < n \right\}\).
The standard ordinal class is given as \[ [\alpha] := \left\{ \beta \in \Ord \mid \beta < \alpha \right\}. \]
This is sometimes written as \(\downarrow \alpha\).
For every ordinal \(\alpha\), the class \([\alpha]\) is a set.
We proceed by transfinite induction.
- (Base case): For \(\alpha = 0\), we have \([0] = \emptyset\) which is a set.
- (Step case): If \([\alpha]\)is a set, then \[ [s(\alpha)] = [\alpha] \cup \left\{ \alpha \right\}, \] which is clearly a set.
- (Supremum case): Suppose \([\alpha]\) is a set for every every \(\alpha \in A \subseteq \text{\Class{A}}\) is a set. Then \[ [\sup A] = \bigcup\limits_{\alpha \in A} [\alpha] \] is a set, because sets are closed under set-indexed unions.
\(\Ord\) is a proper class.
Suppose \(\Ord\) is a set.
Then \(\sup(\Ord)\) is a maximum element in the set of ordinals. But this cannot be, because \(s(\sup(\Ord))\) is strictly larger.
The ordinals separate into two kinds.
- \(\alpha \in \Ord\) is a successor ordinal if there exists \(\alpha' \in \Ord\) such that we have \[ \alpha = \alpha' + 1 \]
- \(\alpha \in \Ord\) is a limit ordinal if there exists a subset \(A \subseteq [\alpha]\) with \(\alpha = \sup A\). This is equivalently stated as \(\alpha = \sup [\alpha]\).
This split is clean.
Every ordinal is either a successor ordinal or a limit ordinal (but not both).
By transfinite induction, left as exercise.
We have two reformulations of transfinite induction.
If \(\Class{X} \in \Ord\) satisfies
- \(0 \in \Class{X}\),
- \(\alpha \in \Class{X} \implies s(\alpha) \in \Class{X}\),
- \([\lambda] \subseteq \Class{X} \implies \lambda \in \Class{X} \), where \(\lambda > 0\) is a limit ordinal,
then \(\Class{X} = \Ord\).
If \(\Class{X} \subseteq \Ord\) satisfies \[ [\alpha] \subseteq \Class{X} \implies \alpha \in \Class{X}, \] then \(\Class{X} = \Ord\).
This can be stated alternatively. If \(\Class{Y}\) satisfies progressivity: \[ \forall y \in \Ord .\, (\forall x < y .\, x \in \Class{Y}) \rightarrow y \in \Class{Y}, \] then \(\Class{Y} = \Ord\).
Let \(\Class{A}\) be a class with \(a \in \Class{A}\) and \(F : \Class{A} \rightarrow \Class{A}\), and \(L : \mathcal{P}\Class{A} \rightarrow \Class{A}\). There is a unique class function \[ T : \Ord \rightarrow \Class{A} \] satisfying
\begin{align*} T(0) &= a,\\ T(\alpha+1) &= F\big(T(\alpha)\big),\\ T(\lambda) &= L\big(\{T(\alpha)\mid \alpha<\lambda\}\big) \quad\text{for }\lambda>0\text{ a limit ordinal.} \end{align*}Let \(a \in \Class{A}\), \(F : \Class{A} \rightarrow \Class{A}\) and \(L : \mathcal{P} \Class{A} \rightarrow \Class{A}\) as above. We need to prove the existence of a suitable \(T : \Ord \rightarrow \Class{A}\).
We proceed by defining a set of approximations to \(T\) for every \(\alpha\).
For an ordinal \(\gamma\), a \(\gamma\)-approximation is a function \(t : [\gamma] \rightarrow \Class{A}\) that satisfies
\begin{align*} t(0) &= a \text{ if } 0 < \gamma \\ t(s(\alpha)) &= F(t(\alpha)) \text{ if } s(\alpha) < \gamma \\ t(\lambda) &= L(\left\{ t(\alpha) \mid \alpha < \lambda \right\}) \text{ if } \lambda < \gamma. \end{align*}Note that if \(t : [\gamma] \rightarrow \Class{A}\) is a \(\gamma\)-approximation, then \[ t\vert_{[\beta]} : [\beta] \rightarrow A \] is a \(\beta\)-approximation for any \(\beta < \gamma\).
We will prove: \[ \forall \gamma \in \Ord. \, \exists! \gamma-\text{approximation } t_{\gamma} : [\gamma] \rightarrow \Class{A}. \] Once we prove this, we can define \[ T(\alpha) := t_{\alpha + 1}(\alpha). \]
We prove the above statement using transfinite induction.
The TRT allows us to prove a lot of useful things.
Any two ordinal structures are isomorphic.
Exercise.
Suppose \(\Class{A}\) and \(\Class{Z}\) are classes with class functions
\begin{align*} B : \Class{Z} &\rightarrow \Class{A} \\ F : \Class{Z} \times \Class{A} &\rightarrow \Class{A} \\ L : \Class{Z} \times \mathcal{P}\Class{A} &\rightarrow \Class{A} \end{align*}Then there is a unique class function \(T : \Class{Z} \times \Ord \rightarrow \Class{A}\) satisfying
\begin{align*} T(z,0) &= B(z) \\ T(z,\alpha+1) &= F(z, T(z,\alpha)) \\ T(z,\lambda) &= L\bigl(z, \{T(z,\alpha)\mid \alpha<\lambda\}\bigr)\quad\text{for }\lambda>0\text{ a limit ordinal} \end{align*}The proof is essentially the same as the non-parametrised version.
Using the parametrised transfinite recursion theorem, we define \(T : \Ord \times \Ord \rightarrow \Ord\) using
\begin{align*} B(\alpha) &= \alpha \\ F(\alpha, \beta) &= \beta + 1 \\ L(\alpha, A) &= \sup A \end{align*}Then \(T : \Ord \times \Ord \rightarrow \Ord\) is the unique class function such that
\begin{align*} T(\alpha, 0) &= \alpha \\ T(\alpha, \beta + 1) &= F(\alpha, T(\alpha, \beta)) = T(\alpha, \beta) + 1 \\ T(\alpha, \lambda) &= L(\alpha, \left\{ T(\alpha, \beta) \mid \beta < \lambda \right\}) = \sup \left\{ T(\alpha, \beta) \mid \beta < \lambda \right\} \end{align*}Thus we have defined ordinal addition as the unique class function \((+) : \Ord \times \Ord \rightarrow \Ord\) satisfying
\begin{align*} \alpha + 0 &= \alpha \\ \alpha + (\beta + 1) &= (\alpha + \beta) + 1, \text{ i.e.} \\ \alpha + s(\beta) &= s(\alpha + \beta)\\ \alpha + \lambda &= \sup_{\beta < \lambda} \alpha + \beta \end{align*}This justifies the use of the notation \(\alpha + 1 = s(\alpha)\).
We have \[ 1 + \omega = \omega \neq \omega + 1 \] which means that ordinal addition is not commutative.
We likewise define ordinal multiplication as the unique class function \((\cdot) : \Ord \times \Ord \rightarrow \Ord\) satisfying
\begin{align*} \alpha \cdot 0 &= 0 \\ \alpha \cdot (\beta + 1) &= \alpha \cdot \beta + \alpha \\ \alpha \cdot \lambda &= \sup_{\beta < \lambda} \alpha \cdot \beta \qquad \lambda > 0 \text{ a limit ordinal} \end{align*}Ordinal multiplication is not commutative. For example, \[ 2 \cdot \omega = \omega \neq \omega + \omega = \omega \cdot 2. \]
We define ordinal exponentiation as the unique class function \((\cdot)^{(\cdot)} : \Ord \times \Ord \rightarrow \Ord\) satisfying
\begin{align*} \alpha^0 &= 1 \\ \alpha^{\beta+1} &= \alpha^\beta \cdot \alpha \\ \alpha^\lambda &= \sup_{\beta < \lambda} \alpha^\beta \qquad \lambda > 0 \text{ is a limit ordinal} \end{align*}We have \(\omega^2 = \omega \cdot \omega \neq \omega\) and \(\omega = 2^\omega \neq \omega^\omega\). Contrast with cardinal arithmetic: \(\aleph_0^2 = \aleph_0 \cdot \aleph_0 = \aleph_0\), but \(\aleph_0 \neq 2^{\aleph_0} = \aleph_0^{\aleph_0}\).
An ordinal structure exists.
We define \(\Class{\mathrm{vNOrd}}\) as the class of von Neumann ordinals, namely
\begin{align*} 0 &= \emptyset\\ &\vdots \\ n + 1 &= n \cup \left\{ n \right\} \\ &\vdots \\ \omega &= \left\{ 0, 1, 2, \dots \right\}\\ \omega + 1 &= \omega \cup \left\{ \omega \right\} \end{align*}In particular, we define a structure with the three operations
- \(0 := \emptyset\)
- \(s(\alpha) := \alpha \cup \left\{ \alpha \right\}\) and
- \(\sup(A) := \bigcup A \)
- and taking \(\leq ~ := ~ \subseteq\).
We can prove that the smallest class of sets closed unser the above operations is an ordinal structure.
Well-Ordered Sets
- In the previous lecture we saw ordinals as a numbering system for cardinals.
- Ordinals also show up as a tool for proving more general theorems.
- In this lecture we will come to know the ordinals as tools for classifying well-ordered sets.
If \(\Class{X} \subseteq \Ord\) is nonempty, then \(\Class{X}\) has a minimum element.
Suppose \(\Class{X}\) does not have a minimum element. We prove by transfinite induction, that \(\Ord \setminus \Class{X} = \Ord\). It is enough to show \(\Ord \setminus \Class{X}\) is progressive.
Suppose \(y \in \Ord \setminus \Class{X}\) satisfies \[ \forall x < y . \, x \in \Ord \setminus \Class{X}. \] Clearly \(y \not\in \Class{X}\) because if \(y \in \Class{X}\) it would be a minimum element.
For a class relation \(\Class{R} \subseteq \Class{X} \times \Class{X}\), we say:
- \(\Class{Y}\) is \(\Class{R}\)-progressive if \[ \forall x \in \Class{X} .\, (\forall y \in \Class{X} .\, y\Class{R} x \rightarrow y \in \Class{Y}) \rightarrow x \in \Class{Y}. \]
- \(y \in \Class{Y}\) is \(\Class{R}\)-minimal if \[ \neg \exists z \in \Class{Y} .\, z\Class{R} y. \]
- A sequence \((x_n)_{n \in \N}\) in \(\Class{X}\) is \(\Class{R}\)-decreasing if \[ \forall n \in \N.\, x_{n+1} \Class{R} x_n. \]
Suppose \(\Class{R} \subseteq \Class{X} \times \Class{X}\) is a relation. Consider the following statements.
- For every subclass \(\Class{Y} \subseteq \Class{X}\), if \(\Class{Y}\) is \(\Class{R}\)-progressive, then \(\Class{Y} = \Class{X}\).
- Every nonempty subclass \(\Class{Y} \subseteq \Class{X}\) has an \(\Class{R}\)-minimal element.
- There is no strict \(\Class{R}\)-decreasing infinite sequence in \(\Class{X}\).
Then \(1. \iff 2.\) and \(2. \implies 3.\). Also, \(3. \implies 2.\) holds under the axiom of dependent choice.
- \((2. \implies 3.)\): We prove the contrapositive. Suppose \((x_n)\) is \(\Class{R}\)-decreasing. The set \(\left\{ x_n \mid n \in \N \right\}\) is clearly nonempty and has no \(\Class{R}\)-minimal element.
- \((3. \implies 2.)\): Suppose \(\Class{Y} \subseteq \Class{X}\) is nonempty and has no \(\Class{R}\)-minimal element. Define
- \(x_0\) as some element in \(\Class{Y}\).
- \(x_{n+1}\) as some chosen element such that \(x_{n+1} \Class{R} x_n\), which exists because \(x_n\) is not minimal. By dependent choice we get the required sequence \((x_n)\).
A relation \(\Class{R} \subseteq \Class{X} \times \Class{X}\) is well-founded if 1. (equivalently) from the previous proposition holds.
Any well-founded relation is irreflexive and acyclic.
A well-ordered class is a class \(\Class{X}\) together with a total order relation \(\leq\) such that the corresponding strict order \(<\) is well-founded.
Of particular interest are well-ordered sets (well-orders).
The following are equivalent for any partially ordered set \((X, \leq)\).
- \((X, \leq)\) is a well-order.
- Every nonemtpy subset of \(X\) has a minimum element.
- \((\N, \leq)\)
- Likewise, \(([n], \leq)\) (and every finite linear order)
- Likewiser, for any ordinal \(\alpha\), the set \([\alpha]\) is a well-order under \(\leq\).
And actually, this last characterization is in some sense complete. In particular, we have the following theorem.
Every well-order is order-isomorphic to \([\alpha]\) for a unique ordinal \(\alpha\).
We write \(\operatorname{Ord}(X, \leq)\) for the unique \(\alpha\) such that \(X \cong [\alpha]\) for a well-order \((X, \leq)\). We call this the order type of \(X\).
Let us look at some corollaries before the proof.
The following are equivalent for well-orders \(A, B\).
- \(A \cong B\)
- \(\operatorname{Ord}(A) = \operatorname{Ord}(B)\)
An initial segment of a totally ordered class \((\Class{X}, \leq)\) is any \(\Class{Y} \subseteq \Class{X}\) that is down-closed, that is, \[ \forall y \in \Class{Y} , x \in \Class{X}.\, x < y \implies x \in \Class{Y}. \]
The following are equivalent for well-orders \(A, B\).
- There is an (order) embedding from \(A\) to \(B\).
- \(A\) is isomorphic to an initial segment of \(B\).
- \(\operatorname{Ord}(A) \leq \operatorname{Ord}(B)\).
For well-orders \(A\) and \(B\):
- If \(A\) and \(B\) embed into each other, then \(A \cong B\) as an order isomorphism.
- Either \(A\) embeds in \(B\) or \(B\) embeds in \(A\).
Let \((X, \leq_X)\) and \((Y, \leq_Y)\) be two well orders. We define a well-order on \(X + Y\) as "placing one after the other". as
\begin{align*} \iota_0(x) \leq_{X + Y} \iota_0(x') &\iff x \leq_X x' \\ \iota_1(y) \leq_{X + Y} \iota_1(y') &\iff y \leq_Y y' \\ \iota_0(x) \leq_{X + Y} \iota_1(y) &\iff \top \\ \iota_1(y) \leq_{X + Y} \iota_0(x) &\iff \bot \end{align*}\[ \operatorname{X + Y, \leq_{X + Y}} = \operatorname{Ord}(X, \leq_X) + \operatorname{Ord}(Y, \leq_Y). \]
Product of well-orders
Let \((X, \leq_X)\) and \((Y, \leq_Y)\) be two well orders. We define a well-order on \(X \times Y\) as the reverse lexicographic ordering.
\[ \operatorname{X * Y, \leq_{X \times Y}} = \operatorname{Ord}(X, \leq_X) \cdot \operatorname{Ord}(Y, \leq_Y). \]
For the exponential, we want to define a well-order on a function space. In particular, we will have to restrict the functions we consider.
Let \((X, \leq_X)\) and \((Y, \leq_Y)\) be two well orders. Then \[ \operatorname{Ord}(\operatorname{finsupp}[X, Y], \leq_{Y^X}) = \operatorname{Ord}(Y)^{\operatorname{Ord(X)}}. \] Here we have \[ \operatorname{finsupp}[X, Y] = \left\{ f : X \rightarrow Y \mid \text{ the set } \left\{ x \in X \mid f(x) \neq \text{ the least of } Y \right\} \text{ is finite}\right\} \] Then \[ f <_{Y^X} g \iff f \neq g \text{ and } f(x_0) < g(x_0) \] where \(x_0 = \max \left\{ x \in X \mid f(x) \neq g(x) \right\}\).
In the special case \(Y = (\N, \leq)\) we have \[ \operatorname{finsupp}(X, \N) = \left\{ f : X \rightarrow \N \mid \left\{ x \mid f(x) \neq 0 \right\} \text{ is finite} \right\}. \]
We can look at this as finite multisets. This says that one multiset is below another by looking at the highest element which is assigned a different multiplicity - the left hand set needs to have a smaller multiplicity. We call this ordering \(<_{\N^X}\) the (Deschowitz-manna) multiset ordering. This leads us to multiset induction.
If \(f : A \rightarrow A\) is an embedding for \(A\) a well-order, \(f\) is inflationary, namely \[ \forall x \in A.\, x \leq f(x). \]
Suppose \(f\) is not inflationary. Then there exists some element where \(x > f(x)\). But then the set \[ B = \left\{ x \mid x > f(x) \right\} \] is nonempty. Let \(x_0\) be the smallest element such that \(x_0 > f(x_0)\). Since \(f\) is an embedding, it preserves the strict order, so we have \[ f(x_0) > f(f(x_0)), \] so \(f(x_0) \in B\), contradicting the fact that \(x_0\) is the smallest element of \(B\).
- If \(I\) is a proper initial segment of a well-order \(A\), then \(I = [a]\) for some \(a \in A\).
- If a set \(I\) is a proper initial segment of \(\Ord\), then \(I = [\alpha]\) for some ordinal \(\alpha\).
- No well order is isomorphic to any proper initial segment of itself.
- Every well order has exactly one automorphism (the identity). We say it is rigid.
- If \(A, B\) are isomorphic well-orders, then they have a unique isomorphism between them.
- Suppose \(A\) is isomorphic to \([a]\) for some \(a \in A\). Let \(i : A \rightarrow [a]\) be an isomorphism. \[ A \xrightarrow{i} [a] \xrightarrow{\subseteq} A \] is an embedding but \(i(a) \in [a]\), which means \(i(a) < a\). This contradicts that embeddings are inflationary.
- Suppose \(f : A \rightarrow A\) is an automorphism. Then \(f\) and \(f^{-1}\) are embeddings, so in particular inflationary. This means \[ x \leq f(x) \text{ and } x \leq f^{-1}(x) \] so we have \[ x \leq f(x) \leq f^{-1}(f(x)) = x \] so \(f\) is the identity.
- Is trivial.
Transfinite recursion (uniform version)
:ANKINOTETYPE: Basic :ANKIDECK: Math::KarAr::Lecture05 :ANKITAGS: KarAr :ANKINOTEID: 1774987298741
Given \(\Class{A}\) a class and \(L : \pot \Class{A} \rightarrow \Class{A}\), there is a unique \(T : \Ord \rightarrow \Class{A}\) satisfying \[ T(\alpha) = L \left\{ T(\beta) \mid \beta < \alpha \right\}. \]
For uniqueness, suppose \([\alpha] \cong A \cong [\alpha']\). If \(\alpha < \alpha'\), then \([\alpha']\) is isomorphic to a proper initial segment, namely \([\alpha]\). Contradiction. Likewise for \(\alpha' < \alpha\). Thus \(\alpha = \alpha'\).
It remains to show than an \(\alpha\) such that \(A \cong [\alpha]\) exists. By transfinite recursion, define \(f : \Ord \rightarrow A \uplus \left\{ e \right\}\) by \[ f(\alpha) = \begin{cases} \min A \setminus \left\{ f(\beta) \mid \beta < \alpha \right\} & \text{when this set is nonempty} \\ e & \text{otherwise} \end{cases} \] This satisfies transfinite induction.
Cardinals
We define the cardinality of an ordinal as \[ \abs{\alpha} := \abs{[\alpha]}. \]
Do there exist uncountable ordinals?
Hartogs' lemma
For any set \(A\) there exists a smallest ordinal \(\underline{h}(A)\) for which \[ \abs{\underline{h}(A)} \not \leq \abs{A}. \]
Implicitly we're defining a class function \(\underline{h} : \Class{\text{Set}} \rightarrow \Ord\).
Consider the set \[ W := \left\{ R \subseteq A \times A \mid R \text{ is a well-order on a subset of } A \right\} \] Consider the function \(W \rightarrow \Ord\) \[ R \mapsto \operatorname{Ord}(A', R) \] where \(A' \subseteq A\) is the subset of \(A\) that \(R\) well-orders.
The image of the function is a set of ordinals. As a set it is not the whole of \(\Ord\), so we can define \[ \underline{h}(A) := \text{ the smallest ordinal outside this image}. \] We call this function the Hartogs' function.
\(\underline{h}(\N)\) does not embed in \(\N\) and is thus not countable. So it must be the smallest uncountable ordinal. We define \[ \omega_1 := \underline{h}(\N) \]
With the Axiom of Choice, we have \(\underline{h}(A) > \abs{A}\), while not assuming it we only get the weaker \[ \abs{\underline{h}(A)} \not\leq \abs{A} \]
For any ordinal \(\alpha\), we have that \[ \underline{h}(\alpha) := \text{ the smallest ordinal } \beta \text{ with } \abs{\beta} > \abs{\alpha}. \]
In the notes.
An initial ordinal is an ordinal \(\alpha\) that satisfies, for every \(\beta < \alpha\), we have \[ \abs{\beta} < \abs{\alpha}. \]
We think of it as the smallest ordinal of its own cardinality.
For every set \(A\), \(\underline{h}(A)\) is an initial ordinal.
Suppose \(\underline{h}(A)\) is not an initial ordinal. Then we have \(\beta < \abs{\underline{h}(A)}\) with \(\abs{\beta} = \abs{\underline{h}(A)}\), so there is a bijection \([\beta] \cong [\underline{h}(A)]\). Via the isomorphism, we get \(\abs{\beta} \not \leq \abs{A}\) which contradicts \(\underline{h}(A)\) being the Hargogs' ordinal.
- (Proposition C): Every natural number \(n < \omega\) is an initial ordinal.
- \(\omega\) is an initial ordinal.
Every infinite initial ordinal is a limit ordinal.
For a successor infinite ordinal \(\beta + 1\) we have \(\abs{\beta + 1} = \abs{1 + \beta} = \abs{\beta}\).
If \(I\) is a set of initial ordinals, then \(\sup I\) is an initial ordinal.
Suppose \(I\) is a set of innitial ordinals. Suppose \(\beta < \sup I\). Then there exists some \(\alpha \in I\) such that \(\beta < \alpha\). Then it must hold that \(\abs{\beta} < \abs{\alpha}\) because \(\alpha\) is initial. Thus \[ \abs{\beta} < \abs{\alpha} \leq \abs{\sup I}. \]
We shall count through (transfinitely) through the infinite initial ordinals \(\omega_{\alpha}\) for \(\alpha\) an ordinal. We start with
\begin{align*} \omega_0 &:= \omega \\ \omega_1 &:= \underline{h}([\omega_0]) \text{ the smallest uncountable ordinal}\\ \omega_2 &:= \underline{h}([\omega_1]) \\ \vdots & \\ \omega_{\omega} &= \sup_{\alpha < \omega} \omega_{\alpha} \text{ which is limit by lemma F} \\ \vdots & \end{align*}We are in essence carrying out a definition by transfinite recursion.
We define \[ \omega_{(-)} : \Ord \rightarrow \Ord \] by transfinite recursion by
\begin{align*} \omega_0 &:= \omega \\ \omega_{\alpha + 1} &:= \underline{h}([\omega_{\alpha}]) \\ \omega_{\lambda} &:= \sup_{\alpha < \lambda} \omega_{\alpha} \end{align*}- \(\omega_{\alpha}\) is always an infinite initial ordinal
- Every infinite initial ordinal arises as \(\omega_{\alpha}\) for a unique \(\alpha\).
We define the alephs as cardinalities of ordinals \[ \aleph_{\alpha} := \abs{\omega_{\alpha}}. \]
We have the notion of a smallest uncountable ordinal. We might wonder whether it holds that \(\abs{\omega_1} = \abs{\R}\), stated again, \(\abs{\R} = \aleph_1\), i.e. the continuum hypothesis.
If \(\alpha < \beta\), then \(\abs{\omega_{\alpha}} < \abs{\omega_{\beta}}\), hence \(\omega_\alpha < \omega_{\beta}\).
Proposition A shows shows that \(\abs{\omega_{\alpha} < \abs{\omega_{\alpha + 1}}}\). If \(\alpha < \beta\), then \(\beta \geq \alpha + 1\), so we get \[ \abs{\omega_{\alpha}} < \abs{\omega_{\alpha+1}} \leq \abs{\omega_{\beta}}. \] using the monotonicity of \(\omega_{(-)}\).
- We prove this using transfinite induction using our propositions and lemmas.
Uniqueness follows from proposition E. We now only need show every infinite initial ordinal arises as \(\omega_{\alpha}\) for some \(\alpha\). Let \(\gamma\) be an infinite initial ordinal. We need to find \(\alpha\) with \(\gamma = \omega_{\alpha}\). We define \[ I = \left\{ \beta \in \Ord \mid \omega_{\beta} < \gamma \right\}. \] \(I\) is a set because otherwise \(\beta \mapsto \omega_{\beta}\) would be an order embedding of \(\Ord\) in \([\gamma]\). Note that \(I\) is an initial segment set of \(\Ord\), so \(I = [\alpha]\). We have found \(\alpha\). We now need to show \(\gamma = \omega_{\alpha}\).
First note that \(\omega_{\alpha} \geq \gamma\) because \(\alpha\) is not in \(I\). We need only prove \(\omega_{\alpha} \leq \gamma\). This follows by case analysis on \(\alpha\) being either 0, \(\alpha' + 1\) or a limit ordinal.
A set \(X\) is well-orderable if there exists some well-order with carrier set \(X\).
- If \(Y \rightarrow X\) is an injection with \(X\) well-orderable, then \(Y\)is well-orderable.
- If \(X \rightarrow Y\) is a surjection with \(X\) well-orderable, then \(Y\) is well-orderable
Suppose \(X\) is well-orderable and infinite. Then \(X\) is in bijection with \([\alpha]\) because \(\alpha\) can be taken to be the order type of some well-ordering of \(X\). Then \[ \abs{X} = \abs{[\alpha]} = \aleph_{\beta} \] for some \(\beta \in \Ord\).
Also, if \(\abs{Y} = \aleph_{\beta}\), then \(Y\) is well-orderable.
We have
\begin{align*} \aleph_{\alpha} + \aleph_{\beta} &= \aleph_{\max(\alpha, \beta)} \\ \aleph_{\alpha} \cdot \aleph_{\beta} &= \aleph_{\max(\alpha, \beta)} \end{align*}This is easily proved by the following theorem.
Every aleph is its own square. \[ \aleph_{\alpha} = \aleph_{\alpha} \cdot \aleph_{\alpha}, \] i.e. \[ [\omega_{\alpha}] \cong [\omega_{\alpha}] \times [\omega_{\alpha}]. \]
We will prove this by defining an ordering on \(\Ord \times \Ord\) as \[ (0, 0) < (0, 1) < (1, 0) < (1, 1) < (0, 2) < (1, 2) < (2, 0) < (2, 1) < (2, 2) \] which we think of as organised in a square.
Generalising to the whole \(\Ord \times \Ord\), we define
\begin{align*} (\alpha_1, \alpha_2) \prec (\beta_1, \beta_2) \iff &\max(\alpha_1, \alpha_2) \prec \max(\beta_1, \beta_2) \\ \text{or } & \max(\alpha_1, \alpha_2) = \max(\beta_1, \beta_2) \text{ and } \alpha_1 < \beta_1 \\ \text{or } & \max(\alpha_1, \alpha_2) = \max(\beta_1, \beta_2) \text{ and } \alpha_1 = \beta_1 \text{ and } \alpha_2 < \beta_2 \end{align*}- \(\prec\) is a strict partial order on \(\Ord \times \Ord\)
- Every nonempty class \(\Class{X} \subseteq \Ord \times \Ord\) has a \(\prec\)-minimum element.
For every \(\alpha\), we have that \([\alpha] \times [\alpha]\) is an initial segment of \(\Ord \times \Ord\) ordered by \(\prec\).
I can probably rewrite this using a reference. Should look these up for org.
We want to show \[ \aleph_{\alpha} = \aleph_{\alpha} \cdot \aleph_{\alpha}. \] First, \((\leq)\) is obvious. We prove \(\aleph_{\alpha} \cdot \aleph_{\alpha} \leq \aleph_{\alpha}\) by transfinite induction on \(\alpha\).
- (Case \(\aleph = 0\)): We already know \(\aleph_0 = \aleph_0 \cdot \aleph_0\).
(Case \(\aleph > 0\)): For all \(\beta < \aleph\), we have \(\aleph_{\beta} \cdot \aleph_{\beta}\). The main step is to establish \(\operatorname{Ord}([\omega_{\alpha}] \times [\omega_{\alpha}], \prec) \leq \omega_{\alpha}\). Once we have proved this, we conclude the proof by \[ \aleph_{\alpha} \cdot \aleph_{\alpha} = \abs{[\omega_{\alpha}] \times [\omega_{\alpha}]} = \abs{\operatorname{Ord}([\omega_{\alpha}] \times [\omega_{\alpha}])} \leq \abs{\omega_{\alpha}} = \aleph_{\alpha}. \]
Let us prove \(\operatorname{Ord}([\omega_{\alpha}] \times [\omega_{\alpha}], \prec) \leq \omega_{\alpha}\). The proof is hairy and we're out of time.
Axiom of Choice
Alephs are the cardinals of well-orderable sets. We've also seen well-orderable sets are closed under \(\times\) and \(+\), likewise for subsets and images. We have not yet shown well-orderable sets are closed under exponentiation, powerset, or infinite products. None of these preserve well-orderability without further assumptions.
In particular, we will discuss the Axiom of Choice today, which will imply that all sets are well-orderable.
A choice function on a set \(S\) of sets is a function \(f : S \setminus \{\emptyset\} \rightarrow \Class{U}\) satisfying \[ \forall x \in S .\, X \neq \emptyset \implies f(x) \in x. \]
The following are equivalent for a set \(X\).
- \(X\) is well-orderable.
- \(\pot X\) has a choice function.
For the case \((1 \implies 2)\), let \(<\) be a well-order on \(X\). One choice function is \[ Y \neq \emptyset \mapsto \text{ the minimum element of } Y \text{ under } <. \] Now for \((2 \implies 1)\), define by transfinite recursion \(y_{(-)} : \Ord \rightarrow X \uplus \left\{ e \right\}\) by \[ y_{\alpha} = \begin{cases} g(X \setminus \left\{ y_{\beta} \mid \beta < \alpha \right\}) & \text{if } X \setminus \left\{ y_{\beta} \mid \beta < \alpha \right\} \neq \emptyset \\ e & \text{otherwise} \end{cases} \] By transfinite induction we prove
- \(\left\{ \alpha \mid y_{\alpha} \in X \right\}\) is an initial segment of \(\Ord\).
- If \(y_{\alpha} = y_{\beta} \in X\), then \(\alpha = \beta\).
- \(\left\{ \alpha \mid y_{\alpha} \in X \right\} = [\gamma]\) for some \(\gamma \in \Ord\).
- \(\{y_{\alpha} \mid \alpha < \gamma\} = X\)
It follows that \(y_{(-)} : [\gamma] \rightarrow X\) is a bijection. Hence \(X\) is well-orderable.
Which sets of sets can we prove to have choice functions?
- Any finite set of sets has a choice function.
- \(\pot(X)\), when \(X\) is well-orderable.
- In particular, \(\pot[\alpha]\) have choice functions.
We cannot prove, say, that \(\pot(\R)\) has a choice function.
Every set of sets has a choice function.
We will not assume it as an axiom, but first investigate its properties.
The following are equivalent.
- The axiom of choice
- (The well-ordering principle): Every set is well-orderable
- (Zorn's Lemma): If every chain in a partially ordered set has an upper bound, then the partially ordered set has a maximal element.
- \((1) \iff (2)\) by the previous theorem.
- \((3) \implies 1\): Let \(S\) be a set of nonempty sets. We want to construct a choice function for \(S\). Define a partial order \(F\) where \[ F = \left\{ f \mid f \text{ is a choice function on a subset } S' \subseteq S \right\} \] and \[ f \subseteq f' \iff \left( \dom(f) \subseteq \dom(f') \text{ and } \forall x \in \dom(f).\, f(x) = f'(x) \right). \] Let \(C \subseteq F\) be a chain. Then an upper bound is simply \[ \bigcup C, \text{ i.e. } X \mapsto f(X) \text{ for } f \in C \text{ and } x \in \dom(f). \] This is well-defined. Then \(F\) has a maximal element \(f_{\max}\). But \(\dom(f_{\max}) = S\).
- Let \(X, \leq\) be a partial order stisfying the assumptions of Zorn's Lemma. Let \(g\) be a choice function for \(\pot X\). Define \(y_{(-)} : \Ord \rightarrow X \uplus \left\{ e \right\}\) by transfinite recursion. \[ y_{\alpha} = \begin{cases} g(\left\{ x \in X \mid x \text{ is a strict upper bound for } \left\{ y_{\beta} \mid \beta < \alpha \right\} \right\}) \\ \text{if } \left\{ y_{\beta} \mid \beta < \alpha \right\} \subseteq X \text{ and the set of strict upper bounds is nonempty.} \\ e & \text{otherwise} \end{cases} \] By transfinite induction we prove that \(\left\{ \alpha \in \Ord \mid y_{\alpha} \in X \right\}\) is an initial segment of\(\Ord\) and equals \([\gamma]\) for some \(\gamma\) and \(\left\{ y_{\alpha} \mid \alpha < \gamma \right\}\) is a chain whose upper bound is a maximal element.
Each of the following is equivalent to the axiom of choice.
- Every surjective function has a section, i.e. if \(X \xrightarrow{g} Y\) is surjective, then \(\exists Y \xrightarrow{f} X\) such that \(g \circ f : Y \rightarrow Y = \operatorname{id}_Y\).
- Every equivalence relation has a set of representatives, i.e. if \(\sim \subseteq X \times X\) is an equivalence relation, then \(\exists B \subseteq X\) such that for every equivalence class \(A\), we have \(A \cap B\) is a singleton.
- Every cartesian product of nonempty sets is nonempty, i.e. given \(A_{(-)} : I \rightarrow \Class{\mathrm{Set}}\), \(\forall i \in I .\, A_i \neq \emptyset\), then \(\prod\limits_{i \in I} A_i \neq \emptyset\).
\noindent
Recall the axiom of dependent choice.
Given
\noindent We used this to show that every infinite set is Dedekind infinite. We can improve the result slightly.
Every countable set of sets has a choice function
\noindent This version suffices for the statement of infinity sizes.
Assuming the axiom of countable choice, a countable union of countable sets is countable.
Under the axiom of dependent choice, If a set has no descending chains, it is well-founded.
\[ (\mathrm{AC}) \implies (\mathrm{DC}) \implies (\mathrm{CC}) \]
First note that (CC) is equivalent to the statement that a countable product of nonempty sets is nonempty. Let us prove \((\mathrm{DC}) \implies \mathrm{CC}\). Suppose \((A_n)_{n \in \N}\) is a sequence of nonempty sets. Consider the relation on the set \(\sum\limits_{n \in N} A_n := \left\{ (n, x) \mid n \in \N, x \in A_n \right\}\). Then \[ (m, x)R(n, y) \iff n = m + 1 \] By (DC), \(\exists f : \N \rightarrow \sum\limits_{n \in \N} A_n\) such that \(f(0) = (0, x_0)\) and \(f(n) R f(n + 1)\). Then \[ \pi_2 \circ f \in \prod\limits_{n \in \N} A_n. \] Hence the product is nonempty.
An \(\alpha\)-sequence in \(X\) is a function \([\alpha] \rightarrow X\).
Suppose \(R \subseteq \bigcup\limits_{\beta < \alpha X^{[\beta]}} \times X\) is total. Then there exists \((x_{\beta}_{\beta < \alpha})\) satisfying \[ \forall \beta < \alpha .\, (x_{\gamma}_{\gamma < \beta}) R x_{\beta}. \]
\noindent It is somewhat difficult to find examples. There exist some in the field of stochastic analysis.
\[ (\mathrm{DC}) \iff (\mathrm{DC}_{\alpha}) \]
The axiom of choice is equivalent to \(\alpha\)-dependent choice for all ordinals: \[ (\mathrm{AC}) \iff \forall \alpha .\, (\mathrm{DC}_{\alpha}). \]
First assume the axiom of choice. Suppose \[ R \subseteq \left( \bigcup\limits_{\beta < \alpha} X^{[\beta]} \right) \times X \] is total. Let \(g\) be a choice function for \(\pot(X)\). We define
\begin{align*} x_{(-)} : [\alpha] &\rightarrow X \text{ by transfinite recursion} \\ x_{\beta} &= g \left\{ y \in X \mid (x_{\gamma})_{\gamma < \beta} R y \right\} \end{align*}By transfinite induction this satisfies the DCα property.
Now suppose \(X\) is not well-orderable. Let \(\alpha = \underline{h}(X)\) be the Hartogs' ordinal. Define \(R \subseteq \left( \bigcup\limits_{\beta < \alpha} X^{[\beta]} \right) \times X\) by \[ (x_{\gamma})_{\gamma < \beta} R y \iff y \in X \setminus \left\{ y_{\gamma} \mid \gamma < \beta \right\}. \] Then \(R\) is total because for any \(\beta < \alpha\) there is no surjective \(x_{(-)} : [\beta] \rightarrow X\) because \(X\) is not well-orderable. By DCα there exists a sequence \((x_{\beta}_{\beta < \alpha})\) satisfying the DCα property. By transfinite infuction that property and the one defined here imply \((x_{(-)}) : [\alpha] \rightarrow X\) is injective. This contradicts that \(\alpha = \underline{h}(X)\) is the Hartogs' ordinal.
\[ \alpha \leq \beta \implies (\mathrm{DC}_{\beta}) \implies (\mathrm{DC}_{\alpha}). \]
\[ (\mathrm{DC}_{\omega_{\alpha}}) \implies \forall \beta < \omega_{\alpha + 1}.\, (\mathrm{DC}_{\beta}) \]
Suppose \(P\) is a partial order satisfying
- every well-ordered chain has order type \(< \alpha\),
- every ordered chain has an upper bound.
Then \(P\) has a maximal element.
For a limit ordinal \(\lambda\), we have \[ (\mathrm{DC}_{\lambda}) \iff (\mathrm{ZL}_{\lambda}) \]
Cardinal arithmetic with choice
For this lecture, we will assume the axiom of choice. We will still flag its usage. The axiom makes some things more straightforward and flattens a lot of concepts.
- Every infinite cardinality is an aleph. If \(X\) is infinite, then \(\abs{X} = \aleph_{\alpha}\) for a unique ordinal \(\alpha\).
- Every infinite cardinality is its own square. This already implies choice!
- Cardinalities are linearly ordered. Since infinte cardinalities are alephs, they inherit the order from the ordinals. This is also equivalent to choice.
- Under the axiom of choice, the relations \(<\) become equivalent to \(\ll\).
- For all infinite sets \(X, Y\), we have \[ \abs{X} + \abs{Y} = \max(\abs{X}, \abs{Y}) = \abs{X} \times \abs{Y}. \]
Suppose \(X_{(-)} : I \rightarrow \Class{\text{Set}}\) to be a family of sets. We defined \[ \sum\limits_{i\in I} X_i := \left\{ (i, x) \mid i \in I, x \in X_i \right\} \] and \[ \prod\limits_{i \in I} X_i := \left\{ f : I \rightarrow \bigcup\limits_{i \in I} X_i \mid \forall i \in I .\, f(i) \in X_i \right\} \] as the family of choice functions. We sometimes write a choice function as \((X_i)_{i \in I}\) and call it a family of elements.
These operations are functorial. If \((f_i : X_i \rightarrow Y_i)_{i\in I}\) then we can derive the functions
\begin{align*} \sum\limits_{i\in I} X_i &\rightarrow \sum\limits_{i \in I} Y_i & \sum\limits_{i\in I} X_i &\rightarrow \prod\limits_{i \in I} Y_i \\ \sum\limits_{i\in I} f_i := (i, x) &\mapsto (i, f_i(x)) & \prod\limits_{i\in I} f_i := g &\mapsto (i \mapsto f_i(g(i))) \end{align*}which obviously preserve identity and composition, i.e. are factorial.
Suppose \((X_i)_{i \in I}\) and \((Y_i)_{i\in I}\) enjoy the property that \[ \forall i \in I.\, \abs{X_i} \leq \abs{Y_i}, \] then \[ \abs{\sum\limits_{i\in I} X_i} \leq \abs{\sum\limits_{i\in I} Y_i} \text{ and } \abs{\prod\limits_{i\in I} X_i} \leq \abs{\prod\limits_{i\in I} Y_i} \]
By assumption, for every \(i \in I\), there is an injection \(f_i : X_i \rightarrow Y_i\). Applying functorial actions to \((f_i : X_i \rightarrow Y_i)\), we get functions \(\sum\limits_{_{i\in I}} f_i : \sum\limits_{i\in I} X_i \rightarrow \sum\limits_{i\in I} Y_i\) and \(\prod\limits_{_{i\in I}} f_i : \prod\limits_{i\in I} X_i \rightarrow \prod\limits_{i\in I} Y_i\). It is easily checked that both functions are injective.
\noindent Note the proof uses the axiom of choice when we take a specific family \(f_i : X_i \rightarrow Y_i\).
Suppose \((X_i)_{i \in I}\) and \((Y_i)_{i\in I}\) enjoy the property that \[ \forall i \in I.\, \abs{X_i} = \abs{Y_i}, \] then \[ \abs{\sum\limits_{i\in I} X_i} = \abs{\sum\limits_{i\in I} Y_i} \text{ and } \abs{\prod\limits_{i\in I} X_i} = \abs{\prod\limits_{i\in I} Y_i} \]
CSB.
It also makes sense to talk about operations on families of cardinals. If \((\kappa_i)_{i \in I}\) is a family of cardinals, i.e. initial ordinals, then we say
\begin{align*} \sum\limits_{i \in I} \kappa_i &:= \abs{\sum\limits_{i \in I} [\kappa_i]} \\ \prod\limits_{i \in I} \kappa_i &:= \abs{\prod\limits_{i \in I} [\kappa_i]} \end{align*}Note that \([\kappa_i]\) is the set of ordinals less than \(\kappa_i\).
Suppose \(\lambda\) and \(\kappa\) are cardinalities. We can consider the sum \[ \sum\limits_{\alpha < \lambda} \kappa = \lambda \cdot \kappa. \] i.e. self-sum \(\kappa\) \(\lambda\)-many times. It's true in general that \[ \sum\limits_{i \in I}^{} X = I \times X \] (because we have no dependency in the second component). Let us consider the general case.
Give name
Suppose \(\lambda\) is an infinite cardinal and \((\kappa_{\alpha})_{\alpha < \lambda}\) are nonzero cardinals. Then \[ \sum\limits_{\alpha < \lambda}^{} \kappa_{\alpha} = \lambda \cdot \left( \sup_{\alpha < \lambda} \kappa_{\alpha} \right). \]
We need to prove \[ \sum\limits_{\alpha < \lambda}^{} \kappa_{\alpha} = \lambda \cdot \kappa. \] We prove this by showing the two inequalities. Since \(\kappa_{\alpha} \leq \kappa\) we have \[ \sum\limits_{\alpha < \lambda}^{} \kappa_{\alpha} \leq \sum\limits_{\alpha < \lambda}^{} \kappa = \lambda \cdot \kappa. \] by proposition.
We have \(\kappa_{\alpha} \leq \sum\limits_{\alpha < \lambda}^{} \kappa_{\alpha}\) so \(\kappa \leq \sum\limits_{\alpha < \lambda}^{} \kappa_{\alpha}\) because \(\kappa\) is a supremum. Also we have \(\lambda = \sum\limits_{\alpha < \lambda}^{} 1 \leq \sum\limits_{\alpha < \lambda}^{} \kappa_{\alpha}\) so we get \[ \lambda \cdot \kappa \leq \left( \sum\limits_{\alpha < \lambda}^{} \kappa_{\alpha} \right) \cdot \left( \sum\limits_{\alpha < \lambda}^{} \kappa_{\alpha} \right) = \sum\limits_{\alpha < \lambda}^{} \kappa_{\alpha} \] because \(\sum\limits_{\alpha < \lambda}^{} \kappa_{\alpha} \) is its own square.
If \(\lambda \leq \sup_{\alpha < \lambda} \kappa_{\alpha}\) then \[ \sum\limits_{\alpha < \lambda}^{} \kappa_{\alpha} = \sup_{\alpha < \lambda} \kappa_{\alpha}. \]
If \((\kappa_i)_{i \in I}\) and \((\lambda_i)_{i \in I}\) are families of cardinals with \(\kappa_i \ll \lambda_i\) for every \(i \in I\). Then \[ \sum\limits_{i \in I}^{} \kappa_i \ll \prod\limits_{i \in I}^{} \lambda_i. \]
\noindent
We can use this to rederive Cantor's theorem.
For any cardinality \(\kappa\), we have \(\kappa < 2^{\kappa}\).
\[ \kappa = \sum\limits_{\alpha < \kappa}^{} 1 < \prod\limits_{\alpha < \kappa}^{} 2 = 2^{\kappa} \] where we use König's theorem for the strict inequality.
We prove the strict inequality by showing the nonstrict inequality and showing it is not equality.
We define an injective function
\begin{align*} \sum\limits_{i \in I}^{} [\kappa_i] &\rightarrow \prod\limits_{i \in I}^{} [\lambda_i] \\ (i, \alpha) &\mapsto \left(j \mapsto \begin{cases} \alpha & \text{ if } j = i \\ \kappa_j & \text{ if } j \neq i \end{cases} \right) \end{align*}It's easily checked that this is injective.
We prove there is no bijection \(\sum\limits_{i \in I}^{} [\kappa_i] \cong \prod\limits_{i \in I}^{} [\lambda_i]\). Suppose for contradiction there is. Note that we can view the sum set as an \(I\)-indexed partition of the whole sum set. If we transport this via the bijection, we end up transporting the partition in the product. This would give us an isomorphism to a disjoint union \[ \prod\limits_{i \in I}^{} [\lambda_i] = \bigcup\limits_{i \in I}^{} X_i \] where \(X_i\) is the image of \(\left\{ i \right\} \times [\kappa_i]\) under the bijection.
For every \(i \in I\) the set \[ Y_i = \left\{ x_i \mid x_{(-)} \in X_i \right\} \subseteq [\lambda_i] \] Since \(x_{(-)} \mapsto x_i\) is a surjection from \(X_i \rightarrow Y_i\), we have \(\abs{Y_i} \leq \abs{X_i} = \kappa_i < \lambda_i\). Therefore \([\lambda_i] \setminus Y_i\) is nonempty, hence has a least element. Consider now the function \[ i \in I \setminus \text{the smallest ordinal in } [\lambda_i] \setminus Y_i. \] This is clearly in \(\prod\limits_{i \in I}^{} [\lambda_i]\). On the other hand it is not in any \(X_i\). This is a contradiction to the partition isommorphism.
\noindent
Cardinal exponentiation is a special case of products. In particular, \[ \kappa^{\lambda} = \prod\limits_{\alpha < \lambda}^{}. \] Cardinal exponentiation enjoys expected laws. \[ \left( \prod\limits_{i \in I}^{} \kappa_i \right)^{\lambda} = \prod\limits_{i \in I}^{} \kappa_i^{\lambda} \text{ and } \kappa^{\sum\limits_{i \in I}^{} \lambda_i} = \prod\limits_{i \in I}^{} \left( \kappa^{\lambda_i} \right) \]
The question now is to find a \(\gamma\) so that \(\aleph_{\alpha}^{\aleph_{\beta}} = \aleph_{\gamma}\). This question arguably has no good answer. One potential answer is given by the generalised continuum hypothesis (GCH): For any infinite \(\aleph_{\alpha}\) we have \[ 2^{\aleph_{\alpha}} = \aleph_{\alpha + 1}. \] If we have this, everything can be answered. Sadly both it and its negation are independent from our axioms.
We can prove two things.
- (A) \(2^{\aleph_{\alpha}} \geq \aleph_{\alpha + 1}\)
- (B) \(\alpha \leq \beta \implies 2^{\aleph_{\alpha}} \leq 2^{\aleph_{\beta}}\)
For the next statement we will need a few more definitions.
For every \(\alpha\), we have \[ \operatorname{cf}(2^{\aleph_{\alpha}}) > \aleph_{\alpha}. \]
For a limit ordinal \(\gamma\), an increasing sequence of ordinals of length \(\gamma\) is \((\alpha_\beta)_{\beta < \gamma}\) satisfying \[ \beta < \beta' \implies \alpha_{\beta} < \alpha_{\beta'}. \] The limit of such a sequence is \[ \lim_{\beta \rightarrow \gamma} \alpha_{\beta} := \sup_{\beta < \gamma} \alpha_{\beta}. \]
If \(\alpha\) is a limit ordinal, then its cofinality \(\operatorname{cf}(\alpha)\) is the smallest ordinal \(\gamma\) such that \(\alpha\) is the limit of an increasing sequence of ordinals of length \(\gamma\).
- \(\operatorname{cf}(\omega) = \omega\), since we can see \(\omega\) as a limit of the naturals.
- \(\operatorname{cf}(\omega^2) = \omega\) since \(\omega^2 = \lim \omega, \omega \cdot 2, \omega \cdot 3, \cdots\)
- \(\operatorname{cf}(\omega_1) = \omega_1\) since \(\omega_1 = \lim_{\alpha \rightarrow \omega} \alpha\)
- Similarly, \(\operatorname{cf}(\omega_2) = \omega_2\) and so on.
- \(\operatorname{cf}(\omega_{\omega}) = \omega\) since \[ \omega_{\omega} = \lim \omega, \omega_1, \omega_2, \dots \] i.e. we approximate it with an \(\omega\)-chain of ordinals.
\noindent
Theorem: initiality of cofinality noindent
We always have that \(\operatorname{cf}(\alpha)\) is always an infinite initial ordinal, i.e. an \(\aleph\).
\noindent Thus we often write \(\operatorname{cf}(\omega) = \omega = \aleph_0\) and the like.
- \(\operatorname{cf}(\aleph_0) = \operatorname{cf}(\omega) = \omega = \aleph_0\)
- \(\operatorname{cf}(\aleph_1) = \operatorname{cf}(\omega_1) = \omega_1 = \aleph_1\)
- \(\operatorname{cf}(\aleph_2) = \operatorname{cf}(\omega_2) = \omega_2 = \aleph_2\)
- \(\operatorname{cf}(\aleph_{\omega}) = \operatorname{cf}(\omega_{\omega}) = \omega = \aleph_0\)
By the cofinality theorem, we get
\begin{align*} 2^{\aleph_0} &\neq \aleph_0 \\ &= \aleph_1 \text{ is possible} \\ &= \aleph_2 \text{ is possible} \\ &\vdots\\ &\neq \aleph_{\omega} \text{ is ruled out by the theorem} \\ &= \aleph_{\omega + 1} \text{ is possible} \\ &\vdots \end{align*}Before we prove the above theorem C, we will have a few technical lemmata.
Every infinite cardinal \(\kappa\) can be expressed as \[ \kappa = \sum\limits_{\alpha < \operatorname{cf}(\kappa)}^{} \kappa_{\alpha} \] where we have \(\kappa_{\alpha} < \kappa\) for every \(\alpha < \operatorname{cf}(\kappa)\).
By the definition of cofinality, we have \(\kappa = \lim_{\alpha \rightarrow \operatorname{cf}(\kappa)} \beta_{\alpha}\) where \((\beta_{\alpha})_{\alpha < \operatorname{cf}(\kappa)}\) is an increasing sequence. If we look at all the ordinals between \(\beta_i\) and \(\beta_{i+1}\), we can partition \([\kappa]\) as the disjoint union \[ [\kappa] = \bigcup_{\alpha < \operatorname{cf}(\kappa)} \left( [\beta_{\alpha}] \setminus \bigcup\limits_{\alpha' < \alpha}^{} [\beta_{\alpha'}] \right) = \bigcup\limits_{\alpha < \operatorname{cf}(\kappa)}^{} B_{\alpha} \] Each \(B_{\alpha}\) is a subset of \([\beta_{\alpha}]\) and \(\abs{B_{\alpha}} \leq \abs{\beta_{\alpha}} < \kappa\). Since this is a disjoint union, we have \[ \kappa = \abs{\bigcup\limits_{\alpha < \operatorname{cf}(\kappa)}^{} B_{\alpha}} = \sum\limits_{\alpha < \operatorname{cf}(\kappa)}^{} \abs{B_{\alpha}} \] So \(\kappa = \sum\limits_{\alpha < \operatorname{cf}(\kappa)}^{} \abs{B_{\alpha}}\) and each \(\abs{B_{\alpha}} < \kappa\).
We need to show \(\operatorname{cf}(2^{\aleph_{\alpha}}) > \aleph_{\alpha}\). By the lemma, we have that \[ 2^{\aleph_{\alpha}} = \sum\limits_{\beta < \operatorname{cf}(2^{\aleph_{\alpha}})}^{} \kappa_{\beta} \] where every \(\kappa_{\beta} < 2^{\aleph_{\alpha}}\). By König's theorem we have \[ 2^{\aleph_\alpha} = \sum\limits_{\beta < \operatorname{cf}(2^{\aleph_{\alpha}})}^{} \kappa_{\beta} < \prod\limits_{\beta < \operatorname{cf}(2^{\aleph_{\alpha}})}^{} 2^{\aleph_{\alpha}} = \left( 2^{\aleph_{\alpha}} \right)^{\operatorname{cf}(2^{\aleph_{\alpha}})} \]
Suppose for contradiction that we \(\operatorname{cf}(2^{\aleph_{\alpha}}) \leq \aleph_{\alpha}\). Then we continue the inequality chain above as \[ 2^{\aleph_\alpha} = \sum\limits_{\beta < \operatorname{cf}(2^{\aleph_{\alpha}})}^{} \kappa_{\beta} < \prod\limits_{\beta < \operatorname{cf}(2^{\aleph_{\alpha}})}^{} 2^{\aleph_{\alpha}} = \left( 2^{\aleph_{\alpha}} \right)^{\operatorname{cf}(2^{\aleph_{\alpha}})} \leq \left( 2^{\aleph_{\alpha}} \right)^{\aleph_{\alpha}} = 2^{\aleph_{\alpha} \cdot \aleph_{\alpha}} = 2^{\aleph_{\alpha}}. \] We proved \(2^{\aleph_{\alpha}} < 2^{\aleph_{\alpha}}\) which is a contradiction.
Applications of choice
Regular and inaccessible cardinals
- \(\operatorname{cf}(\alpha) \leq \alp
- If \(\alpha\) is not a cardinal (i.e. it is not an initial ordinal), then \(\operatorname{cf}(\alpha) < \alpha\).
- We have \(\operatorname{cf}(\operatorname{cf}(\alpha))\)
For any limit ordinal \(\alpha\), \(\operatorname{
It follows from points 2 and 3 in the cofinality lemma.
An infinite cardinal \(\kappa\) is regular if \(\operatorname{cf}(\kappa) = \kappa\). It is singular if \(\operatorname{cf}(\kappa) < \kappa\).
An infinite cardinal \(\aleph_{\alpha}\) is a successor cardinal if \(\alpha\) is a successor ordinal. It is a limit cardinal if \(\alpha\) is a limit ordinal.
Every infinite successor cardinal is regular.
\noindent Before proceeding to the proof we will have two lemmas. The first one we have already seen.
A cardinal \(\kappa\) is singular if and only if we have \[ \kappa = \sum\limits_{i \in I}^{} \kappa_i \] where \(\abs{I} < \kappa\) and every \(\kappa_i < \kappa\).
\((\implies)\) is immediate from lemma A.
For \((\impliedby)\), suppose \(\kappa = \sum\limits_{i \in I}^{} \kappa_i\), where \(\abs{I}, \kappa_i < \kappa\) . Then \[ \kappa = \abs{I} \cdot \sup_{i \in I} \kappa_i = \max (\abs{I}, \sup_{i \in I} \kappa_i). \] So \(\kappa = \sup_{i \in I} \kappa_i\), using some bijection on \(I\) with an ordinal.
So for some ordinal \(\lambda\) with \(\abs{\lambda} \leq \abs{I} < \kappa\) we have \[ \kappa = \lim_{\alphaa \rightarrow \lambda} k_{i_{\alpha}} \text{ where } \lambda < \kappa. \] Then \(\operatorname{cf}(\kappa) \leq \lambda < \kappa\) so \(\kappa\) is singular.
Suppose for contradiciton that \(\aleph_{\alpha + 1}\) is singular. By lemma B, we have that \[ \aleph_{\alpha + 1} = \sum\limits_{i \in I}^{} \kappa_i \] where \(\abs{I}, \kappa_i < \aleph_{\alpha + 1}\). So \(\abs{I}\) and \(\kappa_i \leq \aleph_{\alpha}\) and \[ \aleph_{\alpha + 1} \leq \sum\limits_{\beta < \aleph_{\alpha}}^{} \aleph_{\alpha} = \aleph_{\alpha} \cdot \aleph_{\alpha} = \aleph_{\alpha}. \]
An infinite cardinal is weakly inaccessible if it is an uncountable, regular limit cardinal.
\noindent We have various large cardinal axioms,
There exists a weakly inaccessible cardinal.
\noindent Note that we can prove neither WIC nor \(\neg\)WIC from the axioms.
Suppose \(\aleph_{\lambda}\) is a weakly inaccessible cardinal. This means \(\lambda\) is a limit cardinal. We show that \(\lambda = \aleph_{\lambda}\). We have \[ \aleph_{\lambda} = \lim_{\alpha \rightarrow \lambda} \aleph_{\alpha} \] as \(\lambda\) is a limit ordinal. We have \(\lambda \geq \operatorname{\aleph_{\lambda}} = \aleph_{\lambda}\). Note that we always have \(\alpha \leq \omega_{\alpha} = \aleph_{\alpha}\), so indeed \(\aleph_{\lambda} = \lambda\).
We had that \(\kappa\) is a limit cardinal if and only if for any \(\aleph_{\alpha} < \kappa \), we also have \(\aleph_{\alpha+ 1} < \kappa\). We can expand the definition slightly.
\(\kappa\) is a strong limit cardinal if for any \(\kappa' < \kappa\), we also have \(2^{\kappa'} < 2^{\kappa}\).
\noindent For instance, \(\aleph_{\omega}\) is a limit cardinal, but we cannot prove it is a strong limit cardinal.
We have the so called beth cardinals: \[ \underbrace{\aleph_0}_{\beth_0}, \underbrace{2^{\aleph_0}}_{\beth_1}, \underbrace{2^{2^{\aleph_0}}}_{\beth_2}, \cdots \text{limit} &= \beth_{\omega} \]
\noindent Every strong limit cardinal is a limit cardinal. If the generalised continuum hypothesis holds, then every limit cardinal is a strong limit.
A cardinal is strongly inaccessible if it is an uncountable, regular, strong limit cardinal.
There exists a strongly inaccessible cardinal.
\noindent
A Grothendieck universe is a set \(\mathcal{U}\) satisfying
- (Transitivity): Whenever \(X \in \mathcal{U}\) is a set and \(x \in X\), it holds that \(x\in \mathcal{U}\).
- (Unordered pair): If \(x, y \in \mathcal{U}\), then \(\left\{ x, y \right\} \in \mathcal{U}\)
- (Powerset): If \(x, y \in \mathcal{U}\), then \(\pot X \in \mathcal{U}\)
- (Unions): If \(I \in \mathcal{U}\) is a set and \((X_i)_{i \in I}\) is a family where every \(X_i \in \mathcal{U}\) is a set, we have \[ \bigcup\limits_{i \in I}^{} X_i \in \mathcal{U}. \]
Some examples of Grothendieck unvierses are
- \(\emptyset\)
- The set of all hereditarily finite sets
\noindent In some sense these are all the examples, as elucidated by the following theorem.
The following are equivalent.
- There exists a Grothendieck universe containing an infinite set.
- SIC, i.e. the existence of a strongly inaccessible cardinal.
To show \((1) \implies (2)\), consider the limit \(\sup \left\{ \alpha \in \Ord \mid \operatorname{vN}(\alpha) \in \mathcal{U} \right\}\). This is a strongly inaccessibly cardinal.
On the other hand, if \(\kappa\) is strongly inaccessible, then \(V_{\kappa}\) is a Grothendieck universe.
\noindent Let us elucidate the \(V_{\kappa}\) from the cumulative hierarchy.
We define the cumulative hierarchy by transfinite recursion. We consider the map \(V_{(-)} : \Ord \rightarrow \Class{Set}\). Then
\begin{align*} V_0 &:= \emptyset \\ V_{\alpha + 1} &:= \pot V_{\alpha} \\ V_{\lambda} := \bigcup\limits_{\alpha < \lambda}^{} V_{\alpha} \text{ for } \lambda \text{ a limit ordinal.} \end{align*}\noindent The full cumulative hierarchy \[ \Class{V} := \bigcup\limits_{\alpha}^{} V_{\alpha} \] is a proper class. In particular, this is the class of all sets that are
- hereditary,
- well-founded.
Note however, that for every \(\alpha\), it is obvious that \(V_{\alpha}\) is a set.
- If we have \(\alpha \leq \beta\), it follows that \(V_{\alpha} \subseteq V_{\beta}\).
- For a finite \(n\), \(V_n\) has the cardinality \(2^{2^{\udots }^{2^0}}\) for \(n\) 2s.
- \(V_{\omega}\) is the set of hereditarily finite sets.
- We have \(\operatorname{vN}(\omega) \in V_{\omega + 1}\) and also \(\operatorname{vN}(\alpha) \in V_{\alpha + 1}\)
if \(X \in V_{\alpha}\), then we have \(\pot X \in V_{\alpha + 1}\).
The perfect set property
We will focus on the substes of \(\R\) and similar structures. Today we will assume only dependent choice, not the full axiom of choice.
The perfect set property is that every uncountable subset of \(\R\) contains a perfect subset.
A polish space is a topological space defined by a metric topology of a complete separable metric.
- \(\R\)
- \(\N^{\N}\) with metric \[ d(f, g) = \begin{cases} 0 & \text{ if } f = g \\ \frac{1}{2^n} & \text{where } n \text{ is the smallest number such that} f(n) \neq g(n) \text{ otherwise} \end{cases} \]
\noindent
What are the possible cardinalities of the open subsets of \(\R\)? We can only have 0 and \(2^{\aleph_0}\). Regarding the closed subsets, we can have any finite number, \(\aleph_0\) and \(2^{\aleph_0}\). We will be able to prove that these are the only options. This means we will prove from the axioms of set theory that any violation of the continuum hypothesis will have to be a nonclosed set.
The only possible cardinalities of closed subsets of \(\R\) are finite, \(\aleph_0\) and \(2^{\aleph_0}\).
\noindent This will follow from two theorems.
In any Polish space \(X\), the collection of closed sets enjoys the perfect set property: every uncountable closed set has a perfect subset.
In a Polish space \(X\), every perfect set has cardinality \(2^{\aleph_0}\).
A perfect set in a Polish space is a subset \(Y\) that is nonempty, closed, and has no isolated points.
Perfect sets in \(\R\) are \(\R\) and \([a, b]\) for \(a < b\).
\(K_0 = [0, 1]\), and we throw out the (open) middle intervals on each step. Then we define \[ K_{\infty} = \bigcap\limits_{n \in \N}^{} K_n. \] This is a decreasing intersection of nonempty compact sets, hence its intersection is a nonempty compact set. We have \[ K_{\infty} = \bigcap\limits_n^{} K_n = \bigcap\limits_n^{} \bigcup\limits_{\overset{w \in \left\{ 0, 1 \right\}^{ * }}{\abs{w} = n} }^{} D_n = \bigcup\limits_{f : \N \rightarrow \left\{ 0, 1 \right\}}^{} \bigcap\limits_{n \in \N}^{} D_{f(0), \dots, f(n - 1)} = \bigcup\limits_{f : \N \rightarrow \left\{ 0, 1 \right\}}^{} \left\{ K_s \right\} \] Hence we have \(\abs{K_{\infty} = 2^{\aleph_0}}\). No \(K_f\) is isolated. Consider \(K_f\) and any \(\varepsilon > 0\). Let \(n\) be such that \(\frac{1}{3^n} < \varepsilon\). Then any \(f'\)with \(f'(m) = f(m)\) for any \(m \leq n\) enjoys that \[ \abs{K_f - K_{f'}} \leq \frac{1}{3^n} < \varepsilon. \]
\noindent We will prove the above theorems for the special case of \(\R\).
Every perfect subset of \(\R\) has cardinality \(2^{\aleph_0}\).
If \(A \subseteq \R\) is perfect and there exists a \(x \in A\) with lower bound \(a < x\), then at least one of the two sets below is perfect: \[ A \cap [a, \infty) \text{ or } A \cap (a, \infty) \]
We construct a family of bounded perfect subsets of \(A\), namely \((A_w)_{w \in \left\{ 0, 1 \right\}}^{ * }\). Then \[ A_{\varepsilon} = \text{ some bounded subset of } A \text{ obtained using lemma}. \] Suppose we have \(A_w\). We define \(a = \min(A_w)\) and \(b = \max(A_w)\). Then we get \[ A_{w0} =\begin{cases} [a, \frac{2}{3} a + \frac{1}{3}b] \cap A_w & \text{ if perfect} [a, \frac{2}{3} a + \frac{1}{3}b) \cap A_w & \text{otherwise} \end{cases} \] and \[ A_{w1} = \begin{cases} [a, \frac{1}{3} a + \frac{2}{3}b] \cap A_w & \text{ if perfect} [a, \frac{1}{3} a + \frac{2}{3}b) \cap A_w & \text{otherwise} \end{cases} \] Define \[ A_{\infty} = \bigcap\limits_{n \in \N}^{} \bigcup\limits_{\overset{\left\{ 0, 1 \right\}^{ * }}{\abs{w} = n}}^{} A_w. \] This works by the same arguments as for the Cantor set \(\abs{A_{\infty}} = 2^{\aleph_0}\).
We will prove this for the special case of \(\R\), i.e. every uncountable closed set \(A \subseteq \R\) contains a perfect subset.
For every closed set \(A\) define \[ A' = A \setminus \left\{ x \in A \mid x \text{ isolated in } A \right\} \] It follows that \(A'\) is again closed.
To construct our perfect subset, let's define
\begin{align*} A_0 &= A \\ A_{n + 1} &= A_n' \\ A_{\omega} &= \bigcap\limits_n^{} A_n \end{align*}But even \(A_{\omega}\) might still have some isolated points. So instead we consider transfinite induction
\begin{align*} A_0 &= A \\ A_{\alpha + 1} &= A_{\alpha}' \\ A_{\lambda} = \bigcap\limits_{\alpha < \lambda}^{} A_a \end{align*}Then we can define \[ A_{\infty} = \bigcap\limits_{a \in \Ord}^{} A_{\alpha}. \]
Let us define \(F_\alpha = A_{\alpha} \setminus A_{\alpha + 1}\), i.e. the isolated points in \(A_{\alpha}\). Likewise (TODO: pluscup) \[ F = \bigcup\limits_{\alpha \in \Ord}^{} F_\alpha. \] Then \(F\) is a countable set (which will let \(A_{\infty}\) remain uncountable). To see this, consider a function \(f : F \rightarrow I\) where \(I = \left\{ (p, q)\in \Q \times \Q \mid p < q \right\}\) where that \(f\) maps \[ y \in F_{\alpha} \mapsto \text{ some interval} (p_y, q_y) \text{ such that } (p_y, q_y) \cap A_\alpha = \left\{ y \right\}. \] This does not require choice since we can always map to the smallest such interval. This function is injective. Hence \(F\) is countable.
If for all \(\alpha < \omega_1\), \(F_{\alpha}\) were nonempty, then \[ \bigcup\limits_{\alpha <\omega_1}^{} F_{\alpha} \] would be uncountable. So \[ \exists \gamma < \omega_1 \text{ such thhat } F_{\gamma} = \emptyset. \] Let \(\gamma\) be the smallest such. But this means that \(A_\gamma' = A_{\gamma}\), in particular \(A_{\gamma}\) has no isolated points. This means for all \(\beta > \gamma\), we still have \(A_{\beta} = A_{\gamma}\).
This means that \(A_{\infty} = A_{\gamma}\) is closed. It has no isolated points. Because we started with an uncountable set \(A\) and removed a countable subset, our \(A \setminus F\) is still uncountable and in particular nonempty.
This means that \(A \setminus F\) is the perfect set we are looking for.
\noindent
Cantor invented the ordinals for this precise argument.
In descriptive set theory we consider well-behaved subsets of Polish spaces.
The Borel hierarchy is \[ \mathbf{\Sigma}_{\alpha}^{0} \text{ and }\mathbf{\Pi}_{\alpha}^{0} \] in particular this is the boldface hierarchy.
\begin{align*} \Sigma_1 &= \mathcal{O}(X) \text{ the set of /open/ subsets of } X \\ \Pi_1 &= \mathcal{A}(X) \text{ the set of /closed/ subsets of } X \\ \end{align*}We define the rest using transfinite recursion.
\begin{align*} \Sigma_{\alpha + 1} &= \left\{ \bigcup\limits_n^{} A_n \mid \text{ for every } n \in \N, A_n \in \Pi_{\alpha} \right\}\\ \Pi_{\alpha + 1} &= \text{ set of all countable intersections of sets in } \Sigma_{\alpha} \end{align*}and the limit states as
\begin{align*} \Sigma_{\lambda} = \left\{ \bigcup\limits_n^{} A_n \mid \text{ for every } n \in \N, A_n \in \Pi_{\alpha_n} \text{ for some } \alpha_n < \lambda \right\} \\ \Sigma_{\lambda} = \left\{ \bigcup\limits_n^{} A_n \mid \text{ for every } n \in \N, A_n \in \Sigma_{\alpha_n} \text{ for some } \alpha_n < \lambda \right\} \end{align*}- \(\Sigma_{\alpha} \subseteq \Pi_{\alpha + 1}\) and \(\Pi_{\alpha} \subseteq \Sigma_{\alpha + 1}\)
- \(\alpha \leq \beta \implies \Sigma_{\alpha} \subseteq \Sigma_{\beta}\) and \(\Pi_{\alpha} \subseteq \Pi_{\beta}\).
- \(\Sigma_{\alpha}\) is closed under countable unions and finite intersections, and \(\Pi_\alpha\) is closed under finite unions and finite intersections.
- \(B \in \Sigma_{\alpha} \iff X \setminus B \in \Pi_{\alpha}\).
- \(\Sigma_{\omega_1} = \Pi_{\omega_1} = \Sigma_{\omega_1 + 1} = \Pi_{\omega_1 + 1} = \Sigma_{\beta} = \Pi_{\beta}\) for \(\beta \geq \omega_1\).
\noindent By the lemma, \(\Sigma_{\omega_1}\) contains every open set and is closed under both countable union, countable intersection, and complement. The Borel subsets \(B\) are then the smallest collection of subsets of \(X\) containing all opens and closed under countable unions, intersections, and complements.
\(B = \Sigma_{\omega_1}\).
- \(B \subseteq \Sigma_{\omega_1}\) by the definition of \(B\)
- \(\Sigma_{\omega_1} \subseteq B\) is proved by taking \(\Sigma_{\alpha} \subseteq B\) by transfinite induction.
The Borel sets enjoy the perfect subset property.
\(\abs{B} = 2^{\aleph_0}\).
Compare this to \(\abs{\pot(\R)} = 2^{2^{\aleph_0}}\).
We prove by transfinite induction that for all \(\alpha\) we have \(\abs{\Sigma_{\alpha}} \leq 2^{\aleph_0}\) and \(\Pi_{\alpha} \leq 2^{\aleph_0}\).
For the step case \(\alpha + 1\), note that we have a surjection \(\Pi_{\alpha}^{\N} \xrightarrow{\cap} \Sigma_{\alpha + 1}\) so \[ \abs{\Sigma_{\alpha + 1}} \leq \Pi_{\alpha^{\N}} \leq (2^{\aleph_0})^{\aleph_0} = 2^{\aleph_0}. \]
Gale-Stewart games
Last week we proved that every uncountable closed set has a perfect subset. We showed as well this likewise applies to Borel sets and in particular the subsets of \(\R\). In general, this holds for subsets of a Polish space. An important Polish space is called a Baire space.
Today we will not assume the axiom of choice automatically.
The Baire space is \(\N^{\N}\) equipped with the metric \[ d(s, s') = \begin{cases} 0 & \text{ if } s = s' \\ 2^{-n} \text{ if } n \text{ is the least number such that } s(n) \neq s'(n). \end{cases} \]
- It holds that \(\N^{\N} \cong \R \setminus \Q\).
- The Baire space is not locally compact.
There exists an uncountable subset of \(\R\) that does not have a perfect subset.
There are \(2^{\aleph_0}\)-many perfect sets. We can enumerate all perfect sets \((P_{\alpha})_{\alpha < 2^{\aleph_0}}\), treating \(2^{\aleph}\) itself as an aleph. Proceed by transfinite recursion. Set \(A_0, B_0 := \emptyset\) and set\(A_{\alpha + 1}, B_{\alpha + 1}\) by choosing \[ a_{\alpha} \neq b_{\alpha} \in P_{\alpha} \setminus \bigcup\limits_{\beta \leq \alpha}^{} (A_{\beta} \cup B_\beta). \] and \[ A_{\lambda} := \bigcup\limits_{\alpha < \lambda}^{} A_{\alpha}, \quad B_{\lambda} :=\limits_{\alpha < \lambda}^{} B_{\alpha}. \] By transfinite induction we prove that \(A_{\alpha} \cap B_{\alpha} = \emptyset\) and \(\abs{A_{\alpha} = \abs{B_{\alpha}} = \abs{\alpha}}\).
Define \[ A = \bigcup\limits_{\alpha < 2^{\aleph_0}}^{} A_{\alpha}. \] A then does not contain any perfect subset because for any \(\alpha < 2^{\aleph_0}\) since \(b_{\alpha} \in P_\alpha\), but we have \(b_{\alpha} \not\in A\).
Consider the perfect subset game. This is a 2 player turn-based game played by I and II.. We start with a given goal set \(X \subseteq \R\) The associated game \(\operatorname{Perf}_X\) is played as follows.
The game starts in the state \(A_0 := \R\). I chooses rationals \(p < q < p' < q'\) such that \[ [p, q'] \subseteq A_{n - 1} \text{ and } \max(q - p, q' - p') < \frac{1}{2^n}. \] II responds by choosing 0 or 1. This determines \[ A_n := \begin{cases} [p, q] \text{ if } II \text{ chooses } 0 \\ [p', q'] \text{ if } II \text{ chooses } 1. \end{cases} \] At time \(\omega\), player I wins if for \(\left\{ x \right\} \:= \bigcap\limits_{n \in \N}^{} A_n\) if we have \(x \in X\) and player II wins if \(x \not\in X\).
Let \(X \subseteq \R\).
- Player I has a winning strategy if and only if \(X\) has a perfect subset.
- Player II has a winning strategy if and only if \(X\) is countable.
Note that (using AC) there exist sets \(X\) such that \(\operatorname{Perf}_X\) is not determined.
Consider the first point's \((\impliedby)\). Suppose \(X\) has a(n initial) perfect subset \(P_0 \subseteq \R\), then \(I\) plays as follows. Given \(A_{n-1} \supseteq P_{n - 1}\). Using the properties from last week we find \(p < q < p' < q'\) such that
- \([p, q]\) contains a perfect subset \(L \subseteq P_{n-1}\),
- \([p', q']\) contains a perect subset \(R \subseteq P_{n-1}\),
and both \(q - p < \frac{1}{2^n}\) and \(q' - p' < \frac{1}{2^n}\). If player II chooses 0, we take \(P_n = L\), if 1, take \(P_n = R\).
Player I wins because \[ \bigcap\limits_n^{} A_n = \bigcap\limits_{n }^{} P_n \subseteq X. \]
Consider the second point's \((\impliedby)\). Suppose \(X = \left\{ x_1, x_2, \dots \right\}\) is a countable set. Player II has the following strategy. At round \(n\), given player I's move \(p < q < p' < q'\), II chooses 0 if \(x_n \not\in [p, q]\) and 1 otherwise.
Now suppose player II has a winning strategy. We want to show \(X\) is countable. Let \(x \in X\) be arbitrary. Player I tries to force the play to \(\left\{ x \right\}\) as follows. Assuming we're at round \(n\)with \(x \in A_{n - 1}\) if possible we choose \(p_n < q_n < p'_n < q_n'\) such that after II's response we have \(x \in A_n\). Otherwise, we give up on \(x\) at position \(p_1 < q_1 < p'_1 < q'_1, \dots, p_{n-1} < q_{n - 1} < p'_{n - 1} < q'_{n - 1}.\)
Suppose we give up on \(x\) and \(y\) at position \(m_1, \dots, m_{n-1}\). This means \(x, y \in A_{n-1}\). Suppose \(x < y\). We choose \(p_n < q_n < p'_n < q'_n\) such that \(x \in [p_n, q_n]\) and \(y \in [p'_n, q'_n]\).
At any position in our play we give up on at most one element of \(X\). For every element \(x \in X\), there exists a position on which we give up on \(x\). \(X\) is then countable because there are only countably many positions.
A Gale-Stewart game is given as follows. Players I and II take turns to choose a natural number. A play is given as a sequence of numbers. \[
\begin{matrix} I: & n_0 & & n_2 & & n_4 & \cdots \\ II: & & n_1 & & n_3 & & \cdots \end{matrix}\] The game is given by specifying a goal set \(G \subseteq \N^{\N}\). Player I wins the play \((n_i)_{i \in \N}\) if \((n_i)_{i \in \N} \in G\) and II wins if \((n_i)_{i \in \N} \not\in G\).
A strategy for a player is a function
\begin{align*} \sigma_\mathrm{I} : &\left\{ w \in \N^{ * } \mid \abs{w} \text{ is even} \right\} \rightarrow \N \\ \sigma_{\mathrm{II}} : & \left\{ w \in \N^{ * } \mid \abs{w} \text{ is odd} \right\} \rightarrow \N \end{align*}A play \((n_i)_{i \in \N}\) is said to follow \(\sigma_\mathrm{I}\) if \[ \forall i .\, n_{2i} = \sigma_{\mathrm{I}} (n_0n_1,\dots, n_{2i - 1}). \] In particular, we have \(n_0 = \sigma_I(\varepsilon)\). A similar definition holds for \(\sigma_{\mathrm{II}}\).
A strategy is a winning strategy for I if for every play \((n_i)_{i \in \N}\) that follows the strategy \(\sigma_\mathrm{I}\), we have \((n_i)_{i \in \N} \in G\). Similar for \(\sigma_{\mathrm{II}}\) but we want \(\not\in G\).
For a game \(G \subseteq \N^{\N}\) it is not the case that both I and II have winning strategies.
G is determined if one of I and II has a winning strategy.
If \(G \subseteq \N^{\N}\) is closed (in the Baire space topology), then \(G\) is determined.
Suppose \(G \subseteq \N^{\N}\) is closed. Define \(\sigma_\mathrm{I}(w)\) for an even-length word as \[ \sigma_\mathrm{I}(w) = \begin{cases} n \\ 0 & \text{otherwise} \end{cases} \] where \(n\) is the smallest number such that II does not have a winning strategy from \(w_n\) if such an \(n\) exists.
We will prove that if II does not have a winning strategy, then \(\sigma_{\mathrm{I}}\) is winning for 1.
Suppose II does not have a winning strategy. Let \(s = s_0s_1s_2\cdots\) be any play following \(\sigma_{\mathrm{I}}\). We need to prove \(s \in G\). Proceed by induction on \(n\) that II does not have a winning strategy from \(s_0\cdots s_{n-1}\).
If \(n = 0\), our prefix word is empty, so II does not have a winning strategy by assumption. If \(n > 0\) and \(n\) is odd, so \(s_0, \dots, s_{n-1}\) is even, it's I's move, so the property holds by definition. If \(n\) is even, it's II's move. Since they don't currently have a winning strategy, this is preserved by any continuation.
There is a sequence of winning plays for I.
\begin{align*} & t^{(0)}\\ & s_0t^{(1)}\\ & s_0 s_1 t^{(2)}\\ & s_0 s_1 s_2t^{(3)}\\ & s_0 s_1 s_2 s_3t^{(4)}\\ & \vdots \end{align*}Then this is a Cauchy sequence in \(G\). Since \(G\) is closed, its limit is also in \(G\).
There is a more modern generalization of the Gale-Stewart theorem.
If \(G\) is a Borel set, then \(G\) is determined.
\noindent
Without the axiom of choice, we can consider the following axiom.
Every subset \(G \subseteq \N^{\N}\) is determined.
Ultrafilters
We will approach the subject of ultrafilters via ring theory.
Let \(R\) be a commutative ring with unit. An ideal is a subset \(I \subseteq R\) such that
- \(0 \in I\)
- \(x, y \in I \implies x + y \in I\)
- If \(x \in I\) and \(y \in R\), then \(x \cdot y \in I\).
\(I\) is prime if:
- \(1 \not\in I\)
- If we have \(p, q \in R\) such that \(p \cdot q \in I\), then we have either \(p \in I\) or \(q \in I\).
\(I\) is proper if \(I \subsetneq R\). This is equivalent to \(1 \not\in I\).
\(I\) is maximal if \(I\) is proper and for any proper \(J\) we have \(I \subseteq J \implies J = i\).
If \(I\) is maximal, then \(I\) is prime.
For any commutative ring \(R\) and proper ideal \(I \subsetneq R\), \(I\) extends to some maximal ideal \(J \supseteq I\).
\noindent In particular, this statement is equivalent to the axiom of choice.
We prove the theorem using Zorn's lemma. Consider the set \(P := \left\{ K \mid K \text{ is a proper ideal and } K \supseteq I \right\}\). Paritally order \(P\) by \(\subseteq\). It is easily checked that the conditions of ZL are satisfied. By Zorn's lemma, \(P\) has a maximal element.
A (bounded) lattice \(P\) is a partially ordered set with finite suprema (joins) written as \(\vee\) (including a least element \(\bot\)), and finite infima (meets) marked as \(\wedge\), including a maximum element \(\top\).
If \(P\) is a lattice, then the following are equivalent.
- \(\forall a, b, c.\, a \land (b \lor c) = (a \land b) \lor (a \land c)\)
- \(\forall a, b, c.\, a \lor (b \land c) = (a \lor b) \land (a \lor c)\)
We call a lattice satisfying these properties a distributive lattice.
A simple nondistributive lattice is given by the Hasse diagram
\begin{tikzpicture} \node (top) at (0,2) {$\top$}; \node (a) at (-1,1) {$a$}; \node (b) at (0,1) {$b$}; \node (c) at (1,1) {$c$}; \node (bot) at (0,0) {$\bot$}; \draw (bot)--(a)--(top); \draw (bot)--(b)--(top); \draw (bot)--(c)--(top); \end{tikzpicture}\noindent
In a distributive lattice \(P\), \(x\) is the complement of \(y\) if \[ x \lor y = \top \quad \text{ and } \quad x \land y = \bot. \]
\noindent Note that under distributivity, complements are unique. If \(x \lor y = \top\) and \(x \land y = \bot\) and \(x \lor z = \top\) and \(x \land z = \bot\), we get that \(y = z\). This justifies considering the complement of an element.
We write \(x^{ce for the unique complement of \(x\)}\) if it exists.
A lattice is boolean if it is distributive and every element has a complement.
\noindent We can also define boolean structures in terms of rings.
A ring \(R\) is boolean if it is commutative, and multiplication is idempotent, i.e. \[ \forall x \in R .\, x \cdot x = x. \]
\noindent There is an equivalence between these concepts.
First suppose \(P\) is a boolean algebra. Then we can define
\begin{align*} 0 &= \bot & a + b &= a \oplus b = (a \land b^{c}) \land (a^c \land b)\\ 1 &= \top & a \cdot b &= a \land b \end{align*}We can check that the right properties hold. This construes \(P\) as a boolean ring.
Now suppose \(P\) is a boolean ring. Define \(a \leq b :\iff a \land b = a\). Then \(P\) is a boolean algebra, with
\begin{align*} \bot &= 0 & a \lor b &= a + b - a \cdot b \\ \top &= 1 & a \land b &= a \cdot b \\ a^c &= 1 - a \end{align*}- These constructions are mutually inverse.
- A function \(f : P \rightarrow Q\) between boolean algebras/rings is a boolean algebra homomorphism if and only if it is a ring homomorphism.
\noindent Note that \(f : P \rightarrow Q\) is a boolean algebra homomorphism if
- \(f\) preserves finite joins (including \(\bot\))
- \(f\) preserves finite meets (including \(\top\))
- \(f\) preserves complements.
Note that any two of these suffice to define the third.
Suppose \(P\) is a booleaan algebra. The following are equivalent for a subset \(I \subseteq P\).
- \(I\) is an ideal with respect to the boolean ring structure on \(P\)
- \(\bot \in I\),
- if \(a, b \in I \implies a \lor b \in I\),
- if \(a \in I, b \in P \implies a \land b \\in I\). Equivalently, \(I\) is down-closed, (\(a \in I, b \in P, b \leq a \implies b \in I\)).
Then \(I\) proper \(\iff \top \not\mid I\) , and \(I\) is prime \(\iff \forall a, b .\, a \land b \in I \implies (a \in I \lor b \in I)\).
If \(P\) is a boolean algebra, then the following are equivalent for an ideal \(I \subseteq P\).
- \(I\) is maximal.
- \(I\) is prime.
- For every element \(a \in P\), either \(a \in I\) or \(a^c \in I\) and \(I\) is proper.
If \(I \subseteq P\) is a proper ideal in a boolean algebra, there exists a prime ideal \(J\) with \(J \supseteq I\).
\noindent This is literally Krull's theorem for boolean rings. We have that AC implies PIT, but not the other way around.
The following are equivalent (in set theory without AC).
- PIT
- Tychonoff's theorem for Hausdorff spaces.
- The completeness theorem for first-order logic
- Compactness
- The Ultrafilter theorem.
Suppose \(P\) is a boolean algebra. A subset \(F \subseteq P\) is a filter if
- \(\top \in F\)
- \(a, b \in F \implies a \land b \in F\)
- \(a \in F, b \in P, a \leq b \implies b \in F\).
A filter is proper if \(\bot \not\in F\). If is maximal if \(F\)is proper and \(F \subseteq G\) where \(G\)is a proper filter, then \(G = F\). It is prime if \(a \lor b \in F \implies (a \in F \lor b \in F)\). It is ultra if \(F\) is proper and \(\forall a \in P\) we have that \(a \in F \lor a^c \in F\).
If \(P\) is a boolean algebra, then the following are equivalent for a filter \(F \subseteq P\).
- \(F\) is maximal.
- \(F\) is prime.
- \(F\) is ultra.
Suppose \(P\) is a boolean algebra. For any proper filter \(F \subseteq P\), there exists an ultrafilter \(U \supseteq F\).
\noindent This is obviously equivalent to PIT.
For every set \(X\), \(\pot X\) ordered by \(\subseteq\) is a boolean algebra.
We say the ultrafilter on \(X\) to mean an ultrafilter on \(\pot X\). For any \(X\) and \(x \in X\) we can also define an ultrafilter \[ [x] := \left\{ S \subseteq X \mid x \in S \right\} \] which we call the principal ultrafilter.
An ultrafilter that is not principal is called non-principal.
- If \(X\) is finite, then every ultrafilter is principal.
- (assuming PIT) If \(X\) is infinite, then \(X\) has some non-principal ultrafilter.
Let us prove 2. We define \(K \subseteq \pot X\) by \[ K = \left\{ S \subseteq X \mid S \text{ is cofinite} \right\}. \] Here \(S\) being cofinite means that \(X \setminus S\) is finite.
There exists an ultrafilter \(U \supseteq K\). We show \(U\) does not contain any singleton, hence is nonprincipal. Suppose for contradiction that \(\left\{ x \right\} \in U\). Then \(X \setminus \left\{ u \right\} \in K \subseteq U\), so \[ (X \setminus \left\{ x \right\}) \cap \left\{ x \right\} \in U. \] This would lead to \(\emptyset \in U\) which leads to a contradiction.
For any \(X\) define \[ \mathcal{U}(X) = \text{ the set of all ultrafilters on } X. \] We have
\begin{align*} X &\rightarrow \mathcal{U}(X) \\ x &\mapsto [x] \end{align*}which we can view, under PIT, that \(\mathcal{U}(X)\) gives a free compact Hausdorff space over \(X\).
measurable cardinals
A filter \(F\) is \(\kappa\)-complete if for any \(A \subseteq F\), we have \(\abs{A} < \kappa \implies \bigcap A \in F\).
- For instance, being \(\aleph_0\)-complete means being closed under finite intersections, which every filter satisfies.
- \(\aleph_1\)-complete means closure under countable intersections. This is also known as \(\sigma\)-completeness.
A measurable cardinal is a cardinal \(\kappa > \aleph_0\) such that there exists a \(\kappa\)-complete nonprincipal ultrafilter on \([\kappa]\).
Suppose \(X\) carries a \(\sigma\)-complete nonprincipal ultrafilter. Then there exists a measurable cardinal \(\kappa \leq \abs{X}\).
Lemma A
Let \(\mathcal{U}\) be an ultrafilter on \(X\).
- Then \(\mathcal{U}\) is \(\kappa\) complete iff \(\mathcal{V}\) is closed under unions of cardinality \(< \kappa\).
- Then \(\mathcal{U}\) is \(\kappa\) complete iff \(\mathcal{V}\) is closed under disjoint unions of cardinality \(< \kappa\).
- If \(\mathcal{U}\) is nonprincipal and \(\kappa\)-complete, then \(\left\{ S \mid S \subseteq X \cap \abs{S} < \kappa \right\} \subseteq \mathcal{V}\)
Suppose we have \(X\) as in the theorem. Let \(\kappa \leq \abs{X}\) be the smallest cardinality such that \([\kappa]\) has a \(\sigma\)-complete nonprincipal ultrafilter \(\mathcal{U}\). We show that \(\mathcal{U}\) is \(\kappa\)-complete.
Suppose, for contradiction, that this is not so. By the second point of lemma A there is some \(\mathcal{A} \subseteq \mathcal{V}\) such that \(\abs{A} < \kappa\) and \(\bigcup \mathcal{A} \in \mathcal{U}\), where \(\mathcal{A}\) is pairwise disjoint. Define \(\mathcal{U} ' \subseteq \pot \mathcal{A}\) by \(\mathcal{U}' := \left\{ \mathcal{S} \subseteq \mathcal{A} \mid \bigcup\limits_{}^{} \mathcal{S} \in \mathcal{U} \right\}\). One can verify that \(\mathcal{U}'\) is a \(\sigma\)-complete non-principal ultrafilter on \(\mathcal{A}\). Then \(\mathcal{A}\) carries a \(\sigma\)-complete nonprincipal ultrafilter \(\abs{\mathcal{A}} < \kappa\), contradicting the smallness of \(\kappa\). Thus \(\kappa > \aleph_0\) by the third point of Lemma A.
Measurable cardinals are regular.
Suppose \(\kappa\) is measurable but not regular. Then \[ \kappa = \lim_{\alpha < \lambda} \beta_{\alpha} \text{ for some } \lambda, \beta_{\alpha} < \kappa. \] We have \[ [\kappa] = \bigcup\limits_{\alpha < \lambda}^{} [\beta_{\alpha}] = \bigcup\limits_{}^{} \left\{ [\beta_{\alpha}] \mid \alpha < \lambda \right\} \] Because \(\beta_{\alpha} < \kappa\), we have that \([\beta_{\alpha}] \in \mathcal{V}\), hence the union is also in \(\mathcal{V}\) because \(\lambda < \kappa\), so we have \([\kappa] \in \mathcal{V}\), contradicting \(\kappa \in \mathcal{U}\)
Measurable cardinals are strong limit cardinals.
Suppose \(\kappa\) is measurable and not a strong limit. Let \(\mathcal{U}\) be a \(\kappa\)-complete nonprincipal ultrafilter on \([\kappa]\). Let \(\lambda < \kappa\) be such that \(2^{\lambda} \geq \kappa\). Let \(X \subseteq [2^{\lambda}]\) be any subset of cardinality \(\kappa\).
For any \(\alpha < \lambda\), we have \[ \left\{ f \in X \mid f(\lambda) = 0 \right\} \cup \left\{ f \in X \mid f(zl) = 1 \right\} = X. \] So exactly one of the two sets belongs to \(\mathcal{U}\). Call this set \(S_{\alpha}\). Then \(\left\{ S_{\alpha} \mid \alpha < \lambda \right\}\) is a subset \(\mathcal{U}\) of cardinality \(\leq \lambda\). By \(\kappa\)-completeness, (since \(\lambda < \kappa\)), we have that \(\bigcap\limits_{}^{} \left\{ S_{\alpha} \mid \alpha < \lambda \right\} \in \mathcal{U}\). But \(\bigcap\limits_{\alpha}^{} S_{\alpha}\) is either empty of a singleton, hence not in \(\mathcal{U}\). This is a contradiction.
A (finite) powerset measure on a set \(X\) is a function \(\mu : \pot X \rightarrow [0, \infty)\) satisfying
- \(\mu(\emptyset) = 0\)
- \(\mu(X) > 0\)
- (countable additivity) For any countable family \(\mathcal{A}\) of pairwise disjoint subsets of \(X\), then \[ \mu \left( \bigcup\limits_{}^{} \mathcal{A} \right) = \sum\limits_{S \in \mathcal{A}}^{} \mu(S). \]
Further, \(\mu\) is nontrivial if \(\forall x \in X\), \(\mu(\left\{ x \right\}) = 0\). \(\mu\) is two-valued if \(\forall S \subseteq X.\, \mu(S) \in \left\{ 0, 1 \right\}\).
- If \(\mu\) is a two-valued powerset measure on \(X\), then \(\mathcal{U}_{\mu} \left\{ S \subseteq X \mid \mu(S) = 1 \right\}\) is a \(\sigma\)-complete ultrafilter on \(X\).
- Conversely, if \(\mathcal{U}\) is a \(\sigma\)-complete ultrafilter on \(X\), then \[ S \mapsto \begin{cases} 1 & \text{if} S \in \mathcal{U} \\ 0 & \text{otherwise} \end{cases} \] is a two-valued measure.
- The maps \(\mu \mapsto \mathcal{U}_{\mu}\) and \(\mathcal{U} \mapsto \mu_{\mathcal{U}}\) are mutually inverse.
- \(\mathcal{U}\) is a nonprincipal \(\sigma\)-complete ultrafilter if and only if \(\mu_{\mathcal{U}}\) is nontrivial.
If a set has some nontrivial two-valued powerset measure, then there exists a measurable cardinal \(\kappa \leq \abs{X}\).
All of this leads us to the point that only very large sets can carry 2-valued nontrivial powerset measures. The cardinality is \(\geq\) a measurable cardinal \(> 2^{\aleph_0}\). What can we say about measures on \(\R\)?
- Without AC: It is consistent to have a translation-invariant powerset measure on \([0, 1)\)
- Vitali: There is no translation-invariant powerset measure on \(\pot [0, 1)\).
We wonder as mathematicians what happens when we drop translation invariance. Can we find a nontrivial powerset measure on \([0, 1)\), or equivalently on \(\R\)?
Real valued measurable cardinal theorem
If a set \(X\) has a nontrivial powerset measure then there exists a real-valued measurable cardinal \(\kappa \leq \abs{X}\).
Real-valued-measurable-cardinals are weakly inaccessible!
Recall
- Every GS game with Borel goal set is determined
- There exist goal sets that are not determined
For example, a non-Borel subset of \(\N^{\N}\) is \[ \left\{ S \in \N^{\N} \mid \forall n \in \N .\, S_n > 0 \text{ and } S \text{ has an infinite subsequence} (u_i)_{i \geq 0} \right\} \] This set is analytic.
A subset \(A \subseteq P\) of a polish space is analytic if it is the image of a Borel set \(B \subseteq \N^{\N}\) under a continuous function \(f : \N^{\N} \rightarrow P\).
If \(A\) and \(A'\) are dosjoint analytic subsets of \(P\), then there exist Borel sets \(B, B'\) such that \(A \subseteq B\) and \(A' \subseteq B'\) and \(B \cap B' = \emptyset\).
If \(A\) and \(A^c\) are analytic, then both \(A\) and \(A^c\) are Borel.
Suppose there exists a measurable cardinal.
Every G-S game with analytic goal is determined.C
ZFC and independence results
We want to formulate ZFG using a formal language in which formulas of the language express properties of the mathematical universe. Formulas \(\phi, \psi, \dots\) are
\begin{align*} \phi &\Coloneqq x = y \mid x \in y \mid S(x) \\ &\mid \phi \land \psi \mid \phi \lor \psi \mid \phi \rightarrow \psi \mid \phi \leftrightarrow \psi \mid \neg \phi \\ &\mid \forall x .\, \phi \mid \exists x .\, \phi \end{align*}where \(S(x)\) expresses that \(S\) is a set. We also interpret
\begin{align*} \forall x \in y .\, \phi &\equiv S(y) \land \forall x .\, x \in y \implies \phi \\ \exists x \in y .\, \phi &\equiv S(y) \land \exists x .\, x \in y \land \phi \\ \reflectbox{S} x .\, \phi &\equiv \exists y .\, S(y) \land \forall x .\, (\phi \leftrightarrow x \in y) \end{align*}where in the end, \(y\)is fresh. Then we can express the axioms as
- (Memb): \(y \in x \rightarrow S(x)\)
- (Ext): \(\forall x \forall y .\, S(x) \land S(y) \land \forall z .\, (z \in x \leftrightarrow z\in y) \rightarrow x = y\)
- (Sep): \(S(x) \rightarrow \reflectbox{S}y .\, (y \in x) \land \phi\)
- (Pairing): \(\reflectbox{S}z .\, (z = x \lor z = y)\)
- (Powerset): \(S(x) \rightarrow \reflectbox{S}y .\, (S(y) \land \forall z \in y .\, z \in x)\)
- (Indexed union): \(S(x) \land (\forall y \in x .\, \reflectbox{S}z .\, \phi) \rightarrow (\reflectbox{S}z .\, \exists y .\, y \in x .\, \phi)\)
- (Infinity): \(\exists x .\, (\exists y \in x .\, .(S(s) \land \neg \exists z .\, z \in y)) \land \forall y \in x .\, (S(y) \land \exists z \in x .\, \forall w .\, (w \in z \leftrightarrow w \in y \lor w = y))\)
The axioms give us a precise notion of formal proof. For any sentence \(\psi\) we can ask, do we have \(\operatorname{Axioms} \vdash \psi\)?
All results in the course are provable from the axioms (assuming additional ones where needed). In particular, the formulation of ZF is simply our axioms with the addition of the assertion of \(S(x)\), as well as adding foundation: \[ (\exists y .\, y \in x) \rightarrow (\exists y \in x .\, \forall z \in x .\, z \not\in y).s \] We take ZFC as adding the axiom of choice.
If \(T\) is a set of axioms, we have a precise mathematical relation \[ T \vdash \phi \text{ meaning }\phi \text{ is provable from } T. \] We therefore also have a precise meaning for \[ T \not\vdash \phi \] i.e. that \(\phi\) is not provable. We can equivalently state this as \[ \operatorname{con}(T + \neg \phi), \] i.e. that \(T\) is consistent with the negation of \(\phi\). There are significant examples of consistency proofs, e.g.
\begin{align*} \operatorname{con}(ZF) &\implies \operatorname{con}(ZFC) & \text{(Gödel)} \\ \operatorname{con}(ZF) &\implies \operatorname{con}(ZF + \neg AC) & \text{(Cohen)} \\ \operatorname{con}(ZF) &\implies \operatorname{con}(ZF + GCH) & \text{(Gödel)} \\ \operatorname{con}(ZF) &\implies \operatorname{con}(ZFC + \neg CH) & \text{(Cohen)} \end{align*}Let's see how these sorts of proofs are carried out, at a very high level. We think about models, i.e. \[ \operatorname{con}(T) \iff \exists M .\, M \vDash T. \] Then to prove \(\operatorname{con}(T_1) \implies \operatorname{con}(T_2)\) we give a construction of a model of \(T_2\) from any given model of \(T_1\).
Gödel's proof of \(\operatorname{con}(ZF) \implies \operatorname{con}(ZFC + GCH)\)
\(\operatorname{con}(ZF) \implies \operatorname{con}(ZFC + GCH)\)
Given a model \(\mathcal{M}\) of ZF, we extract a smaller inner model \(\mathcal{L}\) inside \(\mathcal{M}\) consisting of constructible sets. Constructibility is expressed by a formula: there is a class \(L\) of constructible sets. \(L\) has a definable well-ordering, hence \(\mathcal{L} \vDash AC\).
In general, \(\mathcal{L}\) satisfies Gödel's axiom of constructibility \(V = \mathcal{L}\).
Cohen's proof of \(\operatorname{con}(ZF) \implies \operatorname{con}(ZFC + \neg GH)\)
\(\operatorname{con}(ZF) \implies \operatorname{con}(ZFC + \neg GH)\)
Start off with a countable transitive model of ZFC \(\mathcal{M}\). Force an injection from \((\omega_2)^{\mathcal{M}} \xrightarrow{i} (\pot \N)^{\mathcal{M}}\) into a new model \(\mathcal{M}' := \mathcal{M}[i]\). In \(\mathcal{M}'\) we have \[ \abs{\omega_0} < \abs{\omega_1} < \abs{\omega_2} \leq \abs{(\pot \N)^{\mathcal{M}}} \leq \abs{(\pot \N)^{M'}} \]
Consistency results for inaccessible cardinals
We write WIC for the existence of a weakly inaccessible cardinal and SIC for a strongly inaccessible one.
\begin{align*} \operatorname{con}(ZF) &\implies \operatorname{con(ZF + \neg WIC)} \\ \operatorname{con}(ZF + WIC) &\implies \operatorname{con}(ZF + SIC) \end{align*}There are two forms of motivation for large cardinal axioms and strong principles.
- The intrinsic: one accepts an axiom on account of its own plausibility
- extrinsic: we accept an axiom on account of their good consequences
Last week we saw that MC implies analytic determinacy. We have \[ \operatorname{con}(ZFC + \operatorname{analytic determinacy}) \implies \operatorname{con}(ZFC + MC) \] We write \(\Sigma_1^1\) for analytic sets and \(\Pi_1^1\) for coanalytic sets, i.e. the complements of analytics. Then
\begin{align*} \Sigma_{n+1}^1 \coloneqq \operatorname{projections of } \Pi_n^1 \operatorname{ sets in } \N^{\N} \times X \xrightarrow{\pi_2} X \\ \Pi_{n+1}^1 \coloneqq \operatorname{projections of } \\Sigma_n^1 \operatorname{ sets in } \N^{\N} \times X \xrightarrow{\pi_2} X \\ \end{align*}A set is projective if it is in some \(\Sigma_n^1\) for some \(n\).
Projective determinacy (Martin/Steele)
IWC is the notion that there are infinitely many Woodin cardinals. Every projective G-S game is determined. We have \[ \operatorname{con}(ZFC + PD) \implies \operatorname{con}(ZFC + IWC). \] We can think about projective determinacy as a missing axiom for ZFC.
Other interesting axioms
- Freiling's axiom of symmetry - Wikipedia is supposedly another interesting axiom.
- Grothendieck's axiom - that every set is contained in a Grothendieck universe
- Axiom of broad infinity