Math With Nazia: MathWithNaziaa : Set Theory

Set Theory

Set theory is the mathematical theory of well-determined collections, called sets, of objects that are called members, or elements, of the set. Pure set theory deals exclusively with sets, so the only sets under consideration are those whose members are also sets. The theory of the hereditarily-finite sets, namely those finite sets whose elements are also finite sets, the elements of which are also finite, and so on, is formally equivalent to arithmetic. So, the essence of set theory is the study of infinite sets, and therefore it can be defined as the mathematical theory of the actual—as opposed to potential—infinite.

The notion of set is so simple that it is usually introduced informally, and regarded as self-evident. In set theory, however, as is usual in mathematics, sets are given axiomatically, so their existence and basic properties are postulated by the appropriate formal axioms. The axioms of set theory imply the existence of a set-theoretic universe so rich that all mathematical objects can be construed as sets. Also, the formal language of pure set theory allows one to formalize all mathematical notions and arguments. Thus, set theory has become the standard foundation for mathematics, as every mathematical object can be viewed as a set, and every theorem of mathematics can be logically deduced in the Predicate Calculus from the axioms of set theory.

Both aspects of set theory, namely, as the mathematical science of the infinite, and as the foundation of mathematics, are of philosophical importance.

The origins

Set theory, as a separate mathematical discipline, begins in the work of Georg Cantor. One might say that set theory was born in late 1873, when he made the amazing discovery that the linear continuum, that is, the real line, is not countable, meaning that its points cannot be counted using the natural numbers. So, even though the set of natural numbers and the set of real numbers are both infinite, there are more real numbers than there are natural numbers, which opened the door to the investigation of the different sizes of infinity. See the entry on the early development of set theory for a discussion of the origin of set-theoretic ideas and their use by different mathematicians and philosophers before and around Cantor’s time.

According to Cantor, two sets $A$

and have the same size, or cardinality, if they are bijectable, i.e., the elements of can be put in a one-to-one correspondence with the elements of . Thus, the set of natural numbers and the set of real numbers have different cardinalities. In 1878 Cantor formulated the famous Continuum Hypothesis (CH), which asserts that every infinite set of real numbers is either countable, i.e., it has the same cardinality as , or has the same cardinality as . In other words, there are only two possible sizes of infinite sets of real numbers. The CH is the most famous problem of set theory. Cantor himself devoted much effort to it, and so did many other leading mathematicians of the first half of the twentieth century, such as Hilbert, who listed the CH as the first problem in his celebrated list of 23 unsolved mathematical problems presented in 1900 at the Second International Congress of Mathematicians, in Paris. The attempts to prove the CH led to major discoveries in set theory, such as the theory of constructible sets, and the forcing technique, which showed that the CH can neither be proved nor disproved from the usual axioms of set theory. To this day, the CH remains open.

Early on, some inconsistencies, or paradoxes, arose from a naive use of the notion of set; in particular, from the deceivingly natural assumption that every property determines a set, namely the set of objects that have the property. One example is Russell’s Paradox, also known to Zermelo:

consider the property of sets of not being members of themselves. If the property determines a set, call it A

, then A is a member of itself if and only if A

is not a member of itself.

Thus, some collections, like the collection of all sets, the collection of all ordinals numbers, or the collection of all cardinal numbers, are not sets. Such collections are called proper classes.

In order to avoid the paradoxes and put it on a firm footing, set theory had to be axiomatized. The first axiomatization was due to Zermelo (1908) and it came as a result of the need to spell out the basic set-theoretic principles underlying his proof of Cantor’s Well-Ordering Principle. Zermelo’s axiomatization avoids Russell’s Paradox by means of the Separation axiom, which is formulated as quantifying over properties of sets, and thus it is a second-order statement. Further work by Skolem and Fraenkel led to the formalization of the Separation axiom in terms of formulas of first-order, instead of the informal notion of property, as well as to the introduction of the axiom of Replacement, which is also formulated as an axiom schema for first-order formulas (see next section). The axiom of Replacement is needed for a proper development of the theory of transfinite ordinals and cardinals, using transfinite recursion (see Section 3). It is also needed to prove the existence of such simple sets as the set of hereditarily finite sets, i.e., those finite sets whose elements are finite, the elements of which are also finite, and so on; or to prove basic set-theoretic facts such as that every set is contained in a transitive set, i.e., a set that contains all elements of its elements (see Mathias 2001 for the weaknesses of Zermelo set theory). A further addition, by von Neumann, of the axiom of Foundation, led to the standard axiom system of set theory, known as the Zermelo-Fraenkel axioms plus the Axiom of Choice, or ZFC.

Other axiomatizations of set theory, such as those of von Neumann-Bernays-Gödel (NBG), or Morse-Kelley (MK), allow also for a formal treatment of proper classes.

The axioms of set theory

ZFC is an axiom system formulated in first-order logic with equality and with only one binary relation symbol $\in$

for membership. Thus, we write to express that is a member of the set

The axioms of ZFC

· Extensionality: If two sets $A$

and

· have the same elements, then they are equal.

· Null Set: There exists a set, denoted by $\emptyset$

· and called the empty set, which has no elements.

· Pair: Given any sets $A$

and , there exists a set, denoted by , which contains and as its only elements. In particular, there exists the set which has

· as its only element.

· Power Set: For every set $A$

there exists a set, denoted by and called the power set of , whose elements are all the subsets of

· .

· Union: For every set $A$

, there exists a set, denoted by and called the union of , whose elements are all the elements of the elements of

· .

· Infinity: There exists an infinite set. In particular, there exists a set $Z$

that contains and such that if , then

· .

· Separation: For every set $A$

and every given property, there is a set containing exactly the elements of that have that property. A property is given by a formula

of the first-order language of set theory.

Thus, Separation is not a single axiom but an axiom schema, that is, an infinite list of axioms, one for each formula $φ$

· .

· Replacement: For every given definable function with domain a set $A$

· , there is a set whose elements are all the values of the function.

Replacement is also an axiom schema, as definable functions are given by formulas.

· Foundation: Every non-empty set $A$

contains an -minimal element, that is, an element such that no element of

· belongs to it.

These are the axioms of Zermelo-Fraenkel set theory, or ZF. The axioms of Null Set and Pair follow from the other ZF axioms, so they may be omitted. Also, Replacement implies Separation.

Finally, there is the Axiom of Choice (AC):

· Choice: For every set $A$

of pairwise-disjoint non-empty sets, there exists a set that contains exactly one element from each set in

· .

The AC was, for a long time, a controversial axiom. On the one hand, it is very useful and of wide use in mathematics. On the other hand, it has rather unintuitive consequences, such as the Banach-Tarski Paradox, which says that the unit ball can be partitioned into finitely-many pieces, which can then be rearranged to form two unit balls. The objections to the axiom arise from the fact that it asserts the existence of sets that cannot be explicitly defined. But Gödel’s 1938 proof of its consistency, relative to the consistency of ZF, dispelled any suspicions left about it.

The Axiom of Choice is equivalent, modulo ZF, to the Well-ordering Principle, which asserts that every set can be well-ordered, i.e., it can be linearly ordered so that every non-empty subset has a minimal element.

Although not formally necessary, besides the symbol $\in$

one normally uses for convenience other auxiliary defined symbols. For example, expresses that is a subset of , i.e., every member of is a member of . Other symbols are used to denote sets obtained by performing basic operations, such as , which denotes the union of and , i.e., the set whose elements are those of and ; or , which denotes the intersection of and , i.e., the set whose elements are those common to and . The ordered pair is defined as the set . Thus, two ordered pairs and are equal if and only if and . And the Cartesian product is defined as the set of all ordered pairs such that and . Given any formula , and sets , one can form the set of all those elements of that satisfy the formula . This set is denoted by . In ZF one can easily prove that all these sets exist.

The theory of transfinite ordinals and cardinals

In ZFC one can develop the Cantorian theory of transfinite (i.e., infinite) ordinal and cardinal numbers. Following the definition given by Von Neumann in the early 1920s, the ordinal numbers, or ordinals, for short, are obtained by starting with the empty set and performing two operations: taking the immediate successor, and passing to the limit. Thus, the first ordinal number is $\emptyset$

. Given an ordinal , its immediate successor, denoted by , is the set . And given a non-empty set of ordinals such that for every the successor is also in , one obtains the limit ordinal . One shows that every ordinal is (strictly) well-ordered by , i.e., it is linearly ordered by and there is no infinite

-descending sequence. Also, every well-ordered set is isomorphic to a unique ordinal, called its order-type.

Note that every ordinal is the set of its predecessors. However, the class $O N$

of all ordinals is not a set. Otherwise, would be an ordinal greater than all the ordinals, which is impossible. The first infinite ordinal, which is the set of all finite ordinals, is denoted by the Greek letter omega (). In ZFC, one identifies the finite ordinals with the natural numbers. Thus, , , , etc., hence is just the set

of natural numbers.

One can extend the operations of addition and multiplication of natural numbers to all the ordinals. For example, the ordinal $α + β$

is the order-type of the well-ordering obtained by concatenating a well-ordered set of order-type and a well-ordered set of order-type . The sequence of ordinals, well-ordered by

, starts as follows

0, 1, 2,…, $n$

,…, , , ,…, ,…, , …, ,…, , …,

, …

The ordinals satisfy the principle of transfinite induction: suppose that $C$

is a class of ordinals such that whenever contains all ordinals smaller than some ordinal , then is also in . Then the class contains all ordinals. Using transfinite induction one can prove in ZFC (and one needs the axiom of Replacement) the important principle of transfinite recursion, which says that, given any definable class-function , one can define a class-function such that is the value of the function applied to the function restricted to

. One uses transfinite recursion, for example, in order to define properly the arithmetical operations of addition, product, and exponentiation on the ordinals.

Recall that an infinite set is countable if it is bijectable, i.e., it can be put into a one-to-one correspondence, with $ω$

. All the ordinals displayed above are either finite or countable. But the set of all finite and countable ordinals is also an ordinal, called , and is not countable. Similarly, the set of all ordinals that are bijectable with some ordinal less than or equal to is also an ordinal, called , and is not bijectable with , and so on.

Cardinals

A cardinal is an ordinal that is not bijectable with any smaller ordinal. Thus, every finite ordinal is a cardinal, and $ω$

, , , etc. are also cardinals. The infinite cardinals are represented by the letter aleph (

) of the Hebrew alphabet, and their sequence is indexed by the ordinals. It starts like this

$ℵ_{0}$

, , , …, , , …, , …, , …, , …, , …,

, …

Thus, $ω = ℵ_{0}$

, ,

, etc. For every cardinal there is a bigger one, and the limit of an increasing sequence of cardinals is also a cardinal. Thus, the class of all cardinals is not a set, but a proper class.

An infinite cardinal $κ$

is called regular if it is not the union of less than smaller cardinals. Thus, is regular, and so are all infinite successor cardinals, such as . Non-regular infinite cardinals are called singular. The first singular cardinal is , as it is the union of countably-many smaller cardinals, namely

The cofinality of a cardinal $κ$

, denoted by is the smallest cardinal such that is the union of -many smaller ordinals. Thus,

By the AC (in the form of the Well-Ordering Principle), every set $A$

can be well-ordered, hence it is bijectable with a unique cardinal, called the cardinality of . Given two cardinals and , the sum is defined as the cardinality of the set consisting of the union of any two disjoint sets, one of cardinality and one of cardinality . And the product is defined as the cardinality of the Cartesian product . The operations of sum and product of infinite cardinals are trivial, for if and are infinite cardinals, then

In contrast, cardinal exponentiation is highly non-trivial, for even the value of the simplest non-trivial infinite exponential, namely $2^{ℵ_{0}}$

, is not known and cannot be determined in ZFC (see below). The cardinal is defined as the cardinality of the Cartesian product of copies of ; equivalently, as the cardinality of the set of all functions from into . König’s theorem asserts that , which implies that the cofinality of the cardinal , whatever that cardinal is, must be uncountable. But this is essentially all that ZFC can prove about the value of the exponential

In the case of exponentiation of singular cardinals, ZFC has a lot more to say. In 1989, Shelah proved the remarkable result that if $ℵ_{ω}$

is a strong limit, that is, , for every , then (see Shelah (1994)). The technique developed by Shelah to prove this and similar theorems, in ZFC, is called pcf theory (for possible cofinalities), and has found many applications in other areas of mathematics.

The universe

of all sets

A posteriori, the ZF axioms other than Extensionality—which needs no justification because it just states a defining property of sets—may be justified by their use in building the cumulative hierarchy of sets. Namely, in ZF we define using transfinite recursion the class-function that assigns to each ordinal $α$

the set

, given as follows:

$V_{0} = \emptyset$

· ·

, whenever

· is a limit ordinal.

The Power Set axiom is used to obtain $V_{α + 1}$

from . Replacement and Union allow one to form for a limit ordinal. Indeed, consider the function that assigns to each the set . By Replacement, the collection of all the , for , is a set, hence the Union axiom applied to that set yields . The axiom of Infinity is needed to prove the existence of and hence of the transfinite sequence of ordinals. Finally, the axiom of Foundation is equivalent, assuming the other axioms, to the statement that every set belongs to some , for some ordinal . Thus, ZF proves that the set theoretic universe, denoted by , is the union of all the ,

an ordinal.

The proper class $V$

, together with the relation, satisfies all the ZFC axioms, and is thus a model of ZFC. It is the intended model of ZFC, and one may think of ZFC as providing a description of

, a description however that is highly incomplete, as we shall see below.

One important property of $V$

is the so-called Reflection Principle. Namely, for each formula , ZFC proves that there exists some that reflects it, that is, for every

$φ (a_{1}, \dots, a_{n})$

holds in if and only if holds in

Thus, $V$

cannot be characterized by any sentence, as any sentence that is true in must be also true in some initial segment . In particular, ZFC is not finitely axiomatizable, for otherwise ZFC would prove that, for unboundedly many ordinals ,

is a model of ZFC, contradicting Gödel’s second incompleteness theorem

The Reflection Principle encapsulates the essence of ZF set theory, for as shown by Levy (1960), the axioms of Extensionality, Separation, and Foundation, together with the Reflection Principle, formulated as the axiom schema asserting that each formula is reflected by some set that contains all elements and all subsets of its elements (note that the $V_{α}$

are like this), is equivalent to ZF.

Set theory as the foundation of mathematics

Every mathematical object may be viewed as a set. For example, the natural numbers are identified with the finite ordinals, so $N = ω$

. The set of integers may be defined as the set of equivalence classes of pairs of natural numbers under the equivalence relation if and only if . By identifying every natural number with the equivalence class of the pair , one may extend naturally the operations of sum and product of natural numbers to (see Enderton (1977) for details, and Levy (1979) for a different construction). Further, one may define the rationals as the set of equivalence classes of pairs of integers, where , under the equivalence relation if and only if . Again, the operations and on may be extended naturally to . Moreover, the ordering on the rationals is given by: if and only if there exists such that . The real numbers may be defined as Dedekind cuts of , namely, a real number is given by a pair of non-empty disjoint sets such that , and for every and . One can then extend again the operations of and on , as well as the ordering , to the set of real numbers

Let us emphasize that it is not claimed that, e.g., real numbers are Dedekind cuts of rationals, as they could also be defined using Cauchy sequences, or in other different ways. What is important, from a foundational point of view, is that the set-theoretic version of $R$

, together with the usual algebraic operations, satisfies the categorical axioms that the real numbers satisfy, namely those of a complete ordered field. The metaphysical question of what the real numbers really are is irrelevant here.

Algebraic structures can also be viewed as sets, as any $n$

-ary relation on the elements of a set can be viewed as a set of -tuples of elements of . And any -ary function on , with values on some set , can be seen as the set of -tuples such that is the value of on . Thus, for example, a group is just a triple , where is a non-empty set, is a binary function on that is associative, is an element of such that , for all , and for every there is an element of , denoted by , such that . Also, a topological space is just a set together with a topology on it, i.e., is a subset of containing and

, and closed under arbitrary unions and finite intersections. Any mathematical object whatsoever can always be viewed as a set, or a proper class. The properties of the object can then be expressed in the language of set theory. Any mathematical statement can be formalized into the language of set theory, and any mathematical theorem can be derived, using the calculus of first-order logic, from the axioms of ZFC, or from some extension of ZFC. It is in this sense that set theory provides a foundation for mathematics.

The foundational role of set theory for mathematics, while significant, is by no means the only justification for its study. The ideas and techniques developed within set theory, such as infinite combinatorics, forcing, or the theory of large cardinals, have turned it into a deep and fascinating mathematical theory, worthy of study by itself, and with important applications to practically all areas of mathematics.

Metamathematics

The remarkable fact that virtually all of mathematics can be formalized within ZFC, makes possible a mathematical study of mathematics itself. Thus, any questions about the existence of some mathematical object, or the provability of a conjecture or hypothesis can be given a mathematically precise formulation. This makes metamathematics possible, namely the mathematical study of mathematics itself. So, the question about the provability or unprovability of any given mathematical statement becomes a sensible mathematical question. When faced with an open mathematical problem or conjecture, it makes sense to ask for its provability or unprovability in the ZFC formal system. Unfortunately, the answer may be neither, because ZFC, if consistent, is incomplete.

The incompleteness phenomenon

Gödel’s completeness theorem for first-order logic implies that ZFC is consistent—i.e., no contradiction can be derived from it—if and only if it has a model. A model of ZFC is a pair $(M, E)$

, where is a non-empty set and is a binary relation on such that all the axioms of ZFC are true when interpreted in , i.e., when the variables that appear in the axioms range over elements of , and is interpreted as . Thus, if is a sentence of the language of set theory and one can find a model of ZFC in which holds, then its negation cannot be proved in ZFC. Hence, if one can find a model of and also a model of , then is neither provable nor disprovable in ZFC, in which case we say that

is undecidable in, or independent of, ZFC.

In 1931, Gödel announced his striking incompleteness theorems, which assert that any reasonable formal system for mathematics is necessarily incomplete. In particular, if ZFC is consistent, then there are undecidable propositions in ZFC. Moreover, Gödel’s second incompleteness theorem implies that the formal (arithmetical) statement $C O N (Z F C)$

, which asserts that ZFC is consistent, while true, cannot be proved in ZFC. And neither can its negation. Thus,

is undecidable in ZFC.

If ZFC is consistent, then it cannot prove the existence of a model of ZFC, for otherwise ZFC would prove its own consistency. Thus, a proof of consistency or of undecidability of a given sentence $φ$

is always a relative consistency proof. That is, one assumes that ZFC is consistent, hence it has a model, and then one constructs another model of ZFC where the sentence is true. We shall see several examples in the next sections.

The set theory of the continuum

From Cantor and until about 1940, set theory developed mostly around the study of the continuum, that is, the real line $R$

. The main topic was the study of the so-called regularity properties, as well as other structural properties, of simply-definable sets of real numbers, an area of mathematics that is known as Descriptive Set Theory.

Descriptive Set Theory

Descriptive Set Theory is the study of the properties and structure of definable sets of real numbers and, more generally, of definable subsets of $R^{n}$

and other Polish spaces (i.e., topological spaces that are homeomorphic to a separable complete metric space), such as the Baire space of all functions

, the space of complex numbers, Hilbert space, and separable Banach spaces. The simplest sets of real numbers are the basic open sets (i.e., the open intervals with rational endpoints), and their complements. The sets that are obtained in a countable number of steps by starting from the basic open sets and applying the operations of taking the complement and forming a countable union of previously obtained sets are the Borel sets. All Borel sets are regular, that is, they enjoy all the classical regularity properties. One example of a regularity property is the Lebesgue measurability: a set of reals is Lebesgue measurable if it differs from a Borel set by a null set, namely, a set that can be covered by sets of basic open intervals of arbitrarily-small total length. Thus, trivially, every Borel set is Lebesgue measurable, but sets more complicated than the Borel ones may not be. Other classical regularity properties are the Baire property (a set of reals has the Baire property if it differs from an open set by a meager set, namely, a set that is a countable union of sets that are not dense in any interval), and the perfect set property (a set of reals has the perfect set property if it is either countable or contains a perfect set, namely, a nonempty closed set with no isolated points). In ZFC one can prove that there exist non-regular sets of reals, but the AC is necessary for this (Solovay 1970).

The analytic sets, also called $Σ_{1}^{1}$

, are the continuous images of Borel sets. And the co-analytic, or

, sets are the complements of analytic sets.

Starting from the analytic (or the co-analytic) sets and applying the operations of projection (from the product space $R \times N$

to ) and complementation, one obtains the projective sets. The projective sets form a hierarchy of increasing complexity. For example, if is co-analytic, then the projection is a projective set in the next level of complexity above the co-analytic sets. Those sets are called , and their complements are called

The projective sets come up very naturally in mathematical practice, for it turns out that a set $A$

of reals is projective if and only if it is definable in the structure

That is, there is a first-order formula $φ (x, y_{1}, \dots, y_{n})$

in the language for the structure such that for some

ZFC proves that every analytic set, and therefore every co-analytic set, is Lebesgue measurable and has the Baire property. It also proves that every analytic set has the perfect set property. But the perfect set property for co-analytic sets implies that the first uncountable cardinal, $ℵ_{1}$

, is a large cardinal in the constructible universe

namely a so-called inaccessible cardinal which implies that one cannot prove in ZFC that every co-analytic set has the perfect set property.

The theory of projective sets of complexity greater than co-analytic is completely undetermined by ZFC. For example, in $L$

there is a set that is not Lebesgue measurable and does not have the Baire property, whereas if Martin’s axiom holds. every such set has those regularity properties. There is, however, an axiom, called the axiom of Projective Determinacy, or PD, that is consistent with ZFC, modulo the consistency of some large cardinals (in fact, it follows from the existence of some large cardinals), and implies that all projective sets are regular. Moreover, PD settles essentially all questions about the projective sets. See the entry on large cardinals and determinacy for further details.

Determinacy

A regularity property of sets that subsumes all other classical regularity properties is that of being determined. For simplicity, we shall work with the Baire space $N$

. Recall that the elements of are functions , that is, sequences of natural numbers of length . The space is topologically equivalent (i.e., homeomorphic) to the set of irrational points of . So, since we are interested in the regularity properties of subsets of , and since countable sets, such as the set of rationals, are negligible in terms of those properties, we may as well work with , instead of

Given $A \subseteq N$

, the game associated to , denoted by , has two players, I and II, who play alternatively : I plays , then II plays , then I plays , and so on. So, at stage , player I plays and at stage , player II plays

. We may visualize a run of the game as follows:

I

n_{0}

n_{2}

n_{4}

\dots

n_{2 k}

\dots


$I I$

n_{1}

n_{3}

\dots

\dots

n_{2 k + 1}

\dots

After infinitely many moves, the two players produce an infinite sequence $n_{0}, n_{1}, n_{2}, \dots$

of natural numbers. Player I wins the game if the sequence belongs to

. Otherwise, player II wins.

The game $G_{A}$

is determined if there is a winning strategy for one of the players. A winning strategy for one of the players, say for player II, is a function from the set of finite sequences of natural numbers into , such that if the player plays according to this function, i.e., she plays at the

-th turn, she will always win the game, no matter what the other player does.

We say that a subset $A$

of is determined if and only if the game

is determined.

One can prove in ZFC—and the use of the AC is necessary—that there are non-determined sets. Thus, the Axiom of Determinacy (AD), which asserts that all subsets of $N$

are determined, is incompatible with the AC. But Donald Martin proved, in ZFC, that every Borel set is determined. Further, he showed that if there exists a large cardinal called measurable then even the analytic sets are determined. The axiom of Projective Determinacy (PD) asserts that every projective set is determined. It turns out that PD implies that all projective sets of reals are regular, and Woodin has shown that, in a certain sense, PD settles essentially all questions about the projective sets. Moreover, PD seems to be necessary for this. Another axiom, , asserts that the AD holds in , which is the least transitive class that contains all the ordinals and all the real numbers, and satisfies the ZF axioms So, implies that every set of reals that belongs to is regular. Also, since contains all projective sets, implies PD.

The Continuum Hypothesis

The Continuum Hypothesis (CH), formulated by Cantor in 1878, asserts that every infinite set of real numbers has cardinality either $ℵ_{0}$

or the same cardinality as . Thus, the CH is equivalent to

Cantor proved in 1883 that closed sets of real numbers have the perfect set property, from which it follows that every uncountable closed set of real numbers has the same cardinality as $R$

. Thus, the CH holds for closed sets. More than thirty years later, Pavel Aleksandrov extended the result to all Borel sets, and then Mikhail Suslin to all analytic sets. Thus, all analytic sets satisfy the CH. However, the efforts to prove that co-analytic sets satisfy the CH would not succeed, as this is not provable in ZFC.

In 1938 Gödel proved the consistency of the CH with ZFC. Assuming that ZF is consistent, he built a model of ZFC, known as the constructible universe, in which the CH holds. Thus, the proof shows that if ZF is consistent, then so is ZF together with the AC and the CH. Hence, assuming ZF is consistent, the AC cannot be disproved in ZF and the CH cannot be disproved in ZFC.

See the entry on the continuum hypothesis for the current status of the problem, including the latest results by Woodin.

Gödel’s constructible universe

Gödel’s constructible universe, denoted by $L$

, is defined by transfinite recursion on the ordinals, similarly as , but at successor steps, instead of taking the power set of to obtain , one only takes those subsets of that are definable in , using elements of as parameters. Thus, letting to denote the set of all the subsets of that are definable in the structure by a formula of the language of set theory, using elements of

as parameters of the definition, we let

$L_{0} = \emptyset$

· ·

, whenever

· is a limit ordinal.

Then $L$

is the union of all the , for an ordinal, i.e.,

Gödel showed that $L$

satisfies all the ZFC axioms, and also the CH. In fact, it satisfies the Generalized Continuum Hypothesis (GCH), namely , for every ordinal

The statement $V = L$

, called the axiom of constructibility, asserts that every set belongs to . It holds in

, hence it is consistent with ZFC, and implies both the AC and the GCH.

The proper class $L$

, together with the relation restricted to

, is an inner model of ZFC, that is, a transitive (i.e., it contains all elements of its elements) class that contains all ordinals and satisfies all the ZFC axioms. It is in fact the smallest inner model of ZFC, as any other inner model contains it.

More generally, given any set $A$

, one can build the smallest transitive model of ZF that contains and all the ordinals in a similar manner as , but now starting with the transitive closure of , i.e., the smallest transitive set that contains , instead of . The resulting model, , need not be however a model of the AC. One very important such model is

, the smallest transitive model of ZF that contains all the ordinals and all the real numbers.

The theory of constructible sets owes much to the work of Ronald Jensen. He developed the so-called fine structure theory of $L$

and isolated some combinatorial principles, such as the diamond () and square (), which can be used to carry out complicated constructions of uncountable mathematical objects. Fine structure theory plays also an important role in the analysis of bigger -like models, such as or the inner models for large cardinals

Forcing

In 1963, twenty-five years after Gödel’s proof of the consistency of the CH and the AC, relative to the consistency of ZF, Paul Cohen (1966) proved the consistency of the negation of the CH, and also of the negation of the AC, relative to the consistency of ZF. Thus, if ZF is consistent, then the CH is undecidable in ZFC, and the AC is undecidable in ZF. To achieve this, Cohen devised a new and extremely powerful technique, called forcing, for expanding countable transitive models of ZF.

Since the axiom $V = L$

implies the AC and the CH, any model of the negation of the AC or the CH must violate . So, let’s illustrate the idea of forcing in the case of building a model for the negation of . We start with a transitive model of ZFC, which we may assume, without loss of generality, to be a model of . To violate we need to expand by adding a new set so that, in the expanded model, will be non-constructible. Since all hereditarily-finite sets are constructible, we aim to add an infinite set of natural numbers. The first problem we face is that may contain already all subsets of . Fortunately, by the Löwenheim-Skolem theorem for first-order logic, has a countable elementary submodel . So, since we are only interested in the statements that hold in , and not in itself, we may as well work with instead of , and so we may assume that itself is countable. Then, since is uncountable, there are plenty of subsets of that do not belong to . But, unfortunately, we cannot just pick any infinite subset of that does not belong to and add it to . The reason is that may encode a lot of information, so that when added to , is no longer a model of ZFC, or it is still a model of . To avoid this, one needs to pick with great care. The idea is to pick generic over , meaning that is built from its finite approximations in such a way that it does not have any property that is definable in and can be avoided. For example, by viewing as an infinite sequence of natural numbers in the increasing order, the property of containing only finitely-many even numbers can be avoided, because given any finite approximation to —i.e., any finite increasing sequence of natural numbers—one can always extend it by adding more even numbers, so that at the end of the construction will contain infinitely-many even numbers; while the property of containing the number 7 cannot be avoided, because when a finite approximation to contains the number 7, then it stays there no matter how the construction of proceeds. Since is countable, there are such generic . Then the expanded model , which includes and contains the new set , is called a generic extension of . Since we assumed is a transitive model of , the model is just , where is the supremum of the ordinals of . Then one can show, using the forcing relation between finite approximations to and formulas in the language of set theory expanded with so-called names for sets in the generic extension, that is a model of ZFC and is not constructible in , hence the axiom of constructibility

fails.

In general, a forcing extension of a model $M$

is obtained by adding to a generic subset of some partially ordered set that belongs to . In the above example, would be the set of all finite increasing sequences of natural numbers, seen as finite approximations to the infinite sequence , ordered by ; and would be the set of all finite initial segments of

In the case of the consistency proof of the negation of the CH, one starts from a model $M$

and adds new subsets of , so that in the generic extension the CH fails. In this case one needs to use an appropriate partial ordering so that the of is not collapsed, i.e., it is the same as the of the generic extension, and thus the generic extension will satisfy the sentence that says that there are real numbers.

Saturday, 6 August 2022

MathWithNaziaa : Set Theory

The origins

The axioms of set theory

The axioms of ZFC

The theory of transfinite ordinals and cardinals

Cardinals

The universe

of all sets

Set theory as the foundation of mathematics

Metamathematics

The incompleteness phenomenon

The set theory of the continuum

Descriptive Set Theory

Determinacy

The Continuum Hypothesis

Gödel’s constructible universe

Forcing

No comments:

Post a Comment

Saturday, 6 August 2022

MathWithNaziaa : Set Theory

The origins

The axioms of set theory

The axioms of ZFC

The theory of transfinite ordinals and cardinals

Cardinals

The universe V

of all sets

Set theory as the foundation of mathematics

Metamathematics

The incompleteness phenomenon

The set theory of the continuum

Descriptive Set Theory

Determinacy

The Continuum Hypothesis

Gödel’s constructible universe

Forcing

No comments:

Post a Comment

The universe