Set Theory
Set theory is the mathematical
theory of well-determined collections, called sets, of objects that are
called members, or elements, of the set. Pure set theory deals
exclusively with sets, so the only sets under consideration are those whose
members are also sets. The theory of the hereditarily-finite sets,
namely those finite sets whose elements are also finite sets, the elements of
which are also finite, and so on, is formally equivalent to arithmetic. So, the
essence of set theory is the study of infinite sets, and therefore it can be
defined as the mathematical theory of the actual—as opposed to
potential—infinite.
The notion of set is so simple that
it is usually introduced informally, and regarded as self-evident. In set
theory, however, as is usual in mathematics, sets are given axiomatically, so
their existence and basic properties are postulated by the appropriate formal
axioms. The axioms of set theory imply the existence of a set-theoretic
universe so rich that all mathematical objects can be construed as sets. Also,
the formal language of pure set theory allows one to formalize all mathematical
notions and arguments. Thus, set theory has become the standard foundation for
mathematics, as every mathematical object can be viewed as a set, and every
theorem of mathematics can be logically deduced in the Predicate Calculus from
the axioms of set theory.
Both aspects of set theory, namely,
as the mathematical science of the infinite, and as the foundation of
mathematics, are of philosophical importance.
The origins
Set theory, as a separate mathematical discipline, begins in the work of
Georg Cantor. One might say that set theory was born in late 1873, when he made
the amazing discovery that the linear continuum, that is, the real line, is not
countable, meaning that its points cannot be counted using the natural numbers.
So, even though the set of natural numbers and the set of real numbers are both
infinite, there are more real numbers than there are natural numbers, which
opened the door to the investigation of the different sizes of infinity. See
the entry on the early development of set theory for a discussion of the origin
of set-theoretic ideas and their use by different mathematicians and
philosophers before and around Cantor’s time.
According to Cantor, two sets
and
have the same size, or cardinality, if
they are bijectable, i.e., the elements of
can be put in a one-to-one correspondence with the elements of .
Thus, the set
of natural numbers and the set
of real numbers have different cardinalities. In 1878 Cantor formulated the
famous Continuum Hypothesis (CH), which
asserts that every infinite set of real numbers is either countable, i.e., it
has the same cardinality as ,
or has the same cardinality as .
In other words, there are only two possible sizes of infinite sets of real
numbers. The CH is the most famous problem of set theory. Cantor himself
devoted much effort to it, and so did many other leading mathematicians of the
first half of the twentieth century, such as Hilbert, who listed the CH as the
first problem in his celebrated list of 23 unsolved mathematical problems
presented in 1900 at the Second International Congress of Mathematicians, in
Paris. The attempts to prove the CH led to major discoveries in set theory,
such as the theory of constructible sets, and the forcing technique, which
showed that the CH can neither be proved nor disproved from the usual axioms of
set theory. To this day, the CH remains open.
Early on, some inconsistencies, or
paradoxes, arose from a naive use of the notion of set; in particular, from the
deceivingly natural assumption that every property determines a set, namely the
set of objects that have the property. One example is Russell’s Paradox,
also known to Zermelo:
consider the property of sets of not
being members of themselves. If the property determines a set, call it A
, then A is a member of itself if and only
if A
is not a member of itself.
Thus, some collections, like the
collection of all sets, the collection of all ordinals numbers, or the
collection of all cardinal numbers, are not sets. Such collections are called proper
classes.
In order to avoid the paradoxes and
put it on a firm footing, set theory had to be axiomatized. The first
axiomatization was due to Zermelo (1908) and it came as a result of the need to
spell out the basic set-theoretic principles underlying his proof of Cantor’s
Well-Ordering Principle. Zermelo’s axiomatization avoids Russell’s Paradox by
means of the Separation axiom, which is formulated as quantifying over
properties of sets, and thus it is a second-order statement. Further work by
Skolem and Fraenkel led to the formalization of the Separation axiom in terms
of formulas of first-order, instead of the informal notion of property, as well
as to the introduction of the axiom of Replacement, which is also formulated as
an axiom schema for first-order formulas (see next section). The axiom of
Replacement is needed for a proper development of the theory of transfinite
ordinals and cardinals, using transfinite recursion (see Section 3). It is also needed to prove the
existence of such simple sets as the set of hereditarily finite sets, i.e.,
those finite sets whose elements are finite, the elements of which are also
finite, and so on; or to prove basic set-theoretic facts such as that every set
is contained in a transitive set, i.e., a set that contains all elements of its
elements (see Mathias 2001 for the weaknesses of Zermelo set theory). A further
addition, by von Neumann, of the axiom of Foundation, led to the standard axiom
system of set theory, known as the Zermelo-Fraenkel axioms plus the Axiom of
Choice, or ZFC.
Other axiomatizations of set theory,
such as those of von Neumann-Bernays-Gödel (NBG), or Morse-Kelley (MK), allow
also for a formal treatment of proper classes.
The axioms of set theory
ZFC is an axiom system formulated in first-order logic with equality and with only one binary relation symbol
for membership. Thus, we write to express that is a member of the set
The axioms of ZFC
· Extensionality: If two sets
and
· have the same elements, then they are equal.
· Null Set: There exists a set, denoted by
· and called the empty set, which has
no elements.
· Pair: Given any sets
and , there exists a set, denoted by , which contains and as its only elements. In particular, there exists the set which has
· as its only element.
· Power Set: For every set
there exists a set, denoted by and called the power set of , whose elements are all the subsets of
· .
· Union: For every set
, there exists a set, denoted by and called the union of , whose elements are all the elements of the elements of
· .
· Infinity: There exists an infinite set. In particular, there exists a set
that contains and such that if , then
· .
· Separation: For every set
and every given property, there is a set containing exactly the elements of that have that property. A property is given by a formula
of the first-order language of set theory.
Thus, Separation is not a single axiom but an axiom schema, that is, an infinite list of axioms, one for each formula
· .
· Replacement: For every given definable function with domain a set
· , there is a set whose elements are all the
values of the function.
Replacement is also an axiom schema, as definable functions are given by
formulas.
· Foundation: Every non-empty set
contains an -minimal element, that is, an element such that no element of
·
belongs to it.
These are the axioms of Zermelo-Fraenkel set theory, or ZF. The axioms of
Null Set and Pair follow from the other ZF axioms, so they may be omitted.
Also, Replacement implies Separation.
Finally, there is the Axiom of Choice (AC):
· Choice: For every set
of pairwise-disjoint non-empty sets, there exists a set that contains exactly one element from each set in
·
.
The AC was, for a long time, a controversial axiom. On the one hand, it is
very useful and of wide use in mathematics. On the other hand, it has rather
unintuitive consequences, such as the Banach-Tarski Paradox, which says that
the unit ball can be partitioned into finitely-many pieces, which can then be
rearranged to form two unit balls. The objections to the axiom arise from the
fact that it asserts the existence of sets that cannot be explicitly defined.
But Gödel’s 1938 proof of its consistency, relative to the consistency of ZF,
dispelled any suspicions left about it.
The Axiom of Choice is equivalent, modulo ZF, to the Well-ordering
Principle, which asserts that every set can be well-ordered, i.e., it can
be linearly ordered so that every non-empty subset has a minimal element.
Although not formally necessary, besides the symbol
one normally uses for convenience other auxiliary defined symbols. For
example,
expresses that
is a subset of ,
i.e., every member of
is a member of .
Other symbols are used to denote sets obtained by performing basic operations,
such as ,
which denotes the union of
and ,
i.e., the set whose elements are those of
and ;
or ,
which denotes the intersection of
and ,
i.e., the set whose elements are those common to
and .
The ordered pair
is defined as the set .
Thus, two ordered pairs
and
are equal if and only if
and .
And the Cartesian product
is defined as the set of all ordered pairs
such that
and .
Given any formula ,
and sets ,
one can form the set of all those elements of
that satisfy the formula .
This set is denoted by .
In ZF one can easily prove that all these sets exist.
The theory of transfinite ordinals and cardinals
In ZFC one can develop the Cantorian theory of transfinite (i.e., infinite) ordinal and cardinal numbers. Following the definition given by Von Neumann in the early 1920s, the ordinal numbers, or ordinals, for short, are obtained by starting with the empty set and performing two operations: taking the immediate successor, and passing to the limit. Thus, the first ordinal number is
. Given an ordinal , its immediate successor, denoted by , is the set . And given a non-empty set of ordinals such that for every the successor is also in , one obtains the limit ordinal . One shows that every ordinal is (strictly) well-ordered by , i.e., it is linearly ordered by and there is no infinite
-descending sequence. Also, every well-ordered set is isomorphic to a unique
ordinal, called its order-type.
Note that every ordinal is the set of its predecessors. However, the class
of all ordinals is not a set. Otherwise, would be an ordinal greater than all the ordinals, which is impossible. The first infinite ordinal, which is the set of all finite ordinals, is denoted by the Greek letter omega (). In ZFC, one identifies the finite ordinals with the natural numbers. Thus, , , , etc., hence is just the set
of natural numbers.
One can extend the operations of addition and multiplication of natural numbers to all the ordinals. For example, the ordinal
is the order-type of the well-ordering obtained by concatenating a well-ordered set of order-type and a well-ordered set of order-type . The sequence of ordinals, well-ordered by
, starts as follows
0, 1, 2,…,
,…, , , ,…, ,…, , …, ,…, , …,
, …
The ordinals satisfy the principle of transfinite induction: suppose that
is a class of ordinals such that whenever contains all ordinals smaller than some ordinal , then is also in . Then the class contains all ordinals. Using transfinite induction one can prove in ZFC (and one needs the axiom of Replacement) the important principle of transfinite recursion, which says that, given any definable class-function , one can define a class-function such that is the value of the function applied to the function restricted to
. One uses transfinite recursion, for example, in order to define properly
the arithmetical operations of addition, product, and exponentiation on the
ordinals.
Recall that an infinite set is countable if it is bijectable, i.e., it can be put into a one-to-one correspondence, with
. All the ordinals displayed above are either finite or countable. But the
set of all finite and countable ordinals is also an ordinal, called ,
and is not countable. Similarly, the set of all ordinals that are bijectable
with some ordinal less than or equal to
is also an ordinal, called ,
and is not bijectable with ,
and so on.
Cardinals
A cardinal is an ordinal that is not bijectable with any smaller ordinal. Thus, every finite ordinal is a cardinal, and
, , , etc. are also cardinals. The infinite cardinals are represented by the letter aleph (
) of the Hebrew alphabet, and their sequence is indexed by the ordinals. It
starts like this
, , , …, , , …, , …, , …, , …, , …,
, …
Thus,
, ,
, etc. For every cardinal there is a bigger one, and the limit of an
increasing sequence of cardinals is also a cardinal. Thus, the class of all
cardinals is not a set, but a proper class.
An infinite cardinal
is called regular if it is not the union of less than smaller cardinals. Thus, is regular, and so are all infinite successor cardinals, such as . Non-regular infinite cardinals are called singular. The first singular cardinal is , as it is the union of countably-many smaller cardinals, namely
.
The cofinality of a cardinal
, denoted by is the smallest cardinal such that is the union of -many smaller ordinals. Thus,
.
By the AC (in the form of the Well-Ordering Principle), every set
can be well-ordered, hence it is bijectable with a unique cardinal, called the cardinality of . Given two cardinals and , the sum is defined as the cardinality of the set consisting of the union of any two disjoint sets, one of cardinality and one of cardinality . And the product is defined as the cardinality of the Cartesian product . The operations of sum and product of infinite cardinals are trivial, for if and are infinite cardinals, then
.
In contrast, cardinal exponentiation is highly non-trivial, for even the value of the simplest non-trivial infinite exponential, namely
, is not known and cannot be determined in ZFC (see below). The cardinal is defined as the cardinality of the Cartesian product of copies of ; equivalently, as the cardinality of the set of all functions from into . König’s theorem asserts that , which implies that the cofinality of the cardinal , whatever that cardinal is, must be uncountable. But this is essentially all that ZFC can prove about the value of the exponential
.
In the case of exponentiation of singular cardinals, ZFC has a lot more to say. In 1989, Shelah proved the remarkable result that if
is a strong limit, that is, ,
for every ,
then
(see Shelah (1994)). The technique developed by Shelah to prove this and
similar theorems, in ZFC, is called pcf theory (for possible
cofinalities), and has found many applications in other areas of
mathematics.
The universe
of all sets
A posteriori, the ZF axioms other than Extensionality—which needs no justification because it just states a defining property of sets—may be justified by their use in building the cumulative hierarchy of sets. Namely, in ZF we define using transfinite recursion the class-function that assigns to each ordinal
the set
, given as follows:
· ·
· ·
, whenever
·
is a limit ordinal.
The Power Set axiom is used to obtain
from . Replacement and Union allow one to form for a limit ordinal. Indeed, consider the function that assigns to each the set . By Replacement, the collection of all the , for , is a set, hence the Union axiom applied to that set yields . The axiom of Infinity is needed to prove the existence of and hence of the transfinite sequence of ordinals. Finally, the axiom of Foundation is equivalent, assuming the other axioms, to the statement that every set belongs to some , for some ordinal . Thus, ZF proves that the set theoretic universe, denoted by , is the union of all the ,
an ordinal.
The proper class
, together with the relation, satisfies all the ZFC axioms, and is thus a model of ZFC. It is the intended model of ZFC, and one may think of ZFC as providing a description of
, a description however that is highly incomplete, as we shall see below.
One important property of
is the so-called Reflection Principle. Namely, for each formula , ZFC proves that there exists some that reflects it, that is, for every
,
holds in if and only if holds in
.
Thus,
cannot be characterized by any sentence, as any sentence that is true in must be also true in some initial segment . In particular, ZFC is not finitely axiomatizable, for otherwise ZFC would prove that, for unboundedly many ordinals ,
is a model of ZFC, contradicting Gödel’s second incompleteness theorem
The Reflection Principle encapsulates the essence of ZF set theory, for as shown by Levy (1960), the axioms of Extensionality, Separation, and Foundation, together with the Reflection Principle, formulated as the axiom schema asserting that each formula is reflected by some set that contains all elements and all subsets of its elements (note that the
are like this), is equivalent to ZF.
Set theory as the foundation of mathematics
Every mathematical object may be viewed as a set. For example, the natural numbers are identified with the finite ordinals, so
. The set of integers may be defined as the set of equivalence classes of pairs of natural numbers under the equivalence relation if and only if . By identifying every natural number with the equivalence class of the pair , one may extend naturally the operations of sum and product of natural numbers to (see Enderton (1977) for details, and Levy (1979) for a different construction). Further, one may define the rationals as the set of equivalence classes of pairs of integers, where , under the equivalence relation if and only if . Again, the operations and on may be extended naturally to . Moreover, the ordering on the rationals is given by: if and only if there exists such that . The real numbers may be defined as Dedekind cuts of , namely, a real number is given by a pair of non-empty disjoint sets such that , and for every and . One can then extend again the operations of and on , as well as the ordering , to the set of real numbers
.
Let us emphasize that it is not claimed that, e.g., real numbers are Dedekind cuts of rationals, as they could also be defined using Cauchy sequences, or in other different ways. What is important, from a foundational point of view, is that the set-theoretic version of
, together with the usual algebraic operations, satisfies the categorical
axioms that the real numbers satisfy, namely those of a complete ordered field.
The metaphysical question of what the real numbers really are is irrelevant
here.
Algebraic structures can also be viewed as sets, as any
-ary relation on the elements of a set can be viewed as a set of -tuples of elements of . And any -ary function on , with values on some set , can be seen as the set of -tuples such that is the value of on . Thus, for example, a group is just a triple , where is a non-empty set, is a binary function on that is associative, is an element of such that , for all , and for every there is an element of , denoted by , such that . Also, a topological space is just a set together with a topology on it, i.e., is a subset of containing and
, and closed under arbitrary unions and finite intersections. Any
mathematical object whatsoever can always be viewed as a set, or a proper
class. The properties of the object can then be expressed in the language of
set theory. Any mathematical statement can be formalized into the language of
set theory, and any mathematical theorem can be derived, using the calculus of
first-order logic, from the axioms of ZFC, or from some extension of ZFC. It is
in this sense that set theory provides a foundation for mathematics.
The foundational role of set theory for mathematics, while significant, is
by no means the only justification for its study. The ideas and techniques
developed within set theory, such as infinite combinatorics, forcing, or the
theory of large cardinals, have turned it into a deep and fascinating
mathematical theory, worthy of study by itself, and with important applications
to practically all areas of mathematics.
Metamathematics
The remarkable fact that virtually all of mathematics can be formalized
within ZFC, makes possible a mathematical study of mathematics itself. Thus,
any questions about the existence of some mathematical object, or the
provability of a conjecture or hypothesis can be given a mathematically precise
formulation. This makes metamathematics possible, namely the
mathematical study of mathematics itself. So, the question about the
provability or unprovability of any given mathematical statement becomes a
sensible mathematical question. When faced with an open mathematical problem or
conjecture, it makes sense to ask for its provability or unprovability in the
ZFC formal system. Unfortunately, the answer may be neither, because ZFC, if
consistent, is incomplete.
The incompleteness phenomenon
Gödel’s completeness theorem for first-order logic implies that ZFC is consistent—i.e., no contradiction can be derived from it—if and only if it has a model. A model of ZFC is a pair
, where is a non-empty set and is a binary relation on such that all the axioms of ZFC are true when interpreted in , i.e., when the variables that appear in the axioms range over elements of , and is interpreted as . Thus, if is a sentence of the language of set theory and one can find a model of ZFC in which holds, then its negation cannot be proved in ZFC. Hence, if one can find a model of and also a model of , then is neither provable nor disprovable in ZFC, in which case we say that
is undecidable in, or independent of, ZFC.
In 1931, Gödel announced his striking incompleteness theorems, which assert that any reasonable formal system for mathematics is necessarily incomplete. In particular, if ZFC is consistent, then there are undecidable propositions in ZFC. Moreover, Gödel’s second incompleteness theorem implies that the formal (arithmetical) statement
, which asserts that ZFC is consistent, while true, cannot be proved in ZFC. And neither can its negation. Thus,
is undecidable in ZFC.
If ZFC is consistent, then it cannot prove the existence of a model of ZFC, for otherwise ZFC would prove its own consistency. Thus, a proof of consistency or of undecidability of a given sentence
is always a relative consistency proof. That is, one assumes that
ZFC is consistent, hence it has a model, and then one constructs another model
of ZFC where the sentence
is true. We shall see several examples in the next sections.
The set theory of the continuum
From Cantor and until about 1940, set theory developed mostly around the study of the continuum, that is, the real line
. The main topic was the study of the so-called regularity properties, as
well as other structural properties, of simply-definable sets of real numbers,
an area of mathematics that is known as Descriptive Set Theory.
Descriptive Set Theory
Descriptive Set Theory is the study of the properties and structure of definable sets of real numbers and, more generally, of definable subsets of
and other Polish spaces (i.e., topological spaces that are homeomorphic to a separable complete metric space), such as the Baire space of all functions
, the space of complex numbers, Hilbert space, and separable Banach spaces.
The simplest sets of real numbers are the basic open sets (i.e., the open
intervals with rational endpoints), and their complements. The sets that are
obtained in a countable number of steps by starting from the basic open sets
and applying the operations of taking the complement and forming a countable
union of previously obtained sets are the Borel sets. All Borel sets
are regular, that is, they enjoy all the classical regularity
properties. One example of a regularity property is the Lebesgue
measurability: a set of reals is Lebesgue measurable if it differs from a
Borel set by a null set, namely, a set that can be covered by sets of basic
open intervals of arbitrarily-small total length. Thus, trivially, every Borel
set is Lebesgue measurable, but sets more complicated than the Borel ones may
not be. Other classical regularity properties are the Baire property
(a set of reals has the Baire property if it differs from an open set by a meager
set, namely, a set that is a countable union of sets that are not dense in any
interval), and the perfect set property (a set of reals has the
perfect set property if it is either countable or contains a perfect set,
namely, a nonempty closed set with no isolated points). In ZFC one can prove
that there exist non-regular sets of reals, but the AC is necessary for this
(Solovay 1970).
The analytic sets, also called
, are the continuous images of Borel sets. And the co-analytic, or
, sets are the complements of analytic sets.
Starting from the analytic (or the co-analytic) sets and applying the operations of projection (from the product space
to ) and complementation, one obtains the projective sets. The projective sets form a hierarchy of increasing complexity. For example, if is co-analytic, then the projection is a projective set in the next level of complexity above the co-analytic sets. Those sets are called , and their complements are called
.
The projective sets come up very naturally in mathematical practice, for it turns out that a set
of reals is projective if and only if it is definable in the structure
That is, there is a first-order formula
in the language for the structure such that for some
,
ZFC proves that every analytic set, and therefore every co-analytic set, is Lebesgue measurable and has the Baire property. It also proves that every analytic set has the perfect set property. But the perfect set property for co-analytic sets implies that the first uncountable cardinal,
, is a large cardinal in the constructible universe
namely a so-called inaccessible cardinal which implies that one cannot prove
in ZFC that every co-analytic set has the perfect set property.
The theory of projective sets of complexity greater than co-analytic is completely undetermined by ZFC. For example, in
there is a
set that is not Lebesgue measurable and does not have the Baire property,
whereas if Martin’s axiom holds. every such set has those regularity
properties. There is, however, an axiom, called the axiom of Projective
Determinacy, or PD, that is consistent with ZFC, modulo the consistency of some
large cardinals (in fact, it follows from the existence of some large
cardinals), and implies that all projective sets are regular. Moreover, PD
settles essentially all questions about the projective sets. See the entry on large
cardinals and determinacy for further details.
Determinacy
A regularity property of sets that subsumes all other classical regularity properties is that of being determined. For simplicity, we shall work with the Baire space
. Recall that the elements of are functions , that is, sequences of natural numbers of length . The space is topologically equivalent (i.e., homeomorphic) to the set of irrational points of . So, since we are interested in the regularity properties of subsets of , and since countable sets, such as the set of rationals, are negligible in terms of those properties, we may as well work with , instead of
.
Given
, the game associated to , denoted by , has two players, I and II, who play alternatively : I plays , then II plays , then I plays , and so on. So, at stage , player I plays and at stage , player II plays
. We may visualize a run of the game as follows:
After infinitely many moves, the two players produce an infinite sequence
of natural numbers. Player I wins the game if the sequence belongs to
. Otherwise, player II wins.
The game
is determined if there is a winning strategy for one of the players. A winning strategy for one of the players, say for player II, is a function from the set of finite sequences of natural numbers into , such that if the player plays according to this function, i.e., she plays at the
-th turn, she will always win the game, no matter what the other player
does.
We say that a subset
of is determined if and only if the game
is determined.
One can prove in ZFC—and the use of the AC is necessary—that there are non-determined sets. Thus, the Axiom of Determinacy (AD), which asserts that all subsets of
are determined, is incompatible with the AC. But Donald
Martin proved, in ZFC, that every Borel set is determined. Further, he showed
that if there exists a large cardinal called measurable then even the analytic
sets are determined. The axiom of Projective Determinacy
(PD) asserts that every projective set is determined. It turns out that PD
implies that all projective sets of reals are regular, and Woodin has shown
that, in a certain sense, PD settles essentially all questions about the
projective sets. Moreover, PD seems to be necessary for this. Another axiom, ,
asserts that the AD holds in ,
which is the least transitive class that contains all the ordinals and all the
real numbers, and satisfies the ZF axioms So,
implies that every set of reals that belongs to
is regular. Also, since
contains all projective sets,
implies PD.
The Continuum Hypothesis
The Continuum Hypothesis (CH), formulated by Cantor in 1878, asserts that every infinite set of real numbers has cardinality either
or the same cardinality as . Thus, the CH is equivalent to
.
Cantor proved in 1883 that closed sets of real numbers have the perfect set property, from which it follows that every uncountable closed set of real numbers has the same cardinality as
. Thus, the CH holds for closed sets. More than thirty years later, Pavel
Aleksandrov extended the result to all Borel sets, and then Mikhail Suslin to
all analytic sets. Thus, all analytic sets satisfy the CH. However, the efforts
to prove that co-analytic sets satisfy the CH would not succeed, as this is not
provable in ZFC.
In 1938 Gödel proved the consistency of the CH with ZFC. Assuming that ZF is
consistent, he built a model of ZFC, known as the constructible universe,
in which the CH holds. Thus, the proof shows that if ZF is consistent, then so
is ZF together with the AC and the CH. Hence, assuming ZF is consistent, the AC
cannot be disproved in ZF and the CH cannot be disproved in ZFC.
See the entry on the continuum hypothesis for the current status of the
problem, including the latest results by Woodin.
Gödel’s constructible universe
Gödel’s constructible universe, denoted by
, is defined by transfinite recursion on the ordinals, similarly as , but at successor steps, instead of taking the power set of to obtain , one only takes those subsets of that are definable in , using elements of as parameters. Thus, letting to denote the set of all the subsets of that are definable in the structure by a formula of the language of set theory, using elements of
as parameters of the definition, we let
· ·
· ·
, whenever
·
is a limit ordinal.
Then
is the union of all the , for an ordinal, i.e.,
.
Gödel showed that
satisfies all the ZFC axioms, and also the CH. In fact, it satisfies the Generalized Continuum Hypothesis (GCH), namely , for every ordinal
.
The statement
, called the axiom of constructibility, asserts that every set belongs to . It holds in
, hence it is consistent with ZFC, and implies both the AC and the GCH.
The proper class
, together with the relation restricted to
, is an inner model of ZFC, that is, a transitive (i.e.,
it contains all elements of its elements) class that contains all ordinals and
satisfies all the ZFC axioms. It is in fact the smallest inner model of ZFC, as
any other inner model contains it.
More generally, given any set
, one can build the smallest transitive model of ZF that contains and all the ordinals in a similar manner as , but now starting with the transitive closure of , i.e., the smallest transitive set that contains , instead of . The resulting model, , need not be however a model of the AC. One very important such model is
, the smallest transitive model of ZF that contains all the ordinals and all
the real numbers.
The theory of constructible sets owes much to the work of Ronald Jensen. He developed the so-called fine structure theory of
and isolated some combinatorial principles, such as the
diamond ()
and square (),
which can be used to carry out complicated constructions of uncountable
mathematical objects. Fine structure theory plays also an important role in the
analysis of bigger -like
models, such as
or the inner models for large cardinals
Forcing
In 1963, twenty-five years after Gödel’s proof of the consistency of the CH
and the AC, relative to the consistency of ZF, Paul Cohen (1966) proved the
consistency of the negation of the CH, and also of the negation of the AC,
relative to the consistency of ZF. Thus, if ZF is consistent, then the CH is
undecidable in ZFC, and the AC is undecidable in ZF. To achieve this, Cohen
devised a new and extremely powerful technique, called forcing, for
expanding countable transitive models of ZF.
Since the axiom
implies the AC and the CH, any model of the negation of the AC or the CH must violate . So, let’s illustrate the idea of forcing in the case of building a model for the negation of . We start with a transitive model of ZFC, which we may assume, without loss of generality, to be a model of . To violate we need to expand by adding a new set so that, in the expanded model, will be non-constructible. Since all hereditarily-finite sets are constructible, we aim to add an infinite set of natural numbers. The first problem we face is that may contain already all subsets of . Fortunately, by the Löwenheim-Skolem theorem for first-order logic, has a countable elementary submodel . So, since we are only interested in the statements that hold in , and not in itself, we may as well work with instead of , and so we may assume that itself is countable. Then, since is uncountable, there are plenty of subsets of that do not belong to . But, unfortunately, we cannot just pick any infinite subset of that does not belong to and add it to . The reason is that may encode a lot of information, so that when added to , is no longer a model of ZFC, or it is still a model of . To avoid this, one needs to pick with great care. The idea is to pick generic over , meaning that is built from its finite approximations in such a way that it does not have any property that is definable in and can be avoided. For example, by viewing as an infinite sequence of natural numbers in the increasing order, the property of containing only finitely-many even numbers can be avoided, because given any finite approximation to —i.e., any finite increasing sequence of natural numbers—one can always extend it by adding more even numbers, so that at the end of the construction will contain infinitely-many even numbers; while the property of containing the number 7 cannot be avoided, because when a finite approximation to contains the number 7, then it stays there no matter how the construction of proceeds. Since is countable, there are such generic . Then the expanded model , which includes and contains the new set , is called a generic extension of . Since we assumed is a transitive model of , the model is just , where is the supremum of the ordinals of . Then one can show, using the forcing relation between finite approximations to and formulas in the language of set theory expanded with so-called names for sets in the generic extension, that is a model of ZFC and is not constructible in , hence the axiom of constructibility
fails.
In general, a forcing extension of a model
is obtained by adding to a generic subset of some partially ordered set that belongs to . In the above example, would be the set of all finite increasing sequences of natural numbers, seen as finite approximations to the infinite sequence , ordered by ; and would be the set of all finite initial segments of
.
In the case of the consistency proof of the negation of the CH, one starts from a model
and adds
new subsets of ,
so that in the generic extension the CH fails. In this case one needs to use an
appropriate partial ordering
so that the
of
is not collapsed, i.e., it is the same as
the
of the generic extension, and thus the generic extension
will satisfy the sentence that says that there are
real numbers.
No comments:
Post a Comment