Appendix - Set Theory
A review of a few basic ideas from set theory is provided here. Set theory forms the foundation for almost all of modern mathematics. Indeed, one can say that modern mathematics is the study of sets. We will not make any attempts to define rigorously the notion of a set. Certain elementary definitions are provided here for convenient reference. More definitions will be introduced later on as and when they are required.
Describing a set
Loosely speaking, a set is a collection of elements. These elements are said to belong to the set. Given a set , the statement that an element belongs to is abbreviated as The negation of this statement, that does not belong to , is written as The set consisting of elements is written as
Example
Let us collect together the list of India’s Twenty20 cricket captains so far (as of 2019): (I have listed only the last names since the full names would make the list too long.) We denote the fact that was a captain of the Indian Twenty20 cricket team as Similarly, we denote the fact that was not a captain of the Indian Twenty20 cricket team using the notation Notice how the set-theoretic notation succinctly represents the more verbose statement ‘Tendulkar was not a captain of the Indian Twenty20 cricket team’.
Remark
Whenever you see a mathematical expression, keep in mind that it just summarizes a possibly long, or, complex idea - it is a good idea to read it out fully until you gain enough intuition to think abstractly.
A more powerful notation to represent a set is as follows. Suppose we are given a property that any element belonging to a set may, or may not, satisfy. The set of all elements in that satisfy the property is written as The above statement is read ‘the set of all in for which is true’.
Example
Continuing the earlier example involving the set of all Indian Twenty20 cricket team (as of 2019), notice that only and represented Mumbai, in the domestic matches. Using the notation just introduced, we see that the set consisting of just those Indian Twenty20 captains until 2019 who played for Mumbai as
It is helpful to postulate the existence of a special set called the empty set that contains nothing. The standard notation for the empty set is .
Example
Returning to the set of all Indian Twenty20 captains, it turns out that none of them were born in Mumbai. In set-theoretic notation, we write this as
Finally, we will always work within the confines of a universal set, which contains everything that we are dealing with in a particular context.
Example
In all the earlier examples involving the set of all Indian Twenty20 captains, a convenient choice of universal set is the set of all cricketers who represented India at the international level in cricket matches: Note that the set can be obtained from as follows:
Remark
Note that the choice of universal set is not unique. In the examples considered above, we might just as well have chosen the set of all Indians who were born, say, after 1900, as our universal set. In practice, the universal set is often implicitly understood from the context; it is always a good idea to be aware of it.
Numerical sets
While set theory is very broad in its purview, we will limit our discussion henceforth to studying the properties of special kinds of sets. In particular, we will find the following numerical sets to be very useful:
-
The set of all natural numbers: .
-
The set of all integers: .
-
The set of all rational numbers, which are numbers of the form where and , will be denoted by the symbol .
-
Finally, the set of all real numbers will be denoted by the symbol .
We will sometimes use the symbol to denote the set of non-negative real numbers: .
Remark
In the modern mathematical formalism, all the numerical sets introduced above are constructed from the empty set . Understanding how this is done is beyond the scope of these notes. Any good book on real analysis should, however, provide the necessary details.
All the discussions henceforth will be confined to numerical sets, or sets constructed from numerical sets, unless stated otherwise.
Subsets, inclusion and equality
A set is said to be a subset of another set , written , if, and only if, it is true that whenever , it is also the case that . Formally,
Remark
We will use a variety of logical qualifiers in these notes. The symbol stands for the logical qualifier for all. We also use the logical qualifiers and to represent the notions there exists, and there exists a unique, respectively. The symbol stands for implication. Thus, is read if , or only if . Finally, the symbol stands for equivalence. Thus, is read if, and only if, . We will henceforth abbreviate the phrase if, and only if as iff.
Example
We will use the following subsets of to be very useful: for any ,
The phrase is contained in is also used for . This can alternatively be stated as, , and we say that is a superset of , or that contains . is said to be a strict subset of , written , if is a subset of , and there exists an element such that . Formally,
Example
Note that the following inclusion relation holds among the numerical sets introduced earlier:
Finally, two sets and are said to be equal, written , if both and . In other words, every element in belongs to , and vice versa. Formally, This is an important strategy to prove the equality of two sets.
Example
Let us consider the two sets and For any , note that it is possible to find a unique such that . Thus, we see that . To prove the reverse inclusion, note that for any , , and hence it is true that . We have thus shown that
Families of sets
We will now introduce a useful notation that will serve us well in the subsequent development of set theory. A family of sets indexed by the set is a set whose elements are sets where . In symbols, The set is also called the index set for the family. It is also conventional to represent the family as
Example
Suppose that we have sets defined as follows: Let be the set consisting of the first natural numbers: We can now collect together all the sets into a single set , and index it using : The indexing notation is particularly convenient when dealing with index sets which are subsets of real numbers. For instance, defining the sets we write the set consisting of every such as
An important kind of family of sets is the power set of a given set , consisting of all the subsets of . Note that since for every set , .
Example
Consider the set . The power set of is the set Note that has elements. In general, the power set of a finite set consisting of elements has elements.
Remark
The power set of a given set is strictly larger than that set. Consider the set of all natural numbers. Intuitively, we think of this as an infinite set. Consider the power set of . Based on the comment just made, should have more elements in it than , which by itself has infinite elements. Does it make sense to say that something is larger than infinity? We will unfortunately not have the pleasure of studying this very interesting question in these notes.
Set operations
Given an initial collection of sets, we can construct new sets from the existing ones using a variety of set operations. We will study a few of the more important one here.
Union
The union of two sets and , written as , is defined as the set of elements that belong to either , or to , or to both. Symbolically, Given a family of sets , the union of the family is defined similarly as Two special cases are worth noting here. When , the union is written as When , it is written as
Example
Continuing the previous example involving the family and , we see that Similarly, it is easily checked that
Intersection
The intersection of two sets and , written , is defined as the set of elements that belong to both and . In symbols, The extension of the notation to the case of families of sets is analogous to the case of unions described earlier.
Two sets and are said to be disjoint if they have no element in common, . A collection of sets is said to be pairwise disjoint if every pair of two sets in the collection is disjoint.
Example
With the sets defined as earlier, note that Note also that for every . We therefore say that and are disjoint for every .
Set difference
The set difference of two sets and , denoted as , is defined as the set of all elements of that do not belong to : If a universal set is provided, the complement of a set , written as , is defined as the set difference of and . In symbols, .
Example
For any , we defined the set earlier as . If , then we see that Since , we can compute its complement in as
Cartesian products
A set containing two elements has no natural ordering among its elements. Thus, this set is exactly the same as the set . We define an ordered pair as a collection of two elements such that is the first element, and is the second element. Note that the ordered pair is different from the ordered pair .
Remark
The notion of an ordered pair can be defined entirely in terms of sets. Consider the set . This prescribes a set , and singles out a particular element, in this case. If we define , it is immediately obvious that the set is not the same as the set . While this shows how an order is fundamentally just a special set, it is almost never explicitly written out in practice.
The Cartesian product of two sets and is defined as the set that consists of all ordered pairs such that belongs to and belongs to :
Example
Consider the sets and . The Cartesian product of and in this case is computed as
Remark
The word Cartesian derives from the the name of the French philosopher Rene Descartes. We will capitalize the first letter of the name of a term if that term has its origin in the name of an individual.
The Cartesian product of a finite collection of sets is defined similarly as A similar extension holds for an infinite collection of sets.
The Cartesian product of a set with itself, , is often written as . Similarly, the cartesian product of copies of is written as .
Example
As an important example of the Cartesian product of a set with itself is the set : In particular, the sets , and will play a significant role in these notes.
Binary relations
A binary relation on a set is a subset of . If , it is conventional to represent this as .
Remark
In the example presented earlier in the context of the Cartesian product, and . One possible relation on is the set defined as Note that, in this case, , while it is not true that .
A binary relation is said to be
-
reflexive, if it is true that for each , it is the case that .
-
symmetric, if for every , .
-
antisymmetric if for every , .
-
transitive if for every , .
Example
Given any set , let us consider the relation on the power set of . Note that for any , shows us that the relation is reflexive; shows us that the relation is anti-symmetric; and, shows us that the relation is transitive.
Example
Consider the set . The relation on defined as is not reflexive since . It is, however, symmetric and transitive.
An equivalence class on a set is a binary relation on that is reflexive, symmetric and transitive. An equivalence class is typically represented using symbols like , and is usually written as . Given an equivalence relation on , we define the equivalence class of as the set consisting of all elements in that are related to through , The notation is used if the particular equivalence relation used to define this class is to be emphasized. The collection of all equivalence classes of induced by the equivalence relation is called the factor space of , and is denoted as . Thus,
Example
Consider the following equivalence relation on the set of all natural numbers: Let us first verify that this is an equivalence relation. is reflexive since for any natural number , is divisible by . For any , if , then is divisible by . But this also means that is divisible by . In other words, . Thus, we see that is symmetric. Finally, for , if and , then for some , and for some . Adding these two equations, we see that , and hence that . We therefore see that is also a transitive relation. Together, these three results show that is an equivalence relation on . The equivalence classes of in this case are given by There are only distinct equivalence classes in this case. The factor space is thus seen to be
Equivalence classes offer a power means to partition a set. A partition of a set is a pairwise disjoint family of subsets of such that the . The sets are then said to partition the set. The importance of equivalence relation lies in the fact that it partitions the set on which it is defined into disjoint equivalence classes.
Proof
A proof of the statement that an equivalence class on a set partitions the set into disjoint equivalence classes is provided here. This can be skipped on a first reading.
Consider the set consisting of all the equivalence classes of . It is easy to see that . By construction, . Further, , which gives the reverse inclusion, . Thus, .
We prove next that if and , then . Choose any . This choice is always possible since , and hence . From the definition of an equivalence class, . Using the transitivity property of the equivalence relation, and , and hence . We have thus shown that . By a symmetric argument, it is evident that . This shows that if , then .
Finally, let us show that the equivalence classes that are not disjoint are identical. Consider any two equivalence classes and , where . It is either the case that , or . In the latter case, such that and . But this means that and , and hence, by symmetry and transitivity of the equivalence relation, . Using the argument given earlier, we see that . We have thus shown that is a pairwise disjoint cover of , consisting of subsets of .
Example
In the example considered earlier, notice how the set of equivalence classes is pairwise disjoint, and further that The factor space thus provides a partition the set of natural numbers .
Maps and functions
Given two sets and , a map, or a mapping, from into is a relation between and with the property that for every , there exists a unique such that . In symbols, It is conventional to notate as to make connection with the notation often employed in applications. We will also use the notation to denote the fact that maps to . The set is called the graph of . We will reserve the word function to denote a map of the form .
Example
Consider the sets and . The relation defined as does not represent a map since is related to both and . One the other hand, the relation defined as represents a mapping between and : , , and .
The fact that the map takes an element of and returns an element of is compactly written using the notation . Here, the set is called the domain of , and is called the codomain of . The domain of is sometimes abbreviated as . The set is called the range of . is also called the image of under , and is sometimes abbreviated as .
Remark
It is important to always be aware of the domain and codomain of any map/function. For instance, the two functions , defined as and the function , defined as represent two different functions.
The pre-image, or inverse image, of a subset of under the mapping is the subset of consisting of all those elements that are mapped into through , The notation should not be confused with the inverse map, to be defined later. We also note that if there is no such that . Thus, the inverse image is defined for all subsets of .
Example
Consider the sets , and the map defined as In this case, , and .
Given sets and maps and , the composition map is defined as , for every . The composition of maps is associative. What this means is that if , , and are given maps, then . It thus makes sense to write the composition map as .
Example
Let be the function defined as
Let be the function defined as
Then the composite function is given by
The map is said to be a one-to-one map from into , or injective, or an injection, if for . The map is said to be a map from onto , or surjective, or a surjection, if . The map that is both one-to-one and onto, or, equivalently, both injective and surjective, is said to be a one-to-one correspondence, or bijective, or a bijection from onto . A bijection thus allows us to identify elements of one set with that of another.
Example
We will now illustrate how the choice of the domain and codomain for the same rule can be critical in deciding the nature of the corresponding map:
-
The function defined as is onto, but not one-to-one.
-
The function defined as is one-to-one, but not onto.
-
The function defined as is both one-to-one and onto.
If is a bijection, the inverse map is defined as follows: , where is the unique element in such that .
Remark
It is important to distinguish the inverse map from the inverse image. The latter is a mapping from subsets of to subsets of , and is defined even when the inverse map does not exist.
Given any set , we will use the notation to denote the identity map defined as for every . (Depending on the context, other notations will also be used for the identity map.) We thus see that if is a bijection, then and .