INTERNATIONAL SERIES IN PURE AND APPLIED MATHEMATICS William Ted Martin, E. H. Spanier, G. Springer and P. J. Davis. Consulting Editors AHLFORS: Complex Analysis BucK: Advanced Calculus BUSACKER AND SAATY: Finite Graphs and Networks CHENEY: Introduction to Approximation Theory CHESTER: Techniques in Partial Differential Equations CODDINGTON AND LEVINSON: Theory of Ordinary Differential Equations CONTE AND DE BooR: Elementary Numerical Analysis: An Algorithmic Approach DENNEMEYER: Introduction to Partial Differential Equations and Boundary Value Problems DETTMAN: Mathematical Methods in Physics and Engineering GOLOMB AND SHANKS: Elements of Ordinary Differential Equations GREENSPAN: Introduction to Partial Differential Equations HAMMING: Numerical Methods for Scientists and Engineers HILDEBRAND: Introduction to Numerical Analysis HOUSEHOLDER: The Numerical Treatment of a Single Nonlinear Equation KALMAN, FALB, AND ARBIB: Topics in Mathematical Systems Theory LASS: Vector and Tensor Analysis McCARTY: Topology: An Introduction with Applications to Topological Groups MONK: Introduction to Set Theory MOORE: Elements of Linear Algebra and Matrix Theory MOSTOW AND SAMPSON: Linear Algebra MouRSUND AND DURIS: Elementary Theory and Application of Numerical Analysis PEARL: Matrix Theory and Finite Mathematics PIPES AND HARVILL: Applied Mathematics for Engineers and Physicists RALSTON: A First Course in Numerical Analysis RIT0ER AND RosE: Differential Equations with Applications RITT: Fourier Series RUDIN: Principles of Mathematical Analysis SHAPIRO: Introduction to Abstract Algebra SIMMONS: Differential Equations with Applications and Historical Notes SIMMONS: Introduction to Topology and Modern Analysis SNEDDON: Elements of Partial Differential Equations STRUBLE: Nonlinear Differential Equations McGraw-Hill, Inc. New York St. Louis San Francisco Auckland Bogota Caracas Lisbon London Madrid Mexico City Milan Montreal New Delhi San Juan Singapore Sydney Tokyo Toronto WALTER RUDIN Professor of Mathematics University of Wisconsin-Madison Principles of Mathematical Analysis THIRD EDffiON This book was set in Times New Roman. The editors were A. Anthony Arthur and Shelly Levine Langman; the production supervisor was Leroy A. Young. R.R. Donnelley & Sons Company was printer and binder. This book is printed on acid-free paper. Library of Congress Cataloging in Publication Data Rudin, Walter, date Principles of mathematical analysis. (International series in pure and applied mathematics) Bibliography: p. Includes index. 1. Mathematical analysis. I. Title. QA300.R8 1976 515 75-17903 ISBN 0-07-054235-X PRINCIPLES OF MATHEMATICAL ANALYSIS Copyright © 1964, 1976 by McGraw-Hill, Inc. All rights reserved. Copyright 1953 by McGraw-Hill, Inc. All rights reserved. Printed in the United States of America. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording, or otherwise, without the prior written permission of the publisher. 28 29 30 DOC/DOC O9 8 7 6 5 4 3 2 1 0 CONTENTS Preface ix Chapter 1 The Real and Complex Number Systems 1 Introduction 1 Ordered Sets 3 Fields 5 The Real Field 8 The Extended Real Number System 11 The Complex Field 12 Euclidean Spaces 16 Appendix 17 Exercises 21 Chapter 2 Basic Topology 24 Finite, Countable, and Uncountable Sets 24 Metric Spaces 30 Compact Sets 36 Perfect Sets 41 vi CONTENTS Connected Sets 42 Exercises 43 Chapter 3 Numerical Sequences and Series 47 Convergent Sequences 47 Subsequences 51 Cauchy Sequences 52 Upper and Lower Limits 55 Some Special Sequences 57 Series 58 Series of Nonnegative Terms 61 The Number e 63 The Root and Ratio Tests 65 Power Series 69 Summation by Parts 70 Absolute Convergence 71 Addition and Multiplication of Series 72 Rearrangements 75 Exercises 78 Chapter 4 Continuity 83 Limits of Functions 83 Continuous Functions 85 Continuity and Compactness 89 Continuity and Connectedness 93 Discontinuities 94 Monotonic Functions 95 Infinite Limits and Limits at Infinity 97 Exercises 98 Chapter S Differentiation 103 The Derivative of a Real Function 103 Mean Value Theorems 107 The Continuity of Derivatives 108 L'Hospital's Rule 109 Derivatives of Higher Order 110 Taylor's Theorem 110 Differentiation of Vector-valued Functions 111 Exercises 114 CONTENTS vii Chapter 6 The Riemann-Stieltjes Integral 120 Definition and Existence of the Integral 120 Properties of the Integral 128 Integration and Differentiation 133 Integration of Vector-valued Functions 135 Rectifiable Curves 136 Exercises 138 Chapter 7 Sequences and Series of Functions. 143 Discussion of Main Problem 143 Uniform Convergence 147 Uniform Convergence and Continuity 149 Uniform Convergence and Integration 151 Uniform Convergence and Differentiation 152 Equicontinuous Families of Functions 154 The Stone-Weierstrass Theorem 159 Exercises 165 Chapter 8 Some Special Functions 172 Power Series 172 The Exponential and Logarithmic Functions 178 The Trigonometric Functions 182 The Algebraic Completeness of the Complex Field 184 Fourier Series 185 The Gamma Function 192 Exercises 196 Chapter 9 Functions of Several Variables 204 Linear Transformations 204 Differentiation 211 The Contraction Principle 220 The Inverse Function Theorem 221 The Implicit Function Theorem 223 The Rank Theorem 228 Determinants 231 Derivatives of Higher Order 235 Differentiation of Integrals 236 Exercises 239 Chapter 10 Integration of Differential Forms 245 Integration 245 .uJ OONTBNTI Primitive Mappings 248 Partitions of Unity 251 Change of Variables 252 Differential Forms 253 Simplexes and Chains 266 Stokes' Theorem 273 Closed Forms and Exact Forms 275 Vector Analysis 280 Exercises 288 Chapter 11 The Lebesgue Theory 300 Set Functions 300 Construction of the Lebesgue Measure 302 Measure Spaces 310 Measurable Functions 310 Simple Functions 313 Integration 314 Comparison with the Riemann Integral 322 Integration of Complex Functions 325 Functions of Class ft'2 325 Exercises 332 Bibliography 335 LJst of Special Sym~I• 337 Index 339 PREFACE This book is intended to serve as a text for the course in analysis that is usually taken by advanced undergraduates or by first-year students who study mathematics. The present edition covers essentially the same topics as the second one, with some additions, a few minor omissions, and considerable rearrangement. I hope that these changes will make the material more accessible amd more attractive to the students who take such a course. Experience has convinced me that it is pedagogically unsound (though logically correct) to start off with the construction of the real numbers from the rational ones. At the beginning, most students simply fail to appreciate the need for doing this. Accordingly, the real number system is introduced as an ordered field with the least-upper-bound property, and a few interesting applications of this property are quickly made. However, Dedekind's construction is not omitted. It is now in an Appendix to Chapter l, where it may be studied and enjoyed whenever the time seems ripe. The material on functions of several variables is almost completely rewritten, with many details filled in, and with more examples and more motivation. The proof of the inverse function theorem-the key item in Chapter 9-is X PREFACE simplified by means of the fixed point theorem about contraction mappings. Differential forms are discussed in much greater detail. Several applications of Stokes' theorem are included. As regards other changes, the chapter on the Riemann-Stieltjes integral has been trimmed a bit, a short do-it-yourself section on the gamma function has been added to Chapter 8, and there is a large number of new exercises, most of them with fairly detailed hints. I have also included several references to articles appearing in the American Mathematical Monthly and in Mathematics Magazine, in the hope that students will develop the habit of looking into the journal literature. Most of these references were kindly supplied by R. B. Burckel. Over the years, many people, students as well as teachers, have sent me corrections, criticisms, and other comments concerning the previous editions of this book. I have appreciated these, and I take this opportunity to express my sincere thanks to all who have written me. WALTER RUDIN 1 THE REAL AND COMPLEX NUMBER SYSTEMS INTRODUCTION A satisfactory discussion of the main concepts of analysis (such as convergence, continuity, differentiation, and integration) must be based on an accurately defined number concept. We shall not, however, enter into any discussion of the axioms that govern the arithmetic of the integers, but assume familiarity with the rational numbers (i.e., the numbers of the form m/n, where m and n are integers and n =I- 0). The rational number system is inadequate for many purposes, both as a field and as an ordered set. (These terms will be defined in Secs. 1.6 and 1.12.) For instance, there is no rational p such that p2 = 2. (We shall prove this presently.) This leads to the introduction of so-called "irrational numbers" which are often written as infinite decimal expansions and are considered to be "approximated" by the corresponding finite decimals. Thus the sequence 1, 1.4, 1.41, 1.414, 1.4142, ... "tends to J2." But unless the irrational number j2 has been clearly defined, the question must arise: Just what is it that this sequence "tends to"? 2 PRINCIPLES OF MATHEMATICAL ANALYSIIJ This sort of question can be answered as soon as the so-called "real number system" is constructed. 1.1 Example We now show that the equation (1) p2 =2 is not satisfied by any rational p. If there were such a p, we could write p = m/n where m and n are integers that are not both even. Let us assume this is done. Then (1) implies (2) = m2 2n 2 , This shows that m2 is even. Hence m is even (if m were odd, m2 would be odd), and so m2 is divisible by 4. It follows that the right side of (2) is divisible by 4, so that n2 is even, which implies that n is even. The assumption that (1) holds thus leads to the conclusion that both m and n are even, contrary to our choice of m and n. Hence (1) is impossible for rational p. We now examine this situation a little more closely. Let A be the set of all positive rationals p such that p2 < 2 and let B consist of all positive rationals p such that p2 > 2. We shall show that A contains no largest number and B con- tains no smallest. More explicitly, for every p in A we can find a rational q in A such that p < q, and for every p in B we can find a rational q in B such that q < p. To do this, we associate with each rational p > 0 the number p2 -2 2p + 2 (3) q=p- p+2 = p+2. Then (4) 2 2 - 2(p2 - 2) q - - (p + 2)2 • If p is in A then p 2 - 2 < 0, (3) shows that q > p, and (4) shows that q2 < 2. Thus q is in A. If pis in B then p2 - 2 > 0, (3) shows that 0 < q < p, and (4) shows that q2 > 2. Thus q is in B. 1.2 Remark The purpose of the above discussion has been to show that the rational number system has certain gaps, in spite of the fact that between any two rationals there is another: If r < s then r < (r + s)/2 < s. The real number system fills these gaps. This is the principal reason for the fundamental role which it plays in analysis. THB UAL AND COMPLBX NUMBER SYSTEMS 3 In order to elucidate its structure, as well as that of the complex numbers, we start with a brief discussion of the general concepts of ordered set and field. Here is some of the standard set-theoretic terminology that will be used throughout this book. 1.3 Deftnltlou If A. is any set (whose elements may be numbers or any other objects), we write x e A. to indicate that x is a member (or an element) of A.. If x is not a member of A., we write: x ¢ A.. The set which contains no element will be called the empty set. If a set has at least one element, it is called nonempty. If A. and B are sets, and if every element of A. is an element of B, we say that A. is a subset of B, and write A. c B, or B :::, A.. If, in addition, there is an element of B which is not in A, then A is said to be a proper subset of B. Note that A. c A. for every set A. If A. c B and B c A., we write A. =B. Otherwise A. ,,= B. 1.4 Deflnldon Throughout Chap. 1, the set of all rational numbers will be denoted by Q. ORDERED SETS 1.5 Deflnltion Let S be a set. An order on Sis a relation, denoted by <, with the foilowing two properties: (i) If x e S and ye S then one and only one of the statements is true. x x in place of x < y. The notation x S y indicates that x < y or x =y, without specifying which of these two is to hold. In other words, x S y is the negation of x > y. 1.6 Deflnidon An ordered set is a set S in which an order is defined. For example, Q is an ordered set if r IX is a lower bound of E. t.9 Examples (a) Consider the sets A. and B of Example 1.1 as subsets of the ordered set Q. The set A. is bounded above. In fact, the upper bounds of A. are exactly the members of B. Since B contains no smallest member, A. has no least upper bound in Q. Similarly, Bis bounded below: The set of all lower bounds of B consists of A. and of all re Q with r S 0. Since A. has no lasgest member, B has no greatest lower bound in Q. (b) If IX= sup E exists, then ex may or may not be a member of E. For instance, let E1 be the set of all r e Q with r < 0. Let E2 be the set of all r e Q with r S 0. Then sup E1 = sup E2 =0, and 0 ¢ Ei, 0 e E2 • (c) Let E consist of all numbers 1/n, where n = 1, 2, 3, .... Then sup E = 1, which is in E, and inf E = 0, which is not in E. t.10 Deflnltlon An ordered set Sis said to have the least-upper-boundproperty if the following is true: If E c: S, Eis not empty, and Eis bounded above, then sup E exists in S. Example 1.9(a) shows that Q does not have the least-upper-bound property. We shall now show that there is a close relation between greatest lower bounds and least upper bounds, and that every ordered set with the least-upperbound property also has the greatest-lower-bound property. ' THE REAL AND COMPLEX NUMBER SYSTEMS 5 1.11 Theorem Suppose Sis an ordered set with the least-upper-bound property, B c S, B is not empty, and B is bounded below. Let L be the set of all lower bounds of B. Then exists in S, and oc = inf B. IX= supL In particular, inf B exists in S. Proof Since B is bounded below, L is not empty. Since L consists of exactly those ye S which satisfy the inequality y ~ x for every x e B, we see that every x E B is an upper bound of L. Thus L is bounded above. Our hypothesis about S implies therefore that L has a supremum in S; call it IX. If y < IX then (see Definition 1.8) y is not an upper bound of L, hence y ¢ B. It follows that IX ~ x for every x E B. Thus oc EL. If IX < f3 then /3 ¢ L, since IX is an upper bound of L. We have shown that IX EL but /3 ¢ L if f3 > IX. In other words, oc is a lower bound of B, but f3 is not if /J > IX. This means that IX= inf B. FIELDS 1.12 Definition A field is a set F with two operations, called addition and multiplication, which satisfy the following so-called "field axioms" (A), (M), and (D): (A) Axioms for addition (Al) If x e F and ye F, then their sum x + y is in F. (A2) Addition is commutative: x + y = y + x for all x, y E F. (A3) Addition is associative: (x + y) + z = x + (y + z) for all x, y, z e F. (A4) F contains an element Osuch that O+ x = x for every x E F. (AS) To every x e F corresponds an element - x E F such that X +(-x) = 0. (M) Axioms for multiplication (Ml) (M2) (M3) (M4) (MS) If x E F and y E F, then their product xy is in F. Multiplication is commutative: xy = yx for all x, y E F. Multiplication is associative: (xy)z = x(yz) for all x, y, z E F. F contains an element 1 =/: 0 such that lx = x for every x e F. If x E F and x =/: 0 then there exists an element 1/x E F such that X' (1/x) = 1. 6 PRINCIPLES OP MATHBMAnCAL ANALYSIS (D) 1be diltrlbutive law holds for all x, y, z e F. x(y + z) =xy + xz 1.13 Remarks (a} One usually writes (in any field) x - y, y-X , x + y + z, xyz, x2, x3, 2x, 3x, ... in place of G), x + (-y}, x • (x + y) + z, (xy)z, xx, xxx, x + x, x + x + x, .. , . (b) The field axioms clearly hold in Q, the set of all rational numbers, if addition and multiplication have their customary meaning. Thus Q is a field. (c) Although it is not our purpose to study fields (or any other algebraic structures) in detail, it is worthwhile to prove that some familiar properties of Q are consequences of the field axioms; once we do this, we will not need to do it again for the real numbers and for the complex numbers. 1.14 Proposidon The axioms for addition imply the following statements. (a) If x + y == x + z then y = z. (b) If x + y = x then y = 0. (c) lfx + y =0 theny = -x. (d) -(-x) = x. Statement (a) is a cancellation law. Note that (b) asserts the uniqueness of the element whose existence is assumed in (A4), and that (c) does the same for (AS). Proof If x + y == x + z, the axioms (A) give y =0 + y =(-x + x) + y = -x + (x + y) = -x + (x + z) == (-x + x) + z = 0 + z = z. This proves (a). Take z = 0 in (a) to obtain (b). Take z = -x in (a) to obtain (c). Since -x + x =0, (c) (with -x in pl~ of x) gives (d). THE REAL AND COMPLEX NUMBER SYSTEMS 7 1.15 Proposition The axioms for multiplication imply the following statements. (a) If x # 0 and xy = xz then y = z. (b) If x # 0 and xy = x then y = 1. (c) If x # 0 and xy = 1 then y = 1/x. (d) If x # 0 then 1/(1/x) = x. The proof is so similar to that of Proposition 1.14 that we omit it. 1.16 Proposition The field axioms imply the following statements.for any x, y, zeF. (a) Ox= 0. (b) If x # 0 and y # 0 then xy # 0. (c) (-x)y = -(xy) = x(-y). (d) (-x)(-y) = xy. Proof Ox+ Ox= (0 + O)x = Ox. Hence l.14(b) implies that Ox= 0, and (a) holds. Next, assume x # 0, y # 0, but xy = 0. Then (a) gives 1 = (;)G)xy=G)G)o=O, a contradiction. Thus (b) holds. The first equality in (c) comes from (-x)y + xy = (-x + x)y = Oy = 0, combined with l.14(c); the other half of (c) is proved in the same way. Finally, (-x)(-y)= -[x(-y)]= -[-(xy)]=xy by (c) and l.14(d). 1.17 Definition An ordered field is a field F which is also an ordered set, such that (i) x + y < x + z if x, y, z E F and y < z, (ii) xy > 0 if x E F, y E F, x > 0, and y > 0. If x > 0, we call x positive; if x < 0, xis negative. For example, Q is an ordered field. All the familiar rules for working with inequalities apply in every ordered field: Multiplication by positive [negative] quantities preserves [reverses] inequalities, no square is negative, etc. The following proposition lists some of these. 8 PRINCIPLES OF MATHEMATICAL ANALYSIS 1.18 Proposition The following statements are true in every ordered field. (a) lf x > 0 then - x < 0, and vice versa. (b) lf x > 0 and y < z then xy xz. (d) If x :/= 0 then x2 > 0. In particular, 1 > 0. (e) /JO< x 0 then O= -x + x > -x + 0, so that -x < 0. If x < 0 then 0 = -x + x < -x + 0, so that -x > 0. This proves (a). (b) Since z > y, we have z - y > y - y = 0, hence x(z - y) > 0, and therefore xz = x(z - y) + xy > 0 + xy = xy. (c) By (a), (b), and Proposition 1.16(c), -[x(z -y)] = (-x)(z -y) > 0, so that x(z - y) < 0, hence xz < xy. (d) If x > 0, part (ii) of Definition 1.17 gives x2 > 0. If x < 0, then -x > 0, hence (-x)2 > 0. But x 2 = (-x)2, by Proposition l.16(d). Since 1 = l2, 1 > 0. (e) lfy > 0 and vs; 0, thenyv s; 0. Buty •(1/y) = 1 > 0. Hence 1/y > 0. Likewise, 1/x > 0. If we muJtiply both sides of the inequality x < y by the positive quantity (1/xXl/y), we obtain 1/y < 1/x. THE REAL FIELD We now state the existence theorem which is the core of this chapter. 1.19 Theorem There exists an ordered.field R which has the least-upper-bound property. Moreover, R contains Q as a subfield. The second statement means that Q c R and that the operations of addition and multiplication in R, when applied to members of Q, coincide with the usual operations on rational numbers; also, the positive rational numbers are positive elements of R. The members of Rare called real numbers. The proof of Theorem 1.19 is rather long and a bit tedious and is therefore presented in an Appendix to Chap. I. The proof actually constructs R from Q. THE REAL AND COMPLEX NUMBER SYSTEMS 9 The next theorem could be extracted from this construction with very little extra effort. However, we prefer to derive it from Theorem 1.19 since this provides a good illustration of what one can do with the least-upper-bound property. 1.20 Theorem (a) If x e R, ye R, and x > 0, then there is a positive integer n such that nx>y. (b) [fx e R,y e R, and x < y, then thereexistsap e Q such that x < p < y. Part (a) is usually referred to as the archimedean property of R. Part (b) may be stated by saying that Q is dense in R: Between any two real numbers there is a rational one. Proof (a) Let A be the set of all nx, where n runs through the positive integers. If (a) were false, then y would be an upper bound of A. But then A has a least upper bound in R. Put a= sup A. Since x > 0, a - x < a, and a - x is not an upper bound of A. Hence a - x < mx for some positive integer m. But then a< (m + l)x e A, which is impossible, since a is an upper bound of A. (b) Since x < y, we have y - x > 0, and (a) furnishes a positive integer n such that n(y-x) > I. Apply (a) again, to obtain positive integers m1 and m2 such that m1 > nx, m1 > -nx. Then -m1 0, it follows that m x<- 0 and every integer n > 0 there is one and only one po~itive real y such that y' = x. ix This number y is written Or xlln, Proof That there is at most one such y is clear, since O< y1 < y2 implies Y': <)1. Let E be the set consisting of all positive real numbers t such that t" 1 + x then t" ~ t > x, so that t rt E. Thus 1 + x is an upper bound of E. Hence Theorem 1.19 implies the existence of y = supE. To prove that y' = x we will show that each of the inequalities y' < x and y' > x leads to a contradiction. The identity b" - a"= (b - a)(b"- 1 + b"-2a + • • • + a"- 1) yields the inequality b" - a"< (b - a)nb"- 1 when O y, this contradicts the fact that y is an upper bound of E. Assume y' > x. Put k-1'-x. nyi-1 Then O< k < y. If t ~ y - k, we conclude that y' - t" s y' - (y- k)" < knyi- 1 = y' - x. Thus t• > x, and t ¢ E. It follows that y - k is an upper bound of E. THE REAL AND COMPLEX NUMBER SYSTEMS 11 But y - k < y, which contradicts the fact that y is the least upper bound of E. Hence y" = x, and the proof is complete. Corollary If a and b are positive real numbers and n is a positive integer, then (ab)lfn = alfnbtfn. Proof = Put IX = a11", /3 b11". Then ab = a."/3" = (a./3)", since multiplication is commutative. [Axiom (M2) in Definition 1.12.J The uniqueness assertion of Theorem 1.21 shows therefore that (ab)tfn = a./3 = alfnblfn, 1.22 Decimals We conclude this section by pointing out the relation between real numbers and decimals. Let x > 0 be real. Let n0 be the largest integer such that n0 ~ x. (Note that the existence of n0 depends on the archimedean property of R.) Having chosen n0 , n1, ... , nk- i, let nk be the largest integer such that n1 nk no + IO + ••• + 10k ~ x. Let E be the set of these numbers (5) (k = 0, 1, 2, ...). Then x = sup E. The decimal expansion of xis (6) Conversely, for any infinite decimal (6) the set E of numbers (5) is bounded above, and (6) is the decimal expansion of sup E. Since we shall never use decimals, we do not enter into a detailed discussion. THE EXTENDED REAL NUMBER SYSTEM 1.23 Definition The extended real number system consists of the real field R and two symbols, + oo and - oo. We preserve the original order in R, and define for every x e R. -oo0thenx·(+oo)= +oo,x·(-oo)= -oo. = - (c) If X < 0 then X • ( + 00) = 00, X • ( - 00) + 00. When it is desired to make the distinction between real numbers on the one hand and the symbols + oo and - oo on the other quite explicit, the former are called.finite. THE COMPLEX FIELD 1.24 Definition A complex number is an ordered pair (a, b) of real numbers. "Ordered" means that (a, b) and (b, a) are regarded as distinct if a-:/= b. Let x = (a, b), y = (c, d) be two complex numbers. We write x = y if and only if a= c and b = d. (Note that this definition is not entirely superfluous; think of equality of rational numbers, represented as quotients of integers.) We define x + y = (a + c, b + d), xy = (ac - bd, ad+ be). 1.25 Theorem These definitions of addition and multiplication turn the set of all complex numbers into a field, with (0, 0) and (1, 0) in the role ofO and 1. Proof We simply verify the field axioms, as listed in Definition 1.12. (Of course, we use the field structure of R.) Let x =(a, b), y = (c, d), z =(e,f). (Al) is clear. ~~ x+y=~+~b+~=~+~d+~=y+~ THE REAL AND COMPLEX NUMBER SYSTEMS 13 (A3) (x + y) + z = (a + c, b + d) + (e,f) = (a + c + e, b + d + f) = (a, b) + (c + e, d + f) = x + (y + z). (A4) x + 0 = (a, b) + (0, 0) = (a, b) = x. (AS) Put -x = (-a, -b). Then x + (-x) = (0, 0) = 0. (MI) is clear. (M2) xy = (ac - bd, ad+ be) = (ca - db, da + cb) = yx. (M3) (xy)z = (ac - bd, ad+ bc)(e,f) = (ace - bde - adf - bcf, acf - bdf + ade + bee) = (a, b)(ce - df, cf+ de) = x(yz). (M4) Ix= (1, 0)(a, b) = (a, b) = x. (MS) If x '# 0 then (a, b) '# (0, 0), which means that at least one of the real numbers a, b is different from 0. Hence a2 + b2 > 0, by Proposition 1.18(d), and we can define Then -1= ( - - a , --b-) . x a2 + b2 a2 + b2 x • -I x = (a, b) ( ~ a b+ a 2 ' a----Y+b-b2) = (I, 0) = 1. (D) x(y + z) = (a, b)(c + e, d + f) = (ac + ae - bd - bf, ad+ af +be+ be) = (ac - bd, ad+ be)+ (ae - bf, af + be) = xy + xz. 1.26 Theorem For any real numbers a and b we have (a, 0) + (b, 0) = (a + b, 0), (a, 0)(b, 0) = (ab, 0). The proof is trivial. Theorem 1.26 shows that the complex numbers of the form (a, 0) have the same arithmetic properties as the corresponding real numbers a. We can therefore identify (a, 0) with a. This identification gives us the real field as a subfield of the complex field. The reader may have noticed that we have defined the complex numbers without any reference to the mysterious square root of -1. We now show that the notation (a, b) is equivalent to the more customary a+ bi. 1.27 Definition i = (0, 1). 14 PRINCIPLES OF MATHEMATICAL ANALYSIS 1.28 Theorem i2 = - 1. Proof i2 = (0, 1)(0, 1) = (-1, 0) = -1. 1.29 Theorem If a and b are real, then (a, b) = a + bi. Proof a+ bi= (a, 0) + (b, 0)(0, 1) = (a, 0) + (0, b) = (a, b). 1.30 Definition If a, b are real and z = a + bi, then the complex number z = a - bi is called the conjugate of z. The numbers a and b are the real part and the imaginary part of z, respectively. We shall occasionally write a= Re(z), b = Im(z). 1.31 Theorem If z and w are complex, then (a) z + w = z + w, (b) zw = z • w, (c) z + z = 2 Re(z), z - z = 2i Im(z), (d) zz is real and positive (except when z = 0). Proof (a), (b), and (c) are quite trivial. To prove (d), write z = a + bi, and note that zz = a2 + b2 • 1.32 Definition If z is a complex number, its absolute value zI I is the non- negative square root of zz; that is, lzl = (zz) 112• The existence (and uniqueness) of Izl follows from Theorem 1.21 and part (d) of Theorem 1.31. Note that when xis real, then x = x, hence lxl =Jx2 . Thus lxl = x ifx~0, lxl = -xifx<0. 1.33 Theorem Let z and w be complex numbers. Then (a) lzl > 0 unless z = 0, IOI = 0, (b) lzl = lzl, (c) lzwl = lzl lwl, (d) IRezl S lzl, (e) lz + wl s lzl + lwl, THE REAL AND COMPLEX NUMBER SYSTEMS 15 Proof (a) and (b) are trivial. Putz= a+ bi, w = c + di, with a, b, c, d real. Then lzwl 2 = (ac -bd)2 +(ad+ bc)2 = (a2 + b2)(c2 + d 2) = lzl 2 1wl 2 or Izw I 2 = (Iz I I w1)2• Now (c) follows from the uniqueness assertion of Theorem 1.21. To prove (d), note that a2 s; a2 + b2, hence P J IaJ = :S: a2 + b2• To prove (e), note that zw is the conjugate of zw, so that zw + zw = 2 Re (zw). Hence Iz + w12 = (z + w)(z + w) = zz + zw + zw + ww = JzJ 2 + 2 Re (zw) + JwJ 2 s: lzl2 +2lzwl + 1w12 = lzl 2 + 2Jzl Jwl + lwJ 2 = (lzl + Jwl)2. Now (e) follows by taking square roots. 1.34 Notation If x1, ••• , x,. are complex numbers, we write x1 + x2 + ••• + x,, = L" x1 • J• 1 We conclude this section with an important inequality, usually known as the Schwarz inequality. 1.35 Theorem If a 1, ... , a. and bi, ... , b,, are complex numbers, then = o Proof Put A I: Ia1 I 2, B = I: Ib1J2, C = I:a1 1 (in all sums in this proof, j runs over the values 1, ... , n). If B = 0, then b1 = ••• = b,, = 0, and the conclusion is trivial. Assume therefore that B > 0. By Theorem 1.31 we have = L 1Ba1 - Cb112 L (Ba1 - Cb1)(Ba1 - Cb1) = B2 L la1l 2 - BCL a1°1 -BCLii1b1 + ICl 2 L lb1l 2 = B2A- BJCl 2 = B(AB - ICl 2). 16 PRINCIPLES OF MATHEMATICAL ANALYSIS Since each term in the first sum is nonnegative, we see that B(AB - ICl 2) :;;:; 0. Since B > 0, it follows that AB - ICl 2 :;;:; 0. This is the desired inequality. EUCLIDEAN SPACES 1.36 Definitions For each positive integer k, let Rk be the set of all ordered k-tuples = X (x1, X 2 , , , . , Xk), where x1, ... , xk are real numbers, called the coordinates of x. The elements of Rk are called points, or vectors, especially when k > 1. We shall denote vectors by boldfaced letters. If y = (y1, ... , Yk) and if ix is a real number, put x + Y = (x1 + Yi, ... , xk + Yk), = IXX (1XX1, , , , , IXXk) so that x + y E Rk and ixx E Rk. This defines addition of vectors, as well as multiplication of a vector by a real number (a scalar). These two operations satisfy the commutative, associative, and distributive laws (the proof is trivial, in view of the analogous laws for the real numbers) and make Rk into a vector space over the real field. The zero element of Rk (sometimes called the origin or the null vector) is the point 0, all of whose coordinates are 0. We also define the so-called "inner product" (or scalar product) of x and y by and the norm of x by k x· y = LX;Y; i= 1 (t Jxl = (x. x)1;2 = xf) 112_ The structure now defined (the vector space Rk with the above inner product and norm) is called euclidean k-space. 1.37 Theorem Suppose x, y, z E Rk, and ix is real. Then (a) lxl :;;:; O; (b) \x\ = 0 if and only ifx = O; (c) Jixxl = lixl Jxl; (d) Ix· y\ ;s; Ix\ IYI; (e) \x + y\ ;s; lxl + \yJ; (f) \x-zl ;s; \x-y\ + ly-z\. THB RIIAL AND OOMPLBX NUMBl!ll SYSTEMS 17 Proof (a), (b), and (c) are obvious, and (d) is an immediate consequence of the Schwarz inequality. By (d) we have Ix + y12 == (x + y) • (x + y) ==x·x+2x·y+y·y ~ lxl 2 +21xl IYI + IYl 2 == (lxl + IYl>2, so that (e) is proved. Finally, (f) follows from (e) if we replace x by x-yandybyy-z. 1.38 Remarks Theorem 1.37 (a), (b), and (f) will allow us (see Chap. 2) to regard R" as a metric space. R1 (the set of all real numbers) is usually called the line, or the real line. Likewise, R2 is called the plane, or the complex plane (compare Definitions 1.24 and 1.36). In these two cases the norm is just the absolute value of the corresponding real or complex number. APPENDIX Theorem 1.19 will be proved in this appendix by constructing R from Q. We shall divide the construction into several steps. Step 1 The members of R will be certain subsets of Q, called cuts. A cut is, by definition, any set IX c Q with the following three properties. (I) IX is not empty, and IX "F- Q. (II) If p e IX, q e Q, and q p, we see that r e l' (since ex1 c y), and therefore y satisfies (III). Thus l' e R. It is clear that oi s; l' for every oi e A. Suppose <> < y. Then there is an s e l' and that s ¢ <>. Since s e l', s e oi for some ex e A. Hence<>< ex, and<> is not an upper bound of A. This gives the desired result: l' = sup A. Step 4 If oc e Rand pe R we define oc + pto be the set of all sums r + s, where re ex and s e /J. We define O* to be the set of all negative rational numbers. It is clear that O* is a cut. We verify that the axioms for addition (see Definition 1.12) hold in R, with O* playing the role of0. (Al) We have to show that oi + p is a cut. It is clear that ex+ /J is a nonempty subset of Q. Take r' ¢ oc, s' ¢ p. Then r' + s' > r + s for all choices of re ex, s e p. Thus r' + s' ¢ oi + p. It follows that ex+ /J has property (I). Pickpeex+/J. Thenp=r+s, with reoc, sep. lfq r. Then p < t +sand t + s e oi + p. Thus (111) holds. (A2) oc + pis the set of all r + s, with r e oc, s e p. By the same definition, p + oc is the set of all s + r. Since r + s =s + r for all re Q, s e Q, we have oi + /J = {J + ex. (A3) As above, this follows from the associative law in Q. (A4) If re oc ands e O*, then r + s < r, hence r + s e oi. Thus ex+ O* c oc. To obtain the opposite inclusion, pick p e oi, and pick r e ex, r > p. Then THE REAL AND COMPLEX NUMBER SYSTEMS 19 p - reO*, and p = r +(p-r)eoc + O*. Thus oc c::: oc + O*. We conclude that oc + O* = oc. (AS) Fix oc e R. Let Pbe the set of all p with the following property: There exists r > 0 such that - p - r ¢ oc. In other words, some rational number smaller than - p fails to be in oc. We show that pe Rand that oc + P= O*. Ifs¢ IX and p = -s - 1, then -p - 1 ¢ IX, hence p e {J. So fJ is not empty. If q e cc, then -q ¢ {J. So fJ-,,. Q. Hence p satisfies (I). Pick p e {J, and pick r > 0, so that -p - r ¢ cc. If q -p - r, hence -q - r ¢ IX. Thus q e {J, and (II) holds. Put t=p+(r/2). Then t>p, and -t-(r/2)= -p-r(JIX, so that te{J. Hence p satisfies (III). We have proved that Pe R. If r e IX and s e p, then -s ¢ oc, hence r < -s, r + s < 0. Thus cc+ /Jc::: O*. To prove the opposite inclusion, pick v e O*, put w = -v/2. Then w > 0, and there is an integer n such that nw e oc but (n + l)w f: IX, (Note that this depends on the fact that Q has the archimedean property I) Put = p -(n + 2)w. Then pep, since -p - w¢ IX, and v = nw + p e cc+ p. Thus O* c::: IX + p. We conclude that IX+ P=O*. This p will of course be denoted by - IX. Step 5 Having proved that the addition defined in Step 4 satisfies Axioms (A) of Definition 1.12, it follows that Proposition 1.14 is valid in R, and we can prove one of the requirements of Definition 1.17: /foe, P, y eR and P< y, then oc + P< oc + y. Indeed, it is obvious from the definition of + in R that IX + p c::: IX + y; if we had ix + /J = ix + y, the cancellation law (Proposition 1.14) would imply P= y. It also follows that 01 > O* if and only if - IX < O*. Step 6 Multiplication is a little more bothersome than addition in the present context, since products of negative rationals are positive. For this reason we confine ourselves first to R+, the set of all ix e R with IX> O*. If IX e R + and Pe R +, we define 1XP to be the set of all p such that p ::;; rs for some choice of re IX, s e p, r > 0, s > 0. We define 1* to be the set of all q < 1. 20 PRINCIPLES OF MATHBMATICAL ANALYSIS Then the axioms (M) and (D) of Definition 1.12 hold, with R+ in place ofF, and with 1* in the role of 1. The proofs are so similar to the ones given in detail in Step 4 that we omit them. Note, in particular, that the second requirement of Definition 1.17 holds: If IX> 0* and p > O* then 1X/3 > O*. Step 7 We complete the definition of multiplication by setting 1XO* = O*IX = O*, and by setting ( -1X)(-P) IX/3= { -[(-IX)/31 -[c,:. (-/3)] if IX < O*, /J < O*, ifo < O*, p > 0*, if c,: > O*, /3 < O*. The products on the right were defined in Step 6. Having proved (in Step 6) that the axioms (M) hold in R+, it is now perfectly simple to prove them in R, by repeated application of the identity y = -( -y) which is part of Proposition 1.14. (See Step 5.) The proof of the distributive law IX{P + y) = IX/3 + IXY breaks into cases. For instance, suppose IX> O*, /3 < O*, /3 + y > O*. Then y = (P + y) + ( -/3), and (since we already know that the distributive law holds in R+) IXY = IX{P + y) + IX ' ( - /3). But IX· (-/3) = -(1X/3). Thus IX/3 + IXY = 1X(/3 + y). The other cases are handled in the same way. We have now completed the proof that R is an ordered field with the least- upper-bound property. Step 8 We associate with each re Q the set r* which consists of all p e Q such that p < r. It is clear that each r* is a cut; that is, r* e R. The,;e cuts satisfy the following relations: (a) r* + s* =(r + s)*, (b) r*s* = (rs)*, (c) r* < s* if and only if r < s. To prove (a), choose per* + s*. Then p = u + v, where u < r, v < s. Hence p < r + s, which says that p e (r + s)*. THE REAL AND COMPLEX NUMBER SYSTEMS 21 Conversely, suppose p e (r + s)*. Then p < r + s. Choose t so that 2t = r + s - p, put r' = r - t, s' =s - t. Then r' er*, s' es*, andp = r' + s', so thatp er*+ s*. This proves (a). The proof of (b) is similar. If r < s then res*, but r ¢r*; hence r* < s*. If r* < s*, then there is apes* such that p ¢ r*. Hence rs p < s, so that r < s. This proves (c). Step 9 We saw in Step 8 that the replacement of the rational numbers r by the corresponding "rational cuts" r* e R preserves sums, products, and order. This fact may be expressed by saying that the ordered field Q is isomorphic to the ordered field Q* whose elements are the rational cuts. Of course, r* is by no means the same as r, but the properties we are concerned with (arithmetic and order) are the same in the two fields. It is this identification of Q with Q* which allows us to regard Q as a subfield of R. The second part of Theorem 1.19 is to be understood in terms of this identification. Note that the same phenomenon occurs when the real numbers are regarded as a subfield of the complex field, and it also occurs at a much more elementary level, when the integers are identified with a certain subset of Q. It is a fact, which we will not prove here, that any two orderedfields with the least-upper-bound property are isomorphic. The first part of Theorem 1.19 therefore characterizes the real field R completely. The books by Landau and Thurston cited in the Bibliography are entirely devoted to number systems. Chapter 1 of Knopp's book contains a more leisurely description of how R can be obtained from Q. Another construction, in which each real number is defined to be an equivalence class of Cauchy sequences of rational numbers (see Chap. 3), is carried out in Sec. 5 of the book by Hewitt and Stromberg. The cuts in Q which we used here were invented by Dedekind. The construction of R from Q by means of Cauchy sequences is due to Cantor. Both Cantor and Dedekind published their constructions in 1872. EXERCISES Unless the contrary is explicitly stated, all numbers that are mentioned in these exercises are understood to be real. * 1. If r is rational (r 0) and x is irrational, prove that r +x and rx are irrational. l2 PRINCIPLES OF MATHEMATICAL ANALYSIS 2. Prove that there is no rational number whose square is 12. 3. Prove Proposition 1.15. 4. Let Ebe a nonempty subset of an ordered set; suppose IX is a lower bound of E and /3 is an upper bound of E. Prove that IX '5:. {3. 5. Let A be a nonempty set of real numbers which is bounded below. Let -A be the set of all numbers -x, where x EA. Prove that inf A= -sup(-A). 6. Fix b > 1. (a) If m, n, p, q are integers, n > 0, q > 0, and r = m/n = p/q, prove that (b"')ll• = (b')1'•. Hence it makes sense to define b' = (b"')1'"· (b) Prove that hr+•= b•b• if rands are rational. (c) If x is real, define B(x) to be the set of all numbers b', where t is rational and t '5:. x. Prove that b' =supB(r) when r is rational. Hence it makes sense to define b" =supB(x) for every real x. (d) Prove that bH' b"b' for all real x and y. 7. Fix b > 1, y > 0, and prove that there is a unique real x such that b" = y, by completing the following outline. (This xis called the logarithm ofy to the base b.) (a) For any positive integer n, b• - 1 :2: n(b - 1). (b) Hence b - 1 :2: n(b11" - 1). (c) If t > 1 and n > (b - 1)/(t - 1), then b11• < t. (d) If w is such that bw y, then bw-<1 1•> > y for sufficiently large n. (/) Let A be the set of all w such that bw < y, and show that x = sup A satisfies b"=y. (g) Prove that this x is unique. 8. Prove that no order can be defined in the complex field that turns it into an ordered field. Hint: -1 is a square. 9. Suppose z = a + bi, w = c + di. Define z < w if a < c, and also if a = c but b < d. Prove that this turns the set of all complex numbers into an ordered set. (This type of order relation is called a dictionary order, or lexicographic order, for obvious reasons.) Does this ordered set have the least-upper-bound property? 10. Suppose z = a + bi, w = u + iv, and THE REAL AND COMPLEX NUMBER SYSTEMS 23 z Prove that z2 = w if v Oand that (z)2 = w if v ~ 0. Conclude that every complex number (with one exception!) has two complex square roots. z 11. If z is a complex number, prove that there exists an r Oand a complex number w with Iwl = 1 such that z = rw. Are wand r always uniquely determined by z? 12. If zi, ... , z. are complex, prove that \z,+z2+"·+z.l ~!zi! + \z2\ +"·+Jz.l. 13. If x, y are complex, prove that !lxl - !YI!~ \x-y\. 14. If z is a complex number such that \zl = 1, that is, such that zz= 1, compute \1 +z\2+ ll -z\2. 15. Under what conditions does equality hold in the Schwarz inequality? 16. Supposekz3,x,yERt, lx-yl =d>O,andr>O. Prove: (a) If 2r > d, there are infinitely many z E Rt such that \z-x\ = \z-y\ =r. (b) If 2r = d, there is exactly one such z. (c) If 2r < d, there is no such z. How must these statements be modified if k is 2 or 1? 17. Prove that Ix+ Yl 2 + lx-y\ 2 = 2\x1 2 + 2IYl 2 if xERt and ye Rt. Interpret this geometrically, as a statement about parallelograms. z 18. If k 2 and x E Rt, prove that there exists y E Rt such that y # 0 but x •y = O. Is this also true if k = 1? 19. Suppose a E Rt, b E Rt. Find c E Rt and r > 0 such that \x-a! =2\x-b\ if and only if Ix - c I = r. (Solution: 3c =4b-a, 3r = 2lb-al .) 20. With reference to the Appendix, suppose that property (III) were omitted from the definition of a cut. Keep the same definitions of order and addition. Show that the resulting ordered set has the least-upper-bound property, that addition satisfies axioms (Al) to (A4) (with a slightly different zero-element!) but that (A5) fails. 2 BASIC TOPOLOGY FINITE, COUNTABLE, AND UNCOUNTABLE SETS We begin this section with a definition of the function concept. 2.1 Definition Consider two sets A and B, whose elements may be any objects whatsoever, and suppose that with each element x of A there is associated, in some manner, an element of B, which we denote by f(x). Then/ is said to be a function from A to B (or a mapping of A into B). The set A is called the domain off (we also say f is defined on A), and the elements f(x) are called the values off. The set of all values offis called the range off. 2.2 Definition Let A and B be two sets and let f be a mapping of A into B. If E c A,f(E) is defined to be the set of all elements f(x), for x e E. We call f(E) the image of E under/. In this notation, f(A) is the range of/. It is clear that/(A) c B. Iff(A) = B, we say that/maps A onto B. (Note that, according to this usage, onto is more specific than into.) If E c B,f- 1(E) denotes the set of all x EA such that f(x) e E. We call 1- 1 (E) the inverse image of E under f. If ye B,f- 1(Y) is the set of all x e A BASIC TOPOLOOY 25 = such that /(x) y. If, for each ye B,/-1(y) consists of at most one element of A, then f is said to be a 1-1 (one-to-one) mapping of A into B. This may also be expressed as follows: f is a 1-1 mapping of A into B provided that /(x1):;: /(x2) whenever x 1 :;, x 2 , Xie A, x 2 eA. (The notation x1 :;, x2 means that x1 and x2 are distinct elements; other- = wise we write Xi X2 .) 2.3 Definidon If there exists a 1-1 mapping of A onto B, we say that A and B can be put in 1-1 correspondence, or that A and B have the same cardinal number, or, briefly, that A and B are equivalent, and we write A~ B. This relation clearly has the following properties: It is reflexive: A ~ A. It is symmetric: If A~ B, then B ~ A. It is transitive: If A ~Band B ~ C, then A ~ C. Any relation with these three properties is called an equivalence relation. 2.4 Definidon For any positive integer n, let Jn be the set whose elements are the integers l, 2, ... , n; let J be the set consisting of all positive integers. For any set A, we say: (a) A is.finite if A~ Jn for some n (the empty set is also considered to be finite). ~ (b) A is infinite if A is not finite. (c) A is countable if A J. (d) A is uncountable if A is neither finite nor countable. (e) A is at most countable if A is finite or countable. ~ Countable sets are sometimes called enumerable, or denumerable. For two finite sets A and B, we evidently have A B if and only ifA and B contain the same number of elements. For infinite sets, however, the idea of "having the same number of elements" becomes quite vague, whereas the notion of 1-1 correspondence retains its clarity. 2.5 Example Let A be the set of all integers. Then A is countable. For, consider the following arrangement of the sets A and J: A: 0, 1, -1,2, -2,3, -3, ... J: 1, 2, 3, 4, 5, 6, 7, ... 26 PRINCIPLES OP MATHEMATICAL ANALYSJS We can, in this example, even give an explicit formula for a function/ from J to A which sets up a 1-1 correspondence: /(•)_Jl-i n---1 2 (n even), (n odd). 2.6 Remark A finite set cannot be equivalent to one of its proper subsets. That this is, however, possible for infinite sets, is shown by Example 2.5, in which J is a proper subset of A. In fact, we could replace Definition 2.4(b) by the statement: A is infinite if A is equivalent to one of its proper subsets. 2.7 Definidon By a sequence, we mean a function/ defined on the set J of all positive integers. If/(n) = Xn, for n e J, it is customary to denote the sequence /by the symbol {xn}, or sometimes by x1, x2 , x 3 , .... The values off, that is, the elements Xn, are called the terms of the sequence. If A is a set and if Xn e A for all n eJ, then{xn} is said to be a sequence in A, or asequenceofelementsofA. Note that the terms x1, x2 , x3 , ... of a sequence need not be distinct. Since every countable set is the range of a 1-1 function defined on J, we may regard every countable set as the range of a sequence of distinct terms. Speaking more loosely, we may say that the elements of any countable set can be "arranged in a sequence." Sometimes it is convenient to replace J in this definition by the set of all nonnegative integers, i.e., to start with Orather than with 1. 2.8 Theorem Every infinite subset ofa countable set A is countable. Proof Suppose E c A, and E is infinite. Arrange the elements x of A in a sequence {xn} of distinct elements. Construct a sequence {n1} as follows: Let n1 be the smallest positive integer such that Xn, e E. Having chosen ni, •.• , n1 _ 1 (k = 2, 3, 4, ...), let n1c be the smallest integer greater than n1c-i such that x,.,. e E. Putting/(k) = Xn,. (k = 1, 2, 3, ...), we obtain a 1-1 correspondence between E and J. The theorem shows that, roughly speaking, countable sets represent the "smallest" infinity: No uncountable set can be a subset of a countable set. 2.9 Definition Let A and O be sets, and suppose that with each element « of A there is associated a subset of O which we denote by E«. BASIC TOPOLOGY 27 The set whose elements are the sets E,. will be denoted by {E,.}. Instead of speaking of sets of sets, we shall sometimes speak of a collection of sets, or a family of sets. The union of the sets E,. is defined to be the set S such that x E S if and only if x e E,. for at least one a: e A. We use the notation (I) S= LJ E,.. «&A If A consists of the integers I, 2, ... , n, one usually writes n (2) S= LJ Em m•l or (3) If A is the set of all positive integers, the usual notation is IX) (4) S= LJ Em. m=- 1 The symbol oo in (4) merely indicates that the union of a countable collection of sets is taken, and should not be confused with the symbols + oo, - oo, introduced in Definition 1.23. The intersection of the sets E,. is defined to be the set P such that x e P if and only if x e E,. for every a: e A. We use the notation (5) P= nE,., «eA or (6) n n P = Em = E1 n E2 n ••• n E~, m=l or (7) as for unions. If An Bis not empty, we say that A and B intersect; otherwise they are disjoint. 2.10 Examples (a) Suppose E1 consists of 1, 2, 3 and E2 consists of 2, 3, 4. Then E1 u E2 consists of 1, 2, 3, 4, whereas E1 n E2 consists of 2, 3. 28 PRINCIPLES OF MATHEMATICAL ANALYSIS (b) Let A be the set of real numbers x such that 0 < x s; 1. For every x e A, let E:r be the set of real numbers y such that O< y < x. Then (i) E:r c:: Es if and only if 0 < x S z S 1; (ii) (iii) UE:r =E1 ; :re.t nEx is empty; X&A (i) and (ii) are clear. To prove (iii), we note that for every y > 0, y ti Ex if X < y. Hence y 'nx&A Ex. 2.11 Remarks Many properties of unions and intersections are quite similar to those of sums and products; in fact, the words sum and product were some- n. times used in this connection, and the symbols l: and II were written in place of Uand The commutative and· associative laws are trivial: (8) AuB=BuA; A riB=BnA. (9) (A u B) u C = A u (Bu C); (A n B) n C = A n (B ri C). Thus the omission of parentheses in (3) and (6) is justified. The distributive law also holds: (10) A n (B u C) =(A ri B) u (A ri C). To prove this, let the left and right members of (10) be denoted by E and F, respectively. Suppose x e E. Then x e A and x e B u C, that is, x e B or x e C (pos- sibly both). Hence x e A n B or x e A n C, so that x e F. Thus E c:: F. Next, suppose x e F. Then x e A n B or x e A ri C. That is, x e A, and x e B u C. Hence x e A n (B u C), so that F c:: E. It follows that E = F. We list a few more relations which are easily verified: (11) Ac:: Au B, (12) An B c:: A. If 0 denotes the empty set, then (13) If Ac B, then (14) A u0 =A, A n0 =0. AuB=B, A. n B=A. BASIC TOPOLOGY 29 2.12 Theorem Let {E.}, n = 1, '),, 3, ... , be a sequence of countable sets, andput (15) Then Sis countable. uCl) s = E•. n= 1 Proof Let every set En be arranged in a sequence {x.k}, k = 1, 2, 3, ... , and consider the infinite array (16) in which the elements of En form the nth row. The array contains all elements of S. As indicated by the arrows, these elements can be arranged in a sequence ( 17) If any two of the sets E. have elements in common, these will appear more than once in (17). Hence there is a subset T of the set of all positive integers such that S ~ T, which shows that S is at most countable (Theorem 2.8). Since E1 c S, and E1 is infinite, S is infinite, and thus countable. Corollary Suppose A is at most countable, and, for every !Y. EA, Ba is at most countable. Put Then Tis at most countable. T = LJ Ba, a EA. For Tis equivalent to a subset of (15). 2.13 Theorem Let A be a countable set, and let Bn be the set of all n-tuples (a1, ... , an), v,.•here ak E A (k = 1, ... , n), and the elements a1 , ..• , a. need not be distinct. Then B. is countable. Proof That B1 is countable is evident, since B1 = A. Suppose Bn-t is countable (n = 2, 3, 4, , ..). The elements of B. are of the form (18) (b, a) (b E Bn-1, a EA). For every fixed b, the set of pairs (b, a) is equivalent to A, and hence countable. Thus Bn is the union of a countable set of countable sets. By Theorem 2.12, B" is countable. The theorem follows by induction. 30 PlUNCJPLES OF MATHEMATICAL ANALYSIS Corollary The set ofall rational numbers is countable. Proof We apply Theorem 2.13, with n = 2, noting that every rational r is of the form b/a, where a and b are integers. The set of pairs (a, b), and therefore the set of fractions b/a, is countable. In fact, even the set of all algebraic numbers is countable (see Exercise 2). That not all infinite sets are, however, countable, is shown by the next theorem. 2.14 Theorem Let A be the set ofall sequences whose elements are the digits 0 and 1. This set A is uncountable. The elements of A are sequences like I, 0, 0, I, 0, I, I, I, .... Proof Let E be a countable subset of A, and let E consist of the sequences s1, s2 , s3 , .... We construct a sequences as follows. If the nth digit in sn is 1, we let the nth digit of s be 0, and vice versa. Then the sequence s differs from every member of E in at least one place; hence s ¢ E. But clearly s e A, so that Eis a proper subset of A. We have shown that every countable subset of A is a proper subset of A. It follows that A is uncountable (for otherwise A would be a proper subset of A, which is absurd). The idea of the above proof was first used by Cantor, and is called Cantor's diagonal process; for, if the sequences s1, s2 , s3 , ... are placed in an array like (16), it is the elements on the diagonal which are involved in the construction of the new sequence. Readers who are familiar with the binary representation of the real numbers (base 2 instead of 10) will notice that Theorem 2.14 implies that the set of all real numbers is uncountable. We shall give a second proof of this fact in Theorem 2.43. METRIC SPACES 2.15 Definition A set X, whose elements we shall call points, is said to be a metric space if with any two points p and q of X there is associated a real number d(p, q), called the distance from p to q, such that (a) d(p, q) > 0 if p,;, q; d(p, p) = O; (b) d(p, q) = d(q,p); (c) d(p, q) ~ d(p, r) + d(r, q), for any re X. Any function with these three properties is called a distance function, or a metric. BASIC TOPOLOGY 31 2.16 Examples The most important examples of metric spaces, from our standpoint, are the euclidean spaces R'k, especially R1 (the real line) and R2 (the complex plane); the distance in Rt is defined by (19) d(x, y) = Ix - y I (x, ye R"). By Theorem 1.37, the conditions of Definition 2.15 are satisfied by (19). It is important to observe that every subset Y of a metric space Xis a metric space in its own right, with the same distance function. For it is clear that if conditions (a) to (c) of Definition 2.15 hold for p, q, re X, they also hold if we restrict p, q, r to lie in Y. Thus every subset of a euclidean space is a metric space. Other examples are the spaces (l(K) and !i72(µ), which are discussed in Chaps. 7 and 11, respectively. 2.17 Definition By the segment (a, b) we mean the set of all real numbers x such that a < x < b. By the interval [a, b] we mean the set of all real numbers x such that a~ x ~ b. Occasionally we shall also encounter "half-open intervals" [a, b) and (a, b]; the first consists of all x such that a ~ x < b, the second of all x such that a 0, the open (or closed) ball B with center at x and radiu~ r is defined to be the set of all y e R" such that Iy - x I < , (or Iy - x I ~ r). We call a set E c R" convex if AX + (1 - A)Y e E whenever x e E, y e E, and O< i < I. For example, balls are convex. For if IY - xi < ,, lz - xi < ,, and 0 < i < 1, we have IAY+(l-i)z-xl = IA(y-x)+(l-A)(z-x)I s;ily-xj +(1-i)jz-xl 0, The number r is called the radius of N,(p). (b) A point p is a limit point of the set E if every neighborhood of p contains a point q :/: p such that q e E. (c) If p e E and p is not a limit point of E, then p is called an isolated point of E. (d) E is closed if every limit point of E is a point of E. (e) A point p is an interior point of E if there is a neighborhood N of p such that N c: E. (f) E is open if every point of E is an interior point of E. (g) The complement of E (denoted by E") is the set of all points p e X such that p ¢ E. (h) E is perfect if E is closed and if every point of E is a limit point of E. (i) E is bounded if there is a real number M and a point q e X such that d(p, q) < M for all p e E. U) E is dense in X if every point of X is a limit point of E, or a point of E (or both). Let us note that in R1 neighborhoods are segments, whereas in R2 neighborhoods are interiors of circles. 2.19 Theorem Every neighborhood is an open set. = Proof Consider a neighborhood E N,(p), and let q be any point of E. Then there is a positive real number h such that = d(p, q) r - h. For all points s such that d(q, s) < h, we have then d(p, s) ~ d(p, q) + d(q, s) < r - h + h = r, so that s e E. Thus q is an interior point of E. 2.20 Theorem If p is a limit point of a set E, then every neighborhood of p contains in.finitely many points ofE. Proof Suppose there is a neighborhood N of p which contains only a finite number of points of E. Let qi, ... , q" be those points of N n E, which are distinct from p, and put r = min d(p, q,,.) 1 smsn BASIC TOPOLOGY 33 [we use this notation to denote the smallest of the numbers d(p, q1), ... , d(p, q,,)]. The minimum of a finite set of positive numbers is clearly posi- tive, so that r > 0. The neighborhood N,.(p) contains no point q of E such that q "F p, so that p is not a limit point of E. This contradiction establishes the theorem. Corollary A.finite point set has no limit points. 2.21 Examples Let us consider the following subsets of R2 : (a) The set of all complex z such that Iz I < 1. (b) The set of all complex z such that IzI :S 1. (c) A nonempty finite set. (d) The set of all integers. (e) The set consisting of the numbers 1/n (n = 1, 2, 3, ...). Let us note that this set E has a limit point (namely, z =0) but that no point of E is a limit point of E; we wish to stress the difference between having a lilnit point and containing one. (f) The set of all complex numbers (that is, R2). (g) The segment (a, b). Let us note that (d), (e), (g) can be regarded also as subsets of R1• Some properties of these sets are tabulated below: Closed Open Perfect Bounded (a) No Yes No Yes (b) Yes No Yes Yes (c) Yes No No Yes (d) Yes No No No (e) No No No Yes (f) Yes Yes Yes No (g) No No Yes In (g), we left the second entry blank. The reason is that the segment (a, b) is not open if we regard it as a subset of R2, but it is an open subset of R1• (V r 0 2.22 Theorem Let {E.} be a (finite or infinite) collection ofsets E•. Then (20) E. = (E:). 'u. Proof Let A. and B be the left and right members of (20). If x e A, then X E.' hence X ' E. for any cc, hence Xe E: for every cc, so that X en E!. Thus A. c: B. 34 PRINCIPLES OP MATHEMATICAL ANALYSIS Conversely, if x e B, then x e E; for every 01:, hence x ¢ E,,, for any 01:, hence x ¢ U. E,,,, so that x e ( U. E,,,)':. Thus B c: A. It follows that A =B. 2.23 Theorem A set Eis open ifand only ifits complement is closed. Proof First, suppose Ee is closed. Choose x e E. Then x ¢ C, and x is not a limit point of Ee. Hence there exists a neighborhood N of x such that Ee r. N is empty, that is, N c: E. Thus x is an interior point of E, and E is open. Next, suppose E is open. Let x be a limit point of Ee. Then every neighborhood of x contains a point of Ee, so that xis not an interior point of E. Since E is open, this means that x e C. It follows that Ee is closed. Corollary A set Fis closed ifand only ifits complement is open. 2.24 Theorem (a) For any collection {G,,,} ofopen sets, U. G,,, is open. n. (b) For any collection {F,,,} ofclosed sets, F,,, is closed. m. (c) For anyfinite collection Gi, ... , Gn ofopen sets, 1 G1 is open. (d) For any finite collection Fi, ... , Fn of closed sets, Ui. 1 F1 is closed. Proof Put G =U. G,,,. If x e G, then x e G,,, for some 01:. Since x is an interior point of G,,,, x is also an interior point of G, and G is open. This proves (a). By Theorem 2.22, (21) and F; is open, by Theorem 2.23. Hence (a) implies that (21) is open so n. that F,,, is closed. m. Next, put H = 1 G1• For any x e H, there exist neighborhoods N 1 of x, with radii r,. such that N 1 c: G1 (i = 1, ... , n). Put r = min (ri, ... , rn), and let N be the neighborhood of x of radius r. Then N c: G1 for i = l, ... , n, so that N c: H, and H is open. By taking complements, (d) follows from (c): (lJ F,)c = (\(Ft). l•l 1•1 BASIC TOPOLOGY 35 2.25 Examples In parts (c) and (d) of the preceding theorem, the finiteness of 1, 1) the collections is essential. For let Gn be the segment (- (n = 1, 2, 3, ...). Then Gn is an open subset of R1• Put G = n:°= 1 Gn. Then G consists of a single point (namely, x =0) and is therefore not an open subset of R1. Thus the intersection of an infinite collection of open sets need not be open. Similarly, the union of an infinite collection of closed sets need not be closed. 2.26 Deftnidon If X is a metric space, if E c X, and if E' denotes the set of all limit points of E in X, then the closure of E is the set E = E u E'. 2.27 Theorem If Xis a metric space and E c: X, then (a) E is closed, (b) E =E ifand only ifEis closed, (c) E c Ffor every closed set F c X such that E c F. By (a) and (c), E Is the smallest closed subset of X that contains E. Proof (a) Ifp e X and p ¢Ethen pis neither a point of E nor a limit point of E. Hence p has a neighborhood which does not intersect E. The complement of E is therefore open. Hence E is closed. (b) If E =E, (a) implies that Eis closed. If Eis closed, then E' c:: E = [by Definitions 2.18(d) and 2.26], hence E E. (c) If Fis closed and F=> E, then F=> F', hence F=> E'. Thus F=> E. 2.28 Theorem Let Ebe a nonempty set ofreal numbers which is bounded above. Let y =sup E. Then ye E. Hence ye E ifEis closed. Compare this with the examples in Sec. 1.9. Proof If ye Ethen ye E. Assume y ¢ E. For every h > 0 there exists then a point x e E such that y - h < x < y, for otherwise y - h would be an upper bound of E. Thus y is a limit point of E. Hence ye E. 2.29 Remark Suppose E c: Y c X, where Xis a metric space. To say that E is an open subset of X means that to each point p e E there is associated a positive number r such that the conditions d(p, q) < r, q e X imply that q e E. But we have already observed (Sec. 2.16) that Y is also a metric space, so that our definitions may equally well be made within Y. To be quite explicit, let us say that E is open relative to Y if to each p e E there is associated an r > 0 such that q e E whenever d(p, q) < r and q e Y. Example 2.2l(g) showed that a set 36 PRINCIPLES OP MATHEMATICAL ANALY81S may be open relative to Y without being an open subset of X. However, there is a simple relation between these concepts, which we now state. 2.30 Theorem Suppose Y c X. A subset E of Y is open relative to Y if and = only ifE Y n G for some open subset G of X. Proof Suppose Eis open relative to Y. To each p e E there is a positive number rP such that the conditions d(p, q) < rP, q e Y imply that q e E. Let VP be the set of all q e X such that d(p, q) < rp, and define u G= VP. peE Then G is an open subset of X, by Theorems 2.19 and 2.24. Since p e VP for all p e E, it is clear that E c G n Y. By our choice of VP, we have VP n Y c E for every p e E, so that G n Y c E. Thus E = G n Y, and one half of the theorem is proved. Conversely, if G is open in X and E = G rs Y, every p e E has a neighborhood VP c G. Then VP rs Y c E, so that Eis open relative to Y. COMPACT SETS 2.31 Definition By an open cover of a set E in a metric space X we mean a collection {G11} of open subsets of X such that E c U G 11 11 • 2.32 Definition A subset K of a metric space X is said to be compact if every open cover of K contains a.finite subcover. More explicitly, the requirement is that if {G11} is an open cover of K, then there are finitely many indices a:1, ... , «n such that KC G111 U • • • U Glln • The notion of compactness is of great importance in analysis, especially in connection with continuity (Chap. 4). It is clear that every finite set is compact. The existence of a large class of infinite compact sets in ~ will follow from Theorem 2.41. We observed earlier (in Sec. 2.29) that if E c Y c X, then E may be open relative to Y without being open relative to X. The property of being open thus depends on the space in which E is embedded. The same is true of the property of being closed. Compactness, however, behaves better, as we shall now see. To formulate the next theorem, let us say, temporarily, that K is compact relative to X if the requirements of Definition 2.32 are met. BASIC TOPOLOGY 37 2.33 Theorem Suppose K c:: Y c:: X. Then K is compact relative to X if and only ifK is compact relative to Y. By virtue of this theorem we are able, in many situations, to regard compact sets as metric spaces in their own right, without paying any attention to any embedding space. In particular, although it makes little sense to talk of open spaces, or of closed spaces (every metric space Xis an open subset of itself, and is a closed subset of itself), it does make sense to talk of compact metric spaces. Proof Suppose K is compact relative to X, and let {V11} be a collection of sets, open relative to Y, such that Kc:: U« V11 • By theorem 2.30, there are sets Gu open relative to X, such that V11 = Y n G,., for all a; and since K is compact relative to X, we have (22) Kc:: G,.1 u ••• u G,." for some choice of finitely many indices ai, ... , °'n. Since Kc:: Y, (22) implies (23) K c:: V«. u ••• u VGI". This proves that K is compact relative to Y. Conversely, suppose K is compact relative to Y, let {G,.} be a col- lection of open subsets of X which covers K, and put V11 = Y n G,.. Then (23) will hold for some choice of a1, ... , an; and since V,. c:: G11 , (23) implies (22). This completes the proof. 2.34 Theorem Compact subsets ofmetric spaces are closed. Proof Let K be a compact subset of a metric space X. We shall prove that the complement of K is an open subset of X. Suppose p e X, p ¢ K. If q e K, let Vq and Wq be neighborhoods ofp and q, respectively, of radius less than ½d(p, q) [see Definition 2.18(a)J. Since K is compact, there are finitely many points q1, ... , qn in K such that KC Wq, u ••• u Wq" = w. If V = Vq, n • • • n Vq", then V is a neighborhood of p which does not intersect W. Hence V c Kc, so that p is an interior point of Kc. The theorem follows. 2.35 Theorem Closed subsets ofcompact sets are compact. Proof Suppose F c:: Kc X, Fis closed (relative to X), and K is compact. r Let {V,.} be an open cover of F. If is adjoined to {V11}, we obtain an 38 PRINCIPLES OF MATHEMATICAL ANALYSIS n open cover of K. Since K is compact, there is a finite subcollection Cll of n which covers K, and hence F. If P is a member of Cll, we may remove it from Cl) and still retain an open cover of F. We have thus shown that a finite subcollection of {V11} covers F. Corollary If Fis closed and K is compact, then F n K is compact. Proof Theorems 2.24(b) and 2.34 show that F n K is closed; since F n K c: K, Theorem 2.35 shows that F n K is compact. n 2.36 Theorem If{K11} is a collection ofcompact subsets ofa metric space X such that the intersection of every finite subcollection of {K..} is nonempty, then K.. is nonempty. Proof Fix a member K1 of {K..} and put G11 = K;. Assume that no point of K1 belongs to every-K11 • Then the sets G11 form an open cover of K1 ; °'" and since K1 is compact, there are finitely many indices a:1, ... , such that K1 c: G111 u ••• u G11,.. But this means that K1 n K111 n • • • n K11,. is empty, in contradiction to our hypothesis. Corollary If {Kn} is a sequence of nonempty compact sets such that Kn=> Kn+t (n = l, 2, 3, ...), then ni Kn is not empty. 2.37 neorem If E is an infinite subset of a compact set K, then E has a limit point inK. Proof If no point of K were a limit point of E, then each q e K would have a neighborhood Vq which contains at most one point of E (namely, q, if q e E). It is clear that no finite subcollection of {Vq} can cover E; and the same is true of K, since E c K. This contradicts the compactness of K. 2.38 Theorem If {In} is a sequence of intervals in R1, such that In=> In+t (n =1, 2, 3, ...), then ni In is not empty. Proof If In = [an, bn], let E be the set of all an. Then E is nonempty and bounded above (by b1). Let x be the sup of E. If m and n are positive integers, then 0,. S am+n S bm+n Sbm, so that x s b,,. for each m. Since it is obvious that a. :s; x, we see that x e I,,. form = l, 2, 3, .... BASIC TOPOLOGY 39 2.39 Theorem Let k be a positive integer. If Vn} is a sequence of k-cells such that In::::, In+ 1(n = 1, 2, 3, ...), then ni In is not empty. Proof Let In consist of all points x = (xi, ... , x,.) such that = an,J ~ Xj ::S: bn,J (l ~j ~ k; n 1, 2, 3, ...), and put In,J = [an,J, bn,Jl• For each j, the sequence {In,J} satisfies the hypotheses of Theorem 2.38. Hence there are real numbers xj(l ~j ~ k) such that (1 ~j ~ k; n = 1, 2, 3, ...). Setting x• == (xT, ... , x:), we see that x• e In for n = 1, 2, 3, . . . . The theorem follows. 2.40 Theorem Every k-cell is compact. Proof Let I be a k-cell, consisting of all points x = (x1, .•• , x,.) such that a1 ~x1 ~ b1 (1 ~j ~ k). Put {t o= (b1 - a1)2r2. o, Then Ix - y I ~ if x e I, y e I. Suppose, to get a contradiction, that there exists an open cover {Gil} of I which contains no finite subcover of I. Put c1 = (a1 + b1)/2. The intervals [a1 , c1] and [c1 , b1] then determine 2" k-cells Q1 whose union is I. At least one of these sets Q;, call it I1, cannot be covered by any finite subcollection of {Gil} (otherwise I could be so covered). We next subdivide I1and continue the process. We obtain a sequence Vn} with the following properties: (a) I ::::> I1 ::::, I2 ::::, I3 ::::, • • • ; (b) In is not covered by any finite subcollection of {Gil}; (c) ifx e In and ye In, then Ix - YI ~ r"o. By (a) and Theorem 2.39, there is a point x• which lies in every In. For some ex, x• e Gil. Since Gil is open, there exists r > 0 such that Iy - x• I < r implies that ye Gil. If n is so large that r"o < r (there is such an n, for otherwise 2" ::S: o/r for all positive integers n, which is absurd since R is archimedean), then (c) implies that Inc Gil, which contradicts (b). This completes the proof. The equivalence of (a) and (b) in the next theorem is known as the HeineBorel theorem. 40 PRINCIPLES OP MATHEMATICAL ANALYSIS 2,41 neorem If a set E in R" has one of the following three properties, then it has the other two: (a) Eis closed and bounded. (b) Eis compact. (c) Every infinite subset ofE has a limit point in E. Proof If (a) holds, then E c: / for some k-cell /, and (b) follows from Theorems 2.40 and 2.35. Theorem 2.37 shows that (b) implies (c). It remains to be shown that (c) implies (a). If E is not bounded, then E contains points x" with (n = 1, 2, 3, ...). The set S consisting of these points Xn is infinite and clearly has no limit point in R", hence has none in E. Thus (c) implies that Eis bounded. If E is not closed, then there is a point x0 e R" which is a limit point of E but not a point of E. For n = 1, 2, 3, ... , there are points Xn e E such that Ix" - x0 I < 1/n. Let S be the set of these points x". Then S is infinite (otherwise Ix" - x0 I would have a constant positive value, for infinitely many n), S has x0 as a limit point, and S has no other limit point in R". For if ye R", y ¢ x0 , then lxn - YI ~ lxo -yl - lxn - Xol ~ 1 lxo-Yl-,;~ l 2lxo-YI for all but finitely many n; this shows that y is not a limit point of S (Theorem 2.20). Thus S has no limit point in E; hence E must be closed if (c) holds. We should remark, at this point, that (b) and (c) are equivalent in any metric space (Exercise 26) but that (a) does not, in general, imply (b) and (c). Examples are furnished by Exercise 16 and by the space !i'2, which is discussed in Chap. 11. 2,42 Theorem (Weierstrass) Every bounded infinite subset of R" has a limit point in R". Proof Being bounded, the set E in question is a subset of a k-cell / c: R". By Theorem 2.40, / is compact, and so E has a limit point in /, by Theorem 2.37. BASIC TOPOLOGY 41 PERFECT SETS 2.43 Theorem Let P be a nonempty perfect set in Rt.. Then Pis uncountable. Proof Since P has limit points, P must be infinite. Suppose P is countable, and denote the points of P by xi, x2, x 3, . •. . We shall construct a sequence {Vn} of neighborhoods, as follows. Let Vi be any neighborhood of x1• If V1 consists of all y e ~ such that Iy - x1 I < r, the closure Vi of V1 is the set of all y e Rk such that IY -xii S: r. Suppose Vn has been constructed, so that Vn n Pis not empty. Since every point of Pis a limit point of P, there is a neighborhood Vn+t such that (i) V..+ 1 c: Vn, (ii) xn¢ V..+i, (iii) Vn+i nP is not empty. By (iii), Vn+1 satisfies our induction hypothesis, and the construction can proceed. Put Kn = Yn n P. Since Yn is closed and bounded, V,. is compact. Since xn ¢ Kn+ i, no point of P lies in nf Kn. Since Kn c:. P, this implies that nf Kn is empty. But each Kn is nonempty, by (iii), and Kn=> Kn+i, by (i); this contradicts the Corollary to Theorem 2.36. Corollary Every interval [a, b] (a< b) is uncountable. In particular, the set of all real numbers is uncountable. 2.44 The Cantor set The set which we are now going to construct shows that there exist perfect sets in R1 which contain no segment. Let E0 be the interval [O, I]. Remove the segment (¼, ¼), and let E1 be the union of the intervals [O, ¼] [¼, 1]. Remove the middle thirds of these intervals, and let E2 be the union of the intervals [o, -H [¾, ¾1, [f, H [¾, 11. Continuing in this way, we obtain a sequence of compact sets En, such that (a) E1 => E2 :::, E3 => • • • ; (b) En is the union of 2n intervals, each of length rn. The set is called the Cantor set. P is clearly compact, and Theorem 2.36 shows that P is not empty. 42 PRINCIPLES OF MATHEMATICAL ANALYSIS No segment of the form (24) where k and m are positive integers, has a point in common with P. Since every segment (ci:, /3) contains a segment of the form (24), if 3-m <{3---C,( 6 P contains no segment. To show that Pis perfect, it is enough to show that P contains no isolated point. Let x e P, and let S be any segment containing x. Let In be that interval of En which contains x. Choose n large enough, so that In c S. Let xn be an endpoint of In, such that Xn '# x. It follows from the construction of P that xn e P. Hence x is a limit point of P, and P is perfect. One of the most interesting properties of the Cantor set is that it provides us with an example of an uncountable set of measure zero (the concept of measure will be discussed in Chap. 11). CONNECTED SETS 2.45 Definition Two subsets A and B of a metric space X are said to be separated if both A n B and An Bare empty, i.e., if no point of A lies in the closure of Band no point of B lies in the closure of A. A set E c X is said to be connected if E is not a union of two nonempty separated sets. 2.46 Remark Separated sets are of course disjoint, but disjoint sets need not be separated. For example, the interval [0, 1] and the segment (1, 2) are not separated, since l is a limit point of (l, 2). However, the segments (0, 1) and (1, 2) are separated. The connected subsets of the line have a particularly simple structure: 2.47 Theorem A subset E of the real line R1 is connected ifand only ifit has the following property: If x e E, ye E, and x < z < y, then z e E. I Proof If there exist x e E, y e E, and some z e (x, y) such that z ¢ E, then E = A,. u B,. where A,. =En (-oo, z), B,. =En (z, oo). BASIC TOPOLOGY 43 Since x e A. and ye B., A and Bare nonempty. Since A. c:: ( - oo, z) and B11 c:: (z, oo), they are separated. Hence E is not connected. To prove the converse, suppose E is not connected. Then there are nonempty separated sets A and B such that A u B = E. Pick x e A, ye B, and assume (without loss of generality) that x < y. Define z = sup (An [x, y]). By Theorem 2.28, z e A; hence z ¢ B. In particular, x ~ z < y. If z ¢ A, it follows that x < z < y and z ¢ E. If z e A, then z ¢ B, hence there exists z1 such that z < z1 < y and z1 ¢ B. Then x < z1 < y and z1 ¢ E. EXERCISES 1. Prove that the empty set is a subset of every set. 2. A complex number z is said to be algebraic if there are integers a0 , ••• , a., not all zero, such that aoz" + a1z•- 1 + ··· + an-1Z +a.= 0. Prove that the set of all algebraic numbers is countable. Hint: For every positive integer N there are only finitely many equations with n+ laol +lad+···+ la.I =N. 3, Prove that there exist real numbers which are not algebraic. 4, Is the set of all irrational real numbers countable? 5. Construct a bounded set of real numbers with exactly three limit points. 6, Let E' be the set of all limit points of a set E. Prove that E' is closed. Prove that E and £ have the same limit points. (Recall that£ =Eu E'.) Do E and E' always have the same limit points? 7. Let Ai, A1, A3, ... be subsets of a metric space. (a) If B. = Ur-1 A,, prove that B. = Ur-1 A,, for n = 1, 2, 3, .... (b) If B = Ul°-1 A,, prove that B :::> Ui!.1 A,. Show, by an example, that this inclusion can be proper. 8. Is every point of every open set E c R2 a limit point of E? Answer the same question for closed sets in R2• 9. Let E" denote the set of all interior points of a set E. [See Definition 2.18(e); E0 is called the interior of E.] {a) Prove that E" is always open. (b) Prove that Eis open if and only if E0 = E. (c) If G c E and G is open, prove that G c £ 0 • {d) Prove that the complement of E 0 is the closure of the complement of E. (e) Do E and £ always have the same interiors? (/) Do E and E0 always have the same closures? 44 PRINCIPLES OF MATHEMATICAL ANALYSIS 10. Let X be an infinite set. For p e X and q e X, define d(p,q)= {~ (ifp ¢ q) (ifp =q). Prove that this is a metric. Which subsets of the resulting metric space are open? Which are closed? Which are compact? 11, For x e R1 and ye R1, define d1(x, y) = (x - y) 2, dz(X, y) = VIX - YI, da(x,y) = lxz -yzl, d4(X, y) = Ix - 2yj, ds(x,y) = lx-yl 1 +lx-yl. Determine, for each of these, whether it is a metric or not. 12. Let Kc: R1 consist of 0 and the numbers 1/n, for n = 1, 2, 3, . . . . Prove that K is compact directly from the definition (without using the Heine-Borel theorem). 13. Construct a compact set of real numbers whose limit points form a countable set. 14. Give an example of an open cover of the segment (0, 1) which has no finite sub- cover. 15. Show that Theorem 2.36 and its Corollary become false (in R1, for example) if the word "compact" is replaced by "closed" or by "bounded." 16. Regard Q, the set of all rational numbers, as a metric space, with d(p, q) = IP - q I, Let Ebe the set of all p e Q such that 2 0, define A to be the set of all q e X for which d(p, q) < 8, define B similarly, with > in place of <. Prove that A and Bare separated. (d) Prove that every connected metric space with at least two points is uncount• able. Hint: Use (c). 20. Are closures and interiors of connected sets always connected? (Look at subsets of R2.) 21. Let A and B be separated subsets of some Rt, suppose a e A, be B, and define p(t) = (1 - t)a + tb for teR1• Put Ao =p- 1(A), Bo =p- 1(B). [Thus teA 0 if and only ifp(t)eA.] BASIC TOPOLOGY 45 (a) Prove that Ao and Bo are separated subsets of R1• (b) Prove that there exists toe (0, 1) such that p(to) ¢ A u B. (c) Prove that every convex subset of Rt is connected. 22. A metric space is called separable if it contains a countable dense subset. Show that Rt is separable. Hint: Consider the set of points which have only rational coordinates. 23. A collection {V«} of open subsets of X is said to be a base for X if the following is true: For every x e X and every open set G c X such that x e G, we have x e V« c G for some ex, In other words, every open set in X is the union of a subcollection of {V«l• Prove that every separable metric space has a countable base. Hint: Take all neighborhoods with rational radius and center in some countable dense subset of X. 24. Let X be a metric space in which every infinite subset has a limit point. Prove that Xis separable. Hint: Fix 8 > 0, and pick x1 e X. Having chosen Xi, .•. , x1 e X, choose x1 + 1 e X, if possible, so that d(x,, x1+ 1)~ 8 for i = 1, ... ,j. Show that this process must stop after a finite number of steps, and that X can therefore be covered by finitely many neighborhoods of radius 8. Take 8 = 1/n (n = 1, 2, 3, ...), and consider the centers of the corresponding neighborhoods. 25. Prove that every compact metric space K has a countable base, and thclt K is therefore separable. Hint: For every positive integer n, there are finitely many neighborhoods of radius 1/n whose union covers K. 26. Let X be a metric space in which every infinite subset has a limit point. Prove that X is compact. Hint: By Exercises 23 and 24, X has a countable base. It follows that every open cover of Xhas a countable subcover {G.}, n = 1, 2, 3, .... Ifno finite subcollection of {G.} covers X, then the complement F. of G1 u ••• u G. is nonempty for each n, but () F. is empty. If Eis a set which contains a point from each F., consider a limit point of E, and obtain a contradiction. 27. Define a point p in a metric space X to be a condensation point of a set E c X if every neighborhood of p contains uncountably many points of E. Suppose E c Rt, Eis uncountable, and let P be the set of all condensation points of E. Prove that P is perfect and that at most countably many points of E are not in P. In other words, show that pc r. E is at most countable. Hint: Let {V.} be a countable base of Rt, let W be the union of those v. for which E r. v. is at most countable, and show that P = we. 28. Prove that every closed set in a separable metric space is the union of a (possibly empty) perfect set and a set which is at most countable. (Corollary: Every countable closed set in Rt has isolated points.) Hint: Use Exercise 27. 29. Prove that every open set in R1 is the union of an at most countable collection of disjoint segments. Hint: Use Exercise 22. 46 PRINCIPLES OF MATHEMATICAL ANALYSIS 30. Imitate the proof of Theorem 2.43 to obtain the following result: u If Rk = rFn' where each Fn is a closed subset of Rt, then at least one Fn has a nonempty interior. Equivalent statement: If Gn is a dense open subset of Rk, for n = l, 2, 3, ... , nrc. then is not empty (in fact, it is dense in Rk). (This is a special case of Baire's theorem; see Exercise 22, Chap. 3, for the general case.) 3 NUMERICAL SEQUENCES AND SERIES As the title indicates, this chapter will deal primarily with sequences and series of complex numbers. The basic facts about convergence, however, are just as easily explained in a more general setting. The first three sections will therefore be concerned with sequences in euclidean spaces, or even in metric spaces. CONVERGENT SEQUENCES 3.1 Definition A sequence {Pn} in a metric space Xis said to converge if there is a point p e X with the following property: For every s > 0 there is an integer N such that n ~ N implies that d(pn, p) < e. (Here d denotes the distance in X.) In this case we also say that {Pn} converges to p, or that p is the limit of {Pn} [see Theorem 3.2(b)], and we write Pn ➔ p, or lim Pn =p. n ➔ oo If {Pn} does not converge, it is said to diverge. 48 PRINCIPLES OF MATHEMATICAL ANALYSIS It might be well to point out that our definition of "convergent sequence" depends not only on {Pn} but also on X; for instance, the sequence {1/n} con- verges in R1 (to 0), but fails to converge in the set of all positive real numbers [with d(x, y) = Ix - y I]. In cases of possible ambiguity, we can be more precise and specify "convergent in X" rather than "convergent." We recall that the set of all points Pn (n = 1, 2, 3, ...) is the range of {Pn}. The range of a sequence may be a finite set, or it may be infinite. The sequence {Pn} is said to be bounded if its range is bounded. As examples, consider the following sequences of complex numbers (that is, X = R2): (a) If sn = 1/n, then limn.. 00 sn = 0; the range is infinite, and the sequence is bounded. (b) lf sn = n2, the sequence {sn} is unbounded, is divergent, and has infinite range. (c) If Sn = 1 + [( - 1)"/n], the sequence {sn} converges to I, is bounded, and has infinite range. (d) If Sn= i", the sequence {sn} is divergent, is bounded, and has finite range. (e) Ifs"= 1 (n = 1, 2, 3, ...), then {sn} converges to 1, is bounded, and has finite range. • We now summarize some important properties of convergent sequences in metric spaces. 3.2 Theorem Let {Pn} be a sequence in a metric space X. (a) {Pn} converges top e X if and only if every neighborhood ofp contains Pn for all but finitely many n. (b) If p e X, p' e X, and if{Pn} converges top and top', then p' = p. (c) If {Pn} converges, then {Pn} is bounded. (d) If E c X and ifp is a limit point ofE, then there is a sequence {Pn} in E such that p = lim Pn . n ➔ co Proof (a) Suppose Pn ➔ p and let V be a neighborhood of p. For some e > 0, the conditions d(q, p) < e, q e X imply q e V. Corresponding to this e, there exists N such that n ~ N implies d(pn, p) < e. Thus n ~ N implies Pn e V. Conversely, suppose every neighborhood of p contains all but finitely many of the Pn . Fix e > 0, and let V be the set of all q e X such that d(p, q) < e. By assumption, there exists N (corresponding to this V) such that Pn e V if n ~ N. Thus d(Pn,P) < e if n ~ N; hence Pn ➔p. NUMERICAL SEQUENCES AND SERIES 49 (b) Let e > 0 be given. There exist integers N, N' such that n~N implies d(p., p) < 2e' n~N' implies d(pn, , p) < 2e - Hence if n ~ max (N, N'), we have d(p, p') :::;; d(p, p.) + d(p., p') < e. Since e was arbitrary, we conclude that d(p, p') = 0. (c) Suppose P. - p. There is an integer N such that n > N implies d(p., p) < 1. Put r = max {l, d(Pi, p), ... , d(pN, p)}. Then d(p.,p) ::S: r for n = 1, 2, 3, .... (d) For each positive integer n, there is a point p. EE such that d(p., p) < 1/n. Given e > 0, choose N so that Ne > 1. If n > N, it follows that d(p., p) < e. Hence Pn - p. This completes the proof. For sequences in Rk we can study the relation between convergence, on the one hand, and the algebraic operations on the other. We first consider sequences of complex numbers. 3.3 Theorem Suppose {s.}, {t.} are complex sequences, and limn ➔ oo s. = s, limn- 00 tn = t. Then (a) lim(s.+tn)=s+t; • (b) Jim cs.= cs, lim (c + sn) = c + s,for any number c; n-+oo n ➔ oo (c) Jim s.t. = st; n ➔ oo (d) lim .!_=~,provided s. # 0 (n = 1, 2, 3, ...), ands# 0. n ➔ oo Sn S Proof (a) Given e > 0, there exist integers N1, N 2 such that n ~ N 1 implies n ~ N 2 implies 50 PRINCIPLES OF MATHEMATICAL ANALYSIS If N = max (Ni, N 2), then n :2:: N implies l(s,.+tn)-(s+t)I S lsn-sl + ltn-tl 0, there are integers Ni, N 2 such that J;, n :2:: N1 implies lsn - sl < J;. n :2:: N 2 implies Itn - ti < If we take N = max (Ni, N 2 ), n :2:: N implies l(sn-s)(t,.-t)I 0, there is an integer N > m such that n :2:: N implies lsn - sl < ½lsl 2s. Hence, for n :2:: N, I 1-S1n - -11 s = - - Sn - SI sn3 0 there corresponds an integer N such that n c?: N implies Jk8 101:J,h - 01:JI < (1 SjS k). Hence n c?: N implies ={f lx,.-xl l01:1,n-01:Jl 2}112 < 8 , J=1 so that Xn ➔ x. This proves (a). Part (b) follows from (a) and Theorem 3.3. SUBSEQUENCES 3.5 Definition Given a sequence {pn}, consider a sequence {nk} of positive integers, such that n1 < n2 < n3 < •••. Then the sequence {p,.,} is called a subsequence of {p,.}. If {p,.1} converges, its limit is called a subsequential limit o f {p,.}. It is clear that {p,.} converges to p if and only if every subsequence of {Pn} converges top. We leave the details of the proof to the reader. 3.6 Theorem (a) If {p,.} is a sequence in a compact metric space X, then some subsequence of{Pn} converges to a point of X. (b) Every bounded sequence in R' contains a convergent subsequence. 52 PRINCIPLES OP MATHEMAnCAL ANALYSIS Proof (a) Let Ebe the range of {Pn}. If E is finite then there is a p e E and a sequence {n1} with n1 < n2 < n3 < ••·, such that Pn, -Pna - ••• -p. The subsequence {p,.1} so obtained converges evidently to p. If E is infinite, Theorem 2.37 shows that E has a limit point p e X. Choose n1 so that d(p, Pn.) < 1. Having chosen ni, ... , n,-i, we see from Theorem 2.20 that there is an integer n1 > n1_ 1 such that d(P,Pn,) < 1/i. Then {p,.,} converges to p. (b) This follows from (a), since Theorem 2.41 implies that every bounded subset of ,RA Jies in a compact subset of ,RA, 3.7 Theorem The subsequential limits of a sequence {p,.} in a metric space X form a closed subset of X. Proof Let E* be the set of all subsequential limits of {Pn} and let q be a limit point of E*. We have to show that q e E*. Choose n1 so that Pni r/t- q. (If no such n1 exists, then E* has only one point, and there is nothing to prove.) Put 6 =d(q, p,..). Suppose n1, ... , n1_ 1 are chosen. Since q is a limit point of E*, there is an x e E* r'6. with d(x, q) < i-16. Since x e E*, there is an n1 > n1_ 1 such that d(x,p,.1) < Thus d(q,pn,) s 21 - 16 = for i 1, 2, 3, .... This says that {p,.,} converges to q. Hence q e E*. CAUCHY SEQUENCES 3.8 Definition A sequence {p,,} in a metric space X is said to be a Cauchy sequence if for every e > 0 there is an integer N such that d(p,., p,,,) < e if n ~ N andm~N. In our discussion of Cauchy sequences, as well as in other situations which will arise later, the following geometric concept will be useful. 3,9 Definition Let E be a nonempty subset of a metric space X, and let S be the set of all real numbers of the form ld(p, q), with ,p e E and q e E. the sup of S is called the diameter of E. NUMERICAL SEQUENCES AND SERIES 53 If(pn}is a sequence in X and if EN consists of the pointspN, PN+t,PN+ 2 , ••• , it is clear from the two preceding definitions that (Pn} is a Cauchy sequence if and only if lim diam EN = 0. 3.10 Theorem (a) If E is the closure of a set E in a metric space X, then diam E = diam E. (b) If Kn is a sequence of compact sets in X such that Kn::::, Kn+l (n = 1, 2, 3, ...) and if lim diam Kn = 0, n n ➔ ao then j Kn consists of exactly one point. Proof (a) Since E c E, it is clear that diam E ::,;; diam E. Fix e > 0, and choose p e E, q e E. By the definition of E, there are points p', q', in E such that d(p, p') < e, d(q, q') < e. Hence d(p, q) :s; d(p, p') + d(p' q') + d(q', q) < 2e + d(p', q') :s; 2e -+ diam E. It follows that diam E :s; 2e + diam E, n and since e was arbitrary, (a) is proved. (b) Put K = j Kn. By Theorem 2.36, K is not empty. If K contains more than one point, then diam K > 0. But for each n, Kn ::::, K, so that diam Kn ~ diam K. This contradicts the assumption that diam Kn ➔ 0. 3.11 Theorem (a) In any metric space X, every convergent sequence is a Cauchy sequence. (b) If Xis a compact metric space and if{Pn} is a Cauchy sequence in X, then {Pn} converges to some point of X. (c) In Rk, every Cauchy sequence converges. Note: The difference between the definition of convergence and the definition of a Cauchy sequence is that the limit is explicitly involved in the former, but not in the latter. Thus Theorem 3.ll(b) may enable us 54 PRINCIPLES OF MATHEMATICAL ANALYSIS to decide whether or not a given sequence converges without knowledge of the limit to which it may converge. The fact (contained in Theorem 3.11) that a sequence converges in Rk if and only if it is a Cauchy sequence is usually called the Cauchy criterion for convergence. Proof (a) If Pn -+ p and if e > 0, there is an integer N such that d(p, Pn) < e for all n ~ N. Hence d(pn, Pm) ~ d(Pn, p) + d(p, Pm) < 2e as soon as n ~ N and m ~ N. Thus {pn} is a Cauchy sequence. (b) Let {Pn} be a Cauchy sequence in the compact space X. For N = l, 2, 3, ... , let E 11 be the set consisting of PN, PN+t, PN+2, ... . Then (3) lim diam EN= 0, N-+~ by Definition 3.9 and Theorem 3.lO(a). Being a closed subset of the compact space X, each EN is compact (Theorem 2.35). Also EN::::> EN+ 1, so that EN::::> EN+1• Theorem 3.l0(b) shows now that there is a unique p e X which lies in every EN. Let e > 0 be given. By (3) there is an integer NO such that diam EN< e if N ~ N0 . Since p e EN, it follows that d(p, q) < e for every q e EN, hence for every q e EN. In other words, d(p, Pn) < e if n ~ NO • This says precisely that Pn ➔ p. (c) Let {xn} be a Cauchy sequence in Rk. Define EN as in (b), with x1 in place of Pi. For some N, diam EN < 1. The range of {xn} is the union of EN and the finite set {x1, ... , xN_ 1}. Hence {x,,} is bounded. Since every bounded subset of K has compact closure in K (Theorem 2.41), (c) follows from (b). 3.12 Definition A metric space in which every Cauchy sequence converges is said to be complete. Thus Theorem 3.11 says that all compact metric spaces and all Euclidean spaces are complete. Theorem 3.11 implies also that every closed subset E of a complete metric space X is complete. (Every Cauchy sequence in Eis a Cauchy sequence in X, hence it converges to some p e X, and actually p e E since Eis closed.) An example of a metric space which is not complete is the space of an rational numbers, with d(x, y) = Ix - y I, NUMERICAL SEQUENCES AND SERIES 55 Theorem 3.2(c) and example (d) of Definition 3.1 show that convergent sequences are bounded, but that bounded sequences in R" need not converge. However, there is one important case in which convergence is equivalent to boundedness; this happens for monotonic sequences in R1. 3.13 Definition A sequence {sn} of real numbers is said to be (a) monotonically increasing if Sn s; Sn+i (n = 1, 2, 3, ...); (b) monotonically decreasing if Sn~ Sn+i (n = 1, 2, 3, ...). The class of monotonic sequences consists of the increasing and the decreasing sequences. 3.14 Theorem Suppose {sn} is monotonic. Then {s.} converges if and only if it is bounded. Proof Suppose Sn s; Sn+i (the proof is analogous in the other case). Let E be the range of {sn}. If {sn} is bounded, let s be the least upper bound of E. Then s. s; s (n = 1, 2, 3, ...). For every s > 0, there is an integer N such that S- 8 s*, there is an integer N such that n:?: N implies Sn< x. Moreover, s* is the only number with the properties (a) and (b). Of course, an analogous result is true for s•. Proof (a) Ifs* = +oo, then Eis not bounded above; hence {sn} is not bounded above, and there is a subsequence {snk} such that snk ➔ + oo. Ifs* is real, then Eis bounded above, and at least one subsequential limit exists, so that (a) follows from Theorems 3.7 and 2.28. If s• = - oo, then E contains only one element, namely - oo, and there is no subsequential limit. Hence, for any real M, Sn > M for at most a finite number of values of n, so that sn--+ - oo. This establishes (a) in all cases. (b) Suppose there is a number x > s* such that Sn ~ x for infinitely many values of n. In that case, there is a number y e E such that y ~ x > s*, contradicting the definition of s*. Thuss* satisfies (a) and (b). To show the uniqueness, suppose there are two numbers, p and q, which satisfy (a) and (b), and supposep < q. Choose x such thatp < x < q. Since p satisfies (b), we have Sn < x for n ~ N. But then q cannot satisfy (a). NUMERICAL SEQUENCES AND SBRIBS 57 3.18 •Examples (a) Let {sn} be a sequence containing all rationals. Then every real number is a subsequential limit, and lim sup S11 = + 00, n..,.00 lim inf Sn= -oo. n..,.00 (b) Let Sn = ( -1 ") /[1 + (1/n)]. Then Jim sups"= 1, 11 ➔ 00 Jim inf Sn = - 1. n-+00 (c) For a real-valued sequence {sn}, lim Sn= s if and only if n-+00 Jim sup Sn = lim inf Sn = S. We close this section with a theorem which is useful, and whose proof is quite trivial: 3.19 Theorem Ifs,. :s; t11 for n ~ N, where N is fixed, then Jim inf s,. :s; Jim inf tn, ,.... 00 ,. ... 00 Jim sup Sn =S; Jim sup tn. n-+oo n-+oo SOME SPECIAL SEQUENCES We shall now compute the limits of some sequences which occur frequently. The proofs will all be based on the following remark: If O:s; x11 :s; s11 for n ~ N, where N is some fixed number, and if Sn ➔ 0, then Xn ➔ 0. 3.20 Theorem (a) If p > 0, then lim 1., = 0. ..... oon (b) If p > 0, then lim ::fp = 1. ,. ... 00 (c) Jim :jn = 1. n..,.00 (d) If p > 0 and oc is real, then lim (l 11-+00 ,f" ) + P 11 = 0. (e) Iflxl <1,thenlimx"=0. ,. .... 00 58 PRINCIPLES OF MATHEMATICAL ANALYSIS Proof (a) Take n > (1/e)11". (Note that the archimedean property of the real number system is used here.) (b) If p > 1, put Xn = ::/P - 1. Then Xn > 0, and, by the binomial theorem, so that 0 «, k > 0. For n > 2k, (l )" + P > '") k = n(n u P - 1) .. • (n k! k + 1) k nky. p > 2kk! Hence 0 n,. 2kk! m-k < (1 +pt< p" n Since« - k < 0, n"'-" -+O, by (a). (e) Take « = 0 in (d). (n > 2k). SERIES In the remainder of this chapter, all sequences and series under consideration will be complex-valued, unless the contrary is explicitly stated. Extensions of some of the theorems which follow, to series with terms in R", are mentioned in Exercise 15. NUMERICAL SEQUENCES AND SERIES 59 3.21 Definition Given a sequence {a.}, we use the notation n=p to denote the sum aP + ap+t + ••• + aq. With {a.} we associate a sequence {sn}, where For {sn} we also use the symbolic expression or, more concisely, a1 + a2 + a3 + ••• (4) The symbol (4) we call an in.finite series, or just a seriPs. The numbers s. are called the partial sums of the series. If {s.} converges to s, we say that the series converges, and write L00 a.= s. n= l The number s is called the sum of the series; but it should be clearly under- stood that s is the limit of a sequence of sums, and is not obtained simply by addition. If {s.} diverges, the series is said to diverge. Sometimes, for convenience of notation, we shall consider series of the form 00 (5) La .. n=O And frequently, when there is no possible ambiguity, or when the distinction is immaterial, we shall simply write I:an in place of (4) or (5). It is clear that every theorem about sequences can be stated in terms of series (putting a1 = s1, and a. = s. - s. _1 for n > 1), and vice versa. But it is nevertheless useful to consider both concepts. The Cauchy criterion (Theorem 3.11) can be restated in the following form: 3,22 Theorem I:a. converges if and only iffor every B > 0 there is an integer N such that (6) ifm~n~N. 60 Pll.INCIPLES OF MATHEMATICAL ANALYSIS In particular, by taking m =n, (6) becomes In other words: (n ~ N). 3.23 Theorem /JI.an converges, then limn-+ao an= 0. The condition a,, ➔ 0 is not, however, sufficient to ensure convergence of I.a•. For instance, the series r -1 00 n=l n diverges; for the proof we refer to Theorem 3.28. Theorem 3.14, concerning monotonic sequences, also has an immediate counterpart for series. 3.24 Theorem A series of nonnegative1 terms converges if and only if its partial sums form a bounded sequence. We now tum to a convergence test of a different nature, the so-called "comparison test." 3.25 Theorem (a) If Ia. I S Cn for n ~ N0 , where N0 is some fixed integer, and if "£c,, converges, then Ian converges. (b) If a,,~ dn ~ 0 for n ~ N0 , and if "£d,, diverges, then I.a,, diverges. Note that (b) applies only to series of nonnegative terms a,,. Proof Given e > 0, there exists N ~ N 0 such that m ~ n ~ N implies L"' c" ~ e, k=n by the Cauchy criterion. Hence IJn°"' s,,tla1:I ~J/"~e, and (a) follows. Next, (b) follows from (a), for if Ian converges, so must Id,, [note that (b) also follows from Theorem 3.24]. 1 The expression " nonnegative" always refers to real numbers. NUMERICAL SEQUENCES AND SERIES 61 The comparison test is a very useful one; to use it efficiently, we have to become familiar with a number of series of nonnegative terms whose convergence or divergence is known. SERIES OF NONNEGATIVE TERMS The simplest of all is perhaps the geometric series. 3.26 Theorem //0 s; x < 1, then I00 x " = 1 -- · n=O 1- X If x ~ 1, the series diverges. Proof If x ~ 1, L n 1-x"+l s n = /c=O J ! =1 -- - X · The result follows if we let n--+ oo. For x = 1, we get 1+1+1+"', which evidently diverges. In many cases which occur in applications, the terms of the series decrease monotonically. The following theorem of Cauchy is therefore of particular interest. The striking feature of the theorem is that a rather "thin" subsequence of {an} determines the convergence or divergence of I:an. Li~= 3.27 Theorem Suppose a1 ~ a2 ~ a3 ~ • • • ~ 0. Then the series 1 an con- verges if and only if the series (7) converges. 00 L 2"a2k = a1 + 2a2 + 4a4 + 8a8 + ••• /c=O Proof By Theorem 3.24, it suffices to consider boundedness of the partial sums. Let Sn = a1 + a2 + ••• + an ' t1c = a1 + 2a2 + ••• + 21ca2k. 62 PRINCIPLES OF MATHEMATICAL ANALYSIS For n < 2k, Sn s a1 + (a2 + a 3) + ••• + (a2k + ••• + a2 k+i- 1) s a1 + 2a2 + ••• + 2ka2k so that (8) On the other hand, if n > 2\ so that sn ~ a1 + a2 + (a3 + a4 ) + ••• + (a2k- 1 +1 + ••• + a2k) ~ ½a1 + a2 + 2a4 + ••• + 2k- 1a2k = ½tk' (9) By (8) and (9), the sequences {s"} and {tk} are either both bounded or both unbounded. This completes the proof. L - 3.28 Theorem I converges if p > I and diverges if p s 1. nP Proof If p s 0, divergence follows from Theorem 3.23. If p > 0, Theorem 3.27 is applicable, and we are led to the series I = I "' 2k . k1p "' 2(1 - p)k_ k=O 2 k=O Now, 21 -p < 1 if and only if 1 - p < 0, and the result follows by com- parison with the geometric series (take x = 21 - P in Theorem 3.26). As a further application of Theorem 3.27, we prove: 3.29 Theorem If p > I, (10) "' I J2n(logn)P converges,· ifp s I, the series diverges. Remark "log n" denotes the logarithm of n to the base e (compare Exercise 7, Chap. 1); the number e will be defined in a moment (see Definition 3.30). We let the series start with n = 2, since log 1 = 0. NUMERICAL SEQUENCES AND SERIES 63 Proof The monotonicity of the logarithmic function (which will be discussed in more detail in Chap. 8) implies that {log n} increases. Hence {l/n log n} decreases, and we can apply Theorem 3.27 to (10); this leads us to the series 00 1 00 1 1 1 00 (11) !"" • 1"'f 21(log 21)P =1~1 (k log 2)P = (log 2)P1~1 kP' and Theorem 3.29 follows from Theorem 3.28. This procedure may evidently be continued. For instance, 00 1 (12) n~3 n log n log log n diverges, whereas 00 1 (13) n"'f3 n log n(log log n)2 converges. We may now observe that the terms of the series (12) differ very little from those of (13). Still, one diverges, the other converges. If we continue the process which led us from Theorem 3.28 to Theorem 3.29, and then to (12) and (13), we get pairs of convergent and divergent series whose terms differ even less than those of (12) and (13). One might thus be led to the conjecture that there is a limiting situation of some sort, a "boundary" with all convergent series on one side, all divergent series on the other side-at least as far as series with monotonic coefficients are concerned. This notion of "boundary" is of course quite vague. The point we wish to make is this: No matter how we make this notion precise, the conjecture is false. Exercises ll(b) and 12(b) may serve as illustrations. We do not wish to go any deeper into this aspect of convergence theory, and refer the reader to Knapp's "Theory and Application of Infinite Series," Chap. IX, particularly Sec. 41. THE NUMBER e f 3.30 Definition e = ..!_• 11-on! Here n! = 1 • 2 • 3 .. •n if n ~ l, and O! = 1. 64 PRINCIPLES OF MATHEMATICAL ANALYSIS Since 1 1 1 s• =1+11+ ·2 -+ 1·2-·-3-+···1+·2-..-·-n 1 I 1 < 1 + 1 +2 - 2+2 - + " · +2·-- < 1 3 ' the series converges, and the definition makes sense. In fact, the series converges very rapidly and allows us to compute e with great accuracy. It is of interest to note that e can also be defined by means of another limit process; the proof provides a good illustration of operations with limits: 3.31 Theorem lim (1 + !)" = e. n ➔ oo n Proof Let n 1 s. = L k'' k=O • By the binomial theorem, Hence t. :$; s., so that (14) lim sup tn :$; e, n ➔ oo by Theorem 3.19. Next, if n ;:=: m, t. ;:=: 1 + I + -21! ( 1 - -I) n + ••• + m-1! ( 1 - -I) ••• ( 1 - n - m --1) · n Let n ➔ ex:,, keeping m fixed. We get so that 11. m m. f n ➔ cfJ t. ;:=: 1 + 1 + 1 21 • + .. • + 1 m1 . , n ➔ OO Letting m ➔ oo, we finally get (15) e :$; lim inf t• . n ➔ oo The theorem follows from (14) and (15). NUMERICAL SEQUENCES AND SERIES 65 The rapidity with which the series L _!_ converges can be estimated as nl follows: If Sn has the same meaning as above, we have 1 1 1 e - Sn = (n + 1) ! + (n + 2) I + (n + 3) ! + ••• so that (16) < -1- - {1 + -1- + - -1- + · · ·}=1- (n+l)! n+l (n+1)2 n!n O < e - s n 1, tan diverges,· (c) if ix = 1, the test gives no information. 66 PRINCIPLES OF MATHEMATICAL ANALYSIS Proof If ex < 1, we can choose P so that ex < p < 1, and an integer N such that vTciJ

1, then, again by Theorem 3.17, there is a sequence {nk} such that Hence Ia" I > 1 for infinitely many values of n, so that the condition an ➔ O, necessary for convergence of "£a", does not hold (Theorem 3.23). To prove (c), we consider the series For each of these series ex= 1, but the first diverges, the second converges. 3.34 Theorem (Ratio Test) The series "£a" I (a) converges 1i/11•m sup l-an+l < 1, n ➔ oo an I !~ ~ (b) diverges if 0:: 1 1for all n n0 , where n0 is some fixed integer. Proof If condition (a) holds, we can find p < 1, and an integer N, such that for n ~ N. In particular, laN+il < PlaNI, IaN +2 I < PI aN +1 I < P2 IaN I, NUMERICAL SEQUENCES AND SERIES 67 That is, lanl < laNIP-N' pn for n ~ N, and (a) follows from the comparison test, since r.p• converges. If Ian+ 1 1 ~ Ia. I for n ~ n0 , it is easily seen that the condition a. ➔ 0 does not hold, and (b) follows. Note: The knowledge that lim a.+ 1/a. = 1 implies nothing about the convergence of r.a•. The series 'f.1/n and 'f.1/n2 demonstrate this. 3.35 Examples (a) Consider the series 11 1 1 1 1 1 1 2+ 3+ 22 + 32 + 23 + 33 + 24 + 34 + for which lim inf an+l = lim (~)" = 0, n ➔ oo an n ➔ oo 3 h•m sup :n:/-a. -_ h•m 2~n --1;;_ - ✓_ 1-• n ➔ oo •""oo 2 2 I1•mn ➔su pa.-+-1 a. = n1➔1. moo -1 (3- )" 2 2 = + oo. The root test indicates convergence; the ratio test does not apply. (b) The same is true for the series I 11 1 1 1 1 2 + 1 + 8 + 4 + 32 + 16 + 128 + 64 + ••• ' where but I1.m m. fa-.+-1 =-1, n ➔ oo a. 8 = lim sup an+l 2, n➔ oo a. 68 PRINCIPLES OF MATHEMATICAL ANALYSIS 3.36 Remarks The ratio test is frequently easier to apply than the root test, since it is usually easier to compute ratios than nth roots. However, the root test has wider scope. More precisely: Whenever ihe ratio test shows convergence, the root test does too; whenever the root test is inconclusive, the ratio test is too. This is a consequence of Theorem 3.37, and is illustrated by the above examples. Neither of the two tests is subtle with regard to divergence. Both deduce divergence from the fact that an does not tend to zero as n-+ oo. 3.37 Theorem For any sequence {en} ofpositive numbers, It•m t'nf -Cn-+l :S: lim inf VnCen, n-+oo Cn n-ooo v h'm sup nCen :S: h'm suCpn+-l· n ➔ oo n ➔ oo Cn Proof We shall prove the second inequality; the ~roof of the first is quite similar. Put = oi I1,m supC-n+-l• n-+oo Cn If oi = + oo, there is nothing to prove. If oi is finite, choose P> oi. There is an integer N such that for n ~ N. In particular, for any p > 0, (k = 0, 1, ... , p - 1). Multiplying these inequalities, we obtain CN+,, :S: P"cN, or (n ~N). Hence so that (18) lim sup ::Jc,. :S: P, 11-000 NUMERICAL SEQUENCES AND SERIES 69 by Theorem 3.20(b). Since (18) is true for every p > ix, we have lim sup :fc,. s; ix. n ➔ oo POWER SERIES 3.38 Definition Given a sequence {c,.} of complex numbers, the series (19) L00 c,.z" 11=0 is called a power series. The numbers c,. are called the coefficients of the series; z is a complex number. In general, the series will converge or diverge, depending on the choice of z. More specifically, with every power series there is associated a circle, the circle of convergence, such that (19) converges if z is in the interior of the circle and diverges if z is in the exterior (to cover all cases, we have to consider the plane as the interior of a circle of infinite radius, and a point as a circle of radius zero). The behavior on the circle of convergence is much more varied and cannot be described so simply. 3.39 Theorem Given the power series !cnz", put a =lim sup .Ylc:"I, 1 R=-· (X (//a = 0, R = + oo; if a = + oo, R = 0.) Then 2::c,. z" converges if Iz I < R, and diverges if izl > R. Proof Put a,.= c,.z", and apply the root test: limsupy,.l;a-,.!= izl limsup,v,.1-!c,.I =Rlzl· n ➔ co n-+oo Note: R is called the radius of convergence of Ic,. z". 3.40 Examples (a) The series In" z" has R = 0. L (b) The series ~ n. has R = + oo. (In this case the ratio test is easier to apply than the root test.) 70 PRINCIPLES OF MATHEMATICAL .ANALYSIS (c) The series :Ezn has R = 1. If lzl = 1, the series diverges, since {zN} does not tend to Oas n ➔ co. L (d) The series !'.'. has R = 1. It diverges if z = 1. It converges for all n other z with lzl = 1. (The last assertion will be proved in Theorem 3.44.) L (e) The series ~ n has R = 1. It converges for all z with lzl = 1, by the comparison test, since Izn/n2 j = 1/n2. SUMMATION BY PARTS 3.41 Theorem Given two sequences {an}, {b,.}, put ifn~0;putA_ 1 =0. Then, i/0-5.p-5.q, we have (20) Proof and the last expression on the right is clearly equal to the right side of (20). Formula (20), the so-called "partial summation formula," is useful in the investigation of series of the form :Ean bn, particularly when {bn} is monotonic. We shall now give applications. 3.42 Theorem Suppose (a) the partial sums An of:Eanform a bounded sequence,· (b) ho ~ bi ~ b2 ~ .. • ; (c) lim bn = 0. n ➔ oo Then :Ean b,. converges. NUMERICAL SEQUENCES AND SERIES 71 Proof Choose M such that IA. I ~ M for all n. Given e > 0, there is an integer N such that bN ~ (e/2M). For N ~ p ~ q, we have I 1-1 I Zq:a.b. n=p qI-A1.Cb. - n=p b.+1)+Aqbq - Ap-lbp,I I ~ M Iq-1 J}b• - b,,+ 1) + bq + bPj = 2MbP ~ 2MbN ~ e. Convergence now follows from the Cauchy criterion. We note that the first inequality in the above chain depends of course on the fact that b. - b.+t ~ 0. 3.43 Theorem Suppose (a) led~ ic2I ~ ic3I ~ .. ·; (b) C2m-l ~ 0, C2m ~ 0 (m = 1, 2, 3, ...); (c) lim. ➔ 00 c. = 0. Then I:c. converges. Series for which (b) holds are called "alternating series"; the theorem was known to Leibnitz. Proof Apply Theorem 3.42, with a.= (-1)"+ 1, b. = Ic. I, 3.44 Theorem Suppose the radius of convergence of :Ee. z" is 1, and suppose = c0 ~ c1 ~ c2 ~ • • ·, lim. ➔ 00 c. 0. Then :Ec.z" converges at every point on the circle Iz I = 1, except possibly at z = I. Proof Put a.= z", b. = c11 • The hypotheses of Theorem 3.42 are then satisfied, since if Iz I = 1, z =1= 1. ABSOLUTE CONVERGENCE The series I:an is said to converge absolutely if the series I: Ian I converges. 3.45 Theorem lfI:a. converges absolutely, then I:a. converges. 72 PRINCIPLES OF MATHEMATICAL ANALYSIS Proof The assertion follows from the inequality ~ IJnakl ktlakl, plus the Cauchy criterion. 3.46 Remarks For series of positive terms, absolute convergence is the same as convergence. If I.an converges, but I, Ian I diverges, we say that I.an converges non- absolutely. For instance, the series I(-- 1n )" converges nonabsolutely (Theorem 3.43). The comparison test, as well as the root and ratio tests, is really a test for absolute convergence, and therefore cannot give any information about nonabsolutely convergent series. Summation by parts can sometimes be used to handle the latter. In particular, power series converge absolutely in the interior of the circle of convergence. We shall see that we may operate with absolutely convergent series very much as with finite sums. We may multiply them term by term and we may change the order in which the additions are carried out, without affecting the sum of the series. But for nonabsolutely convergent series this is no longer true, and more care has to be taken when dealing with them. ADDITION AND MULTIPLICATION OF SERIES 3.47 Theorem If I.an = A, and r.b" = B, then r.(an + b") = A + B, and I.can = cA, for any fixed c. Proof Let Then n L An + Bn = (ak + bk). k=O Since limn ➔ oo An= A and limn ➔ oo Bn = B, we see that The proof of the second assertion is even simpler. NUMERICAL SEQUENCES AND SERIES 73 Thus two convergent series may be added term by term, and the resulting series converges to the sum of the two series. The situation becomes more complicated when we consider multiplication of two series. To begin with, we have to define the product. This can be done in several ways; we shall consider the so-called "Cauchy product." 3.48 Definition Given ta. and r.b., we put n L c. = akbn-k k=O (n=0,1,2, ...) and call r.c. the product of the two given series. This definition may be motivated as follows. If we take two power series ta. z" and th. z", multiply them term by term, and collect terms containing the same power of z, we get ro oo 'Lna z" • 'Lnb z" = (aO + aI z + a2 z2 + .. ·)(b0 + b1z + b2 z2 + ••·) n=O n=O = a0 b0 + (a0 b1 + a1b0 )z + (a0 b2 + a1b1 + a2 b0 )z2 + ••• = Co + C1Z + Cz z2 + ''' , Setting z = 1, we arrive at the above definition. 3.49 Example If and An ...... A, B• ...... B, then it is not at all clear that {C.} will converge to AB, c. since we do not have = A. Bn . The dependence of {C.} on {A.} and {B.} is quite a complicated one (see the proof of Theorem 3.50). We shall now show that the product of two convergent series may actually diverge. The series I (-1)" = 1--1 +-1 _ _1 + ... n=oJn + 1 J2 J3 J4 converges (Theorem 3.43). We form the product of this series with itself and obtain 74 PRINCIPLES OF MATHEMATICAL ANALYSIS so that Since n 1 Cn=(-ltL--:=====• k=0 J(n - k + l)(k + 1) (n-k+l)(k+l)=(i+tf-(i-kf :S;(i+1f. we have I C n I f :.?: k.=£..on- 2 _ 2(n + 1) -+-2- - "n-+-2- , so that the condition en ➔ 0, which is necessary for the convergence of ten, is not satisfied. In view of the next theorem, due to Mertens, we note that we have here considered the product of two nonabsolutely convergent series. 3.50 Theorem Suppose (a) (b) (c) (d) Then 00 L an converges absolutely, n•0 00 }:an =A, n=0 00 Lb,. =B, n=0 n Cn = L akbn-k (n = 0, 1, 2, ...). k=0 00 L Cn =AB. n=0 That is, the product of two convergent series converges, and to the right value, if at least one of the two series converges absolutely. Proof Put Then en= aobo + (aob1 + a1bo) + ''' + (aobn + a1bn-l + ''' + a,.bo) = a0Bn + a1 Bn-i + ••• + an Bo = ao(B + Pn) + a1(B + Pn-1) + ••• + anCB + Po) = AnB + aoPn + a1Pn-1 + •••+ anPo NUMERICAL SEQUENCES AND SERIES 75 Put Yn = aoP. + a1P.-1 +.••+a.Po • c. We wish to show that ➔ AB. Since A. B ➔ AB, it suffices to show that (21) Put [It is here that we use (a).] Let e > 0 be given. By (c), P. ➔ o. Hence we can choose N such that IP. I ~ e for n ~ N, in which case IY.I ~ lfJoa.+ .. ·+fJNan-NI + IPN+1an-N-1+·"+fJ.aol ~ IPoa. + .. •+ fJNa.-NI + B(X. Keeping N fixed, and letting n ➔ oo, we get Jim sup IY. I ~ B(X, since ak ➔ 0 ask ➔ oo. Since e is arbitrary, (21) follows. Another question which may be asked is whether the series re., if convergent, must have the sum AB. Abel showed that the answer is in the affirmative. 3.51 Theorem If the series ra., l:b., re. converge to A, B, C, and c. = a0 b0 + ••• + a. b0 , then C = AB. Here no assumption is made concerning absolute convergence. We shall give a simple proof (which depends on the continuity of power series) after Theorem 8.2. REARRANGEMENTS 3.52 Definition Let {kn}, n = 1, 2, 3, ... , be a sequence in which every positive integer appears once and only once (that is, {kn} is a 1-1 function from J onto J, in the notation of Definition 2.2). Putting (n = 1, 2, 3, ...), we say that I:a~ is a rearrangement of I:an . 76 PRINCIPLES OF MATHEMATICAL ANALYSIS If {s.}, {s~} are the sequences of partial sums of La., La~, it is easily seen that, in general, these two sequences consist of entirely different numbers. We are thus led to the problem of determining under what conditions all rearrangements of a convergent series will converge and whether the sums are necessarily the same. 3.53 Example Consider the convergent series (22) and one of its rearrangements (23) 1 +t-½+t+t-¼+½+fr-¾+ ... in which two positive terms are always followed by one negative. If s is the sum of (22), then Since s s3= i, n-->oo so that (23) certainly does not converge to s [we leave it to the reader to verify that (23) does, however, converge]. This example illustrates the following theorem, due to Riemann. 3.54 Theorem Let La. be a series of real numbers which converges, but not absolutely. Suppose P - 00 S Ct S S 00. Then there exists a rearrangement La~ with partial sums s~ such that (24) lim inf s~ = et, lim sups~= p. n ➔ oo n ➔ oo Proof Let (n = 1, 2, 3, ...). NUMERICAL SEQUENCES AND SERIES 77 Then Pn - q" = an, Pn + qn = Ian I, Pn ;;:: 0, qn ~ 0. The series r.pn, r,qn must both diverge. For if both were convergent, then !,(pn + qn} = !:Ian I would converge, contrary to hypothesis. Since L L L L N N N N an = (pn - qn) = Pn - qn • n=l n=1 n=1 n=l divergence of r.pn and convergence of r.qn (or vice versa) implies divergence of !:an , again contrary to hypothesis. Now let Pi, P 2 , P3 , •.. denote the nonnegative terms of r.an, in the order in which they occur, and let Q1, Q2 , Q3 , ••• be the absolute values of the negative terms of r.an, also in their original order. The series r,pn, r.Qn differ from r.pn, r.qn only by zero terms, and are therefore divergent. We shall construct sequences {mn}, {kn}, such that the series (25} P1 +•••+Pm, - Q1 - ••• - Qk, +Pm,+1 + •·• + pm2 - Qk, +1 - •• • - Qk2 + •••, which clearly is a rearrangement of r.an, satisfies (24). Choose real-valued sequences {ot:n}, {Pn} such that ot:n ➔ ot:, /3n ➔ /3, 0. Let mi, k1 be the smallest integers such that Pi+•••+ Pm,> P1, Pi + •· • + Pm1 - Ql - • • • - Qk, < /32, P1 +•••+Pm, - Q1 - ••• - Qk, +Pm,+1 + ··• +Pm2 - Qk,+1 - ••• - Qk2 < ot:2; and continue in this way. This is possible since r,pn and r.Qn diverge. If Xn, Yn denote the partial sums of (25) whose last terms are Pmn, -Q""' then Since Pn ➔ O and Qn ➔ Oas n ➔ oo, we see that Xn ➔ P, Yn ➔ ot:. Finally, it is clear that no number less than ot: or greater than /3 can be a subsequential limit of the partial sums of (25). 78 PRINCIPLES OF MATHEMATICAL ANALYSIS 3.55 Theorem lf'E.an is a series ofcomplex numbers which converges absolutely, then every rearrangement of'I.an converges, and they all converge to the same sum. Proof Let ta; be a rearrangement, with partial sums s; . Given e > 0, there exists an integer N such that m ~ n ~ N implies m (26) Ela,l:s;e. l=n Now choose p such that the integers 1, 2, ... , N are all contained in the set k1, k2 , •.• , kP (we use the notation of Definition 3.52). Then if n > p, the numbers a1, ... , aN will cancel in the difference Sn - s~, so that s; Is" - I :s; B, by (26). Hence {s;} converges to the same sum as {s"}. EXERCISES 1. Prove that convergence of {s.} implies convergence of {ls.I}. Is the converse true? 2. Calculate lim (vn2 + n - n). ft ➔ OO 3. If S1 = vi, and (n = 1, 2, 3, ...), prove that {s.} converges, and thats.< 2 for n = 1, 2, 3, ... . 4. Find the upper and lower limits of the sequence {s.} defined by 5. For any two real sequences {a.}, {h.}, prove that Jim sup (a. + h.) ~ Jim sup a. + Jim sup h., n ➔ OO n ➔ OO provided the sum on the right is not of the form co - co. 6. Investigate the behavior (convergence or divergence) of !:a. if (a) a. =Vn + 1- v'n; (b) a_.-v'-n-+-l--v-n,, n (c) a.= (~n - 1)•; 1 (d) a. = 1 + z•' for complex values of z. 7. Prove that the convergence of !:a. implies the convergence of if a.;,::o. ~v'a. ""' n ' NUMERICAL SEQUENCES AND SERIES 79 8. If l:a. converges, and if {b.} is monotonic and bounded, prove that l:a.b. converges. 9. Find the radius of convergence of each of the following power series: 2• (b) I; ;;j z•, (c) I: 2• n2 z•, 10. Suppose that the coefficients of the power series I;a. z" are integers, infinitely many of which are distinct from zero. Prove that the radius of convergence is at most 1. 11. Suppose a. > 0, s. = 01 + ···+ a., and l:a. diverges. (a) Prove that I: 1 ~•a.diverges. (b) Prove that 0N+1 +•••+QN+t ~l-.!..!!_ SN+1 SN+k SN+k and deduce that I :~ s. diverges. (c) Prove that and deduce that "~t"'2sa.. converges. (d) What can be said about I: a. 1 +na. and I: 1 +0n"2a. ? 12. Suppose a. > 0 and l:a. converges. Put ....., r. = I: a., . (a) Prove that if m< n, and deduce that I:~ diverges. r. 80 PRINCIPLES OF MATHEMATICAL ANALYSIS (b) Prove that and deduce that I:,a;.-converges. -v r. 13. Prove that the Cauchy product of two absolutely convergent series converges absolutely. 14. If {sn} is a complex sequence, define its arithmetic means a. by (n =0, 1, 2, ...). (a) If Jim s. = s, prove that lim a. = s. (b) Construct a sequence {s.} which does not converge, although Jim an= 0. (c) Can it happen thats.> 0 for all n and that Jim sup Sn= oo, although Jim a.= 0? (d) Put a.= s. - Sn-1, for n ~ 1. Show that Sn - a. = 1 n -+ 1 L• kat. k=l Assume that Jim (na.) = 0 and that {a.} converges. Prove that {s.} converges. [This gives a converse of (a), but under the additional assumption that na.- 0.) (e) Derive the last conclusion from a weaker hypothesis: Assume M < oo, Ina. I-;;;, M for all n, and lim a. = a. Prove that Jim Sn = a, by completing the following outline: If m 0 and associate with each n the integer m that satisfies n-e m-::;;,1+e v;;, and define x 2, x3, x4, ••• , by the recursion formula ~= Xn +t (Xn + :.) . (a) Prove that {x.} decreases monotonically and that Jim x. = v;;, (b) Put e. = x. - v;, and show that so that, setting f3 = 2v;;, (ir· e.+1 1. Take Xi > v;, and define or:+ Xn or:-x! X n +1i.= ..'L.x-. = x . + 1 +x-.. (a) Prove that Xi > X3 > Xs > •••. (b) Prove that X2 < X4 < x 6 < ••• . (c) Prove that Jim x. = v;, (d) Compare the rapidity of convergence of this process with the one described in Exercise 16. 18. Replace the recursion formula of Exercise 16 by p-1 or: Xn+i =--x.+-x.-p+i p p where p is a fixed positive integer, and describe the behavior of the resulting sequences {x.}. 19. Associate to each sequence a= {or:.}, in which or:. is O or 2, the real number I: x(a) = "' 3tXn•. n=i Prove that the set of all x(a) is precisely the Cantor set described in Sec. 2.44. 82 PR.INCIPLJ!S OP MAmBMATICAL ANALYSIS 20. Suppose {p.} is a Cauchy sequence in a metric space X, and some subsequence {p.,} converges to a point p e X. Prove that the full sequence {p.} converges top. 21. Prove the following analogue of Theorem 3.lO(b): If {E.} is a sequence of closed nonempty and bounded sets in a complete metric space X, if E. => Ea+i. and if .....Jim diam E. =0, n then f E. consists of exactly one point. n 22. Suppose Xis a nonempty complete metric space, and {G.} is a sequence of dense open subsets of X. Prove Baire's theorem, namely, that fG. is not empty. (In fact, it 1s dense in X.) Hint: Find a shrinking sequence of neighbor- hoods E. such that E. c: G., and apply Exercise 21. 23. Suppose {p.} and {q.} are Cauchy sequences in a metric space X. Show that the sequence {d(p.,q.)} converges. Hint: For any m, n, d(p., q.) ~d(p.,p,.) + d(p,,.,q,.) + d(q,., q.); it follows that Id(p.' q.) - d(p,. ' q,,.) I is small if m and n are large. 24. Let X be a metric space. (a) Call two Cauchy sequences {p.}, {q.} in X equivalent if .....Jim d(p. , q.) = O. Prove that this is an equivalence relation. (b) Let X* be the set of all equivalence classes so obtained. If P e x•, Q e x•, {p,.} e P, {q.} e Q, define ..... t:i.(P, Q) = limd(p.,q.); by Exercise 23, this limit exists. Show that the number t:i.(P, Q) is unchanged if {p.} and {q.} are replaced by equivalent sequences, and hence that t:i. is a distance function in X*. (c) Prove that the resulting metric space X* is complete. (d) For each p e X, there is a Cauchy sequence all of whose terms are p; let P, be the element of X* which contains this sequence. Prove that t:i.(P,, P4) =d(p,q) for all p, q e X. In other words, the mapping

0 there exists a 6 > 0 such that (2) dr(/(x), q) < s for all points x e E for which (3) 0 < dx(x,p) < 6. The symbols dx and dr refer to the distances in X and Y, respectively. If X and/or Y are replaced by the real line, the complex plane, or by some euclidean space~. the distances dx, dr are ofcourse replaced by absolute values, or by norms of differences (see Sec. 2.16). It should be noted that p e X, but that p need not be a point of E in the above definition. Moreover, even if p e E, we may very well have f(p) 'F lim,. ➔,f(x). We can recast this definition in terms of limits of sequences: 4.2 Theorem Let X, Y, E, f, and p be as in Definition 4.1. Then (4) ,l.i..m., /(x) =q if and only if (5) lim f(pn) =q for every sequence {Pn} in E such that (6) Pn 'F P, lim Pn =p. n ➔ ao Proof Suppose (4) holds. Choose {Pn} in E satisfying (6). Let s > 0 be given. Then there exists /, > 0 such that dy(f(x), q) < s if x e E and O< dx(x, p) < 6. Also, there exists N such that n > N implies 0 < dx(pn ,p) < /J. Thus, for n > N, we have dr(/(pJ, q) < s, which shows that (5) holds. Conversely, suppose (4) is false. Then there exists some s > 0 such that for every /J > 0 there exists a point x e E (depending on /J), for which drif(x), q) ~ s but O< dx(x, p) < 6. Taking/," = 1/n (n = 1, 2, 3, ...), we thus find a sequence in E satisfying (6) for which (5) is false. Corollary Iff has a limit at p, this limit is unique. This follows from Theorems 3.2(b) and 4.2. CONTINUITY 85 4.3 Definition Suppose we have two complex functions, f and g, both defined on E. By f + g we mean the function which assigns to each point x of E the number f(x) + g(x). Similarly we define the difference f - g, the product Jg, and the quotient/jg of the two functions, with the understanding that the quotient is defined only at those points x of E at which g(x) ,;,. 0. Iff assigns to each point x of E the same number c, then f is said to be a constant function, or simply a constant, and we write f = c. If f and g are real functions, and if f(x) ~ g(x) for every x e E, we shall sometimes write/~ g, for brevity. Similarly, if f and g map E into Ji', we define f + g and f • g by (f + g)(x) = f(x) + g(x), (f • g)(x) = f(x) • g(x); and if l is a real number, (M)(x) = lf(x). 4.4 Theorem Suppose E c: X, a metric space, p is a limit point of E, f and g are complex functions on E, and lim /(x) =A, x-+p lim g(x) = B. x-+p Then (a) lim (f +g)(x) = A + B; x➔p (b) lim (fg)(x) =AB; x-+p (c) xli➔m, (!.)(x) g = ~, B if B #- 0. Proof In view of Theorem 4.2, these assertions follow immediately from the analogous properties of sequences (Theorem 3.3). Remark If f and g map E into Rk, then (a) remains true, and (b) becomes (b') lim (f • g)(x) = A • B. x-+p (Compare Theorem 3.4.) CONTINUOUS FUNCTIONS 4.5 Definition Suppose X and Y are metric spaces, E c: X, p e E, and f maps E into Y. Then f is said to be continuous at p if for every e > 0 there exists a '5 > 0 such that dr(/(x),f(p)) < e for all points x e E for which dx(x, p) < '5. If/is continuous at every point of E, then/is said to be continuous on E. It should be noted that f has to be defined at the point p in order to be continuous at p. (Compare this with the remark following Definition 4.1.) 86 PRINCIPLES OF MATHEMATICAL ANALYSIS If p is an isolated point of E, then our definition implies that every function / which has E as its domain of definition is continuous at p. For, no matter which e > 0 we choose, we can pick J > 0 so that the only point x e E for which dx(x, p) 0 be given. Since g is continuous at f(p), there exists 1'/ > 0 such that dz(g(y), g(f(p))) < e if d1(y,f(p)) < 11 and y ef(E). Since/ is continuous at p, there exists b > 0 such that dr(f(x),f(p)) < 1'/ if dx(x, p) 0 such that y E V if d y(f(p), y) < e; and since f is continuous at p, there exists o > 0 such that dy(f(x),f(p)) < e if dx(x, p) < o. Thus x Ef- 1(V) as soon as dx(x, p) < o. Conversely, suppose f- 1(V) is open in X for every open set Vin Y. Fix p EX and e > 0, let V be the set of ally E Y such that dy(y,f(p)) < e. o Then Vis open; hencef- 1(V) is open; hence there exists > 0 such that x E/- 1(V)as soon as dx(P, x) < o. But if x E f- 1(V), then f(x) EV, so that dy(f(x),f(p)) < e. This completes the proof. Corollary A mapping f of a metric space X into a metric space Y is continuous if and only iff- 1( C) is closed in X for every closed set C in Y. This follows from the theorem, since a set is closed if and only if its com- u- plement is open, and since/- 1(Ec) = 1(E)Y for every EC Y. We now turn to complex-valued and vector-valued functions, and to functions defined on subsets of Rk. 4.9 Theorem Let f and g be complex continuous functions on a metric space X. Thenf + g,fg, andf/g are continuous on X. In the last case, we must of course assume that g(x) #- 0, for all x E X. Proof At isolated points of X there is nothing to prove. At limit points, the statement follows from Theorems 4.4 and 4.6. 4.10 Theorem (a) Let / 1 , .h ... , be real functions on a metric space X, and let f be the mapping of X into Rk defined by (7) f(x) = (fi(x), ... ,/',ix)) (XE X); then f is continuous ifand only ifeach ofthe functionsJi., ... ,I,. is continuous. (b) If f and g are continuous mappings of X into Rk, then f + g and f • g are continuous on X. The functions Ji., ... , I,. are called the components off. Note that f + g is a mapping into Rk, whereas r•g is a real function on X. 88 PRINCIPLES OF MATHEMATICAL ANALYSIS Proof Part (a) follows from the inequalities ={ f 2}\ J.tj(x) -1.,(y)J :S: Jf(x) - f(y)J 1/,(x) - /i<.v)J 1=1 for j = l, ... , k. Part (b) follows from (a) and Theorem 4.9. 4.11 Examples If Xi, .•. , x,. are the coordinates of the point x e R", the functions 4>, defined by (8) are continuous on Rk, since the inequality 14>,(x) - i(y) I :S: Ix - YI shows that we may take D= e in Definition 4.5. The functions q,1 are sometimes called the coordinate functions. Repeated application of Theorem 4.9 then shows that every monomial (9) where ni, ... , nk are nonnegative integers, is continuous on R". The same is true of constant multiples of (9), since constants are evidently continuous. It follows that every polynomial P, given by (10) (x e R"), is continuous on R". Here the coefficients cni•"nk are complex numbers, n1, ... , n" are nonnegative integers, and the sum in (10) has finitely many terms. Furthermore, every rational function in Xi, ... , xk, that is, every quotient of two polynomials of the form (10), is continuous on Rk wherever the denominator is different from zero. From the triangle inequality one sees easily that (11) Hence the mapping x-+ Ix I is a continuous real function on R". If now f is a continuous mapping from a metric space X into Rk, and if 4> is defined on X by setting q,(p) = If(p) J, it follows, by Theorem 4.7, that 4> is a continuous real function on X. 4.12 Remark We defined the notion of continuity for functions defined on a subset E of a metric space X. However, the complement of E in X plays no role whatever in this definition (note that the situation was somewhat different for limits of functions). Accordingly, we lose nothing of interest by discarding the complement of the domain off This means that we may just as well talk only about continuous mappings of one metric space into another, rather than CONTINUITY 89 of mappings of subsets. This simplifies statements and proofs of some theorems. We have already made use of this principle in Theorems 4.8 to 4.10, and will continue to do so in the following section on compactness. CONTINUITY AND COMPACTNESS 4,13 Definition A mapping f of a set E into R" is said to be bounded if there is a real number M such that If(x) I ~ M for all x e E. 4.14 Theorem Suppose f is a continuous mapping of a compact metric space X into a metric space Y. Thenf(X) is compact. Proof Let {V,.} be an open cover of/(X). Since/is continuous, Theorem 4.8 shows that each of the sets /- 1(V,.) is open. Since Xis compact, there are finitely many indices, say oci, ... , °'n, such that (12) Xc.f- 1(V,.) u ... u/- 1(V,."). Since/(f- 1(£)) c. E for every E c. Y, (12) implies that (13) f(X) c. v ..l u .. • u v ..n. This completes the proof. Note: We have used the relation /(r- 1(£)) c. E, valid for E c. Y. If E c. X, then/- 1(/(E)) => E; equality need not hold in either case. We shall now deduce some consequences of Theorem 4.14. 4.15 Theorem // f is a continuous mapping of a compact metric space X into R", then f(X) is closed and bounded. Thus, f is bounded. This follows from Theorem 2.41. The result is particularly important when.f is real: 4.16 Theorem Suppose f is a continuous real function on a compact metric space X, and (14) M= sup/(p), m = inf /(p). peX peX Then there exist points p, q e X such thatf(p) = M andf(q) = m. The notation in (14) means that M is the least upper bound of the set of all numbersJ(p), where p ranges over X, and that m is the greatest lower bound of this set of numbers. 90 PRINCIPLES OF MATHEMATICAL ANALYSIS The conclusion may also be stated as follows: There exist points p and q in X such that f(q) Sf(x) Sf(p) for all x e X; that is,/ attains its maximum (at p) and its minimum (at q). Proof By Theorem 4.1 S, f ( X) is a closed and bounded set of real numbers; hence/(X) contains M== sup/(X) and m =inff(X), by Theorem 2.28. 4.17 Theorem Suppose f is a continuous 1-1 mapping of a compact metric space X onto a metric space Y. Then the inverse mapping 1- 1 defined on Y by (xe X) is a continuous mapping of Y onto X. Proof Applying Theorem 4.8 to 1- 1 in place of/, we see that it suffices to prove that/(V) is an open set in Y for every open set Vin X. Fix such a set V. The complement vc of Vis closed in X, hence compact (Theorem 2.35); hence/(V1 is a compact subset of Y (Theorem 4.14) and so is closed in Y (Theorem 2.34). Since/is one-to-one and onto, /{V) is the complement off(Vc). Hence/(V) is open. 4.18 Definition Let/be a mapping of a metric space X into a metric space Y. We say that/is uniformly continuous on X if for every e > 0 there exists b > 0 such that (15) dy(f(p),f(q)) < e for all p and q in X for which dx(p, q) < b. Let us consider the differences between the concepts of continuity and of uniform continuity. First, uniform continuity is a property of a function on a set, whereas continuity can be defined at a single point. To ask whether a given function is uniformly continuous at a certain point is meaningless. Second, if / is continuous on X, then it is possible to find, for each e > 0 and for each pointp of X, a number 8 > 0 having the property specified in Definition 4.5. This 8 depends on e and on p. If/is, however, uniformly continuous on X, then it is possible, for each e > 0, to find one number b > 0 which will do for all points pof X. Evidently, every uniformly continuous function is continuous. That the two concepts are equivalent on compact sets follows from the next theorem. CONTINUITY 91 4.19 Theorem Let f be a continuous mapping of a compact metric space X into a metric space Y. Then f is uniformly continuous on X. Proof Let e > 0 be given. Since f is continuous, we can associate to each point p e X a positive number cp(p) such that (16) q e X, dx(p, q) < cp(p) implies dy(f(p), f(q)) < 26° Let J(p) be the set of all q e X for which (17) dx(P, q) < ½cf>(p). Since p e J(p), the collection of all sets J(p) is an open cover of X; and since Xis compact, there is a finite set of points Pi, ... , p. in X, such that (18) We put (I 9) Then [; > 0. (This is one point where the finiteness of the covering, inherent in the definition of compactness, is essential. The minimum of a finite set of positive numbers is positive, whereas the inf of an infinite set of positive numbers may very well be 0.) Now let q and p be points of X, such that dx(P, q) < <>. By (18), there is an integer m, 1 ~ m ~ n, such that p e J (Pm); hence (20) dx(P, Pm) < ½¢(Pm), and we also have dx(q, Pm) ~ dx(P, q) + dx(P, Pm) < <> + ½¢(Pm) ~ ¢(pm). Finally, (16) shows that therefore dy(f(p),f(q)) ~ dy(f(p),f(pm)) + dy(f(q),/(Pm)) < 6. This completes the proof. An alternative proof is sketched in Exercise 10. We now proceed to show that compactness is essential in the hypotheses of Theorems 4.14, 4.15, 4.16, and 4.19. 4.20 Theorem Let E be a noncompact set in R1. Then (a) there exists a continuous function on E which is not bounded,· (b) there exists a continuous and bounded function on E which has no maximum. If, in addition, E is bounded, then