+ All Categories
Home > Documents > ɷshawns sagan integral and differential calculus

ɷshawns sagan integral and differential calculus

Date post: 08-Apr-2016
Category:
Upload: quiasma
View: 366 times
Download: 32 times
Share this document with a friend
Description:
ɷshawns sagan integral and differential calculus
Popular Tags:
341
Transcript
Page 1: ɷshawns sagan integral and differential calculus
Page 2: ɷshawns sagan integral and differential calculus

INTEGRAL

AND DIFFERENTIAL

CALCULUS:

an intuitive approach

Page 3: ɷshawns sagan integral and differential calculus
Page 4: ɷshawns sagan integral and differential calculus

INTEGRAL

AND DIFFERENTIALCALCULUS:

an intuitive approach

HANS SAGAN

Professor of Mathematics andHead of the Department of MathematicsUniversity of Idaho Moscow, Idaho

JOHN WILEY & SONS, INC. NEW YORK LONDON

Page 5: ɷshawns sagan integral and differential calculus

Copyright © 1962 by John Wiley & Sons, Inc.

All Rights ReservedThis book or any part thereof must notbe reproduced in any form without the

written permission of the publisher.

Library of Congress Catalog Card Number: 62-17469Printed in the United States of America

Page 6: ɷshawns sagan integral and differential calculus

PREFACE

This book is designed as a one-semester course for students of varyingbackgrounds. Those who wish to take this course with a reasonablechance to succeed should be familiar with elementary algebraic techniquesas provided by a good high school preparation or a one-semester coursecustomarily called College Algebra. Chapter I introduces as much analyticgeometry as is needed in the sequel. Certain parts, e.g., the section on thequadratic equation (Sect. 9) can be omitted if this is warranted by thestudents' background. Appendix III is devoted to an introduction totrigonometry. This Appendix III should be taken up between Chapter IIand Chapter III, if needed.

This book is intended for students who take the second semester of aone-year terminal course in mathematics as well as high school teachersof a specific category who attend refresher courses or summer institutes.This "specific category" embraces high school teachers who never had aformal course in calculus, or had a standard calculus course and eitherdid not derive much profit from it or quickly forgot most of it, and arenow called upon to teach the calculus in high school.

Lastly, this text is designed for a calculus course as offered by manyhigh schools as part of an accelerated sequence for students of exceptionalability.

Once upon a time there was a young lady who enrolled in one of theuniversities in the Northwest. Because of her poor high school record inalgebra, she was put into a refresher course in this subject. She wasstruggling along, just barely keeping from drowning, until suddenly,halfway through the course, her features lit up and her eyes sparkled withastonished enlightenment as she exclaimed: "Why didn't they tell mebefore that those letters stand for numbers?"

When writing this book, I was thinking of this young lady's plight andmade an effort to stay in as close contact with numbers as possible. I fre-quently resorted to experimental methods at the expense of mathematicalrigor to provide for a practical understanding of the limit processes that arebasic for a good comprehension of the calculus.

v

Page 7: ɷshawns sagan integral and differential calculus

Vi PREFACE

The calculus is developed here in a fashion as it could have happened-and to some extent did happen-historically. Physical and geometricapplications are interwoven with the text to provide sufficient motivationfor the introduction of new mathematical concepts.

This treatment is not cluttered up with technical details and tricks.The reader will not learn to differentiate

itanV7r/2- x2lx2 1000 1' a

g cot(z+ 1+x2)2+elog I log I +a

nor to integrate

5(ex236

+ sin 1/x)15 (e2x2 +sin21/_x

+ 2ex2

sinV

X +V 3

(2xex2 +1

cos 1/x f dx.21/x /

Instead, it is the aim of this book to provide for a thorough understandingof the limit of a sum process, the limit of the difference quotient, and thepossible practical applications of the calculus.

The sections on the chain rule (differentiation of a function of a function,Chap. III, Sect. 4) and the inverted chain rule (integration by substitution,Chap. III, Sect. 6), which deal with some more formal aspects of thecalculus, are entirely independent of the main text and may or may notbe included in the course. A bare minimum of formulas is developed,but great emphasis is placed on such items as the trapezoidal rule andSimpson's rule, which promote a very practical understanding of thelimit process on which the definition of the definite integral is based.

Chapter II deals with integration. The concept of an area is developedin a semirigorous manner. The main purpose of the introductory sections,1 and 2, is to make students aware of the fact that area is not somethingthat "is" but, rather, an artificial concept which has to be defined in amanner that will meet our intuitive demands. A thorough discussion ofthe definite integral precedes the introduction of the derivative in thisbook because it was felt that it is easier to convince students of the necessityfor measuring areas, rather than slopes. Another reason is a very practicalone: if the derivative is discussed first and then the indefinite integral isintroduced as the antiderivative and, subsequently, the definite integral isdefined in terms of the indefinite integral, students will hardly pay much

Page 8: ɷshawns sagan integral and differential calculus

PREFACE Vii

attention to what might be said later about the definite integral as a limitof a sum, being already in possession of a very simple routine for evaluatingit. This would seem very unfortunate indeed, as the concept of the limitof a sum is really the key to most important applications of the definiteintegral.

At the end of Chapter II, the integrand is characterized as the rate ofchange of the definite integral with a variable upper limit. Thus a con-tinuous transition from Chapter II to Chapter III is provided at theexpense of the fundamental theorem of the calculus which loses itscharacter as a theorem under such treatment.

Chapter III deals with the derivative and its geometric and physicalinterpretations. The basic theme concerning the discrepancy betweenphysical reality and its mathematical description, which was alreadyintroduced in Chapter I, is now carried to a crescendo in the sections onmotion and freely falling bodies. The ideas put forth here are those oflogical positivism, presented in a simplified and personalized form.

Chapter 1V finally deals with volumes as far as this is practical withouthaving to introduce any more essentially new ideas beyond those thathave been already developed in Chapters II and III.

There are certain sections and portions of sections that can be omittedwithout seriously jeopardizing the continuity of the development. Thesesections are clearly set apart from the main text by solid triangles whichare set at the beginning and the end of each such portion. This does notmean, of course, that these sections should be omitted. On the contrary,they ought to be studied if this is feasible under the given circumstances,because most of these specially designated sections serve to round out thetreatment or open up new avenues of thought that should stimulate thebetter students to deeper thinking and inspire them to further studies inmathematics.

Many problems are listed at the end of every section. The answers tomost of the even-numbered problems are supplied in the back of the book.Some of these problems complement the text and serve to help the readerfamiliarize himself with the new notions and techniques that are introduced.Some problems supplement the text in exploring certain aspects of thematerial in greater depth than the main text. Still other problems lead thereader away from the text in a pursuit of sidelines which are only looselyconnected with the material that is studied. There are many more supple-mentary problems supplied at the end of every chapter.

It is my belief that students have to be in possession of facts before theycan make any attempt to fit them into a beautifully constructed deductivesystem. It was my aim to present in this book the facts, or some factsanyway, but I tried to give the reader occasionally a fleeting glimpse of

Page 9: ɷshawns sagan integral and differential calculus

Viii PREFACE

the deductive system by leading him through some simple deductivearguments.

I hope that my book will promote interest in mathematics amongnoncommitted students as well as assist teachers in giving stimulatingpresentations of the calculus on the elementary level.

Moscow, Idaho HANS SAGAN

August 1962

Page 10: ɷshawns sagan integral and differential calculus

ACKNOWLEDGMENTS

I wish to express my gratitude to those who helped me in the preparationof this book. Specifically I thank my former colleague, Dr. Antony E.Labarre, Jr. (now at Fresno State College), for his attentive reading of mymanuscript, and Mr. V. M. Sakhare, one of my graduate students, forworking all the problems, supplying the answers, and calling my attentionto many rough spots in the manuscript. Finally, I wish to thank mypublishers for their help and cooperation in preparing this book.

H. S.

ix

Page 11: ɷshawns sagan integral and differential calculus
Page 12: ɷshawns sagan integral and differential calculus

CONTENTS

I FUNCTIONS

1. Representation of Functions by Tables, 12. Representation of Functions by Graphs, 43. Mathematical Representation of Functions, 134. Straight Line, 195. The General Line Equation, 236. Intersection of Two Lines, 277. Distance between Two Points, 318. The Parabola, 359. The Quadratic Equation, 39

10. Tangent to a Parabola, 4511. Interval Notation, 5112. More Examples of Functions-Continuity, 5313. Uniform Continuity, 60

Supplementary Problems I, 69

II AREAS

1. Area of Rectangles, 732. Area of Triangles, 783. Series and Sequences, 804. Manipulations with Limits, 905. The Area of a Circle, 976. Circumference of a Circle, 1047. Radian Measure, 1078. Area under a Curve, 1109. The Definite Integral, 118

10. The Trapezoidal Rule, 12411. Integration by a Limit of a Sum Process, 13012. Finite Sums, 135

1

73

xi

Page 13: ɷshawns sagan integral and differential calculus

CHAPTER I

FUNCTIONS

1. REPRESENTATION OF FUNCTIONS BY TABLES

The so-called exact sciences engage in a quantitative analysis of nature.There are basically two types of quantities that play a significant role inscientific systems: those that change their value, the so-called variables,and those that do not change their value, the so-called constants. Ourattention in this treatment will be devoted primarily to a study of variables.In order to reach some understanding of this concept, let us discuss a fewexamples.

It is an experimentally established fact that the boiling point of water,i.e., the temperature at which water starts boiling, depends on the atmos-pheric pressure under which the water is brought to a boil. Every travelerknows that it takes 6 minutes to prepare a soft boiled egg in Bozeman,Montana, while it takes only 2 minutes to accomplish the same result inRedding, California. The realization of this phenomenon made men en-vision the pressure cooker which is in our days a gruesome reality thatreduces the great variety of potato dishes to something which is hardly dis-tinguishable from mashed potatoes. We are not about to introduce theAmerican menu as our first example of a variable. So, let us return to thepoint at which we embarked on this culinary discussion: the boiling pointof water. It can be experimentally established, as we mentioned above,that the boiling point of water changes with the atmospheric pressure.Specifically, the entries in the left column of Table 1.1 indicate the differentvalues of the atmospheric pressure under which the experiment was carriedout and the entries in the right column give the corresponding temperaturesat which boiling occurs.

We recognize in this example two physical quantities as variables, i.e.,quantities that change their value: the atmospheric pressure and the boilingtemperature of water. We observe at the same time that these two variablesplay a clearly distinct role because if we choose freely any pressure weplease, the boiling point of the water is completely determined by the choicewe make. In other words: even though the boiling temperature of water is

1

Page 14: ɷshawns sagan integral and differential calculus

2 CHAP. I. FUNCTIONS

Table I.1Water starts boiling at

Under an atmospheric a temperature of degreespressure in mm mercury Celsius (centigrades)

9.209 10

17.53 20

31.824 30

55.32 40

92.51 50

149.38 60

233.7 70

355.1 80

525.8 90

760 100

a variable, its value is determined by the value of the variable that representsthe pressure. We express this situation mathematically by stating that theboiling point of water is a function of the atmospheric pressure. The vari-able to which we can assign values freely (to some extent) we call theindependent variable (here the atmospheric pressure). The other variable,the value of which is determined by the value of the independent variable,we call for obvious reasons the dependent variable (here the boiling tem-perature).

Table 1.2

For an elevation above The following atmospheric pressuresealevel in m in mm mercury at 0°C is found

0 760

2947 525.8

6087 355.1

9433 233.7

13012 149.38

Of course, the concept of dependent and independent variable is a ratherrelative one. Thus, the atmospheric pressure appears to be a dependentvariable if we venture to measure it at different elevations. Specifically, weobtain the results in Table 1.2. Here the elevation plays the role of anindependent variable while the atmospheric pressure emerges as the depend-ent variable: the atmospheric pressure is a function of the elevation.

Combining Tables 1.1 and 1.2, we see that we may consider the boiling'point of water as a function of the elevation, as given in Table 1.3, andeliminate the pressure entirely.

Page 15: ɷshawns sagan integral and differential calculus

1.1. REPRESENTATION BY TABLES 3

Table 1.3

Boiling point ofElevation in m water in 'C

0 100

2947 90

6087 80

9433 70

13012 60

Of course, we could go on now to consider the elevation as a function ofthe time, supposing we are sitting in a rocket that is shot straight upward.The application of such an analysis is quite obvious if we get the idea thatwe must have a boiled egg 6 minutes after launching time.

Table 1.4

Pressure in mm mercury Volume in cm3

50 152

100 76

200 38

300 25.3500 15.2600 12.7700 10.8

800 9.5900 8.4

1000 7.6

Other examples of dependent and independent variables are easily found.Let us consider a cylinder that contains some gas and is closed tightly by apiston (see Fig. I.1). Clearly, if we increase the pressure on the piston, thenthe volume of the enclosed gas will become smaller, and vice versa.

Using crude experimental methods, we will find a relationship for somegas as revealed in Table 1.4.

Fig. 1.1

Page 16: ɷshawns sagan integral and differential calculus

4 CHAP. I. FUNCTIONS

Here the volume of the enclosed gas appears as a function of the pressurewhich is applied to the piston. If we are interested in knowing how muchpressure must be applied in order to compress the gas to a given volume,we have only to interchange the two columns in Table 1.4 and consider thepressure as a function of the volume. Again, we can see that whether avariable is dependent or independent depends largely on the point of view.

2. REPRESENTATION OF FUNCTIONS BY GRAPHS

A common means of representing the relation between two variablequantities, if such a relation exists, is the graph. Let us return to Table I.1for the purpose of an introductory discussion. This table contains tenpairs of values, one of which represents a certain atmospheric pressure, theother one the corresponding boiling temperature of water: (9.209, 10),(17.53, 20), (31.824, 30), (55.32, 40), (92.51, 50), (149.38, 60), (233.7, 70),(355.1, 80), (525.8, 90) and (760, 100). Our aim is to give a geometricrepresentation of this relationship.

However, before we can endeavor to present pairs of numbers geometri-cally, we first have to settle a much simpler problem, namely: how do werepresent a single numerical value geometrically? The answer is simplygiven by the ruler with an engraved scale or the thermometer. We considera line (see Fig. 1.2) and choose one point on this line quite arbitrarily. Wecall this point 0 and let it represent the number 0. Next we choose onemore point which shall lie to the right of 0, but can otherwise be chosenquite arbitrarily, and call it 1. This point shall represent the number 1.The distance between the point 0 and the point 1 we call unit distance.Clearly, the number 2 will now be represented by a point one unit distanceto the right of 1, etc. It is really quite clear how we have to proceed tolocate the points which are supposed to represent all the positive integers.

Negative integers, as suggested by the scale of the thermometer, will berepresented by points to the left of 0. (Historically, the concept of the lineq f numbers preceded the scale of the thermometer; however, although fewstudents are acquainted with the line of numbers, it can be assumed thateverybody has seen a thermometer at least once.) Specifically, the repre-sentative of the number -1 will be a point one unit to the left of 0, therepresentative of -2 one unit to the left of -1, etc.

Thus we have attained a geometric interpretation of all positive andnegative integers. How do we now represent fractions? Clearly, the repre-sentative of z will be a point halfway between 0 and 1, the representative

I I I I I I I I I 1 1

-4 -3 -2 -1 0 i 1 23

3 4

Fig. 1.2

Page 17: ɷshawns sagan integral and differential calculus

1.2. REPRESENTATION BY GRAPHS 5

Fig. 1.3

of a will be a point between 2 and 3 such that its distance from 2 is one halfof its distance from 3, and, finally, the point representing 171 will be locatedbetween I and 2 so that its distance from 1 is ; of the unit distance. Thesefew examples clearly indicate how we have to proceed in finding the repre-sentatives of rational numbers* (fractions).

It is quite easy to construct those points which are supposed to representfractions by the following device. Let us again consider the number 1 .In the following argument we refer to Fig. 1.3. We draw a line through thepoint representing 1 at an acute angle with the line of numbers. Then weproceed to mark 7 points on this line at equal distances, starting with thepoint I (equidistant points). We call these points Pl, P., P3, , P;. Wejoin the last point P7 and the point representing the number 2 with a straightline and then draw a line parallel to the line through 2 and P7 through thepoint P4. This line will intersect the line of numbers in the point whichrepresents the number ;?, i.e., the point ; of a unit to the right of 1. Thiscan be seen quite easily by considering the two similar triangles (1, 2, P;)and (1, 11, P4)

The problem of locating points that represent irrational numberst(numbers which are not fractions) is not so simple. While it is quite easyto construct the representative of (see Fig. I.4), it is not so clear howand if one can construct the representative of U_7 or, to make matters

In

* a is a rational number if, and only if, it can be represented in the form a = -,n

where m and n are positive or negative integers. (n 0).f A number b is irrational if, and only if, it is not possible to write it in the form

mb where in, n are positive or negative integers. (n 0).=

Page 18: ɷshawns sagan integral and differential calculus

6 CHAP. I. FUNCTIONS

N

I 1

0 1 2

Fig. 1.4

worse, the representative of 7T. We wish to state here only that while allsuch numbers have a uniquely determined place on the line of numbers, itis generally impossible to locate them by a construction process using ruler,compass, and even more sophisticated instruments.

We will base the following discussion on the assumption that everynumber has a point representing it on the line of numbers, and for everypoint on the line of numbers there is a number which is represented by thispoint, without entering a discussion of the justification of this assumption.(This question is quite delicate and complicated. There have been professorswho embarked on such a discussion at the beginning of their calculuscourse and have still been talking about it at the end of the school year.)This correspondence between points and numbers is a one-to-one corre-spondence-as we express it mathematically-inasmuch as it is unique bothways.

Now that we have settled the problem of representing numbers geometri-cally, we can proceed to the more complicated problem of representingpairs of numbers geometrically. The first thought that comes to mind, ofcourse, is to draw two lines of numbers parallel to each other whereby thecorrespondence of pairs is expressed by lines joining the two points that

Pressure in mm Hg100 200 300 400 500 600 700 800

10 20 30 40 50 60 70 80 90 100

Temperature in °C

Fig. 1.5

Page 19: ɷshawns sagan integral and differential calculus

[.2. REPRESENTATION BY GRAPHS 7

correspond to each other according to our table. Figure 1.5 illustrates thismethod for the case of Table 1.1.

This representation, however, is quite awkward. A much better-not tosay, ideal-representation can be obtained by drawing the two lines ofnumbers so that they intersect at a right angle, preferably such that the twopoints representing 0 on the two lines coincide (see Fig. 1.6). Now all wehave to do is to agree that the first number in the pair (a, b) is to be repre-sented by a point on the horizontal line and that the second number in thepair (a, b) is to be represented by a point on the vertical line and the pair(a, b) itself is represented by this point in the plane which, if projected verti-cally onto the horizontal line, has the representative of a as an image; andif projected horizontally onto the vertical line, has the representative of b asan image. Clearly, by this method we obtain a one-to-one correspondencebetween all possible ordered pairs of numbers and all possible points in theplane. To every ordered pair of numbers there corresponds one, and onlyone, point in the plane; and to every point in the plane there correspondsone, and only one, ordered pair of numbers.

Applying this method to a graphic representation of Tables I.1, 1.2, 1.3,and 1.4, we obtain by appropriate choice of the units and the intersectionpoint of the two lines of numbers, the graphs which are given in Figs. 1.7,1.8, 1.9 and 1.10.

The system which consists of two lines of numbers that intersect eachother at a right angle is called a coordinate system or more accurately, aright coordinate system. (This is to distinguish it from coordinate systemswhere the lines do not intersect at a right angle and which are of significancein some areas of mathematics.)

II

-------- ?P(ab)I

I I I

-3 -2 -1

3b

2

1

I I i I I

1 2 a 3 4

-1

-2

III -3- IV

Fig. 1.6

Page 20: ɷshawns sagan integral and differential calculus

8 CHAP. I. FUNCTIONS

oc

11

11

I-

11

11

- mm HgI I I I I I I I

100 200 300 400 500 600 700 800

Fig. 1.7

The two lines themselves, we call the coordinate axes. Specifically, inFig. 1.7, we talk about the pressure axis and the temperature axis; in Fig.1.8, about the elevation axis and the pressure axis; in Fig. 1.9, about theelevation axis and the temperature axis; and finally, in Fig. I.10, about thepressure axis and the volume axis.

In a mathematical analysis in which no specific physical significance isattributed to the coordinate axes, we generally call the horizontal axis thex-axis and the vertical axis the y-axis. The two numbers in the pair (a, b)are called the coordinates of the point which represents this pair. Specifi-cally, we talk about pressure coordinate, temperature coordinate, volume

mm Hg

500

100

0

11

I I I C I

1000 5000 10,000

11

--m

Fig. 1.8

Page 21: ɷshawns sagan integral and differential calculus

1.2. REPRESENTATION BY GRAPHS 9

100

oc

11

50

10

0I I I

1000 5000

11

11

11

I

10,000

Fig. 1.9

coordinate, etc., and in general: x-coordinate and y-coordinate. The x-coordinate is also frequently called the abscissa and the y-coordinate theordinate.

This section of the plane, located between the positive x-axis and thepositive y-axis, we call the first quadrant of the coordinate system and weproceed to enumerate the remaining quadrants in the counterclockwisedirection as indicated by roman numerals in Fig. 1.6. Thus the sectionbetween negative x-axis and positive y-axis is the second quadrant, etc....

Thus far we have not gained any advantage from representing the func-tions as given in Tables 1.1 to 1.4 by points in coordinate systems as repre-sented in Figs. 1.7 to 1.10. On the contrary, finding for a given value of the

cm3

150

100

50

10

11

I $ ammHg1000

Fig. I.10

Page 22: ɷshawns sagan integral and differential calculus

10

e`.

CHAP. I. FUNCTIONS

independent variable the corresponding value of the dependent variableseems to be more complicated if we utilize the graphs rather than the tables.However, suppose we want to know the value of the dependent variablefor a value of the independent variable which is not listed in the tables.Then the tables do not provide for an immediate enlightenment. Neitherdo the graphs in Figs. 1.7 to 1.10 in their present form, for that matter. Butif we join the points in these graphs by a curve, as we have done in Figs.1.11 to I.14, then all we have to do in order to find the value of the dependentvariable for some value of the independent variable is to erect a verticalline at the point which represents the given value of the independent variable,intersect this line with the curve, and draw a line through this intersection

mm Hg

500

100

Fig. 1.12

Page 23: ɷshawns sagan integral and differential calculus

I.2. REPRESENTATION BY GRAPHS 11

eC

100

50

10

I I

1000 5000

Fig. 1.13

I

10,000m

point parallel to the horizontal axis. The intersection point of this linewith the vertical axis is then the representative of the value of the corre-sponding dependent variable. (See dotted lines in Figs. 1.11 to I.14.)

Now this procedure seems to be very simple and tempting, indeed-toocompelling, really, to be acceptable. Of course, there is a great logicaldifficulty involved in this process which we camouflaged with our persuasiveargument. What does it really mean to join the points in the graphs ofFigs. 1.11 to 1.14 by curves? Are we justified in doing this? The answer is:no and yes. From a logical standpoint, it is no. From a practical stand-point it is yes. So let us enlarge on this somewhat mystifying answer.

cm3

150

100

50

I i I

100 500t-

1000mm Hg

Fig. 1.14

Page 24: ɷshawns sagan integral and differential calculus

12 CHAP. 1. FUNCTIONS

Y

>- x

Fig. 1.15

Clearly, an atmospheric pressure, which appears in Table I.1 as an inde-pendent variable, may assume any value (within reasonable limits, if wewant to remain in contact with reality). It is also clear that water will boilunder any given pressure at some temperature. So, for any given value ofthe independent variable, the dependent variable has to have some value.Now the only logically satisfactory way of finding this value is, of course,to carry out an experiment. But, no matter how many experiments we maycarry out, all we will ever obtain is a table with finitely many entries or agraph such as the one in Fig. 1.9 with finitely many points.

We now have to make the assumption that, if we should carry out furtherexperiments, the results would be such that they fit the pairs of valueswhich are obtained simply by joining the points of the graph by a curve.There is, of course, some ambiguity in this statement, because we did notspecify the character of the curve. For example, we demonstrated in Fig.1.15 that the same points can be joined by a curve in a number of differentways and the results we obtain from determining the values of the functionbetween measured points are consequently quite different. Since only onevalue can be right, the others have to be wrong. But how do we know thata certain curve will yield the right results while others do not? Well, wedon't. (See also remarks on this subject in Chapter III, Sections 10and 14.)

As the reader can already see, the whole problem boils down to whetherwe wish to be practical or not. If we do obtain-by chance-a curve whichproves to be reliable in predicting results of experiments, then we have wona great deal. If the curve does not permit any reliable predictions-whichwe have to assume after it has "misfired" a number of times- then we havenot lost anything (except for possible material losses), because we stillhave the table into which the results from all our measurements areentered.

Page 25: ɷshawns sagan integral and differential calculus

1.3. MATHEMATICAL REPRESENTATION 13

Problems 1.1-1.8

I.I. Represent the following pairs of numbers by points in a right coordinatesystem :

P(3, 5), P(-1, 4), P (-7, 2) , P(2 4) , P(12, 8), P(-3, -2), P(4, -18).

1.2. Construct the points on the line of numbers which represent the followingrational numbers:

1 4 19 17 3 5 9 16

5 ' 9 ' 5' 7'11'12'23'311.3. Construct the points on the line of numbers which represent the following

irrational numbers:V5, V13, V45, V T4-5.

1.4. Represent the following table by a graph and join the points by a curve.

x y

-5 2

-4 1

-3 1

-2 0

-1 -10 -11 -22 -33 -34 -45 -6

1.5. Interchange x and y in the table in Problem 1.4 and represent the functionwhich is thus obtained by a graph, again joining the points by a curve.

1.6. Let (a, b) be some point on the x-axis. What can you say about a andabout b ?

1.7. Let (a, b) be some point on the y-axis. What can you say about a andabout b?

1.8. Let (a, b) be some point on the line which bisects the right angle betweenx-axis and y-axis. What is the relation between a and b?

3. MATHEMATICAL REPRESENTATION OF FUNCTIONS

Once we have transferred the data from a table to a coordinate system,joined the points in the coordinate system by a curve, and made peace with

our conscience over the latter step, we pose the question: Isn't there amore compact, easily accessible form which would enable us to store ourdata, the ones which we obtained honestly by toiling with apparatus and

Page 26: ɷshawns sagan integral and differential calculus

14 CHAP. 1. FUNCTIONS

measuring instruments, as well the points we boldly obtained by an arbi-trary stroke of our pencil in joining the data points by a curve? There is

indeed-or we wouldn't have posed this question in the first place.For the following argument, let us consider an independent variable x

and a dependent variable y. The relation between these two variables,namely, the fact that y is a function of x, we indicate by writing

y = f(x)(Read: "y is a function of x" or "y is of of x".)

Now, suppose that the functional relationship between x and y is suchthat the value of y, which corresponds to any given x, is obtained bydoubling the value of x, or, as we may write,

y = 2x.

This is a (very) special example of a functional relationship between twovariables, expressed mathematically. So let us express this same functionalrelationship by a graph.

Table 1.5x y

-3 -6-2 -4-1 -20 0

1 2

2 4

3 6

First of all, even if this appears as a step backward, let us set up anarrangement of corresponding values as presented in Table I.S. Next wetransfer these pairs of values into a right coordinate system (see Fig. I.16)and obtain seven points. Let us now join these points by a curve. If theconstruction of the points in the coordinate system was reasonably accurate,and if we do not engage in any escapades upon joining these points by acurve, the result will look very much like a straight line as indicated by thesolid line in Fig. 1.16. Of course, we could also produce something thatmay look like the broken line in Fig. I.16, but we will have to reject thisnightmare on the basis of the subsequent argument:

Let (x1, yl) be a point on the straight line joining the points (1, 2) and(2, 4)-see Fig. 1.16. Then we have, because of a well known theorem onsimilar triangles,

yl-2_4-2xl-1 2-1

Page 27: ɷshawns sagan integral and differential calculus

1.3. MATHEMATICAL REPRESENTATION 15

and from this

Hence,yl = 2x1,

i.e., the point with the coordinates (x1, y) which by hypothesis lies on thestraight line joining the points (1, 2) and (2, 4) is such that its coordinatessatisfy the functional relation y = 2x.

The same argument applies, of course, to any point lying on a straightline that joins any two points in Fig. I.16.

However, we have not yet settled the question as to whether these are allthe points that are represented by y = 2x. In other words: are therepoints whose coordinates satisfy y = 2x but which do not lie on the straightline in Fig. 1.16? Suppose that there is such a point with coordinates (x,g). Then this point lies either above the line or below the line (see Fig.1.17). Since all points on the line are such that their distance from the x-axisis twice their distance from the y-axis (the ratio of vertical side over hori-zontal side in any triangle that is formed by the line, the x-axis and a verticalline through any point is 2) it follows that either 9 is greater than 2x or y

Page 28: ɷshawns sagan integral and differential calculus

16 CHAP. I. FUNCTIONS

is less than 2x as opposed to our assumption that (x, y) satisfies the func-tional relation 2x. Thus we see that the straight line as drawn in Fig.1.16 is the geometric representation of the mathematically formulatedfunctional relation y = 2x.

As an other example, let us.consider the functional relation

y=x2.

Again we can easily obtain the corresponding values given in Table 1.6. Ifwe represent these pairs of values in a coordinate system, we obtain thepoints indicated in Fig. I.18. We are greatly tempted to join these points

Table 1.6

x y

-2 4_3 9

2 4

-1 1

0 01 i2 41 1

s2 42 4

3 9

by a curve, as indicated by a solid line in Fig. 1.18, even though it is not aseasy to justify this action as it was in the preceding example. As a matterof fact, it will be some time until we will see a justification of this process.

In order to return to our original problem of representing a finite numberof discrete measure data by a curve from which we can (rightly or wrongly)predict results of experiments which have not yet (and probably never willbe) carried out on the basis of the results of experiments that have beencarried out, let us restate our position: We have finitely many points in acoordinate system. We wish to join these points by a curve which in turnwe wish to express in a mathematically compact formula. It seems indi-cated-in order to achieve some progress toward this goal-that we havefirst to study mathematical relationships (mathematical functions) andtheir geometric representations and then, once we are familiar with the typesof curves that are obtained from mathematical formulas, try to fit the appro-priate mathematical relationship with its known geometric representationto a given situation.

This seems to be quite a straightforward program. We wish to mentionthat the more we will penetrate into this matter, the more it will reveal its

Page 29: ɷshawns sagan integral and differential calculus

1.3. MATHEMATICAL REPRESENTATION

Y

xx

Fig. 1.17 Fig. 1.18

17

complexity, and at the end we will be happy to have solved only a very fewof the problems that are involved in this process.

Before we close this section, we wish to present two more examples ofmathematical functions to destroy the incorrect mental image of a mathe-matical function as something that has to be expressed by a formula thatcan be written as an equality between two expressions in x and y, beforethis image has a chance to implant itself in the students' unspoiled minds.

The following is a mathematical function:

1 for all rational values of x(0 for all irrational values of X.

Given any value x, this relation enables us to find the corresponding valuey. For x = 27, the corresponding y-value is 1. For x = lf, the corre-sponding y-value is also 1. For x = tip, we have y = 0, because is

irrational. For x = I7 we have y = 0 for the same reason, etc. All

we have to know is whether a given x is rational or irrational.* Then thecorresponding y-value can easily be found. Since every number is either

* Unfortunately, we do not know this of all numbers.

Page 30: ɷshawns sagan integral and differential calculus

18 CHAP. 1. FUNCTIONS

Y

I I

-3 -2 -11l1 2

-x

Fig. 1.19

rational or irrational, this function is defined for all values of x, i.e., it hasa y-value for every value of x.

Here is another example of a function:

0 for all x which are less than -31 for x = -3

y= 1 for all x that are greater than -3 but less than 0Oforx=02x for all x which are positive.

In this case, we can even supply a geometric representation of this function(graph) as indicated in Fig. 1.19. (The student may investigate the possi-bility of representing by a graph the function which we mentioned beforethis one.. If it is not possible, then why?)

Problems 1.9-1.18

1.9. Represent y = 2x + 1 by a graph.I.10. Represent y = x2 - I by a graph.1.11. Given a straight line through the points Pl(l, 4) and P2(2, 8). Try to find

a mathematical representation of this straight line.1.12. Sketch the function:

-1 for all negative values of x_ Oforx=0

Y

- x for all values of x between 0 and 55 for x = 5 and all values of x greater than 5.

Page 31: ɷshawns sagan integral and differential calculus

1.4. STRAIGHT LINE 19

1.13. Sketch the following functions:

(a) y =1

x for all x r 0.

(b) Y =1

1 -xforallx r 1.

(c) y = x2 + 2x + 1 for all x.

1.14. Sketch the function y = Vx. (Hint: Tabulate the values for x = 0,1, 4, 9, 16, 25, and join the data points by a curve.)

I.15. Let f(x) = x2 + 4. What is f(1), f(2), f(-3), f(a) ?

1.16. Given f(x) =x

+ 1 , x r -L. What is f(0), f(1), f(t), f(t2), f I I,

f(it)? \ 111

1.17. G iven f (x) = x2. Find f (x + 1), f (x + h), f (x - h).

fix. Find 1(x + h) -fl x)(x)Simplify1.18. Given.f(x) = .f(x + h) and the

latter.h

4. STRAIGHT LINE

We have seen in the preceding section that the functiony = 2x

represents a straight line in a right coordinate system. In this section wewill establish the mathematical representation of a straight line in general.

First, however, let us consider an example. We know from plane geom-etry that a straight line is uniquely determined by two points. Let us con-sider the two points P1(l, 1) and P2(3, 5) (see Fig. 1.20) and let P(x, y) be athird point on the line L which is determined by P1 and P2. It is our aimto find a mathematical condition for the coordinates x and y of P whichwill guarantee that P lies on L. This condition is then the mathematicalrepresentation of L, since it will yield for any x a corresponding y-valuesuch that (x, y) are the coordinates of a point P on L.

We consider in Fig. 1.20 the two similar triangles Q(P1Q2P2) and

A(P1, Q, P). Since the ratio P2Q2 is the same as PQ we obtain in view ofQ2P1 QPl

P2Q2 = 4Q2P1 = 2PQ=y- 1QP1=x- 1

the relation

y-1-4=2x-1 2

Page 32: ɷshawns sagan integral and differential calculus

20

Y

Fig. 1.20

CHAP. I. FUNCTIONS

So we see that whenever P(x, y) lies on the line L, its coordinates (x, y) alsosatisfy the relation (1.1). We see at the same time that, whenever (x, y)satisfy the relation (1.1), they are also coordinates of a point P on theline L.

From (1.1) follows immediately

y-1=2(x=1)or(I.1a) y=2x- 1.

We can easily interpret the coefficients on the right side of (I.1 a). Clearly,-1 is the y-value that is obtained if we let x = 0, i.e., the line L intersectsthe y-axis at y = - 1. We call this point the y-intercept. The coefficient2 of x is also of great importance. We remember that 2 was obtained as

the ratio P2Q2 , i.e., "rise over run" which is the same for all right trianglesQ2P1

that have two sides parallel to the two coordinate axes and the third sidelying on L (see Fig. 1.20). This ratio is called the slope of the line L becauseit is a measure of the rise relative to the run, or, in other words, it tells byhow much y increases as x increases by one unit. (If the point P2 in Fig.1.20 would be higher up, e.g., have the y-coordinate 6, then the slope would

Page 33: ɷshawns sagan integral and differential calculus

1.4. STRAIGHT LINE 21

amount to 2 which is greater than 2, i.e., the rise is higher for the same runand, consequently, the slope is greater. However, if the point P2 in Fig.1.20 would be farther down, say, have the y-coordinate 4, then the slopewould be 3 which is smaller than 2, i.e., the rise is smaller for the same run.)

We are now ready to discuss the general case. Let us consider twopoints P1(xl, y1) and P2(x2, y2) where we assume that x1 x2. This assump-tion simply means that P1 and P2 do not lie on a vertical line, a case whichwe have to exclude from our considerations for the time being.

Let P(x, y) be a third point on the line L which is determined by P1 andP2 (see Fig. 1.21). Again we consider the two similar triangles A(P1Q2P2)and Q(P1QP), and we see that if P is a point on L, then

P2Q2 _ PQ

Q2P1 QP1Since

PQ=y-ylQP1=x-x1

y-yl-y2-y1x-x.1 x2-x1

Y2 - Yi

x2 - x1

P2Q2-y2-y1Q2P1 = x2 - X1

- ----------------P2

[Y-Y1-Iy2 -Y1

Pl -----x2- -----------4Q

xxl X2

Fig. 1.21

Page 34: ɷshawns sagan integral and differential calculus

22 CHAP. I. FUNCTIONS

This relation is the mathematical representation of a straight line, which isnot vertical, through the points P1(x1, y1) and P2(x2, Y2)- It is customarilyreferred to as the point point formula because it is obtained by making useof the coordinates of two points.

Let us take, for example, the points P1(l, 7) and P2(3, 5); Then theequation of the line through these two points is given according to (1.3) by

y-7 3-1(x-1)

from which we obtain, after a few manipulations,

y=-x+8.We see that the y-intercept is 8, i.e., the line intersects the y-axis at y = 8;and the slope is -1, i.e., the rise is -1 per run 1. That means: if x increasesby one unit, then y increases by minus one unit-which is just a complicatedway of saying that y decreases by one unit.

Now suppose we know one point of the line, P1(x1, yi), and its slope m.Of course, we could immediately find another point P2, since we know thaty increases by m units as x increases by one unit. Hence, the coordinatesof such another point on the line would be x2 = x1 + I and y2 = yi + m.Once we have these coordinates, we can use (1.3) to set up the line equation.However, this is really quite unnecessary because we know anyway that

m_lI2-hix2 - xl

and can thus write the line equation in terms of the coordinates of onepoint and the slope m.

(1.4) y-yi=m(x-xi),which we call the point-slope formula.

Let us now consider the point Pl(3, 4) and find the equation of a linethrough this point with the slope 3. With m = 3 we obtain from (1.4)

y-4= a(x-3)and then, after simplifications,

y=sx-1.Problems 1.19-1.32

1.19. Find the equation of the line through the two given points.

(a) (1, 4), (3, -.1) (b) (5, 8), (2, 1)

(c) (5, 9), (-1, 5) (d) (1, 0), (0, 8)

Page 35: ɷshawns sagan integral and differential calculus

1.5. GENERAL LINE EQUATION 23

1.20. Graph the lines in Problem 1.19 all in the same coordinate system indifferent colors.

1.21. Find the equation of the line through the given point with the givenslope.

(a) (4, 1), m = -7 (b) (-1, -5), m = 12

(c) 5, m = -2 (d) (I5, -3lm = -3

1.22. Graph the lines in Problem I.21 all inthe same coordinate system indifferent colors.

1.23. Sketch the lines given by:

(a) y = ix - 3 (b) y = -llx + l(c) y=12x (d) y = - 6x + 3

1.24. Prove: a horizontal line has the slope 0.1.25. Prove: the equation y = c (c is a constant) represents a horizontal line.1.26. Find the equation of all nonvertical lines through the point (1, 4).1.27. Pick out the one line from the lines in Problem 1.26 which passes through

the point (-1, 12).1.28. Find the equation of all lines with the slope 2.1.29. Pick out the one line from the lines in Problem 1.28 which passes through

the point (3, 4).1.30. Pick out the one line from the lines in Problem 1.26 which has the

slope -,2-,.1.31. Do the three given points lie on one line?

(a) (1, 2), (2, 3), (3, 5) (b) (3, 4), (4, 6), (5, 8)

(c) (6, 4), (8, 3), (10, 2) (d) (1, -1), (2, -4), (3, -7)

1.32. Show that a line with slope in1 is perpendicular to a line with slope M. if,and only if, m1m2 = -1.

5. THE GENERAL LINE EQUATION

If we take the point-point formula (1.3) from the preceding section andshift all terms except y to the right side, we obtain

y_y2-yx-Y2-Y1 +ylwhere

x2 - x1 x2 - x1

Y2-yl=m and -Y2-y1x1 +yl=cx2 - x1 x2 - x1

are constant numbers that are determined by the coordinates of the pointsP1 and P2.

Page 36: ɷshawns sagan integral and differential calculus

24

Thus we see that a function of the type

(1.5) y=mx+2,

CHAP. I. FUNCTIONS

where m, c are constants, represents a straight line. However, we cannotsay that every straight line is representable in the form (1.5) because weremember that we assumed in our derivation of the point-point formulaand the subsequent derivation of the point-slope-formula that the twopoints P1 and P2 do not lie on a vertical line (i.e., a line parallel to the y-axis):x1 0 x2. We can see now quite easily why we were compelled to make suchan assumption. If a line is vertical, then x2 - x1 = 0 and the denominatorin

m = y2 - yl

x2 - xl

becomes zero. Since it is the content of the first mathematical command-ment that division by zero is prohibited, we had better not do this. There is,however, a way to get around this difficulty and we will now show how anall-embracing line equation can be obtained-i.e., an equation which issuch that every line can be represented in its form.

Again we start with ([.3), multiply through by the denominator x.2 - X1,assuming, of course, that it is not 0, and obtain

(x2 - x1)(y - yl) = (y2 - yl)(x - x1).

If we shift all terms that contain neither x nor y to the right side and changethe sign, then we obtain

(1.6) (Y2 - y1)x - (x2 - x1)y = (Y2 - y1)x1 - (x2 - x1)y1

If we simply consider this equation, forgetting conveniently how weobtained it, then we see now that it does not matter whether x2 - x1 is 0or not. That (1.6) is indeed the representation of a straight vertical line forx2 = x1 can be seen directly:

Let x2 = x1 in (1.6). Then (1.6) reduces to

(y2 - y1)x = (y2 - y1)x1

and after cancellation by y2 - y1 we obtain

x=x1

which means: take all points which have xl as abscissa (x-coordinate).These points form, indeed, a vertical line at distance xl from the y-axis.

Since

Y2 - Y1 = A, - (x2 - x1) = B, (y2 - y1)x1 - (x2 - x1)g1 = C

Page 37: ɷshawns sagan integral and differential calculus

1.5. GENERAL LINE EQUATION 25

in (1.6) are constants which are determined by the coordinates of the twopoints which determine the line, we can see that

(1.7) Ax+By=Cwhere A, B, C are constants, is the general representation of a straight line.

If B 0, we can write this equation in the form (1.5) with m = -

Bc

and

= CCand if B = 0, then we obtain a vertical line x = at the distance

A from the y-axis.

Let us now demonstrate how we can determine the constants A, B, C in(1.7) such that (1.7) represents a straight line through two given points Pland P2. For this purpose we will discuss three examples which are charac-teristic of what can possibly happen in this process.

(1) Let P,(l, 3), P2(2, 4). We have to find A, B, C such that x = 1, y = 3as well as x = 2, y = 4 satisfy equation (1.7), i.e.,

A + 3B = C,2A + 4B = C.

These are two linear equations in three unknowns, A, B, and C. So itwould appear that we have either too many unknowns or too few equations,depending on how we look at it. However, we can see quite easily that(1.7) will not change if we multiply both sides of this equation by a nonzeroconstant, i.e., if A, B, C are a set of solutions of our two equations, so arenA, nB, nC where n is any number that is different from zero. In otherwords, the solutions A, B, C of our two equations are determined but for acommon multiple. Now, if we subtract the second equation from the firstequation, we obtain

-A - B=Oor A = -B. If we now substitute -B for A in the first equation, we obtain

-B+3B=Cor C = 2B. We see that we could determine A and C in terms of B. Thusthe equation of the line through the two given points becomes

-Bx + By = 2B.

We multiply both sides of this equation by B and obtain

-x+y=2 or y=x+2.

Page 38: ɷshawns sagan integral and differential calculus

26 CHAP. I. FUNCTIONS

(2) Let PI(4, 3), P2(4, 7). Then we obtain the equations

4A+3B=C,4A+7B= C.

Again, if we subtract the second equation from the first equation, we obtain

-4B = 0

i.e., B = 0. Now, if we let B = 0 in the first equation, we obtain

4A=C

i.e., A = 4 . Thus our line equation appears in the form

CX = C.

4

If we multiply both sides by C , we obtain

x=4

which is a vertical line at a distance 4 from the y-axis.

(3) Let us now consider the case where Pl coincides with P2 and seewhether our algebraic apparatus breaks down. Let P,(l, 4), P2(l, 4). Thenwe obtain

A+4B=CA+4B=C

and we see that we really have only one equation from which we obtainthe solution C = A + 4B. Thus the equation of our line becomes

Ax + By = A + 4B.

Division by B and renaming of the ratioA

= m yields

mx+y=m+4where m is any number. Thus we see that we have obtained infinitely manylines, namely, all lines through the point P(1, 4) except for the vertical line.In order to obtain thevertical line throughP(1,4), we cannot divide throughby B in the above equation, but rather divide through by A. Since B = 0,we see that we obtain the vertical line x = I.

Page 39: ɷshawns sagan integral and differential calculus

1.6. INTERSECTION OF LINES 27

Problems 1.33-I.38

1.33. Two nonvertical lines are parallel if, and only if, they have the same slope.Prove that the two lines given by

Ax+By=CDx + Ey = F

are parallel if, and only if, A = nD, B = nE for some number n.1.34. What can you say about two lines which are given by the two equations

in problem 1.33 with A = nD, B = nE, C = nF?1.35. Find the equation of the line through the two points

(a) (4, 1), (7, 1) (b) (6, -3), (6, 15)(c) (3, 3), (6, -9) (d) (12, 1),(-4, -7)

1.36. A triangle is given by the three points A(4, 1), B(6, -3), C(1, 8). Findthe equations of the sides.

1.37. A parallelogram is determined by the three vertices (1,4), (5, 1) and(6, -4) with the understanding that the fourth vertex is opposite the point (1, 4).Find the equations of the four sides.

1.38. Graph the lines

(a) 3x + 4y = 12 (b) -x + 6y = 3(c)5x+5y=1 (d)x+y=1.

6. INTERSECTION OF TWO LINES*

If two lines in the plane are given by the two equations

Alx + Bly = ClA2x+B2y=C2

we can see by geometric inspection that there are three possibilities:

(1) The two lines intersect,(2) The two lines are parallel and distinct,(3) The two lines coincide, i.e., both equations represent the same line.

To see how these possibilities find their algebraic expression, let us considerthree examples.

(1) Given the two linesx + 3y = 22x - y = 1.

* All sections starting with and ending with A can be omitted without jeopardizingthe continuity of the course.

Page 40: ɷshawns sagan integral and differential calculus

28 CHAP. I. FUNCTIONS

If these two lines have an intersection point, then there exists a pair ofvalues (x, y), the coordinates ,of the intersection point, such that thesevalues satisfy both line equations. In order to find these values, providedthey exist, we proceed as follows. We multiply the first equation by -2and add it to the second equation

-2x+2x-6y-y=-4+1

7y=3 or y=3.

Now, if we substitute this value into one of the two equations, say, thesecond one, we obtain

2x-3=1 orx=10=5

7 14 7

We see that the values x and y = indeed satisfy both equations, andwe conclude that P(;, ;) lies on both lines, i.e., it is the intersection pointof the two lines.

(2) Considerx+2y=1

2x+4y=5.If we proceed as before, i.e., multiply the first equation by -2 and add itto the second equation, we obtain

-2x + 2x - 4y + 4y = -2+5or

0=3

which is clearly nonsense. This can only mean one thing: there is no pointwith coordinates (x, y) such that these values x,y satisfy both equations, i.e.,the two lines do not have an intersection point; the two lines are parallel.

(3) Takex+ y=1

3x+3y=3and proceed as before: multiply the first equation by -3 and add it to thesecond equation. Thus we obtain

-3x + 3x - 3y + 3y = -3 + 3

0=0

Page 41: ɷshawns sagan integral and differential calculus

1.6. INTERSECTION OF LINES 29

which is clearly satisfied for all values of y. Of course, we could expectsuch a thing because, if we look at the two equations, we can see that theyboth represent the same line. (The one equation is obtained from the otherone by multiplication by 3, and if we multiply both sides of an equation bythe same nonzero constant, we do not change the equation.)

Now that we have discussed examples of all the possibilities, let us pro-ceed to discuss the situation in general terms.

We have the two line equations

(I.8}Aix + Bly = C1

A2x + B2y = C2.

Again, if we wish to solve the two equations by elimination of one unknown,we have to multiply, e.g., the first equation by -A2 and the second equa-tion by A1, after which we obtain

-A1A2x - A2B1y = -A2C1A1A2x + A1B2y = A1C2.

Now, if we add the two equations, we obtain

(1.9) (A1B2 - A2B1)y = A1C2 - A2C1

and from this

(I.10) Y = A1C2 -A2C1

A1B2 - A2B1

In order to find x, we could substitute this value for y into one of the twoequations (1.8) and then solve for x. However, we prefer to do it independ-ently of the result which we have already obtained and proceed similarlyas before: we multiply the first equation by B2 and the second equation by-B1:

A1B2x + B1B2y = B2C1

-A2Blx - B1B2y = -B1C2.

Adding these two equations yields

(I.11) (A1B2 - A2B1)x = B2C1 - B1C2

and from this we obtain

(I.12) X = B2C1 - B1C2

A1B2 - A2B1

We see that in (I.10), as well as in (I.12), we have the denominator

D = A1B2 - A2B1.

Page 42: ɷshawns sagan integral and differential calculus

30 CHAP. I. FUNCTIONS

The results in (I.10) and (I.12) are senseful only if

(I.13) D = A1B2 - A2B1 :A 0.

Thus we can state: The system of two linear equations in two unknowns(1.8) has a solution if (1.13) is satisfied. In this case, the solution is unique.

Now let us discuss the case where

(1.14) D=A1B2-A2B1=0.It follows from (1.14) that

Al=B1A2 B2

If we denote this ratio by n, we have

Al=n -=nA2 B2

and, consequently,Al = nA2, B1 = nB2,

i.e., the coefficients A1,B1 of the first equation are a multiple of the coeffi-cients A2,B2 of the second equation. In this case, (1.9) and (I.11) read

0 = A1C2 - A2C1,

0 = B2C1 - B1C2.These equations are senseless, unless the right sides vanish by themselves.

Now, if the right sides vanish by themselves, we have

A1C2 = A2C1,

B2C1 = B1C2and it follows that

Al=Cland

BI=CIA2 C2 B2 C2

SinceAl

= n andB1

= n, it follows that also1

= n, or C1 = nC2.A2 B2 C2

Thus we can state: The system (1.8) has no solution (the two lines areparallel) if A1,B1 are a multiple of A2,B2 (A1 = nA2, B1 = nB2), butC1 nC2. However, if Al = nA2, B1 = nB2 and C1 = nC2, then the twoequations have infinitely many solutions (the two lines are identical).

Problems 1.39-1.47

1.39. Find the intersection point of the lines given below. If there is none,state whether the lines are parallel or coincide.

(a) x + 4y = 1, 2x - 5y = I, sketch.(b) x + 9y = 3, -2x - 18y = 5.

Page 43: ɷshawns sagan integral and differential calculus

1.7. DISTANCE BETWEEN POINTS

(c) 2x + 411 = 8, x + 2y = 4, sketch.(d) 3x - 2y = 1, y = 5.

(e) 4x + 7y = 8, x = 10, sketch.(f)x=5, x=18.(o) x = 12, y = - ],sketch.

31

1.40. What quantities of silver 76 per cent pure and 82 per cent pure must bemixed together to obtain 20 pounds of silver (a) 80 per cent pure? (b) 70 per centpure? (c) 82 per cent pure?

1.41. Two cars left towns 400 miles apart and traveled toward each other. Theaverage speed of the larger car is 25 mph slower than the average speed of thesmaller. If they meet after 5 hours, what was the average speed of the smallercar? Al A2

1.42. We denote the expression A1B2 - A2B1 byB1

B2I and call it a determi-

nant. Evaluate the following determinants:

(a) 1 0

2 3

(b) 3 5

7 9

(c) 2 4

-1 -2

1.43. Evaluate the determinants

3 5

2 1

3 +5x 52+x I

(d) a b

no nb

3 5 + 3x

2 1 + 2x

(e) 1 0

27 0

where x is any number. The result will puzzle you. Try to explain.1.44. Show that

Al + nA2 A2

B1 + nB2 B2

Al A2

B1 B.

by expanding both determinants according to their definition in Problem 1.42.1.45. Evaluate the determinants

100000 999991 180000 799991

I -73581 -73579 1 52 261

by making use of the result in Problem 1.44.1.46. Find the equation of a line which passes through the intersection point

of the two lines x + y = 1 and 2x - 3y = -1 and through the point P(1, 7).1.47. Find the equation of all lines which pass through the intersection point

of the two lines x - 2y = 1, 2x - 3y = 4. (Hint: If you multiply the one lineequation by some constant number k and add it to the other line equation, youobtain a linear equation in x and y. Clearly, this is a line equation. Of whatline?)

7. DISTANCE BETWEEN TWO POINTS

We consider two points Pl and P2 with the coordinates P1(x1, y1) andP2(x2, Y2). In order to find the distance dd1P2 between these two points, we

Page 44: ɷshawns sagan integral and differential calculus

32

y2

CHAP. I. FUNCTIONS

P2

yA

y1

x1

Fig. 1.22

X2x

consider the triangle Q(P1QP2) in Fig. 1.22. We obtain directly from thetheorem of Pythagoras that

(x2 - x1)2 + (y2 - y1)2 = d2

and hence

(I.15)* d = 1/(x2 - x1)2 + (Y2 - yl)2

where the square root is to be understood as positive. (It is quite senselessto talk about negative distances.)

Let us discuss three applications of the distance formula (1.15). First, wewant to find all points (x, y) which are equidistant (have the same distance)from two given points, namely, Pl(l, 1) and P2(3, 4). According to (I.15),the distance from the unknown point P(x, y) to P1(1, 1) is

dppl = 1/(x - 1)2 + (y - 1)2

and from P(x, y) to P2(3, 4)

dpp2 = 1/(x - 3)2 + (y - 4)2.

These two distances have to be equal: dppl = dpp2; hence, it follows thaton squaring both expressions we have

(x- 1)2+(y- 1)2 = (x - 3)2 + (y - 4)2

which simplifies after a few manipulations to

4x+6y=23.* This formula may be considered as the definition of the distance of two points in

a plane and it can be subsequently shown that it fits the intuitive notion of a distancewhich we developed in the past.

Page 45: ɷshawns sagan integral and differential calculus

1.7. DISTANCE BETWEEN POINTS 33

Since this is the equation of a straight line, we can say that all pointswhich are equidistant from the two given points lie on a straight line, orwe could formulate it as follows : the locus of all points that are equidistantfrom the two given points is a straight line. (It is easy to see that this is notonly true for this special case but for the locus of all points equidistantfrom any two given points.)

As a next illustration of the distance formula, let us find the locus of allpoints that are equidistant from one given point, namely the point with thecoordinates (p, q) assuming the fixed distance to be r. We know from geom-etry that this locus is a circle with center at (p, q) and radius r.

It follows from (I.15) that

1I(x - p)2 + (y - q)2 = ror, after squaring both sides,

(I.16) (x p)2 + (y - q)2 = r2,

which is the customary form in which we ordinarily state the equation of acircle with center at (p, q) and radius r. In the case the origin is the centerof the circle, we put p = 0 and q = 0 to get

x2+y2=r2.Finally, let us find the locus of all points which have the same distance

from a given point Pl and a given line L.Let the point Pi have the coordinates Pl(0, p) and let y = -p be the

equation of the line L (see Fig. 1.23).The concept of distance of a point from a line is quite ambiguous. In

order to make our problem meaningful, we have to state what we mean by

y

dPPlP(xy),-

P1

iy

IP

L

Fig. 1.23

Page 46: ɷshawns sagan integral and differential calculus

34 CHAP. I. FUNCTIONS

that. The mathematician understands under distance of a point from a linethe shortest possible distance from the point to the line, and the reader caneasily convince himself that the shortest distance from a point to a line isthe one which is measured along a straight line through the given pointperpendicular to the line. In the light of this definition, it is obvious fromFig. 1.23 that the distance from a point P(x, y) to L is given by

dPL=y+P.From (1.15) we obtain for the distance from P to Pl

dPPI = (x - 0)2 + (y - p)2.

We wish to find the locus of all points for which

dPL = dPPI,

which is clearly equivalent to dPL = dPPI.Thus, we set

(y + p)2 = x2 + (y - P)2

and from this, if we solve for y, we have

(I.17) y = 4x2.

P

The graph of this function, as indicated in Fig. 1.23 by a solid line, iscalled a parabola or, more precisely, a quadratic parabola.

Problems 1.48-1.58

1.48. Find the distance between the following two points:(a) (- 3, 1), (1, -1) (b) (3, 7), (3, 9)(c) (12, 5), (97, 5) (d) (-1, -5), (-7, -4)

1.49. State the equation of a circle with center at (-1, 3) and radius 2.1.50. Find the equation of the circle which passes through the three points

A(2, 1), B(1, 2), C(2, 23 + 1

1.51. Find the locus of all points equidistant from the two points P(1, 3) andQ(4, 9).

1.52. Find the locus of all points which have the same distance from the pointPI(0, 4) and the line L: y

1.53. Find the locus of all points which have the same distance from the point(p, 0) and the line L: x = -p.

1.54. Find the locus of all points which have the same distance from the point(1, 1) and the line x + y = 12.

1.55. The three points A(l, 0), B(3, 6) and C(2, 9) determine a triangle. Whatare the lengths of the three sides?

Page 47: ɷshawns sagan integral and differential calculus

1.8. PARABOLA 35

1.56. Let the three points in Problem 1.55 determine a parallelogram with theunderstanding that the fourth point is opposite A. What is the length of thelonger diagonal?

1.57. What is the distance of a point P(9, 9) from a point (x, y) on the peripheryof the circle x2 + y2 = 1 ?

1.58. In equation (1.16) express y as a function of x. What do you obtain ifthe square root is taken (a) positive? (b) negative? (c) positive and negative?

8. THE PARABOLA

We consider the parabola

(1.18) y = x2

(see Fig. 1.24).Let us now investigate what happens to (1.18) if we leave the parabola

where it is but consider a new x, y-coordinate system as indicated in Fig.1.24 by broken lines. Let the x- and x-axis be parallel to each other b unitsapart and let the y- and y-axis be parallel to each other a units apart.Clearly, we can see from Fig. 1.24 that, if P(x, y) is any point in the planewith the x, y-coordinates x and y and the Y, 9-coordinates Y and P, thefollowing relations hold between x, y and x, y:

(1.19)x + ay=y+b.

These relations hold for any point in the plane and, in particular, for allthe points on the parabola (1.18). In order to obtain the equation of thisparabola in the new x, 9-coordinate system, we have to substitute for x

Fig. 1.24

Page 48: ɷshawns sagan integral and differential calculus

36 CHAP. I. FUNCTIONS

Y

x

Fig. I.25

and y in (I.18) the corresponding expressions in x, y according to (I.19).The result is

y+b=(x+ a)2or

y=x2+2ax+a2-b.Let us introduce new constants A and B for a and b as follows:

(1.20) 2a = A, a2 - b = B.

Then, we can write the equation of the parabola in the new x, 9-coordinatesystem as follows:

(1.21) 9 =x2+Ax+B.

Retracing our steps, we see that any equation of the type (I.21) representsa parabola y = x2 in a suitably chosen x, y-coordinate system with theorigin at x = -a and g = -b, where a and b can be determined from(1.20) in terms of the coefficients A and B.

Let us now go one step further. Assume that Fig. 1.24 is drawn on arubber pad and let us stretch that pad in the y-direction with the under-standing that the x-axis remains at rest (see Fig. 1.25). The result is aparabola-like curve which is represented by a broken line in Fig. 1.25.What is the equation of this curve, which we will again call a parabola?To arrive at an answer, we observe that in stretching the plane in the y-direction, we change the measure units on the y-axis. Suppose we stretchthe pad in the ratio 1: 2 (i.e., a point that previously had a distance d fromthe x-axis has a distance 2d from the x-axis after the completion of the

Page 49: ɷshawns sagan integral and differential calculus

I.8. PARABOLA 37

stretching process). Consequently, what was one unit before-on the y-axis-is now 2 units, and if we stretch the pad in the ratio l :s2 (we writes2 to emphasize that this quantity is to be positive) then what was one unitbefore now becomes s2 units. (We observe that, if s2 is larger than 1, wehave a genuine stretching process. On the other hand, if s2 is less than 1,we actually have a shrinking process.) If we call the new y-axis y (inciden-tally, it coincides with the old y-axis), then we have the followingrelation between the measure unitsonyandy t(x,5=sty)(1.22) y = sty

(see also Fig. 1.26). (s y)If we substitute (1.22) into (I.21),

we obtain s

12y=x2+Ax+B

and if we let

s2 = a, s2A = (4, s2B = y, we have

(1.23) y = axe + fix + y. Fig. 1.26

Again we see that, if we retrace our steps, every equation of the form (1.23)represents a (stretched) parabola.

Let us consider the example

y=4y2+16x+4.We have, in view of s2 = 4, 4A = 16 and 4B = 4:

s2 = 4, A = 4, B = 1

and in view of (1.20)a=2,b=a2-B=4- 1 =3.

Thus, this parabola is obtained from y = x2 by carrying out the transfor-mation

x = x + 2y=y+3

and successively stretching in the ratio 1:4, i.e., the transformation y = 49(see Fig. 1.27).

Let us now consider such a stretched parabola

(1.24) y = axe -l- 9x + y

Page 50: ɷshawns sagan integral and differential calculus

38

and a straight line

(1.25)

Fig. 1.27

y=ax+b.

CHAP. 1. FUNCTIONS

We observe that the line in (1.25) is certainly not perpendicular to the x-axis(see Section 5 of this chapter), i.e., not parallel to the symmetry axis of theparabola.

We will now investigate all the possible intersections of the parabola(1.24) with the straight line (1.25). For this purpose, we have to let theequations (1.24) and (1.25) coexist, i.e., find all values of x and y for whichboth equations are satisfied. Since the y-values have to be equal, we obtain

ax+b=ax2+ fix + yor

0Cxe+(fi-a)x+y-b=0.This is a quadratic equation of the type

axe+bx+c=0(where a, b, c are new constants and have no relation to the ones usedbefore), the solution of which will be discussed in the following section.

Page 51: ɷshawns sagan integral and differential calculus

1.9. THE QUADRATIC EQUATION 39

Problems 1.59-1.67

1.59. Given y = x2. Introduce new coordinates x,y according tox=x+3y=y-4

and write the parabola equation in terms of x and y.1.60. Given y = x2 + 6x - 4. Introduce a new coordinate system x,9 such

that the equation of this parabola in the new coordinate system becomes 9 = 2.1.61. Same as in Problem 1.60 for y = -x2 + 4x + 1 (9 = -0).1.62. Given y = x,2 + 2x - 2. Stretch the y-axis in the ratio 1:4 and write

the equation of the stretched parabola. Sketch the parabola before and afterthe stretching process.

1.63. Reduce y = 9x2 - 36x + 4 to type form, i.e., to the form y = x2 insome 5,y-coordinate system.

1.64. G:ven y = x2 - 4x - 1. (a) Carry out the translation x = x + 1,y = 9 - 2 and then stretch the 9-axis in the ratio 1 :2. (b) Carry out the opera-tions in (a) in the opposite order and compare your results.

1.65. Given y = x2. Consider the transformation x = x + a, y = y + b.Draw the translated parabolas for a = 0, b = 1, 2, 3, 4 in red and for b = 0,a = 1, 2, 3, 4 in blue.

1.66. Given the parabola y = x2 - 2x + 1 and the straight line x + y = 2.Find the quadratic equation, the solution of which will yield the x-coordinatesof the intersection points.

1.67. Same as in Problem 1.66 for the y-coordinates of the intersection points.

9. THE QUADRATIC EQUATION*

It is our aim to find all values of x for which the quadratic equation(1.26) ax2 + bx + c = 0is satisfied. We can assume, of course, that a 0, because if it were equalto zero, we would not have a quadratic equation in the first place.

If the term bx is not present, i.e., if b = 0, the solution of equation (1.26)will not cause any difficulties at all. For example, if we have

4x2-9=0,we obtain after some simple manipulations

x2 = 9

4

and, consequently,3x= ± /_= f-.

4 2

* The reader who is already familiar with the solution of the quadratic equation mayomit this section.

Page 52: ɷshawns sagan integral and differential calculus

40

Thus the two solutions of our equation are

xi =9 3

If we have the equation

CHAP. I. FUNCTIONS

x2 + 1 = 0,

after shifting the absolute term to the right we obtain

x2 = - 1.

There is no real number which is such that its square is a negative number(if we multiply a positive number by itself, we obtain a positive number asa result and if we multiply a negative number by itself, we obtain again apositive number as a result), i.e., the equation x2 + I = 0 has no solution inthe domain of real numbers.

This, however, should not disturb us at all. After all, at the time whenwe worked only with the natural numbers 1, 2, 3, , adding and sub-tracting them, we eventually came across a subtraction of the type

4-7

which, as we know now, does not yield a natural number (positive integer).Well, when we were confronted with this problem we did not hesitate amoment to introduce the negative numbers, boldly writing

4 - 7 = -3,

whatever that might mean. Now we are doing essentially the same thing inthe present case. We invent new numbers which have the property thattheir square is negative. Specifically, we call

(1.27)

the imaginary unit and denote it by i for obvious reasons. * (The electricalengineers denote it by j for reasons of their own.) Thus, according to wellestablished rules and in view of (1.27), V-17 becomes

V-17=tip-11/17=i1V17and

V-4=1/-11/4=i2.* If you want a "simple" interpretation of this abstract concept, here it is: A line with

the slope i is perpendicular to itself, because, as we have seen in Problem 1.32, two linesare perpendicular to each other if, and only if, the product of their slopes is -1. Now,if the line has the slope i, then i i = - I and here you are.

Page 53: ɷshawns sagan integral and differential calculus

..9. THE QUADRATIC EQUATION 41

All the numbers of the type ib, where b is real, are called imaginarylumbers. If we add a real number a to an imaginary number ib, we obtainL complex number

a + ib

No further simplification is possible for the same reason as in adding uptomatoes and 4 potatoes. This yields 3 tomatoes and 4 potatoes and notpomatoes or something like that.)Now let us return to our quadratic equation (1.26)

axe+bx+c=0vhere we now assume that a 0, b 0 0. In this case, a simple extraction)f a square root will not lead us anywhere unless we tamper with the leftide a little. This is accomplished by a process which is called completinghe square. For example, if we have

1.28) 4x2+6x-4=0ve can first write it in the form

4(x2+2x-1)=0.ow x2 + ;x are the first two terms in the expansion of

+ 41x' 6.x+32=x2''2 9

We see that this expression differs from the one in (1.28), after the factoris taken out, only in the absolute terms. Instead of -1 we obtained s,

.e., an excess of i e. Hence(x+)2_

4 16

vill yield the desired result. Indeed, if we evaluate this expression, wePbtain

(x+)2_=x2+*x+_Th=x2+*x1.25 61'huswe can write the left side of (1.28) as

4L(x+4)2-16J.

This is the process which we call completing the square. Now, as we canasily see, this process enables us to solve the quadratic equation (1.28) inhe following manner. First, we can write it after division by 4 in the form

x2+;x- I =0.

Page 54: ɷshawns sagan integral and differential calculus

42 CHAP. 1. FUNCTIONS

We next complete the square on the left side

2

1x +

43 25

1 160,

shift the absolute term to the right side, and obtain

(x+32=2541 16

Once we have the equation in this form, we can extract the square root onboth sides and obtain

x + 4 = ± 25

J16

Thus,3 5 1 3 5x1= -4+4= -2, x2= -4-4= -2

are the two solutions of the quadratic equation (1.28).Let us now carry out the same process with the general quadratic equation

(1.26). First, we factor out the coefficient of x2:

a(x2+bx+e)=0.

a a

Next, we divide by a and complete the square as follows

((

ll2-\x+ bl

x2 +bx+ b2a a 4a2

Thus we see that2 2

x2+bx+c= x+ b - b +c.a a ` 2a 4a2 a

Hence, we can write equation (1.26) in the form

\x+ b/2 b22

C.2a 4a a

It is easy to extract the square root on both sides:

x+ b b 2 -C2a 4a2 a

i.e.,

Page 55: ɷshawns sagan integral and differential calculus

1.9. THE QUADRATIC EQUATION 43

x

Fig. 1.28

For this we obtain, after a few simple algebraic manipulations,

-b±N/b2-4acx =2a

Thus we have the following two solutions of the quadratic equation (1.26):

(1.29) x1_-b+Vb2-4ac x2 _-b-,Jb2-4ac

2a 2a

(1.29) is called the quadratic formula.We can see immediately from the quadratic formula:

(1) If b2 - 4ac is positive, then the quadratic equation (1.26) has two realand distinct roots (solutions), i.e., the para-bola y = axe + bx + c intersects the x-axisin two distinct points (see Fig. 1.28). If thequadratic equation arises out of a problemconcerning the intersection of a parabolaand a line, as we discussed it at the end ofthe preceding section, then there are twodistinct intersection points of parabola andline in this case (see Fig. 1.29).

Fig. 1.29

(2) If b2 - 4ac is negative, then Vb2 - 4ac is imaginary and the twosolutions of equation (1.26) are complex numbers. (Since they differ onlyin the sign of the imaginary part, we call them conjugate complex numbers.)In this case, the parabola y = axe + bx + c does not have any real inter-section points with the x-axis (see Fig. 1.30), or the parabola and the line ofsection 8 do not intersect at all (see Fig. I.31).

(3) Finally, if h2 - 4ac = 0, then equation (1.26) has one real solution

only, namely, x = - 2 . In this case, the parabola y = axe + bx + c

has only one intersection point with the x-axis and that is only possible if

Page 56: ɷshawns sagan integral and differential calculus

44 CHAP. I. FUNCTIONS

Fig. I.30

Fig. 1.31

;7vFig. 1.32

Fir. T.33

x

Page 57: ɷshawns sagan integral and differential calculus

I.10. TANGENT TO A PARABOLA 45

the parabola is located entirely on one side of the x-axis, touching the axis

at this particular point xb= -

2a(see Fig. 1.32). In this case we call the

x-axis tangent line to the parabola, or, if the quadratic equation arises froman intersection of a parabola with a line, the line will be tangent to theparabola (see Fig. 1.33). We will discuss this case in some detail in the nextsection.

Problems 1.68-1.74

1.68. Complete the square:

(a) x2 - 8x + 12 (b) x2 + 9x - 1(c) 16x2 + 4x + 18 (d) 3x2 - 5x + 13

1.69. a + ib is a complex number, if a and b are real. a is called the real partand b is called the imaginary part. A sum (difference) of two complex numbersis a complex number with the sum (difference) of the real parts as real part andthe sum (difference) of the imaginary parts as the imaginary part. Find:

(a) (3 + i2) + (4 - i6) (b) (-2 + i9) - (4 - i7)(c) (0 - ij) - (4 + i7) (d) (4 + i) + (4 - i)

1.70. Solve the quadratic equations which you obtained in Problems 1.66 and1.67 and discuss the geometric significance of the solutions.

1.71. Solve the following quadratic equations by the quadratic formula:

(a) x2 - 5x + 7 = 0 (b) 5x2 + 6x + 7 = 0(c) x2-17x+1 =0 (d)4x2-16x+1 =0

1.72. Given a circle with center in (1, 2) and radius 3. Find the intersectionpoints of this circle with the line y = 3x - 1.

1.73. Find the intersection points of the circle with center in (3, -1) andradius 3 with the unit circle (circle of radius 1) with the center at the origin.

1.74. Given the quadratic equation

x2+bx+I = 0.(a) For what values of b are the solutions real? (b) For what values of b does theequation only have one solution? (c) For what values of b are the solutionscomplex ?

10. TANGENT TO A PARABOLA

We consider a (stretched) quadratic parabola

(1.30) y= axe+bx+cand a straight line

(1.31) Y = Ax + B.

Page 58: ɷshawns sagan integral and differential calculus

46 CHAP. L FUNCTIONS

y

x

Fig. 1.34

In general, the line (1.31) will intersect the parabola (1.30) in two points, ifit intersects at all. The x-coordinates of the intersection points are foundif we let (1.30) and (1.31) co-exist as follows:

axe+bx+c=Ax+B.This is a quadratic equation which, after rearrangement of terms, can bewritten in the form

(1.32) axe+(b-A)x+c-B=0and has, according to (I.29), the solutions

(1.33)-(b-A)±J(b-A)2-4a(c-B)

2a

Figure 1.34 illustrates the situation for the case where the solutions (1.33)are real and different.

Now let us keep the intersection point P1 of the parabola with the linewhich has the x-coordinate x = x1 fixed and move the other intersectionpoint P2 along the parabolic arc towards P1, i.e., let the x-coordinate of thesecond intersection point x2 approach x1, thex-coordinate of the intersectionpoint P1. During this process, the line through Pl and P2 will turn asindicated in Fig. 1.35 where we have depicted the situation for severalpositions of P2.

We see that the closer x2 is to x1, the closer the line through P1 and P.2 willresemble what we consider to be the tangent line to the parabola at thepoint P1. The tangent line is drawn in Fig. 1.35 as a bold line.

Page 59: ɷshawns sagan integral and differential calculus

1.10. TANGENT TO A PARABOLA 47

Let us now investigate what happens algebraically in formula (1.33), ifwe turn the line through Pl and P2 in this manner through suitable changesof A and B. Because the two solutions xl and x2 of the quadratic equation(I.32) move closer together, their difference becomes smaller. Since we havefrom (1.33)

xl =2a '

-(b-A)-J(b-A)2-4a(c-B)X2

2awe obtain for the difference

(1.34) x1 - x2 =,/(b - A)2 - 4a(c - B)

a

This difference can only become smaller if the expression under the squareroot becomes smaller, or the denominator a becomes larger, or a combi-nation thereof. That a becomes larger can be excluded on the grounds thatthe parabola remains the same during the entire process and, therefore, ais a fixed number.

Hence, we can say the smaller(b - A)2 - 4a(c - B),

the closer the line through Pl and P2 will resemble the tangent line to theparabola at the point Pl and, consequently, it appears that we will obtain

Fig. 1.35

Page 60: ɷshawns sagan integral and differential calculus

48 CHAP. I. FUNCTIONS

the tangent line itself, if P, and P2 coincide, i.e., if

(b-A)2-4a(c-B)=0.This leads to the condition: The line y` = Ax + B is tangent line to the

parabola y = axe + bx + c if

(1.35) (b - A)2 = 4a(c - B).

Let us consider an example: Given the parabola

y=x2 (a=l,b=0,c=0)and the line

y = Ax - 1 (B = -1).

Let us determine the slope A of the line such that the line becomes tangentto the parabola. From (1.35)

(0 - A)2 = 4(0 + 1)or

A=±2.Thus it would appear that

y = 2x - 1as well as

y = -2x - 1

are tangent to the parabola y = x2. This is indeed the case, as illustrated inFig. 1.36.

Utilizing condition (1.35), we are now able to find the slope of the tangentline to the parabola (1.30) at any point with the x-coordinate x0. It followsfrom (1.30) that

(1.36) P0(xo, axo2 + bxo + c)

is a point on the parabola (1.30) for any x0. Now, using the point-slopeformula (1.4), we consider all lines through this point and obtain theequation

y-axo2-bxo-c=A(x- x0)or

(1.37) y = Ax + axo 2 + (b - A)xo + c

as the equation of all lines through the point (1.36). We note that

B = axo 2 + (b - A)xo + c

and obtain in view of (1.35) the following condition on A:

(b - A)2 = 4a[c - axo2 - (b - A)xo - c]

Page 61: ɷshawns sagan integral and differential calculus

1.10. TANGENT TO A PARABOLA 49

y

Fig. 1.36

or, after some algebraic manipulations,

A2 - (2b + 4axo)A + b2 + 4a2xo2 + 4abxo = 0.

The solution of this quadratic equation is obtained from

2b + 4axo ± ,/4b2 + 16abxo + 16a2x 2 - 4b2 - 16a2xo2 - 16abxoA1,2 = 2

Since the expression under the square root vanishes, we have

(1.38) A = 2axo + b.

Thus we can say: The tangent line to the parabola y = axe + bx + c atthe point with the x-coordinate x0 has the slope 2axo + b. (In Chapter III,Section 2, we will obtain a more general result which will contain this oneas a special case.)

As an example, let us consider

y=2x2-4x+3

Page 62: ɷshawns sagan integral and differential calculus

50 CHAP. I. FUNCTIONS

and the point on the parabola with the x-coordinate x0 = 2. Then for theslope of the tangent line at this point we have, according to (1.38),

A=8-4=4.The y-value corresponding to x0 = 2 is found to be. yo = 3. Hence we findthe equation of the tangent line from the point-slope formula to be

y - 3 = 4(x - 2),and from this

Problems I.75-I.84

y = 4x - 5.

1.75. Is it possible that a line y = Ax + B crosses a parabola y = ax 2 + bx +c in one point only? (Hint: Assume that the line crosses the parabola in thepoint with the x-coordinate x0, but is not the tangent line to the parabola at thispoint, i.e., has a slope which is different from 2axo + b. Is it then possible thatno other intersection takes place?

1.76. Find the equation of the tangent line to the parabola at the indicatedpoint.

(a) y = 4x2 - 6x + 3, (1, 1) (b) y = -2x2 + 7, (2, -1)(c) y = x2 + 12x + 3, (-1,?) (d) y = -3x2 + 3x - 3, (2, ?)

1.77. Find the tangent line(s) to the parabola y = axe + bx + c whichpass(es) through a point P0 that does not lie on the parabola:

(a)a=1, b=3, c=-1, PO(l,-2)(b) a = -2, b = 1, c = -1, PO(3, 12)

(c) a = 1, b = 2, c = 1, Pa(-4, -10)(d) a = -4, b = -1, c = 1, PO(2, 7)

1.78. Find the equation of the tangent line to the parabola y = x2 + 4x - 3which is parallel to the line y = 5x + 12.

1.79. Same as in Problem 1.78 for y = 2x2 - 3x + 2 and y = 13x - 4.1.80. Determine a such that the quadratic equation

axe+ax-4=0has one solution only.

1.81. Determine b such that the quadratic equation

x2+bx + b = 0has one solution only.

1.82. At what point does the parabola

y=2x2+3x-4have a horizontal tangent line? (A horizontal line has the slope 0.)

1.83. Show that the x-axis is tangent line to the parabola

y=x2-4x+4.1.84. Show that a parabola always has one, and only one, horizontal tangent

line.

Page 63: ɷshawns sagan integral and differential calculus

1.11. INTERVAL NOTATION

11. INTERVAL NOTATION

51

In many cases we will not study a function which is given by a mathe-matical formula for the entire domain of x-values. Sometimes, as in the

example y = x , it will be necessary to exclude certain values from the

x-domain. At other times, how a function behaves on only a certain partof the x-range will be of interest to us and what the function does elsewherewill be immaterial. In order to state in a simple and precise manner forwhat values of x a function is to be studied, we will introduce in thissection a special notation.

First, let us introduce two new symbols. a < b is read "a is less than b"and means that the point representing a on the line of numbers is to theleft of the point representing b on the line of numbers. a < b is read "a isless than or equal to b" and means that the point representing a on the lineof numbers is either to the left of the point representing b on the line ofnumbers or coincides with the latter.

For example,x<3

indicates that all values of x which are less than 3 are considered, while

x <2

signifies that all values of x which are either less than 2, or at most equal to2, are to be considered.

Similarly, we use the symbols a > b and a > b, which stand for "agreater than b" and "a greater than or equal to b" respectively.

Thus, for example, to characterize all the values of x which are between- I and 4, but neither equal to -1 nor to 4, we write

-1 <x<4.If we wish to include the numbers -1 and 4 in our consideration, we write

-1 <x <4.We call such a section of the x-axis, which is terminated by two points,

an interval.Specifically, we call a < x < b an open interval. Here x may range over

all values that are between a and b, but must not assume the values a andb themselves. a < x < b is called a closed interval. Here x is allowed toassume the values a and b in addition to all values between these. Finally,a <_ x < b and a < x < b are called semiclosed or semiopen intervals forobvious reasons.

Page 64: ɷshawns sagan integral and differential calculus

52 CHAP. I. FUNCTIONS

a<x<b a<x<b

Fig. 1.37

a<x<b

Graphically, we represent intervals as demonstrated in Fig. 1.37. Inter-vals are not necessarily bounded on one of their ends. We then speak ofhalf-lines. For example, x > I denotes that part of the x-axis that beginsat x = I and proceeds to the right.

As an example, let us find the interval within which the function

y=x2-x-2=(x+ 1)(x-2)is negative, i.e., y < 0. We can see immediately that y vanishes at x = - Iand at x = 2. At x = 0, we have y = -1. So it appears obvious that

y<0forallxin -1 <x<2(y <0forallxin -1 <x <2).

As a second example, let us find the intervals in which the function

1Y

(x - 3)x

is defined (meaningful). Clearly, at x = 0 and at x = 3 the denominatorvanishes and the function is, therefore, not defined at these points. How-ever, for all values of x which are located in any one of the intervals

the function is defined.

Problems 1.85-1.92

x<0,0<x<3,x>3

1.85. Represent the following intervals graphically:(a) -2 < x < 7(c) -4 <x < -1(e) -3 < x

(b)5<x<7(d)4<x<18(f)x<-5

I.86. Find the interval(s) in which the function

y =x2 -x -2is positive.

1.87. For which intervals does the function

J = -V(x - 1)(x - 2)(x - 3)have real values?

1.88. For which intervals is the function y = x2 - x - 2 defined?

1.89. Shade the region in the x,y-plane for which 1 < x < 4 and -1 < y < 3.1.90. For what values of x is x2 < x true?

Page 65: ɷshawns sagan integral and differential calculus

1.12. MORE EXAMPLES OF FUNCTIONS-CONTINUITY

1.91. Given the functiony = x3.

For which values of x does the inequality

1<y<8hold?

1.92. Find the interval which is common to the intervals

-1<x<3 and 1<x<29.

12. MORE EXAMPLES OF FUNCTIONS-CONTINUITY

53

Let us discuss in this section some functions which are somewhat out ofthe ordinary.

First, we discuss the function

(1.39) y = 1x1 .

(Read: "Absolute value of x".) Let us shortly explain the meaning of the"absolute value symbol." Any number can be represented as a point on theline of numbers, as we explained in Section 2. This point has a certainunique distance from the origin and this distance-which as a distance is apositive number-is what we call the absolute value of a number.

For example,141 = 4, 19.0311 = 9.031,

however,1-31 = 3, 1-0.5611=0.561,etc. -

As a rule, we may remember that, if a number is positive, it is equal to itsabsolute value. On the other hand, if a number is negative, its absolutevalue is obtained by changing the sign from - to + or, as we can state it,by multiplying the number by -1.

Thus, if x > 0, then lxl = x; but, if x < 0, then 1x1 = -x. To illustratethe latter case, let x = -2 < 0. Then 1-21 = -(-2) = 2.

Now we are ready to discuss the function in (1.39). We have

_ xifx>0y=1xl -i-x ifx<0.

Thus, for all x > 0, the function is given by

y=x,x>0which is represented by the half-line which bisects the first quadrant of theright coordinate system (see Fig. 1.38).

For all x < 0 we havey= -x,x<0

Page 66: ɷshawns sagan integral and differential calculus

54

Fig. 1.38

CHAP. 1. FUNCTIONS

x

which is represented by the half-line which bisects the second quadrant ofthe right coordinate system (see Fig. 1.38). Thus we see that the functiony = JxJ is represented by a bent line with a sharp corner at the origin.

Next, let us consider the function

(1.40)

We were strictly brought up in the belief that division by 0 is prohibited.So we have to exclude the value x = 0 from our discussion, since o wouldbe quite meaningless. However, we can still discuss this function for allvalues x 0 which includes values that are very close to 0, and as a matterof fact, as close to 0 as we please to choose them. For example, if x = 100then, y = 100. If we take x = 1ooooooojooo000002 then, y =10000000000000000 etc. We see that the closer the value of x comes to0, the larger the corresponding value of y will be. We express this simplyby stating that as x approaches 0 (x 0), the corresponding value of yapproaches infinity (y cc). Note that the symbol co means neither morenor less than what it means on the lens of a photographic camera. It justexpresses numbers (distances) which are larger than any number (distance)which we would care to state (measure).

Rememberoo is not a number.

It is just a symbol that helps us abbreviate the awkward statement "largerthan any number one cares to state."

The graph of the function (1.40) is found in Fig. 1.39. We can see thatthe curve representing this function comes closer and closer to the y-axis.It will never reach the y-axis, however, because reaching the y-axis would

mean that the function y = 1 has a value for x = 0 which is not the case.X

Page 67: ɷshawns sagan integral and differential calculus

1.12. MORE EXAMPLES OF FUNCTIONS-CONTINUITY 55

We call a line that is approached in such a manner an asymptote: the y-axis

is an asymptote of y = - . By the same token, we can state that the x-axisx

is an asymptote, since it is approached by the curve in a similar manner,i.e., as x --> oo, then y -+ 0 (and as x -+ - co, then y -> 0).

Next, let us consider the function

(1.41) y = [x], x > 0.

(Read: "Largest integer less than or equal to x.") As the note in parenthesesindicates, the bracket symbol signifies that we have to take the largestinteger which is smaller than or equal to x. Thus,

[0.3] = 0, [4.2] = 4, [5] = 5, [9.9865] = 9, etc.

If we wish to draw the graph of this function for all x > 0, we have to dis-cuss the function for every interval which is terminated by two consecutiveintegers. If 0 < x < 1, then we have clearly

y=0,0 <x<1.In the interval 1 < x < 2, we obtain

y = 1, 1 <x<2and for 2 < x < 3 we have

y=2,2<x<3,etc.

y

x

Fig. 1.39

Page 68: ɷshawns sagan integral and differential calculus

56 CHAP. I. FUNCTIONS

The reader can see that we obtain for y = [x] the following definition ifwe wish to avoid the brackets symbol:

0for0<x<11for 1 <x<22 for 2 <x<33 for 3 <x<44for4<x<55 for 5 <x<66 for 6 <x<7

y= 7for7 <x<8

n for n < x < n + I

which is to be continued ad infinitum. Now we can see clearly that thefunction defined in (1.41) is represented by the graph in Fig. 1.40. Since thisgraph resembles the steps in a staircase, we call the function y = [x] thestep function.

Next, we discuss the function

(1.42) y. _

1for0<x<1- lforl < x<2

1for2<x<3-lfor3 <x<4

1for 4<x<5-1 for 5 <x<6

It is obvious how the definition of this function is to be continued. Italways has the value I in every interval of length I that is bounded belowby an even integer and has the value -1 in every interval of length 1 thatis bounded below by an odd integer. In general, we have

Y -lfor2n+1 <x<2n+21 for2n <x<2n+ 1

where n assumes all nonnegative integral values.

Page 69: ɷshawns sagan integral and differential calculus

1.12. MORE EXAMPLES OF FUNCTIONS-CONTINUITY

Y

5

4

3

2

1

Fig. 1.40

57

-x

The graph of this function is given in Fig. 1.41. We can see that thisfunction exhibits a peculiar behavior inasmuch as it repeats itself every 2units. We call such a function a periodic function. The function in (1.42),in particular, is a function of period 2.

We can describe such a function very economically by making use of thesymbol f (x) as follows :

(1.42a) y =f(x) =1 for 0 < x < 1

-1 for 1 < x < 2

with the additional information that

(1.43) f (x + 2) = f (x) for all x > 0,

i.e., if we know the value off(x) for some x, then we know that it has thesame value at x + 2. Since we know the values of the function for all

Fig. 1.41

Page 70: ɷshawns sagan integral and differential calculus

58

Y

i

2

i

2

Fig. 1.42

1 2_> x

values of x in 0 :s-:, x < 2 from (1.42a), we also know it for all values of xwhich are greater than 2 because of (1.43). It is left to the reader to extendthe definition of this function for negative values of x such that the perio-dicity is preserved.

Finally, let us consider one more periodic function, namely,

I x for 0 <x<:(1.44) y =Ax) _ [-x+1where

(1.45)

for z<x<1

.f (x + 1) = f (x).

This is a function with period 1. In order to graph this function, we firstfocus our attention on the interval 0 < x < 1 where we easily obtain thegraph as indicated by a solid line in Fig. 1.42. Then we simply repeat thegraph to the left and the right as demanded by the periodicity 1 of ourfunction. This is indicated by a broken line in Fig. 1.42. 4

The functions represented in Figs. 1.39, 1.40, and 1.41 exhibit a property(or properties) which we have not encountered before in dealing withstraight lines and parabolas. It is easier to discuss the phenomenon we havein mind from a negative standpoint by pointing out what properties thesefunctions do not have rather than explaining directly what they do have.

Consider a river between the point of origin and the termination point.A river cannot suddenly cease to exist and continue at some other distantpoint. (It can go underground for a while but that does not mean that itceases to exist.) And this "river property" is what those functions in Figs.1.39, 1.40, and 1.41 do not have. The branch of the function in Fig. 1.39to the left of the y-axis as well as the branch to the right of the y-axis couldconceivably represent rivers. However, the entirety of this function could

not because the water cannot cross the y-axis along the graph of y = - .

CHAP. I. FUNCTIONS

Page 71: ɷshawns sagan integral and differential calculus

1.12. MORE EXAMPLES OF FUNCTIONS-CONTINUITY 59

The point x = 0 is called a point of discontinuity of our function. Like-wise, the points represented by integral values of x are points of discon-tinuity of the functions represented in Figs. 1.40 and 1.41. If a functiondoes not have discontinuities, then we call it continuous.

Let us suppose that we are in possession of a microscope of infinite mag-nifying power and we focus this microscope on a point of the graph of afunction. Then, if the image in the viewer turns out to be an uninterruptedline through this point, the function is continuous at the point. Otherwise,the point is one of discontinuity. Of course, there are no microscopes ofinfinite magnifying power and, even if there were, there are no means ofdrawing a "continuous" curve that would stand up to such a microscopicinvestigation.

So the question of whether a function is continuous or not cannot besettled by an examination of its graph. What we have to do is examine themathematical function itself after having translated the intuitive conceptof the "river property" into rigorous mathematical language. We will notindulge in such a discussion in our treatment. The functions which we willdeal with in future are continuous or, if they have points of discontinuity,these can be easily noticed.

We will, however, introduce in the next section the stronger concept ofuniform continuity (which is equivalent to continuity in every point of aclosed interval) of which we will make use in Chapter II, Section 8, in alimited way.

Problems 1.93-1.103

1.93. Sketch the function y = x - 11.94. Sketch the function y = x13.1.95. Show that Ix + yl = kxi + jyj, if x > 0 and y >_ 0 or x < 0 and y <- 0;

and that Ix +yI <jxj +Iyl ifeitherx >0,y <0orx <0, y >0.1.96. Sketch the function y = [IxI] for all x (within reason).1.97. Show that [x - }] < x. For what values of x does the equal sign hold?1.98. Sketch the function

y =f(x) =(1 for 0 <x <0forz <x<1

where f (x + 1) = f (x).1.99. Graph the function

y = f(x) = x for -1 <- x < lwhere f (x + 2) = f (x).

1.100. Graph the function

y = x-1

Page 72: ɷshawns sagan integral and differential calculus

60 CHAP. 1. FUNCTIONS

1.101. Graph the function

1.102. Graph the function

1

y= 1 -x2

y=[x]-IxI

1.103. Graph the functions y =TX

, y = X, and y = xi in the same coordinate

system in different colors. Discuss the behavior of these functions relative toeach other.

13. UNIFORM CONTINUITY*

The functions defined in (1.40), (1.41), and (1.42) in the preceding sectionshow some characteristic features that set them apart from the functionswhich we discussed on previous occasions, i.e., linear functions and parab-olas. What makes these functions so special is that once we start drawingthe curves which represent them, it is not possible to do so without havingto interrupt the process at certain points [e.g., the points where the functionsin (1.41) and (1.42) jumped from one level to another, or where the functionin (1.40) "jumped" from - oo to + co]. In other words, these functionsare not representable by "continuous" graphs, if we accept for the momentthe unsophisticated everyday meaning of the term "continuous."

If we attempt to make a study embracing all functions from an elemen-tary standpoint, using simple and elementary techniques, we encounteralmost insurmountable difficulties. It turns out that a great simplificationcan be obtained if we exclude all these functions that jump, either througha finite distance or through an infinite distance, or show some other erraticbehavior of which we will exhibit examples later. In order to eliminatesuch functions, we have to establish a criterion which will enable us to dis-tinguish in a formal mathematical fashion between "well-behaved" and"not-so-well-behaved" functions. The criterion which.would seem obvious,namely, that we should be able to draw the graph of the function with anuninterrupted stroke of the pencil, is, unfortunately, inadequate becauseit would eliminate functions which exhibit neither of the frowned uponphenomena but still defy graph drawing as is shown in Appendix 11.

In order to arrive at some workable criterion, let us consider three func-tions, each one in a closed interval of finite length a < x < b, as representedin (a), (b), and (c) of Fig. 1.43. A function of the type as represented in Fig.1.43(a) we consider acceptable. Functions as depicted in (b) and (c) we willreject in the future. Now, what is it that distinguishes the function in Fig.

* The reader may omit this section for the time being and take it up at a later timewhen he has reached greater mathematical maturity.

Page 73: ɷshawns sagan integral and differential calculus

1.13. UNIFORM CONTINUITY

y

>- x

(a)

y

(b)

y

Fig. 1.43

(c)

61

I.43(a) from those in (b) and (c) ? Clearly, if we choose any two x valuesx1, x2 in a < x < b which are very close together, then we can see that thecorresponding y-values f (x1), f (x2) are also quite close together in case ofFig. I.43(a). However, if we look at (b) and (c) of Fig. 1.43 we can see thatit is possible to choose two x-values such that the corresponding y-valuesare far apart, no matter how close the two x-values are.

This is really the whole situation in a nutshell. All we have to do now isto formulate this idea mathematically so that it is accessible to formal ma-nipulations. Let us consider for this purpose an enlargement of Fig. 1.43(a)as presented in Fig. 1.44.

Page 74: ɷshawns sagan integral and differential calculus

62 CHAP. I. FUNCTIONS

Let us choose a positive numbers (read: epsilon) quite arbitrarily. Now

draw a net of horizontal parallel lines which are 2 units apart, as indicated

in Fig. 1.44. We embark on a journey on our curve, starting at the pointa, f(a) and proceeding to the right. On this journey, we will have to crosssome of the parallel lines one at a time. Whenever we do, we mark theintersection point. If we project these intersection points vertically ontothe x-axis, we obtain the points that are designated by al, a2, a3, 0.1 11 in

Fig. 1.44. We can see now that whenever we pick out two x-values, x1 andx2, which are both located in an interval terminated by two consecutivepoints OCk, ak+1(k = 0, 1, 2, - , 11 if we agree to let a = ao and b = 012),then the corresponding y-values, f(x1) andf(x2), differ from each other by

than less 2 , because the portion of the curve between ak and ak±1 remains

entirely between two adjacent horizontal lines which are a distance 2 apart.

Let us denote the lengths of the subintervals which are terminated by thepoints ak by ak (read: delta k) :

61 = a1 - ao, S2 = a2 - d1, ... , 612 = 0'12 - al l

E/2{E/2,

E/2{E/2{

E/2{E/2{

E/2{

E/2{E/2{

E/2{ Tr1 1

f(x2) - f(xl)I I IIIII

I. i '

II , I

IlIsl si ! 1111111

H I 2;1 1 1 1 1 1

°I ts ;St -I

a4

x

Fig. 1.44

Page 75: ɷshawns sagan integral and differential calculus

1.13. UNIFORM CONTINUITY 63

and let us pick from these 8k's the smallest one and denote it by 8£

8E = minimum (81, 62, , 812).

We can see now that the following is true.If x1, x2 are two points on the x-axis which are less than 6E apart:

-8E<X1-X2<8$,

then the corresponding values of the function f(xl) and f(x2) cannot befarther apart than s :

-8 <f(XI) -f(x2) < 8,for the following reason.

Either x1 and x2 both lie in the same interval ak < x < ak+1 (k =0, 1, 2, 3, , or 11) or they don't. If they do, then both the correspond-ing values of the function lie in the same horizontal strip formed by two

adjacent parallel lines, 2 apart.

On the other hand, if x1 and x2 do not lie in the same interval ak < x <ak+l, then suppose x1 lies in ak < x < ak+l. It follows that x2 will either liein ak_1 < x < ak or in ak+1 < x < ak+2 It cannot lie in any other intervalbecause it is less than 8E apart from x1 and the length of either adjacentinterval is at least 8E.

But then the value of the function corresponding to x1 will lie in one stripof width s/2 and the value of the function corresponding to x2 will lie eitherin the strip above or below-if not in the same strip. Now, the maximumvertical distance between any two points in adjacent horizontal strips iscertainly less than s, and our assertion is proved.

Before we go any further, let us consider an example. Let

y=f(x)=x2in the interval 0 < x < 1 (see Fig. 1.45). Let s = 4. We can see that ourfunction shows the steepest increase at the end of the interval. At x = 1,the function has the value 1. Now, where does it have a value that is

I(= 2) smaller than 1, i.e., where does the function have the value -1g?

Clearly, the positive solution of a = x2, which is x = 1/$ is the answer.Thus it follows that the values of our function for any two x-values between

and I differ by less than 8. But we see at the same time that the differ-ence between any two values of our function is less than whenever the two

x-values are less than 1 apart. This is the smallest bk which we called6E (in our case 6,14).

Page 76: ɷshawns sagan integral and differential calculus

64 CHAP. I. FUNCTIONS

Y

1

Fig. 1.45

What we have done up to now will not get us very far. For example,consider the function

y= 1 [x] in 0<x<102000

which is drawn in Fig. 1.46 with a considerably enlarged y-scale and takes = i 0 o Obviously, two values of this function are certainly not fartherapart than . o- as long as the x-values for which these values are taken arenot farther apart than I and yet, this function is one of those which we wishto reject. But now we arrive at the crux of the matter: While it is stillpossible to find a Ss that will work with e = -C6-0 in the entire interval, it

Y

101'2000 F

52000

t

II I

II

IIIr- I I

I IIII I

l iT.

L1 2 3 4 5 61 7 8 9 10X2

Xj

Fig. 1.46

3 X

Page 77: ɷshawns sagan integral and differential calculus

1.13. UNIFORM CONTINUITY 65

will be impossible to find one if we choose s = 10 0 0 0. For then, no matterhow small we would choose be, there are always pairs of x-values less than6e apart such that the corresponding y-values of the function differ by morethan s = i I0000So

in order finally to get rid of stepfunctions and similar manifestationsof sophisticated mathematical minds, we will have to require that no matterhow small we choose our E, it shall always be possible to find a be which issuch that

whenever

- e < f (x1) - f (x2) < E

- be < x1 - x2 < 8e.

Hence, we put forth the following definition, using absolute value symbolsinstead of the awkward two-sided inequalities :

We call a function y = f (x) uniformly continuous in the interval a < x < bif it is possible to find for any e > 0, no matter how small, a number 8Ewhich is such that

wheneverI./ (xl) -f(X2)1 < e

1X1 - x21 < Se.

(The prefix "uniformly" in front of "continuous" is used to distinguishthis property from the ordinary continuity which is a point property ratherthan an interval property and was mentioned at the conclusion of the pre-ceding section.)

In the light of this definition, we can see right away that the functiony = x2 is uniformly continuous in 0 < x <- 1 (or, as a matter of fact, inany closed interval). To demonstrate this, we will show that it is possibleto find for any E > 0, no matter how small, a be such that

I f (xl) - f(X2)1 = I x12 - x221 < E

as long as Ixl - x2I < be.Now,

J X12 - x221 = I (xl - x2)(x1 + x2)1 < 2 I x1 - x2I

since xl < 1, x2 < 1.Thus we can see that whenever

EIx1-x21 <2

then Ix12 - x221 < e, no matter how small we have chosen our e.On the other hand, if we consider the function

y = [x]

Page 78: ɷshawns sagan integral and differential calculus

66 CHAP. I. FUNCTIONS

(see Fig. 1.40), we see that if we choose, for example, e = 2i then no matterhow small we choose 6s, we will always find two points xl and x2 closerthan 6. such that the corresponding values of the function are more thanJ apart. Clearly, all we have to do is choose xl on one side of a jump andx2 on the other side of the jump, and we have it.

Finally, it is almost needless to say that the function

y 1-xin 0 < x < 2, which is the type depicted in Fig. 1.43(c), is certainly not uni-formly continuous in 0 < x < 2. How could it be? Choose xl on the leftof x = 1 and x2 to the right of x = 1 as close together as you please, andyou will find that the corresponding values of the function will be fartherapart than you would care to measure, instead of being less than a smalldistance e apart.

Before we close this section let us mention a few other types of functionswe rid ourselves of through restricting our considerations to uniformlycontinuous functions in accordance with the definition which we presentedin this section.

Take, for example, the function

_ Iforx=0y-_f(x)

0for x; 0

in -1 < x < 1(see Fig. 1.47). Let a = J and take xl = 0. No matter howclose to 0 the value x2 is chosen, the difference of the two correspondingvalues of the function will be I and not less than 2. (A discontinuity of thetype as exhibited by this function at x = 0 is called a removable discontinuitybecause this function can be made uniformly continuous in the interval-1 < x < 1-or in any interval including the origin, for that matter-byredefining its value at x = 0 as f (0) = 0.)

y

I

1

x

Fig. 1.47

Page 79: ɷshawns sagan integral and differential calculus

[.13. UNIFORM CONTINUITY

I 1

67

Fig. 1.48

Page 80: ɷshawns sagan integral and differential calculus

68 CHAP. I. FUNCTIONS

Finally, we consider a function that is graphically represented by a poly-gon that oscillates between y = -1 and y = 1, assuming the value y = 1

at the points x = 1,1 , 1, 1, 1, 1 I , ... , and the value y = -13 5 7 9 11 2n + 1

at the points x =2, 4' 6' 8' ' 2n

..... in the interval 0 < x < I (see

Fig. 1.48). This function is not uniformly continuous, either. Take any eand a Se no matter how small, there will always be two points x1, x2 in thevicinity of x = 0 such that the corresponding values of the function willdiffer by more than e, provided e < 2.

In order to connect the concept of uniform continuity discussed in thissection with the concept of continuity mentioned in the preceding section,let us summarize the situation as follows: if a function is uniformly con-tinuous in a closed interval a < x < b, then it is continuous at every pointof the interval. However, if a function is only continuous at every point ofan open interval a < x < b, then it is not necessarily uniformly continuousin the closed interval a < x < b. The function y = 1/x illustrated this veryclearly. This function is continuous at every point of 0 < x < 1, i.e., hasno discontinuities in 0 < x < 1. However, it is not uniformly continuousin 0 < x < 1 because of its discontinuity at the point x = 0.

We wish to stress that uniform continuity is an interval property, whereasplain continuity is merely a point property. We will not make use of theconcept of continuity in this treatment, except for the fact that we will notconsider functions which have discontinuities or, if they do have discon-tinuities, they will be easy to spot. The concept of uniform continuity,.however, will be used in Chapter II, Section 8, though only in a limited way.

Problems 1.104-1.108

1.104. Given y = x2 + 7. Let s = . Find a SE such that I f(xl) - f(x2)I < sfor all Ixl - x2l < 6e for all x in the interval 1 < x < 2.

1.105. Given y = x2 + 2x + 5. Show that this function is uniformly con-tinuous in the interval -1 <- x < 1.

1.106. State closed intervals within which the function

1Y

(x - 1)(x + 2)is uniformly continuous.

1.107. Is the function x2-1for x 0 1

4 for x = 1continuous at the point x = 1 ?

I.108. If the function in problem 1.107 is not continuous at x = 1, thenredefine the function at x = I so that it becomes continuous. 4

Page 81: ɷshawns sagan integral and differential calculus

SUPPLEMENTARY PROBLEMS I

Supplementary Problems I

3.1. Sketch the function

y =

xfor -oo <x <01 for x = 0x2+2for0<x<13for1 <x<2X2 +2for2 <x < co

69

3.2. Given x1 f (x + h) - .f (x)f() =

7x. Find h and simplify.

3.3. Givenf(x) = x4. Findf(x + h) and expand.3.4. Given y2 = x. Sketch the function. How many y-values correspond to

each x-value?4.1. Find the equations of all lines through the point P(l, -2).4.2. Find the equations of all lines with the slope m = '-.4.3. Which line meets the specifications of problems 4.1 and 4.2 simul-

taneously ?4.4. Given the line L: y = 3x + 1. Find the equation of the line perpendic-

ular to L which passes through the point P(1, 3). (See Problem I.32.)4.5. Find the equations of all lines which are perpendicular to the line

y=-3x+3.4.6. From the lines in Problem 4.5, pick out the one which passes through the

origin.5.1. Find the equation of the line which is perpendicular to the line y = 3 and

passes through the point P(1, 4).5.2. Given 3x + 4y = 1 and 6x + 8y = F. Assign such a value to F that

(a) the two lines are parallel and intersect the y-axis at points 3 units apart,(b) the two lines coincide.

5.3. Sketch the lines (a) x + y = 1, (b) x + y = 2, (c) x + y = 3. Whatfeature is common to all these lines?

5.4. Find the equation of the line through the two points (a) P1(1, 3),P2(1, 5)and (b) P1(4, 7), P2(13, 7).

5.5. Given two vertices A(0, 0) and B(4, 0) of an isosceles triangle, the twoequal sides of which terminate in the third vertex C. Find the equation of theline through B and C. Note that C is not uniquely determined.

6.1. Find the equation of the line which passes through the intersection pointof the two lines x + y = 2 and 3x + 2y = 5 and the intersection point of thetwo lines x - 4y = 8 and x + y = 3.

6.2. Find the equations of all lines which pass through the intersection pointof the lines x = 4 and y = 1.

6.3. Given the lines x + 2y = 3 and x - By = 1. Choose B so that the twolines intersect at the point P(l, 1).

Page 82: ɷshawns sagan integral and differential calculus

70 CHAP. I. FUNCTIONS

6.4. Evaluate the determinants

(a)

6.5.

1 9375

1 1

all a12 a13

a21 a22 a23

a31 a32 a33

(b) 0 1

-1 10827

(c) 526 2

1052 4

is called a determinant of the third order.

The following rules are valid:

and

1 0 0

0 a22 a23

0 a32 a33

all a12 + nail

a21 a22 + na21

a13

a23

a31 a32 + na31 a33

a22 a23

a32 a33

all a12 a13

a21 a22 a23

a31 a32 a33

for any positive or negative n. Utilizing these rules, evaluate the determinant

1 4 0

0 3 2

0 9 8

(Hint: Let n = -4.)7.1. Find the locus of all points P(x, y) which are such that the sum of the

distances from P to Fl(-c, 0) and from P to F2(c, 0) is constant: dprl + dpF2 =2a where a is a constant. In the final result, substitute for c a constant b accordingto a2 - b2 = c2 and simplify. (This locus is an ellipse.)

7.2. Find the locus of all points P(x, y) which are such that the difference of itsdistances from Fl(-c, 0) and F2(c, 0) is constant: dprl - dpr2 = 2a where ais a constant. In the final result substitute for c a constant b according toa2 + b2 = c2. (This locus is a hyperbola.)

7.3. Sketch the graph of the locus in Problem 7.1.7.4. Sketch the graph of the locus in Problem 7.2.7.5. Given a point P(4, 2) and a line L: x + y = 1. Find the shortest distance

from P to L by intersecting a line perpendicular to L through P with L andcomputing the distance from P to the intersection point.

7.6. Given the circle x2 + y2 = 1 and a point P(0, 1) on the circumference ofthe circle. Find another point Q(x, y) on the circumference of the circle at adistance j from P.

8.1. Given y = x2. Introduce a new coordinate system according to x =x + 4, g = y - 3 and find the equation of the parabola in the new coordinatesystem.

8.2. Introduce a new coordinate system so that the parabola y = 4x2 + 16x -64 assumes the form g = x2.

Page 83: ɷshawns sagan integral and differential calculus

SUPPLEMENTARY PROBLEMS I 71

8.3. Find the intersection points of the parabola y = x2 + 2 with the liney = x + 3. Introduce a new coordinate system according to x = x + 3,g = y - 2. Find the coordinates of the intersection points of the line with theparabola in the new coordinate system by two different methods.

8.4. Given two lines y = 3x + 4 and 9 = 3x - 1 in two different coordinatesystems. Find a and b in x = x + a, y = y + b so that the two lines coincidein the x, g-coordinate system.

8.5. Find the quadratic equation which has the x-coordinates of the inter-section points of the parabola y = x2 with the circle x2 + y2 = 4 as solutions.

9.1. Show that the sum of the two solutions of a quadratic equation is alwaysreal, whether the solutions are real or not.

9.2. Given the quadratic equation ax2 + bx + c = 0. Show that, if x1, x2 aretwo (real or complex) numbers such that -a(x1 + x2) = b and axlx2 = c, thenx1 and x2 are solutions of the quadratic equation.

9.3. Multiplication of complex numbers is defined as follows:

(a + ib)(c + id) = ac - bd + i(bc + ad).

Show that the product of two conjugate complex numbers is real.9.4. Fractions with complex numerator and/or denominator are manipulated

like ordinary fractions. Multiply numerator and denominator in the followingfractions by a suitable complex number so that the denominator becomes real:

(a) 1 - i2 (b) 3 - i () i

1 +i 4-i3 c 1 -i59.5. Given the parabola y = x2 + 1 and the straight line y = mx - 2.

Determine the slope m < co so that the line has only one intersection pointwith the parabola.

10.1. A triangle has one vertex at A(-1, 0) and the other two vertices at thepoints at which the two lines through A are tangent to the parabola y = x2.

Find the lengths of the three sides of the triangle.10.2. Given the parabola y = x2 + 3x + 1. Find the values of x for which

the tangent to this parabola has the slopes 1, 2, 3, , 10. Call these valuesx1i x2, x3, , x10. Find xk+1 - x1 for all k = 1, 2, 3, , 9.

10.3. Show by geometric inspection that y = x2 + bx + c assumes thesmallest value at that point at which the slope of the tangent line is zero.

10.4. Given two parabolas y = axe + bx + c and y =Ax2 + Bx + C. Showthat, if both parabolas intersect the x-axis in the same two points, then a = nA,b = nB, c = nC for some constant n.

10.5. Consider the two parabolas in Problem 10.4. Show that the one canbe transformed into the other one via a transformation of the type g = ay.Find a.

10.6. Consider the two parabolas in Problem 10.4. Show that the slope of thetangent line to the one parabola is at any given point a multiple of the slope ofthe tangent line to the other parabola.

11.1. For what values of x does the function y = V 1 - x2 have real values?(Use inequalities.)

11.2. Same as in Problem 11.1 for the function y = 'V'x2 - 1.

Page 84: ɷshawns sagan integral and differential calculus

72 CHAP. I. FUNCTIONS

11.3. For what values ofx is the function y = x(x - 1)(x + 2)(x + 3) positive?11.4. Represent the following regions geometrically:

(a) -1 <x<1,-1 <y<1 (b) -2<x-1 <2,y>0(c) x2 + y2 < 1 (d) x2 < 4, -1 < y < 3

12.1. Sketch the function y = Ix + 21.12.2. Sketch the function Iyl = Ixl12.3. Sketch the function y = [x - [x]].

12.4. Sketch the function y =x2

12.5. Sketch the function y = x2 for 0 < x < 1, f(x + 1) = f(x).13.1. Given f (x) = 4x2 - 1. Find a S£ so that I f (xl) - f (x2)I < s for all

Ix1 -x21 <6,in0 <x <2.13.2. Show that the function y = Ixl is uniformly continuous in the interval

-1 <x < 1.13.3. Show by inspection that the function

y=x for-1 <x<1

{-x+2for1 <x<3where f (x + 4) = f (x), has no discontinuities.

Page 85: ɷshawns sagan integral and differential calculus

CHAPTER II

AREAS

Sections 1 and 2 of this chapter deal with areas of rectangles and triangles.It is the purpose of these sections to establish the familiar formulas for thearea of a rectangle of dimensions a and b as the product ab and for the areaof a triangle of base c and height h as zch. The following principle isstressed throughout these sections and frequently is made use of in thesequel : Congruent regions have the same area, and i f one region R1 is entirelycontained in another region R2, then the area of R1 is smaller than the areaof R2.

1. AREA OF RECTANGLES

In this section we will enter a preliminary discussion of what will eventu-ally lead us to the concept of an area measure of planar regions which arebounded by closed curves.

However, before we start working toward this end, we have to clarifywhat such an area measure is supposed to accomplish. So suppose weapply a coat of (pink) paint to a certain planar region and measure thequantity of paint that is required to accomplish this task. Clearly, con-gruent regions will require the same amount of paint to be covered withone coat. But suppose we now paint two regions, one of which can beentirely submerged in the other one. Then it turns out that we need morepaint for the containing region than the one which can be submerged in it.In such a case we will say that the "area" (area in quotes shall stand for theintuitive concept of area) of the one region is larger than the other one,while in the first case, where the same amount of paint was required, wewill say that the "areas" are equal (congruent regions have the same area).Moreover, if we have two regions, one of which requires an amount p ofpaint and the other an amount q of paint, then it is intuitively clear that thecombined region will require an amount of p + q of paint.

So it appears that an area measure which is sensibly defined should beconsistent with the "paint measure" as we discussed it here. It seems that

73

Page 86: ɷshawns sagan integral and differential calculus

74 CHAP. II. AREAS

the system of paint measure is logically satisfactory although quite imprac-tical, since in order to make a statement about the sizes of two regions withrespect to each other, we would have to apply a coat of paint first to eachof them. This is certainly very impractical, even though this task could besomewhat simplified by introducing a specific test region and defining theamount of paint required forthis particular region as 1 ppu (pink-paint unit).

In addition to the properties of the paint measure which we discussedabove, let us state two properties which are quite trivial but neverthelessimportant for what is to follow. Any region will require a positive amountof paint and any degenerate region (either a point or a curve) will requireno paint at all.

In order to arrive at a satisfactory mathematical definition of an areameasure, we will proceed essentially along the same lines as is done inestablishing a length measure. We will try to establish a correlationbetween regions and positive numbers which has, in terms of our examplewith the paint, the following properties: Two regions for which the sameamount of paint is required shall correspond to the same number; if oneregion requires less paint than another, then the number corresponding tothe first region shall be less than the number corresponding to the secondregion, and finally if the number a corresponds to the region which requiresthe amount p of paint and the number b corresponds to the region whichrequires the amount q of paint, then the number a + b shall correspond tothe combined region that requires the amount p + q of paint.

In order to accomplish our goal, i.e., the establishment of a satisfactoryarea measure, we will first discuss a region with a very simple boundary,namely, a rectangular region (see Fig. II.1). A rectangle is uniquely deter-mined by its dimensions a and b, i.e., given the dimensions a and b, thenall the rectangles with these given dimensions are congruent.

Thus we rightfully expect that the area of a rectangle will also dependonly on a and b, and we write for the area

A[r(a, b)] . . . Area of rectangle with dimensions a and b.

b

a

Fig. 11.1

Page 87: ɷshawns sagan integral and differential calculus

11. 1. AREA OF RECTANGLES 75

Clearly, the size (i.e., the amount of ppu's required) will depend on thelength of a and the length of b. Suppose a is fixed; the larger b, the largerthe size of the rectangle. Likewise, let b be fixed, then the larger a, thelarger the size of the rectangle. So we see it is certainly sensible to postulatethat the area measure should be an increasing function of a, if b is fixed andan increasing function of b if a is fixed. The simplest increasing functionwhich we are acquainted with is the linear function. Therefore, we try thefollowing definition

(11. 1) A[r(a, b)] = f(b) a

(11.2) A[r(a, b)] = g(a) b

where the underline indicates that the dimension so designated is to beconsidered as fixed and where f (b) and g(a) are proportionality factors, thevalues of which depend on b and a, respectively. If (11.1) and (11.2) bothgive a measure for the same area, then the quantities on the right side haveto be equal: J '(b) a = g(a) b.

Let us divide this equation by the product ab. Then we obtain

f(b) = g(a)b a

The left side depends on b only, the right side depends on a only; hence,if this is to be a true equation, both sides have to be equal to the same con-stant C which is independent of a and b but can otherwise be chosen quitearbitrarily.

Thus we obtain fromf(b) = C, g(a) = C

b a

that

(11.3) f (b) = Cb and g(a) = Ca.

If we substitute these expressions into (11.1) and (11.2), we obtain in bothcases

(11.4) A[r(a, b)] = Cab.

The constant C is quite arbitrary and we can choose its value any way wewish. However, its value will be uniquely determined if we choose an areameasure unit. It is customary to choose a square of sidelength 1 unit (rec-tangle with a = 1, b = 1) as measure unit for areas and call it square unit.(If the measure unit for length is 1 inch, then the measure unit for area is

1 square inch (1 sq in. or 1 in2); if the measure unit for length is 1 cm,then the measure unit for area would be 1 sq cm (or 1 cm2), etc....)

Page 88: ɷshawns sagan integral and differential calculus

76 CHAP. H. AREAS

It now follows from (11.4) that1 = A[r(1, 1)] = C.1.1

and from this, C = 1.However, if we choose a square of sidelength 1 ft as unit for area measure,

and still measure length in inches, then we have

1 = A[r(a, b)] = C.12.12

and, consequently, C = x &4.So we see that C plays in a sense the role of a

conversion factor that converts square inches into square feet, etc....For reasons of simplicity, we will henceforth assume that C = 1, i.e.,

the measure unit for areas will always be a square of sidelength 1 unit.Let us now discuss whether this definition of area measure as introduced

by (11.4) has all the properties which we listed as desirable in the beginningof this section and which we can state now as follows-as far as rectanglesare concerned:

(1) A[r(a, b)] > 0 if a > 0, b > 0,(2) A[r(a, b)] = 0 if a = 0 or b = 0 or both,(3) A[r(a, b) & r(c, d)] = A[r(a, b)] + A[r(c, d)]

where & shall mean "combined with," if this combination yields again arectangle, and finally(4) If r(a, b) is contained in r(c, d), then A[r(a, b)] < A[r(c, d)].

To these conditions we may add the condition which determines the areameasure unit

(5) A[r(1, 1)] = 1.

Properties (1) and (2) are clearly satisfied by (11.4) since the product oftwo positive numbers is again positive and a product is zero if, and onlyif, at least one factor is zero.

Property (3) requires some discussion. First, we can see easily that thecombined region of the two rectangles r(a, b) and r(c, d) is a rectangle againonly if they have one side in common, say, b = c. Thus we can piece thetwo rectangles together and obtain a rectangle with the dimensions b, a + d(see Fig. 11.2) which has, according to (11.4) with C = 1, the area

A[r(b, a + d)] = b(a + d) = ab + bd.

Since A [r(a, b)] = ab and A [r(b, d)] = bd, we see thus that

(11.5) A[r(b, a + d)] = A[r(a, b) & r(b, d)] = A[r(a, b)] + A[r(b, d)].

Finally, let us check on property (4). That this condition is satisfied canbe checked directly but we will show here how this property can be deducedfrom (1) and (3).

Page 89: ɷshawns sagan integral and differential calculus

H.1. AREA OF RECTANGLES

b=c

Fig. 11.2

c

Fig. II.3

d

77

Suppose r(a, b) is contained in r(c, d) (see Fig. 11.3). Consider the rec-tangles with dimensions a, d and c - a, d. From (11.5),

A[r(c, d)] = A[r(a, d) & r(c - a, d)] = A[r(a, d)] + A[r(c - a, d)]where the area of r(a, d) is, again in view of (11.5),

A[r(a, d)] = A[r(a, b)] + A[r(a, d - b)].Thus we obtain

A[r(c, d)] = A[r(a, b)] + A[r(a, d - b)] + A[r(c - a, b)].

Since all the terms on the right side are positive in view of (1), it followsthat

A[r(c, d)] > A[r(a, b)].

Problems 11.1-11.7

1.1. Consider the rectangles r(a, b) and r(c, d). Let a < c, b <_ d or a < c,b < d and show directly that

A[r(a, b)] < A[r(c, d)].

11.2. Let the units for length measurement be inches and the units of areameasurement be square meters. Find the value of the conversion factor C.Same for meters and square yards.

11.3. Prove that A[2r(a, b)] = 2A[r(a, b)] where 2r(a, b) = r(a, b) & r(a, b).11.4. Prove that A[r(ca, b)] = cA[r(a, b)] and interpret geometrically.11.5. Prove that

A[r(ca, fib) & r(yc, 5d)] = afA[r(a, b)] + y6A[r(c, d)]

provided as = yc or fib = yc or fib = bd or as = bd.11.6. Given a rectangle with the side a fixed and the side b variable. The area

is a function f(b) of b. What is f(x.)?11.7. The side a of a rectangle with fixed dimension b is a function f (A) of

its area A. What isf(x)? 4

Page 90: ɷshawns sagan integral and differential calculus

78 CHAP. H. AREAS

.2. AREA OF TRIANGLES

A configuration that receives considerable attention in plane geometry,as well as in social life, is the triangle, i.e., a figure that consists of threestraight lines, each two of which meet in a point (vertex) (see Fig. 11.4).

In this section we will try to generalize the concept of an area measuresuch that it becomes applicable to triangles. For this purpose we first con-sider a rectangle with dimensions a and b, and join two opposite verticesby a straight line (diagonal) (see Fig. 11.5). We thus obtain two congruenttriangles. We remember that one of our demands of an area measure isthat two congruent figures must have the same area. Further, we requirethat the sum of the areas of two regions shall be equal to the area of thecombined region.

Thus we have to consider the areas of the two triangles in Fig. 11.5 asequal and assign to them a value such that the sum of the two areas is equalto the area of the rectangle with dimensions a and b. Since

A[r(a, b)] = ab,

we obtain for the area of either one of the two triangles

A(triangle) = lab.

Note that we really did not have any choice in defining the area of a tri-angle, if we want to be consistent.

The triangle considered above is a very special one inasmuch as two ofits sides are perpendicular to each other. We call such a triangle a righttriangle. We will now proceed to free ourselves from this restriction andconsider a general triangle which may or may not be a right triangle. InFig. 11.6 we have drawn such a triangle. We denote the vertices by capitalletters, starting with the lower left vertex and proceeding in the enumeration

b

Fig. 11.4

V_

a

Fig. 11.5

Page 91: ɷshawns sagan integral and differential calculus

11.2. AREA OF TRIANGLES 75

Fig. 11.6

counterclockwise. We denote each side by the small letter correspondingto the vertex opposite it. Now we draw a line through C parallel to c, anda line through B parallel to b. These two lines will meet at a point D. Thefigure (ABDC) is called a parallelogram because it consists of two pairs ofparallel lines. (Clearly, if the triangle we began with had had a right angleat A, then (ABDC) would have turned out to be a rectangle, which indeedis a special case of a parallelogram.)

We consider the line through C and D continued to the left and drawtwo lines through A and B which are perpendicular to c. The result is arectangle ABD'C' which has the area ch, if we use h to denote the length ofthe side AC' (or BD'). The two triangles ACC' and BDD' are congruent(right) triangles, and hence have the same area. In order to obtain fromthe parallelogram region (ABDC) the rectangular region (ABD'C'), wehave to add the triangular region (ACC') and take the triangular region(BDD') away, i.e., add and subtract at the same time regions of equal area.Thus it appears that the parallelogram (ABDC) has the same area as therectangle (ABD'C'), namely, ch. Therefore, it follows that the triangle(ABC), being obviously congruent to the triangle (BDC), has the area

(11.6) A(triangle) = J-ch.

We call c the base and h the height (altitude) of the triangle, forming themental image of the base lying flat and the two sides b, a towering over it.

Having arrived at an area measure for triangles, we can now generalizethis concept immediately to any region which is encompassed by a closedpolygon (iro2,v' = many, yovv' = angle), i.e., a boundary with many angles.What we really mean by this expression is a boundary that consists of manyline segments, as illustrated in Fig. 11.7. Such a closed polygon can besubdivided into a number of triangles (in more than one way-see Fig.11.8) and the area of every triangle can be computed by formula (11.6). Wedefine the area of the polygon to be the sum of the areas of the componentfigures, and then arrive at the area of the polygonal region by adding atthe areas of the component triangles. It can be shown that we will arrive

Page 92: ɷshawns sagan integral and differential calculus

80

Fig. 11.7

Fig. 11.8

CHAP. If. AREAS

at the same result in whatever way we subdivide the polygonal region intotriangles.

Problems 11.8-11.14

11.8. Find the area of the irregular pentagon (ABCDE) where the verticeshave the coordinates A(2, 0), B(2, 1), C(5, 1), D(2, 3), E(0, 2).

11.9. Find the area of the equilateral triangle of side length 1.11.10. Find the area of the regular hexagon of side length 1.II.11. Find the area of the irregular octagon with the vertices A(l, 0), B(l, 2),

C(4, 2), D(5, 4), E(4, 5), F(1, 5), G(l, 4), H(0, 3), by partitioning it into trianglesin at least two different ways.

11.12. Find the area of a trapezoid. (A trapezoid is a quadrangle, two sidesof which are parallel to each other. Let the dimensions of these two sides bea and b, and denote their distance apart by h.)

11.13. Given a triangle with fixed base b and variable height h. The area is a

function f(h) of h. What is f(x), f(t2), f

11.14. Given a triangle of fixed height h and variable base b. b is a functionf (A) of the area. What is f (.x) ?

3. SERIES AND SEQUENCES

Let us consider a geometrical configuration which consists of a square ofsidelength I with an isosceles right triangle of sidelength 1 surmounted on

Page 93: ɷshawns sagan integral and differential calculus

11.3. SERIES AND SEQUENCES 81

it, as indicated in Fig. 11.9. We proceed to inscribe a sequence of squaresin this triangle, also shown in Fig. 11.9. The sidelength of each square ishalf the sidelength of the preceding square. We can see that no matter howmany squares we inscribe, there is still an isosceles triangle left in which wecan continue the process. In other words, there is no last square or, as wemay say, there are infinitely many squares which can thus be inscribed inthe triangle under consideration-or, for that matter, in any triangle. Letus denote the area of the basic square of sidelength 1 by ao, the area of thenext smaller square by a1, the area of the next one by a2, etc....

We have infinitely many squares at our disposal. We can never drawthem all, but we can draw as many as we please. If we combine any numberof such consecutive squares, we obtain a region which is encompassed by apolygon. The area of this region is equal to the sum of the areas of itssquare components. Thus, the area of the region which is obtained by

AA

Fig. 11.9

Page 94: ɷshawns sagan integral and differential calculus

82 CHAP. H. AREAS

a combination of the squares with areas ao, a1, a2, , an is given by

ao+a1+a2+...+a,

This sum can be found whether we consider 5 squares (n = 5) or 100million squares (n = 100000000). (In the latter case, it may take a littlelonger.)

We are now about to take a very bold step. Let us pose the question:Is it reasonable to assign an area measure to the region consisting of thetotality of all squares with the areas ao a1, a2, ? We remember thatwe demanded in Section 1 of this chapter that the area measure be suchthat if a region is contained in another region, then its area measure shallbe smaller than the area measure of the region in which it is contained.Clearly, the region consisting of as many consecutive squares as we careto consider is contained in the region bounded by the bold polygon in Fig.11.9. If we keep inscribing in this region additional squares, we will neverleave the region, no matter how far we go, as is quite obvious from the con-struction process.

Thus, if we consider the totality of all the squares as a region at all, it willcertainly be sensible to assign to it an area measure which is smaller thanthe area measure of the circumscribed region, which in turn is 1. Thismeans that

ao+a1+a2+... <2

where the three dots on the left of this inequality indicate that this sum-mation is to be continued ad infinitum.

Admitting that a region may consist of infinitely many portions, theareas of which add up to the area of the total region, really involves ageneralization of our concept of area measure as follows:

If a region R consists of infinitely many regions Ro, R1, R2, which donot overlap, and if there exists a region R which has a finite area measuresuch that it contains all the infinitely many non-overlapping regions Rk(k = 0, 1, 2, 3, - - -), then the area measure A(R) of the region R shall bedefined as follows:

A(R) = A(Ro) + A(RI) + A(R2) + A(R3) + .. .

where the A(Rk) (k = 0, 1, 2, 3, - - -) are the area measures of the regionsRk.

We call this property the total additivity of the area measure.

Page 95: ɷshawns sagan integral and differential calculus

11.3. SERIES AND SEQUENCES 83

Now let us return to the evaluation of our sum a0 + a1 + a2 + - -

Since it is always awkward to write out a number of terms in order toindicate a summation process, we introduce an abbreviating notation. Theword sum starts with an S, so we could write Sak for ao + a1 + a2 +and indicate somehow that the summation is to be extended from k = 0 toco. This is not the way it is done, however, since too many people read theroman script and mathematicians do not want just anybody to read whatthey produce. Therefore, we use the Greek equivalent of S, namely E (read:sigma) to indicate a summation as follows :

The finite sum s, = ao + a1 + a2 + - + a is denoted as follows

a0+a1+a2+...+ akk=0

and if the summation is to be extended to infinity as encountered above, weindicate this by writing

ao+a1+a2+...= z akk=0

We call k the summation subscript and note that it is in a way a dummy,because it can be replaced by any other letter without changing the sum atall-provided it is replaced in all instances in which it occurs. So, instead

00 00

00

of 57 ak, we could also write I a1 or I a; and it would still represent thek=0 i=0 j=0

same sum. The numbers below and above the E symbol we call the sum-mation limits for obvious reasons.

In this new notation, our problem is to evaluate the sum

CO

I akk=0

Inspection of Fig. 11.9 shows that the region which is bounded by the boldpolygon consists of the region which is made up of all the squares togetherwith the shaded region which, in turn, consists of infinitely many isoscelesright triangles. To this shaded region we can also assign an area measureand justify this by the same argument we employed in assigning an areameasure to the region consisting of infinitely many squares. If b1, b2,

b3, - - are the areas of the triangles, then

k=1

Page 96: ɷshawns sagan integral and differential calculus

84 CHAP. II. AREAS

will be the area measure of the shaded region and, if our generalization ofthe area measure to infinitely many regions is senseful at all, the result hasto be consistent with all the basic properties of the area measure-inparticular, with the property that the area of a region which is obtained bythe combination of two nonoverlapping regions is equal to the sum of theareas of the two regions. Since the area of the polygonal region in Fig. 11.9

is, as we previously remarked, a we have

(11.7)

We observe that

00

001 ak+ I bk=3k=0 k=1 2

bk= lakfor all k= 1,2,3, .

Hence it appears that

00

0c) 00-21I bk = L. jak = 1 akk=1 k=1 k=1

and we can write (11.7) as

and, consequently,

(11.8)

°°

00

3

I aa0+ I a/ + kk=1

=-k=1 2

3 °° 3a0 +2 I1ak=2.

Since ao is the area of the square with side length 1, we have ao = 1 and weobtain from (11.8)

CO1

ak =k=1 3

and therefore°°

co

4(II.9) a0 + - I1 ak = Io ak = 3

This result can just as well be obtained without reference to geometricinspection and area measure. We note that the side lengths of the squares

1 1 1

which are involved in our process are 1,2

,4

,8

, . The sidelength of the1

(n + 1)th square is clearly2

n (note that the side length of the first square is

Page 97: ɷshawns sagan integral and differential calculus

11.3. SERIES AND SEQUENCES

obtained for n = 0). Hence, we obtain for the areas

ao = 1

1a1=4

1 1

a2 =16 42

a364 43

1 1a4 = =

256 44

1 1_an

__(2n)2 - 4n

85

Thus our problem boils down to the purely arithmetical problem of evalu-ating the sum

llk

(11.10) 1+4+42+43+44+...=k=o

m

1 4/Let us evaluate the finite sums sn for several values of n. We have

so=a0= 1s1=ao+at=4=1.25s2=ao+a1+a2=1.3125s3=ao+a1+a2+ a3 = 1.328 ...s4 = ao+a1 +a2+a3+a4= 1.332...

We observe that this sequence of values so, s1, s2, s3, comes closer to thevalue s = 1.3333 of the sum, the larger n is. We call these sums sn the

partial sums of 7 ak which, in turn, is called an infinite series (or simplyk=0

series). Specifically, we call sn the nth partial sum, and it appears quite

obvious that the value of the series i ak, provided such a value exists, isk=0

Page 98: ɷshawns sagan integral and differential calculus

86 CHAP. II. AREAS

the value which is approached by the sequence of partial sums as n growsbeyond bounds or, as we say, as n tends to infinity.*

We call the value that is approached by the elements of a sequences,,, as ntends to infinity, the limit of the sequence and employ the followingnotation

lim Sn = S.n-+oo

As we observed earlier, this limit is the value of the series of which the sk'sare partial sums:

00

(II.11) 'Y ak = lim sn=0 n- 00

whereThus far we really have not established that lim sn = 3. All we have

done is evaluate a few partial sums and observe the trend of these partialsums towards 3. To find this limit in a more systematic manner, let usconsider the more general problem of adding up the infinite series

00I qk

k=0

which is clearly a generalization of the case we just considered where q =.In evaluating the partial sums, we can see that, provided q 0 1, we have

the following:

S0 = 11-q 2-q)

(1=

s1=1 (1 -q) 1 -q

s2=1+q+q 2= 1-q +q2= 1-qq q

s3=1+q+g2+g3= 1-q +q3 qq

-q_ 4 5

s4= 1 +q +q2+g3+g4= -q +q4= -qq - q4

etc.... The reader can already see how the pattern develops and make theintelligent conjecture that

1 - qn+

(11.12) sn = 1 + q + q2 + .. + qn = =L1 .1-q ) q

* Customarily, the value of an infinite series is defined as the value that is approachedby the partial sums as n tends to infinity.

Page 99: ɷshawns sagan integral and differential calculus

11.3. SERIES AND SEQUENCES

In order to verify this relation, let us multiply both sides by (1 - q):

(1 - q)(1 + q + q2 + q3 + ... + q71-1 + qn) = 1 - qn+l.

If we carry out the multiplication on the left side, we obtain

1 +q+g2+q3+...+qn

-q-q2-q3-...-qn-qn+l= 1 -qn+1

87

and we see that (11.12) is indeed correct.Thus we have now a closed representation of the nth partial sum of the

series q1, which is called the geometric series, namely00

1 -q n+1Sn= 1 -q

and we can now undertake to investigate its limit. If we consider

n+1lim sn = lim 1

q

n- oo n- co 1 - q

we see that the only term on the right which depends on n at all is qn+1 Soit appears reasonable to expect that once we know the limit of qn+l as ntends to infinity, we will also know the limit of sn.

Let us examine increasing powers of q for several values of q. Letq = 1:then we obtain for the first, second, third, fourth, and fifth powers of q:

0.25, 0.0625, 0.015625, 0.00390625, 0.00097656,

Let q = J: then we obtain for the first, second, , fifth powers of q:

Let q = i-9o : then we obtain for q1, q2, ... , qs:

0.9, 0.81, 0.729, 0.6561, 0.59049,

For q = 2, we obtain2, 4, 8, 16,

For q = 1.1, we have

1.1,1.21,1.331,1.4641,1.61051,

and finally, let us consider q = 1. Then we obtain

1, 1, 1, 1, 1,

Page 100: ɷshawns sagan integral and differential calculus

88 CHAP. II. AREAS

We can see quite clearly that in the first three cases with q = 4, 3, A- thepowers of q decrease as the exponent increases. In the two cases whereq = 2 and q = 1.1 we see that the powers ofq increase steadily as n increasesand, in the last case, we see that all powers of q have the same value 1. Wecan also see that, if q is very small, the powers of q decrease very rapidly(see the case with q = 4). If q is close to 1 but still less than 1, the powersof q still decrease but not so rapidly anymore. q = I appears to be theturning point, i.e., for all q which are less than 1, the sequence q" decreasesand for all values of q which are larger than 1, the sequence of the powers ofq increases.

If q is some positive number which is less than 1, then we can write it as1

r, where r is some number that is larger than 1, and we have

Since r is larger than 1, r" will increase beyond bound as n tends to infinityand we obtain

(11.13) lim q" = lim 1 = 0 for 0 < q < 1 (r > 1).n-m n-+oo r"

As the reader can easily convince himself, this relation holds true also if qis a number between -1 and 0.

In view of (11.13) we can see that

1 -(11.14) lira s" = lim

qn+l

=1

n- 00 n-00 1 - q 1 -qfor all q which are in the interval -1 < q < 1 or, as we may state, for allq for which Iql < 1.Thus, in view of (11.11)

00

(II.15) 57 qk = 1 for IRI < 1.k=o 1 - q

It is quite clear from the above discussion that the series in (11. 15) is quitesenseless forlgj > 1.

Now, if we substitute q = in (11.15), we see that

= o \4/k 3

confirming the value we arrived at earlier by geometric inspection.

Page 101: ɷshawns sagan integral and differential calculus

11.3. SERIES AND SEQUENCES 89

Let us summarize the content of this section:We considered sums with infinitely many terms

0057 ak= a0+a1+a2+a3+...k=0

which are called infinite series. If such a sum has a value at all, then itsvalue is given by the limit

CO

I ak = 11m Snk=0 n- CO

of its partial sum sn = ao + a1 + a2 + + an.In particular, we considered the case where ak = qk for Iqi < 1. This

particular series is called the geometric series. The limit of its partial sums

Sn =1-qn+1

1-qturned out to be 1 1 for all Iqi < 1. Thus we could conclude that

q00

I qk =1 for lql < 1.

k=0 1 -qProblems 11.15-11.24

11.15. Write the following sums using the sigma notation:

(a)

(b) 123 +s6

(c) 0.1 + 0.01 + 0.001 + + 0.000000000000000000001/(d) 1 - q + q2 - q3 + ... + (-1)7'q'

(e) 1+ 4 + A + 116 + + 316 + ..

+ 7 - 84 +...(f) 1 - 1x2 x3 x4

(h) 1 - x2 + x4 _ x6 + x811.16. Evaluate the following sums :

s 4 1

(a) I k(b)k=1 k=1

5 3

(c) k2 (d) 1 (kg - k)k=1 k1

w cp 2k-1(e) 1 (0.9)k (f)

k=03k

k=0

Page 102: ɷshawns sagan integral and differential calculus

90 CHAP. 11. AREAS

11.17. Find the limits of the following sequences:(a) sn n + 1 (b) sn = n2 2

1

n 2n2

n(n1)

(d) sn_ n(n - l)(n - 1)

(C) . =2n2

11.18. What is the limit of the sequence 0.3, 0.33, 0.333, 0.3333, ?

11.19. Evaluate the sum in Problem II.15(g) for x = 0.1, using the first fourterms only. What do you think is an estimate of the error which you committedby chopping off the series after four terms?

11.20. Given(-1)nn

sn 2n+3Find so, s1i s2, sy, and s,. What is the limit as n tends to infinity ?

n 21

11.21. Show that I Cat; = C at.ko k=o

n L n11.22. Show that (ak + bk) _ ak + bk.

0 k0 k0n n-1

11.23. Find 1 and 1.k=1 k.1

II.24. Write

1 1 x4

as an infinite series.

4. MANIPULATIONS WITH LIMITS

IxI < 1

Let us evaluate the following limit

limn2 + 2n

n-oo 3n2 - 4

First, we divide numerator and denominator of the quotient under thelimit sign by n2, thus leaving the value of the fraction unchanged but makingit more easily accessible to our investigation:

21+2

limn + 2n = lim nn-+co 3n2 - 4 n-+m 3 4

n2

Now we can see quite easily that the expression2

in the numerator andn

the expression4

in the denominator approach zero as n tends to infinityn

Page 103: ɷshawns sagan integral and differential calculus

11.4. LIMITS

and it appears that the entire fraction approaches the values :

limn-+cc

91

If we write down symbolically what we have actually done in evaluatingthis limit, we obtain

limn- co

1 + 2lim (1 + 2) lim 1 + lim 2

n _ lim n _ n-_oo n-oc n

3 - 4 lim (3 - 4) lim 3 - lim 4n n n-oo n-+oo n

We evaluated each one of the four limits by itself and then put the resultstogether again in the indicated order. But is such a procedure permissible?This question, fortunately, can be answered in the affirmative under certainslight restrictions. Since we introduced the concept of a limit from a purelyintuitive standpoint, we are not able to analyze this problem in any detail,but it will certainly appear quite reasonable and "intuitively clear" to thereader that the following rules, which govern the manipulation with limits,hold true.

The limit of a sum of two sequences is equal to the sum of the limits ofthe two sequences, provided that both limits exist. If lim a.n = A andlim bn = B, then

n-CO

(11.16) lim (a,,, + bn) = A + B.n- CO

This is the rule which we actually applied in our problem to evaluate thelimit of the numerator. Let us consider still another example.

Let a,, = n + I and bn =1

. Clearly, lim an = limn + I= 1 and

n n n-.co n-.co n

lim b., = Iim 1 = 0. Hence we obtain in view of (II.16)n-.oo n-co

n- co n- oo n n

n

lim(an+bn)=lim(n+1 +1) =1+0.

This can be checked directly as follows:

b -n+l+1 - n+2an + n -

n n n

and hence, limn + 2 = 1.n-.oo n

Page 104: ɷshawns sagan integral and differential calculus

92 CHAP. II. AREAS

A rule which is analogous to (11. 16) holds for the limit of the differenceof two sequences. The limit of the difference offtHwo sequences is equal to thedifference of the limits of the two sequences, provided that both limits exist.If lim an = A and Jim bn = B, then

n-.co n-ao

(11.17) lim (an - bn) = A - B.n- ao

The reader will notice that this rule was used to evaluate the limit of thedenominator in our problem. So let us work another problem which willdemonstrate that the condition that both limits have to exist is quite essen-tial. Suppose we have

a.n=n+1and bn=n.Then neither lim an, nor lim bn exist, i.e., lira an = lira (n + 1) = o0

n- CO n-.oo 'n-CO n-COand lim b.n = lim n = oo. If we would nevertheless apply our rule, we

n-+c0 11-o0

would obtainlim (an - bn) = oo - 00

%_00

which is quite meaningless. However, we see that an - b.n = n + 1 - n =I and we have consequently

lim (an - bn) = 1.n- 00

Now we have to discuss two more rules which deal with the limits ofproducts and quotients.

The limit of'a product is equal to the product of the limits, provided thatboth limits exist. If lim a = A and lim bn = B, then

n-oo n-oo

(11.18) lim (anbn) = AB.n- 00

As an example, let us consider_ n b+3

an n+n n

Then urn an = 11im n

+1 = 1 and lira b.n = lim n + 3

= 1 and accord-)I-cc n- cc n

ing to (If.18)

1 n n+3b

= 1n-oo n

which can also be checked directly.Rules (11. 16) and (I1.18) actually hold for a sum of any number of terms

or a product of any number of factors.

im an .n =1im

Page 105: ɷshawns sagan integral and differential calculus

II.4. LIMITS 93

Finally, we have: The limit of'a quotient is equal to the quotient of thelimits, provided that both limits exist, and the elements of the sequence in thedenominator are different from zero and tend to a limit which is different fromzero. If lim an = A and lim bn = B 0- 0, where bn 0 for all n, then

n-.03 n- oD

(II.19) lim an = `4n-oo bn B

This rule was finally used in our example above to express the limit as thequotient of the limits.

We wish to point out to the reader that we cheated a little in the examplesthat served to illustrate our rules. Specifically, we assumed without furtherdiscussion that

limn + 1 = 1, lim n = 1 and limn + 3 = 1.n n n

Let us apply the appropriate rules toward the evaluation of these limitsafter dividing numerator and denominator by n.

In the first case we have

n + 1lira =

lim (1+1 )

nn ,-.co 1 lim 1 n- 00 n-+acn_00

We see that the whole problem boils down to the evaluation of the limit

of the sequence 1 as n tends to infinity. (The same thing occurs in the twon

other cases, as the reader can easily see for himself.)If we had based the concept of a limit on a rigorous mathematical foun-

dation, we would have had no difficulty in proving that this limit is zero.We introduced the limit from a purely intuitive standpoint however, andthus have to accept at faith that

(11.20) lim 1 = 0n-+oo n

which should be plausible and will certainly not strain the imagination. Itis not possible, no matter how hard we may try, to obtain this result by amanipulation of the four rules which we have stated in the preceding dis-

cussion.Our results enable us to establish a simple recipe for the evaluation of

limits of quotients with polynomials of n in numerator and denominator(for definition of polynomial, see Appendix 1). Consider, for example,

n4-3n2+4n+7limn-oo 2n4 + 4n2 + 3n - 10037

1+1lim

n = = lim 1 + lim 1

Page 106: ɷshawns sagan integral and differential calculus

94 CHAP. II. AREAS

Let us divide numerator and denominator by n4 to obtain

1 -3 + 4 + 7

limn-+oo

n2 no n4

4 +3 100372+n2n3- n4

We see that all terms in numerator and denominator tend to zero as n tendsto infinity except for the coefficients of n4, and we therefore have the limit

i.e., the ratio of the coefficients of n4.It is now easy to see that this rule applies in general : If we have a quotient

of two polynomials in n of the same degree, say, k, then the limit is equal tothe ratio of the coefficients of nk. The validity of this rule can even beextended to the case where the degree of the numerator polynomial isdifferent from the degree of the denominator polynomial, as we will see inthe following two examples.

First we considerlim

n2 + 3n - 1n-aon3+4n2+n

The coefficient of n3 in the numerator is 0 and the coefficient of n3 in thedenominator is 1. Hence, if our rule applies, the limit will be i = 0 whichis indeed the case as this direct check will show:

1 3 1

n3+4n2+nlim - lim - = 0.n2 + 3n - I n n2 n3 0

Finally, we consider

noo 4 1 1+ +n n-

limn4-3n2+4n

n n3+1The coefficient of n4 in the numerator is 1, in the denominator it is 0. Hence,by our recipe, the limit would be o, i.e., does not exist, which is indeed thecase as a direct check shows.

In the preceding discussion we accepted on faith the rather obvious fact

that 1 tends to zero as n tends to infinity. It is equally obvious that

n has to tend to infinity as 1 tends to zero. Thus we can certainly writen

the limit which we considered in the beginning of this section, namely,

limn2 + 2n

n_oo 3n2 -4

Page 107: ɷshawns sagan integral and differential calculus

II.4. LIMITS

which turned out to be equivalent to

in the form

lira

n~0 3 - 4(12

n)

95

Now, could we not avoid this awkward notation by writing x instead of

with the understanding that x has to approach zero, namely,n

lim1 + 2x

x~0 3 - 4x2

The answer is yes, although there are some reservations to it. If we look atthis limit in terms of x, conveniently forgetting its past history, then thequestion arises as to how x is to approach zero. It could approach zero as1 1 1 ± n3-n

as n - oo, or asn2- as n - co, or as

4n - 3n2 +as n -+ oo, or,

10n4

as a matter of fact, in any other manner. Now, if the result we obtain is thesame, no matter how x approaches zero, there is no problem. Fortunately,this is the case in our example and in all other examples that are likely toarise in this treatment.

However, there are cases where the manner in which x approaches zerodoes make a difference as far as the limit is concerned. Take, for example,the pathological function

f (x) -1 for all rational values of x0 for all irrational values of x

and considerlim f (x).

If we let x approach zero as 1 where n co, and since it follows fromn

the definition of this function that f (1) = 1 for all integers n, we obtainn

1 as the limit. On the other hand, if we let x approach zero as wheren

n -* oo, then f 0 because-Vi

is irrational for all integers n andn n

Page 108: ɷshawns sagan integral and differential calculus

96 CHAP. II. AREAS

we obtain 0 as the limit. In such a case we say that the limit does not existbecause the value obtained clearly depends on the selected sequencethrough which we let x go to zero.

In general, we say that

lim f (x)s-O

for some given function f'(x) exists if, and only if,

Jim f (X")n- CO

has the same value, no matter what sequence x1, x2, x3, with lim xn = 0n- o0

we select. This criterion is, of course, anything but practical because it ishumanly impossible to check all possible sequences that tend to zero.Although, there is a more practical alternative, we need not go into thismatter any further because all the limits of this nature that we will encounterin this treatment exist, i.e., are independent of the sequence selected and

can thus be evaluated by letting x run through the sequenceI with n -> oo.n

This means that the rules which we have discussed in the first portion ofthis section also apply to limits of the type ]im f (x), as far we will encounterthem in this treatment. Z-0

We warn the reader that the argument which we put forth here is by nomeans a justification of this procedure, but it will suffice for our purpose.

Problems 11.25-11.30

11.25. Evaluate the following limits:

2n2+1(a) ,11,E

8642 - n2 (b) n-,n12n3

(c) JIM9n5 - 6n + 1 l 1

(d) lim n3 - 3n2 + 4n + 125

4,L-00 11 + 1000000

(e) JIMn3 + 4

n-.oo n(n2 1)

n-. o0

(f) limn-.CO

4n - 3n

n -4

11.26. Which of the following limits exists?

(a) Jimn

(b)

4n2+1

Jim (n2 + 1)V'n

n-co n3 + 4n + 3

Jimn5 + 4n2 - 1

n_00 Vn(n4 + 3)

n-.Dons-3n2+4n+1n4 + 4n

JIM(e) n-.,, n3 - 12 (d)

Page 109: ɷshawns sagan integral and differential calculus

11.5. AREA OF A CIRCLE

11.27. Evaluate :

(a)

(c)

lirax2 +4x

x_0 3x2 - I2x4 + 3

lim 4x-o x - 1

97

(b) lim x3 + 2x + 12

(d)

x-.o x2 - 6

lim 2x2 + 4x + 4x->p (x + 1)2

11.28. Find the following limits, provided they exist, using rule (11.17) wher-ever it is applicable:

(a) Jimn _ n2 + 3 1 4n

) (b) lim--

2 + 1n-0 + 1 2n n_>oo \n n + 122 1)n - n

(c) lim d l+

2( imoon 4+n-00)

21-00\n - 2

- 13n2\ )H.29. Show by rule (11.18) that

lira Can = C lira ann-oo n-.m

provided that lira an exists.n-oo

11.30. Suppose that lim an = A and lim b,, = B ; 0. Show that if Jim an =71-CO n-00 n-,,, b,,

1, then A = B. (Hint: Assume that A r B and show that this leads to acontradiction. It then follows that it is not possible that A B; hence, thecontrary has to be true, i.e., A = B.)

5. THE AREA OF A CIRCLE*

We proceed now to define the area of a circle. All we have at our dis-posal is the definition of the area of a rectangle and the consequent formulasfor areas of regions which are bounded by straight line segments (polygons).Clearly, we cannot give an arbitrary definition of the area of a circle becausethat might have the consequence that we will get stuck with two incon-sistent concepts of area. In order to arrive at a satisfactory definition ofthe area of a circle we proceed according to a method which was firstdeveloped by Archimedes of Syracuse (287-212 B.C.), the greatest mathe-matician of ancient times. His method is really the basis of what we callnowadays the Integral Calculus.

The basic idea of Archimedes' method is a generalization of postulate(4), p. 76, for the area measure which, generalized to closed (not necessarilyrectangular) regions, says that whenever one region is entirely containedin another region, the area-measure of the enclosed region is smaller thanthe area measure of the enclosing region.

On the basis of this idea we assign to the circle an area measure that islarger than the area measure of all possible inscribed polygons and smaller

* The reader who is not familiar with trigonometry is referred to Appendix III whichhe should study at this time before continuing with the regular text.

Page 110: ɷshawns sagan integral and differential calculus

98 CHAP. II. AREAS

than the area measure of all possible circumscribed polygons. In practice,we obtain increasingly better "approximations" to what ultimately will becalled the area of the circle, if we consider a sequence of inscribed regularpolygons with an increasing number of vertices and a sequence of circum-scribed regular polygons with an increasing number of vertices.

Suppose we have a circle of radius r (see Fig. II.10) and consider an in-scribed and a circumscribed square. Let A4 be the area of the circumscribedsquare and a4 the area of the inscribed square. If we denote the area of thecircle (i.e., the number which is larger than the area of all inscribed polygonsand smaller than the area of all circumscribed polygons, if such a numberexists at all) by A, then the inequality

a4<A<A4holds.

If, instead of squares, we consider now an inscribed regular octagon anda circumscribed regular octagon (see dotted lines in Fig. 11.10) with theareas as and A8, respectively, we see by inspection that A8 < A4 and a8 > a4.At the same time A8 > A and a8 < A. Hence, it follows that

a4<a8<A<A8<A4.Thus, it appears that a8 and A8 are better approximations to A than A4 anda4. The reader can already see from this short account that it seems that acontinuation of this process will yield increasingly better approximationsas the number of vertices of the polygons becomes larger. At the same timeit is clear that there is no end to this process, i.e., no matter how manyvertices a polygon has, there is always one with more vertices that will yielda still better approximation. The following question, however, remains

Fig. U.10

Page 111: ɷshawns sagan integral and differential calculus

11.5. AREA OF A CIRCLE 99

Fig. U.11

unsettled : does the difference between the areas of two correspondingpolygons-the one inscribed and the other one circumscribed-becomeless than any finite positive number, no matter how small, or does it not asn becomes large? We will see at the end of this section that this differencedoes, indeed, tend to zero.

In order to carry out the indicated process numerically, we have first toderive formulas for the areas of the inscribed and circumscribed polygons.These areas can be expressed in terms of the radius of the circle and thenumber of vertices. First, we note that every regular polygon with nvertices can be subdivided into n congruent isosceles triangles (see Fig.11.11).

For the following discussion we refer to Fig. 11.12. The triangle OAB ispart of an inscribed regular polygon. Its area a is given by

rha =-.2

Clearly,

h = sin a, h = r sin a.r

Hence

(11.21) a = 2r2 sin a

where a is the angle between the two equal sides measured in degrees.The triangle OCD is part of a circumscribed regular polygon. Its area

A. is given byA _ rb

2

Page 112: ɷshawns sagan integral and differential calculus

100 CHAP. II. AREAS

Fig. 11.12

Now,

Hence

(11.22)

b = tan b = 2r tan fl.2r

A,: = r2 tan 9

where j9 is half the angle between the two equal sides.Suppose we now consider regular polygons with n vertices. Then the

angle a at the center of each isosceles triangular component of the inscribedregular polygon is given by

360a=-n

and the angle 9, which is half the angle at the center of each triangularcomponent of the circumscribed polygon, is given by

180

n

Thus we obtain for the area a,, of the inscribed regular polygon with nvertices, according to (11.21)

(II.23) an=nr2

sin360-

n

and for the area A,, of the circumscribed regular polygon with n vertices,according to (11.22),

(11.24) An = nr- tan 180

n

Page 113: ɷshawns sagan integral and differential calculus

11.5. AREA OF A CIRCLE

Since

101

an<A<A.,we arrive at the important inequality

(11.25) 2r2n sin360

< A < r2n tan 180

n n

Let us now evaluate this inequality for various values of n, using five-place tables for the values of the trigonometric functions. *Letting n = 4,

2r2 < A < 4r2Letting n = 16,

3.061442 < A < 3.18252r2

Letting n = 32,3.12144r2 < A < 3.15168r2.

We observe that the lower bound and the upper bound agree with eachother in the first decimal place. Hence, the area of a circle of radius r,accurate to one decimal place, is given by 3.1r2. Now, let n = 128; then

3.14112r2 < A < 3.1424r2.

We see that the approximations agree in the first two decimal places.Hence, the area accurate to two decimal places is given by 3.14r2.

There is no use going any further if we use five-place tables, because thefifth place is already inaccurate (rounded off) and becomes, after multipli-cation by a number greater than 100, the third decimal place. Indeed, if wewould evaluate (11.25) for n = 256, using five-place tables, we would notobtain a better result than the one we obtained already for n = 128.

We observe that inasmuch as the area of the circle is smaller than thearea of the circumscribed polygon and larger than the area of the inscribedpolygon, we get an approximation to the area of the circle which is betterthan the one given by either upper or lower bound, if we take the arith-metical mean of upper and lower bound. We obtain

for n = 4: A - 3r2,forn = 16: A-3.12198r2,for n = 32: A = 3.13656r2,

for n = 128: A - 3.14176r2.However, we can not tell directly from these results, how much betterapproximations they really are.

* Standard Mathematical Tables, CRC, 12th ed. p. 93. Observe that these values ofthe trigonometric functions which we use can be found without knowledge of the lengthof the unit circle (see Appendix III, Section 4).

Page 114: ɷshawns sagan integral and differential calculus

102 CHAP. If. AREAS

Now let us shortly recapitulate what we have done: We squeezed thearea of the circle in between an upper and a lower bound. We still do notknow what the area of the circle is. All we can tell is that it is larger thanthe lower bound and smaller than the upper bound. It is intuitively clearthat upper and lower bound come closer to each other as n becomes large.However, all we can say for sure is that the lower bound never exceeds acertain finite number because the inscribed polygons have an area which issmaller than the area of any circumscribed polygon and the area of thecircumscribed polygons is always greater than a certain finite positivenumber which is different from zero, because it is always greater than thearea of any inscribed polygon which is always greater than zero for n >_ 3.

In order to show that upper and lower bound both approach the samenumber as n becomes large, i.e., the gap between them approaches zero,we have to show that the quotient of upper and lower bound approaches1. This means, obviously, that numerator and denominator both approachthe same limit* (see Problem 11.30). Now,

fr2n sin360

sin360

Since

Q = an =

(see Appendix III, formula (AIII, 13), we have

sin360 = 2 sin 360

cos360 = 2 sin 180

cos180- - - - -

n 2n 2n n

and, consequently,

Since

n _I n

n

. 180

180 sin ntan

n 180cos -

n

n

* This argument is valid if the numerator sequence as well as the denominatorsequence have a limit and the limit of the denominator sequence is different from zero.This is here the case. The numerator sequence is an increasing sequence which isbounded above and the denominator sequence is a decreasing sequence which is boundedbelow (away from zero). According to a famous theorem of Bolzano, such sequenceshave limits and the limit in the latter case is different from zero (see Problem 11.35).

2 sin 180 cos180

Q. = 2n 180 ntan -

A,r2n tan

180 2tan

180

n n

sin 2a = 2 sin a cos a

Page 115: ɷshawns sagan integral and differential calculus

II.5. AREA OF A CIRCLE

we obtain

Q. =sin 180 cos2180

n n = cos2180- .sin 180 n

nNow, as n -> oo we see that

Jim Qn = Jim cost 180 .

n- oo n_oo nBecause

it appears that

and we have

lim180 = 0,

n-oo n

lim cos 180 = cos 0 = 1n-0 oo n

lira Qn = lira cos 180limcos 180 = 1.

n-oo 111-+00 12 n-co n

103

Thus we see that An and an both approach the same limit and this limitis some finite number because an is finite for all n and An is never 0.

With the notation

(11.26) lim n sin 360 =7rn-0co 2 n

we can write

(11.27) A = r2ir

for the area of a circle with radius r, where 7r = 3.14 .

Problems 11.31-11.35

11.31. Find upper and lower bounds for the area of a circle with radius 1(unit circle), using regular polygons with 6, 12, 24, 96 vertices. Use five-placetables.

11.32. Find sin a for the following values of oc:

45°,450 4504 ' 8

(Use half-angle formulas, Appendix III, Section 4.)11.33. Same as in Problem 11.32 for tan a.

11.34. Evaluate the ratio A where an is the area of a regular inscribed polygonAn

and An is the area of a regular circumscribed polygon with n vertices for n = 4,n = 16, n = 32, n = 64, n = 96, n = 128. (Note that the radius of the circleis immaterial for this computation.)

Page 116: ɷshawns sagan integral and differential calculus

104 CHAP. II. AREAS

11.35. Suppose you have infinitely many numbers and they are arranged sothat ao < al < a2 < < an < . Suppose you know that an < A for all n.Illustrate with examples that the sequence an approaches a limit which is lessthan or equal to A.

6. CIRCUMFERENCE OF A CIRCLE

We will now employ the Archimedean method which we developed inthe preceding section to find a suitable definition for the length of the cir-cumference of a circle. As the concept of area in the preceding section wasbased on the concept of area of a polygonal region, the concept of lengthwill be based on the concept of length of a line segment and, in turn, thelength of a polygon.

Again we will consider inscribed polygons with the idea in mind that thelarger the number of vertices, the closer the length of the polygon will beto what we will ultimately call the length of the circumference of the circle.

Let us refer to Fig. I1.13 for the following developments. We see that

b . a-=sin -.2r 2

Hence,

b = 2r sina-2

and, consequently, the length In of an inscribed regular polygon with nvertices is given by

In = 2rn sin 180

n

Fig. 11.13

Page 117: ɷshawns sagan integral and differential calculus

11.6. CIRCUMFERENCE OF CIRCLE 105

since a = 360/n is the central angle subtended by one side of a regularpolygon with n vertices.From

sin 2j1 = 2 sin 3 cos ,8

it follows with /3 = 180/n that

sin360

= 2 sin 180 cos180

and, consequently,n n n

360sin -180 nsin - _

n

Hence,

2 cos180

n

360sin

nIn=rn180'

n?3.

cos -n

We have seen in the preceding section [see (11.26)] that

lim 11 sin360

n-+oo2 n

exists and have denoted its value by 7r.Hence,

. 360sin -lim 1.n = 2rlim n n = 2rn- oo n-co 2 180

cos -

lim n sin 360

?,-co 2 n = 2r IT =2r7r.lim cos

180 cos 0n n-cc n

Therefore, we can define the circumference C of a circle as

(11.28) C = 2r7r

where r is the radius and IT = 3.14 .

This definition is, of course, only justified if the perimeter L,,, of thecircumscribed polygons tends to the same limit as the perimeter 1n of theinscribed polygons as n becomes large. This is indeed the case as can beseen as follows.

We see from Fig. If. 14 thata- = tan - .

Hence2r 2

a = 2r tan #2

Page 118: ɷshawns sagan integral and differential calculus

106 CHAP. II. AREAS

Fig. 11.14

and the perimeter Ln of the circumscribed regular polygon with n verticesis given by

Ln = 2rn tan 180

n

since # = 360/n for a regular polygon with n vertices.We have seen in the preceding section that

Hence,

limn tan 180 = limn

sin360

= 7r.W-00 n n-+ao 2 n

lim 1n = lim Ln = 2rir.n_00 n- co

Problems II.36-11.40

11.36. Consider inscribed and circumscribed regular polygons of a unit circle.Find 13, L3, 14, L4, 16, L6, 136, L36 and evaluate the differences

Lk-1kfor k=3,4,6,36.

11.37. Given a circle with radius r. Express the area A of the circle as afunction f (C) of its circumference. What is f (x) ?

11.38. Express the circumference C of a circle as a function f (A) of its area.11.39. Find approximations for Tr from the values 1k and Lk in Problem 11.36

and compare them with the approximations which we found in Section 5.

11.40. Find approximations of 7r by evaluating n tan 180 for n = 90 and n =180. Use five-place tables. n

Page 119: ɷshawns sagan integral and differential calculus

11.7. RADIAN MEASURE 107

7. RADIAN MEASURE

Now that we know it is senseful to assign a length measure to the cir-cumference of a circle, we are in a position to introduce a new measure forangles which proves to be more adequate for most mathematical investi-gations than the degree measure.

For the following discussion we refer to Fig. 11.15. If the point P moveson the circumference of the circle with radius 1 (unit circle) from T in thecounterclockwise (positive) direction through a distance x, then the segmentOP will sweep an angle of a°. It is obvious that there is a one to one corre-spondence between a° and x, i.e., with every angle a there corresponds onedistance x measured on the circumference of the unit circle and with everydistance x there corresponds one angle a provided that 0 < a < 360°.Hence the distance x along the circumference of the unit circle is suitablefor the measurement of angles. We call this new measurement of anglesthe radian measure. Since the length of the unit circle is 2ir, as we have seenin the preceding section, we see that an angle of 360° corresponds to theradian measure 2ir radians, an angle of 180 degrees corresponds to 7r radians,and angle of 90 degrees to 7r/2 radians, etc.

In general, we have

360x 180xa= _2ir 7r

Fig. 11.15

Page 120: ɷshawns sagan integral and differential calculus

108 CHAP. II. AREAS

tan x

Fig. 11.16 Fig. 11.17

for the conversion of x radians into a degrees and

Trax=-180

for the conversion of a degrees into x radians.The relation between circumference of a circle and its area with 7r as the

connecting link as developed in Sections 5 and 6 will enable us to find aformula for the area of a circular sector which is such that its circularboundary has the length x, where 0 < x < 27r. Now, the entire circle oflength 2rlr has the area r27r. Hence, a circular sector (see Fig. 11.16),

2

which has a circular boundary of length 1, has the area 2 =r

and, con-

sequently, a circular sector with a circular boundary of length x has thearea

(11.29) Sx=xr2

This result will now enable us to derive an important and interesting limitrelation, namely,

(11.30) limsin x = 1.

x-.0 X

To show this, we refer to the unit circle in Fig. 11. 17 and observe that thearea a of the triangle OAB is smaller than the area d of the circular sectorOCB which, in turn, is smaller than the area A of the triangle OCD:

(11.31) a<d<Awhere we assume that 0 < x <7r/2. [Since x is allowed to assume thevalue 0, we have to include the equal signs in the inequality (11.31).]

Page 121: ɷshawns sagan integral and differential calculus

11.7. RADIAN MEASURE 109

Now,1.a = - sin x cos x2

(11.32)

A=1tanx2

and if we substitute for a, d, A in (11.31) according to (11.32), we have

- sin x cos x < - 5 - tan x.2 2

Multiplying this inequality by 2 and dividing it by sin x, yields

cos x < x <sin x cos x

Now, we let x -> 0 and obtain

1 < lim x< 1

x-o sin xsince cos x -> 1 as x 0.

XThus as x - 0 is less than or equal to I and at the same time greater

sin xthan or equal to 1. This is only possible if

limx = 1.

x-o sin x

It follows that the same has to be true for the reciprocal value and thusproves relation (11.30) (see Problem 11.47).

We wish to observe that in most Calculus texts this fundamental formula,which is derived on the basis of the knowledge of the relation betweencircumference and area of a circle, is then used to derive a certain relation(differentiation of the sine function) which, in turn, is used to establish anintegration formula which ultimately is used to find the area and circum-ference of a circle. This is a beautiful example of a vicious circle.

Problems 11.41-11.47

11.41. (a) Convert the following angles in degrees into radians: 270, 225, 60,30, 22.5, 135, 345 57r 77r

(b) Convert the following radian measures into degree measures: W' T'37r or llzr 5a 57r

T'-F2'-6 '3'3'

Page 122: ɷshawns sagan integral and differential calculus

110 CHAP. H. AREAS

11.42. Evaluatesinx for x = 1,

4 , 6 , 16 , 64 .Use tables.

11.43. Suppose you want the value of sin x to five decimal places. Use tablesto determine for what values of x we can substitute x for sin x without noticingthe difference in the fifth decimal place. Do the same for four and three decimalplaces.

11.44. Find limtan x

z-.01 sin x

cosx x

11.45. Find

x

limsin2 x

.

.-0 x2

11.46. Find lim sin 2xz-0 x

(Hint: Note that tan x =cossin

xx and hence,

tan xa

sin11.47. Prove that if lim

x= 1, then lim

-2 = 1.z-0 sin x z-.0

Hint: Note thatsinx

=1

(x x

\ sin x

8. AREA UNDER A CURVE

We will now generalize the idea of Section 5 to define the area of a regionwhich is bounded on one side by a curve, represented by a function y =f(x), and on the remaining sides by the x-axis and the vertical lines at x = aand x = b (see Fig. 1I.18). We will assume for the following thatf(x) > 0in the interval a < x < b and that y = f(x) does not have any discontin-uities. We defined the area of a circle in Section 5 as the limit of a sum ofareas of triangles. To chop up the area in Fig. I1.18 into triangles would behighly impractical. Since the only other polygonal region of which wealready know the area is the rectangle, we will try to make use of rec-tangles for our purpose. This is really quite simple. We subdivide theinterval a < x < b into n equal subintervals by introducing the divisionpoints

X1, x2, x3, ... xtt-1with the understanding that

x1-a=x2-x1=x3-x2=... =b-x7,_1 = Ax

where Ax = b - a . To obtain a unified notation, let us setn

a = x0, b = x,,.

Page 123: ɷshawns sagan integral and differential calculus

11.8. AREA UNDER A CURVE 111

Y

2

Fig. 11.18

Now we locate in every subinterval xk_1 < x < xk (k = 1, 2, - - n) thevalue k for which the functionf(x) assumes the smallest value:

f ($k) = min f (x) in xk_1 < x < xk.

We erect the ordinates at all division points Xk and at all points k anddraw horizontal segments through for all k = 1, 2, 3, , n. Wethus obtain a collection of rectangles which are indicated by the shadedareas in Fig. 11. 18. We will denote the sum of these areas by Sn where nindicates the number of subintervals. We have

(11.33) Sn =.f AX +.f AX + ... + x

n

= Ox Iflkk=1

If Aa,b stands for the yet to be defined area measure of the region that isbounded by y =f(x), the x-axis and the lines x = a and x = b, then it isintuitively clear that

S,z < A..bLet us take, for example,

f(x)=x(1 -x)+ 1,a=0,b= 1

and let n = 4, hence Ox = 1.

Page 124: ɷshawns sagan integral and differential calculus

112

0 x1=4 x2=i x3=4 1

1 52 3 E4

Fig. 11.19

CHAP. It. AREAS

We obtain (see Fig. 1I.19) 1 = 0, 2 = 4, 3 = f, 4 = 1, and, consequently,

S4 4(1+16+16+1)=1.093

It is obvious from Fig. 11.19 that S4 is smaller than the area we areinterested in. The difference is indicated by the dotted region. Suppose weincrease the number of subintervals to n = 8 for the purpose of decreasingthis difference. Then we obtain (see Fig. 11.20)

L38171+19+79+79+19+71+11.132---8(I+ 64 16 64 64 16 64

Clearly,$8 > $4

but still$8<Aab.

Again the difference is indicated by the dotted region in Fig. 11.20.Now let us go a step further and double the number of subintervals

Page 125: ɷshawns sagan integral and differential calculus

11.8. AREA UNDER A CURVE 113

again, taking n = 16 and, consequently, Ax = i?a (see Fig. 11.21). Weobtain

1 271 71 295 19 311 79 319 319Sls

= 16 1+

256 + 64 + 256 + 16 + 256 + 64 + 256 + 256

+79 311 19 295 71 271

64 + 256 + 16 + 256 + 64 + 256-F 1 = 1.150

and we can see again that$8 <!S16<Aa.b

(We will see later, on p. 141, that this area Aa,b is found to bee = 1.16).We can see from Figs. I1.19, 20, and 21 that the discrepancy between the

sum of the areas of the inscribed rectangles and the yet to be defined areaof the region under the curve y = f(x) (error) as indicated by the dottedregions becomes smaller as the number n of subintervals increases. It isintuitively clear that this error can still be made smaller by further increasingthe number of subintervals. This is our motivation to define the area underthe curve y = f (x) if f (x) has no discontinuities in a < x < b betweenx = a and x = b as

(11.34)n

Aa,b = lim I {/J (Sk) AX

n.-om k=1

Fig. 11.20

Page 126: ɷshawns sagan integral and differential calculus

114 CHAP. II. AREAS

Fig. 11.21

where the limit is to be understood in the following sense: The number ofsubintervals has to increase beyond bound (n -> oc) while at any given

stage of the process all subintervals have to have the same lengthb a

It is clear that as n - oo, the length of each subinterval b a 0.n

As in Section 5, we have to show now that this definition is senseful. Wewill proceed for this purpose as in Section 5 where we first considered thesum of the areas of the inscribed triangles and then the sum of the areasof the circumscribed triangles, showing that the difference of the twoapproaches zero as n - oo. In order to obtain the sum of the areas of thecircumscribed rectangles in our case, we have to locate all points 77k in xk-1< x < xk for which the function f(x) assumes its largest value:

f (??k) = max f (x) in xk_1 < x < xk.

Clearly, the rectangles with the widths Ax = xk - xk_1 and the heightsf(771) are circumscribed rectangles (see Fig. 11.22). We denote the sum ofthe areas of all the circumscribed rectangles by 5,,:

n

Sn =f(771)L x +J (?72) AX + ... +f(77n)AX = AX If (770-k=1

Page 127: ɷshawns sagan integral and differential calculus

11.8. AREA UNDER A CURVE 115

Clearly,S. Aa.b

for all n.Now we consider the difference (see Fig. 11.23) 9,, - Sn and obtain

S. - 'Sn = If (771) - f(')] Ox + If (112) - f (e2)] Oxn

.i.. ... + If(77-) AX = AxI If (77k) -f(Sk)]

= b - a.1 U (77k) - f ($k)]

n k=1

k=1

It can be shown that this difference approaches zero, if f(x) is uniformlycontinuous in a < x < b. (This is the case, if f (x) is continuous in everypoint of the interval a < x < b.)

From this and the fact that both limits exist, it follows that

lim Sn = lim Sn = Aa,bn- CO n-co

We call this limit, which serves to define the area measure of the regionencompassed by y = 0, x = a, x = b, and y = f(x), the definite integral of

f (x) between the limits a and b :

urn zx f() =lirAX jf(77k) = f(x)dx.n- 00 k=1 n-+co k=1 a

y

Fig. 11.22

Page 128: ɷshawns sagan integral and differential calculus

116 CHAP. If. AREAS

--xa X1 X2 X3 . . . . xn_1 b

Fig. II.23

Jo-If f (x) is uniformly continuous* in a < x < b, then there exists for anyarbitrary smalls > 0 a 6, soft(hat

{/l " J I < s

provided thatIrk - kI < 6e

(see Chapter I, Section 13).Now let us choose an arbitrary small s > 0. Then 6 is determined by

this s [and the function f (x)] and we take n sufficiently large so that

(11.35)Ax=b-a<SE.

n

Since 77k and k are both located in the same subinterval of length Ax, wehave

Ink - 41 < Axand because of (11.35)

Irk ski<a.Hence,

<s[we can omit the absolute value signs because from the definition off (?7k)and follows that f(7lk) > and we obtain

nb0< n-S,to - aI CJ (?7k) f(4)]

n k=1

<b-a.1 s b - ans (b - a)sn k=1 n

* See Chapter I, Section 13.

Page 129: ɷshawns sagan integral and differential calculus

11.8. AREA UNDER A CURVE 117

where e can be chosen as small as we please, provided we choose n suffi-

ciently large. (Clearly, e = e 1 = en. See Problem 11.23 ) .k=1 k=1

This means that we can make the difference Sn - Sn as small as we pleaseif we choose n large enough, and this is obviously equivalent with the state-ment

(11.36)

Since

lim(S.-Sn)=0.n- oo

Sn < Aa,b < ,finwe obtain

0 GAa.b - Sn S Sn - Ls.and as n co, in view of (11.36)

lim (A,,,b - 5,,) = lim (S - Sn) = 0.n- oo n-00

It is quite obvious that S. is an increasing sequence, i.e., Sn becomeslarger if we add additional division points (increase n). Furthermore, itseems intuitively clear that Sn is always less than or at most equal to anyone of the 9, ,,-no matter how large n is. Therefore, we conclude again-aswe did already in Section 5-that lim S. exists. Now, lim Aa,b exists, of

n-+oo

course, because Aa,b is a constant number. Hence, by our rule according towhich the limit of a difference of two sequences is equal to the difference ofthe limits of the two sequences, provided that both limits exist (see Section4), we obtain

lim (A,,,b - Sn) = lim Aa,b - lira Sn = Aa,b - lira Snn-ao n- co n-+ ao

and since

lim (Aa, b - Sn) = 0,n- oo

we haveAa.b = lim Sn. 4

n-.oo

Problems 11.48-11.53

11.48. f(x) = x(1 - x) + 1, a = 0, b = 1. Find En for n = 4, 8, and 16.Find S, - Sn for n = 4, 8, and 16, using the values for Sn which were found inthe text.

11.49.gn

2Sn is a better approximation to the area than either 3,,, or L5,,.

Evaluate this average for n = 4, 8, and 16 for the function in Problem 11.48.11.50. Find Sn and 9n for n = 4, 8, 16 for the function f(x) = l/x, a = 1,

b = 2. (Aa,b =0.69315)

Page 130: ɷshawns sagan integral and differential calculus

118 CHAP. II. AREAS

11.51. Same as in Problem 11.49 for the function in Problem 11.50.

11.52. Same as in Problem 11.48 for the function f (x) = '%/1 - x2, a = 0,b = 1. The area bounded by this curve, the x-axis, and the lines x = 0 andx = 1 is a quadrant of a unit circle. Multiply your results for S and 5, by 4and compare them with the results which we obtained for 7f in Section 5 wherewe used regular polygons to approximate the area of a circle.

11.53. Take again f(x) = a = 0, b = 1. Instead of dividing theinterval into equal subintervals, use the following division points:

1 _1 _3 1 _5 11 3xl = $,x2 4'x3 x4

_2'x5xs

_16'x7 =4,

13 7 29 15 31X 8 = 16, xs = g , xlo 32' x11

_16' x12

_32

Compare your results with the results of Problem 11.52 as well as with theresults for 77 of Section 5.

9. THE DEFINITE INTEGRAL

In the preceding section, we defined the area under a curve y = f (x)between x = a and x = b and bounded below by the x-axis to be the limitof a sum $ (and, in view of (11.36), the limit of S. as well).

Since S is the sum of inscribed rectangles and thus its value always belowthe value which is to be assigned to the region under consideration, wecall it a lower sum. S,,, for analogous reasons, is called an upper sum.The definition of area which we gave in the preceding section (and the sub-sequent proof we presented to establish the existence of the limit of Sn)only applies to a very special case, namely, to the case where f (x) is uni-formly continuous in a < x < b and f (x) Z 0 in the entire interval.

Let us postpone a discussion of the possibility off (x) being negative inat least some portions of the interval for later and focus our attention onthe assumption of uniform continuity. It is quite obvious that we canassign an area measure to a region which is bounded above by the stepfunction

y = [x]

and on the sides by x = 1, x = 5 and below by the x-axis as depicted inFig. 11.24 (see also Chapter I, Section 12, Fig. 1.40). Even though this func-tion is certainly not uniformly continuous in 1 < x < 5 (there are threejumps each of saltus 1 in the interval), it follows quite easily, since theshaded area in Fig. 11.24 consists of four rectangles, that the area measureto be assigned to this region is 10.

In order to cover this case-and even others- by a general definition,we proceed as outlined in the following. First of all, we consider the func-tion y = f(x) of which we do not know anything except that it is uniquely

Page 131: ɷshawns sagan integral and differential calculus

11.9. THE DEFINITE INTEGRAL 119

Fig. 11.24

defined on the interval a < x < b, i.e., with every x in the interval therecorresponds one value y. Then we subdivide the interval a < x < b into nequal or unequal subintervals and compute the corresponding upper andlower sums. (A function may not always have a largest and a smallestvalue in each subinterval. In that case, we take, instead of the largest value,the smallest number which is larger than or equal to the values of the func-tion in the subinterval, and instead of the smallest value of the function,we take the largest number which is smaller or equal to the values of thefunction in the subinterval.)

We do this for all possible subdivisions and then let n --o- oo with theunderstanding that the length of each subinterval approaches zero in thisprocess. If upper and lower sums for all possible subdivisions tend to thesame limit, then we call this common limit the definite integral of f (x)between the limits x = a and x = b and write

n

(11.37) lim l f ( k) i1 xk = f f (x) dxn-00 k=1 Ja

where Oxk = xk - xk_l. We call a the lower integration limit and b theupper integration limit. The integral sign is really a degenerated (or hybrid,depending on the point of view) S which signifies that a summation processis involved in obtaining the integral, even though not an ordinary sum-mation process.

The function f (x) under the integral sign is called the integrand and dxplays essentially the role of a taillight. In the case that f (x) > 0, the integralin (11.37) defines the area under the curve y = f (x) between x = a, x = b

Page 132: ɷshawns sagan integral and differential calculus

120 CHAP. II. AREAS

and above y = 0. Specifically, we can show that if f (x) is uniformly con-tinuous in a < x < b, then the definite integral as defined in (11.37) existsand its value is to be found as the limit of a lower sum using a subdivisionof equal subintervals, because the limits of all such sums are equal in caseof the existence of the integral. Thus, the definition we gave in the pre-ceding section appears as a special case of the definition (11.37) of thissection.JO-We wish to point out that the definition (11.37) is not only generalenough to cover cases where the function f (x) has finitely many discon-tinuities in the interval of integration but also certain cases where thefunction has infinitely many discontinuities. We will give here two ex-amples, in one of which the integral exists, and in the other of which it doesnot exist.

Let us consider the function

f(x) =

1forI <x <1

0forI <x<13 2

1for1 <x<14 3

0for1<x<15 4

1for1 <x<16 5

The reader can easily see how the definition of this function is to be con-tinued (see Fig. 11.25).

We can see that this function has infinitely many jumps of saltus 1 in theinterval 0 < x < 1. Still, it can be shown that the integral as defined in(11.37) exists and has the value

f(x) dx = 0.69315.. .fi

However, if we consider the function

f (x) _1 for x irrational0 for x rational

Page 133: ɷshawns sagan integral and differential calculus

11.9. THE DEFINITE INTEGRAL 121

y

Fig. 11.25

then we see easily that the integral from 0 to 1 (or over any other intervalfor that matter) does not exist. Since

min f (x) = 0

in any subinterval (because any subinterval, no matter how small, containsrational points), and

max f (x) = 1

in any subinterval (because any interval contains irrational points), we have

Sn=0forallnand

9,,, = 1 for all n.

Thus, lim Sn lim S,, i.e., not all limits of upper and lower sums areequal and, consequently, the integral does not exist, since one of the basicconditions is not met.

Now we turn to the question of what happens if f (x) is negative in theentire integration interval or in parts thereof.

Let us refer to Fig. 11.26. We wish to find the area which is enclosed byy = f(x), x = a, x = b and the x-axis. The function y = f(x) in Fig. 11.26

Page 134: ɷshawns sagan integral and differential calculus

122 CHAP. II. AREAS

intersects the x-axis in two points, namely, at x = c and at x = d. In theinterval c 5 x 5 d we have f (x) < 0. Clearly, we obtain the area underthe curve y = f(x) between a and c and d and ('b by integration as follows

Aa,c = f(x) dx, Ad,b = J f(x) dx.d

What happens between c and d? Clearly, the values off(x) in this intervalare negative. Hence, in order to obtain the heights of the rectangles inupper and lower sum, we have to take the absolute values off (x). Specifi-cally, we take

min I f (x) = I f I in xk_1 < x < xkand

max I f (x) I = I f (,qk) l in xk_1 < x < xkand obtain

Ac,d = lim I f( k)I Ax = f lf(x)I dx,dn-,co k=1 c

i.e., instead of integration the function f(x) in the interval in which thefunction f (x) is negative, we integrate the absolute value of the function.

Thus we obtain in case

f(x) >Oina <x <c,f(x) <Oinc <x <dandf(x) >0 in d <x <bthe following formula for the area

Aa,b =f f(x) dx + f dlf(x)I dx +d

f(x) dx.a c d

Y

Fig. II.26

Page 135: ɷshawns sagan integral and differential calculus

[1.9. THE DEFINITE INTEGRAL 123

Fig. II.27

Suppose we have the function

y=x(x - 1)(x-2)(see Fig. 11.27) and we wish to find the area which is enclosed by this curveand the x-axis. Clearly, we have to take

lAo,2 = fo

- 1)(x - 2) dx + f," jx(x - 1)(x - 2)I dxJo

l=f - 1)(x - 2) dx - I x(x - 1)(x - 2) dx.0 1

The reader can easily generalize this idea to the case where f (x) has anyfinite number of sign changes in the integration interval.

Problems II.54-II.59

11.54. Show that

f2x(x - 1)(x - 2) dx = 0.

II.55. Show by geometric inspection that

b c b

f(x) dx = f f(x) dx +f f(x) dxa ¢ c

where f (x) > 0 in a < x < b and c is some number between a and b.II.56. Show that the formula in Problem 11.55 is also true if a -< b <- c and

f(x)>0in aSx<c.II.57. Show that the formula in Problem 11.55 is true regardless of the sign

of f (x).

Page 136: ɷshawns sagan integral and differential calculus

124 CHAP. II. AREAS

Cf (x) dx = Cl f (x) dx where C is a constant. Assume11.58. Show thatJ

b

b

thatf(x) is such that the definition of the integral in (11.34) applies, and use thisdefinition to prove the above formula.

11.59. Consider the function y =f(x) which is represented in Fig. 1.48(Chapter I, Section 13). Find the area of the region which is bounded above byy = f(x), below by y = -1 and on the sides by x = 1/2n and x = 1. This areaAn depends on n. Does lim A,, exist? If so, what is this limit and what does itrepresent? n~x

10. THE TRAPEZOIDAL RULE

We have seen in the preceding sections that the area under a curve can beapproximated by lower sums or upper sums, depending on whether wewant an approximation from below or above. The error at each subintervalis indicated in Fig. II.28 by the shaded area in case we take the lower sum,and by the dotted area in case we use the upper sum, as an approximation.Even though this error becomes small and ultimately vanishes as Ax =xk+l - xx. -> 0 for all k, in case the function is well behaved in the intervalin which the area is sought, there is no reason why we shouldn't look for abetter approximation, i.e., an approximation that yields for the same sub-division a smaller error than either the upper or the lower sum.

Let us consider Fig. II.29. If we join the beginning point Pl of the curvein the interval xk < x < xk+i with its endpoint P2 by a straight line andapproximate the area under the curve by the shaded trapezoid, then we canexpect to obtain in general a more accurate result than the one obtainablefrom an approximation by an upper or a lower sum. Of course, there arecases where upper or lower sums do yield better results for a given sub-division than the trapezoidal approximation would yield. We depict twosuch cases in Fig. 11.30.

Y

Xk xk+1x

Fig. 11.28

Page 137: ɷshawns sagan integral and differential calculus

11. 10. TRAPEZOIDAL RULE 125

y

Fig. 11.29

In Fig. 11.30(a) the shaded area represents the error that is committedif the trapezoidal approximation is used, and the dotted area represents theerror resulting from an approximation by the lower sum (inscribed rec-tangle). In Fig. II.30(b) the dotted area represents the error due to an uppersum approximation and the shaded area again indicates the error producedby a trapezoidal approximation. It is obvious that, in both cases, thedotted area is smaller than the shaded area, i.e., the error due to the trape-zoidal approximation is larger than the error committed by the lower andupper sum approximations, respectively. However, the introduction ofadditional division points will, in general, reverse this situation.

y y

xxk Xk+1 Xk k+1

(a) (b)

Fig. 11.30

Page 138: ɷshawns sagan integral and differential calculus

126 CHAP. If. AREA;

Fig. 11.31

Let us now derive a general formula for the approximation of an areaunder a curve by a sum of areas of trapezoids.

First, we note that the area of the trapezoid ABCD in Fig. 11.31 is given by

(11.38) Area of trapezoid = ba + C

2

(see Problem 11.12). This formula is quite obvious if we notice that theshaded triangles in Fig. 11.31 are congruent.

Now we consider a function y = f(x) of which we assume that it has nodiscontinuities in a S x S b (see Fig. 11.32). We divide the intervala< x s b into n equal subintervals of length

Ox=b -an

Page 139: ɷshawns sagan integral and differential calculus

11.10. TRAPEZOIDAL RULE 127

and call the division points x1i x2, x3, , xn_1, putting x0 = a, x,, = b.Clearly,

xk=a+kb a fork=0,1,2, n.n

At each one of these division points we erect the ordinate

Yk = f (xk),

join each pair of adjacent points (xk, yk), (xk+1, yk+1) for all k = 0, 1, 2,n - 1, by a straight line and compute the area of every one of the

trapezoids thus obtained (see Fig. 11.33). Clearly, we obtain for the areaAk of the (k + 1)th trapezoid according to (11.38)

Ak=Axyk+1+yk2

for all k = 0, 1, 2, , n - 1, where we agree that f (a) = f (xo) = yo andf(b) =f(xn) = y,.

Now we sum all these areas and obtain

Ox(2 2 2

Ax(Zo+y1+y2+...+yn-1+ 2")

Since Ax = b - a , we obtain the following approximation for the arean

Fie. 11.33

Page 140: ɷshawns sagan integral and differential calculus

128 CHAP. II. AREAS

under the curve y = f (x) between x = a and x = b :

Jf(x)_

(I1.39) dx - b a (yo + 2y1 + 2y2 + ....+ 2yn-1 + yn).

This formula is called the trapezoidal rule for quite obvious reasons.In Section 8 we approximated the area under the curve

y=x(1-x)+1between x = 0 and x = 1 by a lower sum with 16 subintervals andobtained $16 = 1.150 . Let us now use the trapezoidal rule with thesame number of subintervals for the same problem.

We have1 1 _3 _1 5xo0,x116,x28,x3

16'x4 4'x5_`16

3 7 1 9 5 _11x6 8 , x7

16 ' x$ 2 ' x9 16 ' x1o8 ' x11

16

_3 13 _7 _15x12 4,x13 16,x14 8,x15 16,x16= 1

and, accordingly,

271 284 295 304 311YO 1,y1=256y2-256'y3=256'y4=256y5=256

__ 316 319 320z for k = 0 1 2 16.Y6

256 ' y' - 256 ' YS - 256 ',k - y16-k , , ,

Hence, we obtainrl

Jo

[x(1 - x) + 1] dx

1271+ 284 + 295 + 304 +311 + 316 + 319 + 320

321+2 +319+316+311 +304+295+284+271 + 1

256

= 1.16601.

We will see in Section 13, Problem 11.81, that the true value of thisintegral is found to be 1.16. Thus we see that the approximation 1.16601which we just obtained is considerably better than the approximation1.150 as obtained from the lower sum in Section 8.

Let us now find as another application of the trapezoidal rule an approxi-mation to the value of 7r, i.e., the area of the unit circle. The unit circlewith the center in the origin has the equation

y=1'1 -x2.

Page 141: ɷshawns sagan integral and differential calculus

II.10. TRAPEZOIDAL RULE 129

We consider one quadrant of this circle above the x-axis between x = 0and x = 1 and choose n = 32.

We have

and obtain accordingly the following values for yk.

1/0 = 1 21/9 = 1.91927 21/17 = 1.69443 2y25 = 1.24844

2y1 = 1.99902 21/10 = 1.89984 21/18 = 1.65359 2y26 = 1.16592

2y2 = 1.99609 2y11 = 1.87812 2y19 = 1.60930 2y27 = 1.07347

21/3 = 1.99119 2y12 = 1.85405 21/20 = 1.56125 21/28 = 0.96825

2y4 = 1.98431 2y13 = 1.82752 2y21 = 1.50908 21/29 = 0.84779

21/5 = 1.97543 21/14 = 1.79844 21/22 = 1.49217 2y30 = 0.69597

21/6 = 1.96453 21/15 = 1.76666 21/23 = 1.39054 2y31 = 0.35903

21/7 = 1.95156 2y16 = 1.73205 21/24 = 1.32288 1/32 = 0.

21/s = 1.94849

Hence, we obtain with (11.39)

7r 1

4=J Ji - x2 dx_ 64 50.06768 = 0.78231

0

and, therefore,7r ti 3.12924.

This result deviates from the true value of IT = 3.1415 by 0.0123which is not as accurate as we might have expected it to be. The reason isthat the trapezoidal approximations to the circle do not yield very goodresults in the neighborhood of the point (1, 0).

Problems 11.60-11.64('

11.60. Find an approximation to J x10 dx by using the trapezoidal rule with0

n = 4 and compare it to the result that is obtained from an approximation bythe lower sum S4. (True value of the integral is

II.61. A function f (x) is said to be monotonically increasing if, for any pairof values x1 < x2, the relation f(xl) < f(x2) holds. Show that the trapezoidalrule applied to a monotonically increasing function is obtained by taking thearithmetic mean of lower sum and upper sum. (Hint: Note that, if f (X) ismonotonically increasing, then max f (x) = 1/k+1 in Xk < X < xk+l, min f (x) _1/i in xk < x < xk+l.)

1'1II.62. Find an approximation to J x2 dx by using the trapezoidal rule. Start

0with n = 2 and continue doubling the number of subintervals until no morechanges of the result in the third decimal place can be expected.

Page 142: ɷshawns sagan integral and differential calculus

130 CHAP. IT. AREAS

11.63. Prove that the trapezoidal rule yields the exact result if applied to theintegration of all linear functions, i.e., functions of the type y = ax + b.

ri11.64. Use the trapezoidal rule to find an approximation ton = 4J V! - x2 dx

0

taking 16 subintervals between 0 and j and 32 subintervals between j and 1.Compare your result with the one obtained in this section and the one obtainedin Section 5.

11. INTEGRATION BY A LIMIT OF A SUM PROCESS

We will devote this section to the discussion of two very simple examples,in which we will compute the area of a rectangle and a triangle by an inte-gration process, i.e., by the evaluation of the appropriate limits of lowersums, and thus demonstrate how the formulas for the area of these basicgeometric configurations now appear as special cases of our general defini-tion of area as given in (11.34) (Section 8). (It would be awkward, indeed,if this were not so. After all, the general definition of area as given inSection 8 was based on the formula for the area of a rectangle, and theformula for the area of a triangle, in turn, was developed from the formulafor the area of a rectangle-see Section 2.)

First, let us consider a rectangle with the dimensions a and b. We chooseour coordinate system so that the rectangle takes the position as indicatedin Fig. 11.34. We can interpret this problem as the problem of finding thearea of a region which is bounded above by y = b (the line parallel tothe x-axis at distance b from the x-axis) between x = 0, x = a and abovethe x-axis.If we subdivide the interval 0 < x < a into n subintervals of length

Ax =a

, we obtain in every one of these subintervalsn

and

yn

min f (x) =f(k) = b

maxf(x) = f(Y7k) = b,

y=b

-> x

Fig. 11.34

Page 143: ɷshawns sagan integral and differential calculus

11. 11. INTEGRATION BY LIMIT PROCESS 131

(a, b)

a

Fig. 11.35

i.e., f f (Y7k) = b and, consequently,

Sn=gnbAx=ball=ab.k=1 n k=1

Therefore,

(11.40)f0a

b dx =limSn =limSn =limab = ab,n- oo n-. n- 00

as was to be expected.Next, we consider a right triangle where the two sides forming the right

angle have the dimensions a and b. We choose our coordinate system insuch a way that the triangle will take the position as indicated in Fig. II.35.The line through (0, 0) and (a, b), forming the hypothenuse of the triangle,has the equation

Thus we face the problem of finding the area under the curve y =b

xa

between x = 0, x = a and above the x-axis. A subdivision of the interval0 < x < a into n equal subintervals will yield an undersum (lower sum),as indicated for n = 6 in Fig. 11.36. In general, we have clearly

a 2a n-11=o, 2=Th =-,..., n= a

and, consequently,n n n

J 0, .fb

, f 2b ... , f (n) _ (n - 1)bn. n n

Page 144: ɷshawns sagan integral and differential calculus

132

Thus with Ox = a we haven

CHAP. II. AREAS

sn-a[,+b+2b+3b+..,+(n- 1)b

nLL n n n n

=ab[1+2+3+....+(n-1)].

n

In order to discover the limit of this expression as n -* oo, we have toevaluate the sum

1+2+3+...+(n-1)

for any n and represent this sum in terms of n. For this purpose, we rewritethe sum in the more elaborate form

1 1).

If we add the first and the last term, we obtain n. If we add the second andthe next to the last term, we obtain n. If we add the third term and theterm preceding the next to the last, we obtain n again, etc....

Now, if n is odd, we obtain exactly n 2 1 such sums, i.e, the sum of this

series is given byn(n-1)

.1 2-F--. .(n-1)

y

2

Fig. 11.36

Page 145: ɷshawns sagan integral and differential calculus

II.11. INTEGRATION BY LIMIT PROCESS 133

Suppose n is even. Then we obtainn

2 2 such sums and the term 2 in

the middle is left out. Thus we obtain altogether

1+2+3+...+.(n-1)=n2 2n +2=n(n2

1)

which is the same formula as the one obtained above for odd n. Thus wecan state that

1+2+3+...+(n-1)=n(n-1)2

for any integer n.Hence, we obtain

Sn_abn(n-1)abn2-n=ab(1- 1)

_n2 2 2 n2 2 n

and we see easily that

ab(11.41) f" b

x dx lim S lImab(1 1) ab .

lIm (11)

p a n-fcc n 2 n 2

as we expected to obtain.Next we consider the general case where the triangle is not necessarily

a right triangle. We choose our coordinate system so that the origin coin-cides with the vertex A at which the triangle has an obtuse angle, if it hasan obtuse angle at all (note that a triangle has at most one obtuse angle).If there is no obtuse angle, then we let the origin coincide with just anyvertex.

Further, we choose the x-axis so that it coincides with the height of thetriangle through A (see Fig. 11.37). With the notation introduced in Fig.11.37, we have p + q = a. The equation of the line through AC is given by

yPxwhere h = b by the theorem of Pythagoras and the equation ofthe line through AB is given by

Therefore, the area of the triangle (ABC) is given by

It h

fo hxdx+ fo J- hxJ dx.

Page 146: ɷshawns sagan integral and differential calculus

134 CHAP. If. AREAS

Y

Fig. 11.37

In view of (11.41), where we now write h instead of a and p instead of b incase of the first integral, and h instead of a and q instead of b in case of thesecond integral, the area of the triangle (ABC) is given by

hp + hq = h(p + q) = ha

2 2 2 2

Since a is the base of our triangle, we thus obtain the well known formula

Area of triangle = 2 base x height.

Problems 11.65-11.69

11.65. Evaluate the following sums:

12 10

(a) I k (b) kk=1 k=210000 100

(c) 1 k (d) 1 (k +k

k1 k-111.66. Find the area of a right triangle with sides 3, 4, 5 by a limit of a sum

process.11.67. Find the area of a triangle with sides 3, 5, 7 by a limit of a sum process.

Page 147: ɷshawns sagan integral and differential calculus

1I.12. FINITE SUMS 135

11.68. Find the area of a region bounded by y = [x], the x-axis, x = 0 andx = 10 by a limit of a sum process.

11.69. Consider f (x) = x2 in the interval 0 <_ x < 1. Find S., for the areaunder y = x2, between x = 0, x = 1 and above the x-axis. Use equal subintervalsand evaluate the lower sum for n = 4, 8, 32. Make a conjecture about the limit.

12. FINITE SUMS

In the preceding section we found thatn-1 n(n -k=1 2

by a method which C. F. Gauss (1777-1855) purportedly invented in gradeschool when his incompetent teacher required the class to add up all thenumbers from 1 to 100 in the hope that this would keep the class busy fora while. It is almost needless to say that Gauss had the last laugh.

In the following section we will need a formula for the sumn-1

k2=1+4+9+...+(n-1)2.k=1

In order to evaluate this sum, we consider the following two sums, whichappear to pose a more complicated problem (and do, as a matter of fact,if we were out to evaluate them), namely,

n-1'10=1+8+27+...+(n-1)3

andk=1

n-11(k-1)3=0+1+8+27+...+(n-2)3.

k=1Clearly,

n-1 n-1

(11.42) k3 - (k - 1)3 = (n - 1)3.k=1 k=1

On the other handn-1 n-1 n-1

(11.43) 1 k3 - I (k - 1)3 = 1 [k3 - (k - 1)3]k=1 k=1 k=1

n-1(k3-k3+3k2-3k+1)

k=1

n-1_ (3k2 - 3k + 1).

k=1Since

n-1 n-1 n-1 n-1

(3k2-3k+1)=3k2-31k+I1k=1 k=1 k=1 k=1

Page 148: ɷshawns sagan integral and differential calculus

136 CHAP. II. AREAS

andn-1 n(n - 1)

n-1

I k = I1=(n-1),k=1 2 k=1

we have in view of (11.42) and (11.43)

(See Problem 11.23),

3n

k2 - 3n(n -

1) + (n - 1) = (n -i)3

Ik=1 2

and, consequently, after shifting of terms and dividing by 3,

n-1 33 1 11

k=1

) + n(n- )-n-(n-3

r6 3

=(n-1)[2(n- 1)2+3n-21

L 6 J

=(n- 1)r2n2-4n+2+3n-216LL

= (n - 1)n(2n - 1)6

Let us list all the summation formulas which we obtained so far for laterreference purposes:

n-1

(11.44) 57.1=(n-1)k=1

n-1 n(n - 1)(11.45) _7 k =

k=1 2

(11.46) -1 2 = (n - 1)n(2n - 1)k=1 6

Problems II.70-11.74

11.70. Evaluate

101 k 1000

(a) A (b) A(k + 1)

51 100

I k2 (d) I k2k=1 k=1

11.71. Find a formula for k3. [Hint: Consider the sums 10 andn-1 k=1 k=1A (k - 1)4 and proceed as in the text.]

b:=1

11.72. Find a formula for1+3+5+ +(2m+1).

[Hint: Observethat1+3 +2m =2(1 +2+3++m).]

Page 149: ɷshawns sagan integral and differential calculus

11. 13. AREA UNDER PARABOLA

11.73. Evaluate

11.74. Evaluate

n (n)2 +n

(2a)2 + ... +n

((nn lea I2

8r37r 11-1 8r3,T

n n

n -+ oo.

13. AREA UNDER A PARABOLIC ARC

137

Let us consider the problem of finding the area A which is bounded by anopen rectangle with a parabolic arc being the supplementary boundary.

First, we consider the problem of finding the area under the parabola

y=x2between x = 0, x = a and above the x-axis (see Fig. II.38). We subdivide

the interval 0 < x < a into n equal subintervals of length a . Then then

coordinates of the division points are

a 2a 3a (n - 1)axl __, x2=-1 x3=_I ...,xn-1=n n n n

and for the sake of uniformity we put

0 = x0 and a=xn.y

x1 X3X2

Fig. 11.38

Page 150: ɷshawns sagan integral and differential calculus

138

We note thatmin f (x) = f (xk) in xk C x C xk+l

and obtain, therefore, for the lower sum in view of

/2 2

f(xk) = {J (kn) =,l2

the following expression

sn=0+a( +_ - +n n)2 n (na)2

CHAP. II. AREAS

a (n - 1)a 12n 11 fl

a3 a3 (n - 1)n(2n - 1)=n3[1+22+32+...+(n-1)2]=n36

3

= 6n3(2n3 - 3n2 + n)

or, as we may write,

Since

we have

Sn3 (1 2n +

2n1

2

13 )=1lim(1- +n- co 2n 2n2

aa3

(11.47) A= x2 dx =30

It follows from (11.47) right away that

f x2dx b3

3

which represents the area under the parabola y = x2, between x = 0,x = b, and above the x-axis.

Hence, we obtain for the area under the parabola y = x2 between x = a,x = b, and above the x-axis

(11.48) x2dx 0 a3=

a 3

(see Fig. 11.39).This formula, together with the formulas (11.40) and (11.41) of Section

11, will enable us to find the area under a parabolic are

(11.49) y = Ax2 + Bx + C

Page 151: ɷshawns sagan integral and differential calculus

11. 13. AREA UNDER PARABOLA 139

Y

x = a

Fig. 11.39

x=b

between any point x = a and any other point x = b, if we also assumethat the function in (11.49) is positive between a and b. (Otherwise, wewould have to divide the integration interval as we have demonstrated inSection 9.)

We need for this purpose a property of the definite integral which can beeasily established, namely, that

(11.50) J [f(x) + g(x)] dx =J f(x) dx + g(x) dx.b b b

a a a

We will prove this relation for monotonically increasing functions f (x)and g(x) only, but we wish to point out that it is true in general-providedthat the two integrals on the right exist.

If f (x) and g(x) are monotonically increasing (for definition of mono-tonically increasing functions, see Problem 11.61), it is clear that

min [f(x) + g(x)] = f(xk) + g(xk) [= min f (x) + min g(x)]

in any interval xk < x < xk+1. Therefore,

n--`1 n-1/

n-1sn = Ox G [f (xk) + g(xk)] = Ox .1 f (xk) + Ax Y_ g(xk)

k=0 k=0 k=0

If the limits of both sums on the right side of this relation exist as n -> cc,which we will assume, then invoking our rule according to which the limit

Page 152: ɷshawns sagan integral and differential calculus

140 CHAP. If. AREAf

of a sum of two sequences is equal to the sum of the limits of the twcsequences, provided that both limits exist (see Section 4), we have that

n-1 n-1 b b

lim Sn = lim Ax I f (xk) + I'M Ox I g(xk) = f f (x) dx + f g(x) dx.n-+oo n-oo k=0 k=0 Ja Ja

On the other hand,

lim Sn = f b[f (x) + g(x)] d xn'm Ja

and (11.50) follows.Now, if we wish to integrate the function in (11.49), we make use of the

relation (11.50) as follows:

f Jb(Ax2 + Bx + C) dx = f Ax2 dx + f t(Bx + C) dx.a a a

We apply (11.50) again to the last integral on the right and obtain

b b b b

(Ax2+Bx+C)dx=J Ax2dx+ f Bxdx+rCdx.a a a Ja

We have seen in Problem 11.58 that

f bkf(x) dx =kb

f(x) dxa a

where k is a constant. Hence

f(Ax2 + Bx + C) dx = A fbx2 dx + B f bx dx + C f.bdx..' a a a

From (11.48) it follows that

3

From (11.40) and (11.41) in Section 11 it follows that

b62 a2 L

f xdx= and f dx=b - aa 2 a

(see also Problems 11.75 and 11.76).Thus we finally arrive at the formula

(I1.51)b 3

f (Ax2+Bx+C)dx=Ab -a3 + Bb 2 -a 2 +C(b-a).

a 3 2

b

x2 dx =b3 - a3

f.

Page 153: ɷshawns sagan integral and differential calculus

11. 13. AREA UNDER PARABOLA 141

Problems 11.75-11.83

11.75. We have seen in Section 11 that

fJ

b dx = ab.0

b b

Derive from this formula that f dx = b and, in turn, dx = b - a.0 aJ

abx dx = a11.76. We have seen in Section 11 that

I . Derive from thisa 2

result that 0bx dx = b2

2 and, in turn, b x dx =b2 - a2

2 .a

11.77. Evaluate :

('1 0

(a) J (4x2 + 6x + 3) dx (b) I (x2 - 12x + 1) dx0 g

fo

(c) (3x2 + 2x + 4) dx (d)I101(x2

- 500) dx.loo

11.78. Find the area of the region between the curve y = f(x), the x-axis, andx=a,x=b:(a) f(x) = x2 - 1, a = -2, b = 2(b) f (x) = x2 - 3x + 2, a = 0, b = 4(c) f (X) = x2 + 3x + 2, a = -5, b = -1(d) f(x) = x2 - x, a = -1, b = 1

Note that these functions change sign in the integration interval.11.79. Determine A, B, C so that the parabola

y=Axe+Bx+Cpasses through the points (1, 1), (2, 2) and (3, 5).

11.80. Find the area under the parabola in Problem 11.79 between x = 1 andx=3.

Cl11.81. Find J [x(1 - x) + 1] dx and compare the result with the values which

0we obtained for the undersums in Section 8 and from the trapezoidal rule inSection 11.

11.82. Find the area of the region which is encompassed by the parabolay = x2 and the line y = x. (Sketch the region.)

11.83. Find the area of the region which is encompassed by the two parabolas

y=x2and y=4x2+1.(Sketch the region.)

Page 154: ɷshawns sagan integral and differential calculus

142 CHAP. II. AREAS

14. SIMPSON'S RULE

In Section 10 we derived the so-called trapezoidal rule which yields fora given subdivision an approximate value to the definite integral. Wearrived at this rule by approximating the function y = f (x) to be integratedby a polygon and then, in turn, by approximating the area under the curvey = f (x) by a sum of areas of trapezoids.

It seems quite obvious that we can obtain better approximations to thearea, if we use a closer approximation to the curve than the one furnishedby a polygon. This is exactly what we are going to do now.

Again we subdivide the interval a < x < b by division points x1, x2,

x3, , x,,_1 into n equal subintervals of length Ax =b

n a but we willn

assume now that n is an even number for reasons which will soon becomeobvious. Now we erect the ordinates at all division points (see Fig. 11.40).The ordinates at division points with an even subscript we represent by asolid line and the ordinates at division points with an odd subscript werepresent by a broken line.

Now, instead of replacing the curve within each subinterval by a straightline, we will replace the curve between each two consecutive division pointswith an even subscript x2k, x2k+2 by a parabola y = Axe + Bx + C whichpasses also through the point [x2k+l,f(x2k+1)). It seems quite apparent thata much better approximation is obtained in this manner. Now all we have

to do is find the equation of the 2 parabolas involved, apply formula

Y

-x

Fig. H.40

Page 155: ɷshawns sagan integral and differential calculus

11.14. SIMPSON'S RULE 143

(11.51) from Section 13 to find the areas under those parabolas, and formthe sum of all these areas.

Again, we let a = x0, b = x,, to unify the notation. Further, we let

f(xk) = y7,, k = 0, 1, 2, 3, ... , n.

Now let us consider the subinterval x2k < x < x2k+2 (see Fig. II.41). Wehave to fit a parabola y = Ax2 + Bx + C through the three points

(x2k, y2k), (x2k+1, Y2kt1), (x2k+2+ Y2k+2)

and then find the area under the parabolic are. We note that the area underthis parabolic arc depends only on the distance Ax between consecutivedivision points and the lengths of y2k, Y27,t1 and y2k+2 It is quite immaterialwhere x2k+1 is located relative to the origin. We can simplify our problemto some extent if we assume that x2k+1 = 0, i.e., shift the entire configuration

in Fig. 11.41 by x27,+1 units to the left (see Fig. 11.42). Since Ax =b - a

we have now x2k = -Ax = - b a and x27,+2 = Ax =b a Thus we

n nobtain for the area under the parabolic arc y = Ax2 + Bx + C betweenX2k and x2k+2 in view of (11.51),

(11.52)Ox

(Ax2 + Bx + C) dx = A(Ax)3 3(-Ax)3 + B (6.x)2 -( -0x)2

J-oX+ C[Ax - (-Ax)]

3= 2A (Ox) + 2CAx.3

We note that B has dropped out entirely due to the shift of x2k+1 into theorigin of the coordinate system and thus only A and C are left to be deter.mined so that y = Ax2 + Bx + C passes through the three points (-Ax,y2k), (0, Y2k+1), (Ax, y2k+2)

Y2k = A(-Ax)2 + B(-Ax) + C

y2k+1 = C

Y2k+2 = A(0x)2 + B(Ax) + C.

Addition of the first and the last equation yields

(11.53) Y2k+2 + Y2k = 2A(0x)2 + 2C.

Page 156: ɷshawns sagan integral and differential calculus

144 CHAP. II. AREAS

x 2k x2k+1

Fig. 11.41

From the second equation we have

(11.54)

and hence, in view of (11.53),

(11.55)

C = Y2k+1

X2k+2

A = ?/2k+2 + ?12k - 2Y2k+12(Ax)2

Fig. 11.42

Page 157: ɷshawns sagan integral and differential calculus

11.14. SIMPSON'S RULE 145

If we substitute the values for A and C from (11.55) and (11.54) into (11.52),we see that the area A2k under the parabolic are between x2k and x2k+2 is

(11.56)

A2k = Y2k+2 + 1/2k - 2y2k+1 2( x)3 + 2y2k+1Ax2(0x)2 3

= (Y27,+2 + y2k - 2y2k+1 + 2y2k+1 r Ax = Y2k+2 + 4y2k+1 + y2kAX3 / 3

Now we sum all these areas A0, A2, A4, An-2 and obtainn-12 OxL,o A2k = 3 [(Yo + 4y1 + y2) + (Y2 + 4y3 + Y4) + (y4 + 4y5 + y6)

+ ... + (yn-4 + 4yn-3 + yn-2) + (yn-2 + 4yn-1 + yn)]Ax= 3 (yo+4y1+21/2+4y3+2y4+4y5+2ys+...+2yn-4

+ 4yn-3 + 2yn-2 + 41/n-1 + 1/n).Thus we obtain the following approximation to the definite integral of thefunction f (x) between a and b, if we also substitute for Ax its valueAx=b -a

n

('b(11.57) J .f(x)dx-b3na(yo-f 41/1-F 2y2+4y3-f 2y4+...

a

+ 2yn.-2 + 4yn-1 + yn)This is called Simpson's rule.

This rule is easy to remember: the first and the last ordinate yo and ynare each taken once, all other even-numbered ordinates are taken twice andall the odd-numbered ordinates are taken four times.

In order to obtain some insight into the effectiveness of this formula, wewill try it out in computing an approximation of the area of one quadrant

of the unit circle, which will yield an approximation to 4 . We will see

that the approximation which we will thus obtain is better than the one wearrived at by using the trapezoidal rule in Section 11, but not as good asthe one obtained in Section 5.

The unit circle has the equation

y= 1 -x2.We subdivide the interval 0 < x S 1 into 32 subintervals and compute thevalues of yk by making use of square root tables. *

* Standard Mathematical Tables, CRC, 12th ed. pp. 210-229.

Page 158: ɷshawns sagan integral and differential calculus

146 CHAP. II. AREAS

We obtain :

1/o = 1 32y1 = 31.98437

32Y2 = 31.93744 321/3 = 31.85906

32y4 = 31.74902 321/5 = 31.60696

321/6 = 31.43247 32y7 = 31.22499

321/8 = 30.98387 32y9 = 30.70831

32y10 = 30.39737 32y11 = 30.04996

321/12 = 29.66479 321/13 = 29.24038

32y14 = 28.77499 321/15 = 28.26569

321/16 = 27.71281 321/17 = 27.11088

32y18 = 26.45751 32y19 = 25.74879

32Y20 = 24.97999 321/21 = 24.14539

321/22 = 23.87467 321/23 = 22.24860

321/24 = 21.16601 32y25 = 19.97498

321/26 = 18.65470 321/27 = 17.17556

32Y28 = 15.49193 321/29 = 13.56466

321/30 = 11.13553 32y31 = 5.74456

1/32 = 0

Thus,k=15 k=15

2 .1 Y2k = 24.02363, 4 1 y2k+1 = 50.10675k=1 k=0

and consequently, after elementary operations

7r ti 3.13043

as compared to the value 3.12981 which was obtained by the trapezoidalrule using the same number of subintervals.

Problems 11.84-11.87

11.84. Use Simpson's rule to find

f1

(2x2+3x+1)dx.

Take n = 2, 4, 8. How do you always obtain the same result?11.85. Use Simpson's rule to find an approximation for

dx.

Page 159: ɷshawns sagan integral and differential calculus

U. 15. INTEGRAL WITH VARIABLE LIMIT 147

an approximation to11.86. Use Simpson's rule to findjsinxdx.

Take n = 4. (For values of sin x, see Appendix III, Section 2.)11.87. Use the trapezoidal rule for the integral in Problem 11.86 with the same

number of division points. The true value of this integral is 2. Compare theresults.

15. DEFINITE INTEGRAL WITH A VARIABLE UPPER LIMIT

We have seen in Section 13 that the area under a parabolic arc y = x2between x = a and x = b is given by

fbx2 dx = b3 - a3 .

a 3 3

Let a = 0 in the following discussion for reasons of convenience. Now letus consider various values of b and evaluate this area in every instance.

First, let b = 0, then we obtain

Next we let b = J. Thenf

0

0

x2dx=0.

0i24f x2 dx 1

For b = 1, we obtain f x2dx = 1 .Jo 3

2

Forb=2, f x2dx=3.0

3

Forb=3, f x2dx=9.0

9

x2dx=243.For b = 9, fo

12

For b = 12, f x2 dx = 576, etc.° b

We see that the value ofJ

x2 dx depends on the value which we assign torb 0

b, or as we can say: J x2 dx is a function of b. This is also quite clear from0

looking at Fig. II.43, where we have drawn three vertical lines at three

Page 160: ɷshawns sagan integral and differential calculus

148

Y

0 bl b2 b3

Fig. 11.43

CHAP. H. AREAS

different points bl, b2, and b3. Clearly, the areas from 0 to b1, b2i b3 are alldifferent. b

We express the fact that f x2 dx is a function of b in the customary form0

fb

0

x2 dx = F(b).

Before we continue, let us note that it is really immaterial what letter web

use to designate the integration variable. Instead of x2 dx we mightfob rb

just as well write I u2 du orJ

t2 dt or use any other letter, for that matter.Jo o

since the integration variable does not explicitly occur in the final result.(The integration variable is a "dummy" just as the summation subscriptin a sum.) In general, we can see that

Ib

$f(u)b

Pf(x) dx = du = (t) dt =...

If we use u now instead of x for the integration variable, we have

fb

u2I

du = F(b)0

Since it is customary to denote a variable by x, we write x instead of b anchave

u2 du = F(x),fx

o

Page 161: ɷshawns sagan integral and differential calculus

II. 15. INTEGRAL WITH VARIABLE LIMIT 149

i.e., the area under the parabolic are y = x2 between the fixed lower limit0 and the variable upper limit x is a function of x. (Note that we havedenoted the integration variable by u so that we do not get it mixed up withthe upper limit.)

In general, if we have some function y = f'(x) for which the integralexists, we can say that the area under the curve y = f(x) between a fixedlower limit a and a variable upper limit x is a function of x:

rx

af (u) du = F(x).

We will now investigate what happens to the area F(x) as x increases.For example, in the case of the area under the parabolic arc which we dis-cussed initially [f(x) = x2], we see that if x increases from 0 to 1, then thearea increases from 0 to 3. If x increases from 1 to 2, then F(x) increasesfrom 3 to 3. If x increases from 2 to 3, then F(x) increases from s to 9,etc. We see that (at least in this case) even though x increases in alllisted instances by the same amount, the corresponding increase of the areaF(x) differs from case to case.

We will now try to establish a general formula which will enable us tomeasure this increase for any x. For this purpose we consider a secondpoint (see Fig. 11.44) h units to the right of x which has the x-coordinatex + h and consider the areas between a and x

f xf (u) du = F(x)

a x x+hFig. 11.44

Page 162: ɷshawns sagan integral and differential calculus

150

and between a and x + hfx+h

a

CHAP. II. AREAS

Then, the corresponding increase (change) in area is given by

F(x + h) - F(x)(per h units of x-axis).

Thus, the approximate increase of the area per unit increase of x is givenby

(II.58)F(x + h) - F(x)

h

The numerator of this expression is represented by the shaded region inFig. 11.44. If h is very small, we can approximate this shaded region by atrapezoid as indicated in Fig. 11.44. The area of this trapezoid is given by

(11.59) Ax,x+h(trapezoid) = hf (x + h) +f(x)

2

Hence, we can state that the change of area per unit increase of x is approxi-mately given by

(11.60)F(x + h) - F(x) ,f (x + h) -+- .f (x)

h 2

Clearly, this approximation becomes more accurate the smaller h isbecause the error we make in approximating the curve y =.f(x) by astraight line segment in the interval between x and x + h seems to decreaseas h becomes smaller and it is intuitively clear that the error tends to zeroas h tends to zero-if the functionf(x) is sufficiently well behaved.

Therefore, we define the rate of change of the area F(x) with respect to xas

limF(x + h) - F(x)

h-.0 hWe observe that

limf(x + h) +.f(x) _.f(x) +f(x) =.f(x)h-0 2 2

provided that lim f (x + h) = f (x), which we will assume, and in view ofh-.0

our observation according to which the error in (11.60) tends to zero as htends"to zero, we obtain

(11.61) limF(x + h) - F(x) = AX)

h-+0 h

Page 163: ɷshawns sagan integral and differential calculus

11.15. INTEGRAL WITH VARIABLE LIMIT 151

for the rate q f change of the area under y = f (x) as a function of the upperintegration limit.

We call the rate of change

lim F(x + h) - F(x)h-o h

provided it exists, the derivative of the function F(x) with respect to x anddenote it by

(11.62) F'(x) = lim F(x + h) - F(x)h-o h

We can now state our result in (11.62) in the following words: The rate ofchange of the area F(x) under the curve y = f(x) between a fixed lowerlimit a and a variable upper limit x is given byf(x), or, if

F(x)= J'a

f (u) du,

then

F'(x) = f (x).

This relationship ties up the operation of integration between a fixedlower limit and a variable upper limit with a limit operation as defined in(11.62). This limit operation is called differentiation. If we integrate afunction f (x) between a fixed lower limit and a variable upper limit x, weobtain a function F(x), and if we differentiate this function F(x) again, wecome back to. f(x). We therefore call the two operations inverse to eachother, in the same way that we call squaring and extracting the square rootinverse operations. (If we square the number 2, we obtain 4; then if weextract the positive square root of 4, we obtain 2 again, the number withwhich we started.)

Now, suppose we have a function f (x). Then the area under the curvewhich represents this function between 0 and a variable upper limit x isgiven by

A0,x = f" f (u) du = F(x)0

where

Let x = a, then

let x = b, then

F'(x) = f (x).

AO ,a =ff(u) du = F(a);

Ao,b = f b f (u) du = F(b)0

Page 164: ɷshawns sagan integral and differential calculus

152 CHAP. If. AREAS

and hence

Aa,b = Ao,b - AO,a = o f (u) du - foa f (u) du = F(b) - F(a)Jo J

or

(11.63) f f (u) du = F(b) - F(a),a

where F'(x) = f (x).The reader may get the impression that we have not been quite consistent

with our notation. Originally, we called

j'f(u) du = F(x)

and now we used px

Jo

f (u) du = F(x).

It seems that these two functions, both of which we called F(x), cannot bethe same. Still, a simple argument will reveal that they are "almost" thesame in the following sense:

Since the area under y = (fx) between 0 and x can be obtained by addingto the area between 0 and a the area between a and x (where 0 < a < x),we have

because

J xf(u) du =fa

f (u) du + fx f (u) du0 0 a

Jf(u) du = Ao,a and fa" f (u) du = A,,,x.0

Now, Jf(u) du is some constant number (the area under y = f(x)

between r0 and a). Therefore, the function which is represented by the

integralJ

zf (u) A which we called F(x) at first differs from the function

represented byJ

xf (u) du, which we called now F(x) by some constanta 0f(u) du only. So, if we want to be fussy, we could writenumber C = Io

fof(u) du = F(x) + C

for some constant number. Thenfpa

0

f (u) du = F(a) + C

f}f(u)du=F(b)+Co

Page 165: ɷshawns sagan integral and differential calculus

11.15. INTEGRAL WITH VARIABLE LIMIT 153

and, consequently, we obtain again formula (11.63):

f bf(u) du =fO bf(u) du - f af(u) dua 0

= F(b) + C - F(a) - C = F(b) - F(a).

So we see that our little inconsistency did not have any harmful con-sequences at all. We will dwell on this problem in more detail in ChapterIII, Section 5.

We recall now that we have already obtained the following integrationformulas (see I1.48 and Problems 11.75 and 11.76):

Jdu = b - a

f 2

u du =b a2

2 - 2a

fbut dub3

- a3 .a 3 3

Thus we find in the case of f(x) = 1 that F(b) = b, F(a) = a, i.e., F(x) _

0

du = x and we see thatIx

(x)' = I.x x2

Forf(x) = x, we have F(x) =f udu = -2

and, consequently,0

(x2),

(2-x

and, finally, for f (x) = x2, we have F(x) = fox u2 du 3 and, therefore,

xs

3/ = x2.

The next chapter will be almost exclusively devoted to a study of thisnew operation of differentiation which we introduced in this section, andthere we will find an entirely different method for the derivation of thethree differentiation formulas which we listed just now.

Problems 11.88-11.92

11.88. Find F(x) = $f(u) du for the following functions:

(a) f (X) = x2 + 3x (b) P x) = - 3x2 + 4(c) f (x) = 2x2 + 3x - 1 (d) f (x) = x2 + 13x + 7(e) f (x) = -4x + 2 (f) f (x) = (x + 3)2

Page 166: ɷshawns sagan integral and differential calculus

154 CHAP. II. AREAS

II.89. Using the three differentiation formulas which are listed at the end ofthis section and some imagination, find F'(x) for the following functions:

2

(a) F(x) = 2x (b) F(x) = 4 + x

(c) F(x) = -2x3 + 2x2

(e) F(x) = 4x3 - x

(d) F(x) = x3 + 3x2 - 4x

2x3 5x2(f) F(x) = 3 +_2 - 7x

II.90. The area of a right isosceles triangle of side length x is given by

xA(x) = u du.

0

Justify this formula. Find the rate of change of this area as x = 4.11.91. Same as in Problem 11.90 for the area under the parabolic arc y = x2

between 0 and x:x

A(x) = J u2 du.0

b

II.92. G(x) = f g(u) du is a function of the variable lower limit x. Express

G(x) as a function of a variable upper limit r and find the rate of change of G(x)with respect to x. (Hint: Note that a switch of integration limits means thatIthe widths of the inscribed rectangles in the lower sum are now measured fromleft to right, i.e., the sign is changed.)

Supplementary Problems II

I.I. The circumference of a square of side length a is given by C = 4a.Express the area A of the square as a function f (C) of the circumference. Whatis f(x)?

1.2. Express the circumference C of a square of side length a as a functiong(A) of its area. What is g(x)? What is f [g(x)] where f is the function definedin Problem 1.1 ?

1.3. Assume that the area measure of a rectangle of dimensions a and b isgiven by

A[r(a, b)] = ab2.

Demonstrate with the rectangles r(2, 3) and r(2, 5) that condition 4 (p. 76) isstill satisfied. However, condition 3 is not satisfied any more.

1.4. Given a function F(x) and a function F(y) where x and y are independentvariables that are in no relation to each other. Show that if F(x) = F(y) is tohold for any pair of values (x, y), then F(x) = F(y) = constant. (Hint: Assumethe opposite to be true, i.e., F(x) and F(y) are not constant and show that thisleads to a contradiction.)

2.1. Let a, b, c be the lengths of the three sides of a triangle. Show by inspec-tion that a + b > c and a + c > b and b + c > a.

2.2. What happens if in Problem 2.1 a + b = c?

Page 167: ɷshawns sagan integral and differential calculus

II. SUPPLEMENTARY PROBLEMS 155

2.3. The perimeter of an equilateral triangle of side length a is given by C =3a. Express the area A of the equilateral triangle as a function f (C) of the per-imeter. What is f (X) ?

2.4. Express the perimeter C of an equilateral triangle of side length a as afunction g(A) of its area. What is g(x)? Find g[f(x)] where f is the functiondefined in Problem 2.3.

2.5. Find the area of the quadrangle with vertices at A(0, 0), B(1, 0), C(3, 5),and D(1, 3) in two different ways by considering two different partitions of thequadrangle into triangles.

3.1. Write down the general term of the following sequences:

(a) 1, -2,4, -9,16, -25, (b) 1, 4' 9' 16' 25

(c) (d) 1, -4' 16'64'256...

3.2. Write the numbers as the limit of a sequence.3.3. What is the limit of the sequence 0.9, 0.99, 0.999, 0.9999, ?

k3.4. Evaluate the sum 00

99

A (T-60)n n n

3.5. Simplify k2 - 2 k + y 1.k=1 k=1 k=1

3.6. Write 1x

x2 as an infinite series.

4.1. Evaluate the following limits

(a) limrte+5n+6

n-00n2 -

2

non(c) n-o 2n(Vn - 1)

4.2. Findn2(a) lim I

n- (n - 1)2

n-.oo

(c) lim [4n2 - 16n2

n-+oo L (4n - 1)2

(b) lim2n3

+

12)s 1

n-00 (n-

(d) lim4n3 1

n-oo2n4-+ 4

n(b) lim ('fin - _.,1-00Vn+1

(d) lim 1 12n - 4n + 1

n-34.3. Find

2 / _ 2

(a) li FO 4x2 ± 1 (b) lim lx2 + 11

x3+3x2-4x+1 (x+l)(x-1)(c)

zi o (1 - x)3(d) lim

Z-0 X2 - 4

4.4. Let an = 2n + 1 and assume that lim an = 1. Choose a sequence bnn - 1 n-co bn

(bn 0 an for all n) for which the stated limit relation holds.

Page 168: ɷshawns sagan integral and differential calculus

156 CHAP. II. AREAS

1

4.5. Choose a sequence cn (ci T c,, for i : k) so that lim6c

= 1.

4.6. Let91-+00

1 when x = 1 for all integers nf (X) = n n

0 for all other values of x.

Does lim f(x) exist? If so, what is its value?x-O

4.7. Evaluate

(a) lim(a + x)2 - a2

1 1

(b) limx

I h xh-. 0z-O x

(c) lim(z - k)3 - z3

k-.O k

n

Vx+ h -

'Vx

(d) limh-o h

5.1. Show that if it is true that for all elements of a sequence an the followinginequality holds a _< a -< A for some numbers a < A, then there exists atleast one number L in a < L < A so that there are infinitely many elements ofthe sequence a,, in any neighborhood of L, no matter how small this neighbor-hood is. (Hint: There are infinitely many points representing the elements ofthe sequence in the interval a < x < A. If you cut this interval in half, then

there are infinitely many points in a < x < a 2 A , or in a2

A < x < A,

or in both. Now, repeat this argument for the one of these two intervals thatcontains infinitely many points, etc....)

5.2. Show that if an < a,)+i for all n in the sequence in Problem 5.1, thenthere can be only one such point L with the property that infinitely manyelements of the sequence are located in any neighborhood of L, no matter howsmall this neighborhood is.

5.3. Consider the inequality (11.25) on p. 101 for n = 2. Is it still valid?5.4. Same as in Problem 5.3 for n = 1.5.5. A circular ring is a region that is bounded by two concentric circles of

different radii. Find the area of a circular ring of inner radius 1 and outerradius 2.

5.6. Consider a line segment of length 2 which is tangent to the circle ofradius R and center at the origin at the point (0, R) so that the point of tangencycuts the segment into two parts of equal length. Now consider a circular ringof inner radius R and an outer radius that is chosen so that the outer circlepasses through the endpoints of the line segment. Make a sketch. Show thatthe area of the circular ring thus obtained is independent of R, i.e., no matterhow large R is, this area is always the same.

6.1. Find the area of a regular heptagon in terms of the radius of the circum-scribed circle.

6.2. Find the area of a regular octagon in terms of the radius of the circum-scribed circle.

Page 169: ɷshawns sagan integral and differential calculus

II. SUPPLEMENTARY PROBLEMS 157

. Express the6.3. An unenlightened State Legislature once defined 7T as22

error they committed in per cent of the true value to four significant figures.6.4. The radius of the earth is approximately 4000 miles. Based on this

approximation, find the length of the equator, assuming that the earth is asphere.

7.1. Find sin 12 . [Hint: Use half-angle formula, Appendix III, formula(AIII.8).]

7.2. Find sin 12 . (Hint: Note that 12 = 3 + 4 and use the formula which

was developed in Problem AIII.14.\

7.3. Derive the relation (11.30) considering the area of the circular sector ofradius 1 and curved boundary of length x, the triangle OCB and the triangleOCD (see Fig. 11.17).

8.1. Find an approximation to the area under the curve y = 1 + x2 betweenx = 0 and x = 1 by taking S8.

8.2. Same as in Problem 8.1 taking S8.8.3. Find a better approximation to the area in Problem 8.1 by taking the

arithmetic mean of S8 and S.-9.1. Given the function

xfor0 <x <1f(x) -

Find I " f (x) dx by geometric inspection.0

9.2. Same as in Problem 9.1 for the function f (X) 1 - x2 between thelimits x = 0 and x = 1.

4

9.3. Show by geometric inspection that f (x) dx = 0 where f (x) is thefofunction represented in Fig. I.41.

9.4. Let0 for 0 <x <3

f(x) = {not defined for 3 < x < 3.00000000000011 for 3.0000000000001 <- x < 5.

5

es f(x) dx exist?fo

Do

9.5. Let 0 < g(x) < f (x) in a -< x < b. Interpret the formula

(x) dxIrb[f(x) - g(x)] dx =J bf(x) dx -Igb

a a

geometrically. Is this a true formula?

Esin

Does it also hold if f (x) = g(x) for all x ?

10.1. Find an approximation to x dx by the trapezoidal rule, using

n = 6 subintervals. (For values of sin x, see Table AIII.2 in Appendix III,Section 2.)

Page 170: ɷshawns sagan integral and differential calculus

158 CHAP. II. AREAS

10.2. Same as in Problem 10.1 for fnsin2 x dx.

r610.3. Find an approximation to J V'x3 - 1 dx by the trapezoidal rule using

n = 5 subintervals. 1

10.4. Find an approximation to f5

2x21+ 1 dx by the trapezoidal rule usingn = 6 subintervals. J

11.1. Givenx+1for0<x<1f(x)-t-x+3for1<x<2.

Find 19f (x) dx by a limit of a sum process.0

I

b

1.2. Show that dx = b - a.1

11.3. Find the area of a region which is bounded by y = x, y =

2

and x = 1by a limit of a sum process.

11.4. Find the area of the region bounded by y = 1, y = -x, y = x by alimit of a sum process.

12.1. Show that3 5k2 =Ik- I

k=1 k=112.2. Show that

12.3. Show that

2 5

3,1 k2 =5 k.k=1 k=1

ak = L. ak-vk=0 k=vn-1 n-2

12.4. Show that k2 = I (k + 1)2. (Hint: Use the result of Problem 12.3.)k k=0

12.5. Simplify

and evaluate.13.1. Evaluate

n-1 n-1 n-1jk2 +n57k+2n2jlk=1 k=1 k=1

1f(a) (x2 + x + 1) dx (b) J (3x2 + 2x + 1) dx1

o

(c) f (4x2 - 100) dx (d) J0003x2

dxJ 5 1

13.2. Find the area of the region between y = f (x), the x-axis, x = a, andx = b:

(a) f (X) = x2 - 4, a = 2, b = 3 (b) f (x) = x2 + 2x + 1, a = -1, b = 0(c) f (x) = 2x2 - 50, a = 5, b = 6 (d) f (X) = x2 - 9, a = 2, b = 4.

Page 171: ɷshawns sagan integral and differential calculus

II. SUPPLEMENTARY PROBLEMS 159

13.3. Find the area of the region bounded by y = 4x2 + 1, the x-axis, x = 0,and x = 4 and compare the value with the approximative value obtained fromthe trapezoidal rule with 8 subintervals. Express the error in per cent of the truevalue.

13.4. Find the area of the region encompassed by

y = Jx2 and y = 5 ox.13.5. Find the area of the region encompassed by

y=x2 +1andy=-x2+2.14.1. Show by inspection that

/'h h(a) ( x3 dx = 0 (b)

hx2 dx = 2J x2 dx

hJ

o

(c) x dx 0 (d) Jfh

rh dx = 2

I

hdx = 2hh 0

14.2. Show that('h hf (Ax3 +Bx2 + Cx + D) dx = f (Bx2 + D) dx

h h

(Hint: Use the results of Problem 14.1.)14.3. Show that Simpson's rule yields the exact result for an integral of the

type

Erh

J

(Ax3 + Bx2 + Cx + D) dxh

even though a curve of third order is approximated by a quadratic parabola inSimpson's rule. (Hint: Make use of the result in Problem 14.2.)

14.4. Use Simpson's rule to obtain an approximation for

fsin2 x dx

using n = 6 subintervals. Compare your result with the approximation obtainedin Problem 10.2 by using the trapezoidal rule and with the exact value which is

found to be 2 . (Try to establish 2 as the exact value of this integral by geo-

metric inspection.)14.5. Use Simpson's rule to find an approximation for the integral

f2dxx

for n = 16 subintervals.

15.1. Find F(x) = fxf (u) du for the following functions

0

(a) f (x) = x2 - 2x + 1 f ( x )

(c) f (x) = (2x - 3)2 (d) f(x) = x - 1

Page 172: ɷshawns sagan integral and differential calculus

160 CHAP. II. AREAS

15.2. Find F'(x) for the following functions

(a) F(x) = x3 + 4x + 1290 (b) F(x) = (x - 1)2(x + 1)2 4 -

(c) F(x) =3

+ 2 + x + 1 (d) F(x) = x - 26

15.3. The area under y = x2 + 2x + 3 above the x-axis between x = 0 and .xis given by

A(x) = J (x2 + 2x + 3) dx.0

Find the rate of change of A(x) at x = 1, 3, 7.15.4. Same as in Problem 15.3 for the area under the curve

15.5. Show thaty =x2-x + 10.

ff(u)du + I f (U) du = 0.z

(Hint: See also Problem 11.92.)

Page 173: ɷshawns sagan integral and differential calculus

CHAPTER III

RATES

1. THE DERIVATIVE

In the preceding section, we accidentally came across the limit of thequotient

Mr 1) F(x + h) - F(x)h

as h - 0. In this section we will attempt to attribute some geometricsignificance to this limit quite independently of its meaning in the contextof the preceding section. We will henceforth refer to the quotient in (111. 1)as the d fference quotient of the function F(x) because it is the quotient ofdifferences: the numerator is the difference of two values of the functionF(x), namely, F(x + h) and F(x), and the denominator is the difference oftwo values of x, namely, x + h and x itself.

Let us consider the functiony = F(x)

which is represented by its graph in Fig. 111. 1 (we assume that it has agraph), and let us try to give a geometric representation to the differencequotient (111. 1) and, if possible, also to its limit ash - 0. For this purpose,we consider a fixed point (x, 0) on the x-axis and some other point at adistance h from x, with the abscissa x + h (see Fig. 111. 1). Then we erectthe corresponding ordinates which terminate in the points denoted by Pand R. We can see now that the difference quotient (III.1) represents theratio of the distances RQ to QP, i.e., the slope of the line segment from P toR. We call the segment PR a chord or a secant (Latin: secare = to cut) ofthe curve y = F(x). We can thus state

F(x + h) - F(x)h is the slope of the chord joining P[x, F(x)] and

R[x + h, F(x + h)].

Thus far everything has gone quite smoothly. Now let us proceed to themore complicated task of trying to interpret the meaning of the limit of

161

Page 174: ɷshawns sagan integral and differential calculus

162

y

CHAP. III. RATES

F(x + h) - F(x)

xx x+h

Fig. III.1

(111. 1) as h - 0. In Fig. 111.2 we have drawn the chords for progressivelysmaller values of h, for positive as well as negative values of h. In all cases,we drew the continuation of the chord to both sides beyond the curve indotted lines. We can see that the smaller h is, the closer the position of thechord is to what we would intuitively call the tangent line to the curve atthe point P[x, F(x)] and which is drawn in a bold line.

So far we have not defined what we mean by tangent line to a curveexcept for the case of a quadratic parabola (see Chapter 1, Section 10). Weremember that in case of a parabola, we considered first a line which inter-sects the parabola in two distinct points and then we let the one point slideinto the other one and called the line which was thus obtained, the tangentline to the parabola. But this is exactly what we are doing now in a moregeneral form. More general, insofar as we consider now any curve y =F(x), and not necessarily a quadratic parabola. We give the definition:The tangent line to the curve y = F(x) at the point P[x, F(x)] is the linewhich passes through this point P and has a slope which is equal to thelimit of the difference quotient (11I.1) as h - 0 (from either side), providedthis limit exists. (If the limit does not exist, then the tangent line does notexist either.)

Thus :

(111.2) limF(x -{- h) - F(x)

is the slope of the tangent line to y = F(x)h-0 h

at the point P[x, F(x)], by definition.

Page 175: ɷshawns sagan integral and differential calculus

III.1. DERIVATIVE 163

In the preceding section, we introduced the term derivative for this limitand denoted it by F'(x). Thus we can state in this terminology:

The slope of the tangent line to the curve y = F(x) at the point P[x, F(x)]is given by its derivative F'(x).

To illustrate how the nonexistence of the derivative at a certain point hasthe nonexistence of the tangent line at this point as a consequence, andvice versa, let us consider the example

y = Ixi.

We have seen in Chapter I, Section 12, that this function is represented bythe graph in Fig. 111.3. We see by inspection that this curve does not havea tangent line at x = 0 and, indeed, it can be demonstrated that the deriva-tive fails to exist at this point. If we denote the derivative of F(x) = jxi atthe point x = 0 by F'(0), we have

F'(0)=limIx+hl-1xIl =limjj.h-0 h atx=o n-.o h

Now, we have to distinguish between positive and negative values of h, i.e.,we have to distinguish between an approach to the origin from the rightand from the left:

If h > 0, then hi = h and

F'(0) = lim h = lim 1 = 1.h-o h h-o

-x

Fig. 111.2

Page 176: ɷshawns sagan integral and differential calculus

164 CHAP. III. RATES

Y

x

Fig. III.3

On the other hand, if h < 0, then Jhi = -h and

F'(0)=limh=lim(-1)= -1.h-o h n-.o

Thus we see that the limit does not exist because we obtain differentvalues, depending on whether we approach the origin from the right orfrom the left. This is geometrically quite obvious. If we approach theorigin from the right, all chords (which incidentally coincide with the curveitself) have the slope 1 and if we approach from the left, all chords havethe slope - 1. At the origin itself, there is no tangent line, as pointed outearlier.

Before we consider further examples, let us introduce some technicalterms. The process by which the derivative of a function is found is calledderentiation process, as we mentioned in the preceding section, the corre-sponding verb being "to differentiate." (The derivative is sometimesreferred to as the differential quotient.) Specifically: we differentiate the

function F(x) with respect to x, meaning that we find the derivative F'(x).This terminology finds its counterpart in the following, often very useful,notation:

F'(x)

(Read: "de of of ex over de ex.")However, this notation is to be handled with great care. The d's in

dFdcx)are symbols and not numbers. So, do not attempt to cancel the d's,

unless you want to wipe out the harvest of many centuries of mathematicalthought and effort.

Page 177: ɷshawns sagan integral and differential calculus

111. 1. DERIVATIVE 165

Because the values of F(x) are customarily denoted by y: y = F(x), we

also use dx to denote the derivative of the function y = F(x) with respect

to x. Summarizing, we list all the customary notations for the derivative

F'(x) =dF(x) = d F(x) = dy = y'.dx dx dx

If we have, for example, the function w = G(z), then the derivative of Gwith respect to z can be designated by either one of the following symbols:

G'(z) = dG(z) = d G(z) = dw = w'.dz dz dz

Let us now consider a few simple examples. First, we take the function

y=Cwhere C is a constant number. This function is represented by a horizontalline C units away from the x-axis (see Fig. III.4). Clearly, the tangent lineto this curve coincides with the curve itself and has the slope zero. Thiscan also be seen from the formal standpoint by taking the derivative (or,as we may say, by differentiating C with respect to x) :

dC=limC-C=limo=0,dx r-o h 4-o

i.e.,

(III.3) = 0.

In words: The derivative of a constant is zero.Next, we take the function

y=x.This function is a straight line with slope 1. The tangent line coincides withthe graph and, hence, the derivative has to be 1. This is indeed the case

y=c

x

Fig. III.4

Page 178: ɷshawns sagan integral and differential calculus

166 CHAP. 111. RATES

as we can verify directly:

dx=limx-{-h-x=limb=liml=l,dx h h h-.o

i.e.,dx1.

(111.4)=

d x

(Again, we wish to point out that this result cannot be obtained simply bycancellation of dx in numerator and denominator, even if it may lookappealing. d is a symbol, hence dx is a symbol and not a number.)

Finally, we consider the quadratic parabolay=x2.

We find for its derivative

dx2 = lim (x + h)2- x2 = limx2 + 2hx + h2 - x2 = lira 2hx + h2

dx h- o h h- 0 h h-.o h

= lim (2x + h) = 2x,h-0

dx2-=2x.dx

(Now, try to get this result by cancellations without any hokus-pokus!)We remind the reader that the result in (111.5) is in agreement with the

result which we obtained in Chapter 1, Section 10, where we discussed thetangent to the parabola. We also point out that the formulas (111.4) and(111.5) seem to be in agreement with the differentiation formulas which welisted at the end of the preceding section.

Problems III.1-III.4

III.1. Given F(x) = x2. EvaluateF(x + h) - F(x)

hat the point x = 1 for the

following values of h: h = 1, 0.5, 0.2, 0.1, 0.01, 0.001. Observe that the sequenceof values thus obtained tends to the limit 2 in agreement with (111.5).

III.2. Do the same as in Problem 111.1 with the function F(x) = x3. Make aguess about the limit as h -- 0 and compare your result with the formula

(c) = x2 from Chapter II, Section 15.3

I11.3. Simplify the difference quotients for the following functions F(x) to apoint where the factor h in numerator and denominator cancels out:

(a) F(x) = x3 (b) F(x) = -,Ix

(c) F(x) = z (d) F(x) = (1 - x)2

Page 179: ɷshawns sagan integral and differential calculus

111.2. DIFFERENTIATION RULES 167

111.4. Consider the difference quotient of the function y = sin x. By a skilfulmanipulation of the identity in Problem AIII.14 of Appendix III, Section 4, itcan be shown that

sins - sin # = 2 cos « 2 # sin «2

#

Let a = x + h, # = x and simplify the difference quotient until you can write

it as a product of sin h/2h12with some other function.

2. DIFFERENTIATION RULES

We will devote this section to a few basic rules which serve to facilitateto some extent the differentiation process of functions.

First, let us consider the function

y = 4x2.

According to the definition of the derivative given in (111.2), we obtain

' = lim 4(x + h)2 - 4x2 = lim 4x2 + 8hx + 4h2 - 4x2Y1

h-0 h h

h h-+0

In the preceding section we saw that

dx2 = 2x

and we see now thatdx

d(4x2) = 4 2xdx

2

which appears to be simply 4dx

That it is true, in general, that a multiplicative constant remains un-changed upon differentiation, can be seen quite easily. Let us consider

y = cf (x)

where c is a constant and f (x) is a differentiable function, i.e., a functionwhose derivative exists. We obtain from the definition of the derivative

y' = lim cf (x + h) - cf (x) = lim cf (x + h) - f (x)h-0 11 h-0 h

= c lim f (x + h) - f (x) _.cf'(x)

h

Page 180: ɷshawns sagan integral and differential calculus

168

Thus we can state

(111.6)d [cf (x)] = c df (x)

dx dx

CHAP. III. RATES

or in words: a multiplicative constant remains unchanged upon differenti-

ation.Next, we consider the example

y=3x2+2x-5.This function is the sum of the three functions

y1=3x2,y2=2x,ya=-5

and we know from the preceding section and (111.6) that

yl' = 6x,

Y2 = 2

ys = 0

[see (111.3, 4, and 5)].Now, if it were true that

d

dx(yi + y2 + Y3) = yip + y2' + yap'

then we would have right away

y'=6x+2+0.That this is, indeed, the case can be seen by a direct check:

d 3(x+h)2+2(x+h)-5-3x2-2x+5Y =-(yi+y2+ya)=1im

dx h-o h

=lim3x2+6xh+3h2+2x+2h-5-3x2-2x+5

h-o It

=lim6xh+3h2-+ -2h=lim(6x+3h+2)=6.z+2.h-0 h h-0

(We wish to point out that this result also follows from the formula (1.38)which we developed for the slope of the tangent line to a quadratic pa-rabola.)

Again, we can easily see that the derivative of a sum of functions can beobtained by differentiation of each term in the sum. Consider

y =fl(X) +f2(x)

Page 181: ɷshawns sagan integral and differential calculus

111.2. DIFFERENTIATION RULES 169

where fl(x) and f2(x) shall be differentiable functions. Then

[.fi(x) + f2(x)] = limfl(x + h) +f2(x + h) -.fi(x) -f2(x)y' = dx h-0 h

= iim (fl(x + h) -f1(x) +f2(x + h) -f2(x))h-ro h h

= limfi(x + h) -fl(x) + lim f2(x + h) -f2(x)=f1'(x) +f2 (x),

h h

or in words : the derivative of a sum of two functions is the sum of the deriva-tives of the two functions, provided that the derivatives of the two functionsexist.

This latter qualification is quite necessary as we can see from a verysimple example. Take

f1(x) __ 1 for all irrational values of x10 for all rational values of x

Then,

and, consequently,

f2(x) - 10 for all irrational values of x1 for all rational values of x.

fl(X) +f2(x) = 1

d Cf1(x) + f2(x)J = ddx) = 0.

However, neither the derivative of f1(x) nor the derivative off2(x) exists.We can show this quite easily (even though it should be obvious) : Letus try to find the derivative of f1(x) at a rational point x. Then, f1(x + h) =f1(h) because if x is rational, then x + h is rational if h is rational, andirrational if h is irrational. Then we have

f1'(x) = lim f1(x + h) - fi(x) = lim fi(h) - 0 = lim f1(h)h-o h h-0 h h

and we see that, while the denominator approaches zero, the numeratoroscillates between 0 and 1. Clearly, the limit cannot exist. A similar argu-ment shows that the limit fails to exist at every irrational point and ananalogous argument holds for f2(x).

The requirement that both limits

fl(x + h) - f1(x) f2(x + h) -.f2(x)(111.7) lim , lim

h-+0 h h-.0 h

Page 182: ɷshawns sagan integral and differential calculus

170 CHAP. III. RATES

have to exist enters our argument at the point where we break the limit

lim fi(x + h) + f2(x + h) -.fi(x) - f2(x)h-.o h

into the sum of the two limits (111.7). Such a procedure is only permissibleif both limits exist (as we pointed out when discussing limits of sequences,Chapter II, Section 4).

Again, we can illustrate this by a simple example. Take

lim1- - 1-

h-o h h

Clearly,

lim 1h-'o h

does not exist, i.e., is not a finite number. Hence,

h-0 h h h-.o h h-.o h

because the limit on the left side does exist:

lim-1)=limo=0.h-ro

nhh h-o

The student might claim that "of course, if we split this limit into twolimits, then we obtain co - oo and this is 0 anyway, or isn't it?" No, itisn't. Just try it with

lim(-h- -)=limb-1

h2 h2

which clearly does not exist. Still, if you split this limit into a sum of twolimits, you obtain oo - ob.

Let us return to our problem at hand. We have shown thus far that thederivative of a sum of two functions is the sum of the derivatives of the twofunctions. We can generalize this statement easily to a sum of three func-tions, and, as a matter of fact, to a sum of any finite number of functions.

Take

y =fl(X) -I- f2(x) +f3(x)and assume that the derivatives fi'(x) (i = 1, 2, 3) exist. Then, we combinethe two functionsf&) +f (x) to a function F(x):

and haveF(x) =f2(X) + f3(x)

y =fi(x) + F(x),

Page 183: ɷshawns sagan integral and differential calculus

111.2. DIFFERENTIATION RULES 171

i.e., a sum of two functions to which we can apply the rule which we havealready established :

y' = f1'(x) + F'(x).

Now, what is F'(x)? Again, F(x) is the sum of two functions and, hence,accessible to our rule and we obtain

F'(x) = f2 (x) + f3'(x).

Thus, combining our partial results, we have

y' = f1'(x) + fz (x) +f3'(x)

In general, one can show that the derivative of a sum of n functions is equalto the sum of the derivatives of the n functions, provided the derivatives ofthe n functions exist:

dxIlfk(x) = I fk'(x)

(For the case of n = 4 and 5, see Problems 111.5 and 6.)Now, we can really live it up and find the derivative of

y = 25x2 + 8x - 786935420

right away: We obtain, observing the differentiation formulas of the pre-ceding section and the two rules of this section,

y'=25.2x+8.1 -0=50x+8.(We remind the reader that we could have obtained this result as early asChapter I, Section 10, when we discussed the slope of the tangent line to aquadratic parabola.)

Problems III.5-III.10

111.5. Suppose fz'(x) (i = 1, 2, 3, 4) exist. /Show that

{

TX[fl(x) +f2(x) +f3(X) +f4(X)] =.1'(x) + f2 (x) +J 3 (X) +f4 (X).

111.6. Same as in Problem III.5 with i = 1, 2, 3, 4, 5.(For the sum of five functions.)

III.7. Differentiate:(a) y = 4x2 - 3x + 4 (b) y = -x2 + 3x + 1(c) y = (x + 10)2 (d) y = (x - l)(x + 1)

111.8. Find the equation of the tangent line to the parabola

y=2x2-3x+12at the point P(2, 14).

Page 184: ɷshawns sagan integral and differential calculus

172 CHAP. III. RATES

111.9. At which point does the curve

y=4x2-3x-2have a tangent line with slope 6?

111.10. At which point does the curve

y = 4x2 - 16x + 12

have a horizontal tangent line? (A horizontal line has the slope 0.)

3. DIFFERENTIATION OF SOME SIMPLE FUNCTIONS

We are now going to find the derivative of some simple functions by thelimit process which serves as the definition of the derivative.

First, let us consider

We obtainy=x3.

d(x3)=lim(x+h)3-x3=limx3+3x2h+3xh2+h3-x3

dx h-0 h h-0 h

=lim3x2h+3xh2-}-h3=lim(3x2+3xh+h2)=3x2.h-0 h h-'0

Hence,

(111.8)d(x3)=3x2.dx

Now, if we inspect the following table which is compiled from the results(1I[.3), (111.4), (111.5) of Section 1 and (111.8) of this section,

y

1

x

x2

x3

y'

0

1

2x

3x2

we can make an intelligent guess as to the derivatives of other powers of x,such as

d(x4)3, dx ) = 5x4, etc.

(See Problems 111.11 and 12.)It would appear that in taking the derivative of

y=xn

Page 185: ɷshawns sagan integral and differential calculus

111.3. DIFFERENTIATION OF SIMPLE FUNCTIONS 173

where n is a positive integer or zero, we multiply the function by the expo-nent and diminish the exponent by 1:

(111.9) d(x') = nxn-1

dx

(For a proof of this formula, see Problem 111.18.)If this were'true also for negative exponents, then we would obtain for

n = -1

d(x_1

d

) - x = -1 x2 = - 1dx dx

We can check directly that this is indeed the case:

d (!) 1 -1 x-(x+h)

x2

= lim x+ h x = 1im

x(x + h) = lim -hdx h h h-+o hx(x + h)

=lim -1 = -1 =- 1

h-to x(x + h) x(x + 0) x2Thus,

(III.10)

d1

x 1

dx x2

Let us now live dangerously and consider the exponent n = 1. If (111.9)also happens to be true for rational exponents, then we could expect that

d(x112) = d...J = 1 x (1il) = 1

dx dx 2 2,/x

Again, a direct check will show that this is indeed the case:

d,/x=1im x+h--N/x=limV3+h-N/x ,/x-{-h+V_X_dx h h h

h 1= lim = Jim = limh-'o h(,/x + h + ,/ X) h-o h(,/'x + h + /x) h-.o Jx + h + IX

1 1\/x+0+ Jx\x.Thus,

(111.11)d. /x = 1

d x 2,/x

Page 186: ɷshawns sagan integral and differential calculus

174 CHAP. III. RATES

Similarly, we compute

1 1 J - /x+h/ - x + h Jx = lim VxJx + h = lim Jx - VT + h

dx h-0 h h-o h h-o h Jx Jx + h

= lim Jx x + h. /i + Jx + hJ, -.o h V -X,/ x-+h V-X + V x + h

1imx-(x+h)=

iL-.o h./x,/x + h(Jx + x + h)

= lim -hh-o hvx,.Jx + h)

= 1im -1h-0 lx-,I+ h(,Jx + ,Ix + h)

-1 -1,l xJx + O(Jx + /x + 0) 2Jxs

Thus,

(111.12)

d1 =- 1

dx 2,/X3

which can also be obtained from a direct application of (111.9) as the readercan easily verify.

We wish to state without proof that (111.9) is universally true for anyreal exponent, i.e., for any positive or negative number, be it rational orirrational. The proof of (111.9) for positive integers is quite simple (seeProblem 111. 18). It is not difficult for negative integers, either, and can befound in most introductory treatments of the Calculus. There are no realdifficulties to establish it for positive or negative fractions, which is alsodone in most Calculus books. It is, however, quite difficult and requiressome background knowledge to establish the validity of (111.9) for irrationalnumbers. This is usually accomplished in a course that is customarilycalled "Advanced Calculus."

Problems 111.11-111.29

111.11. Find the derivative of y = x4 by the limit process and compare theresult with the result obtained by a direct application of (111.9).

111.12. Same as in Problem 111. 11 for y = xsIII.13. Find the derivative of y = x'5 by the limit process.

Page 187: ɷshawns sagan integral and differential calculus

111.3. DIFFERENTIATION OF SIMPLE FUNCTIONS 175

111.14. Find the derivative of y = x + 1X

111.15. Find the derivative of1 - X(a) y = 7x5 - 3x4 + 5x3 - x2 + 2x + 7 =(b) yx2

(c)y=4x2-1+ 1

(d) y-x 2

=x 1+x111.16. Demonstrate that

an - bn= an-1 + an-2b + an-3b2 + ... + abn-2 + bn-1.a-b

(Hint: Multiply both sides by (a - b) and show that all terms on the right sideexcept an and -bn cancel out.)

I11.17. Let a = x + h and b = x in Problem 111. 16 and write the formula interms of x and h.

111.18. Show thatd(xn) = nxn-1

dx

for any positive integer n. (Hint: Transform the difference quotient accordingto the formula you obtained in Problem 111.17.)

111.19. Find the derivative ofy = 1/x2 + y12,

where y1 is a constant, by the limit process.111.20. Find the derivative of

y = ''(x - x2)2 + 1/22

where x2, y2 are constants, by the limit process.111.21. Take y = f (x). Then the derivative is denoted by y' = f'(x). f(x) is,

in general, a function of x. Now we differentiatef'(x) again with respect to x anddenote the result by

y° = [f'(x)Y =7(x)and call it the second derivative off with respect to x. Similarly, we define thethird derivative as the derivative of the second derivative, etc.

(a) Find the 4th derivative of y = x4.(b) Find the 5th derivative of y = x4.(c) Find the nth derivative of y = xn.

111.22. Find the equation of the tangent line to the parabola

y =x2 +2x +4at the point with the abscissa 3.

111.23. Find the equation of the tangent line to the curve

1y=x+ -xat the point with the abscissa x = 1.

Page 188: ɷshawns sagan integral and differential calculus

176 CHAP. III. RATE,

111.24. Find the intersection point of the tangent lines to the curve y =x3 - 3x2 + 5 at the points with the abscissae x = 1 and x = 3.

111.25. At what points does the curve y = x3 - 5x + 3 have horizontal tangent lines?

111.26. At what points does the curve y = x + x have horizontal tangenlines?

111.27. In which interval(s) does the curve y = x3 + 4x2 - 2x + 2 havepositive slope? (The slope of a curve at a point is defined as the slope of thetangent line to the curve at this point.)

111.28. Find all points at which the curve y = 3x5 - 25x3 + 60x + 3 hashorizontal tangent line. (Hint: Note that an equation of the type ax4 + bx2 +c = 0 can be solved for x2 by the quadratic formula. Subsequently, x can befound by extraction of a square root.)

111.29. Given y = sin x. Find y'. (Hint: Use the result of Problem III.4.;

4. THE CHAIN-RULE

Suppose we wish to differentiate the function

(I11.13) y = (1 + x2)365

This can be done, in principle, by the rules which we derived in the precedingtwo sections for the differentiation of polynomials, because, if we expandthe right side of (111.13) according to the binomial formula, we obtain apolynomial of the 730th degree which contains 366 terms.

Before we dismiss this problem as solved, however, let us stop and thinkfor a moment. If we actually expand the right side of (111.13) we wouldhave to-according to a rough estimate-process about 36000 digits. If weallow 10 seconds for the processing of every digit (computation, writing,etc.), we have to keep working for about 4 days. It will take at leastanother 4 days to differentiate this polynomial, because every one of thedigits has to be processed again, so that we come up with the (conservative)estimate of 8 days of continuous and rather uninspiring work required forthe differentiation of (111. 13). Nobody could possibly want this result thatbadly.

We will develop in this section a method that will enable us to achievethis result, and similar ones, in a matter of seconds. Let us consider theauxiliary function

Then (111.13) can be written asy = u365

Page 189: ɷshawns sagan integral and differential calculus

III.4. CHAIN RULE 177

and we see immediately, that, according to (111.9),

dy= 365u364

duwhile

du- = 2x.dx

Now, if dy, du, and dx could be handled like numbers, everything wouldbe quite easy. We could cancel

dy skt = dy

pa dx dxand obtain as result

dy = 365u3642x = 365(1 + x2)3642x = 730x(1 + x2)364_

dx

This procedure is indeed permissible, even though it cannot be justifiedsimply by cancellation of terms which, as we know, are not numbers, butrather mysterious symbols. In order to establish the permissibility of thisprocess, we may argue as follows: We consider

y = AU)

where we assume that, f'(u) = dduu) exists and

u = g(x)

where we assume that g(x) = ddxx) exists.

Thus, y will ultimately appear as a function of x:

y =.f(g(x)).

We speak in such a case of a function of a function.Now, by the definition of the derivative, (111.2), we have

d y = 1imf(g(x + h)) - f(&))dx h

What is g(x + h) ? We have g(x) = u. Thus, if x changes by the smallamount h, then the value of g(x) will change by some amount k:

g(x+h)=u+kand it is quite obvious that k -+ 0 as h -> 0 [unless the function g(x) has adiscontinuity at this particular point, which we can exclude since we assumedthat g'(x) exists (and how could a function have a tangent line at a pointwhere it is not even continuous ?)].

Page 190: ɷshawns sagan integral and differential calculus

178 CHAP. III. RATES

Hence, we can write

dy = limf(u + k) -f(u)dx h

(k-'o)

The expression under the limit sign will not change its value if we multiplyit by

u+k-it -u(x+h)-u(x)k k

Thendy

dxlim

f(u + k) - f(u) u(x + h) - u(x)

h kh-+0

limf (u + k) -f(u) u(x + h) - u(x)

h-'0 k h(k-'0)

We assumed thatf'(u) as well as g'(x) exist. Hence, both fractions underthe limit sign approach a limit and we can use our rule according to whichthe limit of a product is equal to the product of the limit, provided thatboth limits exist and we obtain

(111.14) dy = limf (u + k) - f(u) lim g(x + h) - g(x)dx k-0 k h

= f'(u)g'(x) = dy du

du dx

In words: The derivative of a function f of a junction g of x with respect tox is obtained by differentiation off with respect to the function g and thenmultiplication by the derivative of g with respect to x (the so-called innerderivative).

This is called the chain rule.Now we can return to our original time-consuming problem: We have,

according to (111.14),

d11 + x2)365 = du365 d(1 + x2) = 365u3642x = 730(1 + x2)364.

dx du dx

We see that (111.14) provides, indeed, for a substantial simplification overthe process which we outlined in the beginning of this section.

In practice, we do not actually make a substitution by u at all. Onedifferentiates (1 + x2)365 "with respect to (1 + x2)" and then multiplies theresult by the "inner derivative" of (1 + x2), namely 2x, and that is that.

To demonstrate the wide range of applications of the chain rule, let usconsider the following example. Take

y = x3 - 3x2 + 16.

Page 191: ɷshawns sagan integral and differential calculus

III.4. CHAIN RULE

According to (III.11),d, Fu 1

du

Hence, we obtain with the chain rule

179

d'1x3- 3x2+ 16=1 (3x2- 6x).

dx 2, x3-3x2+16

The reader will certainly appreciate this shortcut after he has tried to findthe same derivative by going through the lengthy and awkward limit pro-cess, as we have done in some instances in Section 3 of this chapter.

Problems 111.30-111.33

111.30. Write the following rule symbolically and justify it: The derivative of asquare root of an expression is equal to one half the derivative of the expression,divided by the square root.

111.31. Differentiate by the chain rule:

(a) y = x21- 4 [see (I11.12)]

(b) y = x4 - 3x2 + 4x (see Problem 111. 13)

(c) y = v'x + 1(x + 1) (Find first the derivative of uM by the limitprocess.)

(d) y= x - 7 + x 1 7 (see Problem 111.14.)

III.32. Differentiate by the chain rule

(a) y = sin (x2) (b) y = sin (1 - 3x + x3)

(c) y = 'Vsin x (d) y = sin2 x

(see Problem 111.29.)III.33. The chain rule can be generalized to functions of functions of functions

of .... Specifically, if y = f (u), u = g(v), v = h(x), then

dy _ dy du dvdx du dv dx

Using this rule, find the derivative of

y = \/l + (x2 - 2x + 4) 12

(Hint: Let y = V _u, u = 1 + v12, v = x2 - 2x + 4.) .4

Page 192: ɷshawns sagan integral and differential calculus

180 CHAP. III. RATES

5. INTEGRATION BY ANTIDIFFERENTIATION

We have seen in Chapter II, Section 15, that if

f (u) du = F(x),

for some constant lower integration limit a, then

F'(x) = f (x).

For example, in Chapter II, Section 13, we found that with f'(x) = x2,

f Lug du =33 - a3

.

aNow let b = x and obtain

x s3r it2 du =

xX

aa F(x)

a 3 3

and we know from the preceding section that

() d x3_a3llF x =dx(3 3I - xJ

We can see that with every functionf(x) for which the integral exists andevery lower integration limit a within the domain in which the integralexists, there corresponds a function F(x) such that

ff(u) du = F(x)

where F'(x) = f(x). However, if we do not specify the lower integrationlimit a, then there are clearly infinitely many functions F(x) with the prop-erty that F'(x) =f(x), and all these functions differ from each other onlyby an additive constant. Certainly, if we take F(x) and F(x) + C, where Cis an arbitrary constant, then

d [F(x) + C] = dF(x) + dC = dF(x)dx dx dx dx

dand if F'(x) = f(x), then it is also true that

dx[F(x) + C] = f(x).

Take, for example, the two functions

x2 + 2x + I and x2 + 2x + 1896.

We obtain from both functions the same derivative, 2x + 2.If a function F(x) is in relation

(111.15) F'(x) = f (x)

Page 193: ɷshawns sagan integral and differential calculus

111.5. ANTIDIFFERENTIATION 181

to a functionf(x), then we call F(x) the antiderivative off(x). The reasonfor this terminology is obvious : f (x) is the derivative of F(x), i.e., f (x) isfound from F(x) by differentiation. So, in order to get back to F(x) fromf(x) we have to do the opposite of what we have done in getting from F(x)to f (x), namely, antidifferentiate.

Since there correspond with f(x) infinitely many functions that differfrom each other by an additive constant only, if we do not specify in

Ja'X

f (u) du = F(x)

the constant lower integration limit, we also call F(x) + C the indefiniteintegral off (x) and write

(111.16) f f(x) dx = F(x) + C

where C is an arbitrary constant.Here it is expressed explicitly that the antiderivative is determined except

for an additive constant, which we call integration constant. Therefore, therelation (111. 16) is to be preferred to (I11.15), even though both statementsexpress the same thing.

For example,3$(x2_4x+3)dx___2x2+3x+C

is equivalent to(33

-2x2+3x+C) =x2-4x+3,

where C is an arbitrary constant.If we consider the problem of finding the equation of a curve which has

at any point with the abscissa x the slope y' = 3x2 - 2x + I and passesthrough the point P(0, 1), we proceed as follows.

First, we find the antiderivative (the indefinite integral) of y', namely, y:

y=J(3x2-2x+1)dx=x3-x2+x+C

and then we determine the value of the integration constant C so that y = 1when x = 0, obtaining C = 1. Thus,

y=x3-x2+x+ 1is the solution to our problem.

Page 194: ɷshawns sagan integral and differential calculus

182

Now we can relate the antidifferentiation process to the process of evalu-ation of a definite integral as follows. Suppose we want to find

fJ bf(x) dx.a

First, we find by antidifferentiation the indefinite integral off (x) :

f f (x) dx = F(x) + C.

We know thatfa

f( x) dx F(a) + C

rb

a

for some suitable lower integration limit a and some constant C. Then,

fabf(x)dx=abf(x)dx-faf(x)dx=F(b)+C-F(a)-C

f(x) dx = F(b) + C

CHAP. III. RATES

= F(b) - F(a)

as we have already seen in Chapter II, Section 15.If F(x) is the antiderivative of f (x), it is also customary to write

frb

a

f(x) dx = F(x)b

a

meaning, that F(x) is to be taken at the upper integration limit x = a andfrom this F(x) taken at the lower integration limit x = b is to be subtracted.For example,

J'(3x2 - 2x + 1) dx = (x3 - x2 + x)2

6-1=5.=i

We can state now all the differentiation formulas which we have developedthus far as antidifferentiation formulas, or integration formulas, as they arecustomarily called. We have seen in Section 1 that

dC-0,dx

i.e., C is the antiderivative of 0 (C is an arbitrary constant) :

(111.17) Jo dx = C.

Fromdx = 1dx

Page 195: ɷshawns sagan integral and differential calculus

111.5. ANTIDIFFERENTIATION 183

follows now

and from

Ji dx = fdx = x + C

d x2- = 2xdx

it follows after division by 2 that

fx dx = 1x2 + C, etc.J 2

In general, we obtain from (111.9), Section 3,

dxn = nxn-1

dx

after division by n and writing m + I instead of n (n is an integer, hencem + 1 is also an integer)

m+1(111.18) xm dx =

x + C.m+ 1

This formula is loaded with danger because, if m = -1, we obtainx° I

0- on the right side which is quite senseless. So we have to demand

that in formula (111.18)m - 1.

Otherwise we can say about (111. 18) the same thing that we stated aboutthe differentiation formula (111.9), namely, that it is valid for all realnumbers m (except, of course, for m = -1).

This is funny in a way, because m = -1 corresponds to n = 0 and thedifferentiation formula (111.9) is valid for n = 0. On the other hand, we seefrom (111.9) that no matter what value of n we might choose, we cannot

possibly obtain 1 as the derivative of a power of x. Hence, 1 does notx x

appear to have an antiderivative among the powers of x.It can be shown, however, that there is a function, call it 1(x), which has

1- as its derivative or, as we may state, is such thatx

J1 dx=1(x)+C,x>0.X

We choose the constant C such that

(III.19) f 1 du = I(x).1 U

Page 196: ɷshawns sagan integral and differential calculus

184 CHAP. III. RATES

It can be shown that the function defined in (111. 19) has the properties of

a logarithm, namely,1(xy) = 1(x) + 1(y),

I (x) = 1(x) - 1(y),y

1(x") = al(x).

1(x) is called the natural logarithm. The numerical values of 1(x) can befound to any degree of accuracy from (111.19) by any one of the approxi-mation formulas for the evaluation of definite integrals. We found, forexample, in Problem 11.50 that

1(2) f21du=0.6

1u

Problems I11.34-III.41

I1I.34. Find the antiderivative of

(a) y' = x2 + 4x - 1

(c) y'=x4+x2+1

(e) y' = 1x

111.35. Find

(b) y' = x3 - 3x2 + 2x + I

(d) y' = 7x5 + 3x3 - 2x2 + 12x + 3

(f) y'=-x2+3x

(a) J(x3 - 4x2 + 4x + 7) dx (b) J(x4 + x3 + x2 + x + 1) dx

(c) J(4x3 - 3x2 + 2x + 1) dx

(d) J(6x5 + 5x4 + 4x3 + 3x2 + 2x + 1) dx

(e)J

x3 (f)J

x2

111.36. Find the equation for the curve which has for any x the slope

y'=x2+3x+4and passes through the point P(1, 3).

111.37. Utilizing the antidifferentiation process, evaluate the following definiteintegrals:

(a) f(x3_3x2+1)dx (b) f5

(x2 - 4x + 5) dx

(c)J

2(x4 + x + 3) dx (d) f (x3 - 4x) dxi 1

Page 197: ɷshawns sagan integral and differential calculus

111.6. INVERTED CHAIN RULE 185

111.38. Find an approximation to 1(2) using Simpson's rule and 32 subintervals.(1(2) = 0.6931 )

111.39. Same as in Problem 111.38 for 1(3). (1(3) = 1.0986111.40. Find an approximation to l(6). (Hint: 1(6) = 1(2.3) = 1(2) + 1(3).

For 1(2) and 1(3) use the approximations found in the two preceding problems.(1(6) = 1.7917 )

111.41. Take F(x) = x2 + 2. Clearly, F'(x) = 2x. Show that it is impossibleto find a real number a such that

I x2x dx _ x2 +2.a

Why does this not contradict our statement preceding and including formula(111.16)?

6. THE INVERTED CHAIN-RULE(Substitution Method)

We have seen in Section 4 that if y = f (u) and u = g(x), then y = f(g(x))and

dy_dy_dudx du dx

[see (111.14)].Hence, if we have an integral like

we see that with u = 1 + x2 and consequently du/dx = 2x, the integrand isof the form

1 du

2../u dx

where1 d= [see (III.11)]. Thus we can see that Vu with u = g(x)

2-Vu duis the function, which has, according to the chain rule,

1 duas its

derivative: 2\/u dx

d u(x) _ 1 du

dx 2,/u dx'

or, as we may write in view of (111.15) and (111. 16),

1 dudx= /u(x)+C2.,/-u dx

Page 198: ɷshawns sagan integral and differential calculus

186 CHAP. III. RATES

which becomes, in our example with u = 1 + x2,

5_xdx =5 1 2x dx = 1u(+ C = J1 + x2 + C.

1+x2 2.,/u

We see immediately that this integration method is applicable, wheneverwe have an integral of the form f f'(u)u'(x) dx and we obtain

(111.20) f f'(u)u'(x) dx = f [u(x)] + C.

In practice, we apply this rule (which is obviously an inversion of thechain rule) as follows.

We have the integral

Jf'(u)u'(x) dx

and carry out the substitution u = u(x). Then du/dx = u'(x) and we have

f f'(u)u'(x) dx =f f'(u) dx dx =f f'(u) du = f(u) + C.

Then, we substitute u = u(x) back again and obtain

f f'(u)u'(x) dx = f [u(x)] + C.

Let us work some more problems. Consider

dx

(1 + x)2

We substitute

and obtaindu

= 1 or, as we may write, dx = du.dx

Hence,

f (1+x)2f u2 uC 1+x f C

(in view of (111.10).Next consider

Let

fx2 dx

J (1 + x3)2

u = 1 + x3. Then dx = 3x2 and, consequently, x2 dx = s du

Page 199: ɷshawns sagan integral and differential calculus

111.7. DIFFERENTIAL

and we obtain

J

x+2

dx3)2 3 J u2 = -u -{- C = - 1 + x3 C.(l

Finally, let us consider

5x(1 + x2)1° dx.

Let u = I + x2, then du/dx = 2x and x dx = i A.Hence

fx(1 + x2)1° dx - 1 ful' du = 1 - + C - 1 (1 + x2)11 + C.2 2 11 22

Problems 111.42-111.45

111.42. Integrate

(a)

1

(c) 1(1 + x)2dx

I11.43. Integrate

(a) fx2vhl + 4x3 dx

(b) 5(1 - x)780 dx

(d) 5[x2 + (1 - x)150] dx

(b) J'4x(l - 7x2)12 dx

(c) 5(1 - 4x3) /x - x4 dx (d) J 1 (1 - 1 dx72 X x

M.44. Integrate

fsin x cos x dx

(Hint: Let u = sin x).TII.45. Integrate

f IT - cost x cos x dx.

7. THE DIFFERENTIAL

Suppose we wish to evaluate

187

V'16.098,

at least approximately, without getting involved in too much numericalcomputation. Clearly, the value of this square root will be a value that isquite close to 4 since 4 = 16 and 16.098 is very close to 16 (and this isreally the intuitive meaning of the statement that y = \/x is continuousat x = 16).

Page 200: ɷshawns sagan integral and differential calculus

188 CHAP. III. RATES

Let us write this square root as follows

V' 16.098 = V16 + 0.098.

We know the value of and we want to know the value of the squareroot for a value very close to 16. We can state this problem in a moregeneral form as follows.

Suppose we know the value of Vx- for a certain value of x and we wishto find the value of

V +- h

where h is a small number. So the question arises: How much bigger is1/x -+h than '\Ix? The amount by which the square root increases as xincreases by the amount h is given by

Let us divide this difference by h :

Jx+h - Jxh

and we see that we have here the difference quotient of the function 1/x.

We know that the difference quotient approaches the derivative y' = dVxdx

as h -> 0. So we expect that for small values of h, the difference quotientwill be very close to the derivative:

Jx -+h - Jx dJxtih dx

Since

dJx

[see (111. 11)], we havedx 2,/-x

Jx h - .\/x ti I

and henceh 2Jx

XJx+h-Jx 1ivh.Substituting x = 16 and h = 0.098,

,/T6 . 998 - /T-6 0.0982 J16

Page 201: ɷshawns sagan integral and differential calculus

111.7. DIFFERENTIAL

and hence

.16.098 -v 4 + 8 0.098 = 4.01225

while the exact value accurate to five decimal places is found to be

16.098=4.01223

189

as we can find by cumbersome manipulations. We see that the approxi-mation which we obtained is really quite good.

Now, in general, we have the problem of finding the value of a functionin the neighborhood of a value for which the function is known (or can befound easily) :

Given y = f(x). Find an approximation tof(x + h) for small values ofh. We proceed as in the preceding example: We consider

f(x + h) -f(x)h

and argue that for small values of h, this difference quotient will not differmuch from the derivative f'(x):

f(x + h) - f(x) =f'(x)

h

and we obtain the approximation formula

(111.21) f(x + h) - f(x) + f'(x)h.

The correction term f'(x)h is called the differential of the functionf(x).It furnishes an approximation of the amount by which the function in-creases as x increases by the small amount h.

Formula (111.21) is easily accessible to geometric interpretation. Werefer to Fig. 111.5: We have drawn the tangent line T to the curve y =f(x)at the point [x,f(x)]. The slope of the tangent line at this point is f'(x)(see (111.2)), which we can write as

.f '(x) =hf'(x)

h

Since the run between x and x + h is h, we see that hf'(x), the correctionterm, represents the rise of the tangent line at x + h and we see that thebold segment denoted by E represents the error which we commit in usingformula (111.21) for the approximation to the increase of the function. Wecan easily see from Fig. 111.5 that the smaller h is, the better the approxi-mation. i.e., the smaller is the error E.

Page 202: ɷshawns sagan integral and differential calculus

190

Y

Fig. II1.5

Problems I1I.46-III.53

III.46. Find approximations to(a) -1.01 (b) V'24.97(c) V'9.104 (d) V48

111.47. Find approximations to(a) (9.031)2 (b) (10.0114)2

(c) (100.236)2 (d) (4.0029)2

CHAP. III. RATES

111.48. Explain why values larger than the true values are obtained in Problem111.46 and values smaller than the true values are obtained in Problem 111.47.

III.49. Find an approximation to sin 13 + 240) .(According to the right

d sin xanswer to Problem 111.29, dx = cos x.)

111.50. A 5-ft square steel plate is subjected to heat, whereupon the edgesexpand by 0.0341 ft. Find an approximation to the area of the expanded plate.

III.51. The 36-sq-ft area of a square plate expands and reaches an area of36.037 sq ft during the process. Find an approximation to the side of the ex-panded plate.

111.52. The radius of a circular disk of radius 1 ft expands by 0.0138 ft. Findan approximation to the area of the expanded disk.

111.53. A circular disk has initially the area of 97r sq in. The disk shrinks, whenexposed to cold, to an area of 8 .869ir sq in. Find an approximation to the amountby which the radius became reduced.

Page 203: ɷshawns sagan integral and differential calculus

111.8. NEWTON'S METHOD 191

8. NEWTON'S METHOD

In Chapter I, Section 9, we developed a formula which enables us to solvethe quadratic equation

axe+bx+c=0.We have not discussed the solution of cubic equations

ax3+bx2+cx+d=0or, for that matter, any other equation

(111.22) f (X) = 0,

where f(x) stands for some function of x, not necessarily a polynomial.Let us mention that there exists an algebraic formula for the solution of acubic equation and there is also an algebraic formula for the solution of aquartic equation

ax4+bx3+cx2+dx+e=0.

These formulas are so complicated that it is much better to stay awayfrom them, no matter how badly one needs a solution. There are no alge-braic formulas for the solutions of general algebraic equations of the 5thand higher order [equations of the type (111.22) with f (x) representing apolynomial of the 5th or higher order with rational coefficients]. As amatter of fact, it can be shown that such formulas cannot exist. So it is notjust a matter of not having found such formulas up to now. Nevertheless,there are methods which yield the solutions of special equations of higherorder than the 4th and there are also various methods for solving equationsof the type (111.22) for special functions f(x) which are not polynomials.But there is no practical method which would allow us to deal with thegeneral equation (111.22) without specifying whatf(x) is.

However, there are methods which allow us to solve approximately ageneral equation of the type (111.22) to any degree of desired accuracy, ifwe impose some restrictions on f(x). The method which we will discusshere was designed by Sir Isaac Newton (1643-1727) and bears his name.

We will have to assume for the following that the derivative. f'(x) existseverywhere. Let us consider the graph of y = f (x) (see Fig. 111.6). Solvingequation (111.22) is equivalent to the problem of finding the intersectionpoint(s) of the graph of y =f(x) with the x-axis y = 0. To find this inter-section point, we proceed as follows: We pick just any point x0 (see Fig.111.6), substitute it intof(x) and obtain a value

yo =,f(xo)

Page 204: ɷshawns sagan integral and differential calculus

192

Fig. III.6

CHAP. Ill. RATES

Now there are two possibilities: Either yo = 0, then we are through,because xo is then a solution of (111.22); or yo 0 0. In the latter case, welocate the point P0(xo, yo) on the graph of y = f (x) and determine thetangent line To at this point by the point-slope formula [we know the co-ordinates of the point PO and the slope of To, namely, f'(xo)]. Next, weintersect the tangent line To with the x-axis and find the x-coordinate of theintersection point. We call it x1. Again, we evaluate the function f'(x) forx = x1 and obtain a value

y1 =,f(x1)If yl = 0, then x = x1 is a solution of (111.22); or, more realistically,

yl 0 0. In the latter case, we locate the point P1(x1, y1) on the graph ofy =f(x) and determine the tangent line T1 at P1. This tangent line willintersect the x-axis at some point x = x2 and we repeat the process, i.e.,locate the point P2(x2i y2) and intersect the tangent line T2 at P2 with thex-axis to obtain the value x = x3, etc.

In this manner we generate (we hope!) a sequence of values x0, x1, x2,x3, - - which tends to the actual intersection point of y = f(x) with y = 0and which is the solution of (111.22).

Before we write down a general formula which condenses this process,let us work an example. We wish to solve

f(x)=x2-2=0approximately, which means that we want to find numerical approximations

to 1/2 because x = /2 is a solution of this equation.

Page 205: ɷshawns sagan integral and differential calculus

111.8. NEWTON'S METHOD 193

Let us start with just any value-for example, x0 = 2 (see Fig. 111.7).We obtain PO(2, 2). The slope of y = x2 - 2 is given by y' = 2x. Hencethe tangent line To at P0 has the slope 4 and, therefore, the equation ofTo (by the point-slope formula) is

y-2=4(x-2)or

y=4x-6.We intersect To with y = 0 by solving

4x - 6 = 0with the solution

xl =3= 1.5.

Now we locate the point P1(2, 1). The slope of T1 at P1 is 2 3 andconsequently

Ti: y=3x- 1 .

The intersection point of Tl with y = 0 is found to be

x2=12=1.416.

Next, we repeat the process. We obtain P2(12-, y !T) and

T2:y=17x-577

6 144

Fig. III.7

Page 206: ɷshawns sagan integral and differential calculus

194

and, consequently,x3 = 577

= 1.4142156408

CHAP. Ill. RATES

The value of i accurate to five decimal places with the sixth decimalplace rounded off is found to-be

1/2 = 1.414214.

So we see that our third approximation, x3, is already very close to the truevalue.

In order to develop a general formula for this process, let us note thatthe procedure is the same at every step of our iteration. So we can pick outany step for our general discussion, say, the (k + I)th step. After k steps,we obtain the approximation x1:

{yk = J (xk)

and determine the equation of the tangent line at the point Pk(xk, yl;). Theslope of y = f(x) at x = xk is given by

Mk =.f'(xk)

Thus we obtain from the point-slope formula

y - yk =f'(xk)(x - XA-)or

Tk: y =f'(xk)x -J '(xk)xk +

If we intersect this tangent line with the x-axis y = 0, we arrive at theequation

J '(xk)x - J '(xk)xk + Yk = 0.

The solution of this equation yields the next approximation xk-F1:

xk+l - f'(xk)xk - Yk = X _ {. yk

f '(xk)k

f (x0or, if we write again f (x7,) instead of yk, we have

(III.23) xk+1=xk-'

k=0,1,2,3,f(xk)

This method works most of the time quite well, inasmuch as the sequencexo, x1i x2, x3, approaches the solution (or a solution) off(x) = 0.

However, it could happen, for example, that we start out with a value xusuch that after a number of steps a point at which the slope of y = f (x) iszero is reached (see Fig. 111.8), just to mention a simple case where the

Page 207: ɷshawns sagan integral and differential calculus

111.9. MAXIMA AND MINIMA

Fig. 111.8

195

method does not work. In Fig. 111.8, we unfortunately have f'(x2) = 0and, consequently, x3 = - oo and the process falls apart.

Problems 111.54-111.57

111.54. Find an approximation to 1/3 by Newton's method. Take xo = 2 andtake three steps. (Hint: 1/3 is the solution of x2 - 3 = 0.)

111.55. Find approximate solutions ofx3 + 7x2 - 5x - 35 = 0

by Newton's method.(a) take xo = 2 (b) take xo = -2 (c) take xo = 6.

Take two steps in each case. If you are puzzled by the results, try to explainwhat puzzles you.

111.56. Find an approximation to ti/-7 by Newton's method. Take xo = 2 anduse three steps. Compare your result with the value v'7 = 1.912931 which isfound in tables and with the approximation which you obtain for ;'7 = 3J8 - 1

by the method of Section 7. (Remember: dx x'-3 = 3x I=/).

111.57. Find an approximation to 1/6 by Newton's method. Start with xo =and go as far as you think is needed to obtain the result accurate to three decimalplaces.

9. MAXIMA AND MINIMA

Let us consider the function y = f(x) and let us assume that the deriva-tivef'(x) exists for all values of x, i.e., the curve representing the functionhas a tangent line at every point.

Page 208: ɷshawns sagan integral and differential calculus

196 CHAP. III. RATES

Xj X2 X3 X4

Fig. 11.9

For the following discussion we refer to Fig. 111.9. We observe that thevalue of the function at x = x, is greater than the values which the functionassumes in a neighborhood of the point x = x,. The same thing can besaid about the point x = x3. We call the value of the function at such apoint a relative maximum value, and give the following definition.

y =f(x) is said to assume a relative maximum at the point x = x,, ifthere exists some neighborhood* of x = x,, i.e., some intervals x, - 61x < x, and x, < x < x, + 62 such that the values of the function for all a:in these intervals are smaller than the value of the function at x = x,:

f(xl) = rel. max.f(x) if f(x,) >f(x) for all x in x, - 61 < x < x, andx,<x<x,+62forsomeb,,62.

The reason for calling such a value a relative maximum is quite obvious.The function as represented in Fig. 111.9 assumes much greater values thanf(x,) for large values of x, but f(x,) is still a maximum, relative to (com-pared with) values in a neighborhood of x,.

Similarly, the function y =f(x) is said to assume a relative minimum atthe point x = x2, if there exists some neighborhood of x = x2, i.e., someintervals x2 - 61 < x < x2 and x2 < x < x2 + 62 such that the values ofthe function for all x in these intervals are greater than the value of thefunction at x= x2:

f(x2) = rel. min. f (x) if f (x2) < f (X) for all x in x2 - b, < x < x2 andx2 < x < x2 + 62 for some 61, 62.

* The term neighborhood is used here in the naive sense. Technically, we are dealinghere with what is called a deleted neighborhood.

Page 209: ɷshawns sagan integral and differential calculus

111.9. MAXIMA AND MINIMA 197

We can see that the function depicted in Fig. 111.9 has relative minimumvalues at x2 and x4 and relative maximum values at xl and x3.

The question which now arises is this: If a function is not given by itsgraph, but rather by its mathematical representation, how do we find thevalues of x for which the function assumes a relative maximum or a relativeminimum-or, if we lump the two together, a relative extreme value?

Recall that we assumed that the derivative (tangent line) exists every-where. Again we consult Fig. 111.9 and see that the tangent line to thecurve at points which represent relative extreme values is horizontal. If itwere not horizontal, then the slope of the tangent line would be eitherpositive or negative, indicating that the function is either climbing fromsmaller to larger values or descending from larger to smaller values at thispoint.

Thus we can state: If y = f(x) is a function which everywhere has aderivative, and if y = f(x) has a relative extreme value at the point x = xo,then it is necessary that the tangent line at this point be horizontal, i.e.,

(111.24) f'(xo) = 0.

First, let us point out that it is quite essential to assume that the tangentline exists everywhere. Consider the function

y = 1XI

(see Fig. I11.10) which has a minimum value at x = 0. Observe that thederivative at this point is not zero; as a matter of fact, it does not evenexist, as we have seen in Section 1.

Second, we wish to point out that condition (111.24) is by no meanssufficient, i.e., if the derivative at a certain point is zero, that does notguarantee that the function assumes at this point a relative extreme value.Consider

Clearly, the derivativey=x3.

y/ = 3x2

vanishes at the point x = 0; still, the function does not assume a relativeextreme value at this point (see Fig. III.11).

Thus all we can say is: If an everywhere differentiable function hasextreme values at all, then it assumes these extreme values at points forwhich the derivative of the function vanishes. Therefore, in order to findextreme values we have to find first the values for which the derivativevanishes, and then investigate each such point individually to see whetherit really yields a relative maximum or a relative minimum.

Page 210: ɷshawns sagan integral and differential calculus

198

Fig. III.10

CHAP. 111. RATES

Y

Fig. III.11

To illustrate the procedure, let us consider some problems. First, weconsider the function

y = 2x3 + 3x2 - 12x - 6.Its derivative is

y'= 6x2+6x- 12.

In order to find all points for which the derivative vanishes, we have tosolve the equation

6x2+6x-12=0.

The solutions of this equation are

x1=1,x2=-2.

Hence, if there are extreme values at all (note that the derivative of ourfunction exists everywhere), these extreme values can only be assumed forthese values x1 = 1, x2 = -2.

Let us evaluate our function for these two values. We have

ylx=1=2+3-12-6=-13YIx=-2=-16+12-1-24-6= 14.

Offhand, we might say that 14 is the relative maximum and -13 is therelative minimum of our function because 14 is greater than -13. Eventhough this is true in this particular case, we warn the reader not to offersuch faulty argumentation (see the next problem discussed in this section),because it could easily lead to wrong conclusions.

Page 211: ɷshawns sagan integral and differential calculus

111.9. MAXIMA AND MINIMA 199

x

Fig. 111.12

In order to graph our function, we first draw a graph representing thederivative y'. This is a quadratic parabola (see Fig. 111.12). We see thatthe derivative is positive for all values x < -2 and all values x > I andit is negative for all x which are in the interval -2 < x < 1. We tabulatethis information together with the fact that the derivative vanishes at x = 1,-2 in Fig. 111. 13, and draw underneath the table straight lines with slopesthat are indicated by the table.

We then obtain a polygon which has a very rough resemblance to thegraph we seek. It seems to indicate specifically that the value at x = -2 isindeed the relative maximum value, and the value at x = 1 is the relativeminimum value of the function. The information we thus gained aboutour function, together with the construction of a few points, will yield itsgraphic representation as given in Fig. 111.14.

Next we consider the following function

y=x+-.x

x x<-2 x=-2 -2<x<1 x=1 I x>1

y + 0 - 0 +

Fig. 111.13.

Page 212: ɷshawns sagan integral and differential calculus

200 CHAP. III. RATES

Fig. Ill.14

y

----------_~l----+----+------------~x

I I I I I I

7\ Fig. m.IS

Page 213: ɷshawns sagan integral and differential calculus

111.9. MAXIMA AND MINIMA 201

The derivative is

y'=1- 1

X

(see Problem Ill. 14). The values for which the derivative vanishes are thesolutions of the equation

1- 2=0X

or, after multiplication by x2 0,

x2- 1 =0.

The solutions of this equation are x12 = ± 1.The corresponding values of the function at these points are :

ylx=1=1+1=2y1,1=-1-1=-2.

So, offhand, we would state that 2 is the maximum and -2 is the mini-mum value. Wrong! From the graph, which is given in Fig. 111. 15, we seethat it is exactly the other way around.

It may sound crazy to say that -2 is the maximum while 2 is the mini-mum value. But, remember, we are dealing with relative extreme valuesand we see that -2 is indeed larger than any value of the function in theneighborhood of x = -1, while 2 is smaller than any value of the functionin the neighborhood of x = 1.

The graph of this function, by the way, can easily be obtained from thegraph of y = x and the graph of y = I /x by addition of correspondingordinates, as indicated in Fig. Ill. 16.

Finally, we wish to discuss the celebrated example of the open square boxwith maximum volume which is to be found in most Calculus books eversince the differential calculus was invented. The problem is : Given asquare sheet of cardboard of sidelength 6. This sheet is to be folded intoan open box with square bottom such that it contains the largest possiblevolume.

In order to formulate the problem mathematically, let us refer to Fig.Ill. 17. We fold the cardboard along the dotted lines at a distance x fromthe edges. We thus obtain an open box with square bottom. The side-length of the square bottom is 6 - 2x and the height of the box is x. Thusthe volume is given by the following expression, which is a function of x:

V(x) = (6 - 2x)2x = 4x3 - 24x2 + 36x.

Page 214: ɷshawns sagan integral and differential calculus

202

Y

CHAP. III. RATES

x1

Fig. 11I.16

Now we have to choose the distance x of the fold from the edge such thatthe volume of the box becomes as large as possible. This means that wehave to choose x such that V(x) assumes a maximum. The derivative ofV(x) is given by

V'(x) = 12x2 - 48x + 36.

6

6

Fig. 111.17

Page 215: ɷshawns sagan integral and differential calculus

111.9. MAXIMA AND MINIMA 203

To find the values x for which this derivative vanishes, we have to solve thequadratic equation

or, after division by 12,

The solutions are

Hence,

12x2 - 48x + 36 = 0

x2-4x+3=0.

4±,/16-124±22 2

xl = 3, x2 = 1.

If we choose x1 = 3, then this means that we fold the square sheet in themiddle and obtain thus a "box" of volume zero. This is certainly thesmallest box we can possibly obtain. However, if we consider the othersolution x2 = 1, a simple argument will reveal that this leads to the largestbox obtainable.

Problems III.58-III.68

111.58. Find all extreme values of the functions(a) y = x3 - 3x2 + 3x - 1 (b) y = x3 - 3x + 2(c) y = 2x3 - 9x2 + 12x -4 (d) y = 2x3 - 3x2 - 36x + 12(e) y = 3x5 - 50x3 + 135x + 27 (f) y = 3x4 + 4x3 - 12x2 +4

and sketch the graphs of the functions.III.59. Find all extreme values of the function

y=2x_ I

and sketch the function.111.60. Find the extreme value of the function

1

J x2+I(For differentiation, use limit process.)

111.61. Show that the extreme value of the function in Problem III.60 is anabsolute extreme value; i.e., if it is a maximum, then it is the largest value that thefunction can possibly attain; and if it is a minimum value, then it is the smallestvalue that the function can possibly attain.

111.62. Find a necessary condition for the minimum of the function y =' /x2 + y12 + '(x - x2)2 + y22, where x2, y1, y2 are constants. (For differentia-tion, see Problems Ill.19 and 111.20.)

111.63. Find the extreme values of the function

_ 1

Y = , 2 + .1

and sketch the function. (For differentiation of 1/x3, use limit process.)

Page 216: ɷshawns sagan integral and differential calculus

204 CHAP. III. RATES

111.64. Given two numbers such that their sum is equal to a given number 2a.Choose the two numbers such that their product is a maximum. [Hint: Givenx + y = 2a. Choose x and y such that xy becomes a maximum. Since y =2a - x, we have xy = x(2a - x) which is to be maximized.]

111.65. A horsetrader wishes to encompass the largest possible rectangularcorral with a fence 240 yards long. What are the dimensions of the corral?

111.66. A collection is taken up in a country club with the object of purchasinga fence. The money raised will buy a fence of a total length of 1.6 miles. Whatare the dimensions of the largest possible rectangular golf course, if it is to befenced in on three sides with a straight river forming the supplementary bound-ary ?

111.67. A right triangle with given perimeter c is to be made as large as possible.What are its dimensions?

111.68. Sketch and discuss the function V(x) which represents the volume of theopen rectangular box in the example which we worked at the end of this section.

10. MOTION

Suppose a point moves along a straight line starting from a positionwhich is marked by s = 0, and reaches after t seconds the position

(111.25) s = s(t)

units away from the origin s = 0. We will agree to use negative values of sto indicate a position on the left (or below) the starting point and positivevalues of s to indicate a position to the right (or above) the starting point.Formula (111.25) expresses the position of the moving point as a functionof the time t. We call (111.25) the law of motion of this particular point.The structure of the function s(t) will depend on the type of motion it issupposed to represent.

For example, s=tdescribes a motion which is such that during any time interval of unitlength, the point will move through a distance of unit length. If we agreeto measure time in seconds and the distance in feet, then, the moving pointdescribed by the above formula will move 1 foot per second. If we have amotion of the type where the point covers equal distances in equal timeintervals (in our case, the point covers in time intervals of I second, dis-tances of I foot, in intervals of 3 seconds distances of 3 feet, etc., no matterat what particular time relative to the starting time of the motion we maycheck), we say that the point proceeds with a constant velocity where wedefine velocity as the ratio of distance divided by the time interval duringwhich the distance was covered:

(111.26) Velocity = DistanceTime

Page 217: ɷshawns sagan integral and differential calculus

111.10. MOTION 205

Thus, if a point travels through 7 feet every 12 seconds, then its velocitywill be ; ft/sec. Clearly, it follows from (111.26) that the dimension of avelocity has to be ft/sec in order to make the dimensions on both sides of(111.26) agree.

Before we proceed to more general cases, let us discuss one more ex-ample. Let

s = at+bft,

where a, b are constant numbers, be the formula describing the motion of aparticular point. Then the distance covered after to seconds is given byso = ato + b and the distance covered after t1 seconds (t1 > to) is given bys1 = at1 + b. Hence, the distance covered in the time interval from to tot1 is given by

s1-so=at1+b-ato-b=a(t1-to)and, according to (111.26), the velocity is

v = a(tl - to) = a ft/sec.tl - to

We see from this derivation that no matter what checkpoints to and tl wemay choose, we will always obtain a for the velocity. This shows that everymotion that is described by a linear function in t is a motion with constantvelocity. Later, we will also show the converse, i.e., that every motion withconstant velocity is represented by a linear function in t.

Our definition (111.26) of a velocity breaks down, however, the momentwe consider motions which are described by functions which are not linearin t. Let us consider the case

s=t2-3.Suppose we are interested in the velocity after 3 seconds. We have s(3) = 6.In order to apply (111.26) we have to know also the distance at a differenttime t1, which is s(t1) = t12 - 3. On invoking (111.26), we have

v-ti2-3-6-tie-9-t-1-3

1ti-3 tl-3which is a function of t1, i.e., depends on the second checkpoint. This isclearly nonsense, because the velocity after 3 seconds cannot possiblydepend on the behavior of the moving point tl - 3 seconds later, where t1is chosen quite arbitrarily.

We see from this example that we have to redefine velocity for motionswhich are not described by linear functions. Such a definition, in order tobe senseful, has to be such that the definition (111.26) follows as a special

Page 218: ɷshawns sagan integral and differential calculus

206 CHAP. III. RATES

case from this new, more general definition, or else we would be stuck withtwo different concepts of velocity which certainly is not a desirable situ-ation.

Suppose we have a point Po moving according to

s = s(t)

where the function s(t) may be any function that could conceivably describea motion. (A function of the type

t-1fort<0s(t) =

tfort>-0would certainly have to be excluded since it is hard to imagine a movingpoint-in the framework of classical physics-which suddenly covers att = 0 a distance of 1 foot in 0 seconds.)

It is our aim to give a definition of the velocity at a certain time to. Inorder to arrive at a reasonable definition, we will make use of (111.26) whichis applicable to motions with constant velocity. For this purpose we sub-stitute for the motion of Po as described in (111.25) another motion of apoint P1, which, although not the same, still comes very close to the motionof Po. We subdivide the time interval in which the motion is considered(0 < t < T, where T could also be infinity) into subintervals 0 < t < t1,tl < t < t2, t2 < t < t3, - - - , tk_1 < t < tk, - - - , and replace our movingpoint Po with another point P1 that moves as follows:

It covers the distance from s(0) to s(t1) in tl seconds by moving with aconstant velocity

s(t1) - s(0)tl

the distance from s(t1) to s(t2) in t2 - t1 seconds by moving with a constantvelocity

s(t2) - s(t1)

t2 - tl

the distance from s(tk_1) to s(tk) in tk - tk_l seconds with the constantvelocity

S(tk) - S(tk-1) , etc.tk- tk-1

The procedure which we followed here is interpreted graphically in Fig.111. 18, where the solid line in the space-time-coordinate system representsthe motion of the point Po and the broken line the motion of the point Pl.

Page 219: ɷshawns sagan integral and differential calculus

111.10. MOTION

S

A

%1//

i S(t7)

lt )s s

s(4) S(t) I,-S00-S(t)

ff s

¢ - Sl2)

tj t2 t3 to t5 t6 t7

Fig. III.18

207

t

We now imagine both points (Po which moves according to s = s(t) andP1 moving as just defined) to start at s = 0 at the same time t = 0. Clearly,both points will arrive at s(tl) at the same time, they will both arrive ats(t2) at the same time, , they will both arrive at s(tk) at the same time,etc. The only difference between the two motions is that, in between anytwo checkpoints, the two moving points are not necessarily abreast. Ingeneral, one of them will lag behind the other.

However, it is intuitively quite clear, that the smaller we choose the timeintervals tk_1 < t < tk, the smaller is the difference in the distance betweenthe two moving points in between checkpoints and we obtain a better andbetter approximation to the original motion as described by s = s(t), thecloser together we select the checkpoints. The velocity of Po at some time towill then be very closely approximated by the velocity of the second pointP1 in the time interval tk_I < t < tk which contains to. We can even simplifythe situation by choosing to as one of the division points, e.g.,

Then the velocity of P. at to will be approximately equal to the velocity ofthe other point P1 in the interval tk-1 < t < tk or the interval tk < t <tk+l, namely,

S(tk) - s(tk-1) or S(tk+1) - S(tk)tk - tk-1 tk+1 - tk

Page 220: ɷshawns sagan integral and differential calculus

208 CHAP. 111. RATES

and the approximation will be better, the closer t7._1 or tA.+1 are taken to to.This idea gives rise to the following definition of the velocity of Po at thetime to:

s(to + h) - s(to)(111.27) v(to) = lim

h

h is positive or negative.If we compare (111.27) with the definition of the derivative of a function

F(x) as given in Section 1,

F'(x) = lim F(x + h) - F(x)A- o h

we see that (111.27) is equivalent to

(111.28) v(t) = s'(t),

where we again write t instead of to. In words: The velocity is defined asthe derivative of the distance with respect to the time.

We can see immediately that the definition (111.26) is a special case of(111.28), since for s = at + b, we obtain s' = a. We wish to make quiteclear that (111.28) does not follow from (II1.26). The argument which weput forth leading us from (111.26) to ([11.28) was not a derivation, but rathera heuristic argument which did show us the way in our search for a reason-able definition of velocity.

The acceleration of a motion is defined as the "velocity of the velocity,"i.e., the rate with which the velocity changes as the time increases. Again,for a motion where the velocity increases at a constant rate (i.e., if thevelocity is a linear function of the time), the acceleration is defined as

Acceleration = Velocity increase

Time increase

If the quotient on the right is not independent of t, then we run into thesame difficulties as before and we have to search for a more general defini-tion which will apply to any motion. If we go through the same argumentas before for velocity and time instead of distance and time, we finallyarrive at the definition

(111.29) a(t) = v'(t) = [s'(t)]' = s"(t).

Since the acceleration as derivative of the velocity thus appears as the deriv-ative of the derivative of the distance, we call it the second derivative of thedistance and denote it as in (111.29) by s"(t).

We have seen in Section 5 that F'(x) = f (x) is equivalent to

ff(x) dx = F(x) + C

Page 221: ɷshawns sagan integral and differential calculus

111.10. MOTION 209

where C is an arbitrary constant. Hence, we can write (III.28) and (111.29)in the form

and

Hence,

and

fv(t) dt = s(t) + C

Ja(t) dt = v(t) + C.

ror

J

sot

v(u) du = s(t) - s(0)

a(u) du = v(t) - v(0).

From this it follows that

(111.30) s(t) = fotv(u) du + s(0)

and

(111.31) v(t) = f ta(u) day + v(0).0

Therefore, if we want to establish the formula of a motion uniquely,knowing the velocity, we have to prescribe the initial position (initial dis-tance) s(0), i.e., the distance from s = 0 which the point occupies when themotion starts. Similarly, if we want to establish the formula for the velocityuniquely if the acceleration of the motion is known, we have to prescribethe initial velocity v(0), i.e., the velocity with which the motion starts out.[If v(0) = 0, we say that the motion starts from rest.] Thus, if we know theacceleration of a motion and wish to obtain the distance formula, we haveto know the initial velocity and the initial distance in order to arrive at aunique result.

Mathematically this means that, if we know the second derivative of afunction (acceleration) and we want to find the function itself (distance),we have to prescribe a point through which the function is to pass and alsothe slope with which it is to pass through this point.

It follows readily from (111.30): if

then

v = c (constant),

ts = r c du + s(0) = ct + s(0),

0

i.e.. s is a linear function of the time.

Page 222: ɷshawns sagan integral and differential calculus

210 CHAP. III. RATES

s

rI I I

I I

ttj t2 t3 t4 t,5

Fig. III.19

Formulas (111.30) and ([11.31) are very important in the analysis ofphysical motion, while the formulas (111.28) and (11I.29) are merely ofacademic value, inasmuch as they enable us to establish the notion of avelocity and acceleration conceptually. The reason for the practical un-importance of formulas (111.28) and (111.29) in the analysis of motion shallbe discussed now.

In order to make use of (111.28), we have to know the distance as a func-tion of the time:

s = s(t).

How do we acquire knowledge of this function? Clearly, we know it fromexperiments, if this function is to describe a motion which is actually takingplace. Now, what do we really do when we establish such a formula experi-mentally, i.e., by carrying out measurements? We observe a certainmotion (e.g., freely-falling body) and measure either the distance after thelapse of certain time intervals terminating in t1, t2, t3, , or the elapsedtime when the moving point reaches certain distances s1, s2, s3, .

In either case, we will obtain a discrete and finite number of data whichcan be plotted in an s, t-diagram, as illustrated in Fig. 111. 19. In order tocompress this information (which may consist of 1 million points) into apractical and compact mathematical form, we join the points which repre-sent the measured data by a "smooth" curve, preferably one which can berepresented by a simple mathematical function [e.g., the data (0, 0), (1, 1),(2, 4), where the first coordinate represents the time and the second co-ordinate represents the distance, can be represented by s = t2.] In thisway we arrive at a representation s = s(t).

First, let us mention that this representation is certainly not unique. (Inthe example which we just mentioned in the brackets, s = Jt3 - -t2 + t

Page 223: ɷshawns sagan integral and differential calculus

111.10. MOTION 211

seems to serve the purpose just as well, inasmuch as it assumes the requiredvalues at t = 0, 1, 2.) Of course, we can arrive at unique results by mathe-matical trickery, i.e., by making additional hypotheses about the curvewhich is to join the given points, or by prescribing a certain method bywhich this function is to be secured. This means, however, that an elementis brought into the picture which does not rightly belong here, namely,mathematical assumptions and hypotheses. Mathematics has nothing todo with motion nor has motion anything to do with mathematics. Mathe-matics is very useful in describing physical phenomena; however, one hasto distinguish very clearly between mathematical assumptions that havephysical motivation and mathematical ad hoc assumptions which lackmotivation entirely but serve only to make everything come out nicely.

Now, in order to find the velocity of a motion for which the mathematicalformula s = s(t) was found, we have to differentiate this function. All wereally know about this function are ,its values at certain discrete pointstl, t2i t3, , tk. We do not know anything about the values of the functionin between. Of course, the formula s = s(t) will yield values for s for everyvalue of t, but we have no right whatever to expect the point really to bethere. Even if, in order to obtain more complete information, we increasethe number of checkpoints and decrease at the same time the lengths of thetime intervals between checkpoints, we cannot resolve our problem. Whatwas said before about the behavior of the function s(t) between consecutivecheckpoints can now be said in regard to the smaller intervals between thenew checkpoints, and we face basically the same problem.

Figure 111.20 illustrates the unreliability of the information which weobtain by taking the derivative of such an experimentally established func-tion. The same data are joined in three different ways by curves in solid,broken, and dotted lines. In all three cases the derivative at the point to isrepresented by the tangent line. We can see that the derivative can reallyhave any value between - co and oo, but we have no way of deciding whichone it ought to be.

The situation may become still clearer if we consider an example takenfrom "life" itself. Suppose a little green man from outer space comes toEarth and observes a pig standing behind a picket fence. All he really seesare a number of sections of the pig's body and, having had no previousexperience with Earth pigs, he will be unable to complete the picture of thepig mentally, provided he has a mind, because "almost anything" could bein between, for all the little green man knows.

This brings out another point we wish to make. We, in the same posi-tion, would have no difficulties whatever in imagining the hidden parts ofthe pig's body, because we have previous experience in this matter. Thesame point holds true for the physicist who has had previous experience

Page 224: ɷshawns sagan integral and differential calculus

212 CHAP. III. RATES

I I I >ttk t0 tk+1

Fig. 111.20

with motions. He is in a position to make additional hypotheses in regardto what can happen in between checkpoints and thus arrive at a satisfactoryrepresentation of the motion by a mathematical function that will stand upto differentiation and may yield useful results. However, in both cases-thepig as well as the motion-we have to be prepared for surprises, becausesomething could be in between after all which surpasses both our imagi-nation and previous experience.

So much about the unreliability of any results that are obtained fromformulas (111.28) and (111.29). The situation is quite different in regard tothe integral formulas (111.30) and (111.31). Here it does not matter so muchhow the function behaves in between checkpoints. The reader can see forhimself that all the areas under the three different curves in Fig. 111.20 arealmost the same.

Before closing this section, let us work two examples. Suppose we knowthat a certain motion follows the law

s=3t2+t+5ft.Then the initial distance is found for t = 0, s(0) = 5 ft, the velocity isfound according to (111.28) as

t, = 6t + 1 ft/sec

where the initial velocity, obtained for t = 0 is given by v(0) = I ft/secand, finally, the acceleration is found from (1I1.29) to be

a = 6 ft/sec-sec

Page 225: ɷshawns sagan integral and differential calculus

111.11. FREELY FALLING BODIES 213

Now, suppose we have a motion of which we know that it has the accel-eration

a = 3 ft/sec-sec

We know in addition that the initial velocity is v(0) = 12 ft/sec and theinitial distance is given by s(0) = 14 ft. Then we obtain from (111.31)

and from (111.30)

Problems 111.69-111.76

v = 3t + 12 ft/sec

s =3 t2+12t+14ft.

111.69. S = t2 + 3t + 4. Find v and a as functions of t.I11.70. s = 4t2 - 12t + 3. Find v at t = 3.111.71. What are v(0) and s(0) in Problem 111.69?111.72. s = 12t2 + 4t + 1. Find v(0).111.73. a = -2, find v and s if v(0) = 12, s(0) = 3. At what time t is v = 0?111.74. Given v = 12t + 3. Find a and s, if s(0) = 12.111.75. The functions s = t2 and s = t3 - t2 + t both join the same three

points (0, 0), (1, 1) and (2, 4). Compare the values of the derivatives of thesetwo functions at t = 0, i, 1, and 2.

111.76. a = -32 ft/sec-sec, v(0) = vo, s(0) = s0. Find the equation of motion.

11. FREELY FALLING BODIES

It is our experience that whenever we release an object (e.g., pencil, ball,typewriter) which has been suspended in some reasonable neighborhoodof the surface of the earth, it embarks on a straight motion directed down-wards and only comes to rest when it hits an obstacle in its path (e.g., table,gymnasium floor, surface of the earth). Although this phenomenon hasnot been observed on all the bodies that exist, it is nevertheless a fact thatall the bodies which have been subjected to such treatment have neverbehaved otherwise, i.e., none ever fell upwards, or sideways, or stayedunsupported in free space.

Although there is no logical reason whatever that all droppable but notyet dropped objects shall behave in the same way, men, even in primitiveand unsophisticated environments will-and as a matter of fact did-con-clude, on the basis of the available evidence, that it is a general physicalphenomenon that all bodies relatively near the earth's surface have thetendency to fall down if unsupported. We wish to point out that we neverwill be able to prove a law of such general character because a proof

Page 226: ɷshawns sagan integral and differential calculus

214 CHAP. 111. RATES

(physical proof, that is) would involve the verification of this law forall-and we really mean all-bodies, which is of course quite impossible.It would be very easy (logically, that is), on the other hand, to disprovethis law simply by demonstrating that it does not hold in one particularinstance. So, until somebody finds a body that falls sideways or upwardsor does anything but fall downwards along a straight line, we are compelledto accept this law as a reasonable working hypothesis.

Once we have accepted the working hypothesis that bodies, if unsup-ported, fall down, our inquiring mind pushes us further to investigate thenature of the descent. It is a widely spread custom in the entire civilizedand scientific minded world among children who dwell in city quarters toengage in a pastime which really should not be encouraged, however, doeshave merit in molding the innocent little minds of the perpetrators of thegame. We are speaking of a game which involves the dropping of a paperbag filled with water (the bigger the bag, the more gratifying the result!)from a window onto the street, preferably onto a bald, bare-headed passer-by. Everybody with a scientific mind whoever engaged in this game willknow that the higher the point of release, the more gratifying the effect,because the impact velocity (velocity at the moment of the hit) increases withthe height through which the paper bag has fallen after being released, andtherein lies a definite advantage of the twelfth floor dwellers over the thirdfloor dwellers.

While we can be reasonably sure that the Italian scientist, Galilei Galileo(1564-1642), did not engage in throwing or dropping water-filled paperbags (paper bags being a product of our present highly overdevelopedcivilization), he nevertheless recognized in carrying out a great number ofexperiments that the velocity increases as the distance through which theobject falls increases. He went further than that and determined experi-mentally the law of freely falling bodies, according to which the distance tseconds after the release is proportional to the square of the elapsed timeinterval t, the proportionality factor having approximately the value 16 ina foot-second measure system.

Thus we write

(11 I.32) h = 16t2 ft. (t in seconds).

Ever since this formula was established, it has been subjected to a greatnumber of tests ranging from simple high school experiments with thefalling machine to very sophisticated and complicated scientific scrutini-zation.

According to the formulas which we developed in the preceding section,disregarding for the moment what we said in connection with the differ-entiation of experimentally established functions, we obtain for the velocity

Page 227: ɷshawns sagan integral and differential calculus

111.11. FREELY FALLING BODIES

of the freely falling body

(111.33) v = dh= 32t ft/sec

dtand for the acceleration

(111.34) a = dt = 32 ft/sec-sec.

215

Now, if we could establish by an independent argument the fact that theacceleration of the freely falling body is 32 ft/sec-sec, then we could arriveat (111.32) via a logically satisfactory method by successive integrations of(111.34), using the formulas (111.30) and (111.31) of the preceding section.

This question-why an acceleration of 32 ft/sec-sec, or why an acceler-ation at all-leads us to a very deep problem, one which the physicistalways faces after having investigated a physical phenomenon from a quan-titative standpoint: the problem of searching for the cause.

What causes an object to embark on an accelerated downward motiontowards the earth? Nowadays, every high school student will smuglysupply the answer: gravity. This answer, however, is really no answer atall as long as it is not embedded in a more general system and explainablein more general terms. Sir Isaac Newton, the English physicist (1643-1727)asked himself this question many times, together with many other-notobviously related-questions such as why does the moon orbit around theearth and why do the planets orbit around the sun, and why specifically,along elliptical orbits as quantitatively described by Johannes Kepler?

After a great deal of thought and experimental work, Newton came upwith a new working hypothesis, namely: Between any two masses thereexists a force of attraction which causes the two masses to move towardseach other.

The first new and important concept that enters our discussion here isthe concept of a force. What is a force? It is something which is notdirectly accessible to measurements. It can only be explained and measuredin terms of its effects. We all have an intuitive notion of force. For thesake of the following argument, let us use this intuitive notion and set it inquotes. To set a mass motion, we have to exert a certain amount of "force."If the mass increases and we wish to cause the same effect as before, wehave to exert a greater "force." If the mass remains the same and we wishto impart a. greater acceleration, we also have to exert a greater "force."Thus, Newton was led to define force as jointly proportional to the massm and the acceleration a of the motion on which the mass embarks uponbeing subjected to the force in question:

(111.35) f = ma lb.ft/secsec.

Page 228: ɷshawns sagan integral and differential calculus

216 CHAP. III. RATES

in

r

Fig. III.21

9Now that the concept of force is clarified, we can go along with Newton insetting up a hypothesis about the magnitude of the force of attractionbetween two masses.

Newton conjectured (see also Fig. 111.21)

(111.36)f=Gm4

1.-

where M and m are the masses of the two objects involved, r is their centraldistance (distance from the center of gravity of the one mass to the centerof gravity of the other mass*), and where G represents the force of attrac-tion between two unit masses at a distance I unit apart. (This is a physicalconstant which has to be established experimentally. Its value depends, ofcourse, on the measure units which are chosen.)

We wish to point out that this law enables us to explain a great numberof physical phenomena, in the mechanics of heavenly bodies as well as insimple phenomena such as the freely falling body in a neighborhood of thesurface of the earth. It explains (in terms of this new and mysterious forceof attraction) why objects fall downwards and why planets move the waythey do. It"does not explain why bodies attract each other. The answer,"because they like each other" is just as good as any other answer. (111.36)can never be verified (established as the "ultimate truth"-whatever thatmight be-) just as we can never verify that it is true for all bodies that theyfall down. All we can do in regard to its truth is to show that it is false byfinding a phenomenon which is not consistent with this law. (See someremarks towards the end of this section about Einstein's gravitationaltheory.)

We will not go into a general philosophical discussion here because thatwould lead us far afield and go definitely beyond the scope of this treat-ment. All we wish to do here is to explain Galilei's law for freely fallingbodies (111.32) in terms of Newton's gravitational law (111.36) and therebydemonstrate how physical theories with a limited scope are embedded in

* The center of gravity is explained in Chapter IV, Section 5. For the time being,however, let us assume that the masses in and M are homogeneous spheres. Thentheir center of gravity is simply their geometric center.

Page 229: ɷshawns sagan integral and differential calculus

III. 11. FREELY FALLING BODIES 217

theories of much wider scope and how, in turn, many simple physical phe-nomena are explained in terms of general hypotheses.

We have an object of mass m at a perpendicular distance d from thesurface of the earth and release it from this initial position. According toNewton's law, the object of mass m and the earth of mass M attract eachother (disregarding the air resistance) with a force

(111.37)112Mf=G

(R + d)2

where R is the radius of the earth (see Fig. 111.22). On the other hand wehave, from (111.35),

f = am

where a is the acceleration of the downward motion of the object of massm. Hence it follows, from (111.35) and (111.37) after division by m, that

M(111.38) a = G

(R + d)2

We see from this formula that the acceleration a depends on the distance dof the falling object from the surface of the earth, and since this distancechanges as the object proceeds in its motion, the acceleration appearsultimately as a function of the time t and cannot possibly be constant, aswe assumed in the beginning. However, if d is small in comparison withR (e.g., 23 feet as compared to 4000 miles), which is the case for all practicalpurposes, the change in d from its initial value to its terminating value 0

Fig. H1.22

Page 230: ɷshawns sagan integral and differential calculus

218 CHAP. 111. RATES

will be so insignificant that its effect on a will not be detectible by the usualmeasuring procedures. Therefore, we neglect d entirely and use

(111.39)

for the acceleration of freely falling bodies which are released in a reason-able neighborhood of the surface of the earth. This formula will now besubjected to numerical evaluation. We find in tables*

G = 6.670 x 10-8 dyne (dimension of force in c-g-s-system)fi

Mass density of earth = 5.522 g/cm3

Mean radius of earth = 6371 km = 6371 x 105 cm

Thus we obtain for the mass of the earth

mass = mass density x volume

5.522 x 47rR3

3

4 x 5.5227r(6371 X 105)3

3

and consequently

a _ 6.670 x 10-8 x 4 x 5.5221x(6371 x 105)3= 982.7 cm/sec-sec

3(6371 x 105)2(Tr - 3.142).

Since 1 cm = 0.03281 ft, we obtain

a = 32.24 ft/sec-sec

to four significant figures. We point out that this value is arrived at byusing a mean value for the radius of the earth. At a geographic latitude of45° we obtain 32.17 ft/sec-sec. We see that the value of 32 as obtained byexperiments is, indeed, explainable in terms of Newton's general gravita-tional theory.

Now we can reverse our steps and start with

a = 32 ft/sec-sec,

* Standard Mathematical Tables, 12th ed. CRC, 1959, p. 16.t Note that the value of G can be determined without reference to freely falling

bodies. Cavendish (1731-1810) was the first to determine G by measuring the force ofattraction between two masses of known magnitude.

Page 231: ɷshawns sagan integral and differential calculus

111.11. FREELY FALLING BODIES 219

use formulas (111.30) and (111.31) with the understanding that the object isreleased from rest, i.e., initial velocity v(0) = 0, and the distance is measuredfrom the point of release downwards, i.e., s(0) = 0, and we arrive at

v =f 32 du + v(0) = 32t,0

e

s= J32htd u+s(0)=16t2,0

which is identical with (111.32).We also wish to point out the fact that the velocity of a falling body is

independent of its mass, a fact which may puzzle the layman and seem tocontradict sound reasoning, whatever that may be, yet finds a beautifulexplanation through this derivation.

That Newton's law explains the motion of a freely falling body in asufficiently small neighborhood of the earth (d very small compared to R)is, of course, not sufficient evidence to establish the validity of this law, andnothing really is, as we already know. However, that Newton could alsoexplain in terms of his law the planetary orbits around the sun along ellip-tical paths with the sun in one of the foci and even verify by computationthe experimentally established dimensions of these ellipses is quite impres-sive and made us accept Newton's law as a very successful workinghypothesis.

It should be mentioned, however, that Leverrier observed in 1845 that themajor axis of the elliptical orbit of the planet Mercury turns in the directionof the motion of the Mercury by 43" per century. This fact is notexplainable in terms of Newton's theory. Here we have, so to speak, theobject that falls sideways. The gravitational theory that was developed byAlbert Einstein at the beginning of this century explains this phenomenon.Inasmuch as it also explains all the other phenomena which thus far havebeen explained in terms of Newton's theory, it is to be substituted forNewton's theory.

Po. Specifically, it follows from Einstein's theory that the angle of rotationof the major axis of the elliptical orbit of a planet amounts to

67ra

a(l - e)where a is the gravitational radius of the sun, a the semi-major axis of theellipse, and s the eccentricity of the ellipse. Indeed, this formula yields 43"per century for the planet Mercury. A

Einstein's theory, which is now generally accepted, has stood up to teststhis far. Our past experience, however, indicates that someday something

Page 232: ɷshawns sagan integral and differential calculus

220 CHAP. III. RATES

may come up which will make the revision of our gravitational conceptnecessary and put a new theory in place of Einstein's theory.

Now let us return from our philosophical excursion into space back toearth and some practical problems.

In order to utilize the fact concerning the acceleration that a free bodyattains in the neighborhood of the earth's surface, we will introduce a co-ordinate system with a horizontal t-axis and a vertical h-axis, where h = 0shall represent the position on the surface of the earth and the height ismeasured vertically upward. Then a downward motion as enforced bygravity will decrease the height and we will have to attach a minus sign tothe acceleration :

a = - 32 ft/sec-sec.

Suppose an object is thrown vertically upward with an initial velocity ofv(0) = 64 ft/sec (if we were to throw it downward, the initial velocity wouldhave to be assumed negative) from an initial height (cliff, steeple. etc.) ofh(0) = 80 ft. Then, according to formulas (111.30) and (111.3 1) in the pre-ceding section,

tv= - f 32 du+v(0)= -32t + 64

0

andt

h =f (-32u + 64) du + h(0) _ -16t2 + 64t + 80.0

This formula represents the height t seconds after the motion is started.If we wish to know at what time the object will strike the ground, we

have to solve h = 0 for t. This yields the quadratic equation

-16t2 + 64t + 80 = 0with the two solutions

t1.2=-1,5.Clearly, the solution t1 = -1 has to be disregarded, because this valuerefers to a time before the motion was started* and we obtain t = 5 for theduration of the motion (time the object was in the air). If we want to findthe impact velocity, i.e., the velocity with which the object strikes theground, we have to substitute this value we found for t into the formula forthe velocity and obtain

v(5) = -32.5 + 64 = -96 ft/sec.

* The negative solution t = -1 has the following significance: if the motion hadbeen started 1 second earlier from the ground with an initial velocity the magnitudeof which is equal to the impact velocity, then it would have reached after I second(at t = 0) the height h = 80 and would have at this instance the velocity 64 ft/sec.

Page 233: ɷshawns sagan integral and differential calculus

111.11. FREELY FALLING BODIES 221

If we, finally, want to know how high the object climbed, we have to findout first at what time the object turned, i.e., reversed its motion. Clearly,at this point the velocity changes from a positive (upward) velocity to anegative (downward) velocity, and since the function representing thevelocity is continuous, it has to be zero at the time at which the object turnsback. Therefore, we have to solve the equation v = 0, i.e.,

-32t + 64 = 0

and obtain t = 2, i.e., after 2 seconds in the air, the object reached thehighest point. If we substitute this value into the formula for h, we have

Itllla` = -16.4 + 64.2 + 80 = 144 ft.

The same result can be obtained by a purely mathematical argument,utilizing the theory of maxima and minima. We wish to find the maximumvalue of the function

h(t) = -16t2 + 64t + 80.

We know from Section 9 that, if h(t) has a maximum at a certain point, itis necessary that

Now,h'(t) = 0.

h'(t) = r(t) = -32t + 64 = 0and here we are again.

In general, the formula for the vertical motion of an object in the gravi-tational field of the earth is

h = -16t2 + v(0)t + h(0),

as the reader can easily verify by application of (111.30) and (111.31) toa = -32 ft/sec-sec.

Problems 111.77-111.83

111.77. Evaluate formula (111.38) for

(a) d = 20 feet (b) d = i mile (c) d = 200 miles

and compare your results with the value for a which is obtained in the text. Usethe same values for mass density and radius of the earth as in the text and observethat only four significant digits are given.

111.78. The neutral point between earth and moon is the point where an objectis not subject to any gravitational forces, i.e., where the force of attraction exertedby the moon is equal to the force of attraction exerted by the earth. Meancentral distance between earth and moon = 60.3 radius of earth. Mass of moon_ o mass of earth. Find the distance of the neutral point from the surface ofthe earth.

Page 234: ɷshawns sagan integral and differential calculus

222 CHAP. III. RATES

111.79. An object is thrown upward with an initial velocity of 32 ft/sec froman initial height of 64 feet. How high does it go?

IH.80. An object is thrown downward with an initial velocity of 64 ft/sec. froman initial height of 160 feet. Find the impact velocity.

111.81. We wish to throw an object vertically upward from the surface levelof the earth so that it reaches a height of 320 feet. What initial velocity do wehave to impart?

111.82. A projectile is to be shot to the moon. In order for it to reach themoon, it has to pass the neutral point. Therefore, its initial velocity has to begreater than the one which will make it reach the neutral point. Find thesmallest possible initial velocity that will accomplish the purpose. (Note that weassume here that there is no air resistance, that a does not change its value duringthe entire process, and that earth as well as moon are throughout the entire tripof the projectile considered to be at rest relatively to each other.)

111.83. Suppose you are lurking in a window on the twenty-first floor (189 feetabove the street) with a paper bag full of water and you see the universitypresident (who is 6 feet tall) approach at a rate of two yard-steps per second.(a) How far from the vertical projection of the posed paperbag onto the streethas the victim to be when you let go in order to achieve a perfect hit? (b) What isthe impact velocity? (c) Verify your results by carrying out the experiment.(d) Apologize to the victim.

12. SNELL'S LAW OF REFRACTION

We will study in this section a very important application of the theoryof maximum and minimum values to the propagation of light. Our investi-gation is based on a hypothesis originated by Pierre Fermat (1608-1665).Known now as Fermat's principle, it states that light, in traveling from onepoint P to another point Q, chooses a path along which it can cover thedistance in the shortest possible time. This law is subject to certain modi-fications which we will skip for the moment, but which we will discussbriefly at the end of this section.

In addition to this principle, we make use of the experimental fact thatthe velocity of light in a homogeneous isotropic medium* is constant andis different in media of different density. The velocity of light depends onthe density of the medium, just as the velocity of a boat depends on thedensity of the water through which it travels. For example, a boat whichis capable of traveling at 30 knots in clear water cannot come near thisvelocity in the Salt Lake in Utah.

If both points P and Q, which are to be joined by a light ray, are locatedin the same medium and there is no obstacle between them, there is noproblem because the velocity of light is constant within the same medium,

* Homo; eneous means that the physical properties of the medium are independentof the point selected, and isotropic means that the physical properties are independentof the direction selected.

Page 235: ɷshawns sagan integral and differential calculus

111. 12. SNELL'S LAW 223

Fig. M.23

and light will choose the shortest geometric path, joining these two points,namely, the straight line. (Observe that as far as Fermat's principle isconcerned, it makes no difference whatever whether we consider light as acorpuscular motion or a wave motion or both.)

Suppose, however, that P is located in a medium in which the lightvelocity is u ft/sec and Q is located in a medium in which the light velocityis v ft/sec. Let us assume further that the two media are separated by aplane surface. The following experiment will demonstrate what will happenin this case.

We fill a tank with water which is dyed green (or any other convenientcolor) and place a thin-beamed light source (e.g., a focused flashlight)somewhere above the water surface so that the beam of light hits the watersurface at an acute angle (see Fig. 111.23). Then we black out all lightexcept that coming from our light source and generate smoke or dust in thevicinity of the water surface. We will observe that the lightbeam, whichunder the stated conditions should be clearly visible, will travel along astraight line from the source to the water surface and along a straight linefrom the water surface to the bottom of the tank. Upon entering thewater, however, the beam will experience what is called a refraction, i.e., itwill show a break as indicated in Fig. II1.23.

Let us now analyze this situation mathematically in terms of Fermat'sprinciple. We choose our coordinate system so that the x-axis lies in thewater surface and x- and y-axes lie in the same plane as the light beam. Letus place the point P (light source) on the y-axis, at a distance yl from thewater surface (see Fig. 111.24). The point Q lies in the plane determined byP and x-axis and has coordinates (x2, y2) where Y2 < 0.

The beam of light as represented by the broken line PRQ will enter thewater at a point R(x, 0) where x is such that the time required to reach Qis a minimum. The angle between the incoming ray and the perpendicularto the water surface is denoted by a and the angle between the outgoing

Page 236: ɷshawns sagan integral and differential calculus

224

Fig. III.24

CHAP. III. RATES

x

ray and the perpendicular is j9. Let u be the velocity of light above thewater surface and let v be the velocity of light in the water. If we call thedistance PR = dl and the distance RQ = d2, then the time required to gofrom P to Q is given by

T=dl+d2,

u v

because it follows from (111.26) that

DistanceTime =Velocity

for motions with constant velocity.From the distance formula we obtain

andd2 = (x - x2)2 + Y22.

Hence, our problem is to find x such that

T(x) _x2

+ J12 + V (x -x2)2 + Y22 minimum.u V

Page 237: ɷshawns sagan integral and differential calculus

111.12. SNELL'S LAW 225

A necessary condition for x to make T(x) a minimum is that its firstderivative vanish:

(111.40) T'(x) = 0

We have, according to Problems 111.19 and 111.20,

d /x2 +y12 = x =

xdx Vx2 + y12 dl

and

V(x-x2 )2-{- y22= x-X2 x2-xd x '(x - x2)2 + Y22 d2

Since

and

we obtain for (111.40)

and from this

x- = sin ad1

x2 - x

d2= stn 15,

sing sin fl

it v

(IIL41)sin a it

sin F v

i.e., the ratio of the velocities is equal to the ratio of the sines of the anglesof incoming and outgoing ray with the perpendicular. This is Snell'sThe ratio in (111.41) is called the index of refraction.

Formula (111.41) yields an indirect method of testing Fermat's principle.For example, if the light velocities u and v in two media are determinedexperimentally, then the ratio u/v can be computed and an experiment, suchas that outlined in the beginning of this section where the angles a and ,8 aremeasured, will substantiate the predicted result. If we are reasonablysatisfied with the validity of this law, we can use it to find the light velocityin a certain medium if we know the light velocity in air by carrying out thetank experiment and measuring the refraction angles a and #. For example,we know that the velocity of light in air is

u(air) = 2.998 x 1010 cm/sec.

We wish to determine the velocity of light in water by carrying out thetank experiment. We measure the angles as

a = 40°48', 9 = 30°.

Page 238: ɷshawns sagan integral and differential calculus

226 CHAP. III. RATES

Fig. 111.25

Hence, sin a = 0.665 - f and sin 9 = Thus, the refraction index turnsout to be sin a_4

sin fi 3

With u = 2.998 x 1010 cm/sec, it follows that v (velocity in water) is

v = 3 x 2.998 x 1010= 2.248 x 1010 cm/sec.

In order to be in a better position to judge this difference in velocities, weassume the velocity of air as approximately 186,000 miles per second. Thenwe obtain for the velocity of light in water 139,000 miles per second, amarked difference.

We will conclude this section with a short discussion of an experimentwhich, upon superficial examination, seems to contradict Fermat's mini-mum principle. We place a lightsource at a point A and a parabolic mirroropposite A as indicated in Fig. 111.25. We assume the light source iscapable of emitting light in all possible directions. By inserting a mirrorprobe at the focus F of the parabolic mirror, we will observe that lightarrives at this point.

It can be further established, by blacking out all other light, that it arrivesthere along a straight line directly from A, as we expect in view of Fermat's

Page 239: ɷshawns sagan integral and differential calculus

II1.13. THE CYCLOID 227

principle. However, if we black out this particular ray, we will observe, byturning the mirror in F, that light still arrives at F and we can trace thislight by skilful experimentation to the parabolic mirror and from therealong a line parallel to the axis of the mirror to the source A. The velocityof the light along both paths is the same. So it would follow that here lightchooses the path ABF which does not yield a minimum for the time re-quired to reach F from A, since the minimum clearly is rendered by thedirect path from A to F. However, it can be shown that the path ABF doesyield a minimum as compared to other paths which reach F via reflectionin the mirror (see Problems 111.85 and 86). We see that we have here thesame situation as we encountered earlier in dealing with ordinary extremevalue problems where the value of x that satisfies the vanishing conditionfor the first derivative does not necessarily yield a minimum; and if it doesyield a minimum, then it is usually a relative minimum, i.e., it is a smallestvalue compared to values obtained in a neighborhood, as indicated by thebroken lines in Fig. 111.25.

Problems 111.84-111.87

111.84. Find the derivatives of

y= \/(x-1)2+(x2- a)2and

y = Vx2 + (x2 - 4)2by the limit process.

111.85. Given a parabola y = x2 which has its focus at (0, ¢). Join the point(1, a) where a > 1, with a point (x, y) on the right branch of the parabola andthis point, in turn, with the focus. Observe that y = x2 holds for points on theparabola, and express the length of this path as a function of x.

111.86. Show that x = I makes the derivative of the function in Problem111.85 vanish. Interpret your result and make a sketch. (For differentiation, usethe results in Problem 111.84.)

111.87. Assume the velocity of light in air as 186,000 miles/sec. Find the velo-city of light in ethyl ether, glycerin, and bromine. The refraction indices are1.351, 1.474, 1.654, respectively.

13. THE CYCLOID

We will devote this section to the discussion of a remarkable curve whichis quite simple in nature, but which possesses rather startling properties.Even though it would be far beyond the scope of this book to support bymathematical arguments all the statements which we are about to make, westill think that a narrative account should be of interest to the reader andprove quite stimulating.

Page 240: ɷshawns sagan integral and differential calculus

228

Fig. III.26

CHAP. III. RATES

The curve which we are about to discuss is the so-called cycloid. Thiscurve can be generated by a simple process, namely, by rolling a circle alonga line and observing a fixed point on the circumference of the circle through-out this process. The curve which is traced by this point when the circlerolls along is the cycloid. It is represented in Fig. 111.26. We will notindulge in trying to establish a mathematical representation of this curvefor this is not quite simple. Instead we enter right away a discussion of the"startling" properties of this curve.

John Bernoulli, a famous Swiss mathematician who lived and worked inthe late 17th and early 18th centuries, posed the following problem "to themathematicians of the world to give their consideration" : Given two pointsPz and P2 in a vertical plane but not so that the one is vertically below theother. These two points are to be joined by a curve y = f (x) which is suchthat the time required for a mass point under the influence of gravity toslide from Pi to P2 (without friction) along this curve shall be a minimum(see Fig. 111.27). If we assume that the motion along the curve y = f(x) isfrictionless, the only force exerted upon the sliding masspoint is the gravi-tational force which will impart the constant acceleration a = 32 ft/sec-secin the vertical downward direction upon the masspoint. (We neglect airresistance, of course.) Hence, we can apply our law of freely falling bodies

x

Y

Fig. III.27

Page 241: ɷshawns sagan integral and differential calculus

111.13. THE CYCLOID 229

and we obtain, with the understanding that the y-axis points downwardas indicated in Fig. 111.27,

y = 16t2and for the velocity

v=dy=32t.dt

These two formulas allow us to compute the velocity v in terms of thevertical distance y and we obtain

(111.42) V = 8\/y,

i.e., the velocity of the vertical projection of the motion is directly propor-tional to the square root of the vertical distance through which the mass-point has moved.

We will now outline how Bernoulli obtained a solution to this problem.The means which he employed are not quite legitimate; however, thesolution he arrived at is correct and can now be verified by more modernmathematical methods.

Instead of considering the gravitational "medium" as one that changescontinuously as the height changes, Bernoulli considered it as consistingof a number of layers in each of which the velocity v is considered asconstant (see Fig. 111.28) and replaced the curve along which the masspointis supposed to slide by a polygon with vertices in the boundary planesbetween two adjacent layers.

Let us investigate now the boundary plane between the kth and the(k + 1)th layer. This plane goes through the point x = 0, y = yk on they-axis and is perpendicular to the y-axis. Throughout the kth layer wedenote the velocity which is assumed to be constant by rk, and throughoutthe (k + 1)th layer by vk+l.

x

Y

Fig. 1 1.28

Page 242: ɷshawns sagan integral and differential calculus

230

In view of (111.42), we have

and

vk = 8Vyk

CHAP. 111. RATES

Vk+1 = 8 `' yk+l

Now Bernoulli makes the ingenious, but hardly justifyable, hypothesis thatthe sliding masspoint will behave when entering a new layer like a lightraywhen penetrating through the border plane between two media, i.e., heinvokes Snell's law (111.41) sin a_u

sink v'where u is the velocity of light in one medium and v is the velocity of lightin the other medium. a is the angle of the incoming lightray with the per-pendicular and j is the angle of the outgoing lightray with the perpendicularto the border plane.

Let us apply this law to the situation at hand: we denote the angle of theincoming path by a7; and the angle of the outgoing path by flk+1 (see Fig.111.29). We note that

13k+1 = ak+1

i.e., the angle of the outgoing path is the same as the angle of the incomingpath at the next following border plane at y = yk+1

According to Snell's law

(I11.43)sin ak; sin ak = 8 Jyk =sin fl.. sin ak+1 yk+1 % Yk+1

Hence, we can conclude that the sine of the angle of the incoming path withthe perpendicular is proportional to the square root of the vertical distancefrom the level of departure:

sin OCk = C1/yk

yk

V = Vk+1

y

Fig. III.29

Page 243: ɷshawns sagan integral and differential calculus

111.13. THE CYCLOID 231

Y

Y

Fig. M.30

where C is a proportionality constant, the value of which depends on thecoordinates of the endpoint P2.

Now we put forth the traditional argument of the integral and differentialcalculus: we let the number of gravitational layers between the level ofdeparture and the level of arrival increase and, at the same time, we lettheir thickness decrease to zero. This leads to the conclusion that the curvey = f(x) which is the solution of our problem, has to intersect the perpen-dicular at any point at an angle a which satisfies the relation

(111.44) sin a = C1/y.

(see Fig. 111.30). The angle between a curve and a line is defined as theangle between the tangent line to the curve and the line at the intersectionpoint. On the other hand, if # is the angle of the tangent line with thehorizontal, then the tangent of this angle is equal to the derivative at thispoint (rise/run).

Since

(3=a+90we obtain

y' = tan (a + 90) = sin (a + 90)cos (a + 90)

Since sine and cosine are complementary functions (see Appendix 111,Section 3) and since the sine is odd and the cosine even, it follows that

cos (a + 90) = cos (90 - (-a)) = sin (-a) = - sin a,

sin (a + 90) = sin (90 - (-a)) = cos (-a) = cos a.

Page 244: ɷshawns sagan integral and differential calculus

232 CHAP. III. RATES

Hence, if we also observe that sine a + cost a = 1, we have

cos a -,/1 - sine ay = - _ -sin a sin a

From (111.44) we obtain now

(III45)C2y

CJy

i.e., the curve y = f(x) which provides the path of shortest descent is to besuch that its derivative is related to the function itself according to (111.45).Such a relation is called a differential equation, because it is an equation foran unknown function that also contains the derivative (differential quotient)of the unknown function.

With our present knowledge it would be impossible to solve this equationfor y = ft x). However, with the background usually provided by an intro-ductory course in differential equations it is merely a routine matter to showthat the cycloid is the solution of this problem.

Since the cycloid emerges as the curve of shortest descent, it also iscalled the brachistochrone, which is a combination of the two Greek wordsjpaxccToc for shortest and xpovos for time.

But the cycloid has another property which makes it a remarkable curveindeed. In order to discuss this other property, let us build a model: Wedraw a cycloid on a board (j inch or ¢ inch thick) by letting a circle roll ona ruler and tracing the path of a fixed point on its circumference. Then wecut the board along the cycloid and mount it on another board as indicatedin Fig. 111.31. This board is supported in the back so that it assumes aninclined position as indicated in Fig. 111.31. Now we take two steelballs,place one in the position marked by A in Fig. 111.31 and the other in a posi-tion marked by B and hold them in these positions with two electromagnets

Fig. IU.31

Page 245: ɷshawns sagan integral and differential calculus

111.14. METHOD OF LEAST SQUARES

Fig. 111.32

233

which are operated by the same switch. Now we are ready to take bets ona handicap race: which one will arrive first at the bottom 0? You may notbelieve it, unless you really carry out the experiment: both will arrive at0 simultaneously-so, the house cannot possibly loose! No matter fromwhich position we release a masspoint on a cycloid, the time required forthe descent is always the same. So the cycloid not only is the curve of short-est descent (brachistochrone), it is also the curve of equal descent (tauto-chrone) as was first discovered by the Dutch physicist Ch. Huygens(1629-1695).

The model which we built can now also be used to illustrate the brach-istochrone property. The portion which we cut out initially can be used tosimulate two curves which are not cycloids and fitted into one-half of themodel, as indicated in Fig. 111.32. Then it will appear that whenever westart two steelballs at the same vertical distance from the bottom, the oneon the cycloid will always arrive at 0 first. The students are urged to reallybuild such a model. This is one of the simplest and still most impressivemathematical models there is.

14. THE METHOD OF LEAST SQUARES

Suppose a physicist or engineer carries out an experiment that involvestwo variables: an independent variable x (e.g., the time) and a dependentvariable y (e.g., temperature, velocity, etc.) Suppose he knows that withinthe framework of the theory that applies to this particular experiment, therelation between x and y is a linear one:

y= ax + b,

but he does not know the values of the constants a and b which are to bedetermined from the experiment. No experimental setup is ideal and nomeasurement is perfectly accurate. So, eventually, he will come up with anumber of data which he tabulates. This table may look somewhat likeTable III. I.

If we plot the pairs of corresponding numbers (x, y) from Table 111. 1in a Cartesian coordinate system, we obtain the points in Fig. 111.33. Due

Page 246: ɷshawns sagan integral and differential calculus

234 CHAP. 111. RATES

Table 111.1

x J

0 -41 -32 -33 -14 0

5 1

6 2

7 2

to the imperfection of the experiment and the inaccuracies of measure-ments, these points do not lie on a straight line, but with some good willwe can state that they cluster about a straight line.

The question which arises now is: How do we fit a straight line throughthese points which will fit the points best and represent the "true" relation-ship between x and y as closely as possible? One possibility is to pick outthose points which do lie on a straight line and join them by this line (seesolid line in Fig. 111.33). This means, of course, that we accept those pointsthat happen to lie on a straight line as accurate data and reject all the otherpoints as inaccurate.

The reader can easily see that there is no justification whatever for sucha procedure because the data selected are probably just as inaccurate as theones which we rejected. For example, consider the case where 4 of the 8

Fig. 111.33

Page 247: ɷshawns sagan integral and differential calculus

111.14. METHOD OF LEAST SQUARES 235

points lie on one line and the remaining 4 points lie on another line. Whichone do we accept ? Flip a coin ? Most likely the true data, which we wouldobtain only under ideal circumstances making accurate measurementspossible, will lie somewhere in between the ones which we actually obtained.Now we are back to our question: How do we choose a line such that thepoints representing the data cluster about the line as "closely" as possible?

Let us magnify Fig. 111.33 somewhat and consider the line L which passesbetween the given points as indicated (see Fig. 111.34). As a measure of thedeviation of the measured data from the line L we may consider the differ-ences between the measured y-value yk that corresponds to xk and the y-value that is obtained if we substitute Xk into the line equation

(111.46) y = ax + b.

We obtain with x1 = 0, x2 = 1, x3 = 2, etc.

d1=yl-ax1-b= -4 - bd2=y2-axe-b=-3-a-b

d8=y8-ax8-b=2-7a-b.In general, we write

dk=yk - axk - b.

Clearly, these quantities dk are positive or negative, depending on whetherthe kth measure point lies above or below the line L, and it is also clear thatIdki represents the vertical distance of the kth measure point from the lineL. We may think naively that we can obtain the best fit if we choose thecoefficients a and b such that the sum of all the dk's is zero. That this isreally a very poor procedure of obtaining a fitting line can be seen fromFig. 111.35, where the sum of the dk's is zero, but the line L does not fit thepoints at all.

However, if we consider the sum of the distances I dkI instead of the sumof the dk's, we will certainly obtain a good fit when we try to make this sumas small as possible through proper choice of a and b. Thus our problemcan be formulated as follows.

Choose a and b in y = ax + b such that

becomes a minimum. This procedure would yield a quite satisfactoryresult. It has, however, the disadvantage that absolute values are difficultto handle in computational processes. So we do the next best thing and

Page 248: ɷshawns sagan integral and differential calculus

236 CHAP. III. RATES

require that the sum of the squares of the distances shall become a mini-mum: d12 + d22 + d32 -F . . . + dg2 -* minimum

(observe that Jdkl 2 = dk2) through proper choice of a and b. Now, in orderto formulate our problem mathematically in greater generality, let us getaway from our example for the time being, and assume that we havemeasured n values Yl, y2, y3, . ' ' , yn which correspond to the n valuesx1i x2, x3, , xn of. the independent variable. Then, we have to choose aand b so that

(II1.47) d12 + d22 + d32 + ... + dn2 = dk2k=1

becomes a minimum. Since

dk=yk - axk - b,we obtain for (111.47)

(111.48) (yi - ax1 - b)2 + (y2 - axe - b)2+ (y3 - ax3 - b)2 + ... + (yn - axn - b)2

and our problem is, if we also use the more sophisticated sum notation, thefollowing one.

Y

Fig. M.34

Page 249: ɷshawns sagan integral and differential calculus

Ill. 14. METHOD OF LEAST SQUARES 237

5 Idr

dl

d3d2

Fig. 1II.35

Choose the coefficients a and b such that

d7

(111.49) .L, (yk - axk - b)2k=1

becomes a minimum. Clearly, (111.49) is a function of two variables : aand b.

So far we have dealt only with functions of one variable and also onlywith the minimum of functions of one variable. We will see in the nextsection, however, that it is quite easy to find a necessary condition for theminimum of a function of two variables in terms of the theory which wedeveloped for the minima (and maxima) of a function of one variable.

Problems 111.88-111.92

III.88. Expand the expression in (111.49) for the following data: (0, 1), (1, 1),(2, 3), (3, 4), (4, 4), (5, 5), (6, 7), (7, 9).

III.89. Find b in y = x + b such that (111.49) for the data in Problem 111.88becomes a minimum. Observe that this is a minimum problem for a function ofone variable, b, only.

I11.90. Given the data (0, 1), (4, 3), (8, 5). Find the equation of all lines forwhich d1 + d2 + d3 = 0.

111.91. Evaluate the expression in (111.49) for the data in Table 111.1 with

(a) a = 1, b = -4 (b) a 191b --A12A

Which pair of coefficients yields the better fit?

Page 250: ɷshawns sagan integral and differential calculus

238 CHAP. III. RATES

111.92. Given the function of two variables z = x2 + y2 + 1. Assume thatthe minimum is attained for y = 0 and some x. For what x does the functionattain the minimum under this assumption on y? What is the value of the func-tion for the x which you found? Is it the minimum of the function?

15. MINIMUM OF FUNCTIONS OF TWO VARIABLES

We consider the function

(111.50) z = AX, y)

where x and y are independent variables and z is a dependent variable.(111. 50) is to be understood in the following sense: For every pair of values(x, y) which may be restricted to a region or may range over the entire x, y-plane, there corresponds a value z. As an example you may consider a thinrectangular plate which is heated at the edges. The coordinates x and ydetermine a point on the plate and the corresponding dependent variablez may stand for the temperature at this point.

Let us assume that the function f (x, y) assumes its smallest value at thepoint Po(xo, yo) :

minf(x, y) = f(xo, yo).

Now: What condition(s) has (have) to hold at this point?Suppose we substitute for y the value yo in (111.50). Then we obtain a

function of one variable, namely x, z = f (x, yo) = F(x), which, accordingto our assumption, assumes its minimum value at x = x0.

If the function z = F(x) has everywhere a derivative, then it is necessaryaccording to the theory which we developed in section 9 that

(III.51) F'(xo) = 0.

So we see that the derivative of f (x, y) with respect to x has to vanish atPo(xo, yo) iff(x, y) assumes its minimum value at this point.

If we would denote the derivative off (x, y) with respect to x by a prime,as we used to in dealing with functions of one variable, we would run intodifficulties in case we should forget how this derivative is to be understood.To avoid any ambiguity in notation, we therefore denote the derivative ofa function of two variables with respect to one of the variables by a sub-script as follows :

d

dxf (x, y) = MX, y),

df(x, y) =A(x, Y) -

Page 251: ɷshawns sagan integral and differential calculus

111.15. MINIMUM OF FUNCTIONS OF TWO VARIABLES 239

Thus, we can write the condition (111. 51) in the form

(111.52) ff(xo, yo) = 0.

Now we substitute for x the value x0 in (111.50) and repeat our argument:We obtain thus a function of one variable only, namely, y

z = f (xo, y) = G(y)

of which we know that it assumes its minimum for y = yo. Hence, weobtain the necessary condition that

(111.53) G'(yo) = 0,

or, written in terms off and the new notation which we introduced :

(111.54) yo) = 0.

Thus we can state: If f(x, y) assumes its minimum value at the pointP0(xo, yo), and if its derivatives with respect to x and with respect to y existeverywhere, then it is necessary that

(111.55) .(x0, yo) = 0, ffv(xo, yo) = 0.

So, in order to find a point where the function f(x, y) could have a mini-mum, we write down equations (111.55), solve for x and y (which is simplein case these equations turn out to be linear), and try to establish afterwards,by an independent argument, whether the function has indeed a minimumat this point. We will not enter such a discussion here because this issomewhat involved and quite unnecessary for our purpose. The functionswe are going to deal with will be such that we can see by inspection thatthey can only have one minimum and it will turn out that the equations(111.55) will have only one solution. It then seems reasonable to assumethat this solution will yield the minimum value.

As a first application, let us consider the simple example

z=x2"+2.

We know that the smallest value this function can possibly assume is thevalue 0, and we can also see that the value 0 is obtained for x = 0, y = 0.This is indeed the solution of the equations (f II.55) which read in this case

2x = 02y=0.

Page 252: ɷshawns sagan integral and differential calculus

240 CHAP. III. RATES

As another application, let us consider a problem of the nature discussedin the preceding section, but let us simplify it to three data:

We try to fit the line y = ax + b through these points such that (111.49)becomes a minimum. (111.49) turns out to be the following function of thevariables a and b:

f(a, b) _ (yx - ax, - b)2 + (y2 - ax2 - b)2 + (y3 - ax3 - b)2

=(1 - b)2 + (2 - a - b)2 + (2 - 2a - b)2= 3b2 + 6ab + 5a2 - 12a - 10b + 9.

Hencefa(a,b)=6b+ l0a- 12=0fb(a,b)=6b+6a- 10=0.

The solution of this system of two linear equations in two unknowns is

a=1, b=6.2

Y

2

(0, 1)

(1. 2). (2 2)

1 2

Fig. M.36

-x

Page 253: ɷshawns sagan integral and differential calculus

111.16. METHOD OF LEAST SQUARES 241

is the one which fits best the points representing the given empirical data inthe sense that the sum of the squares of the deviations dk assumes a mini-mum. We therefore call this method the "method of least squares" andsay that the line is fitted best "in the sense of the least square principle".

In Fig. 111.36 we have drawn the line which we obtained as solution toour problem and have indicated also the points representing the given data.

Problems 111.93-111.96

111.93. Determine the points where the function z = 16x2 + 4xy + y2 + 1could possibly have a minimum.

111.94. Suppose we know that z =,f (x, y) has a maximum at the pointP1(x1i y1). Derive a necessary condition.

111.95. Suppose we know that the function z =./* (s, t, u, v, w) has a minimumat the point (so, to, uo, vo, wo). Derive necessary conditions.

111.96. Fit a line y = ax + b through the points (1, 2), (3, 3), (5, 3), (7, 4)in the sense of the least square principle. t16. THE METHOD OF LEAST SQUARES (continued)

We are now fully prepared to deal with and solve the problem which weposed in section 14, namely, to find two constants a and b such that thefunction

n(111.56) f (a, b) = I (yk - axk - b)2

k=1

assumes a minimum value, where the xk and yk (k = 1, 2, 3, , n) aregiven numbers.

We know from the discussion in the preceding section that iff(a, b) hasa minimum value for a certain pair (a, b), then it is necessary that the de-rivatives off with respect to a and with respect to b vanish for these valuesa, b :

(III.57a) ,,a, b) = 0(III.57b) fb(a, b) = 0.

In computing f, and ft, we remember that the derivative of a sum is equal tothe sum of the derivatives (see Section 2). We note also that every term ofthe sum (111.56) is of the form

(111.58) (yk - axk - b)2= yk2 -l- a2xk2 + b2 - 2axkyk - 2byk -l- 2abxk,

k = 1,2,3, ..,n.So all we have to do is differentiate the general term (111.58) and then takethe sum of these derivatives from k = 1 to k = n. We obtain for the

Page 254: ɷshawns sagan integral and differential calculus

242 CHAP. 111. RATES

derivative of (111.58) with respect to a:

d

da(yk - axk - b)2 = 2axk2 - 2xkyk + 2bxk = 2(axk2 - XkYk + bxk).

Thus (III.57a) now readsn

2 G (axk2 - xkyk + bxk) = 0.k=1

If we divide by 2 and split up the sum into its components, we obtainn

91(111.59) a I xk2 + b xk - L xkyk = 0.

k=1 k1 k=1

Similarly, we obtain

d ax b)2 = 2b - 22 2ax

and, consequently, we have for (III.57b)n

2 1 (axk - yk + b) = 0k=1

or

n n n(111.60) a xk - yk + b I = 0.

k=1 1;=1 k=1

(111.59) and (111.60) constitute a system of two linear equations in twounknowns, a and b, which has, in general, a unique solution (see ChapterI, Section 6).

If we want to solve this system, we have to evaluate first the coefficientsit it it 9L

I xk, I yk, I xkyk and ]E xk2. This can most efficiently be done by thek=1 k=1 k=1 k=1following scheme, where we use again the data from Table III.I in Sec-tion 14:

xk ykxk

XkYk

0 -4 0 0

1 -3 1 -32 -3 4 -63 -1 9 -34 0 16 0

5 1 25 5

6 2 36 12

7 2 49 14

7L

28 -6 140 19k=1

Page 255: ɷshawns sagan integral and differential calculus

111.16. METHOD OF LEAST SQUARES 243

Here the numerals set in roman type represent the given data and areinserted in the table first; the numerals set in italics are then computed interms of the given data.

Thus, in our example, (111.59) and (111.60) become:

140a + 28b - 19 = 0

28a+8b+6=0If we multiply and combine these equations as indicated, we find that

20a =-21

and, consequently,

b=-49.12

Thus the line20 49

Jx

21 12

will fit the given data best in the sense of the least square principle (seeFig. 111.37).

The computation that was involved in solving this problem was quitesimple and straightforward. Still, if we could accomplish some furthersimplifications, it would certainly help in case a great many data are givenand/or the data involve large numbers or numbers with a great manydecimal places.

Fig. III.37

Page 256: ɷshawns sagan integral and differential calculus

244 CHAP. III. RATES

Looking at (111.59) and (111.60), we see that the solution could be found

very easily if the terms which are multiplied by -7 xk were not present.n k=1

Now, what would that entail? If I xk = 0, then that means that the x-k=1

values are arranged symmetrically with respect to the origin, if they areequidistant to begin with. If this is not the case, then it can easily bearranged simply by shifting the y-axis until the new x-values xk are such

9L

that I xk = 0. We take the arithmetic mean of all the xkk=1

x = x1 +, x2 +. x3 + ... + xn

n

and introduce the new coordinates

Then, we obtain

xk=xk - x.

n n n

xk= (xk-x)= xk - x L 11 k=1 k=1 k=1

x1+x2..1. x3+... + xn- x1-f- x2+ xa+....+xnn= 0.

n

Hence, the equations (111.59) and (111.60) written in terms of the xk appearin the form

n n`fia .1 xk2 - L,. xkyk = 0

k=1 k=1

nb - yk=0k=1

and we obtain immediately the solution

xkyk 1 n

(III.61) a=km b1 yk.2 re k=1

xk

Hence,k=1

y = ax+b

is the line that fits the data best in the sense of the least square principlein the new x, y-coordinate system and, consequently,

y=a(x-x)+b=ax +(b-ax)is the solution of our original problem.

Page 257: ɷshawns sagan integral and differential calculus

111.16. METHOD OF LEAST SQUARES 245

To illustrate the superiority of this procedure over the previous one, letus solve the same problem again by the new method. We use the followingscheme :

xk Jk xk = xk - -'Z :E7, 2

0 -4 - 7 49 28

2 4 2

1 -3 -5 25 15

2 4 2

2 -3 _ 3 9 92 4 2

3 -1 - 1 1 1

2 4 2

4 0 21

0

5 1 3 9 3

2 4 2

6 25 25 10

2 4 2

7 27 49 14

2 4 2

168 80 28 728 -6 0

4 2 8 2

Thus, in accordance with (111.61),

20a b

3

21' 4and, consequently,

20 _ 3 20 7 3 20 49y=-x --=- x-- -- = -x --.21 4 21 2 4 21 12

We wish to point out one additional advantage of this method. In com-puting the numbers in the column headed by xk2 in the previous procedure,we had to carry out 8 squaring processes. Now, because of the symmetryof the xk data, we only have to carry out 4 squaring processes in column 4and then copy these values in reverse order. (Of course this is not trueanymore if the xk are not equidistant to begin with.)

Page 258: ɷshawns sagan integral and differential calculus

246 CHAP. M. RATES

Problems 111.97-111.102

111.97. Fit a line through the points (0, 1), (1, 0), (2, 0), (3, -,), (4, -*),(5, -1), (6, -2), (7, -3) in the sense of the least square principle.

111.98. Same as in Problem IIl.97 with the points (-2, 0), (-1, 0), (0, 0),(1, 1), (2, 1), (3, 2), (4, 1.5), (5, 2), (6, 4), (7, 4), (8, 4).

111.99. Fit a parabola y = axe + bx + c through the points (0, 0), (1, 1),(2, 3), (3, 8), (4, 17) in the sense of the least square principle. Note that

l'(a, b, c) (yk - axk2 - bxk - c)2 is now a function of three variables.k=1

111.100. Fit a curve y = a cos x + b sin x through the points (0, 0.7), (ir/6, 0.6),(,r/4, 1.1), (ir/3, 0.8), (Ir/2, 0.6), (27r/3, 0.6), (3ir/4, 0.1), (5ir/6, -0.4), (jr, -0.8) inthe sense of the least square principle. (For the true values of sine and cosinefor the listed arguments see the tables in Sections 2 and 3 of Appendix III.)

111.101. Prove that the line y = ax + b which fits the points (xk, yk) (k =1, 2, 3, , n) in the sense of the least square principle passes through the origin

21

if 57.xk, = 0 and the yk are symmetrical with respect to the x-axis.k=1111.102. The population of the United States is listed as follows

year population in millions

1810 7

1840 17

1870 39

1900 76

1930 122

1960 160

Assume that the population increases quadratically and fit a parabolay = axe + bx + c through these data in the sense of the least square principle.Under this assumption, (a) find the rate of increase in 1945; (b) estimate thepopulation in 1970.

Supplementary Problems III

1.1. Given F(x) = 1/x. EvaluateF(x +

h- F(x)

at the point x = 1 for the

following values of h: h = 1, 0.5, 0.2, 0.1, 0.01, 0.001. Can you guess what thelimit will be ash 0?

1.2. Consider the difference quotient for the function y = cos x. By a skilfulmanipulation of the identity in Problem AIII.13 of Appendix 111, Section 4, it canbe shown that

Cos a - cos 9 _ -2 sin (a 2 ) Sin l a2

Let a = x + h, fl = x and simplify the difference quotient until it appears as a

product one factor of which issin (h/2)

h/2

Page 259: ɷshawns sagan integral and differential calculus

III. SUPPLEMENTARY PROBLEMS 247

1.3. Let .F(x) = IxI. Sketch the function y = F'(x).1.4. Simplify the difference quotients of the following functions to a point at

which the factors h in numerator and denominator cancel out:

(a) F(x) = x3 - x (b) F(x)

1.5. Given the functionx2 forx>0

F(x.)x3forx <0.

-xx

FindF(x + hh - F(x) and - F(x - hh - F(x) and evaluate these quotients at

x = 0 for h = 1, 0.5, 0.1, 0.01, 0.001, 0.0001. Do both tend to the same limit?

2.1. Differentiate

(a) y = (1 + x)3(1 - x)(b) y =

x3 - 641 -x2 x-4

2.2. Find the equation of the tangent line to the parabola

y=x2-3x+1at the point (4, 5).

2.3. Given the functions(2 for rational x

.fi(x) 0 for irrational x= (0 for rational x

f2(x) 2 for irrational xf3(x) = -2x for all x.

Find the derivative of F(x) = fl(x) +f2(x) +f3(X)-2.4. At which point does the curve

y =x2 - 2x + 5have a horizontal tangent line?

2.5. The general term an of the sequence al, a2, a3, is defined as follows:a is the slope of the tangent line to the parabola y = x2 + x + 4 at the pointx = 1 +n 2 Give an explicit representation of a,,. Find lim a,,. Can you

check your result by an independent method?3.1. Differentiate

xs x5x4 x3 x21

(a) F(x) = x7 + x(b) F(x) + 5 + 4 + + 2

+ x +

3.2. Givenx3+1forx<0

F(x) = 1x4 + 1 for x > 0.

Find F'(x) and F"(x) at x = 0. What about F"(x) at x = 0?3.3. Find the equations of the tangent lines to the curve y = x

points at which the slope of the tangent lines is 1.+ ' at those

x

Page 260: ɷshawns sagan integral and differential calculus

248 CHAP. III. RATES

3.4. Given the parabola y = x2 - 4x + 2. Find the equations of the tangentlines to the parabola which pass through the point (1, -1).

3.5. Given y = x3 - 3x + 4. Find those values of x for which y' = 0.Denote these values by Xk (k = 1, 2, ?) and evaluate y' at x = Xk for all k.

3.6. Given y = cos x. Find y'. (Hint: See Problem 1.2.)4.1. Differentiate:

(a) y = (4x2 - 6x + 7)1008 (b) y = 9x3 - v'x + 1

+1

(c) y1

= (d)y=x \/x+3x x2+44.2. Differentiate:

(a) y = cos (x3) (b) y = cos4 x

(c) y = sin3 (xs) (d) y = Vsin3 x

4.3. Show thatdx(Pla) p P-= 1

- X17dx q

where p and q are positive or negative integers. (Hint: Consider yQ = xP

instead of y = x"N. Thend a

= dy4 dx = qy7-1 dx . On the other hand,

W

= px7-1. Since yq = x1, we have dx = dxn , i.e., gyq-1dy

= pxT1.)

5.1. Find the antiderivative of

(a) y' = 3x2 - 4x + 5 (b) y' = 4x3 + 12x2

(c) y' = -12x4 + 23x + 7 (d) y' =12 1- - sx x

5.2. Find

f f (1

(a) J Tx (b) J\x2 - x2 I dx

(c) 5x4 dx (d) 5(1 + x2)(1 - x2) dx

5.3. Utilizing the antidifferentiation process, evaluate the following definiteintegrals .

r1

r(1 -X2 ) dx(a) I (b)

Jcos x dx

J 1 0

1000.001

(c) f (4x + 1) dx (d) sin x dxf1000 ir

x g

x2 dx = 3 + C.a

Page 261: ɷshawns sagan integral and differential calculus

III. SUPPLEMENTARY PROBLEMS 249

Show that for any value C (between - co and co) there exists a value a whichsatisfies this relation. Compare this result with that of Problem 111.41. Try toexplain the difference between the two problems.

r35.5. Is

J(Vx + 3) dx meaningful? Explain.

i6.1. Show that

Jf (u)u (x) dx = F(u(x)) + C

if dF = f(u) -

6.2. Integrate

(a) J (1 + /3 + 47) dx f (1 xs )(b) J ` -F x4 - 1 dx

1-sin x dx5(3cc

(c) (d) - x2 Vx3 + 4) dx

6.3. If y = f (u), u = g(v), v = h(x), then dy dy du dvdx =

cfu TV TXShow that

Jf'(u)u'(v)v'(x) dx = f(u(v(x))) + C.

6.4. Evaluate :

(a) f 7T/2sin x cos x dxJIT/4 sin x dx

.l 0 (b) cost x

(c) ro3 d + x (d) jo x v'x2 _+9 dx

7.1. Given y = xM. Find an approximation to y' at x = 4.01. (Find thederivative y' of x%% by the limit process.)

7.2. Find approximations for

(a) (2.98)4. (b) V'120.87

(c) (1.0001)' (d) V63.9917.3. Height and base of an isosceles triangle both increase from a length of 2 ft

to a length of 2.023 ft. Find an approximation for the increased area.7.4. The area of a square decreases from 4m2 to 3.976m2. Find an approxima-

tion for the length of the perimeter of the shrunken square, assuming that all sidesshrink at the same rate.

7.5. Find an approximation for the increase of the function y = x2 betweenx = 0.5 and x = 0.52 and between x = 4 and x = 4.02. Find also the exactincrease and express the errors in both cases in per cent of the exact increase.

8.1. Find an approximate solution of

x3 + 7x2 - 5x - 35 = 0

Page 262: ɷshawns sagan integral and differential calculus

250 CHAP. 111. RATES

by Newton's method. Start with xo = -6 and take three steps. (See alsoProblem 111.55.)

8.2. Specialize Formula (111.23) for the function f (x) = x2 - a.8.3. Apply the formula found in Problem 8.2 to a = 5; start with xo = 2,

and take three steps. You thus obtain an approximation for what?8.4. Find approximate solutions. of

x2-3x+I =0by Newton's method. Start with xo = z and with xo = 2. Take three steps ineach case. Solve the equation by the quadratic formula, isolate the approxima-tions for the square root involved, and compare the two approximations witheach other.

8.5. Find an approximation for the solution x of the equation

udu -3=0by Newton's method.

9.1. Find all extreme values of the functions

(a) y = x3 + 3x2 - 105x + 1 (b) y = 4x3 + 6x2 - 9x -49.2. A rectangle with the largest possible area is to be inscribed in a circle of

radius 1. What are its dimensions?9.3. Find all extreme values of the function

y = sin x + cos x

and sketch the function. (For differentiation of sine and cosine, see Problem111.29 and supplementary Problem 3.6.)

9.4. Given the function

x2

- 2x +2forx> 1f(x) x3 - 3x2 + 3x for x < 1.

Find the minimum value of this function.9.5. Sketch the function

1 .y = -sinx.x

(Hint: The extreme values of sin x are I and -1. Wherever sin x = 1, we havey = 1 /x; and wherever sin x = -1, we have y = - I /x.)

10.1. An object slides down an inclined plane, starting from rest at the top,with an acceleration of 12 ft/sect. The length of the inclined plane is 18 ft. Whendoes it hit the bottom?

10.2. An object moves back and forth along a straight line according to theformula

s = 3 cos t + sin t

(Harmonic Motion). For what values of t does it pass through the point s = 0 ?How far does it go beyond the point s = 0?

10.3. A car starts from rest with an acceleration of 20 ft/sect. As soon as itreaches a velocity of 60 mi/h it will travel with this velocity for 3 min and then

Page 263: ɷshawns sagan integral and differential calculus

III. SUPPLEMENTARY PROBLEMS 251

slow down to a velocity of 45 mi/h within minute. It will then travel at thevelocity of 45 mi/h for 5 minutes and then slow down to a stop within 1 minute.(a) Find a = a(t). (b) Find v = v(t). (c) Find s = s(t), assuming that the carstarts from s = 0.

10.4. How far did the car in Problem 10.3 travel?10.5. Sketch the functions a = a(t), v = v(t) and s = s(t) from Problem 10.3.11.1. A stone is thrown vertically upward from a window 24 ft above the

ground with an initial velocity of 16 ft/sec. (a) How high did it go? (b) Whendid it pass the window on its way down? (c) When did it hit the ground?(d) What was the impact velocity?

11.2. What initial velocity do we have to impart to an object if we want tothrow it 82 ft vertically upward?

11.3. The weight of an object on the surface of a heavenly body is defined asthe force of attraction between the heavenly body and the object. Suppose, anobject weighs 1 lb on the surface of the earth. How much does it weigh on thesurface of the moon? (Mass of moon so of mass of earth, radius of moon2200 mi.)

11.4. Find the acceleration of a freely falling body in a reasonable neighbor-hood of the surface of the moon. (For dimensions of moon, see Problem 11.3.)

11.5. A projectile is to be shot from the moon to the earth. Find the necessaryinitial velocity to shoot it beyond the neutral point. (See also Problem 111.82and Supplementary Problem 11.4.)

12.1. A light ray is emitted from the point P(0, 1 in.) and proceeds within thex, y-plane through air until it hits a water surface y = 0 at an angle of 60° withthe perpendicular. It then proceeds through water until it hits the surface of alayer of glycerin at y = -1 in. If the layer of glycerin is 1 inch thick, at what pointdoes the light ray leave the glycerin layer? (Velocity of light in air = 186,000mi/sec, in water = 139,000 mi/sec, in glycerin = 126,000 mi/sec.)

12.2. If a light ray penetrates the boundary of a medium, part of it is refractedinto the new medium and part of it is reflected such that the angle of the reflectedray with the perpendicular is equal to the angle of the incoming ray with theperpendicular. Consider that part of the light ray that is reflected on the lowerboundary of the glycerin layer in Problem 12.1. At what point does it emergefrom the water layer into the air?

13.1. Consider the cycloid in Fig. 111.26 (p. 228). After the circle of radius Rrolled from its initial position through a distance d, the line joining P with thecenter of the circle will have turned through an angle of a = dIR radians.Express the coordinates x and y of the point P for any position of the generatingcircle as functions of the angle «.

13.2. Same as in Problem 13.1 for the case where the point P does not lie onthe periphery of the circle but at a distance a < R from the center of the circle.Sketch the curve.

13.3. Same as in Problem 13.2 for the case a > R. Make a sketch.14.1. Find a in y = ax so that the expression (11.49) becomes a minimum for

the data in Problem 111.88 (p. 237).14.2. Given the data (0, 2), (3, 3), (6, 5), (9, 5). Find the equation of the line(s)

for which dl + d2 + d3 + d4 = 0. (For definition of the dg's see p. 235).

Page 264: ɷshawns sagan integral and differential calculus

252 CHAP. M. RATES

15.1. At what point(s) could the function of three variables

w = x2 + y2 + 10z2

possibly have an extreme value? (See also problem 111.95.)16.1. Find a line which fits the points (0, -4), (1, -2), (2, 1), (3, 2), (4, 5)

according to the least square principle.16.2. Same as in Problem 1I1.102 (p. 246) with an assumed population of 170

millions in 1960. Compare the results with those of Problem 111.102.

Page 265: ɷshawns sagan integral and differential calculus

CHAPTER IV

VOLUMES

1. VOLUME OF A PRISM

We consider a right parallelepiped. A right parallelepiped is a box (seeFig. IV. 1). Suppose the dimensions of the box are a, b, and c as indicatedin Fig. IV. 1. Our aim is to establish a measure of the volume of this solid,or in simple words : How much pink paint does the box hold? Again, weproceed in a way very similar to the one which we took in establishing thearea measure of a plane rectangle.

Clearly, among many boxes with the same base (the shaded rectangle inFig. IV.1 is called the base), i.e., with the same dimensions a and b, the onewith the largest height c will hold the greatest amount of paint, i.e., has tobe assigned the largest volume measure. On the other hand, if we havemany boxes with the same height c but different bases (with different areameasures), then it is clear that the one with the largest base will have thegreatest volume. This leads us to define the volume of a rectangular box(right parallelepiped) as jointly proportional to the area of its base and thelength of its height.

Since

we are led to the definitionArea of base = ab,

(IV. 1) Volume of right parallelepiped = abc

if we agree to choose a box of the dimensions a = 1, b = 1, c = 1 as theunit of volume measurement, thereby making the proportionality constantequal to one. (If the sides are measured in inches, then the volume isaccordingly measured in cubic inches-cu in.; if the sides are measuredin feet, then the volume is measured in cubic feet-cu ft; and if the sidesare measured in meters, then the volume is measured in cubic meters-m3.)

We can see easily that the volume measure of a rectangular box, asdefined in (IV.1), has analogous properties to the area measure, namely:it is always positive unless at least one of the sides has the length zero-in

253

Page 266: ɷshawns sagan integral and differential calculus

254 CHAP. IV. VOLUMES

Fig. IV.1

which case we don't have a box at all; the volume measure is additive, i.e.,if we fit two boxes together to make one box, then the volume of the newlycreated box equals the sum of the volumes of the two components. Fromthis property we can again deduce that whenever one box is contained inanother box, the volume measure of the submerged box is smaller than thevolume measure of the containing box.

We proceed now to a generalization of this concept to a solid, as depictedin Fig. IV.2. The base of this solid is a planar region encompassed by apolygon and the lateral surface consists of a number of rectangles perpen-dicular to the base which are all of the same height h. Such a solid is calleda prism.

Before we attempt to deal with this general case, let us first consider thecase where the base is a right triangle. Such a prism can be obtained from

Fig. IV.2

Page 267: ɷshawns sagan integral and differential calculus

IV.2. VOLUME OF CYLINDER 255

Fig. IV.3

a right parallelepiped simply by cutting the latter in two, as indicated inFig. IV.3. Hence, we obtain for its volume

V = 2abh.

Now, to obtain a prism which has a general triangle as its base, we gothrough essentially the same argument as in Chapter II, Section 2, p. 79and obtain the following formula for the triangular prism:

Volume of triangular prism = Jabh

where a is one side and b the corresponding height in the base triangle.Dissecting the base of a general prism into triangles, we are led to the

following formula for the volume measure of a prism:

Volume of prism = Area of base x height,

where the area of the base can be computed in terms of areas of triangles.

Problems IV.1-IV.4

IV.1. The base of a prism of height 12 is a right triangle with the sides 3 and 7.Find the volume.

IV.2. The base of a prism of height 10 is a regular octagon which is inscribedinto a circle of radius 6. Find the volume.

IV.3. The base of a prism of height h is a regular hexagon which is inscribedinto a circle of radius r. Express r as a function of the volume V.

IV.4. The base of a prism of height h is a regular polygon with n verticesinscribed into a circle of radius r. Find the volume.

2. VOLUME OF A RIGHT CIRCULAR CYLINDER

What is commonly referred to as a cylinder is what mathematicians moreprecisely call a right circular cylinder (see Fig. IV.4). Top and bottom arecircles of radius r which lie in parallel planes and are joined together by a

Page 268: ɷshawns sagan integral and differential calculus

256 CHAP. IV. VOLUMES

Fig. IV.4

sheet of height h which is perpendicular to the planes in which top andbottom lie.

How do we define the volume of such a right circular cylinder in order tobe consistent with the volume definitions which we have given so far?Now the volume is to be greater than the volume of any solid which liesentirely within the cylinder. If we inscribe a regular polygon into thecircular base, we obtain a prism which is contained in the cylinder (seeFig. IV.5). The volume of this prism, according to the formula which wedeveloped in the preceding section, is given by

V(prism) = hB,,

where Bn represents the area of the regular polygon with n vertices inscribed

Fig. 1V.5

Page 269: ɷshawns sagan integral and differential calculus

IV.2. VOLUME OF CYLINDER 257

Cut

Fig. IV.6

in a circle of radius r. As we let the number of vertices increase, the prismwill fit with increasing snugness into the cylinder, and we are led to thedefinition

Since

and

lim hBn = h lim Bnn-co n- co

llm B,, = r7r,n-00

as we have seen in Chapter II, Section 5, we are led to the definition

(IV.2) Volume of right circular cylinder = r27rh.

Before we close this section, let us settle the question as to the lateralsurface area of a cylinder. We cut through the lateral surface along a lineperpendicular to the base and unroll it (see Fig. IV.6). We then obtain arectangular sheet of dimensions h and 2rir, since 2rIr is the circumferenceof the circular base. Hence,

Lateral surface area of cylinder = 2rirh.

Problems IV.5-IV.7

IV.5. Given a right circular cylinder of height h = 4 and a base of radiusr = 2. (a) Find the volume of the cylinder. (b) Find the lateral surface area ofthe cylinder. (c) Find the total surface area of the cylinder.

IV.6. Given a cylinder of height h and a base of radius r = 2. (a) Express thevolume Vas a function f (h) of the height. (b) Express the volume V as a functionof the lateral surface area S: V = F(S). What is F(S)?

IV.7. Given a prism of height 1, the base of which is a regular polygon of 106vertices inscribed into a circle of radius r = 2. Find an approximation to thevolume.

Volume of right circular cylinder = lim hB,,.n-CO

Page 270: ɷshawns sagan integral and differential calculus

258 CHAP. IV. VOLUMES

3. VOLUME OF A RIGHT CIRCULAR CONE

We consider a right circular cone as represented in Fig. IV.7. The baseis a circle of radius r. A point V, the vertex of the cone, is h units verticallyabove the center of the circle. Now, take a line from V to a point P on thecircumference of the circular base and let P wander around the full circum-ference. Then the line VP will generate the lateral surface of a right circularcone of base radius r and height h.

In order to determine the volume of the cone we have to reduce thisproblem to the volume determination of solids with which we have alreadydealt. We chop the cone up by a number of planes which are parallel tothe base (see Fig. IV.8) and try to determine the volume of each slice. Eachslice has a shape as indicated in Fig. IV.9. Such a thing is called a conicalfrustum (frustrated cone) and its volume determination is at least as frus-trating as the volume determination of the entire cone itself. Therefore,we assume that the slices are very thin and approximate each slice by acylindrical disk (see Fig. IV. 10) with the same base.

If we put these cylindrical disks together again, we obtain what is indi-cated in Fig. IV. 11, which is by no means a cone, no matter how thin we

Fig. IV.8

Fig. IV.9 Fig. IV.10

Page 271: ɷshawns sagan integral and differential calculus

IV.3. VOLUME OF CONE 259

(k + 1)thdisk

h

kh

Fr------- 3--Fig. IV.11 Fig. IV.12

h

make the slices to begin with. However, we can see that, if we let the heightof each cylindrical slice go to zero and at the same time increase the numberof slices to infinity, then the solids of the type illustrated in Fig. IV. 11 willapproximate the right circular cone from the outside with increasingaccuracy. We are thus led to define the volume of the cone as the limit ofthe volume of that pile of cylindrical slices as their number increases beyondbound and their individual thicknesses tend to zero.

To carry out this limit process in detail, we refer to the cross section inFig. IV.12. The base radius Xk of the (k + 1)th cylindrical disk is obtainedfrom Xk _ r

h kh h

n

(note that BLV and B'L'V are similar triangles). Hence,

xk=Y h - kh).h n,

Thus the volume of the (k + 1)th cylindrical disx is given by

Vk= r2 2 h-kli2

rh.h( n / r2

If we sum up all n disks, we obtain

V(cone)r27r I1 (h - khl2

r2 r1(h2

2h2k

+

h2k2

nh k=o n nh k=o n n2r2i n-1 2h2 n-1 h2 n-1--(h211--Ik+-1k2nh k=o n k=0 n2k=0

Page 272: ɷshawns sagan integral and differential calculus

260 CHAP. IV. VOLUMES

We have seen in Chapter II, Section 12, that

n1 l1 = n, ik = n(n - 1)* ik2 = (n - 1)n(2n - 1)*57

k=0 k=0 2 k=0 6

Using these results,

V(cone) -r27r(nh2 - 20 n(n - 1) h2 .(n - 1)n(2n - 1)+ 2 '

Since

nh n 2 n 0

= r27rh 1 - n(n - 1) + n(n - 1)(2n - 1))( n2 6n3

r2TTh (1 - n2 - n + 2n3 - 3,t2 + 12)

n2 6n3

=r27rh(1-1+n+32n+6n2)

lim 1 = 0 and lim 1 = 0,n-'o 11 n-'ao it2

we obtain for the volume of the right circular cone2

(IV.2) Vcone = liym r27rh (n +3 2n + 612) r 3 h

Problems IV.8-IV.12

IV.8. Given a right circular cone of height 3 and base radius 1. (a) Find itsvolume. (b) Approximate the volume by taking 6 inscribed cylindrical disks and6 circumscribed cylindrical disks. What is the error?

IV.9. Find a formula for the area of the lateral surface of the right circularcone of height h and base radius r. (Hint: Cut the surface along a line from V toa point on the circumference of the base and unroll it.)

IV.10. Given a right circular cone of base radius 2 and height It. Express thevolume V as a function of the lateral surface S.

IV.11. What is the total surface area of the cone in Problem IV.8 ?IV.12. A cone has the volume V = 121T. Choose the dimensions r and h so

that the lateral surface area is a minimum.

4. THE LAW OF THE LEVER"Give me a place to stand and I willmove the Earth." (ARCHIMEDES)

Let us consider a teeter-totter, or, if you prefer a more sophisticatedterm, a lever. The point of support of the lever, F, is called the fulcrum(see Fig. IV. 13), and the parts sticking out on both sides of the fulcrum are

* Note that it is immaterial here whether we extend these sums from k = I to n - 1or from k=0ton-1.

Page 273: ɷshawns sagan integral and differential calculus

IV.4. LAW OF THE LEVER

f1

d1

Fd2

Fig. IV.13

261

called the lever arms. We will now discuss the question as to how to keepa lever in balance (equilibrium). Small children usually have a vaguenotion of the answer. If a child plays teeter-totter with its little sister orbrother, it will find out pretty soon that for the best effect it has to sit closerto the fulcrum than a lighter child.

Suppose the latter is capable of exerting a downward directed forcethrough its weight of magnitudef1 and is placed at a distance d1 from thefulcrum (see Fig. IV. 13). Now, the problem for the older and presumablybigger kin who is capable of exerting a downward directed force f2 throughits weight, is where to sit? In other words: At what distance d2 from thefulcrum is it supposed to exert the force f2 in order to keep the lever inbalance.

Archimedes (287-212 B.C.) discovered the complete answer to this prob-lem in the form of the law of the lever which states: The lever is in balanceif, and only if,

(IV.3) ff1d1 = f2d2,

i.e., force times length of lever arm must yield the same product on bothsides. Archimedes immediately recognized the far reaching consequencesof his discovery as evidenced by his statement which we quoted above. Ina less bombastic tone, we could say that a small child is capable of liftinga concert grand piano, provided it uses a lever and places the fulcrumstrategically, i.e., close to the piano, while exerting its little force at a suffi-ciently large distance from the fulcrum on the other lever arm.

It is more practical for most applications to state the law of the lever interms of masses rather than forces. From (111.35) on p. 215 we have

f = mawhere f stands for force, m for mass, and a for acceleration. A mass underthe influence of gravity is accelerated by the amount a - 32 ft/sec-sec.Hence, if the force f1 is exerted by the weight of a mass m1 and the force f2is exerted by the weight of a mass m2, then

f.{'i = m1a

J 2 = m2a.

Page 274: ɷshawns sagan integral and differential calculus

262 CHAP. IV. VOLUMES

M d

FA ml m2 m3 m4 Mn

Fig. IV.14

Substitution of these quantities into (IV.3) yields

m1ad1 = m2ad2.

We can divide by a and obtain the law of the lever in terms of masses:

(IV.4) m1d1 = m2d2,

i.e., a mass m1 at distance d1 from the fulcrum balances a mass m2 at dis-tance d2 from the fulcrum (on the other lever arm) if, and only if, theproducts of the respective masses with their distance from the fulcrum areequal.

Suppose that we wish to balance a mass m1 = M with the unit massm2 = 1. Then, according to (IV.4),

Md1 = d2,

i.e., the unit mass has to be placed at a distance from the fulcrum which isM times the distance of the mass M from the fulcrum.

To facilitate further references, let us introduce the concept of moment.The product of a mass times its distance from a given point is called themoment of this mass with respect to the given point. Using this new term,we can state the law of the lever as follows: The lever is in balance if, andonly if, the moments on both sides are equal.

Now, suppose that we have more than one mass, namely, masses ofmagnitudes m1, m2, , m,, at distances d1, d2, , d,, from the fulcrumon one lever arm and a mass M at a distance d on the other lever arm (seeFig. IV. 14). Then the moment,uk of the mass Mk with respect to the fulcrumis given by

ak=mkdk(k= 1,2,3,...,n)and the total moment of all masses on the right side of F is given by

(IV.5) /fi1 =P1+P2+Y3+... +JUn

= m1d1 + m2d2 + m3d3 + ... + m,d,a = mkdk.k=1

The moment of the mass M on the left side of the fulcrum is given by

(IV.6) ,ui = Md.

To keep the lever in balance, we have to have ,u,, = ,ul, i.e.,n

(IV.7) Md = I mkdk.k=1

Page 275: ɷshawns sagan integral and differential calculus

IV.4. LAW OF THE LEVER 263

Suppose that

Mynk,k=1

i.e., the mass M on the left equals the sum of all masses on the right, thenit follows from (IV.7) that

n n

d I ink = I mkdkk=1 k=1

and, consequently,

(IV.8)

n

' 17'kdkd = k=1n

G inkk=1

This formula can be interpreted as follows: If we replace the massesm1, m2i m3, , mn at distances d1, d2, d3, , do by a single mass M whichis equal in magnitude to the sum of all masses mk, and if we place thismass Mat a distance d from the fulcrum as given by (IV.8), then it will exertthe same moment with respect to the fulcrum as the initial mass distributiondid.

This distance d has another important significance. Suppose we wish torelocate the fulcrum so that the given mass distribution m1, m2i m3, , m n

balances the lever with respect to the new fulcrum. We shift the fulcrumF through a distance d' into the position F' (see Fig. IV. 15). Then weobtain for the moments ,uk with respect to the new fulcrum F'

y1, = m1(dl - d'),u2' = m2(d2 - d')

/zn' = mn(dn - d').

If dk - d' < 0, then Mk is on the left side of the new fulcrum F' and ifdk - d' > 0, then Mk is on the right side of the new fulcrum F'. Now wewish to determine d' so that the total moment on the left of F' is the sameas the total moment on the right of F'. This is equivalent to determiningd' such that the algebraic sum of all the moments /k' is zero.

F -<-d'- F'ml m2 m3 m4 Mn

Fig. IV.15

Page 276: ɷshawns sagan integral and differential calculus

264 CHAP. IV. VOLUMES

F m m m m m m m

0 1 2 3 4 5 n-1 n

Fig. IV.16

This leads to the condition

n n n nIN' = I lnk(dk - d') = I mkdk - d' J mk = 0k=1 k=1 k=1 k=1

and we obtain

d' _

k=1

i.e., d' = d as given in (IV.8). This property attributes a new significanceto the quantity d. The point at distance d from the fulcrum F is the pointabout which the given mass distribution is in equilibrium. We call thispoint the mass center or centroid: The mass center is the point about whicha (linear) distribution of masses is in equilibrium.

Suppose we have n masses of equal magnitude placed at equal distances.Then we can represent these locations as points on the line of numbers asindicated in Fig. IV. 16 and obtain for the mass center according to (IV.8)

n nIkln mJk

d = k=1 = k=1 = m(n + 1)n = 1 (n + 1)n 71

57 m m'F 1nm 2

k=1 k=1

which is also the geometric center of the points 1, 2, 3, , n.

Problems IV.13-IV.19

IV.13. A mass weighing 1000 lb is placed at a distance 2 ft from the fulcrum.(a) At what distance do we have to place a mass weighing 3 lb at the other leverarm to keep the lever in balance? (b) What is the weight of a mass which isplaced at a distance 16 ft from the fulcrum on the other lever arm, if it is to keepthe lever in balance?

IV.14. In a decimal scale, an object is kept in balance by a weight which isonly one-tenth of the weight of the object. What is the ratio of the lengths of thelever arms?

IV.15. The masses 2, 4, 3, 5, 9, 4, 6, 1, 9 at distances 1, 2, 5, 4, 5, 7, 125, 9 areto be replaced by a mass of magnitude equal to the sum of the given masses. Atwhat distance is this mass to be placed from the fulcrum to exert the samemoment?

IV.16. Given the mass distribution of Problem IV.15. The masses are to bereplaced by a mass of magnitude which is one-half the sum of the magnitude ofthe given masses. (a) Make an intelligent guess as to where to place this new

Page 277: ɷshawns sagan integral and differential calculus

IV.5. MASS CENTER 265

mass in order to exert the same moment. (b) Compute the distance at which thismass has to be placed in order to exert the same moment. (c) Explain why yourguess was wrong-if it was wrong.

IV.17. Find the mass center of the masses 3, 3, 5, 4, 4, 7, 5, 4, 5, 7, 7, 5 which are1, 1, 3, 3, 4, 4, 4, 5, 3, 2, 1 units apart.

IV.18. Use geometric reasoning to find the mass center of a rectangle, i.e., thepoint about which the rectangle is in balance if supported at this point by the tipof a pencil.

IV.19. Same as in Problem IV. 18 for a triangle.

5. MASS CENTER OF REGIONS AND SOLIDS

Suppose we have a rectangular region that is covered uniformly withmass of constant mass density. (If we cut out any portion of the region,determine its mass, divide it by its area, and obtain the same value for theratio no matter which region we may have selected, we say that the regionis of constant mass density.)

It is quite clear that the mass center of such a rectangular region coincideswith its geometric center. This can be seen as follows. If we cut out such arectangular region, it will balance on the edge of a ruler, if the ruler supportsit along a line which is parallel to one pair of sides and cuts the rectangleinto two equal portions. There are two such lines and they intersect in thegeometric center of the rectangle. Hence, we should be able to balance arectangle at the tip of a pencil, if the rectangle is supported at its geometriccenter.

We are going to make use of this intuitive result to establish formulasthat will allow us to find the mass center of regions which are not every-where bounded by straight lines.

Suppose we have a region of constant mass density p (read: rho) whichis bounded by the x-axis, x = a, x = b and the curve y = f (x) (see Fig.IV.17). We will approximate this region by n inscribed rectangles as wedid when determining the area of such regions (Chapter II, Sections 8 and9). We remind the reader that the inscribed rectangles are obtained bydivision of the interval a S x < b into n subintervals by division pointsx1, x2i x3, , x,1, which we assume for reasons of convenience to beequidistant:

xk -xk_1=Ax for all k= 1, 2, - , n.

We will also assume, as usual, that xo = a and x,, = b in order to have aunified notation. The height of each one of the inscribed rectangles is theminimum value of the function in the corresponding subinterval. Wedenote this minimum value by f ( k) :

f min f (x) in xk_1 < x < xk.

Page 278: ɷshawns sagan integral and differential calculus

266 CHAP. IV. VOLUMES

Fig. IV.17

It follows that the area AAk of the kth rectangle is

AAk = f

and consequently, we obtain for its mass Amk

Amk = Pf (ek)Ax.

Let us assume that the entire mass of this rectangle is concentrated at itsmass center, i.e., at its geometric center, the coordinates of which are givenby

(IV.9) xk = xk 2xk-1 yk -f (k)

If Ax is sufficiently small, which we assume to be the case, we can say thatxk "almost" coincides with k. It seems intuitively clear that the error whichwe make with this assumption disappears if we let Ax approach zero,which we will do in the end, anyway. Hence, we obtain for the momentAMk,,, of the point (4, yk) of mass Amk with respect to the y-axis*

(IV.10) AMk,. = Amk k = Pf ( k) kAx.

Thus the total moment of the n rectangles with respect to the y-axis is thesum of all moments in (IV.10) and this is

n) = P I Ax.ML

fin` t tf(Sk)Sk

k=1

* This is to be understood as "with respect to the intersection point of the y-axiswith the horizontal line through the point (fix, ?k)"

Page 279: ɷshawns sagan integral and differential calculus

IV.5. MASS CENTER

We observe that

(IV. 11) min [xf(x)] in xk_1 < x < Xk.

267

So if we take an undersum Sn (see Chapter II, Section 9) of the functionF(x) = xf(x) with the same subdivision as we have used for determinationof the moment and denote the minimum of F(x) in xk_1 < x < xk by

we havemin F(x) = min [xf (x)] = Skf in xk_1 < X < xk

n n

P$n = P I Ox = p j kf Ox < p j Ox = Mn)k=1 k=1 )L=1

If Sn denotes the corresponding upper sum of the function F(x) with thesame subdivision, then we can see likewise that

SinceA MY(n)

blira Sn = Jim Sn = f F(x) dx = f xf (x) dxn-00 a Ja

(see Chapter II, Section 8) we have, in view of

p$ < My(n) < pS,,that

(IV.12) M,, = p f bxf(x) dx,a

where M denotes the moment of the region under consideration withrespect to the y-axis.

We defined in the preceding section the coordinates of the mass centerfor a concrete mass distribution by dividing the moment by the total mass.We, therefore, define in generalization of this idea the x-coordinate of themass center of our region by

p f b bxf(x) dx rxf(x) dx(IV.13) Jab =Jab

p f f(x) dx f f(x) dxa a

rb(Note that the total mass of the region is its area I f(x) dx times the mass

adensity p, and that the factor p which appears in numerator and denomi-nator cancels out in case of a region of constant mass density.)

Page 280: ɷshawns sagan integral and differential calculus

268 CHAP. IV. VOLUMES

We proceed similarly to establish a formula for the y-coordinate of themass center of the given region. We note that the y-coordinate of the masscenter of the kth inscribed rectangle is given by

yk = z f (k)

[see (IV.9)]. Hence, if we think of the entire mass of the kth rectangle tobe concentrated at this point, the moment of this rectangle with respect tothe x-axis * is

Mk, = p2f (Ek)t x

and, consequently, we are led by an analogous line of reasoning to thefollowing formula for the moment Mx of the given region with respect tothe x-axis :

Mx = f f 2(x) dx.a

Therefore the y-coordinate of the mass center is to be defined as

2 fof2(x)dx

(IV.14) y =v

f f(x) dxx

noting again, that p cancels out.Let us now apply formulas (IV.13) and (IV.14) to the problem of finding

the mass center of an isosceles triangle of constant mass density p (see Fig.IV.18). Clearly, y = 0 because the triangle will balance on the x-axis, i.e.,the mass center has to lie on the x-axis.

The mass of the whole triangle is obviously given by

m = prh.

In order to find the moment of the triangle with respect to the y-axis, wehave to take into account that we have as much of the triangle above thex-axis as we have below the x-axis (just as in the case of a fat woman who istrying to hide behind a bamboo rod). This requires the introduction of the

factor 2 in formula (IV.12). We note further that f(x) = rx in our case.Hence, we obtain

Jo

it 2rh3 2rh2-p3h

=p30

We remind the reader that the vertical bar in the above line indicatesthat the indefinite integral of x2 is to be taken between the limits 0 and h.(See p. 182.)

* See footnote on p. 266.

Page 281: ɷshawns sagan integral and differential calculus

IV.5. MASS CENTER

Y

Thus we have because of

that

Fig. IV.18

p2rh2 2h

3 prh 3,

269

i.e., the mass center lies at I of the height from the vertex.The idea which we pursued in this section can easily be generalized to the

determination of the mass center of solids, and is particularly simple incase of a right circular cylinder of constant mass density (mass per unitvolume is constant).

Intuitively, it is clear that the mass center of a right circular cylinder ofconstant mass density lies at its geometric center, i.e., the point on the axiswhich is halfway between top and bottom. Nevertheless, let us go throughthe motions of a limit of a sum process in order to indicate how this ideacan be generalized later on to more complicated solids.

We consider a right circular cylinder of height h and base radius r andchoose our coordinate system as indicated in Fig. IV.19. We slice the

cylinder into n cylindrical disks of heighth

= Ax and assume that n isn

very large, i.e., Ox is very small. If p is the mass density of the cylinder,then the mass of any one of the slices is given by

Mk = pr2Tr/x.

Page 282: ɷshawns sagan integral and differential calculus

270 CHAP. IV. VOLUMES

Fig. IV.19

If xk is the distance of the bottom of the (k + 1)th slice from the originand Ox is very small, we can assume that the distance of the mass center ofthe (k + 1)th slice from the origin is also Xk, except for an error which isnegligible in the light of the later limit process with Ax -- 0. Thus we havefor the moment of the (k + 1)th slice with respect to the origin

Mk+l = pr27TxkAx.

If we now add all the moments of all these cylindrical disks and take thelimit as n -> co and simultaneously Ax -+ 0, we obtain

n-1 b 2

M = urn I pr27rxk Ax = pr'-7r f x dx* = pr27rn-+co k=0 a 2

Since the volume V of the cylinder is given by

V = r27rh

(see Section 2) it follows that its mass is

m = pr27rh

and, hence, the ratio of moment to mass becomes

pr2irh2 _ hx-2r27rhp 2

kpr27Th2

0 2

as we anticipated.

* Note that xk = min x in x4, G x < xk+1

Page 283: ɷshawns sagan integral and differential calculus

IV.6. ARCHIMEDEAN SCALE 271

Problems IV.20-IV.24

IV.20. Find the mass center of a triangle of constant mass density which is

bounded by x= 0, =O ,and a

IV.21. Find the mass center of a region of constant mass density which isbounded by y = 0, x = 0, x = l' y = x2.

IV.22. Same as in Problem IV.21 for the region bounded by x = 0, y = x2,y=1.

IV.23. Find the mass center of a right circular cone of height h and base radiusr. (Hint: Let the vertex of the cone coincide with the origin and let the x-axiscontain the axis of the cone.)

IV.24. Find the mass center of the solid of constant mass density which isgenerated by rotating the area bounded by y = x2 and the line y = I about they-axis. (Such a solid is called a paraboloid of revolution.) Hint: Slice theparaboloid into horizontal cylindrical disks and apply limit of a sum process.

6. THE ARCHIMEDEAN SCALE

We will demonstrate in this section the ingeneous process by whichArchimedes (287-212 B.c.) arrived at the formula for the volume of a sphere.We will need for this purpose the formulas for the volume of the rightcircular cylinder and the right circular cone, which we derived in Sections2 and 3 and which we reiterate:

V(cylinder) = b2hlrb2h,7r

V(cone) =3

In both cases b stands for the radius of the base and h for the height.Archimedes made the great discovery that a cylinder balances a sphere

and a cone on a lever. More specifically, he found that if we place a rightcircular cylinder of base radius 2r and height 2r on one side of the leversuch that the axis of the cylinder coincides with the lever arm and the cyl-inder terminates at the fulcrum, and suspend a sphere of radius r and acone of base radius 2r and height 2r at a point 2r units from the fulcrum onthe other lever arm, then the lever is in equilibrium, provided all threesolids are made of the same homogeneous material of constant massdensity (which we can assume to be I without loss of generality). (See Fig.I V.20.)

We will now give a "proof" of this very astonishing fact before we pro-ceed to investigate its consequences for the volume determination of thesphere in terms of the volumes of cone and cylinder. The "proof" which is

Page 284: ɷshawns sagan integral and differential calculus

272 CHAP. IV. VOLUMES

Fig. IV.20

supplied here is basically the one which was given by Archimedes himself,except for the fact that we use the convenience of our modern notation. *

In the following we refer to the schematic sketch in Fig. IV.21. We placethe cylinder on the left side of the fulcrum and draw the contours of coneand sphere on the right side with the understanding that the force exertedon the lever arm by cone and sphere is to act upon the point P, 2r units tothe right of the fulcrum. We will prove our proposition by demonstratingin the true spirit of Archimedes' integration method that any thin cylindricalslice of the cylinder at distance x from the fulcrum on the left will balancecorresponding slices of the same thickness cut out from cone and sphereat a distance x to the right from the fulcrum and suspended at the point P.

We obtain for the mass of the cylindrical disk (slice) on the left

Am (cylinder) = 4r2zrAx

where Ax is the thickness of the slice, since the mass density is assumed tobe 1, which has mass = volume as a consequence. Therefore, the momentof the cylindrical disk placed at the distance x from the fulcrum is given by

(IV. 15) AM (cylinder) = 4r2TrLx X.

The radius of a circular cross section of the cone at x is x, since the anglebetween any generating line of this cone with its axis is 45°. Hence, if we

* We put proof in quotes because, as the reader will see, this "proof" is anything butrigorous. Archimedes, who ordinarily gave rigorous mathmatical proofs for all hisstatements, was well aware of the heuristic nature of this method, which we presentin the following, and pointed this out quite emphatically.

Page 285: ɷshawns sagan integral and differential calculus

IV.6. ARCHIMEDEAN SCALE 273

approximate the volume of the disk which is cut out from the cone andwhich really is a conical frustum by a cylindrical disk of equal radius andheight (which is a good approximation if Ax is very small), we obtain

Am (cone) -- x27rix

and, consequently, we obtain for the moment, if this mass is placed at P,

(IV.16) OM (cone)- x27rLx2r.

Finally, we have to consider a slice from the sphere which is cut out atx. Since the equation of the circle in Fig. IV.21 [with radius r and center at

(r, 0)] is (x - r)2 + y2 = r2,

the square of the radius of the circular cross section at x is

y2 = r2 - (.x - r)2 = 2xr - x2and, consequently,

dm (sphere) (2xr - x2)7rAx,

if we assume again that the volume of the slice from the sphere is approxi-mated by a cylindrical disk of radius -\/2xr - x2 and height Ax.

Therefore,

(IV.17) OM (sphere) - (2xr - x2)7rLx2r.

From (IV. 16) and (IV. 17) we have

OM (cone) + OM (sphere) - 27rrx20x + 27rr(2xr - x2)ix

= 27rr(x20x + 2xrLx - x2Lx)

= 47rr2xhx = OM (cone) + AM (sphere),

Fig. IV.21

Page 286: ɷshawns sagan integral and differential calculus

274 CHAP. IV. VOLUMES

where the bars indicate an approximation to the moments which becomesmore accurate the smaller Ox is.

Thus we have in view of (IV. 15)

(IV.18) OM (cylinder) = AM (cone) + OM (sphere).

If we slice up the solids on both sides of the fulcrum by passing planesthrough a number of points which are arranged symmetrically with respectto the fulcrum, we see that (IV. 18) will hold for any triple of correspondingslices and on adding up all the equations of which (IV. 18) is typical, we seethat

M (cylinder) = M (cone) + M (sphere).

If we increase the number of cross sections and at the same time let thelength of every distance between two cross sections (-Ax) tend to zero, it isintuitively clear that

lim M (cone) = M (cone), Iim A? (sphere) = M (sphere)Ax-0 4x-0

and we finally obtain

(IV.19) M (cylinder) = M (cone) + M (sphere)

as we proposed to prove.We know that

m (cylinder) = 4r2zr2r = 8r37r

and that this cylinder has its mass center at a distance r from the fulcrum(where its geometric center is located; see Section 5). Hence,

M (cylinder) = 8r 47F.

The mass of the cone is given by

m (cone) _4r2Tr2r _ 8r3Tr

3 3

Hence its moment, assuming suspension at P, is given by

M (cone) = 16r4v

3

The mass of the sphere is equal to its volume V (sphere) because the massdensity is assumed to be 1. Hence, its moment, again assuming suspensionat P, is given by

M (sphere) = V (sphere) 2r.

Page 287: ɷshawns sagan integral and differential calculus

IV.6. ARCHIMEDEAN SCALE 275

Thus, on substituting these values into (IV.19), we have

8r4Tr = 16r41r +3 2rV (sphere)

and from this we obtain immediately

(IV.20)4ArV (sphere) =

The preceding proof of formula (IV.19) is lacking in mathematical rigor,inasmuch as we went through a mental limit process, the outcome of whichwas guided by our intuition.

Since the concept of a limit was developed to some extent in Chapters IIand III, we are here in a position to give a more rigorous proof of (IV.19).However, this proof will really make the significance of (IV.19) appearquestionable (unless cylinders really wish to play teeter-totter with spheresand cones instead of always playing with other cylinders), because (IV.19)was only of interest to us as long as it served to derive the volume formulafor the sphere. In the subsequent proof the formula for the volume of thesphere will turn up as a marginal result simultaneously with the recognitionof (IV.19) as a true relation.

We subdivide the interval 0 < x < 2r into n equal subintervals of length2r2r . Then the division points x1, x2, x3, , xn_1 will have the coordinates

2r 2r2 2r3 2rk _ 2r(n - 1)x1= ,x2=n

,x3=n 11

,...,xn-1-n11

The same subdivision is to be carried out on the left side of the fulcrumF(x = 0).

Thus, according to (IV.15) and Ox =2r

, we obtain at the point Xkn

2OM (cylinder) - 4r2irxkOx = 4r2ir

2rk 2r = 16zrkn n n

according to (IV.16),

OM (cone) = xk27TL x2r =

and, according to (IV.17),

4r2k2rr 2r 2r = 16r4k2ir

n2 11 n3

k)LIM (sphere) - (2xkr - xk2)irzx2r = 2rir

2r- 2 2rk r - 4r22

2

n

(n n

_ 16r4irk _ 16r4irk2

n2 113

Page 288: ɷshawns sagan integral and differential calculus

276 CHAP. IV. VOLUMES

and we have, according to (IV. 18),

16r47rk - 2r(8r3k27r + 8r37rk - 8r37rk2

n2 n3 n2 173

where we factored out 2r for reasons of convenience. The first and the lastterm in parentheses cancel each other and we have here merely an identity.However, we will not yet carry out this cancellation. All these equationshold for k = 0, 1.

If we think of writing all these equations out, then adding them, we seethat

16r47r n-1 n-12

8r37r n1 8r37r n12(IV.21) k = 2r(8r3ir- I k2 + k - 3 k .

2 )n2 k=1 n3 k=1 17 k=1 n k=1

We observed in Chapter II, Section 12 that

nik= n(n- 1) - 112-nk=1 2 2

k2 = (n - 1)n(2n - 1) = 2n3 - 3n2 + nk=1 6 6

Thus we obtain for (IV.21)

I6r47r(112 - nl112 2

- 2rr8n3

r37r(2173 - 3n2 + n) + 8r37r(n2 - n) - 8r37r(2113 -3 112 + n

L 6 112 2 l13 6

or, as we may write,

8r47r (1 - 1)n

11= 2r8r37r(1 - 3 -I- 1) + 4r37r(1 - 1) - 8r-7r(, - 3 + 1 )J3 2n 2n2 \ 17 3 2n 2112

and we see, on passing to the limit as n -+ oo, that

807T = 2r(8 3 3 7r + 4r37r - 8 3

3 77)

which is merely an identity. At the same time we see that the first term inparentheses yields the formula for the volume of the cone and the tworemaining terms yield the volume of the sphere.

Before closing this section we wish to point out that a very impressivemodel can be built to demonstrate the validity of the relation (IV. 18) veryconvincingly. First, we need a lever, preferably a homogeneous steel rodwhich is balanced at its center, as demonstrated in Fig. IV.22. Then we cut

Page 289: ɷshawns sagan integral and differential calculus

IV.7. VOLUME OF A SPHERE 277

Fig. IV.22

out a circular disk of radius 2 units from some homogeneous material, theheavier the better (we suggest the use of platinum by generously endowedprivate schools and scrap metal by state institutions), and drill a hole in itscenter so that it can slide freely along the steel rod. This disk shall representone of the slices of the cylinder. Then we proceed to cut out two morecircular disks from the same material representing the corresponding slicesof cone and sphere.

We observe that if the radius of the slice which is supposed to come fromthe cone is x, then the radius of the corresponding slice from the sphere has

to be 2x - x2 (observe that r = 1, since the radius of the cylinder waschosen to be 2r = 2). Thus, 2 corresponding slices would have, for example,

radii 1, 1 or1/23 or 12/3

etc. If more than one pair of disks is produced,

then it is recommended that corresponding pairs be painted in the samecolor, using different colors for different pairs.

Then the corresponding two disks are slid onto the rod into a position2 units away from the fulcrum. The third disk is to be placed on the otherside of the lever and its position is to be adjusted so that the lever balances.If the dimensions of the model are reasonably accurate and the lever is inperfect balance, the distance of the third disk from the fulcrum should thenturn out to be equal to the radius of the disk representing the slice from thecone.

7. VOLUME OF A SPHERE BY INTEGRATION

A sphere can be generated by a circle which rotates about one of its di-ameters (see Fig. IV.23). We will use this idea to establish a formula by

Page 290: ɷshawns sagan integral and differential calculus

278 CHAP. IV. VOLUMES

Fig. 1V.23

x

which the volume of a sphere can be found by an integration process. Itdoes not increase the complexity of the problem at all if we consider themore general problem of finding the volume of a solid, the lateral surfaceof which is generated by a rotating curve. So let us consider just any curvey =f(x) and rotate the section of the curve about the x-axis which liesbetween x = a and x = b (see Fig. IV.24).

Y

Fig. 1V.24

Page 291: ɷshawns sagan integral and differential calculus

IV.7. VOLUME OF A SPHERE 279

This process generates a surface of revolution which, together with theplanes at x = a and x = b that are perpendicular to the x-axis, forms theboundary of a solid, a so-called solid of revolution. We will devote thissection to the determination of the volume of such a solid, which in thespecial case that the rotating curve is a semicircle, will be a sphere.

Again, true to tradition, we cut up the solid by n - 1 planes which areperpendicular to the x-axis and are equally spaced at distances Ax =b a The (k + 1)th slice can be approximated by a cylindrical disk (if

nOx is very small) of height Ax. The base radius of this cylindrical disk isf (x) taken at some x in the interval xk < x < xk+1. If we want an inscribeddisk, we choose that value of x for which f (x) has a minimum in the intervalXk < x < xk+l; and if we want a circumscribed disk, we take that valueof x for which f(x) has a maximum. Let us take k+1 such that fminf(x) in xk < x < xk+1 Then the volume of such an inscribed cylind-rical disk is given by

Vk = vf2(sk)Ax, k = 1, 2, n

and we obtain for the volume of a solid which consists of all n inscribedcylindrical disks

n(IV.22) Vn = 17 1 f 2(ek)Lx.

k=1

In order to arrive at a formula for the volume of the solid of revolutionwith which we started, we have to let n > co and simultaneously Ox -k 0.

In order to facilitate the comparison of the present case with the defini-tion of the definite integral in Chapter II, Section 9, we let f 2(x) = F(x)for the moment. Then (IV.22) will read

n.

Y. = 7r jk=1

and we obtain in the limit, according to (11.37),n U

lim Vn = lim '71 Tr F(x) dx,n-oo n-ao 7.=1 1a

if F(x) is uniformly continuous in a < x < b. (Note that iff(x) > 0, whichwe will assume, then f 2(x) will assume its minimum at the same point atwhich f (z) assumes its minimum.)

Now, if we replace F(x) again with f 2(x), we obtain for the volume of thesolid of revolution that is generated by the rotation of the curve y = f(x)between x = a and x = b about the x-axis

(IV.23) V = 7r f f 2(x) dx.a

Page 292: ɷshawns sagan integral and differential calculus

280 CHAP. IV. VOLUMES

If we wish to apply this formula to the determination of the volume of asphere of radius R, we observe that the sphere is generated by rotation of asemicircle of radius R about its diameter, as we mentioned in the intro-duction to this section. We remember the equation of a circle with itscenter in the origin and radius R:

x2 + y2 = R2.

From this we obtain the upper semicircle

Hence,

and we have

y=VR2-x2.f2(x)=y2=R2-x2

R RV (sphere) = Tr (R2 - x2) dx = ITR2f dx - Tr f R x2 dxJ R R J R

_?rR2x R -itx3 R

-2rR3-7rR3 7TR3

-R 3 -R 3 3

4-rR3

3

We may use the same formula (IV.23) to check a partial result of ProblemIV.24. There it was required, among other things, to find the volume of aparaboloid which is generated by rotating the parabola y = x2 about they-axis and truncating at y = 1, by carrying out the limit of a sum process(see Fig. IV.25).

In order to adapt our formula to this situation, we have to make a fewchanges. We observe that now the y-axis is the axis of rotation instead ofthe x-axis. This indicates that all we did with respect to x in the derivationof formula (IV.23) we now have to carry out with respect to y. This isaccomplished simply by interchanging x and y in formula (IV.23). Specifi-cally, we note that instead of representing the rotating curve as a function

Y

Fig. IV.25

Page 293: ɷshawns sagan integral and differential calculus

IV.8. VOLUME OF A TORUS 281

of x: y = f(x), we now have to represent the rotating curve as a function ofy:x = g(y). Then formula (IV.23) will read

b(IV.24) V= ITJ g2(y) dya

where a and b are now limits for y. In our particular case, we have

x=g(y)=1/y,a=0,b=1.Therefore,

i 7Ty2 1 7rV= rJ0

ydy 2 o=2*

Problems IV.25-IV.30

IV.25. The curve y = 1 between x = 1 and x = 2 is rotated about the x-x

axis. Find the volume of the solid thus generated. (Hint: Remember that)

dx x2

IV.26. The curve y = x2 between x = 1 and x = 3 is rotated about the y-axis.Find the volume of the solid thus generated.

IV.27. Same as in Problem IV.26 if the curve is rotated about the x-axis.x2 2 2

IV.28. The upper branch of the curvea2

+ b2 = 1 rotates about thex-axis. Find the volume of the solid thus generated. (Hint: Note that for x = 0,y = ±b and for y = 0, x = -a. Sketch the curve and determine the integra-tion limits. In your result let a = b = R. After this substitution you shouldobtain the formula for the volume of a sphere, if your work was correct.)

IV.29. The right branch of the curve x2 - y2 = 1 between y = -1 and y = 1is rotated about the y-axis. Find the volume of the solid thus generated.

IV.30. Same as in IV.29 by cutting the solid into slices and applying the limitof a sum process. (The solid of Problems IV.29 and IV.30 is a hyperboloid ofrevolution of one sheet.)

8. VOLUME OF A TORUS

A torus is a solid of revolution which is generated by a circle rotatingabout an axis which lies in the same plane as the circle, but does not passthrough the circle. In short, it is the mathematical refinement of a dough-nut.

In Fig. IV.26, the circle of radius a, with center at (b, 0) where b > a,rotates about the y-axis and thus generates a torus. We will devote thissection to the development of a simple formula that will enable us to findthe volume of a torus and similar solids of revolution.

Page 294: ɷshawns sagan integral and differential calculus

282

Fig. IV.26

CHAP. IV. VOLUMES

In general, let us assume that we have some planar region which isbounded above by the graph of the function y = f (x), where f (x) > 0,below by the x-axis, and on the sides by the lines x = a and x = b, wherewe assume that 0 < a < b, i.e., the region lies entirely to the right of they-axis (see Fig. IV.27). We let this region rotate about the y-axis and gen-erate thus a solid of revolution.

Again we subdivide the interval a < x < b into n equal subintervals of

length Ox = b a and approximate the area under the curve in each sub-n

interval by an inscribed rectangle of height f min f (x) in xk < x :5,-

Fig. IV.27

Page 295: ɷshawns sagan integral and differential calculus

IV.8. VOLUME OF A TORUS 283

xk+l. This inscribed rectangle with base in Xk < x < xk+l will generate,upon rotation about the y-axis, a cylindrical shell of inner radius xk andouter radius xk+l and height f The volume of this cylindrical shell isobtained by subtraction of the volume of the inner cylinder of base radiusXk from the volume of the outer cylinder of base radius xk+l:

V (cylindrical shell) = 7Tx2

+1 - xk2)f(k+1)

= 7r(xk+1 + xk)(xk+1 - xk)f (k+1)-Since xk+l - xk = Ax, we have

V (cylindrical shell) = 7r(xk+l + xk) f(k+l)Lx

Now, if Ax is very small, both Xk and xk+l can be approximated by k+lwhich is a value somewhere between xk and xk+l and we obtain

V (cylindrical shell) - 27rk+lf (k+1)Ax.

Now we sum up all the volumes of the n cylindrical shells between a and band obtain

n-1{

n{ gV,,, - 27r .1 k+1J (k+1) Ax = 27r .1 kJ (S k)Ax

k=0 k=1

which will tend to the volume of the solid of revolution that is underinvestigation, if we let n -- oo (and simultaneously Ox -> 0). We have seenin Section 5 that

nJim I ek f (k) D.x = f xf(x) dx.n-oo k=1 Ja

Hence, we obtain for the volume

(IV.25) V = 27r ,, xf (x) dx.a

If we compare this formula with formula (IV.13) of Section 5, we seethat

xV

2w f f (x) dxa

where x is the x-coordinate of the centroid of the region of constant mass

Page 296: ɷshawns sagan integral and differential calculus

284 CHAP. IV. VOLUMES

x

Fig. IV.28

density which is rotated. Thus we arrive at the following simple formulafor V:

(IV.26) V = 2TrxA

where A is the area of the region which is rotated.This formula can be interpreted as follows: The volume of a solid of

revolution is obtained by multiplying the area of the to be rotated regionby the length of the path through which the centroid travels during rotation.(Note that 2irx is the length of the circle of radius x.)

Now let us apply formula (IV.26) to the volume determination of thetorus. We have A = a2irand

b

(the centroid of a circle of constant mass density is its geometric center).Hence, V (torus) = 2ir2a2b.

We can use formula (IV.26) backwards to find, for example, the centroidof a semicircle without difficulties. We place the semicircle in a position asindicated in Fig. IV.28. If this semicircle is rotated about the y-axis, weobtain a sphere with the volume

V 4a3ir

3

The area of the semicircle is given by

Aa27r=-.

2

Page 297: ɷshawns sagan integral and differential calculus

IV.9. SIMPSON'S RULE AND WINE BARRELS

Hence we obtain from (IV.26)

4a3Tr Tra2= 2Trx -and, consequently,

Problems IV.31-IV.34

3 2

4ax=-.3Tr

285

IV.31. A triangle with vertices at (1, 0), (2, 1), and (2, -1) is rotated about they-axis. Find the volume of the solid which is thus generated.

IV.32. The ellipse (x - 2)2 + 4 = I is rotated about the y-axis. Find the

volume of the solid thus generated.IV.33. The circle (x - 4)2 + (y - 1)2 = 1 is rotated about the line y = x.

Find the volume of the solid thus generated.

IV.34. The semicircle x = 3 + V4 - y2 is rotated about the line x = -1.Find the volume of the solid thus generated.

9. SIMPSON'S RULE AND WINE BARRELS

In Problem IV.28 it was required to find the volume of the ellipsoid ofrevolution which is generated by the curve

xs(IV.27)

a2 +bz

2 = 1

rotated about the x-axis (see Fig. IV.29).

Fig. 1V.29

Page 298: ɷshawns sagan integral and differential calculus

286 CHAP. IV. VOLUMES

This was done by employing formula (IV.23) with the integration limits-a and a:

V = 7r fa

f 2(x) dx.a

It follows from (IV.27) that

f 2(x) = y2 = b2 (1 - a2)

Hence

fa(IV.28) V = 1r b2 (1 - x22)

dx.aa

The integrand is a quadratic function of x. We remember that in Simp-son's rule (see Chapter II, Section 14) the integrand is approximated by aquadratic function of x (parabola) in order to find an approximate valueof the integral. Consequently, if the integrand is quadratic in the first place,we expect to obtain a precise result and not only an approximation fromSimpson's rule. Thus let us apply Simpson's rule to find the integral in(IV.28), using the minimum number of subintervals, namely, n = 2:

-a

+ + bwhere yo = F(a), y1 = F

a

2, and y2 = F(b).

x2In our case, F(x) =f2(X) = b2 1 -

a2, the lower integration limit is

= 0 and we obtain-a and the upper integration limit is a. Hence, a2

b

yo=0, yl=b2, Y2=0.Therefore

V=7r

a

6

a (4b2) = 4rr3 b2

which is indeed the result that should have been obtained in Problem IV.28by integration.

Suppose we cut off the solid in Fig. IV.29 with two planes which areperpendicular to the x-axis, each at the distance c < a from the origin (seeFig. IV.30). The solid thus obtained resembles a wine barrel very closely.Again we use Simpson's rule to determine the volume of such a wine barrel.

We have now -c as the lower integration limit, c as the upper integrationlimit, and, consequently,

C2 C2

y0=b2(1 -a2), y1=b2, y2= b2(1-a2).

IF(x) dx- b3n

a(yo 4y1 Y2),

Page 299: ɷshawns sagan integral and differential calculus

IV.9. SIMPSON'S RULE AND WINE BARRELS 287

Fig. IV.30

Hence, we obtain

V= 7T 26 ZL(1 - a2!) + 4 + (1 - a2)1 = 7r3b2(6 _ a22)

for the volume of a wine barrel with dimensions as indicated in Fig. IV.30.It may be of interest to note that the great astronomer Johannes Kepler

(1571-1630) who investigated in his "Stereometria doliorum" in 1612the volume of 92 solids, mostly solids of revolution, proved that theAustrian wine barrels, for a given amount of material to manufacture, con-tain the largest possible volume. On this happy note let us conclude ourdiscourse of the Calculus.

Problems IV.35-IV.38

IV.35. The hyperbola x2 - 4y2 = I between y = -2 and = 2 is rotatedabout the y-axis. Find the volume of the solid thus generated by Simpson'srule.

IV.36. The curve y = x3/2 between x = 0 and x = 1 is rotated about the x-axis.(a) Find the volume by integration. (b) Find the volume by Simpson's rule.(c) Why does Simpson's rule yield the precise result, even though the integrand isnow a cubic function of x?

IV.37. Find the volume of a sphere of radius R by Simpson's rule.x2 12

IV.38. Let the ellipsea2

+b2

= 1 be rotated about the -axis. Find the

resulting volume by Simpson's rule.

Page 300: ɷshawns sagan integral and differential calculus

288 CHAP. IV. VOLUMES

Supplementary Problems IV

1.1. The base of a prism of height x2 is a regular heptagon which is inscribedin a circle of radius 2x. Find the rate of change of the volume with respect to xatx=1.

2.1. The outside wall of a pipe 10 ft long is a circular cylinder of radius 3,and the inside wall is a prism, the base of which is a regular hexagon inscribed ina circle of radius 2.5 ft. The pipe is manufactured of a material that weighs100 g per cm3. Find the weight of the pipe.

2.2. The largest possible prism of rectangular base is to be inscribed in a rightcircular cylinder of height 10 and radius 2. What are its dimensions?

2.3. A cylindrical can is to be manufactured so that it will hold 50 cu in. whilethe least possible amount of material is used. What are its dimensions?

2.4. The material for the lateral surface of a cylindrical can costs 9$ per sq ft,and the material for top and bottom costs 15$ per sq ft. What are the dimensionsof the cheapest can that holds 10 cu in.?

2.5. Same as in Problem 2.4, if we take into account the waste of material,assuming that top and bottom are cut out of square sheets of dimension 2r where ris the radius of the base of the cylindrical can.

3.1. Approximate the volume of a cone of radius r and height h by the sum ofthe volumes of n inscribed cylindrical disks.

3.2. Approximate the volume of a cone of radius r and height h by the arith-metic mean of the approximation obtained from circumscribed cylindrical disks(see p. 260) and the approximation obtained from inscribed cylindrical disks(see Supplementary Problem 3.1). Find the absolute error committed by thisapproximation. How much better is this approximation compared to the one onp. 260 and the one in Supplementary Problem 3.1 ?

3.3. A right circular cylinder is to be inscribed in a right circular cone of radiusr and height h, so that its volume is a maximum. What are its dimensions?

3.4. Find the dimensions of a right circular cone that has a volume of 12 cu in.and the smallest possible lateral surface area.

3.5. Same as in Problem 3.4 so that the total surface area is a minimum.4.1. A circular cone of radius 2 in. and height 5 in. which is made of a material

that weighs I oz per cu in. is suspended at a point 6 in. from the fulcrum on a leverarm. Find the force required to keep the lever in balance, if it is to act at a point10 in. from the fulcrum on the other lever arm.

4.2. Masses of equal magnitude are placed at the vertices of a parallelogram.Use intuitive reasoning to find the mass center.

4.3. Same as in Problem 4.2 if the masses are placed at the vertices of a tri-angle.

5.1. Find the mass center of the region of constant mass density that is bounded

by y24x= -3

+ 4 and the y-axis.

5.2. Find the mass center of the region of constant mass density that is boundedby y = -x2 + 6x and y = 1.

Page 301: ɷshawns sagan integral and differential calculus

IV. SUPPLEMENTARY PROBLEMS 289

5.3. Find the mass center of the region of constant mass density that is boundedx

by y =2

+ 4, x = -3y - 5, x = -2, x = 4. (Make a sketch of the region.)

5.4. Find the y-coordinate of the mass center of the region of constant massdensity that is bounded by the curve y = sin x between x = 0 and x = it andthe x-axis. (Hint: See Chapter II, Supplementary Problem 14.4.)

6.1. A right circular cone of radius 2 and height 2 is suspended at a distance2 from the fulcrum on a lever arm. The cone is to be balanced by a right circularcylinder that is slid onto the other lever arm so that its axis coincides with thelever arm and one face goes through the fulcrum. Choose radius r and height rof the cylinder so that the lever is in balance. (Assume that cone and cylinderare both made of the same material of constant mass density.)

7.1. Find the volume of a right circular cone of radius r and height h byintegration.

7.2. The exterior surface of a container is a right circular cylinder that isgenerated by rotation of the line x = 4 between y = 0 and y = 10 about they-axis. The interior surface is a paraboloid generated by rotation of the parabolay = x2 + 1 about the y-axis. Assume that the material of the container weighs12 oz per cu in. What is the weight of the container?

8.1. A triangle with vertices at A(2, 1), B(3, -2), C(2, 4) is rotated about the

line y = -. Find the volume of the solid thus generated.

8.2. The semicircle y = v' 1 - x2 is rotated about the line y = -x - 4. Findthe volume of the solid thus generated.

9.1. The curve y3 = x2 between y = 0 and y = 1 is rotated about the y-axis.Find the volume of the solid thus generated by Simpson's rule.

Page 302: ɷshawns sagan integral and differential calculus
Page 303: ɷshawns sagan integral and differential calculus

APPENDIX

Al. POLYNOMIALS

A polynomial is, as the name suggests, something which contains manyterms. In mathematics, we specifically call an expression a polynomial ifit is a sum of "many" powers of x which, in turn, are multiplied by certainconstants.

Thus, for example,

is a polynomial, andx2 + 3x - 4

3x4-4x3+2x-7is also a polynomial. In the first example, the highest power of x thatappears is the second. We call it, therefore, a polynomial of the seconddegree. For the same reason we call the second polynomial a polynomialof the fourth degree. In general, we call

P(x) = aoxn + alxn-1 + a2xn-2 + ... + an-1x + an

a polynomial of the nth degree because n, which has to be a positive integer,is the largest exponent which occurs in this expression. The symbols ao, a1,a2, , an-1, an-which are called the coefficients of the polynomial-standfor constant numbers.

If we substitute for x a certain value, then the polynomial P(x) willassume a certain value. It is a function of x. If we wish to find those valuesof x for which the polynomial P(x) assumes the value 0, we have to solvethe equation

aoxn + alxn-1 + a2xn-2 + + an-1x + an = 0.

If there is a value (or values) of x for which this equation is satisfied, wecall this value (these values) a root (roots) of the equation (/ is a root ofthe equation x2 - 2 = 0). If the coefficients ao, a1, , an are rational

291

Page 304: ɷshawns sagan integral and differential calculus

292 APPENDIX

numbers, then the above equation is called an algebraic equation of the nth

degree. The values of x for which this equation has a solution-the rootsin our new terminology-are called the zeros of the polynomial P(x).

It can be shown that an algebraic equation of the nth degree has n realor imaginary roots x1, x2, , x,,, some of which or all of which might beequal. For example, x2 - 2x + 1 = 0 has the two equal roots xl = 1,x2 = 1, while x3 + x = 0 has one real root xl = 0 and two complex rootsx2=iandx3=-i.

If x1, x2, , x are the n zeros of the polynomial P(x) of the nth degree,then P(x) can be written in the form

a0xn + alxn-1 + a2x71-2 + ... + an-lx + an

= a0(x - x1)(x - x2) ... (x - xn).

For example, x1 = 0, x2 = 1, x3 = 3 are the zeros of

P(x) = x3 - 4x2 + 3x

and it can easily be verified that

x3 - 4x2 + 3x = (x - 0)(x - 1)(x - 3).

Let us consider a polynomial of even degree which contains only evenpowers of x: let n = 2v where v is some positive integer and let

P(x) = a0x2v + a2x2v-2 + a4x2v-4 + ... + a2v-2x2 + a2v.

If we substitute -x for x, we obtain

P(-x) = ao(-x)2v + a2(-x)2v-2 + a4(-x)2v-4 + ... + a2v-2(-x)2 + a2v

= aox2v + a2x2v-2 + a4x2v-4 + ... + a2v-2x2 + a2v = P(X)-

So we see that if n = 2v and only even powers of x are present, then

P(-x) = P(x).

For this reason, we generally call a functionf(x) which has this property:

an even function. f(- X) = f(x)

On the other hand, if we consider a polynomial of odd degree whichcontains only odd powers of x, namely, n = 2,u - 1 and

P(x) = alx2u-1 + a3x2u-3 + a5x2u-5 + ... + a2µ-3x3+ a2µ-1x

and substitute -x for x, then we obtain

P(-x) = al(-x)2,,-1 + a3(-x)2F,-3 + a5(- x)2µ-5 + .. .

+ a2u-3(-x)3 + a2u-l(-x)= -alx2u-1 - a3x2u-3 -, a,x2u-5

p2µ-3x3 - a2u-1x= -P(x),

Page 305: ɷshawns sagan integral and differential calculus

Al. POLYNOMIALS 293

y

Fig. All Fig. A.I.2

i.e., if P(x) is of odd degree and there are only odd powers of x present,then P(-x) = -P(x).

For this reason, we generally call a function with this property:

an odd function. f(- X) = -f(x)As the reader can easily convince himself, even functions have a graph

which is symmetrical with respect to the y-axis and odd functions have agraph which is symmetrical with respect to the origin of the coordinatesystem-if they have a graph at all (see Figs. AI.1 and 2).

Problems AI.1-AI.7

AI.1. Write down the most general polynomial of the(a) 0th degree (b) first degree

(c) seventh degree

AI.2. Show that a polynomial of the first degree with rational coefficients hasone, and only one, zero which is rational.

AI.3. Show that a polynomial of the second degree with rational coefficientshas at most two real zeros. What can you say about the real zeros of a poly-nomial of the third degree?

AI.4. Show that the function y = (1 for x rational l is even.0 for x irrational;AI.5. Find a polynomial of the fourth degree that vanishes at the points

xl = 1, x2 = 2, x3 = 3, x4 = 4. Does this problem permit a unique answer?AI.6. Find a polynomial of the third degree that vanishes at the points

xl = -1, x2 = 2, x3 = 3 and assumes the value 12 for x = 1.AI.7. Write the polynomial x4 - 4x3 + 3x2 + 4x - 4 as a product of linear

factors (x - Xk) where Xk are the zeros of this polynomial.

Page 306: ɷshawns sagan integral and differential calculus

294 APPENDIX

All. A UNIFORMLY CONTINUOUS FUNCTION THATDEFIES GRAPHICAL REPRESENTATION

It turns out that there are functions which are uniformly continuous in aclosed interval but defy graphical representation. By this we mean: Thereare uniformly continuous functions which are such that it is impossible todraw a curve representing them, i.e., they do not have a graph.

While a study of these functions requires a great amount of mathematicalsophistication, we will try, nevertheless, to outline the generation of such afunction. In the process, the reader will have to accept a few things onfaith, because we are not able to establish rigorously every statement wemake with the rather limited techniques that are at our disposal in thistreatment.

Let us consider Fig. AII.l. We see a broken line and a dotted line. Thebroken line forms the two equal sides in an isosceles triangle with height1 and base 1. The dotted line together with portions of the broken lineforms the four sides of two isosceles triangles of height 1 and base z. Letus call the function that is represented by the broken line sl(x) and the func-tion which is represented by the dotted and portions of the broken lines2(x). Now we consider the function y2 = s1(x) + s2(x), which is obtainedfrom s1(x) and s2(x) by addition of corresponding ordinates. The result isdepicted by a solid line in Fig. All. I.

We next transfer the solid line from Fig. AII.1 to Fig. AII.2 as a brokenline. The dotted line in Fig. All.2 is related to s2(x), as s2(x) is related tos1(x). We denote the function which is represented by the dotted line bys3(x). [s3(x) is formed from the eight equal sides of four isosceles trianglesof height 4 and base }r.]

Again, we add s3(x) to the function y2 = s1(x) + s2(x) which is repre-sented by the broken line and obtain a function, namely, y3 = s1(x) +s2(x) + s3(x), which is represented by the solid line in Fig. AII.2.

The solid line from Fig. AII.2 is now transferred to Fig. AIl.3 as a brokenline and we proceed as before, i.e., take s4(x) (the equal sides of eightisosceles triangles of height - and base J) and add it to the function repre-sented by the broken line, etc. After five steps we obtain the function whichis represented by a solid line in Fig. AII.5, namely, the functiony6 = S1(x) + s2(x) + S3(x) + s4(x) + S5(x) + S6(X)-

At every step we obtain a uniformly continuous function, because everyfunction is obtained as the sum of two uniformly continuous functions,and it is intuitively clear that the sum of two uniformly continuous func-tions is again a uniformly continuous function. Now it can be shown thatif we continue this process of adding functions s7(x), s6(x), without everending it, we arrive at a uniformly continuous function in 0 x < 1. At

Page 307: ɷshawns sagan integral and differential calculus

Y

Fig. AII.1

Y3

=sl

(x)

+S2

(x)

+S3

(X)

Fig. AII.2

x

Page 308: ɷshawns sagan integral and differential calculus

Fig.

AII

.3

Y5 = 51(x) + 52(x) + 531x) + 54(x) + 56(x)

151(

x) +

52(

x) +

$3(x

) +

s4(x

)I

c-55

(x)

A A A A A

A A

A A A A A

A A

AA

rr

rr

rr

r\ r

\ r\

\,I \

r\ r

\ r'.

rr

0 1 16

1

Fig. AII.4

Page 309: ɷshawns sagan integral and differential calculus

All. CONTINUOUS FUNCTION WITHOUT GRAPH 297

yA

Y6 = sl(x) + s2(x) + s3(x) + s4(x) + s50) + s6(x)

,Iss(x)

O

32

Fig. AII.5

the same time, it is quite obvious that it is humanly impossible to graph thisfunction (animals have not been very successful with it, either) because thiswould require drawing a polygon with infinitely many vertices.

We will show below that all the values of this function are less than 2.We will not be able to show, however, that the function obtained by thisinfinite process indicated above is a uniformly continuous function. But,as we pointed out already, how could it ever become discontinuous, if onl)uniformly continuous functions are added in the process? (Note that sucl.an argument is quite dangerous and may lead to wrong conclusions. How-ever, in the present case it does apply. In general, it is not permissible tcgeneralize from a result that can be obtained by finitely many operationsto a result that is obtained from infinitely many operations of the samenature.)

Now let us show that none of the values of this function exceeds the

Page 310: ɷshawns sagan integral and differential calculus

298 APPENDIX

value 2. For this purpose we need the results of Chapter II, Section 3 onSeries and Sequences.

Our function, which we will call Y(x), is composed as follows:

Y(x) = S1(x) + S2(x) + S3(x) + ... =1 Sk(x)1

We know from the construction of the functions sk(x) that

s1(x) < 1

S2(X) <-1

(*) S3(X) -<4

2n-1

for all x in 0 < x S 1. Thus, if yn(x) denotes the nth partial sum of the

00series .1 sk(x) .k=1

yn(x) = S1(x) + s2(x) + s3(x) + ... + Sn(x)

(and gn(x) denotes the nth partial sum of the geometric seriesk=1 2)

(l n1 1 1 1- `2 r

g(x) = 1 + + + + = = 2r1 - (i)fl]

1--2

we have in view of (*) that

yn(x) < gn(x) = 2 Cl - (21Y]

for all n. If we take the limit as n -+ oo, we obtain

nlim yn(x) = Y(x) < lim 2 [1 - (!)] = 2,

n +oo n- m 2

Y(x) < 2for all xin 0<x<1.

Page 311: ɷshawns sagan integral and differential calculus

AIII. CIRCULAR FUNCTIONS 299

AIII. CIRCULAR FUNCTIONS

1. The Sine Function

Let us consider a unit circle (circle with radius 1) with its center at theorigin of a right coordinate system (see Fig. AIII.1). We assume that apoint P is moving along the circumference of this circle in the positive(counterclockwise) direction. The location of this point is uniquely deter-mined by the angle a subtended by the line through 0 and P and the positivex-axis, which we agree to measure in the positive direction.

Suppose that the sun is located at the "end" of the positive x-axis. Thenthe sunlight will come in from the right in parallel rays, as indicated by thearrows in Fig. AIII.1, and will project a shadow of the point P onto they-axis. Let us use Q to denote the shadow (or projection) of P. Q is at acertain distance y from the origin and this distance will change as the pointP moves on the periphery of the unit circle. We assume, for the sake ofour argument, that the very moment that the point P crosses the y-axis atthe point (0, 1), the sun switches from the "endpoint" of the positive x-axisto the "endpoint" of the negative x-axis.

We can see quite clearly that the shadow Q will now return to the origin,as P keeps moving in the positive direction, will cross the x-axis at theorigin, and move along the negative y-axis until it reaches the point (0, -1).At that instant, we switch the sun back to the "endpoint" of the positivex-axis (it would be more practical to have two suns at our disposal which

x

Fig. AM.1

Page 312: ɷshawns sagan integral and differential calculus

300 APPENDIX

could be blacked out alternatingly) and the shadow Q will move back tothe origin as P moves back into the position T where it started. If P keepson moving, then the same process will repeat itself again and again-"untilthe circle is worn out."

There are many very interesting aspects to this simple experiment. Firstof all, we recognize that the distance of the shadow Q from the origin 0depends on the angle a between OP and the positive x-axis. (We reiteratethat this angle is measured from the positive x-axis in the positive direction).This means that the distance y = 6-Q--is a function of a. While a can assumeall possible values (If P starts out from T and moves in the positive direc-tion, then a will increase from 0 to co, and if P starts out from T and movesin the negative-clockwise-direction, then a will decrease from 0 to - co),the distance y = OQ can only assume values between -1 and 1 if we agreeto attach a minus sign to this distance to signify when Q is below the x-axis.

This function is of such paramount importance in mathematics that wedesignate it by a special name. We call it the sine function (from latin"sinus") and abbreviate it by writing

y=singwhere a is the angle between OP and the positive x-axis (a is positive whenmeasured in the counterclockwise direction and negative when measuredin the clockwise direction), and y is the distance of P's shadow Q from theorigin (y is positive when Q is in the upper half-plane and negative whenQ is in the lower half-plane).

What we have formulated above in words about the fact that the sinefunction cannot assume values above 1 and below -1, we can now expressin terms of an inequality:

-1 <sina <1or, using the absolute value, symbol,

(AIII.1) Isin al < 1

for all a.We also mentioned that after P has gone around once, the entire process

will repeat itself or, in other words, if P is in a certain position, then thedistance of Q from the origin, or, if you prefer, the value of the sinefunction, will not reveal how often the point P went around the unit circlebefore it took this particular position. The important fact is this: Sinceone full circular motion of P corresponds to OP sweeping a full angle of360°, we can say, whether OP subtends an angle of a° or an angle of a° +k 360°, where k is any positive or negative integer, the value of the sinefunction will be the same in all these instances:(A111.2) sin (a + k - 360) = sin a

Page 313: ɷshawns sagan integral and differential calculus

AIII. CIRCULAR FUNCTIONS 301

for any a and all integers k. We express this property verbally by sayingthat the sine function has the period of 360°.

We observe further that if OP sweeps a certain angle a in the positivedirection and sweeps the same angle in the negative direction (-a), thenthe distance of its shadow from the origin will in both cases be the same,the only difference being, that the two projections will be on opposite sidesof the origin, which is expressed by a sign difference of the values of thesine function :

(AIII.3) sin (-a) = -sin cc.

We note that this very property is exhibited by polynomials which con-tain only odd powers of x. We call the sine function for this reason an oddfunction. (See also Appendix Al.)

Problems AIII.1-AIII.4

AIII.1. Express the following sine functions in terms of a positive or negativeangle which in absolute value is between 0 and 180°:

sin 675°, sin 1079°, sin (-361'), sin 7963 °, sin (-979°)AIII.2. Using formulas (AIII.2) and (AIII.3), show that

(a) sin a = -sin (720 - a)(b) sin (-a) = -sin (a + 1080)(c) sin (cc + 360) = -sin (360 - a)

AIII.3. For what values of a does sin cc have the value(a) 0, (b) 1 (c) -1?

AIII.4. Show that - forallx>0.x- x -x2. Some Values of the Sine Function

In this section we will evaluate the sine function for some arguments thatcan be expressed as simple fractions of 360°. For this purpose we recalla simple theorem from plane geometry concerning similar triangles.

If we consider the triangle L (ABC) (Fig. AIII.2) and the triangle/(AB'C') where B', C' are the intersection points of the lines throughA, B and A, C with a line parallel to BC, then we see that these two tri-angles are similar, because corresponding angles are equal. According toa theorem on similar triangles, the ratio

AB AB'

BC B'C'

Page 314: ɷshawns sagan integral and differential calculus

302 APPENDIX

Fig. AIII.2

is invariant, i.e., does not change its value whatever line B'C' we may take,as long as it is parallel to BC.

Let us apply this result to the triangles L (ORP) and L(OR'P') in Fig.AIII.3. First, we realize that

sing=OQ=RP.Next, we see that

RP R'P'

OP OP'

for any points R', P' that lie on a line perpendicular to the x-axis with R' onthe x-axis and P' on the line through 0, P. We see further on that OP = 1

Fig. AIII.3

Page 315: ɷshawns sagan integral and differential calculus

AIII. CIRCULAR FUNCTIONS 303

Fig. AIII.4

since we are dealing with a unit circle. Thus it follows that

R'P'sin a =OP,

Opposite sidesin a =Hypotenuse

in a right triangle with the angle a° which is formed by the hypotenuse andthe side which is not the opposite side, the so-called adjacent side.

We will now try to find the value of sin 45°. In Fig. AIII.4, the triangleA(ABC) has an angle of 45° at the vertex A, a right angle of 90° at thevertex B, and hence, an angle of 45° at the vertex C. (The sum of the threeangles in a plane triangle is a stretched angle of 180°.) Thus, A(ABC) is anisosceles triangle with the sides AB and BC equal. Therefore, we obtainfrom the theorem of Pythagoras

CBC2+BC2=-8/2BC.

Thus, we have

sin 45° = BC - BC - 1 = /2AC ,/2 BC /2 2

Now let us find the value of sin 30°. If a right triangle has one angle of30°, then the other angle has to be

180 - 30 - 90 = 60°.

Such a triangle A(ABC) is represented in Fig. AIIL5. If C' is the sym-metric image of C with respect to the line through A and B, we see that thetriangle A(AC'C) is an equilateral triangle (all three angles are equal, henceall sides are equal).

Page 316: ɷshawns sagan integral and differential calculus

304 APPENDIX

Consequently,

and we obtain

Since

BCAC

2

sin 30° =BC = AC = 1

AC 2AC 2

2

,CB =VAC'-BC'= AC'-`4C = 2 AC,

we have also from Fig. AIII.5 that

V3AC

AB 2 - J3sin 60° = = = - -

AC AC 2

Finally, we note that

sin 0° = 0 and sin 90° = 1.

Thus we obtain Table AIII.1.

Table AIII.1

0 30 45 60 90

1 V2 V3sin u 0 1

2 2 2

0'

Fig. AIII.5

Page 317: ɷshawns sagan integral and differential calculus

AIII. CIRCULAR FUNCTIONS 305

90° (-270°)

135° (-225°) 45° (-315°)

30° (-330°)

° °--- 0 (-360)=360

210° (-150°) _ _ -- 330° (-30°)

225° 135°)-(- 315° (-45°)

270° (-90°)

Fig. AIII.6

From Table AIII.1 and Fig. AIII.6 we construct immediately Table AII1.2.

Table AIII.2M

sins

0 30

20

45

2

60

N/3

2

90

I

120

2

135 1 150

v22 2

180

0

210

2

225

-',/22

240

-'v32270

-1

300

-V32315

2330

2

360

0

In view of formula (AIII.2) in Section 1, we can add or subtract anymultiple of 360° from the argument a in the first line of Table All 1.2 withoutchanging the values of sin a in the second line.

3. Other Circular Functions

Let us refer to Fig. AIII.7. Again we consider a unit circle with itscenter at the origin and a point P moving in the positive direction on thecircumference of the unit circle, starting out at the point T. If we nowconsider the projection R of the point P onto the x-axis (by a parallel lightcoming from above and below, respectively), we see that the distance ORis also a function of the angle a between OP and the positive x-axis. Thetriangle A(0RP) has a right angle at R. The angle at 0 is a', hence, theangle at P is 90 - e. Two angles whose sum is a right angle are called

Page 318: ɷshawns sagan integral and differential calculus

306 APPENDIX

Fig. AIII.7

complementary angles. We see that in view of OP = 1,

OR = sin (90 - a).OP

Since 90 - a is the complementary angle of a, we call sin (90 - a) thecomplementary sine of a and denote it by cos a (read "cosine (X"):

(AIII.4) cos a = sin (90 - a).

In view of the theorem of Pythagoras, we have

OR2 + RP2 = 1

and consequently we obtain the important identity

(AIII.5) sin2 a + cost a. = 1

for all values of a.From Fig. AIII.8 we can see that

(AIII.6) cos (-a) = cos a,i.e., the cosine is an even function (see Appendix AI).

Finally, we define a third function by the ratio

sin atan a

cos a

(Read: "tangent a"). If S is the intersection point of the line through OP(see Fig. AIII.9) with the line perpendicular to the x-axis through T, wesee that in view of

sin a_ TScos a OT

Page 319: ɷshawns sagan integral and differential calculus

Alli. CIRCULAR FUNCTIONS 307

Fig. AIII.8

and OT = 1, the function tan a can be represented by the length of thesegment TS. This representation also serves to explain the term "tangent."

The functions sin a, cos a, tan a are called circular functions in view oftheir definition in terms of the unit circle. The customary trade name ofthese functions is trigonometric functions (yovv = angle) suggesting theirrelationship to the measurement of triangles. In many treatments, thesefunctions are indeed introduced as the different ratios of two sides in a right

Fig. AIIL9

Page 320: ɷshawns sagan integral and differential calculus

308 APPENDIX

triangle and then generalized to angles > 90°. We have started with thegeneral definitions.

We wish to point out that there are three more functions of this type onthe market, namely, cot a (cotangent a), sec a (secant a), and csc a (co-secant a). We will not say another word about these functions after havingmentioned them, because they will not do us any good anyway, and as amatter of fact, they hardly do anybody any good but a great effort is usuallymade by textbook authors to introduce them and establish all their relation-ships with the three functions we introduced in our treatment.

Really, as the student can clearly see, only the sine function is actuallyessential, because cosine and tangent can easily be expressed in terms ofthe sine,

sin acos a. = sin (90 - a), tan a =

sin (90 - a)

However, it is sometimes practical to be in possession of an abbreviatednotation for these two descendents of the sine function.

From cos a = sin (90 - a) and tan a =sin a

and from Table AIII.2cos a

we obtain the entries in Table AIII.3 as presented below:

Table AIII.3x 0 30 45 60 90 120 135 150 180 210 225 240 270 300 315 330 360

'/3 V2 1 i 'L3 V3 1 1 V2sx 10

- - - -1 - _0

i2 2 2 2 2 2 2 2

tanx 0 1 1 'V3 0 1 1 V3 co -1 01/3 \/3 '73 :73

Problems AIII.5-AIII.10

AIII.5. Compute the entries in Table A1II.3 from Table AIII.2 and thesin arelations cos a = sin (90 - a) and tan a.=cos a

AIII.6. Check the identity sine a + cos2 a = 1 for the following values of «:

0, 45, 60, 135, 210, -150, -30, -225.

AIII.7. From sin2 a + cost a = I it follows that cos a = V'1 - sin2 a. Forwhat values of a do we have to extract the positive square root and for whatvalues of a the negative square root?

AIII.8. For what values of « is (a) tan a > 0, (b) tan a < 0?AIII.9. Show that cos (a + 360 k) = cos a.AM.10. Show that tan (a + 180 k) = tan a.

Page 321: ɷshawns sagan integral and differential calculus

AIII. CIRCULAR FUNCTIONS 309

4. Some Identity Relations Between Circular Functions

We will derive in this section a few important identity relations betweencircular functions. The significance of an identity relation, or simplyidentity is that it holds true for any value of the variable (within reason).Thus,

(1 +x)2= I +2x+x2,(1+x)(1-x)=1-x2

are algebraic identities.We already know some identities involving circular functions, namely,

cos a= sin (90-a)and

sin atan a =COS cc

While these two identities are quite trivial, since they really serve to definethe cosine and the tangent function, the relation

sin 2 a + COS2 a = 1,

derived in Section 3, which is also an identity, is not so trivial.We are now out to derive some identities which will enable us to express

circular functions of a certain argument in terms of circular functions oftwice the argument and half the argument, respectively. We refer for thispurpose to Fig. AIII.10. We consider an angle of a° subtended by OA andOB. The coordinates of the points A and B are clearly

A(1, 0), B(cos a, sin a).

Then we consider the corresponding negative angle of -a.° between OA andOC. The coordinates of C are clearly

C(cos (-a), sin (-a)).

Since cos a is an even function and sin a is an odd function, we have

and, therefore,cos (-a) = cos a, sin (-a) = -sin cc

C(cos a, -sin a).

Finally, we consider the angle of 2u° between OA and OD. The co-ordinates of D are given by

D(cos 2a, sin 2a).

We observe that the triangles Q(OCB) and L(OAD) are congruent,because both have an angle of 2a° at 0 and, in both cases, the two sides

Page 322: ɷshawns sagan integral and differential calculus

310 APPENDIX

Fig. AIII.10

emanating from 0 have the length I (remember, we consider a unit circle).Hence, it follows that the remaining sides have to be equal:

CB=ADand from this it follows that

(AIII.7) CB2 = AD2.

We obtain from the distance formula (see Chapter I, Section 7)

CB2 = (cos (x - cos a)2 + (sin a - (-sin a))2 = 4 sin2 a.and

AD2 = (cos 2a - 1)2 + (sin 2a - 0)2 = 2 - 2 cos 2a.Thus we have, in view of (AIII.7) :

4sin 2..=2-2cos2aand from this we obtain

(AIII.8) sine a =1 - cos 2a.

'

the so-called half-angle formula for the sine.We obtain a similar formula for the cosine, if we utilize the theorem of

Pythagoras, sin2 a. + cost a = 1. Then

cost a= 1 - sin2 a = 1 - 1 - cos 2a

i.e.,

(AIII.9) cost a = 1 + cos 2a2

the so-called half-angle formula for the cosine.

2

Page 323: ɷshawns sagan integral and differential calculus

All]. CIRCULAR FUNCTIONS 311

These two formulas enable us to find the value of sin a and cos a in

terms of the value of cos 2a. For example, we know cos 450 _ A/2

Then, by using (AIII.8), we obtain 2

1-2X45 2 2-V2sin" - _ =

and hence2 2 4

sin 22.5° 1/2

where we have to take the positive square root, because 22.5° is an angleless than 180° and we know that the sine is positive for all angles between0° and 180°.

From (AIII.9) we obtainJ2

45 1 + 2 2 +cos" - _ =

and consequently2 2 4

cos 22.5° = 11 2 +

Let us play with this formula a little more. Using (AIII.9) again, weobtain

1 -F 2 -1-245 2 -{- V2 -{-/2=cos

4 2

=4

and hence45-U2

cos4

-and we obtain in general

-{--12+`%2

(AI1I.10) cos25=1, 2+ 12 2+V2+---+

where we have n + 1 nested square roots on the right side.

Let us live dangerously and let n -+ oo. Then 25 --> 0 and consequently

cos 25 -* 1. But what happens on the right side? We obtain infinitely

many nested square roots. Does that make any sense? If it is indeedpermissible in (AIII.10) to let n - - oo, then we have to accept as a con-sequence that

I=J2+V2+2 _+- -

Page 324: ɷshawns sagan integral and differential calculus

312 APPENDIX

or

(AIII.11)

That this is indeed senseful, can be checked by an entirely differentprocess. If the nested square root stands for some number at all, it willhave to be the number 2, for:

let

1=J2+V2+ 2+Then

a2=2+ J2+V2+V2+... =2+A

since it is immaterial if we have infinitely many nested square roots oroo - 1 many square roots on the right side. Now we have a quadraticequation for 2

with the solutions22-2-2=0

1+.J1+8 1+3X1,2 -

2- 2

Clearly, we have to reject the possibility that 22 = -1, if we extractpositive square roots only. The other solution 11 = 2 appears to supportour bold statement in (AIII.11).

Let us return to our task at hand. We will try to utilize formulas (AIII.8)and (AIII.9) to express cos 2a and sin 2a in terms of cos a and sin a.

We write (AIII.8) and (AIII.9) in the form

2 sine a = I - cos 2a

2 cos' a. = 1 + cos 2a

and after subtraction of the first equation from the second equation,division by 2 and switching of sides, we arrive at

(AIII.12) cos 2a = cos2 a - sin2 a,

the so-called double-angle formula for the cosine.From the theorem of Pythagoras, we obtain

sin 2a = 1/1 - cos2 2a = 1/1 - (cos2 a -sine a)2.

Page 325: ɷshawns sagan integral and differential calculus

AIII. CIRCULAR FUNCTIONS 31.

We replace cos' a by I - sin2 a and obtain

sin 2a = 1 - (1_-2 sine a)2 = V'_4-sine a_-4 sin4 a.

= 2 1/sin2 a = 21/sin2 a 1/cos2 v. = 2 sin a cos v.,i.e.,

(AIII.13) sin 2a = 2 sin cc cos cc,

the so-called double-angle formula for the sine.

Problems AIII.11-AIII.19

AIII.11. Show that

AII1.12. Prove thatcos 2a = 2 COS2 a - 1.

cos (cc + 13) = cos a cos a - sin « sin (3

for any a, 13. Hint: In Fig. AIII.10, choose D such that the angle betweer.OB and OD is 13 and C such that the angle between OC and OA is -j9. Therproceed as in the derivation of formula (AIII.8). Note that the angle at 0 in botl:triangles p(OCB) and L(OAD) is now a + P.

AIII.13. Use the result of Problem AIII.12 to show that

cos (a - 13) = cos a cos j9 + sin a sin 13

(Observe that the cosine is even and the sine is odd.)AIII.14. Use the results of Problems AIII.12 and AIII.13 to show that

sin (a + 5) = sin «cos 13 f cos a sin 13.

(Observe that sin (a + 13) = cos (90 - a - 13) = cos [(90 - a) - 13].)

AIII.15. Find sin Zn and discuss the consequences as n -* co. (Note thasin 0 = 0.)

A11I.16. Show that1 - cos 2a sin 2a.tang = _

sin 2a 1 + cos 2a

AIII.17. Find sin 75°. (Observe that 75 = 45 + 30.)AIII.18. Find sin 15°.AIII.19. Find cos 15° from the formula in Problem AIII.13. (Observe that

15 = 45 - 30.)

5. Graphs of the Circular Functions*

We will now endeavor to represent the function

y = sin x

* This section is to be read after the reader has studied Chapter II, Section 7 onRadian Measure.

Page 326: ɷshawns sagan integral and differential calculus

314

Fig. AIIL11

APPENDIX

by a graph, where we measure the argument x in radians (see Chapter II,Section 7). We consider a right coordinate system and choose scales onthe x-axis and y-axis. (The units do not necessarily have to be the same. Itis most practical to mark the points 1 and -1 on the y-axis and pointscorresponding to multiples and fractions of it on the x-axis.) Then we drawa circle of radius 1 y-unit with its center at the origin (see Fig. A111. 11).According to (AIII.1) (Section 1), the graph of the sine function must liewithin the strip of the plane determined by the two broken lines y = I andy = -1. We note that we can construct angles of it/3 radians and anglesof 7T/2 radians. Consequently, we can construct angles which are eithermultiples of 7r/3 and ?r/2 or the 2nth part of angles of ¶r/3 and 7r/2 radians.Of course, we can also construct line segments of lengths that correspondto these arcs in terms of the given segment r units long on the x-axis.

For x = 2 + 2k7r the sine function has the value 1 and for x = 2 +

2kir the value -1. For x = 0 + 2kir and x = 7r + 2kir, i.e., for all mul-tiples of ir, the sine function will have the value 0 and since it also experi-ences a sign change at these points, it will cross the x-axis. The values ofthe sine function corresponding to angles which can be constructed, e.g.,nrr n7r 'r and

7rfor all integers n, can be found by a construc-3' 2,3.2-' 2.2n

tion process which is indicated in Fig. AIII.1 1 by a broken line. After asufficient number of points are constructed this way, we can see intuitivelythat the sine function has to have a graph as indicated in Fig. AIII.12.

So far we have explained the name "cosine" in terms of "sine" and thename "tangent." This leaves an explanation of the name "sine" un-accounted for. In a Latin dictionary we can find under the entry "sinus"

Page 327: ɷshawns sagan integral and differential calculus

AIII. CIRCULAR FUNCTIONS 315

Fig. AIII.12

the following translations: curvature, bent, roundness, arc, bolster, bay, andbosom. It is left to the reader to pick the most appropriate description fromthis list for the curve in Fig. AIII.12.

The graph of y = cos x can easily be obtained from the graph of y =sin x as follows : The broken line in Fig. AIII.13 represents the graph of y =sin (-x) = -sin x which is obtained from the graph of y = sin x by reflec-tion in the x-axis.

We wish to obtain

cos x sin (2 - x) =sin x + 21 = sin [_(x - 72)] .

If we translate the graph of y = sin (-x) through it/2 units to the right,

i.e., replace the argument -x by - (x - 2), then we obtain exactly what

we want, as indicated by a solid line in Fig. AIII.13.Before we conclude this section, let us draw a picture of the tangent

function. In view of the definition of the tangent as the ratio of sine tocosine, we see that it will vanish whenever the sine (numerator) vanishesand it will increase beyond bound whenever the cosine (denominator)approaches zero. The easiest way to avail ourselves of a graph of the tangentfunction is to draw the graph of y = sin x (in broken line) and the graph

y = COS x

xK . a 3ir 9,r\ 57r%2 2 zsinx

Fig. AIII.13

Page 328: ɷshawns sagan integral and differential calculus

316 APPENDIX

Fig. AIII.14

of y = cos x (in dotted line) in the same coordinate system, and then"divide" the broken line by the dotted line. This will yield a graph con-sisting of infinitely many branches, as indicated by a solid line in Fig.AIII.14, the graph of y = tan x.

Problems AIII.20-AIII.24

AIII.20. Sketch the following functions.

(a) y = sin (x + 2) (b) y = cos (- - x)

(c) y = tan (x21

(d) y = sin (a61

AIII.21. Sketch the following functions. \

(a) y = sin 2x (b) y = cos.4x(c) y = sin Trx (d) y = tan zx

AIII.22. What is the period in terms of radian measure of the followingfunctions ?

(a) y = sin x (b) y = sin 2x(c) y = cos 2x (d) y = sin nx, n integer(e) y = tan ,rx (f) y = cos 2-nx

AIII.23. Graph the function

y = cos 2x + 3 sin 2xAIII.24. Graph the function

y = sin x + sin (xx - I + sin (x - 43 1

Page 329: ɷshawns sagan integral and differential calculus

Alll. CIRCULAR FUNCTIONS

Supplementary Problems AIII

1.1. Simplify

sin (a + 1080), sin (360 - (x), sin (372), sin (2162).

1.2. For what values of a is

sin l- + kal = 1

a true equation? (k = f 1 , ±2, ±3, ) /1.3. Same as in 1.2 for

2.1. Evaluate

(a) sin 30 sin 60sin z 45

(b) sin 120 + sin 270

sin 135

sin (ir + ka) = 0.

317

(c) sin 210(sin 45 + sin 405)2.2. A right triangle has a hypotenuse of length 6 and an angle of 30°. Find

the remaining sides.3.1. Show that the following relations are true for all values of a:

a(a) 1 - cos a= tan a cos a

sin a

(b) sin a cos a = tan all - sin2 a)

(c) tang a + 1 =COS2

3.2. Simplify

tan (a + 2160), cos (720 - 2a), cos (a - 90), tan (cc

3.3. The cotangent is defined as follows

cot a =

Show that tan a cot a = 1.

COS a

sin a

3.4. Show that cot2 a =1 - sin2 a

sine a

4.1. Show that the following relations are identities:

(a) cost a =1

2

(stntan 2

2aa + 1

- 90)

(b) sin 2a cos a = 2 s:n all - sin 2 a)(c) tan all + cos 2a) = 1 - cost 2a

Page 330: ɷshawns sagan integral and differential calculus

318 APPENDIX

4.2. Simplify

cos 30 cos 45 + sin 30 sin 45(a)

'V1 + cos 30

(b) sin a cos 60 - cos a sin 60 + sin (a + 300)sin (a - 60)

(Hint: See Problems A111.13 and A111.14.)4.3. Show that

tan (a + ) = tan « + tan1 -tanatant1'

5.1. Sketch the function pt = sin irx + cos ;.

5.2. Sketch the function ?i = Itanx

5.3. Sketch the function J = Icos x

5.4. Sketch the function y = Isin x

Page 331: ɷshawns sagan integral and differential calculus

ANSWERS TO EVEN-NUMBERED PROBLEMS

CHAPTER I1.2. Similar to the problem illustrated in Fig. 1.3.1.6. b=0 1.8. a=b1.10. A curve similar to the one in Fig., 1.18 which intersects the x-axis at xand x = - I and the y-axis at y = -1.

1 1 I t 11.16. 2't+l't2+1't+l+1

h 11.18. Vx+h, - _

h(Vx+h+Vx)Vx+h+V-x

1.24. For any two points on a horizontal line the y-coordinates are the same.1.26. y=inx-in+4 1.28. y=2x+h,barbitrary1.30. p ax 19

C1.32. Refer to Fig. Al : AD = nil,

AC1

mn

in_; now,CD AC ACt)ll_ =-=AC BC CB

Y

S

Fig. Al319

Page 332: ɷshawns sagan integral and differential calculus

320 ANSWERS

1.34. The lines coincide.1.36. 2x +y=9, llx + 5y = 51, 7x + 3y=311.40. x = pounds of silver 76 % pure and y = pounds of silver 82 % pure.

(a) x =3 , y =404

(b) impossible (c) x = 0, y = 20

I.42. 3, -8,0,0,0 I.46. 32x-3y=111.48. 21/5, 2, 85, V37 1.50. (x - 1)2 + (y - 1)2 = .11.52. y = x2 1.54. x2 + y2 - 2xy + 20x + 20y = 140

1.56.1.58.

V229That part of a circle which lies above the horizontal line through (p, q); that

part which lies below this line; the entire circle1.60.1.62.1.64.1.66.

1.68.

2 =x+3,y=y+13y'=4y,x=x,g =4x2+8x-8

x2-x-=02

(x-4)24,(x+2) -84,16 x +8)-+64J'3L\x6/

+.131

611±V5 3±V5

1.70. X = , y = . Two distinct intersection points

1.72.

1.74.1.76.

2 2

l2-9V10(1 +31010,2+9100)' 1 -3V10 ,

10

If b is greater than 2 or less\\

than -2, for b = ±2, if b is between -2 and 2y=2x-1,y=-8x+15,y=10x+2,y=-9x+9

131.78. y=5x-

4I.80.a=-16 1.82.

(_43, - 41

1.84. The x-coordinate of this point is a solution of a linear equation which has onesolution only.1.86. x>2,x<-1 I.88.x<-1,-1<x<2,x>21.90. 0<x<1 1.92. 1 <x<31.96. Same as in Fig. 1.40 of the text continued symmetrically with respect to they-axis to the left.1.102. Graph resembles the teeth of a saw.

1.104. SE = 116

1.106. a<x<bwhere a<b<-2ora> -2andb<lorl <a<bI.108. f(1) = 2

CHAPTER II

11.2. (0.0254001)2, (1.093611)211.4. A[r(a, b)] = ab, A [r(ca, b)] = (ca)b = c(ab) = cA[r(a, b)]

11.6. f(x) = ax 11.8. 6 111.10.3V3

2

2x11.12. a 2 bh 11.14. f(x) =-

h

II.16. 36, 12 , 55, 30, 10, 2 11.18. 3

Page 333: ɷshawns sagan integral and differential calculus

TO EVEN-NUMBERED PROBLEMS

11.20. s1 2 1 4o=0>sz=-5,sz=7,sa= -3>s4=11,nolimit.

11.22.

321

71

.(ak+bk)=(ao+ho)+(a1+b,)+...+(a.+bn)=ao+a1+...

k=1n n

+an+bo+b1+ +bnak+ bkI k=1

11.24. 1 1 x 4 = 1 +x4 +xS+x12+...

11.26. 0, 0, does not exist, does not exist

11.28. 2 , -4, 4, does not exist

II.32. 0.70711, 0.19509, 0.09802 II.38. f(x) = 2\/7r'A1I.40. n = 90: Tr = 3.1428 zz = 180: 7r- 3.1428II.42. 0.8415, 0.9002, 0.9548, 0.9737, 0.998511.44. 1 11.46. 2

1 1 111.48. S4-3 = ,S8-$8=i6>S16-318=32II.50. S,=0.75951, S4=0.63451, S,=0.72537, 5,=0.66287, 9,,=0.70902,S16 = 0.67776II.52. 4S4 = 2.4957, 4S4 = 3.4957, 4.58 = 2.8033, 4S8 = 3.3033, 4516 = 2.9982,4.516 = 3.248611.54. Inspecting the geometric representation of the integrand will reveal that thearea above the x-axis is the same as the area below the x-axis.

/'b n n ('b11.58. J Cf(x) dx = Jim Cf Ox = C Jim I f Ax = C J f (x) dx

a n-co k=1 n-.co k=1 a11.60. By trapezoidal rule: 0.13, while S4 = 0.01611.62. $ =0.375,S4=0.343,58=0.3359,516=0.33311.66. 6 11.68. 4511.70. 1717, 501500, 45526, 338350 11.72. (m + 1)2

20irr3 17 4111.74.

311.78. 4,

3,

3, 1

11.80. 3 11.82. 16

11.84. The approximating parabola for a parabola is the parabola itself.11.86. 2.004

x3 3x2 2x3 3x2 x3 13x2 (x + 3)311.88. + , -x3 + 4x, + - x, + + 7x, -2x2 + 2x,

3 2 3 2 3 2 3

11.90. A'(4) = 4 11.92. G'(x) = -g(x)

CHAPTER III

111.2. 3x2 + 3xh + h2; 7, 4.75, 3.64, 3.31, 3.0301, 3.003001.h

sin -2

111.4. cos x+2 . h

2

Page 334: ɷshawns sagan integral and differential calculus

322 ANSWERS

III.6. Put F1(x) =f1(z) +f2(x) + f3(x) + f,(x) and F (x) = fs(x). The rest is

obvious.111.8. y=5x+4 III.10. (2, -4)III.12. Jim (5x4 + 10x3h + 10x2h2 + 5xh3 + h4) = 5x4

I11.14. y' = 1 - x2

III.18.de

= urn ((x + h)"-1 + (x + h)"-2x + ... + x1-1] = nx"-1

dx h-o

111.20.X X. 111.22. y=8x-5 III.24.

7,-1x2)2 + y22 3

111.26. (1,2),(-1, -2) III.28. x = ±1, ±2

111.30.dV'u_zu'dx Vu

111.32. (a) 2x cos (x2) (b) 3(x2 - 1) cos (I - 3x + x3)cos x

(c)2 sin x

(d) 2 sin x cos x.

x3 x4 x5 x3111.34.

3+2x2-x+C, 4 -x3+x2+x+C, 5 +

3+x+C,

111.36.

111.42.

7x5 3x4 2x3 12 3x2

6+ 4 3{ 6x2 3x i

x2 +C

x3 3x2 17y + +4x-

32

6

(a) a(V1 + x)2 + C

(c) 1 +x+C

781'51 +C

(b) (1 )

x3 (1 - x)151(d)

3 151+ C

sine x111.44. 2 + C

111.46. 1.005, 4.997, 3.0173, 6.9286111.48. In Problem 111.46 the curves are convex from above, and in Problem 111.47the curves are concave from above.111.50. 25.341 111.52. 1.0276,r 1II.54. x3 = 1.73205

111.56. x3 = 1.9129. By method of Section 5, 'Y7 = 1.9167111.58. (a) No extreme values. In (b) to (f) the extreme values are at x = 1, -1;1, 2; -2, 3; -3, -1, 1, 3; 0, 1, -2.111.60. Extreme value (maximum) is 1 which is assumed at x = 0.

111.62. / x+

x-x20. III.64.x=a,y=a

x2 +yl2 v(x - xn)2 + y22111.66. 2x + y = 1.6, x 0.4, y = 0.8 111.70. v(3) = 12111.72. v(0) = 4111.76. s = -1612+vot+So

111.74. s=6t2+3t+ 12,a = 12

111.78. Distance of neutral point from center of the earth = 54.14R, where R is theradius of the earth.I11.80. Impact velocity = 32V14 ft/sec 111.82. 266,700 ft/sec

Page 335: ɷshawns sagan integral and differential calculus

TO EVEN-NUMBERED PROBLEMS 323

111.84.x - 1 + 2x(x2 - a) x + 2x(x2 - ;)

1)2 + (x2 - a)2 x2 + (x2 - ¢)2111.88. 8b2 + 140a2 + 198 - 68b - 330a + 56ab

111.90. y=ax+3-4a 1II.92.x=0,x(0,0)=1III.94. f (xi, yi) = 0, f (x1, y1) = 0 111.96. Y = i o x + '-s111.98. Y = °o x + 2121.0111.102. Rate of increase in 1945 = 1 55 o millions per year; population in 1970 182millions.

IV.2. 720'2

IV.6. f(h) = 47rh, F(S) = S

IV.10.

IV.14.

IV.18.

IV.20.

V = 3v'52 - 167r2

d2:di = 10

IV.22. x=8,y=5

IV.26. 407r

Intersection point of the diagonalsb a-,y=3

2IV.24. x= 0, g = -

IV.28.

IV.32.

IV.36.

CHAPTER IV

hnr2 27rIV..2

sinn

IV.8. 7r, 4 < V < 99 2

IV.12. r = f9V 2, h = 2Y9

IV.16. (b)443

43

3

APPENDIX Al

AI.2. From ax + b = 0 follows x = - b which is rational since a and b are assumedto be rational. aAI.. If x is rational, so is -x; and if x is irrational, so is -x.AI.6. 3(x + 1)(x - 2)(x - 3)

AIII.2. (a)

APPENDIX AIII

-sin (270 - a) _ -sin (2 360 - a) _ -sin (-a) = sin a(b)(c)

-sin (a + 1080) _ -sin (3 - 360 + a) _ -sin a = sin (-a)sin (a + 360) = sin a = -sin (-a) _ -sin (360 - a)

AIII.8. tans>Ofor0<a<90,180<a<270tans<Ofor90<a<180,-90<a<0

sin (a + 180) =-sin a

AIII.10. tan (a + + - 180) =cos (a + 180) -cos a = tan a.

tan (a + 180.3) = tan (a + 360 + 180) = tan (cc + 360)

= sin (a + 360) _ sin a _cos (a + 360) cos a

-tan cc, etc.

n(n+1)(2n+1)1 87ab2IV.30. 27r lim [1 +

6n3 33 n-.oo327r

7r4

87x2 IV.34. 167r2 + -47ra 2 b 3

4IV.38.

r

Page 336: ɷshawns sagan integral and differential calculus

324 ANSWERS

For k = 2n (even),

tan (a + 180 2n) = tan (cc + 360n) =sin (a + 360n) = sin a

= tan acos(a + 360n) cos a

AIII.12. As in the text, we have BC2 = AD2. Hence,

(cos a - cos R)2 + (sin a + sin #)2 = [cos (a + (3) - 112 + sin2 (a + fl),2 - 2cos ados f3 + 2sincc sin 9 = 2 - 2cos(acos (a + ft) = cos a cos fi - sin a sin #

AIII.14. sin (a + ) = cos (90 - a - fl) = cos (90 - a) cos + sin (90 - a) sin= sin a cos # + cos cc sin

Proceed similarly for sin (a - (3).

AIII.16. tan a =sin a _ 1 - cos 2% _ 1 - cost 2a _ sin 2 2a

cos a 1 + cos 2. n/(1 + cos 20t)2 J(1 + cos 20C)2sin 2a

or

1 + cos 2a

1 - cos 2a _ (1 - cos 2a)2 _ (1 - cos 2a)2 1 - cos 2aV 1 + cos 2a ,J 1 - cost 2a sin2 2% sin 2a

2AIII.18. sin 15° = sin (45 - 30) = sin 45 cos 30 - cos 45 sin 30 = 4 (V3 - 1)

AIII.22. 27r, -rr, 47r,2-7r

, 1, 1n

Page 337: ɷshawns sagan integral and differential calculus

INDEX

The letter A preceding a page number refers to the Appendix.Names of scientists are given in italics.

Abscissa, 9Absolute value, 53

symbol, 53Acceleration, 208, 261Adjacent side, A 303Algebraic equation, A 292Altitude of triangle, 79Antiderivative, 181Antidifferentiation, 180Antidifferentiation formulas, 182 f.Archimedean scale, 271 f.Archimedes of Syracuse, 97, 260, 261,

271Area measure, 73 f.

unit, 75, 76Area, of a circle, 97 f.

under a curve, 110 f.under a parabolic arc, 137 f., 140of rectangles, 73 f., 130of regular polygons, 99 f.of a trapezoid, 126of a triangle 78 f., 131 f.

Atmospheric pressure, 1 f.

Bernoulli, John, 228Boiling point of water, 1 f.Bolzano, 102Brachistochrone, 232Bracket symbol, 55

Cavendish, 218Centroid, 264

of a linear mass distribution, 264of a planar region, 267, 268of regions and solids, 265 f.of a right circular cylinder, 269 f.of a semicircle, 285of a triangle, 268, 269

Chain rule, 176 f.

Chord, 161Circle, area of, 97 f.

circumference of, 104 f.equation of, 33

Circular functions, A 299 f.graphs of, A 313 f.table of values of, A 305, A 308

Circumference of a circle, 104 f.Clockwise direction, A 300Coefficients, A 291Complementary angle, A 306Complementary sine, A 306Completing the square, 41 f.Complex numbers, 41

sum and difference of, 45Cone, right circular, 258

lateral surface of, 260volume of, 258 f.

Conical frustum, 258Conjugate complex numbers, 43Constants, 1Continuity, 59

uniform, 60 f.definition of, 65

Coordinate axes, 8 f.Coordinates of a point, 8Coordinate system, 7 f.Cosine, A 306

graph of, A 315Cotangent, A 317Counterclockwise, 107, A 299Cycloid, 227 f.Cylinder, right circular, 255 f.

lateral surface of, 257volume of, 257

Definite integral, 115, 118 f.existence of, 116 f.

Degenerate region, 74325

Page 338: ɷshawns sagan integral and differential calculus

326

Degree of a polynomial, A 291Dependent variable, 2 f.Determinants, second-order, 31

third-order, 70Derivative, 151, 161 f.; see also Differ-

entiationdefinition of, 162, 163notations for, 165higher, 175of a sum, 169 f., 171

Difference quotient, 161geometric interpretation of, 161

Differentiable functions, 167Differential, 187 f.Differential equation, 232Differential quotient, 164, 232Differentiation, 151 f., 172 f.

of cosine, 248of fractional powers of x, 248of function of a function, 176 f.of powers of x, 173of simple functions, 172 f.of sine, 176

Differentiation process, 164Differentiation rules, 167 f.Discontinuity, 59

point of, 59Distance between points, 31 f.Distance formula, 32Double-angle formula, cosine, A 312

sine, A 313

Earth, mass density of, 218mass of, 218mean radius of, 218

Einstein, Albert, 216, 219Elevation, 2 f.Ellipsoid, volume of, 281Equidistant points, 5Equilateral triangle, A 303Even functions, A 292, A 306Existence of definite integral, 116, 117Extreme values, relative, 197

Fermat, Pierre, 222Fermat's principle, 222 f.Finite sums, 135 f.Focus of parabolic mirror, 226, 227Force, 215, 261Freely falling bodies, 213 f.

INDEX

Frustum, conical, 258Fulcrum, 260Function, 2 f., 13 f.Function of a function, 177

Galilei Galileo, 214Galilei's law, 214, 216 f.Gauss, Carl Friedrich, 135Geometric series, 87, 89

sum of, 88, 89Graphs, 4 f.Gravitational constant, 216, 218Gravitational layers, 229 f.Gravitational radius of sun, 219Gravity, 215

Half-angle formula, cosine, A 3 10sine, A 310

Harmonic motion, 250Height of triangle, 79Higher derivatives, 175Homogeneous, 222Huygens, Ch., 233Hyperboloid, volume of, 281

Identity relations for circular functions,A 309 f.

Imaginary numbers, 41Imaginary unit, 40Impact velocity, 220Indefinite integral, 181 f.Independent variable,/2 f.Index of refraction, 225Infinite series, 85, 89Infinity symbol, 54Initial distance, 209Initial velocity, 209Integral, definite, 115, 118 f.

with variable limit, 147 f., 180 f.Integral, indefinite, 181 f.Integral calculus, 97Integrand, 119Integration by antidifferentiation, 180 f.Integration constant, 181Integration formulas, 182 f.Integration limits, 119Integration of powers of x, 183Integration variable, 119

change of, 148Intersection of two lines, 27 f.

Page 339: ɷshawns sagan integral and differential calculus

INDEX

Intersection of parabola and line, 38,43, 44

Interval notation, 51 f.Intervals, 51 f.

closed, 51graphical representation, 52open, 51semiclosed, 51semiopen, 51

Inverse operation, 151Inverted chain rule, 185 f.Irrational numbers, 5Isotropic, 222Iteration, 194

Kepler, Johannes, 215, 287

Law of motion, 204Law of the lever, 260 f.Least square principle, 241Least squares, method of, 233 f., 241 f.Length of regular polygon, 105 f.Lever, 260

law of the, 260 f.Lever arm, 261Leverrier, 219Limit, 86

of a difference, 92of a function, 95, 96of a product, 92of a quotient, 93of a quotient of polynomials, 94of a sum, 91

Limit of a sum process, 130 f.Limits, manipulation with, 90 f.Line, straight, 14 f., 19 f.Linear factors, A 293Line equation, 21 f.

general, 23 f.Line of numbers, 4Lines, intersection of two, 27 f.

perpendicular, 23Locus, 33Lower sum, 118

Manipulations with limits, 90 f.Mass, 215 f., 261Mass center, 264

of linear mass distribution, 264of rectangle, 265

327

Mass center, of regions and solids, 265 f.of right circular cylinder, 269 f.of semicircle, 284, 285of triangle, 268, 269

Mass density, 265of earth, 218

Mass distribution, linear, 263Mass of earth, 218Maxima, 195 f.Mean radius of earth, 218Mercury (planet), 219Method of least squares, 233 f., 241 f.Minima, 195 f.Minimum of functions of two variables,

238 f.Moment, 262Monotonically increasing functions, 129Moon, distance between earth and, 221

mass of, 221, 251radius of, 251

Motion, 204 f.Multiplicative constant, 168

Natural logarithm, 184Negative direction, A 300Nested square roots, A 311, A 312Neutral point between earth and moon,

221Newton, Sir Isaac, 191, 215Newton's gravitational law, 216 f.Newton's method, 191 f.Numbers, graphical representation of,

4 f.irrational, 5line of, 4ordered pairs of, 7rational, 5

Odd functions, A 293, A 301One-to-one correspondence, 6Open square box, 201Opposite side, A 303Ordered pairs of numbers, 7Ordinary continuity, 65Ordinate, 9

Parabola, quadratic, 34stretched, 37

Parabolic approximation, 142 f.Parabolic mirror, 226

Page 340: ɷshawns sagan integral and differential calculus

328

Paraboloid, volume of, 281Parallelepiped, right, 253

volume of, 253Parallel lines, 27, 28Partial sums, 85Period, 57

of cosine function, A 308of sine function, A 301of tangent function, A 308

Periodic function, 57, 58Perpendicular lines, 23,r, approximations to, 103, 129, 146Polygon, 79

area of, 79 f., 99 f.length of, 105 f.

Point-point formula, 21Point-slope formula, 22Polynomials, A 291 f.Positive direction, A 299Pressure, i f.Prism, 254

triangular, 255volume of 253 f.

Quadratic equation, 39 f.Quadratic formula, 43Quadratic parabola, 34, 35 f.

Radian measure, 107 f.conversion into degree measure, 107

Rate of change, 150 f.Rational numbers, 5Reflection, 251Refraction, 223

index of, 225bromine, 227ethyl ether, 227glycerin, 227

Relative extreme values, 197Relative maximum value, 196Relative minimum value, 196Removable discontinuity, 66Right triangle, 78

area of, 78Root of an equation, A 291

Secant, 161Sequence of partial sums, 86, 89Sequences, 80 f.Series, 80 f.Slope, 20 f.

INDEX

Slope, of tangent line, 162, 163to parabola, 49

Simpson's rule, 142 f., 145, 286 f.Sine function, A 299 f.

graph of, A 315table of values of, A 305values of, A 301 f.

Snell's law, 222 f., 230Square roots, approximate evaluations

of, 188, 189, 192, 193Step function, 56

graph of, 57Straight line, 141., 19 f.

equation of, 20, 21, 25Substitution method, 185 f.Summation limits, 83Summation subscript, 83

Tables, 1 f.Tangent function, A 306

graph of, A 316period of, A 308

Tangent line, 162slope of, 162

Tangent to a parabola, 45 f.Tautochrone, 233Torus, volume of, 281 f.Total additivity of area measure, 82Transformation of coordinates, 35 f.Trapezoid, area of, 126Trapezoidal approximations, 124 f.Trapezoidal rule, 124 f., 128Trigonometric functions, A 307

Under sum, 118, 131Uniform continuity, 60 f.Uniformly continuous function, A 294 f.Unit distance, 4

Variables, i f.dependent, 2 f.independent, 2 f.

Velocity, 204 f., 208of light, 222

in air, 225in water, 226

of velocity, 208Vertical line, 24, 26Vertical motion, 220 f.Volume, of a cone, 258 f., 271

Page 341: ɷshawns sagan integral and differential calculus

INDEX

Volume, of a cylinder, 255 f., 271 x-axis, 8of an ellipsoid, 281 positive, 9of a hyperboloid of one sheet, 281 x-coordinate, 9of a paraboloid, 280, 281of a prism, 253 f. y-axis, 8of solids of revolution, 278 f., 281 f. positive, 9of a sphere, 275, 277 f. y-coordinate, 9of a torus, 281 f. y-intercept, 20

329

Wine barrels, volume of, 286, 287 Zeros of polynomials, A 292


Recommended