7.1 The Derivative

The derivative is an extremely important concept in applied mathematics, as it allows us to model real life situations with mathematical functions and their rates of change. For this reason, the derivative is the most interesting part of analysis for applied mathematicians and physicists. The purpose of this chapter is to establish what a derivative is, what its properties are, and what we can do with derivatives.

7.1.1 Differentiability

You probably have an intuition that the derivative of a function is the gradient of the tangent line at every point along the function’s domain, and that in order for a function to have a derivative defined everywhere it must have no sharp corners or cusps. This is the basic idea, but of course in analysis we must be extra careful with our definitions. How do we know if we can even define a tangent line? To make a definition of the derivative that we can work with we will use equipment that we have developed in the last chapter.

Definition 7.1.

Let I be an open interval and let f:I be a function. Let cI. Then f is differentiable at c if

limxcf(x)f(c)xcexists. (7.1)

If this limit exists, then we denote it f(c), the derivative of f at c. We say f is differentiable if f is differentiable at all cI. Then f:I is also a function, called the derivative of f (also denoted dfdx).

Example 7.1.

Let f: be given by f(x)=x2. Claim: f is differentiable at x=3 with f(3)=6.
Proof: Let c=3,x3. Then

f(x)f(3)x3=x232x3=(x+3)(x3)x3=x+3. (7.2)

Hence,

f(3)=limx3f(x)f(3)x3=limx3x+3=6. (7.3)

So f is differentiable at x=3 with derivative 6. In fact, if cx,

limxcf(x)f(c)xc=limxcx+c=2c. (7.4)

So f is differentiable everywhere with derivative f: given by f(x)=2x.

Example 7.2.

Let f:(0,) be given by f(x)=x. Claim: f is differentiable.
Proof: Let cx, then

f(x)f(c)xc=xcxc(x+cx+c)=xc(xc)(x+c)=1x+c. (7.5)

Hence f:(0,) is given by

f(c)=limxcf(x)f(c)xc=limxc1x+c=12c. (7.6)
Example 7.3.

Let f: be given by f(x)=xn for some n. Claim: f is differentiable with derivative f: given by f(x)=nxn1.
Proof: Let cx, then

f(x)f(c)xc=xncnxc=xcxc(xn1+cxn2+c2xn3++cn2x+cn1). (7.7)

Hence f(c) is given by

f(c)=limxcf(x)f(c)xc=limxc(xn1+cxn2+c2xn3++cn2x+cn1)=ncn1. (7.8)

7.1.2 Comparing Differentiability with Continuity

Now that we have defined strictly what a derivative is, we will explore some properties of them. Firstly, we will note that our notion of differentiability is stronger than the definition of continuity.

Theorem 7.1

Let I be an open interval, let f:I be a function and let cI. Then if f is differentiable at c, then f is continuous at c.

Proof.

Since f is differentiable, f(c)=limxcf(x)f(c)xc. Hence

limxc(f(x)f(c)) =limxcf(x)f(c)xc(xc) (7.9)
=(limxcf(x)f(c)xc)(limxc(xc)) (By theorem LABEL:)
=f(c)0=0. (7.10)

Therefore,

limxcf(x)=f(c). (7.11)

So by theorem LABEL:, f is continuous. ∎

Note that the converse of this theorem is not true. Continuity does not imply differentiability (although ).

Example 7.4.

Let f:(1,1) be given by f(x)=|x|. Claims:

  1. (i)

    f is continuous.

  2. (ii)

    f is not differentiable at x=0.

Proof:

  1. (i)

    This is true by theorem 6.2.

  2. (ii)

    Consider x0, then

    f(x)f(0)x0=|x|x={1x>01x<0. (7.12)

    Thus f(0) is not defined since the limit does not exist at x=0, so f is not differentiable at x=0.

7.1.3 Operations on Differentiable Functions

Now we will prove some properties of derivatives under arithmetic operations similarly to how we have done with every concept we have covered so far. These will be familiar are they are the same rules for derivatives that you will have learned in school.

Theorem 7.2

Let I, let f,g:I be two functions, differentiable at cI.

  1. (i)

    (f+g):I is differentiable at c and (f+g)(c)=f(c)+g(c).

  2. (ii)

    Given λ, (λf):I is differentiable at c and (λf)(c)=λf(c).

  3. (iii)

    (fg):I is differentiable at c and (fg)(c)=f(c)g(c)+f(c)g(c).

  4. (iv)

    If g(c)0, then (1g):I is differentiable at c and (1g)(c)=g(c)g(c)2.

Proof.

In all cases below, let xI with xc.

  1. (i)
    (f+g)(c) =limxc(f+g)(x)(f+g)(c)xc (7.13)
    =limxcf(x)+g(x)f(c)g(c)xc (7.14)
    =limxcf(x)f(c)xc+limxcg(x)g(c)xc (By theorem LABEL:)
    =f(c)+g(c). (7.15)
  2. (ii)
    (λf)(c)=limxc(λf)(x)(λf)(c)xc=λlimxcf(x)f(c)xc=λf(c). (7.16)
  3. (iii)

    Note that since f and g are differentiable at c, they are continuous at c by theorem 7.1. So limxcf(x)=f(c).

    (fg)(c) =limxc(fg)(x)(fg)(c)xc=limxcf(x)g(x)f(c)g(c)xc (7.17)
    =limxcf(x)g(x)+f(x)g(c)f(x)g(c)f(c)g(c)xc (7.18)
    =limxcf(x)g(x)g(c)xc+limxcg(c)f(x)f(c)xc (7.19)
    =f(c)g(c)+g(c)f(c). (7.20)
  4. (iv)

    Since g(c)0 and g is continuous at c, there exists δ>0 so that x(cδ,c+δ)g(x)0.

    (1g)(c)=limxc(1g)(x)(1g)(c)xc =limxc(1xc)(1g(x)1g(c)) (7.21)
    =limxc(1xc)(g(c)g(x)g(x)g(c)) (7.22)
    =limxc(1g(x)g(c))g(x)g(c)xc (7.23)
    =g(c)g(c)2. (7.24)

Part (iii) of this theorem is the well-known product rule for derivatives. We can prove the quotient rule as a corollary.

Corollary 7.2.1 (Quotient Rule)

If g(c)0, (fg):I is differentiable at c and (fg)(c)=g(c)f(c)g(c)f(c)g(c)2.

Proof.

Since fg=f1g,

(f1g)(c) =f(c)g(c)+f(c)(1g)(c) (By theorem 7.2 (iii))
=f(c)g(c)f(c)g(c)g(c)2 (By theorem 7.2 (iv))
=g(c)f(c)g(c)f(c)g(c)2. (7.25)

Other corollaries of this theorem are that all polynomials are differentiable, and all rational functions are differentiable except where the denominator is zero.

Theorem 7.3 (Chain Rule)

Let I,J be intervals, f:I a function differentiable at some cI with f(I)J, g:J a function differentiable at f(c)J. Then (gf):I is differentiable at c and (gf)(c)=g(f(c))f(c).