The confusion about the proof of if function is continuous on a closed interval $[a,b]$,then it is integrable

real-analysissolution-verificationuniform-continuity

In many proofs, it uses the fact that if a function is continuous on the interval $[a,b]$, then it is uniformly continuous on that interval

Thus, statement :$\forall \epsilon>0, \exists \delta>0, \forall x,x'\in [a,b];|x-x'|<\delta\implies |f(x)-f(x')|<\epsilon$

Then choose a partition $P$ such that for any partitioned interval $[t_{i-1},t_i],t_i-t_{i-1}<\delta$

Then for any $x,x'\in [t_{i-1},t_i]$,based on the definition of uniformly continuous, it will be $|f(x)-f(x')|<\epsilon \implies f(x')-\epsilon<f(x)<f(x')+\epsilon$

For any given $x'$,and based on the definition of sup and inf:
\begin{equation}
\ f(x')-\epsilon\leq\inf f(x)\leq f(x)\leq\sup f(x)\leq f(x')+\epsilon
\end{equation}

Thus $\sup f(x)-\inf f(x)<2\epsilon$ for any $x\in [t_{i-1},t_i]$ this means ($M_i$ is $\sup f(x)$ on any $[t_{i-1},t_i]$ and $m_i$ is $\inf f(x)$ on any $[t_{i-1},t_i]$)
\begin{equation}
U(f,P)-L(f,P)=\displaystyle \sum^n_{i=1}(M_i-m_i)(t_i-t_{i-1})<2\epsilon(b-a)
\end{equation}

Then does the above is enough to say that $U(f, P)-L(f, P)<\epsilon$? Since $2\epsilon(b-a)$ can also represent arbitrary positive numbers.


However, in some proofs, I saw that they first let $M_i-m_i<\epsilon_1$
Then after getting $\displaystyle \sum^n_{i=1}(M_i-m_i)(t_i-t_{i-1})<\epsilon_1(b-a)$. There will be another step which is letting $\epsilon_1(b-a)<\epsilon$ to get $\epsilon_1<…$ Is this a necessary step?

Best Answer

To directly answer your question of "is this step necessary?", the answer is it depends. It depends on whether or not you actually know how to prove that it is equivalent to the usual statement which only has $<\epsilon$ at the end. If you already know the equivalence, then of course you don't have to keep repeating it. If you don't know the equivalence, then of course, you have to go the extra step; after all, the reason why sometimes we might go the extra step of starting with $\frac{\epsilon}{2(b-a)}$ or starting with a different quantitity $\epsilon_1$ is just to make contact directly with the theorems proved/definitions involved.

Perhaps that last part may have been a little cryptic, so let me instead illustrate with an example. Suppose $f:I\to \Bbb{R}$ is a given function where $I$ is say an open interval containing the point $a$. Also, let $l\in\Bbb{R}$. Consider now the following statements:

  1. for every $\epsilon>0$, there is a $\delta>0$ such that for all $x\in I$, if $0<|x-a|<\delta$ then $|f(x)-l|<\epsilon$.
  2. for every $\epsilon>0$, there is a $\delta>0$ such that for all $x\in I$, if $0<|x-a|<\delta$ then $|f(x)-l|\leq \epsilon$.
  3. for every $\epsilon>0$, there is a $\delta>0$ such that for all $x\in I$, if $0<|x-a|\leq \delta$ then $|f(x)-l|< \epsilon$.
  4. for every $\epsilon>0$, there is a $\delta>0$ such that for all $x\in I$, if $0<|x-a|\leq\delta$ then $|f(x)-l|\leq \epsilon$.

In other words, I have just replaced some of the strict inequalities with weak inequalities. Now consider the next "type" of statement:

  1. there is a $k>0$ such that for every $\epsilon>0$, there is a $\delta>0$ such that for all $x\in I$, if $0<|x-a|<\delta$ then $|f(x)-l|< k\epsilon$.
  2. there is a $k>0$ such that for every $\epsilon>0$, there is a $\delta>0$ such that for all $x\in I$, if $0<|x-a|<\delta$ then $|f(x)-l|\leq k\epsilon$.
  3. there is a $k>0$ such that for every $\epsilon>0$, there is a $\delta>0$ such that for all $x\in I$, if $0<|x-a|\leq\delta$ then $|f(x)-l|< k\epsilon$.
  4. there is a $k>0$ such that for every $\epsilon>0$, there is a $\delta>0$ such that for all $x\in I$, if $0<|x-a|\leq\delta$ then $|f(x)-l|\leq k\epsilon$.

So, I have added the prefix "there exists $k>0$" and finished off with $k\epsilon$, again with the four variants of weak/strict inequalities. Finally, let me introduce the following type of statement. Suppose $\psi:(0,\infty)\to (0,\infty)$ is a function such that $\lim\limits_{\epsilon\to 0^+}\psi(\epsilon)=0$. Now consider the following statements:

  1. for every $\epsilon>0$, there is a $\delta>0$ such that for all $x\in I$, if $0<|x-a|<\delta$ then $|f(x)-l|<\psi(\epsilon)$.
  2. for every $\epsilon>0$, there is a $\delta>0$ such that for all $x\in I$, if $0<|x-a|<\delta$ then $|f(x)-l|\leq\psi(\epsilon)$.
  3. for every $\epsilon>0$, there is a $\delta>0$ such that for all $x\in I$, if $0<|x-a|\leq\delta$ then $|f(x)-l|<\psi(\epsilon)$.
  4. for every $\epsilon>0$, there is a $\delta>0$ such that for all $x\in I$, if $0<|x-a|\leq\delta$ then $|f(x)-l|\leq \psi(\epsilon)$.

So, the twelve statements I made above, with various choices of weak/strict inequalities, and with various choices of the final quantity ($\epsilon$ vs $k\epsilon$ vs $\psi(\epsilon)$), it turns out they are all logically equivalent. To someone who has finished a first course in real analysis, these equivalences tend to become almost obvious (furthermore, here I only stated things for functions defined on subsets of $\Bbb{R}$ and with values in $\Bbb{R}$, but the same can be generalized to metric spaces and stuff). The key to understanding the equivalence is the role of the quantifiers "for every" and "there exist".

Point ($1$) is often taken as the definition of "$\lim\limits_{x\to a}f(x)=l$", so if you start with that as the definition, you have to prove the remaining statements as theorems. Often times textbooks avoid mentioning that these statements are all equivalent (some textbooks may leave some of these equivalences as exercises, such as Spivak). Also, since point (1) is the definition straight out of a textbook, often in an attempt to not to invoke results not already proved, they may resort to making their proofs slightly more opaque by bringing in $\frac{\epsilon}{2}$ or $\frac{\epsilon}{2k}$ or stuff like that; in a sense this method of brining in "auxillary epsilons" is pretty much a way of proving the equivalence of the above statements, without explicitly saying that's what they're doing.

For the case of the upper and lower sums, it's the same story. The statement

  • for every $\epsilon>0$ there is a partition $P$ of the interval $[a,b]$ such that $U(f,P)-L(f,p)<\epsilon$

can also be expressed equivalently in many ways. We can use either $<$ or $\leq$ at the end. We can add the prefix "there is a $k>0$" in the beginning and finish off with $<k\epsilon$; we can also suppose there is a function $\psi$ as described previously and end with $<\psi(\epsilon)$.

So once again, the question of "is this step necessary" depends entirely on what you have/have not already proved. From a strict logical perspective, yes if you start with one of the statements as a definition, you have to prove that the remaining eleven are equivalent. However, to many "experienced" people (where "experienced" means usually by the end of a 1-2 semester course in analysis) these equivalences are almost obvious, so going the extra step may be unnecessary.

In practice, we often end up proving something like statement (8) (where usually we have some explicit value for $k$ like $k=2$ or $k=10$ or $k=2\sup\limits_{x\in [a,b]}|f(x)|$) or (12) instead, because weak inequalities are more forgiving, and that's all we really need.