[Math] Why is the change-to-polar-coordinate method valid in computing limits

analysiscontinuitypolar coordinatesreal-analysis

For example, when we compute $\displaystyle\lim_{(x,y)\to(0,0)}\frac{x^2y}{x^2+y^2}$, we can use the change to polar coordinates method below:

Let $\begin{cases}x=r\cos\theta\\y=r\sin\theta \end{cases}$, then
$\displaystyle\lim_{(x,y)\to(0,0)}\frac{x^2y}{x^2+y^2}=\lim_{r\to0}\frac{r^3\cos^3\theta\sin\theta}{r^2}(=\lim_{r\to0}r\cos^2\theta\sin\theta=0)$.

However, why is this method valid? What is the actual reason behind this substitution? Is this related to the limit law of composition functions of continuous function? But what are those two functions that applying behind here?

I think it makes use of this change-of-coordinate continuous and one-to-one function $\Psi$:

$$\begin{alignat*}{3}\Psi:\ &\mathbb{R}^+\times[0,2\pi)&&\to\mathbb{R}^2\\
&(r,\theta)&&\mapsto(r\cos\theta,r\sin\theta)\end{alignat*}$$

But I can't figure out how to "chain" this function with others and get the eager result.

Best Answer

Note: this answer stems from the (now removed) comments left under my answer to your other question about polar coordinates. As requested, I have edited to add the proofs.

Before proceeding, let us define (possibly non-standard notation) $$\lim_{(r,\phi)\to\{0\}\times\mathbb R}g(r,\phi)=L\tag{$\diamond$}$$ to mean that for each $\epsilon>0$, there is an open neighborhood $U$ of $\{0\}\times\mathbb R$ in $[0,\infty)\times\mathbb R$ such that $(r,\phi)\in U$ and $r\neq0$ implies $|g(r,\phi)-L|<\epsilon.$

In the case $g(r,\phi)=f(r\cos\phi,r\sin\phi)$, which is $2\pi$-periodic in the second argument, this can be simplified, by requiring instead that for each $\epsilon>0$, there is a $\delta>0$ such that $0<r<\delta$ implies $|g(r,\phi)-L|<\epsilon$. (Intuitively, in this case we do not have to worry about $\delta$ getting smaller and smaller as $\phi\to\pm\infty$.) This can be done e.g. by using the tube lemma from elementary topology.

We may now state:

Proposition. The following are equivalent for a function $f:\mathbb R^2\setminus\{(0,0)\}\to\mathbb R$:

  1. We have $$\lim_{(x,y)\to(0,0)}f(x,y)=L.$$
  2. The function $\tilde f:\mathbb R^2\to\mathbb R$, defined by $$\tilde f(x,y)=\begin{cases}f(x,y);&(x,y)\neq(0,0),\\L;&(x,y)=(0,0),\end{cases}$$ is continuous at $(0,0)$ and $f(0,0)=L$.
  3. For all $\phi_0\in\mathbb R$ we have $$\lim_{(r,\phi)\to(0,\phi_0)}f(r\cos\phi,r\sin\phi)=L.\tag{$\star$}$$
  4. We have $$\lim_{(r,\phi)\to\{0\}\times\mathbb R}f(r\cos\phi,r\sin\phi)=L.$$
  5. The function $g:[0,\infty)\times\mathbb R\to\mathbb R$, defined by $$g(r,\phi)=\begin{cases}f(r\cos\phi,r\sin\phi);&r>0,\\L;&r=0,\end{cases}$$ is continuous at $(0,\phi_0)$ for each $\phi_0\in\mathbb R$.

Proof. The equivalence of 1 and 2 is standard and follows directly from the definitions.

To see that 2 implies 5, let $p:[0,\infty)\times\mathbb R\to\mathbb R^2$ be defined by $p(r,\phi)=(r\cos\phi,r\sin\phi).$ This is obviously continuous, so if $\tilde f$ is continous, $g=\tilde f\circ p$ must be continuous as well.

Next, we show that 5 implies 3. To see this, let $\epsilon>0$ and $\phi_0\in\mathbb R$. By $5$, there is a $\delta>0$ such that $\|(r,\phi)-(0,\phi_0)\|<\delta$ implies $|g(r,\phi)-g(0,\phi_0)|<\epsilon$. In particular, if $r>0$, this means that $\|(r,\phi)-(0,\phi_0)\|<\delta$ implies $|f(r\cos\phi,r\sin\phi)-L|<\epsilon$, which is precisely 3.

Next, we show that 3 implies 4. Let $\epsilon>0$. For each $\phi_0\in\mathbb R$ we have a $\delta(\phi_0)>0$ such that $\|(r,\phi)-(0,\phi_0)\|<\delta(\phi_0)$ implies $|f(r\cos\phi,r\sin\phi)-L|<\epsilon$. Let $$U(\phi_0)=\{(r,\phi)\in[0,\infty)\times\mathbb R\mid \|(r,\phi)-(0,\phi_0)\|<\delta(\phi_0)\}.$$ Then $U=\bigcup_{\phi_0\in\mathbb R}U(\phi_0)$ is an open neighborhood of $\{0\}\times\mathbb R$ in $[0,\infty)\times\mathbb R$ and for each $r\neq 0$ the condition $(r,\phi)\in U$ implies $\|f(r\cos\phi,r\sin\phi)- L\|<\epsilon$, because every such $(r,\phi)$ is an element of some $U(\phi_0)$ and therefore satisfies $\|(r,\phi)-(0,\phi_0)\|<\delta(\phi_0)$.

Finally, we show that 4 implies 1, thus concluding the circle of implications and establishing the desired equivalence. To do this, let $\epsilon>0$. By 4 and the fact that $f(r\cos\phi,r\sin\phi)$ is a $2\pi$-periodic function of $\phi$, there is a $\delta>0$ such that $0<r<\delta$ implies $|f(r\cos\phi,r\sin\phi)-L|<\epsilon$. Therefore, if $0<\|(x,y)\|<\delta$, we have $|f(x,y)-L|<\epsilon$, because $(x,y)=(r\cos\phi,r\sin\phi)$ for some $r\in(0,\delta)$ and $\phi\in\mathbb R$. This concludes the proof. $\qquad\square$

Remark. Note that the limit $(\star)$ is the limit of a two-variable function. Requiring that we have the single variable limit $$\lim_{r\to0}f(r\cos\phi_0,r\sin\phi_0)=L\tag{$\ast$}$$ for each $\phi_0\in\mathbb R$ is not (!) equivalent. In fact, this latter condition is not sufficient for the limit to exist as shown by the standard counterexample $f(x,y)=\frac{x^2y}{x^4+y^2}.$) The point is that the limit $(\star)$ takes into account points $(r,\phi)$ with different values of $\phi$ whereas in $(\ast)$ the value $\phi=\phi_0$ is fixed.

Added: here are the details regarding the part with the tube lemma, but I will do it without actually using the tube lemma, which would allow us to skip a few steps. (In fact, what follows basically includes a proof of the tube lemma in this special case.)

The definition of $(\diamond)$ mentions a neighborhood $U$ of $\{0\}\times\mathbb R$ in $[0,\infty)\times\mathbb R$. We wish to show that, if $g(r,\phi)$ is $2\pi$-periodic in $\phi$, we can always take a neighborhood of the form $V=[0,\delta)\times\mathbb R$ for some $\delta>0$, instead of this possibly more complicated neighborhood $U$.

Assume we have a neighborhood $U$ of $\{0\}\times\mathbb R$, as in the definition of $(\diamond)$, not necessarily of the form $[0,\delta)\times\mathbb R$. Using $U$, we are going to construct a neighborhood $V=[0,\delta)\times\mathbb R$ satisfying the same properties. Note that since $U$ is a neighborhood of every point $(0,\phi_0)\in\{0\}\times[0,2\pi]$, it contains a ball $$B((0,\phi_0),\eta(\phi_0))=\{(r,\phi)\in[0,\infty)\times\mathbb R\mid \|(r,\phi)-(0,\phi_0)\|<\eta(\phi_0)\}$$ of radius $\eta(\phi_0)>0$ around every such point. This ball contains a set of the form $$V(\phi_0)=[0,\delta(\phi_0))\times(\phi_0-\delta(\phi_0),\phi_0+\delta(\phi_0))$$ which still contains $(0,\phi_0)$. The sets $V(\phi_0)$ form a cover of the interval $\{0\}\times[0,2\pi]$, which is compact, so there is a finite subcover $V(\phi_1),V(\phi_2),\ldots, V(\phi_n)$, where $\phi_1,\phi_2,\ldots,\phi_n\in[0,2\pi]$. Now take $\delta=\min\{\delta(\phi_1),\delta(\phi_2),\ldots,\delta(\phi_n)\}$. Observe that $$[0,\delta)\times[0,2\pi]\subseteq V(\phi_1)\cup V(\phi_2)\cup\ldots\cup V(\phi_n)\subseteq U,$$ so $(r,\phi)\in[0,\delta)\times [0,2\pi]$ implies $|g(r,\phi)-L|<\epsilon$. But since $g$ is $2\pi$-periodic in $\phi$, this means that $(r,\phi)\in[0,\delta)\times\mathbb R$ also implies $|g(r,\phi)-L|<\epsilon$. Therefore $V=[0,\delta)\times\mathbb R$ has the desired properties.