Is evaluation of formal power series compatible with composition over nonarchimedean complete fields

algebraic-number-theoryformal-power-seriesnonarchimedian-analysis

In algebraic number theory, one may want to consider a $p$-adic local field and consider the $p$-adic logarithm and $p$-adic exponential function on it. These form inverse homomorphism between a sufficiently higher unit group (multiplicative) and a sufficiently high power of the valuation rings' maximal ideal (additive). Usually, care is put into determining where precisely the power series defining the logarithm/exponential converge in order to determine an appropriate domain and codomain for these isomorphisms. However, the fact that $\exp$ and $\log$ are inverse at all is usually claimed to merely be a formal consequence of this identity holding for power series (e.g. in Neukirch's Algebraic Number Theory, Katok's p-adic Analysis Compared with Real and many others). If some further justification is made, it is a callback to the following lemma (e.g. in Gouvea's $p$-adic Numbers or Cassels' Local Fields):

Lemma: If $(b_{nm})_{n,m\in\mathbb{N}}$ is a double sequence in a nonarchimedean complete field $K$, such that $b_{nm}\rightarrow0$ as $\max(n,m)\rightarrow\infty$, then the double series $\sum_{n\ge0}\sum_{m\ge0}b_{nm}$ and $\sum_{m\ge0}\sum_{n\ge0}b_{nm}$ both exist in $K$ and are equal.

So, the principle underlying this should be answered by the

Question: If $K$ is a nonarchimedean complete field, $P,Q\in K[[X]]$ are formal power series, such that $Q(0)=0$, and $x\in K$, such that the series $Q(x)$ converges in $K$ and the series $P(Q(x))$ converges in $K$, then does the series $(P\circ Q)(x)$, where $P\circ Q\in K[[X]]$ is the formal composite of the power series, converge in $K$ with value $(P\circ Q)(x)=P(Q(x))$?

Let's write $P=\sum_{i\ge0}a_iX^i$ and $Q=\sum_{j\ge1}b_jX^j$. The formal composite works out to be
$$P\circ Q=\sum_{k\ge0}\left(\sum_{i\ge0}a_i\sum_{\substack{j_1+\dotsc+j_i=k\\j_1,\dotsc,j_i\ge1}}b_{j_1}\cdot\dotsc\cdot b_{j_n}\right)X^k.$$ Note that the sum in brackets is a finite sum, since the terms for $i>k$ vanish, but the number of summands is not bounded as $k\rightarrow\infty$. Now, first note that $Q(x)^i=\left(\sum_{j\ge1}b_jx^j\right)^i=\sum_{k\ge0}\left(\sum_{\substack{j_1+\dotsc+j_i=k\\j_1,\dotsc,j_i\ge1}}b_{j_1}\cdot\dotsc\cdot b_{j_i}\right)x^k$. This identity is clear if it were an identity of formal power series, but to justify it as an identity of series in $K$, one also needs the above lemma to justify some rearrangements. Nonetheless, this holds true unconditionally for products of series in a nonarchimedean complete field (see Corollary 2.11 here). Then, we obtain
$$P(Q(x))=\sum_{i\ge0}a_iQ(x)^i=\sum_{i\ge0}\sum_{k\ge0}a_i\sum_{\substack{j_1+\dotsc+j_i=k\\j_1,\dotsc,j_i\ge1}}b_{j_1}\cdot\dotsc\cdot b_{j_i}x^k.$$
This is almost the series $(P\circ Q)(x)$, except we would have to interchange the summations $\sum_{i\ge0}$ and $\sum_{k\ge0}$. If the lemma were to apply in this situation, life would be great and the result would follow. However, I cannot establish the hypothesis that $a_i\sum_{\substack{j_1+\dotsc+j_i=k\\j_1,\dotsc,j_i\ge1}}b_{j_1}\cdot\dotsc\cdot b_{j_i}x^k\rightarrow0$ as $\max(i,k)\rightarrow\infty$.

Best Answer

This is not true in full generality. For a silly example, suppose $|a_i|$ grows extremely fast, and take $Q(X)=X^2-X$. Then $Q(1)=0$ so $P(Q(1))$ converges. However, it is easy to see that if you choose $|a_i|$ to grow fast enough then the norms of the coefficients of $P\circ Q$ will still grow fast such that $(P\circ Q)(x)$ cannot converge for any $x\neq 0$ (you just need to choose $|a_k|$ to be large enough so that $\sum_{i\ge0}a_i\sum_{\substack{j_1+\dotsc+j_i=k\\j_1,\dotsc,j_i\ge1}}b_{j_1}\cdot\dotsc\cdot b_{j_n}$ is dominated by the $i=k$ term).

Here, then, is a correct statement. Suppose $r$ is such that the sequence $|b_j|r^j$ is bounded (in particular, this implies $Q(x)$ converges for all $|x|<r$). Let $R=\sup_j(|b_j|r^j)$ (this is what you would "formally" expect $|Q(x)|$ to be for $|x|=r$ if there was not any fortuitous cancellation that made it smaller). Suppose that the sequence $|a_i|R^i$ is also bounded (in particular, this implies $P(Q(x))$ converges for all $|x|<r$). Then for any $x$ such that $|x|<r$, $(P\circ Q)(x)$ converges to $P(Q(x))$.

(I suspect this can be improved to include the case $|x|=r$ if $|b_j|r^j\to 0$ and $|a_i|R^i\to 0$ so $P(x)$ and $Q(P(x))$ converge for $|x|=r$, but do not quite see how to make the argument work in that case.)

To prove this, as you have observed, it suffices to show that $$c_{ik}=a_i\sum_{\substack{j_1+\dotsc+j_i=k\\j_1,\dotsc,j_i\ge1}}b_{j_1}\cdot\dotsc\cdot b_{j_i}x^k$$ goes to $0$ as $\max(i,k)\to\infty$. Since $|b_j|\leq R/r^j$ for all $j$, we have $|c_{ik}|\leq |a_i|R^i|x|^k/r^k$. Since $|x|<r$, $|x|^k/r^k\to 0$ as $k\to\infty$. Since $|a_i|R^i$ is bounded, this means $c_{ik}\to 0$ as $k\to \infty$ uniformly in $i$. Since $c_{ik}=0$ for $i>k$, this implies $c_{ik}\to 0$ as $\max(i,k)\to\infty$, as desired.

Related Question