Formal Power Series Article Index for
Formal
Website Links For
Formal
 

Information About

Formal Power Series





INFORMAL INTRODUCTION


A formal power series can be loosely thought of as a " Polynomial with infinitely many terms". Alternatively, for those familiar with Power Series (or Taylor Series ), one may think of a formal power series as a power series in which we ignore questions of Convergence . For example, consider the series
:A = 1 - 3x + 5x^2 - 7x^3 + 9x^4 - 11x^5 + \cdots.
If we studied this as a power series, its properties would include, for example, that its radius of convergence is 1. However, as a formal power series, we may ignore this completely; all that is relevant is the sequence of Coefficient s −3, 5, −7, 9, −11, ... . In other words, a formal power series is just an object that records a sequence of coefficients.

Arithmetic on formal power series is carried out by simply pretending that the series are polynomials. For example, if
:B = 2x + 4x^3 + 6x^5 + \cdots,
then we add ''A'' and ''B'' term by term:
:A + B = 1 - x + 5x^2 - 3x^3 + 9x^4 - 5x^5 + \cdots.
We can multiply formal power series, again just by treating them as polynomials:
:AB = 2x - 6x^2 + 14x^3 - 26x^4 + 44x^5 + \cdots.
Notice that each coefficient in the product AB only depends on a ''finite'' number of coefficients of A and B. For example, the ''x''5 term is given by
:44x^5 = (1 imes 6x^5) + (5x^2 imes 4x^3) + (9x^4 imes 2x).\,\!
For this reason, one may multiply formal power series without worrying about the usual questions of Absolute , Conditional and Uniform Convergence which arise in dealing with power series in the setting of Analysis .

Similarly, many other operations that are carried out on polynomials can be extended to the formal power series setting, as explained below.


FORMAL DEVELOPMENT



Two definitions of the formal power series ring


We start with a Commutative Ring ''R''. We want to define the ring of formal power series over R in the variable X, denoted by ''R'' ''X'' ; elements of this ring should be thought of as power series whose coefficients are elements of ''R''.

Perhaps the most efficient definition of ''R'' ''X'' is as the Completion of the Polynomial ring ''R'' with respect to the I-adic Topology determined by the ideal ''I'' of ''R''[''X'' generated by ''X''. This results in a complete topological ring containing ''R''[''X''] as a dense subspace. This method determines the ring structure and topological structure simultaneously.

However, it is possible to describe ''R'' ''X'' more explicitly and with less algebraic machinery, giving the ring structure and topological structure separately, as follows.


Ring structure


We begin with the set ''R''N of all infinite sequences in ''R''. We define addition of two such sequences by

:
\left( a_n ight) + \left( b_n ight) = \left( a_n + b_n ight)


and multiplication by

:
\left( a_n ight) imes \left( b_n ight) =
\left( \sum_{k=0}^n a_k b_{n-k} ight).


This type of product is called the Cauchy Product of the two sequences of coefficients, and is a sort of discrete Convolution . With these operations, ''R''N becomes a commutative ring with zero element (0, 0, 0, ...) and multiplicative identity (1, 0, 0,...).

If we identify the element ''a'' of ''R'' with the sequence (''a'', 0, 0, ...) and define ''X'' := (0, 1, 0, 0, ...), then using the above definitions of addition and multiplication, we find that every sequence with only finitely many nonzero terms can be written as the ''finite'' sum

:
(a_0, a_1, a_2, \ldots, a_N, 0, 0, \ldots) = a_0 + a_1 X + \cdots + a_N X^N = \sum_{n=0}^N a_n X^n.



Topological structure


We would like to extend the above formula to a similar one for arbitrary sequences in ''R''N, that is, we would like
:
(a_0, a_1, a_2, a_3, \ldots) = \sum_{n=0}^\infty a_n X^n \qquad (1)

to hold. However, for the infinite sum on the right to make sense, we need a notion of convergence in ''R''N, which involves introducing a Topology on ''R''N. There are several equivalent ways to define the appropriate topology.

  • We may give ''R''N the Product Topology , where each copy of ''R'' is given the Discrete Topology .

  • We may introduce a Metric (or "distance function"). For sequences (''a''''n'') and (''b''''n'') in ''R''N, let us define

  • ::d((a_n), (b_n)) = 2^{-k},\,\!

:where ''k'' is the smallest Natural Number such that ''a''''k'' ≠ ''b''''k''; if there is no such ''k'', then the two sequences are identical, so we set their distance to be zero.
  • We may give ''R''N the I-adic Topology , where ''I'' = (''X'') is the ideal generated by ''X'', which consists of all sequences whose first term ''a''0 is zero.


All of these definitions of the topology amount to declaring that two sequences (''a''''n'') and (''b''''n'') are "close" if their first few terms agree; the more terms agree, the closer they are.

Now we can make sense of equation (1); the Partial Sum s of the infinite sum certainly converge to the sequence on the left hand side. In fact, any rearrangement of the series converges to the same Limit .

One must check that this topological structure, together with the ring operations described above, form a Topological Ring . This is called the ring of formal power series over ''R'' and is denoted by ''R'' ''X'' .


Universal property


The ring ''R'' → ''S'' with the following properties:
  • Φ is an ''R''-algebra homomorphism

  • Φ is continuous

  • Φ(''X'') = ''x''.



Operations on formal power series



Inverting series


The series
:\sum_{n=0}^\infty a_n X^n
in ''R'' ''X'' is invertible in ''R'' ''X'' if and only if its constant coefficient ''a''0 is invertible in ''R''. An important special case is that the Geometric Series formula is valid in ''R'' ''X'' :

:
\left( 1 - X ight)^{-1} = \sum_{n \ge 0} X^n.



Composition of series


Given formal power series
:f(X) = \sum_{n=1}^\infty a_n X^n = a_1 X + a_2 X^2 + \cdots
and
:g(X) = \sum_{n=0}^\infty b_n X^n = b_0 + b_1 X + b_2 X^2 + \cdots,
one may form the ''composition''
:g(f(X)) = \sum_{n=0}^\infty b_n f(X)^n = \sum_{n=0}^\infty c_n X^n,
where the coefficients ''c''''n'' are determined by "expanding out" the powers of ''f''(''X''). A more explicit description of these coefficients is provided by Faà Di Bruno's Formula .

The critical point here is that this operation is only valid when ''f''(''X'') has ''no constant term'', so that the series for ''g''(''f''(''X'')) converges in the topology of ''R'' ''X'' . In other words, each ''c''''n'' depends on only a finite number of coefficients of ''f''(''X'') and ''g''(''X'').


= Example


If we denote by exp(''X'') the formal power series
:\exp(X) = 1 + x + rac{x^2}{2!} + rac{x^3}{3!} + rac{x^4}{4!} + \cdots,
then the expression
:\exp(\exp(X) - 1) = 1 + x + x^2 + rac{5x^3}6 + rac{5x^4}8 + \cdots
makes perfect sense as a formal power series. However, the statement
:\exp(\exp(X)) = e \exp(\exp(X) - 1) = e + ex + ex^2 + rac{5ex^3}6 + \cdots
is not a valid application of the composition operation for formal power series. Rather, it is confusing the notions of convergence in ''R'' ''X'' and convergence in ''R''; indeed, the ring ''R'' may not even contain any number ''e'' with the appropriate properties.


Formal differentiation of series


Given a formal power series
:f = \sum_{n\geq 0} a_n X^n
in ''R'' ''X'' , we define its formal derivative, denoted ''Df'', by

:
Df = \sum_{n \geq 1} a_n n X^{n-1}.


The symbol ''D'' is called the formal differentiation operator. The motivation behind this definition is that it simply mimics term-by-term differentiation of a polynomial.

This operation is ''R''- Linear :

:
D(af + bg) = a Df + b Dg \,\!


for any ''a'', ''b'' in ''R'' and any ''f'', ''g'' in ''R'' ''X'' . Additionally, the formal derivative has many of the properties of the usual Derivative of calculus. For example, the Product Rule is valid:

:
D(fg) = f(Dg) + (Df) g; \,\!


and the Chain Rule works as well:

:
D(f(u)) = (Df)(u) Du, \,\!


whenever the appropriate compositions of series are defined (see above under Composition Of Series ).

In a sense, all formal power series are Taylor Series . Indeed, for the ''f'' defined above, we find that
:
(D^k f)(0) = k! a_k, \,\!

where ''D''''k'' denotes the ''k''th formal derivative (that is, the result of formally differentiating ''k'' times).


Algebraic properties of the formal power series ring


''R'' ''X'' is an Associative Algebra over ''R'' which contains the ring ''R'' {Link without Title} of polynomials over ''R''; the polynomials correspond to the sequences which end in zeros.

The Jacobson Radical of ''R'' ''X'' is the Ideal generated by ''X'' and the Jacobson radical of ''R''; this is implied by the element invertibility criterion discussed above.

The is maximal if and only if ''M'' ∩ ''R'' is a maximal ideal of ''R'' and ''M'' is generated as an ideal by ''X'' and ''M'' ∩ ''R''.

Several algebraic properties of ''R'' are inherited by ''R'' ''X'' :

If ''R'' = ''K'' is a Field , then ''K'' ''X'' has several additional properties.


Topological properties of the formal power series ring


The metric space (''R'' ''X'' , ''d'') is Complete .

The ring ''R'' ''X'' is Compact if and only if ''R'' is Finite . This follows from Tychonoff's Theorem and the characterisation of the topology on ''R'' ''X'' as a product topology.


APPLICATIONS


Formal power series can be used to solve recurrences occurring in number theory and combinatorics. For an example involving finding a closed form expression for the Fibonacci Numbers , see the article on Generating Function s.

One can use formal power series to prove several relations familiar from analysis in a purely algebraic setting. Consider for instance the following elements of Q ''X'' :

:
\sin(X) := \sum_{n \ge 0} rac{(-1)^n} {(2n+1)!} X^{2n+1}


:
\cos(X) := \sum_{n \ge 0} rac{(-1)^n} {(2n)!} X^{2n}


Then one can show that

:
\sin^2 + \cos^2 = 1


and

:
D \sin = \cos


as well as

:
\sin (X+Y) = \sin(X) \cos(Y) + \cos(X) \sin(Y)


(the latter being valid in the ring Q ''X'',''Y'' ).

In algebra, the ring ''K'' ''X''1, ..., ''X''''r'' (where ''K'' is a field) is often used as the "standard, most general" complete local ring over ''K''.


INTERPRETING FORMAL POWER SERIES AS FUNCTIONS


In Mathematical Analysis , every convergent Power Series defines a Function with values in the Real or Complex numbers. Formal power series can also be interpreted as functions, but one has to be careful with the Domain and Codomain . If ''f''=∑''a''''n'' ''X''''n'' is an element
of ''R'' ''X'' , ''S'' is a commutative associative algebra over ''R'', ''I'' is an ideal in ''S'' such that the I-adic Topology on ''S'' is complete, and ''x'' is an element of ''I'', then we can define

:
f(x) = \sum_{n\ge 0} a_n x^n


This latter series is guaranteed to converge in ''S'' given the above assumptions on ''x''. Furthermore, we have

:
(f+g)(x) = f(x) + g(x)


and

:
(fg)(x) = f(x) g(x)


Unlike in the case of bona fide functions, these formulas are not definitions but have to be proved.

Since the topology on ''R''.

With this formalism, we can give an explicit formula for the multiplicative inverse of a power series ''f'' whose constant coefficient ''a'' = ''f''(0) is invertible in ''R'':

:
f^{-1} = \sum_{n \ge 0} a^{-n-1} (a-f)^n


If the formal power series ''g'' with ''g''(0) = 0 is given implicitly by the equation

:
f(g) = X


where ''f'' is a known power series with ''f''(0) = 0, then the coefficients of ''g'' can be explicitly computed using the Lagrange Inversion Theorem .


GENERALIZATIONS



Formal Laurent series


A formal Laurent series over ''R'' is defined in a similar way to a formal power series, except that we also allow finitely many terms of negative degree. That is, we consider series of the form
:
f = \sum_{n \ge -M} a_n X^n

where ''M'' is an integer which depends on ''f''. We may add and multiply such series using the same formal rules as for formal power series; note that multiplication makes sense because we have only allowed finitely many negative index terms. Under these operations, these elements form the ring of formal Laurent series over ''R'', denoted by ''R''((''X'')). It is a topological ring, and its relationship to formal power series is analogous to the relationship between Power Series and Laurent Series .

If ''R'' = ''K'' is a Field , then ''K''((''X'')) may also be obtained as the Field Of Fractions of the Integral Domain ''K'' ''X'' .

One may define formal differentiation for formal Laurent series in a natural way (term-by-term). If ''R'' is a field, then in addition to the rules listed above under Formal Differentiation Of Series , the Quotient Rule will also be valid.


Power series in several variables


It is relatively straightforward to extend the above ideas to define a formal power series ring over ''R'' in ''r'' variables, denoted ''R'' ''X''1,...,''X''''r'' . Elements of this ring may be expressed uniquely in the form
:
\sum_{\mathbf{n}\in\Bbb{N}^r} a_\mathbf{n} \mathbf{X^n}

where now n = (''n''1,...,''n''''r'') ∈ '''N'''''r'', and '''X'''n denotes the monomial ''X''1''n''1...''X''''r''''n''''r''. This sum converges for any choice of the coefficients ''a''n∈''R'', and the order of summation is immaterial.


Definition


One possible definition of ''R'' ''X''1,...,''X''''r'' is to take the completion of the polynomial ring ''R'' in ''r'' variables with respect to the I-adic topology, where ''I'' is the ideal of ''R''[''X''1,...,''X''''r'' generated by ''X''1,...,''X''''r''. That is, ''I'' is the ideal consisting of polynomials with zero constant term.

Alternatively, one may proceed in a similar way to the more explicit discussion given above for the single-variable case, giving the ring structure first in terms of "multi-dimensional" sequences, and then defining the topology.

The topology on ''R'' ''X''1,...,''X''''r'' is the J-adic topology, where ''J'' is the ideal of ''R'' ''X''1,...,''X''''r'' generated by ''X''1,...,''X''''r''. That is, ''J'' is the ideal consisting of series with zero constant term. Therefore, two series are considered "close" if their first few terms agree, where "first few" means terms whose total degree ''n''1 + ... + ''n''''r'' is small.


Warning


Although ''R'' ''X''1, ''X''2 and ''R'' ''X''1 ''X''2 are isomorphic as ''rings'', they do ''not'' carry the same topology. For example, the sequence of elements
:f_n = X_1^n, \quad n \geq 1
converges to zero in ''R'' ''X''1, ''X''2 as ''n'' → ∞; however, in the ring ''R'' ''X''1 ''X''2 , it does ''not'' converge, since the copy of ''R'' ''X''1 embedded in ''R'' ''X''1 ''X''2 has been given the discrete topology.


Operations


All of the operations defined for series in one variable may be extended to the several variables case.
  • Addition is carried out term-by-term.

  • Multiplication is carried out simply by "multiplying out" the series.

  • A series is invertible if and only if its constant term is invertible in ''R''.

  • The composition ''f''(''g''(''X'')) of two series ''f'' and ''g'' is defined only if the constant term of ''g'' is zero.


In the case of the formal derivative, there are now ''r'' different Partial Derivative operators, which differentiate with respect to each of the ''r'' variables. They all commute with each other, as they do for continuously differentiable functions.


Universal property


In the several variables case, the universal property characterizing ''R'' → ''S'' with the following properties:
  • Φ is an ''R''-algebra homomorphism

  • Φ is continuous

  • Φ(''X''''i'') = ''x''''i'' for ''i'' = 1, ..., ''r''.



Replacing the index set by an ordered abelian group


Suppose ''G'' is an ordered abelian group, meaning an abelian group with a total ordering "<" respecting the group's addition, so that ''a'' < ''b'' iff ''a'' + ''c'' < ''b'' + ''c'' for all ''c''. Let I be a well-ordered subset of ''G'', meaning I contains no infinite descending chain. Consider the set consisting of

:\sum_{i \in I} a_i X^i

for all such I, with ''a''''i'' in a commutative ring ''R'', where we assume that for any index set, if all of the ''a''''i'' are zero then the sum is zero. Then ''R''((''G'')) is the ring of formal power series on ''G''; because of the condition that the indexing set be well-ordered the product is well-defined, and we of course assume that two elements which differ by zero are the same.

Various properties of ''R'' transfer to ''R''((''G'')). If ''R'' is a field, then so is ''R''((''G'')). If ''R'' is an ordered field, we can order ''R''((''G'')) by setting any element to have the same sign as its leading coefficient, defined as the least element of the index set I associated to a non-zero coefficient. Finally if ''G'' is a Divisible Group and ''R'' is a Real Closed Field , then ''R''((''G'')) is a real closed field, and if ''R'' is Algebraically Closed , then so is ''R''((''G'')).

This theory is due to Hans Hahn , who also showed that one obtains subfields when the number of (non-zero) terms is bounded by some fixed infinite cardinality.


EXAMPLES AND RELATED TOPICS