Poincaré-Birkoff-Witt Theorem

PROOF.. We note first that $S_k (\mathfrak{g})$ has the set of monomials

$\displaystyle \{x_{\lambda_1} \circ x_{\lambda_2} \circ x_{\lambda_3} \circ \d... ...\lambda_n} ;\lambda_1\leq \lambda_2\leq \lambda_3 \leq \dots \leq \lambda_n\}$

as a

-basis. For monomial $w=x_{\lambda_1} \circ x_{\lambda_2} \circ x_{\lambda_3} \circ \dots\circ x_{\lambda_n}$ such that $\lambda_1\leq \lambda_2\leq \lambda_3 \leq \dots \leq \lambda_n$ , we put $z=\lambda_2\leq \lambda_3 \leq \dots \leq \lambda_n$ . Then

$\displaystyle w=f_{m-1}(x_{\lambda_1},z).$

We define inductively the action of $x_{\lambda_0}$ on it by the following equations.

$\begin{equation*} f_m(x_{\lambda_0} , x_{\lambda_1} \circ z) = \begin{cases} x _... ...ligned}\right ) &(\text{if } \lambda_0>\lambda_1)\\ \end{cases}\end{equation*}$

We first note that the above definition is necessary to meet our conditions. Indeed, by (2) we necessarily define as above for $\lambda_0\leq \lambda_1$ . When $\lambda_0>\lambda_1$ , we compute

		$\displaystyle x_{\lambda_0}.(x_{\lambda_1}\circ z)$
	$\displaystyle \overset{(3)}{=}$	$\displaystyle x_{\lambda_1}.x_{\lambda_0}.z+[x_{\lambda_0},x_{\lambda_1}].z$
	$\displaystyle =$	$\displaystyle x_{\lambda_1}.(x_{\lambda_0}.z-x_{\lambda_0}\circ z) + x_{\lambda_1}.(x_{\lambda_0}\circ z) +[x_{\lambda_0},x_{\lambda_1}].z$
	$\displaystyle \overset{(2)}{=}$	$\displaystyle x_{\lambda_1}.(x_{\lambda_0}.z-x_{\lambda_0}\circ z) + x_{\lambda_1}\circ x_{\lambda_0}\circ z +[x_{\lambda_0},x_{\lambda_1}].z$

and take a careful look at degrees of each monomials using (1). From this argument we see in particular that the action is uniquely determined by conditions (1),(2),(3).

It is easy to see that the conditions (1),(2) are satisfied by defined as above.. Let us proceed to verify that the so defined also satisfies (3). Let us consider $x_\lambda, x_\mu$ $z=x_{\mu_1}\circ x_{\mu_2} \circ \dots \circ x_{\mu_n}$ with $\mu_1 \leq \mu_2 \leq \dots \leq \mu_n$ , $n\leq m-1$ . We need to prove

( $\flat$ )

$\displaystyle x_\lambda.x_\mu. z-x_\mu.x_\lambda.z=[x_\lambda,x_\mu].z.$

Since the equation above is antisymmetric in $\mu,\nu$ , we may assume that $\lambda\leq \mu$ .

(i) Case where $\lambda \leq \mu_1$ .

		$\displaystyle x_\lambda. x_\mu.z$
	$\displaystyle =$	$\displaystyle x_\lambda.(x_\mu \circ z)+x_\lambda.(x_\mu. z-x_\mu\circ z)$
	$\displaystyle \overset{(1)}{=}$	$\displaystyle x_\lambda\circ x_\mu \circ z +x_\lambda.(x_\mu.z-x_\mu\circ z)$

In other words,

$\displaystyle f_m(x_\lambda, f_m(x_\mu,z)) =x_\lambda\circ x_\mu \circ z + f_{m-1}(x_\lambda,(f_{m-1}(x_\lambda,z)-x_\mu \circ z)).$

On the other hand we have

		$\displaystyle x_\mu.x_\lambda.z$
	$\displaystyle =$	$\displaystyle x_\mu.(x_\lambda\circ z)$
	$\displaystyle \overset{\text{by def}}{=}$	$\displaystyle x_\lambda\circ x_\mu \circ z+ f_{m-1}(x_\lambda,f_{m-1}(x_\mu,z)-x_\mu\circ z)+f_{m-1}([x_\mu,x_\lambda], z)$

So the equation $&flat#flat;$ surely holds in this case.

(ii) Case where $\lambda ,\mu >\mu_1$ .

In this case we need to ``decompose'' further:

$\displaystyle z=x_\nu .w .$

We first forget about the hypothesis $\lambda\leq \mu$ and prove

		$\displaystyle x_\lambda. (x_\mu. (x_\nu. w)) \qquad (\heartsuit)$
	$\displaystyle =$	$\displaystyle x_\nu.( x_\lambda.( x_\mu.w)) + [x_\lambda,x_\nu].( x_\mu.w) + [x_\mu,x_\nu]. (x_\lambda.w) +[x_\lambda,[x_\mu,x_\nu]].w$

(Since we are doing induction, we need to pay a special attention on degrees on operands. That means, we should use

's rather than the above ``lazy'' notation. But that is fairly cumbersome, so we keep on being lazy here.)

Let us now admit that the above equation $\heartsuit$ is true and prove the rest of the equation (3). By interchanging $\lambda$ and $\mu$ in the equation ( $\heartsuit$ ), we obtain

		$\displaystyle x_\mu. (x_\lambda. (x_\nu. w)) \qquad (\diamondsuit)$
	$\displaystyle =$	$\displaystyle x_\nu.( x_\mu.( x_\lambda.w)) + [x_\mu,x_\nu].( x_\lambda.w) + [x_\lambda,x_\nu]. (x_\mu.w) +[x_\mu,[x_\lambda,x_\nu]].w$

Then by subtracting $(\diamondsuit)$ from $(\heartsuit)$ , we obtain

		$\displaystyle x_\lambda.(x_\mu. (x_\nu .w))-x_\mu. (x_\lambda. (x_\nu. w))$
	$\displaystyle =$	$\displaystyle x_\nu.(x_\lambda.(x_\mu. w)-x_\mu. (x_\lambda. w))$
		$\displaystyle + ([x_\lambda,[x_\mu,x_\nu]] - [x_\mu,[x_\lambda,x_\nu]]).w.$

Since $\deg(w)$ is smaller than $\deg(z)$ , by induction hypothesis the first term in the right hand side may be replaced by $x_\nu. ([x_\lambda,x_\mu].w)$ . The second term may be replaced, by the Jacobian identity, by $[[x_\lambda,x_\mu],x_\nu]$ . So the equation ($&flat#flat;$) holds in this case too.

It remains to prove the equation ( $\heartsuit$ ). By the induction hypothesis we have

$\displaystyle x_\mu. (x_\nu.w)=x_\nu.(x_\mu.w)+[x_\mu,x_\nu].w.$

Also by the induction hypothesis we have

$\displaystyle x_\lambda.([x_\mu,x_\nu].w) = [x_\mu,x_\nu].(x_\lambda.w) +[x_\lambda,[x_\mu,x_\nu]].w$

Lastly, we decompose $x_\mu.w$ as

$\displaystyle x_\mu.w=(x_\mu\circ w)+(x_\mu.w-x_\mu\circ w). =(x_\mu\circ w)+y$

Then the second term

has degree smaller than $\deg(z)=\deg(w)+1$ . The case (i) applies to the first term and we obtain:

$\displaystyle x_\lambda.(x_\nu.(x_\mu.w))= x_\nu.(x_\lambda.(x_\mu.w)) +[x_\lambda,x_\nu].(x_\mu.w).$

These altogether complete the proof. $\qedsymbol$

PROOF.. Let

$\displaystyle \iota_0: \mathfrak{g}\to \operatorname{Gr}(U(\mathfrak{g}))$

be the obvious

-linear map.

Using the universality of symmetric algebra, there exists a unique -algebra homomorphism

$\displaystyle \Phi: S(\mathfrak{g})\to \operatorname{Gr}(U(\mathfrak{g}))$

which extends $\iota_0$ . On the other hand the action defined in the Lemma 1.3 gives us a linear map

$\displaystyle \Psi_0:U(\mathfrak{g}) \ni x \mapsto x.1 \in S(\mathfrak{g})$

which is clearly degree-decreasing. So it defines a

-linear map

$\displaystyle \Psi: \operatorname{Gr}(U(\mathfrak{g}))\to\operatorname{Gr}(S(\mathfrak{g}))\cong S(\mathfrak{g}).$

Now the composition we obtain

$\displaystyle \Psi\circ \Phi: S(\mathfrak{g})\overset{\Phi}{\to} \operatorname{Gr}(U(\mathfrak{g}))\overset{\Psi}{\to} S(\mathfrak{g})$

coincides with the identity map. Indeed, it coincides with the identity on monomials of the form

$\displaystyle x_{\lambda_1}\circ x_{\lambda_2}\circ x_{\lambda_3}\circ \dots \circ x_{\lambda_{n-1}}\circ x_{\lambda_n}.$

The map $\Phi$ is easily verified to be surjective. So we conclude that $\Phi$ and $\Psi$ are both bijective and are inverse to each other. $\qedsymbol$