A reduced polynomial for α over F, whose highest coefficient is 1 and satisfies p(α)=0, is represented as irr(α,F)=p(x).
The highest degree of irr(α,F) is called the degree of α over F, and is represented as deg(α,F).
For the algebraic extension fieldE of F, if we say irr(α,F)=irr(β,F), then two elements α,β∈E are called conjugates over F.
Example
As a natural example, consider the conjugate complex numbera+ib=a−ib. For the reduced polynomial
p(x):=(x2−2ax+a2+b2)∈R[x],
defining it as
====p(a+ib)a2+i2ab−b2−2a2−i2ab+a2+b20a2−i2ab−b2−2a2+i2ab+a2+b2p(a−ib),
using the expression of algebra, we can see that (a+ib),(a−ib)∈C is a conjugate over R.
For a more intuitive understanding of conjugates, consider the error equation
x5+x3+x2+1=0.
Factoring it into irreducibles R[x],
x5+x3+x2+1=(x3+1)(x2+1)=(x+1)(x2+x+1)(x2+1),
therefore, solutions x=−1, x=±1, and x=2−1±−3 can be found. Although all of these satisfy the given equation, x=−1 is a solution to a linear equation and, therefore, cannot be a conjugate of the other solutions. Moreover, x=i and x=2−1+−3 are not conjugates since they are zeros of a different reduced polynomial.
As an example of conjugates without complex numbers, 2 and (−2), which are zeros of (x2−2)∈Q[x], are conjugates over Q.
Theorem
Let’s say deg(α,F)=n for algebraic α over the field F. By defining the mapping ψα,β:F(α)→F(β) as
ψα,β(c0+c1α+⋯+cn−1αn−1):=c0+c1β+⋯+cn−1βn−1,
irr(α,F):=a0+a1x+⋯+anxn,
then
a0+a1α+⋯+anαn=0.
Since ψα,β is a homomorphism,
ψα,β(0)===ψα,β(c0+c1α+⋯+cnαn)c0+c1β+⋯+cnβn0.
This means that irr(β,F) divides irr(α,F), and the same applies to (ψα,β)−1=ψβ,α, therefore the following holds:
irr(β,F)=irr(α,F).
(⟸)
Let
p(x):=irr(β,F)=irr(α,F)
and define the substitution functions ϕα:F[x]→F(α) and ϕβ:F[x]→F(β). Then, because of p(α)=p(β)=0, ϕα, and ϕβ have the same kernel ⟨p(x)⟩⊂F[x].
By the Fundamental Theorem of Homomorphism, two homomorphisms ψα:F/⟨p(x)⟩→F(α) and ψβ:F/⟨p(x)⟩→F(β) exist. By setting
ψα,β:=ψα∘(ψα)−1,
ψα,β:F(α)→F(β) is also a homomorphism. Therefore, the following holds for (c0+c1α+⋯+cn−1αn−1)∈F(α):
===ψα,β(c0+c1α+⋯+cn−1αn−1)(ψα∘(ψα)−1)(c0+c1α+⋯+cn−1αn−1)ψβ((c0+c1x+⋯+cn−1xn−1)+⟨p(x)⟩)c0+c1β+⋯+cn−1βn−1.
■
Meanwhile, we can derive the following useful corollary for equations with real coefficients.
Corollary
For f(x)∈R[x], if f(a+ib)=0, then f(a−ib)=0.
Proof of the corollary
Let’s say
f(x):=c0+c1x+⋯+cnxn.
Since f(a+ib)=0,
f(a+ib):=c0+c1(a+ib)+⋯+cn(a+ib)n=0,
i, and −i are conjugates over R, therefore, the following holds:
0=ψi,−i(0)=ψi,−i(f(a+ib))=f(a−ib).
■
Fraleigh. (2003). A first course in abstract algebra(7th Edition): p416. ↩︎