Then, there exists a linear functional x∗:X→R of X that satisfies the following conditions:
(a)x∗(y)=y∗(y),∀y∈Y
(b)x∗(x)≤p(x),∀x∈X
Explanation
To say that R− is a vector space means it is a vector space over the field R. In other words, it means that the conditions for scalar multiplication of a vector space, (M1)~(M5), hold true for real numbers. Similarly, the term R-linear means that the two properties of linearity, specifically scalar multiplication, hold true for real numbers.
Since X,Y is a R-vector space, the terms linear and R-linear mean the same. If this concept is confusing, think of R-, C- as non-existent in this text for ease of understanding the proof. Later, when applying the Hahn-Banach theorem to normed spaces, the function p corresponds to the norm. The proof of the theorem stated above is omitted and will be used as a lemma for proving the Hahn-Banach theorem for complex numbers.
Let X be a C-vector space and assume that Y⊂X. Let us define p:X→R as the following sublinear functional.
p(λx)=∣λ∣p(x),x∈X,λ∈C
And assume that y∗:Y→C satisfies the following condition as a linear functional of Y.
Re(y∗(y))≤p(y),∀y∈Y
Then, there exists a linear functional x∗:X→C of X that satisfies the following conditions:
x∗(y)=y∗(y),∀y∈Y
Re(x∗(x))≤p(x),∀x∈X
Explanation
Compared to the theorem for real numbers, the codomain of p being R remains unchanged because, as mentioned above, when X is a normed space, p corresponds to the norm. X, Y are C-vector spaces and since R⊂C, they also satisfy the conditions for being a R-vector space. This is because if all conditions for a vector space, (M1)~(M5), hold true for all complex numbers, they automatically hold true for all real numbers as well. Similarly, y∗, x∗ being C-linear means they also satisfy the condition of being R− linear.
Proof
Define the function ψ:Y→R as follows:
ψ(y)=Re(y∗(y))
Then, it can be shown that ψ is also a C-linear functional of Y. This is a trivial result since Re and y∗ are linear, and the demonstration is very straightforward, thus omitted. By the definition of ψ and (1), the following equation holds:
ψ(y)=Re(y∗(y))≤∣y∗(y)∣≤p(y)
Therefore, by the Hahn-Banach theorem for real numbers, there exists a R-linear functional Ψ:X→R of X that satisfies:
Ψ(y)=ψ(y),∀y∈Y
Ψ(x)≤p(x),∀x∈X
Next, define a new function Φ:X→C as follows. The final goal is to show that the defined Φ is the x∗ mentioned in the theorem.
Φ(x):=Ψ(x)−iΨ(ix)
Then, Φ can be seen as a linear functional of X. Since Ψ is R-linear, linearity with respect to addition and multiplication by real numbers is trivial, so only Φ(ix)=iΦ(x) needs to be verified.
Therefore, since Φ is a linear functional of X and satisfies (a), (b), x∗=Φ exists.
■
The Hahn-Banach Theorem for Seminorms
Let X be a C-vector space and assume that Y⊂X. Let p:X→R be a seminorm of X. And assume that y∗:Y→C satisfies the following condition as a linear functional of Y.
∣y∗(y)∣≤p(y),∀y∈Y
Then, there exists a linear functional x∗:X→C of X that satisfies the following conditions:
x∗(y)=y∗(y),∀y∈Y
∣x∗(x)∣≤p(x),∀x∈X
Proof
From the definitions of seminorm and sublinear, if p is a seminorm, it automatically satisfies the conditions of being sublinear.
It is trivial that the following equation is satisfied:
Re(y∗(y))≤∣y∗(y)∣≤p(y)
Therefore, by the Hahn-Banach theorem for complex numbers, there exists a linear functional x∗:X→C of X that satisfies the following two conditions:
x∗(y)=y∗(y)∀y∈Y
Re(x∗(x))≤p(x)∀x∈X
Let’s assume S={λ∈C:∣λ∣=1}. Then,
Re(λx∗(x))=≤=Re(λx∗(λx))p(λx)∣λ∣p(x)=p(x)∀x∈X
For a fixed x∈X, a λ∈S that satisfies ∣x∗(x)∣=λx∗(x) can always be found. Thus, for that particular λ, the following equation holds:
∣x∗(x)∣=λx∗(x)=Re(λx∗(x))≤p(x),∀x∈X
Since the linear functional x∗ of X satisfies both conditions, the proof is complete.
■
Appendix
For a fixed x, let’s say x∗(x)=a+ib. If we assume λ=c+id, then because of the condition on λ, c2+d2=1, thus λ=c+i1−c2 holds. Also, ∣x∗(x)∣=a2+b2 holds. Considering λx∗(x)=(ac−b1−c2)+i(a1−c2+bc), and since ∣x∗(x)∣ is a non-negative real number,
For convenience, let’s denote c=a2+b2a. And let’s set d=a2+b2−b. Then, (2) and c2+d2=1 hold true. Also, ∣x∗(x)∣=ac−bd=a2+b2 is true. Therefore, for a fixed x, if x∗(x)=a+ib, then for λ=a2+b2a−ia2+b2b∈S, ∣x∗(x)∣=λx∗(x) holds true.