Optimization Theory: Method of Lagrange Multipliers
Description
In nonlinear optimization problems that have a nonlinear objective function, a method called Lagrangian Multiplier Method involves multiplying constraints by something called Lagrangian Multipliers and integrating them into the objective function.
For instance, if an optimization problem for a differentiable scalar function is given as above, our goal is to find that makes the maximum within the feasible solution set . The Lagrangian Multiplier Method is one of the solutions for this, which turns it into the next optimization problem by multiplying Lagrangian Multipliers to and adding it to , a Lagrangian Function which is taken as the objective function.
This newly obtained optimization problem no longer has constraints, and this is not magic but math. In the process of optimizing , the point where the rate of change of with respect to becomes , in fact, ends up satisfying the original constraints, allowing us to use optimization techniques that are constraint-free or have weaker constraints.
Examples
The distribution that maximizes entropy is the uniform distribution 1
Let’s verify through the Lagrangian Multiplier Method that the Shannon entropy is maximized in the uniform distribution. Suppose the probability distribution of a random variable is as follows regarding .
Then, the optimization problem to find that maximizes entropy can be written as follows.
The constraint is because the sum of all probabilities must be . The original objective function cannot indefinitely increase due to this constraint. The Lagrangian function for this problem is as follows. Then, when is partially differentiated with respect to into it implies thus, it must be Meanwhile, from the partial differentiation with respect to , hence, the probability distribution that maximizes entropy precisely is In other words, the uniform distribution maximizes entropy.
■
Support Vector Machine
In machine learning, Support Vector Machine can emerge as a technique to find hyperplanes through the Lagrangian Multiplier Method. Refer to the following section:
Luenberger. (2021). Linear and Nonlinear Programming (5th Edition): p370~371. ↩︎