Multivariable Second Derivative Test

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

What is the primary purpose of using the quadratic approximation in the context of the Hessian matrix?

  • To simplify complex functions for easier computation.
  • To determine the stability of numerical methods.
  • To formulate a multivariable second derivative test for identifying local extrema. (correct)
  • To solve systems of linear equations.

Why are there 2's in front of the cross-terms in the quadratic form associated with the Hessian matrix?

  • To normalize the quadratic form.
  • To account for both $m_{ij}$ and $m_{ji}$ in a symmetric matrix where $i < j$. (correct)
  • To simplify the calculation of eigenvalues.
  • To ensure the matrix is positive definite.

Suppose a function $f: R^n \rightarrow R$ has a critical point at $a$. What information is needed to classify this critical point using the Hessian matrix?

  • The eigenvalues of the Hessian matrix at $a$. (correct)
  • The determinant of the Hessian matrix at $a$.
  • The trace of the Hessian matrix at $a$.
  • The rank of the Hessian matrix at $a$.

How does the second derivative test in single-variable calculus relate to the Hessian matrix in multivariable calculus?

<p>The single-variable test is a special case of the Hessian matrix when $n = 1$. (C)</p> Signup and view all the answers

In the context of a function $f: R^n \rightarrow R$, what indicates that a critical point (a) is a saddle point?

<p>The Hessian matrix $(Hf)(a)$ is indefinite. (C)</p> Signup and view all the answers

What is the quadratic approximation of a function $f: R^n \rightarrow R$ near a point $a \in R^n$, where $h \in R^n$ is near 0?

<p>$f(a+h) \approx f(a) + \nabla f(a) \cdot h + \frac{1}{2} h^T (Hf)(a) h$. (C)</p> Signup and view all the answers

For a symmetric ( n \times n ) matrix ( A ), how is the scalar function ( q_A(x) ) defined?

<p>( q_A(x) = x^T A x ) (C)</p> Signup and view all the answers

In the context of quadratic forms, what is the significance of a "saddle" appearance of the surface graph ( z = f(x, y) ) near a critical point ( a = (a_1, a_2) )?

<p>It indicates that the part of the surface graph over one line through ( a ) has a local minimum, while over another line it has a local maximum. (A)</p> Signup and view all the answers

Why is it important to analyze the Hessian matrix at a critical point, rather than relying solely on contour plots?

<p>The linear algebra of the Hessian provides precise information about critical points, which is particularly useful when contour plots are unavailable or complex, especially for functions with more than two variables. (C)</p> Signup and view all the answers

For a function $f: R^n \rightarrow R$ with a critical point at $a$, if the Hessian $(Hf)(a)$ is positive-definite, what can be concluded?

<p>$a$ is a local minimum of $f$. (A)</p> Signup and view all the answers

What does it mean for a symmetric matrix to be 'indefinite' in the context of the second derivative test?

<p>It has both positive and negative eigenvalues. (C)</p> Signup and view all the answers

If the Hessian matrix (Hf)(a) is positive-semidefinite, but not positive-definite, what does this imply about the critical point 'a'?

<p>The test is inconclusive, and further investigation is needed. (C)</p> Signup and view all the answers

In the context of economics and thermodynamics, what property of a function ensures a unique global extremum at a critical point?

<p>The function is strictly convex. (A)</p> Signup and view all the answers

For a diagonal matrix, how can you determine its definiteness?

<p>Inspect the signs of its diagonal entries. (A)</p> Signup and view all the answers

Given a symmetric matrix H, what condition indicates that it is indefinite based on its associated quadratic form q(x)?

<p>q(x) has both positive and negative values. (B)</p> Signup and view all the answers

What is a Gram matrix, and what can be generally said about its definiteness?

<p>It is a matrix product MTM that is always positive-semidefinite. (D)</p> Signup and view all the answers

What is the primary use of the Cholesky decomposition?

<p>Inverting covariance matrices in least squares and Hessians in Newton's method for optimization. (D)</p> Signup and view all the answers

Why does the text emphasize examples with 2-variable functions when illustrating the use of the Hessian?

<p>To demonstrate that the Hessian is encoding the geometry of both the contour plot and surface graph at each critical point, which is easily visualized in 2 dimensions. (B)</p> Signup and view all the answers

If the determinant of the Hessian matrix is negative at a critical point, what can be immediately deduced?

<p>The critical point is a saddle point. (C)</p> Signup and view all the answers

Given that the eigenvalues of the Hessian matrix at a critical point are 5 and -3, what can be concluded about the behavior of the function near that point?

<p>The function has a saddle point. (D)</p> Signup and view all the answers

How can the definiteness properties of the Hessian matrix be used to describe the behavior of level curves near a critical point?

<p>When the Hessian produces eigenvectors, the level curves are approximated by ellipses or hyperbolas aligned with the eigenvector directions. (B)</p> Signup and view all the answers

What is the 'eigenline' in the context of contour plots and the Hessian matrix?

<p>A line of symmetry in the contour plot that corresponds to an eigenvector of the Hessian matrix. (D)</p> Signup and view all the answers

What information does Sylvester's Criterion provide?

<p>A criteria for (semi-)definiteness of symmetric ( n \times n ) matrices involving ( k \times k ) determinants for all ( k \leq n ). (D)</p> Signup and view all the answers

What is the result of using the unit eigenvectors w' = wi/||wi|| during level curve approximation?

<p>Encodes the precise geometry, and avoids any distortion factors in calculations. (D)</p> Signup and view all the answers

Flashcards

Hessian Matrix

A symmetric matrix of second partial derivatives of a function.

Critical Point

A point where all first partial derivatives of a function are zero.

Local Minimum

A point where the function has a local minimum value.

Local Maximum

A point where the function has a local maximum value.

Signup and view all the flashcards

Saddle Point

A critical point that is neither a local maximum nor a local minimum.

Signup and view all the flashcards

Positive-Definite Matrix

A matrix where all values are larger than 0.

Signup and view all the flashcards

Negative-Definite Matrix

A matrix for which all values are negative or zero.

Signup and view all the flashcards

Indefinite Matrix

A matrix with both positive and negative eigenvalues.

Signup and view all the flashcards

Quadratic Approximation

Approximating a function using a quadratic polynomial near a point.

Signup and view all the flashcards

Special Pair Lines

Lines through a critical point where the function has opposite local extrema.

Signup and view all the flashcards

Lines of Symmetry

Function decreases quickly along line, has orthogonal eigenvectors w & w'.

Signup and view all the flashcards

Strictly Convex

The condition where a function stays convex.

Signup and view all the flashcards

Unique Global Extremum

Critical point provides the best for overall data.

Signup and view all the flashcards

Anti-Diagonal Matrix

An nxn matrix where the diagnol entries are zero.

Signup and view all the flashcards

Gram Matrix

A symmetric matrix of the form

Signup and view all the flashcards

Cholesky Decomposition

This is a better version of LU-decomposition

Signup and view all the flashcards

Hessian Signs

The determinant and trace.

Signup and view all the flashcards

Sign Dependance

The level sets on hyperbolas centered at (0, 0).

Signup and view all the flashcards

Eigenvalue Techniques

How to approach problems of this nature.

Signup and view all the flashcards

Guassian Elimination

The process performed by the engineer.

Signup and view all the flashcards

Study Notes

  • Chapter 26 applies the quadratic approximation to functions from R" to R near critical points to create a multivariable second derivative test.
  • This test determines if the graph of f inside R^(n+1) near a point is bowl-shaped (local maximum), upward (local minimum), or saddle-like.
  • The Hessian matrix (Hf)(a) and its associated quadratic form are essential tools.

Quadratic Form

  • Defined as q(x) = ∑ bᵢⱼ xᵢxⱼ (where the sum of i is from 1 to n, and the sum of i and j is constrained by 1 <= i < j <= n)
  • bij = fxixj(a), connecting quadratic forms with symmetric matrices based on the link in (20.3.1) and Example 20.3.12
  • Understanding the geometry of level sets q(x1, . . ., xn) = c is a key question.
  • Eigenvectors are used to systematically understand level sets for any n, building on Section 25.4 results for n = 2
  • Analyzing the quadratic form associated with the Hessian at a critical point a of f : R" → R determines whether a is a local maximum, local minimum, or saddle point.
  • This approach recovers the second derivative test from single-variable calculus when n = 1.

Chapter Objectives

  • Identify local maxima, minima, and saddle points from contour plots when n = 2
  • Determine if a critical point a ∈ R" is a local maximum, minimum, or saddle point when (Hf)(a) is a diagonal matrix
  • Use eigenvalues to determine the definiteness of the Hessian and use that information to identify local maxima, minima, and saddle points

Definiteness and Saddle Points

  • The quadratic approximation to f near a point a ∈ R" is given by f(a + h) ≈ f(a) + (∇f)(a) · h + (1/2)hᵀ((Hf)(a))h for h ∈ R" near 0
  • For n = 1, this simplifies to the Taylor approximation f(a + h) ≈ f(a) + f'(a)h + (1/2)f"(a)h²
  • At a critical point a, where (∇f)(a) = 0, the quadratic approximation becomes f(a + h) ≈ f(a) + (1/2)hᵀ((Hf)(a))h

Single-Variable Calculus

  • If f"(a) > 0 then (1/2)f"(a)h² is positive for all h ≠ 0, indicating a local minimum at a
  • Conversely, if f"(a) < 0 then (1/2)f"(a)h² is negative for all h ≠ 0, indicating a local maximum at a
  • To determine if f(a) is a local maximum/minimum, we examine if q(Hf)(a)(h) = hᵀ((Hf)(a))h is consistently negative/positive for small h near 0
  • Definition 24.2.2 asks: is the Hessian matrix (Hf)(a) negative-definite, positive-definite, or something else?
  • Variation in values of an n-variable quadratic form near 0 becomes more complex when n > 1.
  • For n = 2, contour plots near (0,0) illustrate this complexity, with level sets forming hyperbolas, ellipses, or pairs of parallel lines
  • For an n × n symmetric matrix A, qa(x) = xᵀAx ∈ R is a function of x ∈ R". Examples include:
  • n = 2: ax² + by² + 2uxy
  • n = 3: ax² + by² + cz² + 2uxy + 2vxz + 2wyz
  • Off-diagonal matrix entries contribute a factor of 2 to the right side of the equations above.
  • Critical points motivate interest in determining where such functions q(x) are positive or negative.
  • For n > 1, there is the possibility of a saddle point, which has no analogue for n = 1.
  • For a critical point a = (a1, a2) ∈ R² of f, the surface graph z = f(x, y) may have a local minimum along one line L in R² through a, and a local maximum along another line L'
  • This results in a "saddle" appearance of the surface graph z = f(x, y) near (a1, a2, f(a1, a2)) ∈ R³

Saddle Shape

  • In Example 10.2.8 with f(x, y) = x² - y², a "saddle" appears over (0,0) ∈ R²
  • This shape stems from vertical planes P and P' that cut the surface graph S through (0,0, f(0, 0)) along curves C and C' with opposite local extrema
  • The absence of a local extremum can occur along a line through a saddle point
  • planes through (0, 0, f(0, 0)) correspond to choosing a line l in the xy-plane through (0,0), with the curve being the graph of f(x, y) over the line l
  • Many lines l exhibit local extrema at (0,0) for f(x,y) = x² - y² = , for example, local maximum, local minimum

Definition of a Saddle Point

  • A critical point a ∈ Rⁿ is a saddle point of f if there are distinct lines L, L' in Rⁿ through a such that f evaluated on L has a local maximum at a, and f evaluated on L' has a local minimum at a
  • L and L' can be written in parametric forms {a + tv} and {a + tv'}
  • Functions g(t) = f(a + tv) and h(t) = f(a + tv') indicate the behavior of f on points of L and L'
  • The saddle-point condition dictates that g has local maximum at t = 0 and h has a local minimum at t = 0
  • Contour plots near saddle points for n = 2 show that among pairs of lines through a with opposite local extrema, a special pair exists: the "lines of symmetry"
  • The lines of symmetry reflect the contour plot near a and are given by {a + tw} and {a + tw'}, where w and w' are orthogonal eigenvectors of (Hf)(a)

Second Derivative Test

  • For a critical point a ∈ Rⁿ of f, and small h, we have f(a + h) ≈ f(a) + (1/2)hᵀ((Hf)(a))h.
  • If q(Hf)(a)(h) > 0 then f(a + h) > f(a), whereas if q(Hf)(a)(h) < 0 then f(a + h) < f(a).
  • This leads to the Second Derivative Test, Version I for f : R" → R at a critical point a ∈ R":
  • If (Hf)(a) is positive-definite then a is a local minimum of f
  • If (Hf)(a) is negative-definite then a is a local maximum of f
  • If (Hf)(a) is indefinite then a is a saddle point of f
  • If the Hessian (Hf)(a) is positive-semidefinite or negative-semidefinite but not definite, more information is needed.
  • To find definiteness properties of the Hessian at a, we will use determinates and eigenvalues which is challenging
  • For n = 1, we apply above since hᵀ(Hf)(a)h = f"(a)h²

Convexity

  • (Hf)(a) is positive-definite for all a ∈ R" exactly when f is "strictly convex": f((1-t)v + tw) < (1-t)f(v) + tf(w) for all v ≠ w and 0 < t < 1
  • If the symmetric n × n matrix (Hf)(a) is diagonal, definiteness properties can be determined by inspecting signs of diagonal entries.

Diagonal entries

  • Consider H with diagonal entries 11, 3, 4 the matrix.
  • The diagonal entries are the coefficients in the associated quadratic form which is positive when ≠ 0
  • If all aⱼ's are positive, H is positive-definite (aⱼ > 0). H is indefinite if one aᵢ's is positive and another is negative
  • It's easy to check definiteness/indefiniteness of a diagonal n×n matrix D by inspection - evaluate qD(x) on coordinate axes
  • Suppose H is a 2 × 2 "anti-diagonal" matrix. This is indefinite because is indefinite because 10xy has positive and negative values

Gram Matrix

  • If M is any m × n matrix, the n × n matrix B = MᵀM (Gram matrix) is symmetric (Theorem 20.3.8)
  • For any v ∈ Rⁿ, qB(v) = vᵀ(MᵀM)v = ||Mv||² ≥ 0. Hence, B is positive-semidefinite
  • If the null space of M is {0} then B is positive-definite

Gram Matrices

  • Every positive-semidefinite symmetric n × n matrix B is MᵀM for an n x n matrix M
  • QR-decomposition (possibly non-invertible M) gives QR where Q is an n x k matrix orthogonal columns gives: B = MᵀM = RᵀR
  • This special LU-decomposition (L = Uᵀ, with U = R positive diagonal) is the Cholesky decomposition

Second Derivative Test

  • Analyze whether an extremum at a point is local
  • restrictions of h to the coordinate axes have a local minimum at the origin of the axes
  • The Hessian matrix for h is diagonal everywhere
  • By Theorem 26.1.5, the critical point is a local minimum for h with a positive definite point

Hessian Encoding Geometry

  • the real significance is for applications beyond the 2-variable case
  • Find all the critical points for a quadratic function like 3x²y + 2y³ - xy and classify each as a local maximum, local minimum, or a saddle point
  • The Hessian is diagonal and anti-diagonal where the matrix is definite, positive definite, or negative definite

Contour plots

  • At each critical point the definiteness matches the contour plot

  • Can look through a contour plot or use linear algebra to see the critical points to determine optimization and thermodynamics

  • The local maximum is at a point where all partials vanish, making the result definite negative

  • Find if is a local maximum, a local minimum, a saddle point, or perhaps something worse and analyze Hessian

  • constant λ disappears and we obtain

Hessian

  • The Hessian is not diagonal so not easily definitive through the eigenvalues. In other words, the spectrum is not diagonal
  • Not always a saddle point due to symmetry
  • In two variables, this can appear as a diagram like tacnode

Eigenvectors and Hessians

  • Now use Theorem 26.3.4 to understand the behavior of some 2-variable functions near critical points, and also apply the eigenvalue method to a 3-variable function with non-diagonal Hessian at two critical points
  • If a point is stable and positive definite, it is a local minimum and vice versa

Computing Eigenvalues

  • Eigenvalues can be computed by knowing the trace, or if it is already a saddle-point due to symmetry on a 2D or 3D level
  • One seeks a local minimum, and must determine what form the Hessian takes, whether diagonal or zeroed
  • If not zeroed the same is applied
  • A positive definite shows that the approximate critical point we found is a local has determinant

Hessian at a general point

  • The eigenvalues are negative if the points are negative and and negative-definite.
  • The point attains a global maximum in the interior near the critical level

Critical Points

  • Find the critical points and the behavior of each
  • the vanishing of fy forces x = y
  • The determinant computes to either positive or negative as a definite equation. It also depends on the common sign of eigenvalues
  • The method helps prove what must be positive

Relationships in this Chapter

  • The relationship between the behavior of a function f : R" → R near a critical point a and the eigenvalues and eigenvectors of the Hessian (Hf)(a), especially when all eigenvalues are nonzero
  • A Hessian having only one nonzero eigenvalue with n = 2,3 the computer vision technique underlies ridge detection
  • Useful in areas such as: facial recognition with wrinkles, detecting blood vessels in brain imaging, and digital humanities to reconstruct

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

More Like This

Use Quizgecko on...
Browser
Browser