TL;DR
I walk through a cool and possibly less known result connecting convexity and the triangle inequalities for norms. Using this result, typical proofs of the triangle inequality for a proposed norm function are significantly simplified. This exposition is based on (Chapter 3, Robinson 2020) 1.
Background - Norms
Normed linear spaces are a natural setting for much applied mathematics and statistics. These are vector spaces,
Definition 1 (Norms in vector spaces) For a given vector space
-
Positive definiteness: For a
, if then . -
Absolute homogeneity:
, for a . -
Triangle inequality:
, for a .
Remarks
Remark (Derived properties from Definition 1). We note that a norm, per Definition 1, in fact, implies the following properties:
In Definition 1, we can always replace positive definiteness with the stronger claim, namely that
In short, we want to show that the reverse implication to positive definiteness always holds, i.e., . To prove this observe that using absolute homogeneity in Definition 1, we have: As required.We also have that
, for a . To see this, observe that for a In effect this means the co-domain can always be changed from to .Since these can always be derived directly from Definition 1, as shown, we can keep Definition 1 in its minimal form as noted here.
These ideas work for seminorms as well, see here for more details.
Main theorem
Theorem 1 (Characterization of norm triangle inequality) Let
-
Positive definiteness: For a
, if then . -
Absolute homogeneity:
, for a .
We then have that:
In simple terms, the importance of Theorem 1 (as captured by Equation 1) can be summarized as follows:
Let
be a function satisfying positive definiteness and absolute homogeneity. Then satisfies the triangle inequality if and only if the unit ba induced by , i.e., , is a convex set.
Remarks
Remark. In Theorem 1, we note the following:
- The function
, is a norm-like function, and only becomes a valid norm per Definition 1 once we establish the triangle inequality, i.e., . - To prove the triangle inequality for
, the necessary condition of Theorem 1 to establish is: which will imply the triangle inequality for - huzzah! - The nice thing is, proving the convexity of
can be much easier to show than trying to prove the triangle inequality property of directly, as we will soon see. -
Subtle point: note that here we had to assume that the co-domain of
is non-negative (not ), i.e., . This is because in a typical norm, which satisfies the triangle inequality, is always shown to be non-negative (see remark below Definition 1 for more details). Here we impose non-negativity of as an additional constraint to establish the triangle inequality property for . This is not an issue, since one would always first check the non-negativity of a candidate norm-like function .
Applications: Minkowski inequalities
Before getting into the details of the proof, let’s just see what Theorem Theorem 1 can do! We’ll consider two related applications taken from (Lemma 3.6, Example 3.13 Robinson 2020), respectively.
Application 1: -norm triangle inequality in
Example 1 (Minkowski inequality in finite dimensions) Let us consider
One can show that
It follows that
In fact, since it
Application 2: -norm triangle inequality
We can also similarly prove the triangle inequality norms involving integrals efficiently. This is seen in the next example.
Let us consider
Let us define
It follows that
Again, we can now denote
Punchline: what did Theorem 1 buy us?
We just saw that applying Theorem 1 enabled us to write very short proofs of Minkowski’s inequality in
To appreciate this approach, note that proving Minkowski’s inequality typicay requires one to first prove Young’s inequality and then Hölder’s inequality. Moreover these need to be done separately in
Proof of Theorem 1
Assuming
Proof - easy direction
Assume that
Proof
Proof. (
Assume that
We observe that for
Which implies the convexity of
Proof - interesting direction
Assume
Proof
Proof. (
Assume
Let
Case 1: Let
Case 2: Let
Case 3: Let
Case 4: Let
By the assumed convexity of
As required.
Recap
In this article we learned the following about Theorem 1:
- It gives an alternative way to characterize the triangle inequality for norm-like functions.
- Using this characterization we can prove the triangle inequality for such norm-like functions using the convexity of the unit ba induced by such functions.
- This is usuay easier since we have lots of tools from convex analysis to help us prove the convexity of
. - We saw this in action since Theorem 1 enabled us to write very short proofs of Minkowski’s inequality in
and .
In summary, if you have a norm-like function for which you are trying to establish the triangle inequality, try out Theorem 1 💯!
Acknowledgements
I thank Prof. James Robinson for providing several technical clarifications on Theorem 1. I thank Mikhail Popov for creating the wikipediapreview
R package, which enable an easy interface for Wikipedia Context Cards in Rmd
files. These Context Cards enable the hover over preview for Wikipedia articles. I thank Jewel Johnson for providing this helpful guide to enable fixed TOC for this article. I thank Dr. Joel Nitta for providing these instructions to enable me to switch to the giscus comments system. Much of these distill site improvements were brought to our attention due to the excellent distillery site run by Prof. John Paul Helveston.
References
Footnotes
Note: The presentation in this post is intentionay verbose. The goal is to give lots of intuition of the key result and its usefulness, and ensure that the proofs are rigorous. It is written with an empathetic mindset to newcomers, and to myself for future reference.↩︎
We refer to such an
satisfying these properties, as a norm-like function.↩︎
Reuse
Citation
@online{shrotriya2022,
author = {Shamindra Shrotriya},
editor = {},
title = {Characterizing Norm Triangle Inequalites via Convexity},
date = {2022-02-12},
url = {https://www.shamindras.com/posts/2021-12-31-shrotriya2021normtriconvexity},
langid = {en}
}