Cauchy's functional equation

Functional equation

Cauchy's functional equation is the functional equation:

f ( x + y ) = f ( x ) + f ( y ) .   {\displaystyle f(x+y)=f(x)+f(y).\ }

A function f {\displaystyle f} that solves this equation is called an additive function. Over the rational numbers, it can be shown using elementary algebra that there is a single family of solutions, namely f : x c x {\displaystyle f\colon x\mapsto cx} for any rational constant c . {\displaystyle c.} Over the real numbers, the family of linear maps f : x c x , {\displaystyle f:x\mapsto cx,} now with c {\displaystyle c} an arbitrary real constant, is likewise a family of solutions; however there can exist other solutions not of this form that are extremely complicated. However, any of a number of regularity conditions, some of them quite weak, will preclude the existence of these pathological solutions. For example, an additive function f : R R {\displaystyle f\colon \mathbb {R} \to \mathbb {R} } is linear if:

  • f {\displaystyle f} is continuous (Cauchy, 1821). In fact, it suffices for f {\displaystyle f} to be continuous at one point (Darboux, 1875).
  • f {\displaystyle f} is monotonic on any interval.
  • f {\displaystyle f} is bounded on any interval.
  • f {\displaystyle f} is Lebesgue measurable.

On the other hand, if no further conditions are imposed on f , {\displaystyle f,} then (assuming the axiom of choice) there are infinitely many other functions that satisfy the equation. This was proved in 1905 by Georg Hamel using Hamel bases. Such functions are sometimes called Hamel functions.[1]

The fifth problem on Hilbert's list is a generalisation of this equation. Functions where there exists a real number c {\displaystyle c} such that f ( c x ) c f ( x ) {\displaystyle f(cx)\neq cf(x)} are known as Cauchy-Hamel functions and are used in Dehn-Hadwiger invariants which are used in the extension of Hilbert's third problem from 3D to higher dimensions.[2]

This equation is sometimes referred to as Cauchy's additive functional equation to distinguish it from Cauchy's exponential functional equation f ( x + y ) = f ( x ) f ( y ) , {\displaystyle f(x+y)=f(x)f(y),} Cauchy's logarithmic functional equation f ( x y ) = f ( x ) + f ( y ) , {\displaystyle f(xy)=f(x)+f(y),} and Cauchy's multiplicative functional equation f ( x y ) = f ( x ) f ( y ) . {\displaystyle f(xy)=f(x)f(y).}

Solutions over the rational numbers

A simple argument, involving only elementary algebra, demonstrates that the set of additive maps f : V W {\displaystyle f\colon V\to W} , where V , W {\displaystyle V,W} are vector spaces over an extension field of Q {\displaystyle \mathbb {Q} } , is identical to the set of Q {\displaystyle \mathbb {Q} } -linear maps from V {\displaystyle V} to W {\displaystyle W} .

Theorem: Let f : V W {\displaystyle f\colon V\to W} be an additive function. Then f {\displaystyle f} is Q {\displaystyle \mathbb {Q} } -linear.

Proof: We want to prove that any solution f : V W {\displaystyle f\colon V\to W} to Cauchy’s functional equation, f ( x + y ) = f ( x ) + f ( y ) {\displaystyle f(x+y)=f(x)+f(y)} , satisfies f ( q v ) = q f ( v ) {\displaystyle f(qv)=qf(v)} for any q Q {\displaystyle q\in \mathbb {Q} } and v V {\displaystyle v\in V} . Let v V {\displaystyle v\in V} .

First note f ( 0 ) = f ( 0 + 0 ) = f ( 0 ) + f ( 0 ) {\displaystyle f(0)=f(0+0)=f(0)+f(0)} , hence f ( 0 ) = 0 {\displaystyle f(0)=0} , and therewith 0 = f ( 0 ) = f ( v + ( v ) ) = f ( v ) + f ( v ) {\displaystyle 0=f(0)=f(v+(-v))=f(v)+f(-v)} from which follows f ( v ) = f ( v ) {\displaystyle f(-v)=-f(v)} .

Via induction, f ( m v ) = m f ( v ) {\displaystyle f(mv)=mf(v)} is proved for any m N { 0 } {\displaystyle m\in \mathbb {N} \cup \{0\}} .

For any negative integer m Z {\displaystyle m\in \mathbb {Z} } we know m N {\displaystyle -m\in \mathbb {N} } , therefore f ( m v ) = f ( ( m ) ( v ) ) = ( m ) f ( v ) = ( m ) ( f ( v ) ) = m f ( v ) {\displaystyle f(mv)=f((-m)(-v))=(-m)f(-v)=(-m)(-f(v))=mf(v)} . Thus far we have proved

f ( m v ) = m f ( v ) {\displaystyle f(mv)=mf(v)} for any m Z {\displaystyle m\in \mathbb {Z} } .

Let n N {\displaystyle n\in \mathbb {N} } , then f ( v ) = f ( n n 1 v ) = n f ( n 1 v ) {\displaystyle f(v)=f(nn^{-1}v)=nf(n^{-1}v)} and hence f ( n 1 v ) = n 1 f ( v ) {\displaystyle f(n^{-1}v)=n^{-1}f(v)} .

Finally, any q Q {\displaystyle q\in \mathbb {Q} } has a representation q = m n {\displaystyle q={\frac {m}{n}}} with m Z {\displaystyle m\in \mathbb {Z} } and n N {\displaystyle n\in \mathbb {N} } , so, putting things together,

f ( q v ) = f ( m n v ) = f ( 1 n ( m v ) ) = 1 n f ( m v ) = 1 n m f ( v ) = q f ( v ) {\displaystyle f(qv)=f\left({\frac {m}{n}}\,v\right)=f\left({\frac {1}{n}}\,(mv)\right)={\frac {1}{n}}\,f(mv)={\frac {1}{n}}\,m\,f(v)=qf(v)} , q.e.d.

Properties of nonlinear solutions over the real numbers

We prove below that any other solutions must be highly pathological functions. In particular, it is shown that any other solution must have the property that its graph { ( x , f ( x ) ) | x R } {\displaystyle \{(x,f(x))\vert x\in \mathbb {R} \}} is dense in R 2 , {\displaystyle \mathbb {R} ^{2},} that is, that any disk in the plane (however small) contains a point from the graph. From this it is easy to prove the various conditions given in the introductory paragraph.

Lemma — Let t > 0 {\displaystyle t>0} . If f {\displaystyle f} satisfies the Cauchy functional equation on the interval [ 0 , t ] {\displaystyle [0,t]} , but is not linear, then its graph is dense on the strip [ 0 , t ] × R {\displaystyle [0,t]\times \mathbb {R} } .

Proof

WLOG, scale f {\displaystyle f} on the x-axis and y-axis, so that f {\displaystyle f} satisfies the Cauchy functional equation on [ 0 , 1 ] {\displaystyle [0,1]} , and f ( 1 ) = 1 {\displaystyle f(1)=1} . It suffices to show that the graph of f {\displaystyle f} is dense in ( 0 , 1 ) × R {\displaystyle (0,1)\times \mathbb {R} } , which is dense in [ 0 , 1 ] × R {\displaystyle [0,1]\times \mathbb {R} } .

Since f {\displaystyle f} is not linear, we have f ( a ) a {\displaystyle f(a)\neq a} for some a ( 0 , 1 ) {\displaystyle a\in (0,1)} .

Claim: The lattice defined by L := { ( r 1 + r 2 a , r 1 + r 2 f ( a ) ) : r 1 , r 2 Q } {\displaystyle L:=\{(r_{1}+r_{2}a,r_{1}+r_{2}f(a)):r_{1},r_{2}\in \mathbb {Q} \}} is dense in R 2 {\displaystyle \mathbb {R} ^{2}} .

Consider the linear transformation A : R 2 R 2 {\displaystyle A:\mathbb {R} ^{2}\to \mathbb {R} ^{2}} defined by

A ( x , y ) = [ 1 a 1 f ( a ) ] [ x y ] {\displaystyle A(x,y)={\begin{bmatrix}1&a\\1&f(a)\end{bmatrix}}{\begin{bmatrix}x\\y\end{bmatrix}}}

With this transformation, we have L = A ( Q 2 ) {\displaystyle L=A(\mathbb {Q} ^{2})} .

Since det A = f ( a ) a 0 {\displaystyle \det A=f(a)-a\neq 0} , the transformation is invertible, thus it is bicontinuous. Since Q 2 {\displaystyle \mathbb {Q} ^{2}} is dense in R 2 {\displaystyle \mathbb {R} ^{2}} , so is L {\displaystyle L} . {\displaystyle \square }

Claim: if r 1 , r 2 Q {\displaystyle r_{1},r_{2}\in \mathbb {Q} } , and r 1 + r 2 a ( 0 , 1 ) {\displaystyle r_{1}+r_{2}a\in (0,1)} , then f ( r 1 + r 2 a ) = r 1 + r 2 f ( a ) {\displaystyle f(r_{1}+r_{2}a)=r_{1}+r_{2}f(a)} .

If r 1 , r 2 0 {\displaystyle r_{1},r_{2}\geq 0} , then it is true by additivity. If r 1 , r 2 < 0 {\displaystyle r_{1},r_{2}<0} , then r 1 + r 2 a < 0 {\displaystyle r_{1}+r_{2}a<0} , contradiction.

If r 1 0 , r 2 < 0 {\displaystyle r_{1}\geq 0,r_{2}<0} , then since r 1 + r 2 a > 0 {\displaystyle r_{1}+r_{2}a>0} , we have r 1 > 0 {\displaystyle r_{1}>0} . Let k {\displaystyle k} be a positive integer large enough such that r 1 k , r 2 a k ( 0 , 1 ) {\displaystyle {\frac {r_{1}}{k}},{\frac {-r_{2}a}{k}}\in (0,1)} . Then we have by additivity:

f ( r 1 k + r 2 a k ) + f ( r 2 a k ) = f ( r 1 k ) {\displaystyle f\left({\frac {r_{1}}{k}}+{\frac {r_{2}a}{k}}\right)+f\left({\frac {-r_{2}a}{k}}\right)=f\left({\frac {r_{1}}{k}}\right)}

That is,

1 k f ( r 1 + r 2 a ) + r 2 k f ( a ) = r 1 k {\displaystyle {\frac {1}{k}}f\left(r_{1}+r_{2}a\right)+{\frac {-r_{2}}{k}}f\left(a\right)={\frac {r_{1}}{k}}}
{\displaystyle \square }

Thus, the graph of f {\displaystyle f} contains L ( ( 0 , 1 ) × R ) {\displaystyle L\cap ((0,1)\times \mathbb {R} )} , which is dense in ( 0 , 1 ) × R {\displaystyle (0,1)\times \mathbb {R} } .

Existence of nonlinear solutions over the real numbers

The linearity proof given above also applies to f : α Q R , {\displaystyle f\colon \alpha \mathbb {Q} \to \mathbb {R} ,} where α Q {\displaystyle \alpha \mathbb {Q} } is a scaled copy of the rationals. This shows that only linear solutions are permitted when the domain of f {\displaystyle f} is restricted to such sets. Thus, in general, we have f ( α q ) = f ( α ) q {\displaystyle f(\alpha q)=f(\alpha )q} for all α R {\displaystyle \alpha \in \mathbb {R} } and q Q . {\displaystyle q\in \mathbb {Q} .} However, as we will demonstrate below, highly pathological solutions can be found for functions f : R R {\displaystyle f\colon \mathbb {R} \to \mathbb {R} } based on these linear solutions, by viewing the reals as a vector space over the field of rational numbers. Note, however, that this method is nonconstructive, relying as it does on the existence of a (Hamel) basis for any vector space, a statement proved using Zorn's lemma. (In fact, the existence of a basis for every vector space is logically equivalent to the axiom of choice.) There exists models[3] where all sets of reals are measurable which are consistent with ZF + DC, and therein all solutions are linear.[4]

To show that solutions other than the ones defined by f ( x ) = f ( 1 ) x {\displaystyle f(x)=f(1)x} exist, we first note that because every vector space has a basis, there is a basis for R {\displaystyle \mathbb {R} } over the field Q , {\displaystyle \mathbb {Q} ,} i.e. a set B R {\displaystyle {\mathcal {B}}\subset \mathbb {R} } with the property that any x R {\displaystyle x\in \mathbb {R} } can be expressed uniquely as x = i I λ i x i , {\textstyle x=\sum _{i\in I}{\lambda _{i}x_{i}},} where { x i } i I {\displaystyle \{x_{i}\}_{i\in I}} is a finite subset of B , {\displaystyle {\mathcal {B}},} and each λ i {\displaystyle \lambda _{i}} is in Q . {\displaystyle \mathbb {Q} .} We note that because no explicit basis for R {\displaystyle \mathbb {R} } over Q {\displaystyle \mathbb {Q} } can be written down, the pathological solutions defined below likewise cannot be expressed explicitly.

As argued above, the restriction of f {\displaystyle f} to x i Q {\displaystyle x_{i}\mathbb {Q} } must be a linear map for each x i B . {\displaystyle x_{i}\in {\mathcal {B}}.} Moreover, because x i q f ( x i ) q {\displaystyle x_{i}q\mapsto f(x_{i})q} for q Q , {\displaystyle q\in \mathbb {Q} ,} it is clear that f ( x i ) x i {\displaystyle f(x_{i}) \over x_{i}} is the constant of proportionality. In other words, f : x i Q R {\displaystyle f\colon x_{i}\mathbb {Q} \to \mathbb {R} } is the map ξ [ f ( x i ) / x i ] ξ . {\displaystyle \xi \mapsto [f(x_{i})/x_{i}]\xi .} Since any x R {\displaystyle x\in \mathbb {R} } can be expressed as a unique (finite) linear combination of the x i {\displaystyle x_{i}} s, and f : R R {\displaystyle f\colon \mathbb {R} \to \mathbb {R} } is additive, f ( x ) {\displaystyle f(x)} is well-defined for all x R {\displaystyle x\in \mathbb {R} } and is given by:

f ( x ) = f ( i I λ i x i ) = i I f ( x i λ i ) = i I f ( x i ) λ i . {\displaystyle f(x)=f{\Big (}\sum _{i\in I}\lambda _{i}x_{i}{\Big )}=\sum _{i\in I}f(x_{i}\lambda _{i})=\sum _{i\in I}f(x_{i})\lambda _{i}.}

It is easy to check that f {\displaystyle f} is a solution to Cauchy's functional equation given a definition of f {\displaystyle f} on the basis elements, f : B R . {\displaystyle f\colon {\mathcal {B}}\to \mathbb {R} .} Moreover, it is clear that every solution is of this form. In particular, the solutions of the functional equation are linear if and only if f ( x i ) x i {\displaystyle f(x_{i}) \over x_{i}} is constant over all x i B . {\displaystyle x_{i}\in {\mathcal {B}}.} Thus, in a sense, despite the inability to exhibit a nonlinear solution, "most" (in the sense of cardinality[5]) solutions to the Cauchy functional equation are actually nonlinear and pathological.

See also

  • Antilinear map – Conjugate homogeneous additive map
  • Homogeneous function – Function with a multiplicative scaling behaviour
  • Minkowski functional – Function made from a set
  • Semilinear map – homomorphism between modules, paired with the associated homomorphism between the respective base ringsPages displaying wikidata descriptions as a fallback

References

  1. ^ Kuczma (2009), p.130
  2. ^ V.G. Boltianskii (1978) "Hilbert's third problem", Halsted Press, Washington
  3. ^ Solovay, Robert M. (1970). "A Model of Set-Theory in Which Every Set of Reals is Lebesgue Measurable". Annals of Mathematics. 92 (1): 1–56. doi:10.2307/1970696. ISSN 0003-486X.
  4. ^ E. Caicedo, Andrés (2011-03-06). "Are there any non-linear solutions of Cauchy's equation $f(x+y)=f(x)+f(y)$ without assuming the Axiom of Choice?". MathOverflow. Retrieved 2024-02-21.
  5. ^ It can easily be shown that c a r d ( B ) = c {\displaystyle \mathrm {card} ({\mathcal {B}})={\mathfrak {c}}} ; thus there are c c = 2 c {\displaystyle {\mathfrak {c}}^{\mathfrak {c}}=2^{\mathfrak {c}}} functions f : B R , {\displaystyle f\colon {\mathcal {B}}\to \mathbb {R} ,} each of which could be extended to a unique solution of the functional equation. On the other hand, there are only c {\displaystyle {\mathfrak {c}}} solutions that are linear.
  • Kuczma, Marek (2009). An introduction to the theory of functional equations and inequalities. Cauchy's equation and Jensen's inequality. Basel: Birkhäuser. ISBN 9783764387495.
  • Hamel, Georg (1905). "Eine Basis aller Zahlen und die unstetigen Lösungen der Funktionalgleichung: f(x+y) = f(x) + f(y)". Mathematische Annalen.

External links

  • Solution to the Cauchy Equation Rutgers University
  • The Hunt for Addi(c)tive Monster
  • Martin Sleziak; et al. (2013). "Overview of basic facts about Cauchy functional equation". StackExchange. Retrieved 20 December 2015.