Partial Differential Equations/Sobolev spaces

Partial Differential Equations
← Characteristic equations	Sobolev spaces	Calculus of variations →

There are some partial differential equations which have no solution. However, some of them have something like ‘almost a solution’, which we call a weak solution. Among these there are partial differential equations whose weak solutions model processes in nature, just like solutions of partial differential equations which have a solution.

These weak solutions will be elements of the so-called Sobolev spaces. By proving properties which elements of Sobolev spaces in general have, we will thus obtain properties of weak solutions to partial differential equations, which therefore are properties of some processes in nature.

In this chapter we do show some properties of elements of Sobolev spaces. Furthermore, we will show that Sobolev spaces are Banach spaces (this will help us in the next section, where we investigate existence and uniqueness of weak solutions).

The fundamental lemma of the calculus of variations

But first we shall repeat the definition of the standard mollifier defined in chapter 3.

Example 3.4: The standard mollifier $\eta$ , given by

\eta :\mathbb {R} ^{d}\to \mathbb {R} ,\eta (x)={\frac {1}{c}}{\begin{cases}e^{-{\frac {1}{1-\|x\|^{2}}}}&{\text{ if }}\|x\|_{2}<1\\0&{\text{ if }}\|x\|_{2}\geq 1\end{cases}}

, where $c:=\int _{B_{1}(0)}e^{-{\frac {1}{1-\|x\|^{2}}}}dx$ , is a bump function (see exercise 3.2).

Definition 3.13:

For $R\in \mathbb {R} _{>0}$ , we define

\eta _{R}:\mathbb {R} ^{d}\to \mathbb {R} ,\eta _{R}(x)=\eta \left({\frac {x}{R}}\right){\big /}R^{d}

.

Lemma 12.1: (to be replaced by characteristic function version)

Let $g\in L^{p}$ be a simple function, i. e.

g=\sum _{j=1}^{n}b_{j}\chi _{I_{j}}

,

where $I_{j}$ are intervals and $\chi$ is the indicator function. If

\epsilon <1/2{\text{diam}}(I_{j})

,

then $\|g*\eta _{\epsilon }-g\|_{p}\leq 2\epsilon \max _{k\in \{1,\ldots ,n\}}b_{k}$ .

The following lemma, which is important for some theorems about Sobolev spaces, is known as the fundamental lemma of the calculus of variations:

Lemma 12.2:

Let $S\subseteq \mathbb {R} ^{d}$ and let $f,g:S\to \mathbb {R}$ be functions such that $f,g\in L_{\text{loc}}^{1}(S)$ and ${\mathcal {T}}_{f}={\mathcal {T}}_{g}$ . Then $f=g$ almost everywhere.

Proof:

We define

h:\mathbb {R} ^{d}\to \mathbb {R} ,h(x):={\begin{cases}f(x)-g(x)&x\in S\\0&x\notin S\end{cases}}

Weak derivatives

Definition 12.1:

Let $S\subseteq \mathbb {R} ^{d}$ be a set, $p\in [1,\infty ]$ and $f\in L^{p}(S)$ . If $\alpha \in \mathbb {N} _{0}^{d}$ is a $d$ -dimensional multiindex and $g\in L^{p}(S)$ such that

\partial _{\alpha }{\mathcal {T}}_{f}={\mathcal {T}}_{g}

, we call $g$ an $\alpha$ th-weak derivative of $f$ .

Remarks 12.2: If $f\in L^{p}(S)$ is a function and $\alpha \in \mathbb {N} _{0}^{d}$ is a $d$ -dimensional multiindex, any two $\alpha$ th-weak derivatives of $f$ are equal except on a null set. Furthermore, if $\partial _{\alpha }f$ exists, it also is an $\alpha$ th-weak derivative of $f$ .

Proof:

1. We prove that any two $\alpha$ th-weak derivatives are equal except on a nullset.

Let $g,h\in L^{p}(S)$ be two $\alpha$ th-weak derivatives of $f$ . Then we have

{\mathcal {T}}_{g}=\partial _{\alpha }{\mathcal {T}}_{f}={\mathcal {T}}_{h}

Notation 12.3 If it exists, we denote the $\alpha$ th-weak derivative of $f$ by $\partial _{\alpha }f$ , which is of course the same symbol as for the ordinary derivative.

Theorem 12.4:

Let $O\subseteq \mathbb {R} ^{d}$ be open, $p\in [1,\infty ]$ , $f,g\in L^{p}(O)$ and $\alpha \in \mathbb {N} _{0}^{d}$ . Assume that $f,g$ have $\alpha$ -weak derivatives, which we - consistent with notation 12.3 - denote by $\partial _{\alpha }f$ and $\partial _{\alpha }g$ . Then for all $b,c\in \mathbb {R}$ :

\partial _{\alpha }(bf+cg)=b\partial _{\alpha }f+c\partial _{\alpha }g

Proof:

Definition and first properties of Sobolev spaces

Definition and theorem 12.6:

Let $O\subseteq \mathbb {R} ^{d}$ be open, $p\in [1,\infty ]$ , $f,g\in L^{p}(O)$ and $n\in \mathbb {N} _{0}$ . The Sobolev space ${\mathcal {W}}^{n,p}(O)$ is defined as follows:

{\mathcal {W}}^{n,p}(O):=\{f\in L^{p}(O):\forall \alpha \in \mathbb {N} _{0}^{d}{\text{ such that }}|\alpha |\leq n:\partial _{\alpha }f{\text{ exists}}\}

A norm on ${\mathcal {W}}^{n,p}(O)$ is defined as follows:

\|f\|_{{\mathcal {W}}^{n,p}(O)}:=\sum _{|\alpha |\leq n}\left\|\partial _{\alpha }f\right\|_{L^{p}(O)}

With respect to this norm, ${\mathcal {W}}^{n,p}(O)$ is a Banach space.

In the above definition, $\partial _{\alpha }f$ denotes the $\alpha$ th-weak derivative of $f$ .

Proof:

1.

We show that

\|f\|_{{\mathcal {W}}^{n,p}(O)}=\sum _{|\alpha |\leq n}\left\|\partial _{\alpha }f\right\|_{L^{p}(O)}

is a norm.

We have to check the three defining properties for a norm:

$\|f\|_{{\mathcal {W}}^{n,p}(O)}=0\Leftrightarrow f=0$ (definiteness)
$\|cf\|_{{\mathcal {W}}^{n,p}(O)}=|c|\|f\|_{{\mathcal {W}}^{n,p}(O)}$ for every $c\in \mathbb {R}$ (absolute homogeneity)
$\|f+g\|_{{\mathcal {W}}^{n,p}(O)}\leq \|f\|_{{\mathcal {W}}^{n,p}(O)}+\|g\|_{{\mathcal {W}}^{n,p}(O)}$ (triangle inequality)

We start with definiteness: If $f=0$ , then $\|f\|_{{\mathcal {W}}^{n,p}(O)}=0$ , since all the directional derivatives of the constant zero function are again the zero function. Furthermore, if $\|f\|_{{\mathcal {W}}^{n,p}(O)}=0$ , then it follows that $\|f\|_{L^{p}(O)}=0$ implying that $f=0$ as $\|f\|_{L^{p}(O)}$ is a norm.

We proceed to absolute homogeneity. Let $c\in \mathbb {R}$ .

{\begin{aligned}\|cf\|_{{\mathcal {W}}^{n,p}(O)}&:=\sum _{|\alpha |\leq n}\left\|\partial _{\alpha }cf\right\|_{L^{p}(O)}&\\&=\sum _{|\alpha |\leq n}\left\|c\partial _{\alpha }f\right\|_{L^{p}(O)}&{\text{ theorem 12.4}}\\&=\sum _{|\alpha |\leq n}|c|\left\|\partial _{\alpha }f\right\|_{L^{p}(O)}&{\text{ by absolute homogeneity of }}\|\cdot \|_{L^{p}(O)}\\&=|c|\sum _{|\alpha |\leq n}\left\|\partial _{\alpha }f\right\|_{L^{p}(O)}&\\&=:|c|\|f\|_{{\mathcal {W}}^{n,p}(O)}\end{aligned}}

And the triangle inequality has to be shown:

{\begin{aligned}\|f+g\|_{{\mathcal {W}}^{n,p}(O)}&:=\sum _{|\alpha |\leq n}\left\|\partial _{\alpha }(f+g)\right\|_{L^{p}(O)}&\\&=\sum _{|\alpha |\leq n}\left\|\partial _{\alpha }f+\partial _{\alpha }g\right\|_{L^{p}(O)}&{\text{ theorem 12.4}}\\&\leq \sum _{|\alpha |\leq n}\left(\left\|\partial _{\alpha }f\right\|_{L^{p}(O)}+\left\|\partial _{\alpha }g\right\|_{L^{p}(O)}\right)&{\text{ by triangle inequality of }}\|\cdot \|_{L^{p}(O)}\\&=\|f\|_{{\mathcal {W}}^{n,p}(O)}+\|g\|_{{\mathcal {W}}^{n,p}(O)}\end{aligned}}

2.

We prove that ${\mathcal {W}}^{n,p}(O)$ is a Banach space.

Let $(f_{l})_{l\in \mathbb {N} }$ be a Cauchy sequence in ${\mathcal {W}}^{n,p}(O)$ . Since for all $d$ -dimensional multiindices $\alpha \in \mathbb {N} _{0}^{d}$ with $|\alpha |\leq n$ and $m,l\in \mathbb {N}$

\|\partial _{\alpha }f_{l}-\partial _{\alpha }f_{m})\|_{L^{p}(O)}=\|\partial _{\alpha }(f_{l}-f_{m})\|_{L^{p}(O)}\leq \sum _{|\alpha |\leq n}\left\|\partial _{\alpha }(f_{l}-f_{m})\right\|_{L^{p}(O)}

since we only added non-negative terms, we obtain that for all $d$ -dimensional multiindices $\alpha \in \mathbb {N} _{0}^{d}$ with $|\alpha |\leq n$ , $(\partial _{\alpha }f_{l})_{l\in \mathbb {N} }$ is a Cauchy sequence in $L^{p}(O)$ . Since $L^{p}(O)$ is a Banach space, this sequence converges to a limit in $L^{p}(O)$ , which we shall denote by $f_{\alpha }$ .

We show now that $f:=f_{(0,\ldots ,0)}\in {\mathcal {W}}^{n,p}(O)$ and $f_{l}\to f,l\to \infty$ with respect to the norm $\|\cdot \|_{{\mathcal {W}}^{n,p}(O)}$ , thereby showing that ${\mathcal {W}}^{n,p}(O)$ is a Banach space.

To do so, we show that for all $d$ -dimensional multiindices $\alpha \in \mathbb {N} _{0}^{d}$ with $|\alpha |\leq n$ the $\alpha$ th-weak derivative of $f$ is given by $f_{\alpha }$ . Convergence then automatically follows, as

{\begin{aligned}f_{l}\to f,l\to \infty &\Leftrightarrow \|f_{l}-f\|_{{\mathcal {W}}^{n,p}(O)}\to 0,l\to \infty &\\&\Leftrightarrow \sum _{|\alpha |\leq n}\left\|\partial _{\alpha }(f_{l}-f)\right\|_{L^{p}(O)}\to 0,l\to \infty &\\&\Leftrightarrow \sum _{|\alpha |\leq n}\left\|\partial _{\alpha }f_{l}-\partial _{\alpha }f\right\|_{L^{p}(O)}\to 0,l\to \infty &{\text{by theorem 12.4}}\\\end{aligned}}

where in the last line all the summands converge to zero provided that $\partial _{\alpha }f=f_{\alpha }$ for all $d$ -dimensional multiindices $\alpha \in \mathbb {N} _{0}^{d}$ with $|\alpha |\leq n$ .

Let $\varphi \in {\mathcal {D}}(O)$ . Since $\partial _{\alpha }f_{l}\to f_{\alpha }$ and by the second triangle inequality

\|\partial _{\alpha }f-f_{\alpha }\|\geq |\|\partial _{\alpha }f\|-\|f_{\alpha }\||

, the sequence $(\varphi \partial _{\alpha }f_{l})_{l\in \mathbb {N} }$ is, for large enough $l$ , dominated by the function $2\|\varphi \|_{\infty }f_{\alpha }$ , and the sequence $(\partial _{\alpha }\varphi f_{l})_{l\in \mathbb {N} }$ is dominated by the function $2\|\partial _{\alpha }\varphi \|_{\infty }f$ .

incomplete: Why are the dominating functions L1?

Therefore

{\begin{aligned}\int _{\mathbb {R} ^{d}}\partial _{\alpha }\varphi (x)f(x)dx=&\lim _{l\to \infty }\int _{\mathbb {R} ^{d}}\partial _{\alpha }\varphi (x)f_{l}(x)dx&{\text{ dominated convergence}}\\&=\lim _{l\to \infty }(-1)^{|\alpha |}\int _{\mathbb {R} ^{d}}\varphi (x)\partial _{\alpha }f_{l}(x)dx&\\&=(-1)^{|\alpha |}\int _{\mathbb {R} ^{d}}\varphi (x)f_{\alpha }(x)dx&{\text{ dominated convergence}}\end{aligned}}

, which is why $f_{\alpha }$ is the $\alpha$ th-weak derivative of $f$ for all $d$ -dimensional multiindices $\alpha \in \mathbb {N} _{0}^{d}$ with $|\alpha |\leq n$ . $\Box$

Approximation by smooth functions

We shall now prove that for any $L^{p}$ function, we can find a sequence of bump functions converging to that function in $L^{p}$ norm.

approximation by simple functions and lemma 12.1, ||f_eps-f|| le ||f_eps - g_eps|| + ||g_eps - g|| + ||g - f||

Let $\Omega \subset \mathbb {R} ^{d}$ be a domain, let $r>0$ , and $U\subset \Omega$ , such that $U+B_{r}(0)\subseteq \Omega$ . Let furthermore $u\in {\mathcal {W}}^{m,p}(U)$ . Then $\mu _{\epsilon }*f$ is in $C^{\infty }(U)$ for $\epsilon <r$ and $\lim _{\epsilon \to 0}\|\mu _{\epsilon }*f-f\|_{W^{m,p}(U)}=0$ .

Proof: The first claim, that $\mu _{\epsilon }*f\in C^{\infty }(U)$ , follows from the fact that if we choose

{\tilde {f}}(x)={\begin{cases}f(x)&x\in U\\0&x\notin U\end{cases}}

Then, due to the above section about mollifying $L^{p}$ -functions, we know that the first claim is true.

The second claim follows from the following calculation, using the one-dimensional chain rule:

{\frac {\partial ^{\alpha }}{\partial x^{\alpha }}}(\mu _{\epsilon }*f)(y)=\int _{\mathbb {R} ^{d}}{\frac {\partial ^{\alpha }}{\partial x^{\alpha }}}\mu _{\epsilon }(y-x)f(x)dx=(-1)^{|\alpha |}\int _{\mathbb {R} ^{d}}{\frac {\partial ^{\alpha }}{\partial y^{\alpha }}}\mu _{\epsilon }(y-x)f(x)dx

=\int _{\mathbb {R} ^{d}}\mu _{\epsilon }(y-x){\frac {\partial ^{\alpha }}{\partial y^{\alpha }}}f(x)dx=(\mu _{\epsilon }*{\frac {\partial ^{\alpha }}{\partial y^{\alpha }}}f)(y)

Due to the above secion about mollifying $L^{p}$ -functions, we immediately know that $\lim _{\epsilon \to 0}\|\mu _{\epsilon }*{\frac {\partial ^{\alpha }}{\partial y^{\alpha }}}f-f\|=0$ , and the second statement therefore follows from the definition of the $W^{m,p}(U)$ -norm.

Let $\Omega \subseteq \mathbb {R} ^{d}$ be an open set. Then for all functions $v\in W^{m,p}(\Omega )$ , there exists a sequence of functions in $C^{\infty }(\Omega )\cap W^{m,p}(\Omega )$ approximating it.

Proof:

Let's choose

U_{i}:=\{x\in \Omega :{\text{dist}}(\partial \Omega ,x)>{\frac {1}{i}}\wedge \|x\|<i\}

and

V_{i}={\begin{cases}U_{3}&i=0\\U_{i+3}\setminus {\overline {U_{i+1}}}&i>0\end{cases}}

One sees that the $V_{i}$ are an open cover of $\Omega$ . Therefore, we can choose a sequence of functions $({\tilde {\eta }}_{i})_{i\in \mathbb {N} }$ (partition of the unity) such that

$\forall i\in \mathbb {N} :\forall x\in \Omega :0\leq {\tilde {\eta }}_{i}(x)\leq 1$
$\forall x\in \Omega :\exists {\text{ only finitely many }}i\in \mathbb {N} :{\tilde {\eta }}_{i}(x)\neq 0$
$\forall i\in \mathbb {N} :\exists j\in \mathbb {N} :{\text{supp }}{\tilde {\eta }}_{i}\subseteq V_{j}$
$\forall x\in \Omega :\sum _{i=0}^{\infty }{\tilde {\eta }}_{i}(x)=1$

By defining $\mathrm {H} _{i}:=\{{\tilde {\eta }}_{j}\in \{{\tilde {\eta }}_{m}\}_{m\in \mathbb {N} }:{\text{supp }}{\tilde {\eta }}_{j}\subseteq V_{i}\}$ and

\eta _{i}(x):=\sum _{\eta \in \mathrm {H} _{i}}\eta (x)

, we even obtain the properties

$\forall i\in \mathbb {N} :\forall x\in \Omega :0\leq \eta _{i}(x)\leq 1$
$\forall x\in \Omega :\exists {\text{ only finitely many }}i\in \mathbb {N} :\eta _{i}(x)\neq 0$
$\forall i\in \mathbb {N} :{\text{supp }}\eta _{i}\subseteq V_{i}$
$\forall x\in \Omega :\sum _{i=0}^{\infty }{\tilde {\eta }}_{i}(x)=1$

where the properties are the same as before except the third property, which changed. Let $|\alpha |=1$ , $\varphi$ be a bump function and $(v_{j})_{j\in \mathbb {N} }$ be a sequence which approximates $v$ in the $L^{p}(\Omega )$ -norm. The calculation

\int _{\Omega }\eta _{i}(x)v_{j}(x){\frac {\partial ^{\alpha }}{\partial x^{\alpha }}}\varphi (x)dx=-\int _{\Omega }\left({\frac {\partial ^{\alpha }}{\partial x^{\alpha }}}\eta _{i}(x)v_{j}(x)+\eta _{i}(x){\frac {\partial ^{\alpha }}{\partial x^{\alpha }}}v_{j}(x)\right)\varphi (x)dx

reveals that, by taking the limit $j\to \infty$ on both sides, $v\in W^{m,p}(\Omega )$ implies $\eta _{i}v\in W^{m,p}(\Omega )$ , since the limit of $\eta _{i}(x){\frac {\partial ^{\alpha }}{\partial x^{\alpha }}}v_{j}(x)$ must be in $L^{p}(\Omega )$ since we may choose a sequence of bump functions $\varphi _{k}$ converging to 1.

Let's choose now

W_{i}={\begin{cases}U_{i+4}\setminus {\overline {U_{i}}}&i\geq 1\\U_{4}&i=0\end{cases}}

We may choose now an arbitrary $\delta >0$ and $\epsilon _{i}$ so small, that

$\|\eta _{\epsilon _{i}}*(\eta _{i}v)-\eta _{i}v\|_{W^{m,p}(\Omega )}<\delta \cdot 2^{-(j+1)}$
${\text{supp }}(\eta _{\epsilon _{i}}*(\eta _{i}v))\subset W_{i}$

Let's now define

w(x):=\sum _{i=0}^{\infty }\eta _{\epsilon _{i}}*(\eta _{i}v)(x)

This function is infinitely often differentiable, since by construction there are only finitely many elements of the sum which do not vanish on each $W_{i}$ , and also since the elements of the sum are infinitely differentiable due to the Leibniz rule of differentiation under the integral sign. But we also have:

\|w-v\|_{W^{m,p}(\Omega )}=\left\|\sum _{i=0}^{\infty }\eta _{\epsilon _{i}}*(\eta _{i}v)-\sum _{i=0}^{\infty }(\eta _{i}v)\right\|_{W^{m,p}(\Omega )}\leq \sum _{i=0}^{\infty }\|\eta _{\epsilon _{i}}*(\eta _{i}v)-\eta _{i}v\|_{W^{m,p}(\Omega )}<\delta \sum _{i=0}^{\infty }2^{-(j+1)}=\delta

Since $\delta$ was arbitrary, this finishes the proof.

Let $\Omega$ be a bounded domain, and let $\partial \Omega$ have the property, that for every point $x\in \partial \Omega$ , there is a neighbourhood ${\mathcal {U}}_{x}$ such that

\Omega \cap {\mathcal {U}}_{x}=\{(x_{1},\ldots ,x_{d})\in \mathbb {R} ^{d}:x_{i}<f(x_{1},\ldots ,x_{i-1},x_{i+1},\ldots ,x_{d-1})\}

for a continuous function $f$ . Then every function in $W^{m,p}(\Omega )$ can be approximated by $C^{\infty }({\overline {\Omega }})$ -functions in the $W^{m,p}(\Omega )$ -norm.

Proof:

to follow

Hölder spaces and Morrey's inequality

Continuous representatives

The Gagliardo–Nirenberg–Sobolev inequality

Sobolev embedding theorems

Exercises

Sources

Partial Differential Equations
← Characteristic equations	Sobolev spaces	Calculus of variations →