信息处理 - What is the meaning of lplp norm in this model for sparse channel estimation? - 吾爱随笔录

What is the meaning of lplp norm in this model for sparse channel estimation?

信息处理 self-study norm

2022-02-21 19:14:52

The signal $y(n)$

\begin{matrix} (1) & y (n) = \sum_{i = 0}^{L - 1} h (i) x (n - i) + v (t) \end{matrix}

$y(n) = \sum_{i=0}^{L-1}h(i) x(n-i) + v(t) \tag{1}$

where $\mathbf{h} = [h_1,h_2,\ldots,h_L]^T$ $L-1$ $v(t)$ $\sigma^2_v$

In the Eq(20) found in the paper, http://www.eurasip.org/Proceedings/Eusipco/Eusipco2008/papers/1569101936.pdf

the eq has the term ${||h||}^p_p$ $0<p \le 1$

Consider an array $h = [1,0.2,0,0.5]$ ${||h||}^p_p$ $h$

The Authors in Eq(24)present the derivative of the cost function computed in Eq(20). The derivative is taken with respect to $h$ $p\lambda \tilde{h}$ $\tilde{h}$ $p$

2个回答

The symbol ${||h||}_p$

{| | h | |}_{p} = \sqrt[p]{{| h_{1} |}^{p} + {| h_{2} |}^{p} + . . . + {| h_{n} |}^{p}}

${||h||}_p=\sqrt[p]{\left|h_1\right|^p+\left|h_2\right|^p+...+\left|h_n\right|^p}$

When they write ${||h||}^p_p$ , it is just that multiplied by itself $p$ times, so that the root disappears. Mathematically:

{| | h | |}_{p}^{p} = \sum_{i = 1}^{n} {| h_{i} |}^{p}

${||h||}^p_p=\sum\limits_{i=1}^n \left|h_i\right|^p$

To add on @Tendero, the expression $\sum_k x_k^p$ is sometimes called the "power $p$ -norm" when $p\ge 1$ . Most often, you can see mentions of the "squared $\ell_2$ norm" or " $\ell_2$ norm squared". For $p=1$ , the exponent does not modify the computation, so it is just called the $\ell_1$ norm.

The use of the power is often more convenient mathematically and computationally: having a $p$ -root ( $\sqrt[p]{\cdot}$ ) can be cumbersome in computing derivatives to find extrema.

When $p\ge 1$ , it satisfies all the norm axioms. But when $0<p<1$ , the triangle inequality is not satisfied anymore, so it should not be called a norm. The correct denomination is a quasi-norm, with a modulus of concavity modulus $K$ such that

ℓ_{p} (x + y) \leq K (ℓ_{p} (x) + ℓ_{p} (y)) .

$\ell_p(x+y) \le K(\ell_p(x) +\ell_p(y))\,.$

When $p=0$ , this is not a norm nor a quasi-norm anymore. It can be called cardinality function, sparsity, count index.

In signal processing where sparsity is considered useful, $\ell_0$ is usually the target to minimize: number of non-zero samples, number of taps for a filter.

However, it is quite intractable too (not differentiable). Under some theoretical conditions, the minimization of an $\ell_0$ penalty can be replaced by an $\ell_1$ penalty, the "last" convex $\ell_p^p$ term.

However, more and more works address non-convex penalties ( $p<1$ ) that better approximate $\ell_0$ .

Finally, in a Bayesian context, the prior of a Laplacian distribution can be encapsulated in an $\ell_1$ penalty, as the Gaussian distribution can be encapsulated in an $\ell_2$ penalty, see for instance Why is Laplace prior producing sparse solutions?.

其它你可能感兴趣的问题

上一篇术语 - 时间序列模型中的滞后、顺序下一篇我该如何检测特定频率的正弦波的相位变化？