Best approximation by ridge functions in $L_p$-spaces

We study the approximation of the classes of functions by the manifold $R_n$ formed by all possible linear combinations of $n$ ridge functions of the form $r(a · x))$. It is proved that, for any $1 ≤ q ≤ p ≤ ∞$, the deviation of the Sobolev class $W^r_p$ from the set $R_n$ of ridge functions in the...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Datum:2010
Hauptverfasser: Maiorov, V. E., Майоров, В. Є.
Format: Artikel
Sprache:Englisch
Veröffentlicht: Institute of Mathematics, NAS of Ukraine 2010
Online Zugang:https://umj.imath.kiev.ua/index.php/umj/article/view/2875
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Назва журналу:Ukrains’kyi Matematychnyi Zhurnal
Завантажити файл: Pdf

Institution

Ukrains’kyi Matematychnyi Zhurnal
_version_ 1860508866720038912
author Maiorov, V. E.
Майоров, В. Є.
author_facet Maiorov, V. E.
Майоров, В. Є.
author_sort Maiorov, V. E.
baseUrl_str https://umj.imath.kiev.ua/index.php/umj/oai
collection OJS
datestamp_date 2020-03-18T19:39:19Z
description We study the approximation of the classes of functions by the manifold $R_n$ formed by all possible linear combinations of $n$ ridge functions of the form $r(a · x))$. It is proved that, for any $1 ≤ q ≤ p ≤ ∞$, the deviation of the Sobolev class $W^r_p$ from the set $R_n$ of ridge functions in the space $L_q (B^d)$ satisfies the sharp order $n^{-r/(d-1)}$.
first_indexed 2026-03-24T02:32:01Z
format Article
fulltext UDC 517.5 V. E. Maiorov (Technion, Haifa, Israel) BEST APPROXIMATION BY RIDGE FUNCTIONS IN Lp-SPACES НАЙКРАЩЕ НАБЛИЖЕННЯ ХРЕБТОВИМИ ФУНКЦIЯМИ В Lp-ПРОСТОРАХ We study the approximation of function classes by the manifold Rn formed by all possible linear combinations of n ridge functions of the form r(a ·x)). We prove that for any 1 ≤ q ≤ p ≤ ∞, the deviation of the Sobolev class W r p from the set Rn of ridge functions in the space Lq(Bd) satisfies the sharp order n−r/(d−1). Дослiджено наближення класiв функцiй многовидом Rn, що утворений усiма можливими лiнiйними комбiнацiями n хребтових функцiй вигляду r(a · x)). Доведено, що для будь-яких 1 ≤ q ≤ p ≤ ∞ вiд- хилення класу Соболєва W r p вiд множини Rn хребтових функцiй у просторi Lq(Bd) характеризується точним порядком n−r/(d−1). 1. Introduction. In this work we continue the study of approximation of multivariate functions by classes consisting of linear combinations of ridge functions started in [9, 10, 11, 5]. Ridge functions are defined as functions on Rd of the form r(a · x), with the parameters a ∈ Rd, r : R → R and a · x is the usual inner product. Let Lq = Lp(B d), 1 ≤ q ≤ ∞, be Banach space of all q-integrable functions on the unite ball Bd = {|x| ≤ 1}, where |x|2 = x2 1 + . . .+ x2 d, with the norm ‖f‖q =  ∫ Bd |f(x)|q dx 1/q . Let A be a set on the unit sphere Sd−1 = {|x| = 1} in Rd. Introduce the set of ridge functions R(A) = { ra := r(a · x) : r ∈ L2,loc(R), a ∈ A } , where r runs over the class L2,loc(R) of square integrable functions on all compact subsets of R, and a runs over A. We denote R = R(Sd−1). Let n be a natural number. Consider the class of functions Rn = R+ . . .+R, consisting of all possible linear combinations of n functions from the set R. Approximation by ridge functions has been studied by several authors. In the works [26] and [6] necessary and sufficient conditions are found on a set A in order that the closure of the set R(A) coincides with the space of continuous functions. In addition, Lin and Pinkus [7] proved that for any fixed n the set Rn is not dense in the spaces of continuous functions on a compact sets. The approximation properties of ridge manifolds were studied by Barron [1], DeVore, Oskolkov and Petrushev [2], Maiorov [9], Maiorov and Meir [12]. Makovoz [14], Mhaskar and Micchelli [16], Mhaskar [15], Oskolkov [17], Petrushev [18], Pinkus [19], Temlyakov [21]. In Gordon, Maiorov, Meyer and Reisner [5] the results about best approximation by ridge functions in the Banach space Lp are considered. c© V. E. MAIOROV, 2010 396 ISSN 1027-3190. Укр. мат. журн., 2010, т. 62, № 3 BEST APPROXIMATION BY RIDGE FUNCTIONS IN Lp-SPACES 397 In this work we consider the problems of best approximation of multivariate functions from the Sobolev classes W r p by the class Rn in the space Lq, where the parameters p, q satisfy 1 ≤ q ≤ p ≤ ∞. Earlier, the asymptotical estimates of approximation by Rn were studied in [5, 9], but only for 2 ≤ q ≤ p ≤ ∞. Let ρ = (ρ1, . . . , ρd) be a multiindex vector, that is, ρ is the vector with nonnegative integer coordinates, |ρ| = ρ1 + . . . + ρd. Introduce the differential operator Dρ = = ∂|ρ|/∂ρ1x1 . . . ∂ ρdxd. Let r be any natural number. We consider in the space Lp = Lp(B d) the Sobolev [23] class of functions W r p := f : ‖f‖W r p := ‖f‖p + ∑ |ρ|=r ‖Dρf‖p ≤ 1  . For subsets W,V ⊂ Lq we define the deviation of W from V by e(W,V )q = sup f∈W e(f, V )q, where e(f, V )q = inf v∈V ‖f − v‖q. Theorem 1. Let d ≥ 2, r > 0 and 1 ≤ p ≤ q ≤ ∞ be any numbers. Then for the deviation of the Sobolev class W r p from the class Rn the asymptotic inequality holds e(W r p , Rn)q � n−r/(d−1). We describe briefly the proof of the Theorem 1. In order to obtain the lower bound in Theorem 1, we construct for any n a function f ∈ W r p depending on n such that the distance of f from the class Rn is greater than cn−r/(d−1). The construction of the function f will be done in the following way. In Section 2 we introduce an orthonormal system { Pk(x) }∞ k=1 of polynomials on the ball Bd and study the Fourier coefficients 〈ra, Pk〉 of ridge function r(a ·x) respect to the system { Pk(x) } . In particular, we show that the coefficients allow the separation of variables r and a (see [13, 9, 10]), namely for any k the identity holds 〈ra, Pk〉 = u(a)v(r), where u and v are some function of a and linear functional of r, respectively. In Section 3 we estimate the Vapnik – Chervonenkis dimension of the projection PrsRn of the class of ridge functions Rn to the polynomial space Pds . In Section 4 we prove Theorem 1 using the results of Sections 2, 3. In the Appendix we present well-known results from the theory of orthogonal polynomials on the segment and from the theory of harmonic analysis on the sphere, which we use in the proofs of Theorem 1. In the rest of this paper notations like c, c′, c0, c1, . . . , etc. denote positive constants which do not depend on the parameter n and may depend only on r, d, p, or q. For two positive sequences {an} and {bn}, n = 0, 1, . . . we write an � bn if there exist positive constants c1 and c2 such that c1 ≤ an/bn ≤ c2 for all n = 0, 1, . . . . 2. Projection of Rn to the polynomial space. In this section we construct speci- al orthonormal systems of polynomials on the unit ball Bd. Orthogonal systems of polynomials on the ball play the important part in problems of approximations of multi- variable functions by manifolds of linear combinations of ridge functions (the plane waves) (see [2, 18, 9, 10]). In the works [2, 18] these methods were developed for the construction of orthogonal projections on polynomial subspaces and approximation by ISSN 1027-3190. Укр. мат. журн., 2010, т. 62, № 3 398 V. E. MAIOROV ridge functions. The system of Gegenbauer orthogonal polynomials is the main tool used in the construction of orthogonal systems of polynomials on the ball [8]. Note, in parti- cular, that in the case d = 2, these Gegenbauer polynomials coincide with Chebyshev polynomials of second kind. In our work the system of orthogonal polynomials on the unit ball is obtained, in a sense, by the convolution of two orthogonal systems. These are the system of Gegenbauer polynomials on the segment [−1, 1], and the system of spherical harmonics on the unit sphere Sd−1. We describe this construction. Let L2(Sd−1) be the Hilbert space consisting of all the complex-valued square- integrable functions h(ξ) on the sphere Sd−1 with the inner product (h1, h2) = ∫ Sd−1 h1(ξ)h̄2(ξ)dξ, h1, h2 ∈ L2(Sd−1), where by dξ we denote the normalized Lebesgue measure on the sphere Sd−1. Consider (see the Appendix) in the space L2(Sd−1) the subspace H consisting of the restrictions on Sd−1 of the harmonic functions on Rd. Let Hs be the subspace in H generated by all spherical harmonics of degree at most s, i.e., all harmonic polynomials of degree at most s. Let Hhom s be the subspace of Hs formed by all homogeneous spherical harmonics of degree s. The functions {hsk}k∈Ks (see Appendix) generate a basis in the space Hhom s . The space Hs = Hhom 0 ⊕ Hhom 1 ⊕ . . . ⊕ Hhom s is the direct sum of the orthogonal subspaces of the spherical harmonics of degrees 0, 1, . . . , s. Denote by Ns the dimension of the space Hs. We have Ns � sd−1. Indeed, using the relation dim Hhom s � sd−2 (see (A.2)) we obtain Ns = dim Hs = dim Hhom 0 + dim Hhom 1 + . . .+ dim Hhom s � sd−1. We introduce in the space H the family of functions B(Sd−1) := {hi}∞i=0 consisting (see (A.2)) of all ordered spherical harmonics, that is, the functions ∞⋃ s=0 {hs,k}k∈Ks , where Ks is defined in Appendix. The set B(Sd−1) is an orthonormal basis in the space H, i.e., for indices i 6= i′ we have (hi, hi′) = δii′ , where δii′ = 0 for i 6= i′, and δii = 1. Now we consider (see Appendix) the Gegenbauer polynomials Cd/2n (t), t ∈ R, of degree n associated with d/2. We norm the polynomial Cd/2n by a factor, i.e., we set un(t) = v−1/2 n Cd/2n (t), vn = π1/2(d)nΓ ((d+ 1)/2) (n+ d/2)n!Γ(d/2) , where (a)0 = 1, and (a)n = a(a+ 1) . . . (a+ n− 1). Let i and j be two arbitrary indices from Z+. Construct on Rd the function Pij(x) = νj ∫ Sd−1 hi(ξ)uj(x · ξ)dξ, νj = ( (j + 1)d−1 2(2π)d−1 )1/2 . (1) ISSN 1027-3190. Укр. мат. журн., 2010, т. 62, № 3 BEST APPROXIMATION BY RIDGE FUNCTIONS IN Lp-SPACES 399 From (1) we see that for any i, j ∈ Z+ the function Pij is a polynomial on Rd of degree j. Note that if the indices i and j are such that the degrees of the polynomials hi and uj satisfy the inequality deg hi > deg uj = j, then Pij(x) ≡ 0 (see (A.8)). Let the set I consists of the couples of nonnegative integers (i, j) such that deg hi ≤ j and a parity of numbers deg hi and j are equal. Let Is be the subset in I consisting of the couples (i, j) with deg hi ≤ j ≤ s. Construct the system of polynomials Π := Π(Bd) := {Pij}(i,j)∈I . (2) We consider in the system Π the finite subsystem Πs = {Pij}(i,j)∈Is consisting of polynomials of degree at most s. Lemma 1 (see [10]). a) The polynomial set Πs is an orthonormal basis in the space of polynomials Pds . b) The set of polynomials Π(Bd) is a complete orthonormal system of functions in the space L2(Bd). Let a ∈ Sd−1 be any unit vector and let A be an orthogonal matrix such that a = Ae, where e = (1, 0, . . . , 0). Let A∗ be the adjoint matrix to A. We denote a∗ = A∗e. Lemma 2. Let ra = r(a · x) be any ridge function from the class R. Then any Fourier coefficient of the function ra with respect to the orthonormal system Πs = {Pij} admits separation of the variables a and r, that is can be represented as 〈ra, Pij〉 = hi(a ∗) r̂j , where r̂j = ∫ I r(t)uj(t)wd/2(t) dt is the j-th coefficient of Fourier – Gegenbauer of the function r. Proof. By the invariance of the measures dx and dξ relative to the rotation in Rd and Sd we have∫ Bd r(a · x)Pij(x) dx = νj ∫ Bd r(a · x) dx ∫ Sd−1 hi(ξ)uj(ξ · x) dξ = = νj ∫ Bd r(e · x) dx ∫ Sd−1 hi(A ∗ξ)uj(ξ · x) dξ = = νj ∫ Sd−1 hi(A ∗ξ) dξ ∫ Bd r(e · x)uj(ξ · x) dx. Decompose the function r to the Fourier – Gegenbauer serious r(t) = ∑∞ k=0 r̂kuk(t) in the space L2(I, w). Using the properties (A.2) and (A.3) from Appendix we obtain∫ Bd r(e · x)uj(ξ · x) dx = ∞∑ k=0 r̂k ∫ Bd uk(e · x)uj(ξ · x) dx = = r̂j ∫ Bd uj(e · x)uj(ξ · x) dx = r̂j uj(e · ξ) uj(1) . ISSN 1027-3190. Укр. мат. журн., 2010, т. 62, № 3 400 V. E. MAIOROV It follows from the property (A.4) that νj ∫ Sd−1 hi(A ∗ξ) dξ ∫ Bd r(e · x)uj(ξ · x) dξ = = νj r̂j uj(1) ∫ Sd hi(A ∗ξ)uj(e · ξ) dx = r̂jhi(A ∗e). The lemma is proved. Denote bym = dimPds the dimension of the space Pds . Let r(x) = ∑n k=1 rk(ak ·x) be any function from the ridge functions classRn. Consider the projection of the function r to the space Pds Prsr(x) := ∑ Pij∈Πs 〈r, Pij〉Pij(x). (3) According to Lemma 2 we can write Prsr(x) = n∑ k=1 ∑ Pij∈Πs hi(a ∗ k) r̂kjPij(x), (4) where r̂kj is the j-th coefficient of Fourier – Gegenbauer of the function gk. 3. Vapnik – Chervonenkis dimension of the class PrsRn. We recall the notions of Vapnik – Chervonenkis dimension (in details see [24]). Consider the function sgn a = 1, if a ≥ 0, and sgn a = −1, if a < 0. For a vector h = (h1, . . . , hn) in Rn, we denote by sgnh the vector (sgnh1, . . . , sgnhn). Let H = {h} be a set of real-valued functions defined on Rd. By sgnH we denote the set of all vectors {sgnh}, h ∈ H. Definition. The Vapnik – Chervonenkis dimension dimV C H of a functions set H = {h} is defined as the maximal natural number m such that there exists a collection {ξ1, . . . , ξm} in Rd such that the cardinality of the sgn vectors set S = = {(sgnh(ξ1), . . . , sgnh(ξm)) : h ∈ H} is equal to 2m. That is, the set S coincides with the set of all vertices of the unit cube in the space Rm. Let {ξ1, . . . , ξm} be any collection of points in Rd. Consider the set of vectors in Rd Πm,s,n = { (P (ξ1 + t), . . . , P (ξm + t)) : P ∈ PrsRn, t ∈ Rd } . We will need to estimate the cardinality |sgn Πm,s,n| of the sign vectors set sgn Πm,s,n. To this end we use the following result. Lemma 3 ([9], Lemma 3). Let m, s, l and q be any natural numbers such that l + q ≤ m/2. Let παβ(σ), α = 1, . . . ,m, β = 1, . . . , q be any fixed polynomials with real coefficients in the variables σ ∈ Rl, each of degree 2s. Construct m polynomials in the l + q variables b ∈ Rq and σ ∈ Rl πα(b, σ) = q∑ β=1 bβπαβ(σ), α = 1, . . . ,m. (5) Construct in Rm a polynomial manifold Π∗m,s,l,q = { (π1(b, σ), . . . , πm(b, σ)) : (b, σ) ∈ Rq × Rl } . ISSN 1027-3190. Укр. мат. журн., 2010, т. 62, № 3 BEST APPROXIMATION BY RIDGE FUNCTIONS IN Lp-SPACES 401 Then for the cardinality of the set sgn Π∗m,s,p,q the following estimate holds: ∣∣sgn Π∗m,s,l,q ∣∣ ≤ (4s)l(l + q + 1)l+2 ( 2em l + q )l+q . Lemma 4. There exist absolute constants c0, c1 and c2 such that c0n ≤ sd−1 ≤ 2c0n, c1s d ≤ m ≤ c2sd, and the cardinality of the set sgn Πm,s,n satisfies the inequality |sgn Πm,s,n| ≤ 2cm, where c ≤ 1/4 is some absolute constant. Proof. Consider the polynomial space Pds with the orthonormal basis Πs = = {Pi,j}(i,j)∈Is Let P ∈ PrsRn be any polynomial. Then P (x) = ∑ (i,j)∈Is 〈r, Pij〉Pij(x), (6) where the function r(x) = ∑n k=1 rk(ak · x) belongs to the manifold Rn. We will show that for every point ξ ∈ Rd the polynomial P (ξ + t) can be represented as a linear combination of polynomials on the variables a∗1, . . . , a ∗ n and t. It follows from the identity (4) P (x) = n∑ k=1 ∑ (i,j)∈Is hi(a ∗ k)r̂kjPij(x). (7) Recall that the set Is consists of the couples (i, j) from the set I satisfying deg hi ≤ ≤ j ≤ s. For every j introduce the set Ijs consisting of all numbers i such that deg hi = j. Then the polynomial P (x) can be written as P (x) = n∑ k=1 s∑ j=1 r̂kj ∑ i∈Ijs hi(a ∗ k)Pij(x). (8) Since {Pi,j}(i,j)∈Is is an orthonormal basis in the space Pds then for every t there is a nondegenerate matrix Γ(t) = { γi ′j′ ij (t) } (i,j),(i′,j′)∈Is , where (i, j) and (i′, j′) are the column and row indexes, respectively, of the matrix Γ(t), such that Pij(ξ + t) = m∑ (i′,j′)∈Is γi ′j′ ij (t)Pi′j′(ξ). (9) Note that all the functions γi ′j′ ij (t) are polynomials from the space Pds . Hence, from (8) and (9) we obtain P (ξ + t) = n∑ k=1 s∑ j=1 r̂kj ∑ i∈Ijs ∑ (i′,j′)∈Is γi ′j′ ij (t)hi(a ∗ k)Pi′j′(ξ). (10) ISSN 1027-3190. Укр. мат. журн., 2010, т. 62, № 3 402 V. E. MAIOROV We enumerate the set { (k, j) : 1 ≤ k ≤ n, 1 ≤ j ≤ s } in {β = 1, . . . , q}, where q = ns and we put bβ = r̂kj . For every point ξα, α = 1, . . . ,m and index β = 1, . . . , ns we define the function on R(d+1)n παβ(a∗1, . . . , a ∗ n, t) = ∑ i∈Ijs ∑ (i′,j′)∈Is γi ′j′ ij (t)hi(a ∗ k)Pi′j′(ξα). Then the identity (10) can be written as P (ξ + t) = q∑ β=1 bβπαβ(a∗1, . . . , a ∗ n, t), α = 1, . . . ,m. Introduce the variables σ = (a∗1, . . . , a ∗ n, t) which belong to the polynomial space P l2s with l = (d+ 1)n. Thus the vector set Πm,s,n belongs to the set Π∗m,2s,l,q with q = sn. From here and Lemma 3 we obtain |sgn Πm,2s,n| ≤ (8s)l(l + q + 1)l+2 ( 2em l + q )l+q . (11) Let c0 and c1 < c2 be a positive numbers with we will choose below. We assume the numbers m, s, l satisfy the conditions c0n ≤ sd−1 ≤ 2c0n, c1s d ≤ m ≤ c2sd and l = (d+ 1)n. (12) Substituting these conditions to the inequality (11) we complete Lemma 4. From Lemma 4 we directly obtain the following consequence. Consequence 1. The Vapnik – Chervonenkis dimension of the polynomial class PrsRn satisfies the estimate dimV C PrsRn ≤ l log2(4s) + (l + 2) log2(l + q + 1) + (l + q) log2 ( 2em l + q ) . 4. Approximation of the class Pd s by ridge functions. Let Ω = [ − 1√ d , 1√ d ]d be the cube which belongs to the unit ball Bd. We define the function on Rd ω(x) = 1, x ∈ 1 2 Ω, 0, x ∈ Rd \ Ω, and continue it on the space Rd such that ω belongs to the classW r ∞(Rd) and 0 ≤ ω(x) ≤ ≤ 1 for all x ∈ Rd. Let λ and m be any natural numbers such that m1/d ≤ λ ≤ 2m1/d. Consider the lattice subset in the cube Ω consisting of m points Ξm = {( i1 + 1/2√ d λ , . . . , id + 1/2√ d λ ) : i1, . . . , id = −λ, . . . , λ− 1 } . Let ξ1, . . . , ξm be the points of the set Ξm. Introduce the set Em = { ε = (ε1, . . . , εm) : εi = ±1, i− 1, . . . ,m } of sign vectors in Rm. Consider the collection of function ISSN 1027-3190. Укр. мат. журн., 2010, т. 62, № 3 BEST APPROXIMATION BY RIDGE FUNCTIONS IN Lp-SPACES 403 Fm = { fε(x) := (2λ)−r m∑ i=1 εiω (2λ(x− ξi)) : ε ∈ Em } . Obviously, every function fε from Fm belongs to the Sobolev class W r ∞. Denote by gε the polynomial of best approximation of the function fε by the space Pds in L∞-norm, that is satisfying ‖fε − gε‖∞ = min g∈Pd s ‖fε − g‖∞. We know [22] that the error of the best approximation of any function f ∈ W r ∞ from the polynomial space Pds in the L∞-norm is bounded above as follows, ‖fε − gε‖∞ ≤ cs−r. Hence we have the next result. Proposition 1. Consider the set of polynomials Gms = {gε : ε ∈ Em}. Then the deviation of the set Fm from the space Gms satisfies e(Fm, Gms )∞ ≤ cs−r. Let Q be a set of functions in the space L(Bd). Denote by PrsQ = {Prsq : q ∈ Q} the projection of a set Q to the subspace Pds . Lemma 5. Let 1 ≤ q ≤ ∞ be any number and P ∈ Pds be any polynomial. Then e(P,Rn)q ≥ e(P,PrsRn)q. Proof. We have e(P,Rn)q = inf ri∈L2,loc(R), ai∈Rd ∥∥∥∥∥P (x)− n∑ i=1 ri(ai · x) ∥∥∥∥∥ q . (13) We fix the set of vectors a = {a1, . . . , an} and consider the linear subspace of functions Un(a) := { u = n∑ i=1 ui(ai · x) : ui ∈ L2,loc(R) } . Let Un(a)⊥ = { v ∈ Lq : 〈v, u〉 = 0 for all u ∈ Un(a) } be the annihilator subspace in Lq for the subspace Un(a). Define the number q′ such that 1/q + 1/q′ = 1. Using the duality in the space Lq we have inf u∈Un(a) ‖P − u‖q = sup v∈Un(a)⊥, ‖v‖q′≤1 〈P, v〉 ≥ sup v∈Un(a)⊥∩Pd s , ‖v‖q′≤1 〈P, v〉. Since Un(a)⊥ ∩ Pds = { v ∈ Pds : 〈v, Un(a)〉 = 0 } = { v : 〈v,Prs Un(a))〉 = 0 } , then using once more the duality in the space Pds , we obtain ISSN 1027-3190. Укр. мат. журн., 2010, т. 62, № 3 404 V. E. MAIOROV e(P,Un(a))q ≥ sup v∈PrsUn(a)⊥∩Pd s , ‖v‖q′≤1 〈P, v〉 = inf h∈Prs Un(a) ‖P − h‖q. (14) It follows from (13) and (14) that e(P,Rn)q = inf a1,...,an e(P,Un(a))q ≥ ≥ inf a1,...,an inf h∈PrsUn(a) ‖P − h‖q = e(P,PrsRn)q. Lemma 5 is proved. 5. Proof of Theorem 1. Consider the space lm1 consisting of vectors x ∈ Rm equipped with the norm ‖x‖lm1 = |x1| + . . . + |xm|. In the space lm1 we consider the subset Em = { ε : ε1, . . . , εm = ±1 } . The following lemma is proved in [9]. For completeness we cite its proof. Lemma 6. Assume that all conditions of Lemma 4 are satisfied. Then there is a vector ε∗ ∈ Em such that e(ε∗,Πm,s,n)lm1 := inf x∈Πm,s,n ‖ε∗ − x‖lm1 ≥ am, where a is an absolute and strictly positive constant. Proof. Let a < 1 be the absolute constant satisfying the equation 1 − 1 2 (1 − − 2a)2 log2 e = 47 64 (i.e., a = 0.19 . . .). Set Π = sgn Πm,s,n. Let π be any vector from Π. Consider the subset in Em Eπ = { ε ∈ Em : m∑ i=1 |εi − πi| ≤ 2am } . Since πi = ±1 we have the estimate for cardinality of the set Eπ |Eπ| = ∣∣∣∣∣ { ε ∈ Em : m∑ i=1 (εi + 1) ≤ 2am }∣∣∣∣∣ = = ∣∣∣∣∣ { ε : ∑ i : εi=1 1 ≤ am }∣∣∣∣∣ = [am]∑ i=0 ( m i ) . From the well-known estimate (see, for example, [3], Chapter 8) we have [am]∑ i=0 ( m i ) ≤ 2me−2m(1/2−β)2 ≤ 2bm, where β = m−1[am], and b = 1 − 1 2 (1 − 2a)2 log2 e = 47 64 . Hence |Eπ| ≤ 247m/64. Consider in Em the subset E′ = ⋂ π∈Π(Em \ Eπ). We estimate the cardinality of E′ via |E′| = ∣∣∣∣∣Em \ ⋃ π∈H Eπ ∣∣∣∣∣ ≥ 2m − |Π|max π∈Π |Eπ| ≥ 2m − |Π|2(47/64)m. (15) By Lemma 4 we have |Π| ≤ 2m/4. From this and (15) we obtain |E′| ≥ 2m−2(63/64)m > > 0. Therefore there exists a vector ε∗ such that for every vector π ∈ Π the following inequality holds: ISSN 1027-3190. Укр. мат. журн., 2010, т. 62, № 3 BEST APPROXIMATION BY RIDGE FUNCTIONS IN Lp-SPACES 405 ‖ε∗ − π‖lm1 ≥ 2am. From here we obtain the inequality e(ε∗,Πm,s,n)lm1 ≥ 1 2 e(ε∗, sgn Πm,s,n)lm1 ≥ am. Lemma 6 is proved. Lemma 7. Assume the natural numbers m, s and n satisfy the conditions of Lemma 4. Then there is a fε∗ ∈ Fm such that e(fε∗ ,PrsRn)1 ≥ c3n−r/(d−1), where c3 is an absolute and strictly positive constant. Proof. Let fε and P be any functions from the sets Fm and PrsRn, respectively. We have ‖fε − P‖1 ≥ ∫ Ω |fε(x)− P (x)|dx = ∫ Ω/(2λ) m∑ i=1 ∣∣fε(ξi + t)− P (ξi + t) ∣∣dt. Define the function ω̄(t) = (2λ)−rω(2λ t). Since fε(ξi + t) = ω̄(t)εi for every vector ε, then for any t from the cube Ω/(2λ) we have m∑ i=1 ∣∣fε(ξi + t)− P (ξi + t) ∣∣ ≥ inf P∈PrsRn, τ∈Rd m∑ i=1 ∣∣ω̄(t)εi − P (ξi + τ) ∣∣ = = inf P∈PrsRn, τ∈Rd ω̄(t) m∑ i=1 ∣∣εi − P (ξi + τ) ∣∣. Thus we obtain ‖fε − P‖1 ≥ 1 m inf t∈Ω/(2λ) ∣∣ω̄(t) ∣∣ inf P∈PrsRn, τ∈Rd m∑ i=1 ∣∣εi − P (ξi + τ) ∣∣ ≥ ≥ c3 (2λ)rm e(ε,Πm,s,n, l m 1 ). Recall (see (12)) that the numbers m, s, n satisfy the conditions c0n ≤ sd−1 ≤ 2c0n, c1s d ≤ m ≤ c2s d and m1/d ≤ λ ≤ 2m1/d. Applying Lemma 6 we obtain that there is a function fε∗ ∈ Fm satisfying ‖fε∗ − P‖1 ≥ c3a (2λ)r ≥ c4n−r/(d−1), c4 = c3a c r/d 2 (2c0)r , for any polynomial P ∈ PrsRn. Lemma 7 is proved. Lemma 8. The deviation of the set Fm from the class Rn satisfies e(Fm, Rn)1 ≥ c4n−r/(d−1). Proof. Let f ∈ Fm be any function. According to Proposition 1 there is a polynomial g ∈ Gms that ‖f − g‖∞ ≤ cs−r. (16) ISSN 1027-3190. Укр. мат. журн., 2010, т. 62, № 3 406 V. E. MAIOROV Using Lemma 5 and twice the inequality (16) we obtain e(Fm, Rn)1 ≥ e(Gms , Rn, L1)− cs−r ≥ ≥ e(Gms ,PrsRn)1 − cs−r ≥ e(Fm,PrsRn)1 − 2cs−r. We choose c2 and c0 such that c4 > 2c. Then it follows from Lemma 7 that e(Fm, Rn)1 ≥ c4n−r/(d−1) − 2cs−r � n−r/(d−1). Lemma 8 is proved. Now we prove Theorem 1. We know that the collection of functions Fm belongs to the class W r ∞. Therefore, using Hölder’s inequality for 1 ≤ q ≤ p ≤ ∞ and Lemma 8, we obtain e(W r p , Rn)q ≥ e(W r ∞, Rn)1 ≥ e(Fm, Rn)1 ≥ cn−r/(d−1). The upper bound e(W r p , Rn)q ≤ cn−r/(d−1) was proved in the paper [9]. Theorem 1 is completely proved. 6. Appendix. We discuss some well-known results connected with orthogonal polynomials, which we use in this present work. The Gegenbauer polynomials. The Gegenbauer polynomials are usually defined via the generating function (1− 2tz + z2)−λ = ∞∑ k=0 Cλk (t)zk, where |z| < 1, |t| < 1, and λ > 0. The coefficients Cλk (t) are algebraic polynomials of degree k and are termed the Gegenbauer polynomials associated with λ. The Gegenbauer polynomials possess the following properties: 1. The family of polynomials {Cλk } is a complete orthogonal system for the wei- ghted space L2(I, w), I = [−1, 1], w(t) := wλ(t) := (1− t2)λ−1/2, and∫ I Cλm(t)Cλn(t)w(t)dt = { 0, m 6= n, vn,λ, m = n, with vn,λ := π1/2(2λ)nΓ(λ+ 1/2) (n+ λ)n!Γ(λ) , (A.1) where we use the usual notation (a)0 := 0, (a)N := a(a+ 1) . . . (a+N − 1). 2. Let Pn denote the set of all algebraic polynomials of total degree n in d real vari- ables. Set un(t) = v −1/2 n C d/2 n (t), where vn = π1/2(d)nΓ ((d+ 1)/2) (n+ d/2)n!Γ(d/2) . The polynomials un(ξ · x), ξ ∈ Sd−1, are in Pn and the un(ξ · x) are orthogonal to Pn−1 in L2(Bd) (see [18]): ∫ Bd un(ξ · x)p(x)dx = 0 ∀ξ ∈ Sd−1 and ∀p ∈ Pn−1. (A.2) ISSN 1027-3190. Укр. мат. журн., 2010, т. 62, № 3 BEST APPROXIMATION BY RIDGE FUNCTIONS IN Lp-SPACES 407 3. For each ξ, η ∈ Sd−1 we have (see [18])∫ Bd un(ξ · x)un(η · x)dx = un(ξ · η) un(1) . (A.3) 4. For each polynomial h(x) ∈ Pn such that h(x) = (−1)nh(−x) for all x ∈ Rd we have (see [18])∫ Sd−1 h(ξ)un(ξ · η)dξ = un(1) νn h(η), where νn = (n+ 1)d−1 2(2π)d−1 . (A.4) An orthogonal system of polynomials on the sphere. We state some facts (see [4, 25, 20]) from the theory of harmonic analysis on the sphere. Let s be any positive integer. Consider a space Hs consisting of the homogeneous harmonic polynomials of degree s in the d variables x1, . . . , xd. Any polynomial from Hs is decomposable by a linear combination of polynomials of the form hsk(x) = Ask d−2∏ j=o r kj−kj−1+1 d−j C d−j−2 2 +kj+1 kj−kj+1 ( xd−j rd−j ) (x2 ± ix1)kd−2 , (A.5) where r2 d−j = x2 1 + . . .+ x2 d−j . The vector k with integer coordinates belongs to the set Ks = { k = (k0, k1, . . . , kd−3, εkd−2) : 0 ≤ kd−2 ≤ . . . ≤ k1 ≤ k0 = s, ε = ±1 } , and Ask is the normalization factor. It is known that the dimension of the space Hs is given by dimHs = |Ks| = ( s+ d− 1 s ) − ( s+ d− 3 s− 2 ) , (A.6) if s ≥ 2, and dimH0 = 1, dimH1 = d. It is easy to verify that the dimension of Hs is asymptotically given by dimHs = ( 2 + 2 (d− 2)! + c(s, d) ) s(s+ 1) . . . (s+ d− 3) � sd−2, (A.7) where 0 ≤ c(s, d) ≤ 1 is some function depending only on s and d. The family of functions {hsk}k∈Ks is an orthonormal system in the spaceL2(Sd−1), i.e., for any multiindices k, k′ ∈ Ks, the following holds: (hsk, hsk′) = ∫ Sd−1 hsk(ξ)hsk′(ξ)dξ = δkk′ . (A.8) Note that the spaces Hs and Hs′ for s 6= s′ are orthogonal with respect to the inner product (A.8). The family of functions ⋃∞ s=0{hsk}k∈Ks is a complete orthonormal system in the space L2(Sd−1). The set of polynomials on the sphere {p : p ∈ Pn} of degree ≤ n is contained in the space H0 ⊕ H1 ⊕ . . . ⊕ Hn, which is the direct sum of the orthogonal subspaces H0,H1, . . . ,Hn. From the above it follows that for any polynomial p ∈ Pn and for any function h ∈ Hn+1 ⊕Hn+2 ⊕ . . . the equality ISSN 1027-3190. Укр. мат. журн., 2010, т. 62, № 3 408 V. E. MAIOROV∫ Sd−1 p(ξ)h(ξ)dξ = 0 holds. 1. Barron A. R. Universal approximation bounds for superposition of a sigmoidal function // IEEE Trans. Inform. Theory. – 1993. – 39. – P. 930 – 945. 2. DeVore R. A., Oskolkov K., Petrushev P. Approximation by feed-forward neural networks // Ann. Numer. Math. – 1997. – 4. – P. 261 – 287. 3. Devroye L., Györfy L., Lugosi G. A probabilistic theory of pattern recognition. – New York: Springer Verlag, 1996. 4. Erdelyi A., ed. Higher transcendental functions. Vol. 2. Bateman manuscript project. – New York, N. Y.: McGraw Hill, 1953. 5. Gordon Y., Maiorov V., Meyer M., Reisner S. On best approximation by ridge functions in the uniform norm // Constr. Approxim. – 2002. – 18. – P. 61 – 85. 6. Lin V. Ya., Pinkus A. Fundamentality of ridge functions // J. Approxim. Theory. – 1993. – 75. – P. 295 – 311. 7. Lin V. Ya., Pinkus A. Approximation of multivariate functions // Adv. Comput. Math. – World Sci. (Singapore), 1994. – P. 257 – 265. 8. Logan B., Shepp L. Optimal reconstruction of functions from its projections // Duke Math. J. – 1975. – 42. – P. 645 – 659. 9. Maiorov V. On best approximation by ridge functions // J. Approxim. Theory. – 1999. – 99. – P. 68 – 94. 10. Maiorov V. On best approximation of classes by radial functions // Ibid. – 2003. – 120. – P. 36 – 70. 11. Maiorov V., Meir R., Ratsaby J. On the approximation of functional classes equipped with a uniform measure using ridge functions // Ibid. – 1999. – 99. – P. 95 – 111. 12. Maiorov V., Meir R. On the near optimality of the stochastic approximation of smooth functions by neural networks // Adv. Comput. Math. – 2000. – 13. – P. 79 – 103. 13. Maiorov V., Oskolkov K. I., Temlyakov V. N. Gridge approximation and Radon compass // Approxim. Theory (a Volume dedicated to Blagovest Sendov / Ed. B. Bojanov. – Sofia: DARBA, 2002. – P. 284 – 309. 14. Makovoz Y. Random approximation and neural networks // J. Approxim. Theory. – 1996. – 85. – P. 98 – 109. 15. Mhaskar H. N. Neural networks for optimal approximation of smooth and analytic functions // Neural Comput. – 1996. – 8. – P. 164 – 177. 16. Mhaskar H. N., Micchelli C. A. Dimension independent bounds on the degree of approximation by neural networks // IBM J. Research and Development. – 1994. – 38. – P. 277 – 284. 17. Oskolkov K. I. Ridge approximation, Chebyshev – Fourier analysis and optimal quadrature formulas // Proc. Steklov Inst. Math. – 1997. – 219. – P. 265 – 280. 18. Petrushev P. P. Approximation by ridge functions and neural networks // SIAM J. Math. Anal. – 1998. – 30. – P. 291 – 300. 19. Pinkus A. Approximation by ridge functions, some density problems from neutral networks // Surface Fitting and Multiresolution Method. – 1997. – 2. – P. 279 – 292. 20. Stein E. M., Weiss G. Introduction to Fourier analysis on Euclidean spaces. – Princeton, New Jersey, 1971. 21. Temlyakov V. On approximation by ridge functions. – Preprint. 22. Timan A. F. Theory of approximation of function of the real variable. – New York: Macmillan Co., 1963. 23. Tribel H. Interpolation theory function spaces, differential operators. – Berlin: Veb Deutscher Verlag Wissenschaften, 1978. 24. Vapnik V., Chervonkis A. Necessary and sufficient conditions for the uniform convergence of empirical means to their expectations // Theory Probab. Appl. – 1981. – 3. – P. 532 – 553. 25. Vilenkin N. Ya. Special functions and a theory of representation of group. – Moscow: Fizmatgiz, 1965. 26. Vostrecov B. A., Kreines M. A. Approximation of continuous functions by superpositions of plane waves // Dokl. Akad. Nauk SSSR. – 1961. – 140. – P. 1237 – 1240 (Soviet Math. Dokl. – 1961. – 2. – P. 1320 – 1329). Received 26.05.09 ISSN 1027-3190. Укр. мат. журн., 2010, т. 62, № 3
id umjimathkievua-article-2875
institution Ukrains’kyi Matematychnyi Zhurnal
keywords_txt_mv keywords
language English
last_indexed 2026-03-24T02:32:01Z
publishDate 2010
publisher Institute of Mathematics, NAS of Ukraine
record_format ojs
resource_txt_mv umjimathkievua/70/2ad5e7992c07e163e9587e988b092a70.pdf
spelling umjimathkievua-article-28752020-03-18T19:39:19Z Best approximation by ridge functions in $L_p$-spaces Найкраще наближення хребтовими функціями в $L_p$-просторах Maiorov, V. E. Майоров, В. Є. We study the approximation of the classes of functions by the manifold $R_n$ formed by all possible linear combinations of $n$ ridge functions of the form $r(a · x))$. It is proved that, for any $1 ≤ q ≤ p ≤ ∞$, the deviation of the Sobolev class $W^r_p$ from the set $R_n$ of ridge functions in the space $L_q (B^d)$ satisfies the sharp order $n^{-r/(d-1)}$. Досліджено наближення класів функцій многовидом $R_n$, що утворений усіма можливими лінійними комбінаціями $n$ хребтових функцій вигляду $r(a · x))$. Доведено, що для будь-яких $1 ≤ q ≤ p ≤ ∞$ відхилення класу Соболева $W^r_p$ від множини $R_n$ хребтових функцій у просторі $L_q (B^d)$характеризується точним порядком $n^{-r/(d-1)}$. Institute of Mathematics, NAS of Ukraine 2010-03-25 Article Article application/pdf https://umj.imath.kiev.ua/index.php/umj/article/view/2875 Ukrains’kyi Matematychnyi Zhurnal; Vol. 62 No. 3 (2010); 396–408 Український математичний журнал; Том 62 № 3 (2010); 396–408 1027-3190 en https://umj.imath.kiev.ua/index.php/umj/article/view/2875/2501 https://umj.imath.kiev.ua/index.php/umj/article/view/2875/2502 Copyright (c) 2010 Maiorov V. E.
spellingShingle Maiorov, V. E.
Майоров, В. Є.
Best approximation by ridge functions in $L_p$-spaces
title Best approximation by ridge functions in $L_p$-spaces
title_alt Найкраще наближення хребтовими функціями в $L_p$-просторах
title_full Best approximation by ridge functions in $L_p$-spaces
title_fullStr Best approximation by ridge functions in $L_p$-spaces
title_full_unstemmed Best approximation by ridge functions in $L_p$-spaces
title_short Best approximation by ridge functions in $L_p$-spaces
title_sort best approximation by ridge functions in $l_p$-spaces
url https://umj.imath.kiev.ua/index.php/umj/article/view/2875
work_keys_str_mv AT maiorovve bestapproximationbyridgefunctionsinlpspaces
AT majorovvê bestapproximationbyridgefunctionsinlpspaces
AT maiorovve najkraŝenabližennâhrebtovimifunkcíâmivlpprostorah
AT majorovvê najkraŝenabližennâhrebtovimifunkcíâmivlpprostorah