On the representation by bivariate ridge functions
UDC 517.5 We consider the problem of representation of a bivariate function by sums of ridge functions. It is shown that if a function of a certain smoothness class is represented by a sum of finitely many arbitrarily behaved ridge functions, then it can also be represented by a sum of ridge functio...
Saved in:
| Date: | 2021 |
|---|---|
| Main Authors: | , , , , , |
| Format: | Article |
| Language: | English |
| Published: |
Institute of Mathematics, NAS of Ukraine
2021
|
| Online Access: | https://umj.imath.kiev.ua/index.php/umj/article/view/263 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Journal Title: | Ukrains’kyi Matematychnyi Zhurnal |
| Download file: | |
Institution
Ukrains’kyi Matematychnyi Zhurnal| _version_ | 1860506996746223616 |
|---|---|
| author | Aliev, R. A. Asgarova, A. A. Ismailov, V. E. Aliev, Rashid Asgarova, Aysel Ismailov, Vugar Aliev, R. A. Asgarova, A. A. Ismailov, V. E. |
| author_facet | Aliev, R. A. Asgarova, A. A. Ismailov, V. E. Aliev, Rashid Asgarova, Aysel Ismailov, Vugar Aliev, R. A. Asgarova, A. A. Ismailov, V. E. |
| author_sort | Aliev, R. A. |
| baseUrl_str | https://umj.imath.kiev.ua/index.php/umj/oai |
| collection | OJS |
| datestamp_date | 2025-03-31T08:48:07Z |
| description | UDC 517.5
We consider the problem of representation of a bivariate function by sums of ridge functions. It is shown that if a function of a certain smoothness class is represented by a sum of finitely many arbitrarily behaved ridge functions, then it can also be represented by a sum of ridge functions of the same smoothness class. As an example, this result is applied to a homogeneous constant coefficient partial differential equation. |
| doi_str_mv | 10.37863/umzh.v73i5.263 |
| first_indexed | 2026-03-24T02:02:18Z |
| format | Article |
| fulltext |
DOI: 10.37863/umzh.v73i5.263
UDC 517.5
R. A. Aliev (Inst. Math. and Mech. Nat. Acad. Sci. Azerbaijan, Baku and Baku State Univ., Azerbaijan),
A. A. Asgarova (Azerbaijan Univ. Languages, Baku),
V. E. Ismailov (Inst. Math. and Mech. Nat. Acad. Sci. Azerbaijan, Baku)
ON THE REPRESENTATION BY BIVARIATE RIDGE FUNCTIONS*
ПРО ЗОБРАЖЕННЯ ГРЕБЕНЕВИМИ ФУНКЦIЯМИ ДВОХ ЗМIННИХ
We consider the problem of representation of a bivariate function by sums of ridge functions. It is shown that if a function
of a certain smoothness class is represented by a sum of finitely many arbitrarily behaved ridge functions, then it can
also be represented by a sum of ridge functions of the same smoothness class. As an example, this result is applied to a
homogeneous constant coefficient partial differential equation.
Розглядається задача зображення двовимiрної функцiї сумами гребеневих функцiй. Показано, що коли функцiю
певного класу гладкостi зображено скiнченною сумою гребеневих функцiй довiльної поведiнки, її також можна
зобразити сумою гребеневих функцiй того ж класу гладкостi. Як приклад цей результат застосовано до однорiдного
диференцiального рiвняння з частинними похiдними i зi сталими коефiцiєнтами.
1. Introduction. Last 30 years have seen a growing interest in the study of special multivariate
functions called ridge functions. This interest is due to applicability of such functions in various
research areas. A ridge function is a multivariate function of the form
g(\bfa \cdot \bfx ) = g(a1x1 + . . .+ amxm),
where g : \BbbR \rightarrow \BbbR and \bfa = (a1, . . . , am) is a fixed vector (direction) in \BbbR m\setminus \{ \bfzero \} . These functions and
their linear combinations find applications in computerized tomography (see, e.g., [16, 21, 24]), in
statistics (especially, in the theory of projection pursuit and projection regression; see, e.g., [4, 6]) and
in the theory of neural networks (see, e.g., [7, 9, 11, 23, 28]). Ridge functions are also widely used
in modern approximation theory as an effective and convenient tool for approximating complicated
multivariate functions (see, e.g., [8, 13, 20, 22, 25]). For more on ridge functions and application
areas, see the book [26] and survey papers [10, 12, 19].
It should be remarked that ridge functions have been used in the theory of partial differential
equations under the name of plane waves (see, e.g., [15]). In general, linear combinations of ridge
functions with fixed directions occur in the study of hyperbolic constant coefficient partial differential
equations. For example, assume that (\alpha i, \beta i), i = 1, . . . , r, are pairwise linearly independent vectors
in \BbbR 2. Then the general solution to the homogeneous equation
r\prod
i=1
\biggl(
\alpha i
\partial
\partial x
+ \beta i
\partial
\partial y
\biggr)
u(x, y) = 0 (1.1)
are all functions of the form
* This research was supported by the Science Development Foundation under the President of the Republic of Azerbaijan
(grant no. EIF/MQM/Elm-Tehsil-1-2016-1(26)-71/08/01).
c\bigcirc R. A. ALIEV, A. A. ASGAROVA, V. E. ISMAILOV, 2021
ISSN 1027-3190. Укр. мат. журн., 2021, т. 73, № 5 579
580 R. A. ALIEV, A. A. ASGAROVA, V. E. ISMAILOV
u(x, y) =
r\sum
i=1
vi(\beta ix - \alpha iy) (1.2)
for arbitrary univariate functions vi, i = 1, . . . , r, from the class Cr(\BbbR ).
Note that the solution of Eq. (1.1) is the sum of bivariate ridge functions. Sums of bivariate ridge
functions also occur in basic mathematical problems of computerized tomography. For example,
Logan and Shepp [15] (the term “ridge function” was coined by them) considered the problem of
reconstructing a given but unknown function f(x, y) from its integrals along certain lines in the plane.
More precisely, let D be the unit disk in the plane and a function f(x, y) be square integrable and
supported on D. We are given projections Pf (t, \theta ) (integrals of f along the lines x \mathrm{c}\mathrm{o}\mathrm{s} \theta +y \mathrm{s}\mathrm{i}\mathrm{n} \theta = t)
and looking for a function g = g(x, y) of minimum L2 norm, which has the same projections as f :
Pg(t, \theta j) = Pf (t, \theta j), j = 0, 1, . . . , n - 1, where angles \theta j generate equally spaced directions,
i.e., \theta j =
j\pi
n
, j = 0, 1, . . . , n - 1. The authors of [15] showed that this problem of tomography
is equivalent to the problem of L2-approximation of the function f by sums of bivariate ridge
functions with equally spaced directions (\mathrm{c}\mathrm{o}\mathrm{s} \theta j , \mathrm{s}\mathrm{i}\mathrm{n} \theta j), j = 0, 1, . . . , n - 1. They gave a closed-
form expression for the unique function g(x, y) and showed that the unique polynomial P (x, y) of
degree n - 1 which best approximates f in L2(D) is determined from the above n projections of f
and can be represented as a sum of n bivariate ridge functions.
In this paper, we are interested in the problem of smoothness in representation by sums of
bivariate ridge functions with finitely many fixed directions. Assume we are given n pairwise
linearly independent directions (ai, bi), i = 1, . . . , n, in \BbbR 2 and a function F : \BbbR 2 \rightarrow \BbbR of the form
F (x, y) =
n\sum
i=1
gi(aix+ biy). (1.3)
Assume in addition that F is of a certain smoothness class, what can we say about the smoothness of
gi? The case n = 1 is obvious. In this case, if F \in Ck(\BbbR 2), then for a vector (c, d) \in \BbbR 2 satisfying
a1c + b1d = 1 we have that g1(t) = F (ct, dt) is in Ck(\BbbR ). The same argument can be carried out
for the case n = 2. In this case, since the vectors (a1, b1) and (a2, b2) are linearly independent, there
exists a vector (c, d) \in \BbbR 2 satisfying a1c+ b1d = 1 and a2c+ b2d = 0. Therefore, we obtain that the
function g1(t) = F (ct, dt) - g2(0) is in the class Ck(\BbbR ). Similarly, one can verify that g2 \in Ck(\BbbR ).
The picture drastically changes if the number of directions n \geq 3. For n = 3, there are ulti-
mately smooth functions which decompose into sums of very badly behaved ridge functions. This
phenomena comes from the classical Cauchy functional equation. This equation,
f(x+ y) = f(x) + f(y), f : \BbbR \rightarrow \BbbR , (1.4)
looks very simple and has a class of simple solutions f(x) = cx, c \in \BbbR . Nevertheless, it easily
follows from the Hamel basis theory that the Cauchy functional equation has also a large class of
wild solutions. These solutions are called “wild” because they are extremely pathological over reals.
They are, for example, not continuous at a point, not monotone at an interval, not bounded at any set
of positive measure (see, e.g., [1]). Let g be any wild solution of the equation (1.4). Then the zero
function can be represented as
ISSN 1027-3190. Укр. мат. журн., 2021, т. 73, № 5
ON THE REPRESENTATION BY BIVARIATE RIDGE FUNCTIONS 581
0 = g(x) + g(y) - g(x+ y). (1.5)
Note that the functions involved in (1.5) are bivariate ridge functions with the directions (1, 0), (0, 1)
and (1, 1), respectively. This example shows that for smoothness of the representation (1.3) one must
impose additional conditions on the representing functions gi, i = 1, . . . , n.
Such additional conditions are recently found by Pinkus [27]. He proved that for a large class
of representing functions gi, the representation is smooth. That is, if apriori assume that in the
representation (1.3), the functions gi belong to a certain class of “well behaved functions”, then they
have the same degree of smoothness as the function F. As the mentioned class of “well behaved
functions” one may take, e.g., the set of functions that are continuous at a point, bounded on one
side on a set of positive measure, monotonic at an interval, Lebesgue measurable, etc. (see [27]).
Konyagin and Kuleshov [17] proved that in (1.3) the functions gi inherit smoothness properties of F
(without additional assumptions on gi) if and only if the directions \bfa i are linearly independent. Note
that the results of Pinkus and also Konyagin and Kuleshov are valid not only in bivariate but also in
multivariate case. There are also other results on ridge function representation, which involve certain
convex subsets of the m-dimensional space (see [17, 18]).
In this paper, we study a different aspect of the problem of representation by ridge functions.
Assume in the representation (1.3) F \in Ck(\BbbR 2) but the functions gi are arbitrary. That is, we allow
very badly behaved functions (for example, not continuous at any point). Can we write F as a sum\sum n
i=1
fi(aix+ biy) but with the fi \in Ck(\BbbR ), i = 1, . . . , n? We see that the answer to this question
is positive as expected. For the sake of convenience we state the result over \BbbR 2, but in fact it holds
over any open set in \BbbR 2.
Note that the above problem is not elementary as it seems. There are cases when representation
with good functions is not possible. Such situations happen over closed sets with no interior. In [14],
Ismailov and Pinkus presented an example of a function of the form
F (x, y) = g1(a1x+ b1y) + g2(a2x+ b2y),
that is bounded and continuous on the union of two straight lines but such that both g1 and g2 are
necessarily discontinuous, and thus cannot be replaced with continuous functions f1 and f2.
The result of this paper can be applied to a higher order partial differential equation in two
variables if its solution is given by a sum of sufficiently smooth plane waves (see, for example,
Eq. (1.1)). Based on our theorem below, in this case, one can demand only smoothness of the sum
and dispense with smoothness of the plane wave summands.
2. Main results. We start this section with the following theorem.
Theorem 2.1. Assume that (ai, bi), i = 1, . . . , n, are pairwise linearly independent vectors in
\BbbR 2 and a function F \in Ck(\BbbR 2) has the form
F (x, y) =
n\sum
i=1
gi(aix+ biy), (2.1)
where gi are arbitrary univariate functions and k \geq n - 2. Then F can be represented also in the
form
ISSN 1027-3190. Укр. мат. журн., 2021, т. 73, № 5
582 R. A. ALIEV, A. A. ASGAROVA, V. E. ISMAILOV
F (x, y) =
n\sum
i=1
fi(aix+ biy).
Here, the functions fi \in Ck(\BbbR ), i = 1, . . . , n.
Proof. Since the vectors (an - 1, bn - 1) and (an, bn) are linearly independent, there is a non-
singular linear transformation S : (x, y) \rightarrow (x
\prime
, y
\prime
) such that S : (an - 1, bn - 1) \rightarrow (1, 0) and S :
(an, bn) \rightarrow (0, 1). Thus, without loss of generality we may assume that the vectors (an - 1, bn - 1) and
(an, bn) coincide with the coordinate vectors e1 = (1, 0) and e2 = (0, 1), respectively. Therefore, to
prove the theorem it is enough to show that if a function F \in Ck(\BbbR 2) is expressed in the form
F (x, y) =
n - 2\sum
i=1
gi(aix+ biy) + gn - 1(x) + gn(y)
with arbitrary gi, then there exist functions fi \in Ck(\BbbR ), i = 1, . . . , n, such that F is expressed also
in the form
F (x, y) =
n - 2\sum
i=1
fi(aix+ biy) + fn - 1(x) + fn(y).
By \Delta
(\delta )
l f we denote the increment of a function f in a direction l = (l\prime , l\prime \prime ). That is,
\Delta
(\delta )
l f(x, y) = f(x+ l\prime \delta , y + l\prime \prime \delta ) - f(x, y).
We also use the notation
\partial f
\partial l
which denotes the derivative of f in the direction l.
It is easy to check that the increment of a ridge function g(ax+by) in a direction perpendicular to
(a, b) is zero. Let l1, . . . , ln - 2 be unit vectors perpendicular to the vectors (a1, b1), . . . , (an - 2, bn - 2)
correspondingly. Then for any set of numbers \delta 1, . . . , \delta n - 2 \in \BbbR we have
\Delta
(\delta 1)
l1
. . .\Delta
(\delta n - 2)
ln - 2
F (x, y) = \Delta
(\delta 1)
l1
. . .\Delta
(\delta n - 2)
ln - 2
\bigl[
gn - 1(x) + gn(y)
\bigr]
. (2.2)
Denote the left-hand side of (2.2) by S(x, y). That is, set
S(x, y)
def
= \Delta
(\delta 1)
l1
. . .\Delta
(\delta n - 2)
ln - 2
F (x, y).
Then from (2.2) it follows that, for any real numbers \delta n - 1 and \delta n,
\Delta (\delta n - 1)
e1 \Delta (\delta n)
e2 S(x, y) = 0,
or, in expanded form,
S(x+ \delta n - 1, y + \delta n) - S(x, y + \delta n) - S(x+ \delta n - 1, y) + S(x, y) = 0.
Putting in the last equality \delta n - 1 = - x, \delta n = - y, we obtain
S(x, y) = S(x, 0) + S(0, y) - S(0, 0).
This means that
ISSN 1027-3190. Укр. мат. журн., 2021, т. 73, № 5
ON THE REPRESENTATION BY BIVARIATE RIDGE FUNCTIONS 583
\Delta
(\delta 1)
l1
. . .\Delta
(\delta n - 2)
ln - 2
F (x, y) = \Delta
(\delta 1)
l1
. . .\Delta
(\delta n - 2)
ln - 2
F (x, 0)+
+\Delta
(\delta 1)
l1
. . .\Delta
(\delta n - 2)
ln - 2
F (0, y) - \Delta
(\delta 1)
l1
. . .\Delta
(\delta n - 2)
ln - 2
F (0, 0).
By the hypothesis of the theorem, the derivative
\partial n - 2
\partial l1 . . . \partial ln - 2
F (x, y) exists at any point (x, y) \in
\in \BbbR 2. Thus, it follows from the above formula that
\partial n - 2F
\partial l1 . . . \partial ln - 2
(x, y) = h1,1(x) + h2,1(y), (2.3)
where h1,1(x) =
\partial n - 2
\partial l1 . . . \partial ln - 2
F (x, 0) and h2,1(y) =
\partial n - 2
\partial l1 . . . \partial ln - 2
F (0, y) - \partial n - 2
\partial l1 . . . \partial ln - 2
F (0, 0).
Note that h1,1 and h2,1 belong to the class Ck - n+2(\BbbR ).
By h1,2 and h2,2 denote the antiderivatives of h1,1 and h2,1 satisfying the condition h1,2(0) =
= h2,2(0) = 0 and multiplied by the numbers 1/(e1 \cdot l1) and 1/(e2 \cdot l1) correspondingly. That
is,
h1,2(x) =
1
e1 \cdot l1
x\int
0
h1,1(z)dz,
h2,2(y) =
1
e2 \cdot l1
y\int
0
h2,1(z)dz.
Here, e \cdot l denotes the scalar product between vectors e and l. Obviously, the function
F1(x, y) = h1,2(x) + h2,2(y)
obeys the equality
\partial F1
\partial l1
(x, y) = h1,1(x) + h2,1(y). (2.4)
From (2.3) and (2.4) we obtain
\partial
\partial l1
\biggl[
\partial n - 3F
\partial l2 . . . \partial ln - 2
- F1
\biggr]
= 0.
Hence, for some ridge function \varphi 1,1(a1x+ b1y),
\partial n - 3F
\partial l2 . . . \partial ln - 2
(x, y) = h1,2(x) + h2,2(y) + \varphi 1,1(a1x+ b1y). (2.5)
Here, all the functions h2,1, h2,2(y), \varphi 1,1 \in Ck - n+3(\BbbR ).
Set the functions
h1,3(x) =
1
e1 \cdot l2
x\int
0
h1,2(z)dz,
ISSN 1027-3190. Укр. мат. журн., 2021, т. 73, № 5
584 R. A. ALIEV, A. A. ASGAROVA, V. E. ISMAILOV
h2,3(y) =
1
e2 \cdot l2
y\int
0
h2,2(z)dz,
\varphi 1,2(t) =
1
(a1, b1) \cdot l2
t\int
0
\varphi 1,1(z)dz.
Note that the function
F2(x, y) = h1,3(x) + h2,3(y) + \varphi 1,2(a1x+ b1y)
obeys the equality
\partial F2
\partial l2
(x, y) = h1,2(x) + h2,2(y) + \varphi 1,1(a1x+ b1y). (2.6)
From (2.5) and (2.6) it follows that
\partial
\partial l2
\biggl[
\partial n - 4F
\partial l3 . . . \partial ln - 2
- F2
\biggr]
= 0.
The last equality means that, for some ridge function \varphi 2,1(a2x+ b2y),
\partial n - 4F
\partial l3 . . . \partial ln - 2
(x, y) = h1,3(x) + h2,3(y) + \varphi 1,2(a1x+ b1y) + \varphi 2,1(a2x+ b2y). (2.7)
Here, all the functions h1,3, h2,3, \varphi 1,2, \varphi 2,1 \in Ck - n+4(\BbbR ).
Note that in the left-hand sides of (2.3), (2.5) and (2.7) we have the mixed directional derivatives
of F and the order of these derivatives is decreased by one in each consecutive step. Continuing the
above process, until it reaches the function F, we obtain the desired result.
Theorem 2.1 is proved.
Theorem 2.1 can be applied to Eq. (1.1) as follows.
Corollary 2.1. Assume a function u \in Cr(\BbbR 2) is of the form (1.2) with arbitrarily behaved vi.
Then u is a solution to the equation (1.1).
Note that the method exploited in the proof of Theorem 2.1 enables us to construct the functions
fi, i = 1, . . . , n, by induction. First let us accept some notation. By (\widetilde ap,\widetilde bp), p = 1, . . . , n - 2, denote
the images of vectors (ap, bp) under the linear transformation which takes the vectors (an - 1, bn - 1)
and (an, bn) to the unit vectors e1 = (1, 0) and e2 = (0, 1), respectively. Clearly,
\widetilde ap = apbn - anbp
an - 1bn - anbn - 1
, \widetilde bp = an - 1bp - apbn - 1
an - 1bn - anbn - 1
, p = 1, . . . , n - 2.
Consider the vectors
lp =
\left( \widetilde bp\sqrt{} \widetilde a2p +\widetilde b2p ,
- \widetilde ap\sqrt{} \widetilde a2p +\widetilde b2p
\right) , p = 1, . . . , n - 2.
Note that for p = 1, . . . , n - 2, the vectors lp are perpendicular to the vectors (\widetilde ap,\widetilde bp), respectively.
Consider also the function, which is generated by the above liner transformation
F \ast (x, y) = F
\biggl(
bnx - bn - 1y
an - 1bn - anbn - 1
,
anx - an - 1y
anbn - 1 - an - 1bn
\biggr)
.
ISSN 1027-3190. Укр. мат. журн., 2021, т. 73, № 5
ON THE REPRESENTATION BY BIVARIATE RIDGE FUNCTIONS 585
Corollary 2.2. The functions fi, i = 1, . . . , n, in Theorem 2.1 can be constructed inductively by
the formulas
fp = \varphi p,n - p - 1, p = 1, . . . , n - 2,
fn - 1 = h1,n - 1, fn = h2,n - 1.
Here,
h1,1(t) =
\partial n - 2
\partial l1 . . . \partial ln - 2
F \ast (t, 0),
h2,1(t) =
\partial n - 2
\partial l1 . . . \partial ln - 2
F \ast (0, t) - \partial n - 2
\partial l1 . . . \partial ln - 2
F \ast (0, 0),
h1,k+1(t) =
1
e1 \cdot lk
t\int
0
h1,k(z)dz, k = 1, . . . , n - 2,
h2,k+1(t) =
1
e2 \cdot lk
t\int
0
h2,k(z)dz, k = 1, . . . , n - 2,
and
\varphi p,1(t) =
\partial n - p - 2F \ast
\partial lp+1 . . . \partial ln - 2
\Biggl( \widetilde apt\widetilde a2p +\widetilde b2p ,
\widetilde bpt\widetilde a2p +\widetilde b2p
\Biggr)
- h1,p+1
\Biggl( \widetilde apt\widetilde a2p +\widetilde b2p
\Biggr)
-
- h2,p+1
\Biggl( \widetilde bpt\widetilde a2p +\widetilde b2p
\Biggr)
-
p - 1\sum
s=1
\varphi s,p - s+1
\Biggl( \widetilde as\widetilde ap +\widetilde bs\widetilde bp\widetilde a2p +\widetilde b2p t
\Biggr)
,
p = 1, . . . , n - 2
\biggl(
for p = n - 2,
\partial n - p - 2F \ast
\partial lp+1 . . . \partial ln - 2
:= F \ast
\biggr)
,
\varphi p,k+1(t) =
1
(\widetilde ap,\widetilde bp) \cdot lk+p
t\int
0
\varphi p,k(z)dz, p = 1, . . . , n - 3, k = 1, . . . , n - p - 2.
The validity of above formulas for the functions h1,k and h2,k, k = 1, . . . , n - 1, is obvious.
The formulas for \varphi p,1 and \varphi p,k+1 can be obtained from (2.5), (2.7) and the subsequent (assumed but
not written) equations if we put x = \widetilde apt/(\widetilde a2p +\widetilde b2p) and y = \widetilde bpt/(\widetilde a2p +\widetilde b2p).
Remark 2.1. If in Theorem 2.1 k \geq n - 1, then the functions fi, i = 1, . . . , n, can be constructed
(up to polynomials) by the method discussed in Buhmann and Pinkus [3]. This method is based on
the fact that for a direction \bfc = (c1, . . . , cm) orthogonal to a given direction \bfa \in \BbbR m\setminus \{ \bfzero \} , the
operator
D\bfc =
m\sum
s=1
cs
\partial
\partial xs
ISSN 1027-3190. Укр. мат. журн., 2021, т. 73, № 5
586 R. A. ALIEV, A. A. ASGAROVA, V. E. ISMAILOV
acts on m-variable ridge functions g(\bfa \cdot \bfx ) as follows:
D\bfc g(\bfa \cdot \bfx ) = (\bfc \cdot \bfa ) g\prime (\bfa \cdot \bfx ).
Thus, if in our case for fixed r \in \{ 1, . . . , n\} , vectors lk, k \in \{ 1, . . . , n\} , k \not = r, are perpendicular
to the vectors (ak, bk), then
n\prod
k=1
k \not =r
DlkF (x, y) =
n\prod
k=1
k \not =r
Dlk
n\sum
i=1
fi(aix+ biy) =
=
n\sum
i=1
\left( n\prod
k=1
k \not =r
((ai, bi) \cdot lk)
\right) f
(n - 1)
i (aix+ biy) =
=
n\prod
k=1
k \not =r
((ar, br) \cdot lk) f (n - 1)
r (arx+ bry).
Now fr can be easily constructed from the above formula (up to a polynomial of degree at most
n - 2). Note that this method is not valid if in Theorem 2.1 the function F is of the class Cn - 2(\BbbR 2).
However, in this case, Corollary 2.2 is applicable.
Remark 2.2. Some polynomial terms appear while attempting to obtain a smoothness result in
multivariate case. In [2], we proved that if a function f(x1, . . . , xn) of a certain smoothness class
is represented by a sum of r arbitrarily behaved ridge functions, then, under suitable conditions, it
can be represented by a sum of ridge functions of the same smoothness class and some n-variable
polynomial of a certain degree. The appearance of a polynomial term is mainly related to the fact
that in \BbbR n, n \geq 3, there are many directions orthogonal to a given direction. Note that a polynomial
term also appears in verifying if a given function of n variables (n \geq 3) is a sum of ridge functions
(see [5]). However, paralleling the above theorem, we conjecture that if a multivariate function of a
certain smoothness class is represented by a sum of arbitrarily behaved ridge functions, then it can
also be represented by a sum of ridge functions of the same smoothness class.
Remark 2.3. For k \geq n - 1, Theorem 2.1 can be obtained from Theorem 3.1 of [2]. Indeed,
according to Theorem 3.1 [2], the function F in Theorem 2.1 can be represented in the form
F (x, y) =
n\sum
i=1
fi(aix+ biy) + P (x, y), (2.8)
where the functions fi \in Ck(\BbbR ) and P (x, y) is a polynomial of total degree at most n - 1. But it is
known that a bivariate polynomial of degree n - 1 is decomposed into a sum of ridge polynomials
with any given n pairwise linearly independent directions (see, e.g., [21]). That is, in (2.8)
P (x, y) =
n\sum
i=1
pi(aix+ biy),
where pi are univariate polynomials of degree at most n - 1. However, in the setting considered here,
Theorem 2.1 is more informative. It covers the extra case k = n - 2 and allows one to construct the
representing functions fi by induction.
ISSN 1027-3190. Укр. мат. журн., 2021, т. 73, № 5
ON THE REPRESENTATION BY BIVARIATE RIDGE FUNCTIONS 587
Unfortunately, we do not yet know if the lower assumed degree of smoothness n - 2 in Theo-
rem 2.1 can be reduced. We think that the final and complete solution to the smoothness problem
considered here requires essentially different approach. Nevertheless, we can strengthen our result
by considering Hölder continuous functions.
We say that a function F : \BbbR m \rightarrow \BbbR , m \geq 1, is locally Hölder continuous with degree \alpha ,
0 < \alpha \leq 1, if for any compact set K \subset \BbbR m there is a number M = M (K) > 0 such that for any
\bfx = (x1, . . . , xm) \in K and \bfy = (y1, . . . , ym) \in K the inequality
| F (\bfx ) - F (\bfy )| \leq M \cdot
m\sum
i=1
| xi - yi| \alpha
holds. By Ck,\alpha (\BbbR m) we denote the class of functions in Ck(\BbbR m), kth order partial derivatives of
which are locally Hölder continuous with degree \alpha .
The following theorem is valid.
Theorem 2.2. Assume that (ai, bi), i = 1, . . . , n, are pairwise linearly independent vectors in
\BbbR 2 and a function F \in Ck,\alpha (\BbbR 2) has the form
F (x, y) =
n\sum
i=1
gi(aix+ biy),
where gi are arbitrary univariate functions and k \geq n - 2. Then F can be represented also in the
form
F (x, y) =
n\sum
i=1
fi(aix+ biy).
Here, the functions fi \in Ck,\alpha (\BbbR ), i = 1, . . . , n.
The proof of Theorem 2.2 can be easily obtained from Theorem 2.1. Indeed, to prove The-
orem 2.2 it is only needed to repeat the proof of Theorem 2.1, emphasizing that the functions
appearing in the right-hand sides of formulas (2.3), (2.5), (2.7), etc. belong to certain classes
Cs,\alpha (\BbbR ) with step by step increasing indicator of smoothness s. More precisely, the functions h1,1,
h2,1 \in Ck - n+2,\alpha (\BbbR ), the functions h1,2, h2,2, \varphi 1,1 \in Ck - n+3,\alpha (\BbbR ), the functions h1,3, h2,3, \varphi 1,2,
\varphi 2,1 \in Ck - n+4,\alpha (\BbbR ), etc.
Theorem 2.1 implies that if a function F \in Ck(\BbbR 2) has the form (2.1) and k \geq n - 2,then all the
partial derivatives of F up to order k are representable as a sum of ridge functions with the given
directions (ai, bi), i = 1, . . . , n. Note that the validity of Theorem 2.1 for other possible k strongly
depends on answers to the following two questions.
Question 1. Assume a function F \in Ck(\BbbR 2) is of form (2.1). Are the first order partial deriva-
tives \partial F/\partial x and \partial F/\partial y representable as a sum of arbitrarily behaved ridge functions with the
directions (ai, bi)?
Question 2. Assume a function F \in C(\BbbR 2) is of form (2.1). Is it true that F can be represented
also in the form
\sum n
i=1
fi(aix+ biy) with continuous fi?
Indeed, a positive answer to Question 1 would mean, by induction, that all the partial derivatives
up to order k and hence any mixed directional derivative \partial kF/\partial l1 . . . \partial lk are represented by a sum\sum n
i=1
gi(aix+ biy), where gi are arbitrary univariate functions. Once we could answer Question 2
ISSN 1027-3190. Укр. мат. журн., 2021, т. 73, № 5
588 R. A. ALIEV, A. A. ASGAROVA, V. E. ISMAILOV
affirmatively, we would immediately obtain that any derivative \partial kF/\partial l1 . . . \partial lk is also written in the
form
\sum n
i=1
fi(aix+ biy) with continuous fi. Then by choosing the directions l1, . . . , lk orthogonal
to the first k directions (ai, bi), i = 1, . . . , n, and applying the above method (see the proof of
Theorem 2.1) we could conclude that F has representation
\sum n
i=1
fi(aix+ biy), where fi \in Ck(\BbbR ).
It should be remarked that Question 2 is a part of the more general question posed in [26, p. 14].
References
1. J. Aczél, Functional equations and their applications, Acad. Press, New York (1966).
2. R. A. Aliev, V. E. Ismailov, On a smoothness problem in ridge function representation, Adv. Appl. Math., 73,
154 – 169 (2016).
3. M. D. Buhmann, A. Pinkus, Identifying linear combinations of ridge functions, Adv. Appl. Math., 22, 103 – 118
(1999).
4. E. J. Candès, Ridgelets: estimating with ridge functions, Ann. Statist., 31, 1561 – 1599 (2003).
5. P. Diaconis, M. Shahshahani, On nonlinear functions of linear combinations, SIAM J. Sci. and Stat. Comput., 5,
175 – 191 (1984).
6. J. H. Friedman, W. Stuetzle, Projection pursuit regression, J. Amer. Statist. Assoc., 76, 817 – 823 (1981).
7. N. J. Guliyev, V. E. Ismailov, On the approximation by single hidden layer feedforward neural networks with fixed
weights, Neural Networks, 98, 296 – 304 (2018).
8. V. E. Ismailov, Characterization of an extremal sum of ridge functions, J. Comput. and Appl. Math., 205, 105 – 115
(2007).
9. V. E. Ismailov, Approximation by neural networks with weights varying on a finite set of directions, J. Math. Anal.
and Appl., 389, 72 – 83 (2012).
10. V. E. Ismailov, A review of some results on ridge function approximation, Azerb. J. Math., 3, № 1, 3 – 51 (2013).
11. V. E. Ismailov, Approximation by ridge functions and neural networks with a bounded number of neurons, Appl.
Anal., 94, 2245 – 2260 (2015).
12. V. E. Ismailov, Approximation by sums of ridge functions with fixed directions (in Russian), Algebra i Analiz, 28,
№ 6, 20 – 69 (2016).
13. V. E. Ismailov, A note on the equioscillation theorem for best ridge function approximation, Expo. Math., 35, № 3,
343 – 349 (2017).
14. V. E. Ismailov, A. Pinkus, Interpolation on lines by ridge functions, J. Approx. Theory, 175, 91 – 113 (2013).
15. F. John, Plane waves and spherical means applied to partial differential equations, Intersci., New York (1955).
16. I. Kazantsev, Tomographic reconstruction from arbitrary directions using ridge functions, Inverse Problems, 14,
635 – 645 (1998).
17. S. V. Konyagin, A. A. Kuleshov, On the continuity of finite sums of ridge functions (in Russian), Mat. Zametki, 98,
308 – 309 (2015).
18. S. V. Konyagin, A. A. Kuleshov, On some properties of finite sums of ridge functions defined on convex subsets of
\BbbR n (in Russian), Tr. Mat. Inst. Steklova, 293 (2016).
19. S. V. Konyagin, A. A. Kuleshov, V E. Maiorov, Some problems in the theory of ridge functions (in Russian), Tr. Mat.
Inst. Steklova, 301 (2018).
20. A. Kroó, On approximation by ridge functions, Constr. Approx., 13, 447 – 460 (1997).
21. B. F. Logan, L. A. Shepp, Optimal reconstruction of a function from its projections, Duke Math. J., 42, 645 – 659
(1975).
22. V. E. Maiorov, On best approximation by ridge functions, J. Approx. Theory, 99, 68 – 94 (1999).
23. V. Maiorov, A. Pinkus, Lower bounds for approximation by MLP neural networks, Neurocomputing, 25, 81 – 91
(1999).
24. F. Natterer, The mathematics of computerized tomography, Wiley, New York (1986).
25. P. P. Petrushev, Approximation by ridge functions and neural networks, SIAM J. Math. Anal., 30, 155 – 189 (1998).
26. A. Pinkus, Ridge functions, Cambridge Tracts Math., 205, Cambridge Univ. Press, Cambridge (2015).
27. A. Pinkus, Smoothness and uniqueness in ridge function representation, Indag. Math. (N.S.), 24, № 4, 725 – 738
(2013).
28. A. Pinkus, Approximation theory of the MLP model in neural networks, Acta Numer., 8, 143 – 195 (1999).
Received 11.09.18
ISSN 1027-3190. Укр. мат. журн., 2021, т. 73, № 5
|
| id | umjimathkievua-article-263 |
| institution | Ukrains’kyi Matematychnyi Zhurnal |
| keywords_txt_mv | keywords |
| language | English |
| last_indexed | 2026-03-24T02:02:18Z |
| publishDate | 2021 |
| publisher | Institute of Mathematics, NAS of Ukraine |
| record_format | ojs |
| resource_txt_mv | umjimathkievua/cd/4f348223a1b3a48fe7841739876997cd.pdf |
| spelling | umjimathkievua-article-2632025-03-31T08:48:07Z On the representation by bivariate ridge functions On the representation by bivariate ridge functions On the representation by bivariate ridge functions Aliev, R. A. Asgarova, A. A. Ismailov, V. E. Aliev, Rashid Asgarova, Aysel Ismailov, Vugar Aliev, R. A. Asgarova, A. A. Ismailov, V. E. Cauchy functional equation ridge function plane wave representation smoothness Cauchy functional equation ridge function plane wave representation smoothness UDC 517.5 We consider the problem of representation of a bivariate function by sums of ridge functions. It is shown that if a function of a certain smoothness class is represented by a sum of finitely many arbitrarily behaved ridge functions, then it can also be represented by a sum of ridge functions of the same smoothness class. As an example, this result is applied to a homogeneous constant coefficient partial differential equation. УДК 517.5 Про зображення гребеневими функцiями двох змiнних Розглядається задача зображення двовимiрної функцiї сумами гребеневих функцiй. Показано, що коли функцiю певного класу гладкостi зображено скiнченною сумою гребеневих функцiй довiльної поведiнки, її також можна зобразити сумою гребеневих функцiй того ж класу гладкостi. Як приклад цей результат застосовано до однорiдного диференцiального рiвняння з частинними похiдними i зi сталими коефiцiєнтами. Institute of Mathematics, NAS of Ukraine 2021-05-24 Article Article application/pdf https://umj.imath.kiev.ua/index.php/umj/article/view/263 10.37863/umzh.v73i5.263 Ukrains’kyi Matematychnyi Zhurnal; Vol. 73 No. 5 (2021); 579 - 588 Український математичний журнал; Том 73 № 5 (2021); 579 - 588 1027-3190 en https://umj.imath.kiev.ua/index.php/umj/article/view/263/9011 Copyright (c) 2021 Vugar Ismailov |
| spellingShingle | Aliev, R. A. Asgarova, A. A. Ismailov, V. E. Aliev, Rashid Asgarova, Aysel Ismailov, Vugar Aliev, R. A. Asgarova, A. A. Ismailov, V. E. On the representation by bivariate ridge functions |
| title | On the representation by bivariate ridge functions |
| title_alt | On the representation by bivariate ridge functions On the representation by bivariate ridge functions |
| title_full | On the representation by bivariate ridge functions |
| title_fullStr | On the representation by bivariate ridge functions |
| title_full_unstemmed | On the representation by bivariate ridge functions |
| title_short | On the representation by bivariate ridge functions |
| title_sort | on the representation by bivariate ridge functions |
| topic_facet | Cauchy functional equation ridge function plane wave representation smoothness Cauchy functional equation ridge function plane wave representation smoothness |
| url | https://umj.imath.kiev.ua/index.php/umj/article/view/263 |
| work_keys_str_mv | AT alievra ontherepresentationbybivariateridgefunctions AT asgarovaaa ontherepresentationbybivariateridgefunctions AT ismailovve ontherepresentationbybivariateridgefunctions AT alievrashid ontherepresentationbybivariateridgefunctions AT asgarovaaysel ontherepresentationbybivariateridgefunctions AT ismailovvugar ontherepresentationbybivariateridgefunctions AT alievra ontherepresentationbybivariateridgefunctions AT asgarovaaa ontherepresentationbybivariateridgefunctions AT ismailovve ontherepresentationbybivariateridgefunctions |