Прогнозування часового ряду за моделлю нормалізації

Empirical constructions of time series models based on the reduction of initial data to normally distributed values have been proposed. The goal of a normalization method is to construct an optimal forecast that is linear for the updated data, and the forecasted original data is recovered through th...

Full description

Saved in:
Bibliographic Details
Date:2025
Main Authors: Bondarenko, Viktor, Bondarenko, Valeriia
Format: Article
Language:English
Published: The National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute" 2025
Subjects:
Online Access:https://journal.iasa.kpi.ua/article/view/336056
Tags: Add Tag
No Tags, Be the first to tag this record!
Journal Title:System research and information technologies
Download file: Pdf

Institution

System research and information technologies
_version_ 1866391928127356928
author Bondarenko, Viktor
Bondarenko, Valeriia
author_facet Bondarenko, Viktor
Bondarenko, Valeriia
author_sort Bondarenko, Viktor
baseUrl_str http://journal.iasa.kpi.ua/oai
collection OJS
datestamp_date 2025-07-25T15:56:08Z
description Empirical constructions of time series models based on the reduction of initial data to normally distributed values have been proposed. The goal of a normalization method is to construct an optimal forecast that is linear for the updated data, and the forecasted original data is recovered through the inverse transformation. The different variants of such transformations have been considered, including the reduction of initial data to Gaussian fractional Brownian motion and a one-dimensional transformation using a strictly monotonic function. The computational experiment based on real data, which allows for a stationary model, confirms the higher quality of the forecast by the normalization method compared to traditional models.
doi_str_mv 10.20535/SRIT.2308-8893.2025.2.07
first_indexed 2025-07-27T04:04:09Z
format Article
fulltext  Publisher IASA at the Igor Sikorsky Kyiv Polytechnic Institute, 2025 106 ISSN 1681–6048 System Research & Information Technologies, 2025, № 2 UDC 519.24 DOI: 10.20535/SRIT.2308-8893.2025.2.07 TIME SERIES FORECASTING USING THE NORMALIZATION MODEL VIKTOR BONDARENKO, VALERIIA BONDARENKO Abstract. Empirical constructions of time series models based on the reduction of initial data to normally distributed values have been proposed. The goal of a normalization method is to construct an optimal forecast that is linear for the updated data, and the forecasted original data is recovered through the inverse transformation. The different variants of such transformations have been considered, including the reduction of initial data to Gaussian fractional Brownian motion and a one-dimensional transformation using a strictly monotonic function. The computational experiment based on real data, which allows for a stationary model, confirms the higher quality of the forecast by the normalization method compared to traditional models. Keywords: optimal forecast, stochastic model, parameter estimation, fractional Brownian motion. PRELIMINARY INFORMATION AND STATEMENT OF THE PROBLEM Time forecasting is defined as estimation the future values of some function of a time variable based on known observations up to the current moment. Other- words, if we observe the trajectory )(tx , Tt 0 , then it is necessary to evaluate the value ,)(sx .  TsT The estimated values are called forecasts and are denoted by ,)(ˆ sx . The stochasticity of the trajectory is an essential circumstance, so )(x does not represent a deterministic function. As a rule, the trajectory values are observed at discrete moments ,0 121 Ttttt nn   that is, the terms of the sequence are known },,,{ 21 nxxx  , where )( kk txx  , and “future” values are to be assessed )(,,)( 1 rnn ttx   ,   TtttT rnnn 21 . If it is known a priori that ) (tx is an implementation of a random process )(t , which corresponds to a finite-dimensional distribution  })(,,)({ 11 mm xtxtP mmmm xxx dyyyyttfdydy m ),..,,,..,( 21121 21    with a density ),,(),,,,,( 111 mmmmm xxfxxttf  , then the optimal forecast for r steps is the conditional average: ,),,,(),,,,,E( ˆ ˆ 211121 1 nnnrnnn rn n xxxxx                 g (1) Time series forecasting using the normalization model Системні дослідження та інформаційні технології, 2025, № 2 107 ).( jj t The coordinates of the conditional mean are calculated by the formula: rnrjjn dydyxxyyypy        1121 ),,,,,(ˆ , where the conditional density  ),,,,,( 121 nr xxyyyp 1 21121 )),,,((),,,,,,(    nnrnnnrn xxxfyyxxxf , rrnrnnn dzdzzzxxxfxxxf        112121 ),,,,,,(),,,( . Optimality means that for random vectors              n  1 ξ ,                rn n  1 η and the fair ratio 22 )ξ(Eηmin)ξ(Eη hg  h , where the function g is defined by the equality (1), ||||  is a Euclidian distance in nR . As a rule, the density of the distribution mf is unknown, and to calculate the forecast, it is necessary to create a stochastic time series model },,,,{ 21 nxxx  that is, to construct a description of the relationship between the values of the series based on previous observations. This model can be described ),,,( 11  kk , where },,,{ 21 n are the sequence of random variables for which the abserva- tion kx is assigned by some value k , and  is a random vector. Note that standard models do not always take into account the specifics of observations and the corresponding forecast turns out to be far from reality (as shown in the example below). The forecast constructions based on traditional time series models are discussed in detail in the reference ]1[ . The preferred option of forecasting is a situation where the time series admits a Gaussian model, i.e.  Q,0~        , where the matrix Q is depicted in the form of        DC BA Q , and block elements DCBA , , , are determined by the ratios: kjjk Ea  ,  ,1 nkj ξ ;);0(~ A knjnkjjk EEd   , );0(~ ,1 Drkj  V. Bondarenko, V. Bondarenko ISSN 1681–6048 System Research & Information Technologies, 2025, № 2 108 and the elements of matrix , , *CBCB  are represent the mutual covariance of ,ξ and η so ;1, 1 . rknjEb knjjk   kjnjk Ec   , .1, 1 nkrj  In this case, the optimal forecast is linear and is determined by the formula: ξξ)(ηEˆ 1 CA│ . (2) In particular, the sample },,{ 1 nxx  can be considered the values of the frac- tional Brownian motion ) (tBH , which is defined as a Gaussian random process with characteristics ,0)(E tBH )( 2 1 )()(E 222 HHH HH ststsBtB  , where 10  H is called a Hurst exponent. The properties of fractional Brownian motion are discussed in [2]. The experience with temporal data shows that identifying a series as fBm values is a rather rare phenomenon. TIME SERIES NORMALIZATION METHOD The idea of the method consists of transforming the original data },,, { 21 nxxx  . into Gaussian },,,{ 21 nuuu  .. In some cases, the original data can be converted to the fractional Brownian motion values. Let there be a continuous one-to-one mapping nn RR  : , ),,,(),,,( 2121 nn uuuxxx  such that the increments of the transformed data ,11 uz  1122 , ,  nnn uuzuuz form a stationary sequence (i.e., they admit a stationary model). The statistics 2 1 2 1 k n k z n    and 1 1 11 1      jj n j zz n are consistent estimates of the variance of the increments and their one-step covariance ([3, 4]). Then consistent estimates of the correlation coefficient and Hurst parameter are calculated by the formulas: , 2   22 )1( 2 1 ln ln H   . The necessary conditions for the hypothesis of “data },,,{ 21 nuuu  form the values of fractional Brownian motion” is the fulfillment of the limiting relations for the statistics , nA , , nn DB nF , which are defined by the following relations: Time series forecasting using the normalization model Системні дослідження та інформаційні технології, 2025, № 2 109 2 31 1 3 1 1 4       kk n k n zu n A ,       2 1 ;0H ; 32 1 1 51 1 1 kk n k Hn zu n B               22 1 ;0~, 3 H ,       2 1 ;0H ; 2 31 1 23 1 1 42       kk n k Hn zu n D , ),1;0(~ ;1; 2 1      H 3 1 3 1 1 k n k Hn z n F    ,3       1; 2 1 H . The proof of the limit relations is contained in  5 . The standard algorithm for testing the specified hypothesis using known H and  is a following: let us assume that the hypothesis is fulfilled and we set the significance level  with comparing the value of the statistic with the tabular value  where .1)( F In particular, for the marginal distribution function of statistics nD (Н )5,0 : 1 3 2 Φ2)(          xxF , where Φ is a Laplace function. Corresponding to the level of significance 05,0 the critical value 6 , and the hypothesis is accepted if .60  nD If the hypothesis )( kHk tBu  is true, then the optimal (linear) forecast for },,,{ 221 nnn uuu  in n steps is calculated by the formula (2):                        nn n u u CA u u  1 1 2 1 ˆ ˆ ,        DC BA Q , jkq )( 222 HHH kjkj  , ,2,1 nkj  and the forecast of the primary variables )ˆ,,ˆ()ˆ, ,ˆ( 21 1 21 nnnn uuxx     . Note that the choice of transformation  is a rather cumbersome procedure. Let us consider another normalization method — the one-dimensional transformation of the real data )( kk xu  . ))((Φ)( 1 xFx   , (3) where dz z x x              2 exp 2 1 )Φ( 2 , F is a distribution function corresponding to the sample },,,{ 21 nxxx  . V. Bondarenko, V. Bondarenko ISSN 1681–6048 System Research & Information Technologies, 2025, № 2 110 Thereby, },,{ 1 nuu  is a sample from the general population  1,0 and the proposed procedure requires stationarity in the narrow sense of the real data and requires an assessment of their distribution. The sample size should not be large to prevent the detection of non-stationarity. Actually, },,,{ 21 nxxx  represents a sample of different random variables )}(,),(),({ 21 nttt  with the same distribution .F This disadvantage is partially compensated by the dependence of the obtained data, which is determined by their correlation matrix. Let us note the following property of the transformation. Proposition. Let  is a strictly increasing function ξ  t is a stationary process in the narrow sense, i.e. ),,,,,(})(,,)({ 112111  nnnnn ttttxxFxtxtP . Then the process ))(()( tt  is also stationary in the narrow sense. Proof. Under the condition of stationarity ),,,,,(})(,,)({ 112111  nnnnn ttttxxFxtxtP . Distribution of the process )(t :  })(,,)({}))((, ,))(({ 1111 nnnn ytytPxtxtP ),,,,,( 1121  nnn ttttyyF , )(1 jj xy  also satisfies the definition of stationarity. Let us formulate a modeling and forecasting algorithm using transfor- mation (3). 1. Checking the data for stationarity, for example, using the Dickey–Fuller criterion and determining the size of the training sample (in the case of checking the adequacy of the model, determine the size of the training plus the predicted sample). 2. Estimation of the distribution function F of random value ξ by the sample },,{ 1 nxx  . 3. Construction of the sample },,{ 1 nuu  . from a normal population )1,0( by the formula ))((Φ 1 kk xFu   . 4. Calculation of the sample correlation coefficients jkk jn k j uu jn      1 1 , with consctruction of correlation matrix Q and defining the forecasting horizon .r 5. Construction of the forecast }ˆ ˆ{ ,,1 rnn uu  of transformed data by the formula (2). 6. Calculating predicted values }ˆ ˆ{ ,,1 rnn xx  of primary data according to the formula ))ˆ(Φ()ˆ(ˆ 11 kkk uFux     . rnn xx  ˆ, ,ˆ 1  . Time series forecasting using the normalization model Системні дослідження та інформаційні технології, 2025, № 2 111 EXAMPLE OF FORECAST CALCULATION The following example illustrates the application of the proposed model: The meteorological data files Precipitation–Florida Climate Center. The sample size is determined by stationarity, which is tested using the Dickey–Fuller criterion. In the given example, stationarity occurs in the interval 401 xx  ; let us put the data into the training sample .301 xx  The data values and their graph are shown in Table 1 and Fig. 1, respectively. T a b l e 1 . The value of the series 401 xx  1x 2x 3x 4x 5x 6x 7x 8x 9x 10x 3.03 3.43 3.54 2.98 2.13 – 2.62 –0.61 –0.15 –2.15 –2.65 11x 12x 13x 14x 15x 16x 17x 18x 19x 20x –3.58 –2.13 –2.32 –2.43 –2.8 –2.42 –3.15 –2.62 –2.81 –2.46 21x 22x 23x 24x 25x 26x 27x 28x 29x 30x –1.2 –0.33 –1.81 –2.18 –0.2 0.6 3.07 5.48 6.34 8.81 31x 32x 33x 34x 35x 36x 37x 38x 39x 40x 6.2 4.04 2.86 1.53 0.702 1.7 3.72 5.33 6.27 3.32 If we consider the given data as values of one random variable, then from the analysis of the sample },,,{ 21 nxxx  the hypothesis follows that this random variable  is distributed by Gumbel’s law:                x xF expexp)( , with the moments: E , 577,0 is a Euler’s constant, 2E 22 2 2 0 )( 6 )(       dzelnz z ,     dzelnzE z3 0 3 )( ,)3(2)( 2 1 )( 3223  ‐6 ‐4 ‐2 0 2 4 6 8 10 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 Fig. 1. Meteorological data )( kk txx  V. Bondarenko, V. Bondarenko ISSN 1681–6048 System Research & Information Technologies, 2025, № 2 112 )(s is a Reamann zeta function, .202,1)3(  The values of sample moments: 0037.0 x , 19.10 30 1 2 30 1 2    k k xx ; 3x 76,33 30 1 3 30 1    k k x , lead to estimates of the distribution parameters 49,2 ; 44,1 , so               49,2 44,1 expexp)( x xF . Density distribution graph 49,2 1 )(  xf )( 49,2 44,1 exp xF x         shown in Fig. 2. Let us construct the transformed data using the formula )(Φ 49,2 44,1 expexpΦ 11 k k k y x u                        , where the values )(Φ 1 ky are calculated using the Laplace function table. The result of the calculations is given in Table 2. T a b l e 2 . Transformed data 401 uu  1u 2u 3u 4u 5u 6u 7u 8u 9u 10u 1.02 1.12 1.14 1.01 0.8 –0.84 –0.03 0.13 –0.63 –0.85 11u 12u 13u 14u 15u 16u 17u 18u 19u 20u –1.32 –0.62 –0.65 –0.75 –0.92 –0.75 –1.09 –0.84 –0.92 –0.76 21u 22u 23u 24u 25u 26u 27u 28u 29u 30u –0.25 0.07 –0.49 –0.64 0.11 0.37 1.03 1.56 1.72 2.13 31u 32u 33u 34u 35u 36u 37u 38u 39u 40u 1.685 –1.25 0.98 0.64 0.4 0.69 1.18 1.52 –1.71 1.09 Fig. 2. Density distribution )(xfξ fζ x Time series forecasting using the normalization model Системні дослідження та інформаційні технології, 2025, № 2 113 The evaluation of correlation coefficients jkk j k j uu j      30 130 1 . Leads to the results and for 7k the coefficients 0k . 1 2 3 4 5 6 7 0.77 0.57 0.43 0.27 0.144 0.036 0.02 The elements of the matrix Q are determined by the ratios:         ,7,0 ,7, kj kj q kj jk 37,1  kj , 10  , , jkjk qa  30,1  kj . The forecast of transformed data in 7 steps is calculated by the formula (2)                       30 1 1 37 31 ˆ ˆ u u CA u u  . The forecasting results are shown in Table 3. T a b l e 3 . The values of forecast in 7 steps 31u 32u 33u 34u 35u 36u 37u 1.685 1.25 0.98 0.64 0.4 0.69 1.18 Forecast 31û 32û 33û 34û 35û 36û 37û 1.682 1.155 0.838 0.443 0.205 0.6 0.32 31x̂ 32x̂ 33x̂ 34x̂ 35x̂ 36x̂ 37x̂ 6.169 3.595 2.267 0.848 0.08 1.396 0.44 where the forecast of the initial data 3731 ˆˆ xx  is calculated by the formula ))ˆ(Φ()ˆ(ˆ 11 kkk uFux     , 44,1)ln(ln49,2)(1   yyF . Let us compare the quality of forecasting using a model, which have constructed using the normalization method and four classical discrete time series models. T a b l e 4 . Comparison of forecast quality Time series values 3731 xx  31x 32x 33x 34x 35x 36x 37x Actual data 6.2 4.04 2.86 1.53 0.702 1.7 3.72 The results of forecast Forecast data 31x̂ 32x̂ 33x̂ 34x̂ 35x̂ 36x̂ 37x̂ Normalization method 6.169 3.595 2.267 0.848 0.08 1.396 0.44 ARMA 7.16 9.53 9.902 7.286 9.672 10.06 9.9 Point Forecast 6.703 5.163 3.89 2,833 1.927 1.142 0.434 GARCH 8.379 8.908 7.437 7.565 7.9 8.02 9.21 ARIMA 8.652 8.063 7.44 6.85 6.328 5.808 4.35 V. Bondarenko, V. Bondarenko ISSN 1681–6048 System Research & Information Technologies, 2025, № 2 114 CONCLUSIONS Given the stationarity of the time series, modeling using the normalization method, which is defined by the relation (3), provides higher forecast quality compared to traditional forecasting methods. REFERENCES 1. P.I. Bidyuk, O.S. Menyaylenko, O.V. Polovtsev, Forecasting methods. Luhansk: Alma Mater, 2008, 604 p 2. Y. Mishura, “Stochastic Calculus for Fractional Brownian Motion and Related Proc- esses,” Lecture Notes in Mathematics 1929, Springer 2008, 393 p. doi: 10.1007/978- 3-540-75873-0 3. K. Kubilius, Y. Mishura, K. Ralchenko, Parameter Estimation in Fractional Diffu- sion Models, vol. 8, 2018, 390 p. doi: https://doi.org/10.1007/978-3-319-71030 4. V.G. Bondarenko, “On some statistics of fractional brownian motion,” System Research & Information Technologies, no. 1, pp. 131–138, 2021. doi: https://doi.org/10.20535/ SRIT.2308-8893.2021.1.11 5. I. Nourdin, Selected Aspects of fractional Brownian motion. Milano: Springer, 2012, 124 p. doi: https://doi.org/10.1007/978-88-470-2823-4 Received 12.04.2024 INFORMATION ON THE ARTICLE Viktor G. Bondarenko, ORCID: 0000-0003-1663-4799, Educational and Research Insti- tute for Applied System Analysis of the National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute”, Ukraine, e mail: bondarenvg@gmail.com Valeriia V. Bondarenko, The University of the Littoral Opal Coast, France, e-mail: vale- ria_bondarenko@yahoo.com ПРОГНОЗУВАННЯ ЧАСОВОГО РЯДУ ЗА МОДЕЛЛЮ НОРМАЛІЗАЦІЇ / В.Г. Бондаренко, В.В. Бондаренко Анотація. Запропоновано емпіричні конструкції моделей часового ряду за схемою зведення первинних даних до нормально розподілених. Метою такого методу нормалізації є побудова оптимального прогнозу, який для оновлених даних є лінійним, а прогнозовані первинні дані відновлюються через обер- нене перетворення. Розглянуто варіанти таких перетворень — зведення пер- винних даних до гаусівського фрактального броунівського руху та одно- вимірне перетворення з використанням строго монотонної функції. Обчислювальний експеримент на базі реальних даних, що допускають стаціо- нарну модель, підтверджує вищу якість прогнозу методом нормалізації порів- няно з традиційними моделями. Ключові слова: оптимальний прогноз, стохастична модель, оцінювання пара- метрів, фрактальний броунівський рух.
id journaliasakpiua-article-336056
institution System research and information technologies
keywords_txt_mv keywords
language English
last_indexed 2025-09-17T09:26:03Z
publishDate 2025
publisher The National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute"
record_format ojs
resource_txt_mv journaliasakpiua/3a/5825d970a62ccc36717f4ec06c67953a.pdf
spelling journaliasakpiua-article-3360562025-07-25T15:56:08Z Time series forecasting using the normalization model Прогнозування часового ряду за моделлю нормалізації Bondarenko, Viktor Bondarenko, Valeriia optimal forecast stochastic model parameter estimation fractional Brownian motion оптимальний прогноз стохастична модель оцінювання параметрів фрактальний броунівський рух Empirical constructions of time series models based on the reduction of initial data to normally distributed values have been proposed. The goal of a normalization method is to construct an optimal forecast that is linear for the updated data, and the forecasted original data is recovered through the inverse transformation. The different variants of such transformations have been considered, including the reduction of initial data to Gaussian fractional Brownian motion and a one-dimensional transformation using a strictly monotonic function. The computational experiment based on real data, which allows for a stationary model, confirms the higher quality of the forecast by the normalization method compared to traditional models. Запропоновано емпіричні конструкції моделей часового ряду за схемою зведення первинних даних до нормально розподілених. Метою такого методу нормалізації є побудова оптимального прогнозу, який для оновлених даних є лінійним, а прогнозовані первинні дані відновлюються через обернене перетворення. Розглянуто варіанти таких перетворень — зведення первинних даних до гаусівського фрактального броунівського руху та одновимірне перетворення з використанням строго монотонної функції. Обчислювальний експеримент на базі реальних даних, що допускають стаціонарну модель, підтверджує вищу якість прогнозу методом нормалізації порівняно з традиційними моделями. The National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute" 2025-06-28 Article Article application/pdf https://journal.iasa.kpi.ua/article/view/336056 10.20535/SRIT.2308-8893.2025.2.07 System research and information technologies; No. 2 (2025); 106-114 Системные исследования и информационные технологии; № 2 (2025); 106-114 Системні дослідження та інформаційні технології; № 2 (2025); 106-114 2308-8893 1681-6048 en https://journal.iasa.kpi.ua/article/view/336056/324825
spellingShingle оптимальний прогноз
стохастична модель
оцінювання параметрів
фрактальний броунівський рух
Bondarenko, Viktor
Bondarenko, Valeriia
Прогнозування часового ряду за моделлю нормалізації
title Прогнозування часового ряду за моделлю нормалізації
title_alt Time series forecasting using the normalization model
title_full Прогнозування часового ряду за моделлю нормалізації
title_fullStr Прогнозування часового ряду за моделлю нормалізації
title_full_unstemmed Прогнозування часового ряду за моделлю нормалізації
title_short Прогнозування часового ряду за моделлю нормалізації
title_sort прогнозування часового ряду за моделлю нормалізації
topic оптимальний прогноз
стохастична модель
оцінювання параметрів
фрактальний броунівський рух
topic_facet optimal forecast
stochastic model
parameter estimation
fractional Brownian motion
оптимальний прогноз
стохастична модель
оцінювання параметрів
фрактальний броунівський рух
url https://journal.iasa.kpi.ua/article/view/336056
work_keys_str_mv AT bondarenkoviktor timeseriesforecastingusingthenormalizationmodel
AT bondarenkovaleriia timeseriesforecastingusingthenormalizationmodel
AT bondarenkoviktor prognozuvannâčasovogorâduzamodellûnormalízacíí
AT bondarenkovaleriia prognozuvannâčasovogorâduzamodellûnormalízacíí