Гiбридний метод побудови секвенцiальних рекомендацiйних систем, заснований на автоматичному синтезi графiв знань та механiзмi уваги

Sequential personalized recommendations, such as next best offer prediction or modeling demand evolution for next basket prediction, remain a key challenge for businesses. In recent years, deep learning models have been applied to solve these problems and demonstrated high feasibility. With the intr...

Повний опис

Збережено в:
Бібліографічні деталі
Дата:2025
Автори: Androsov, Dmytro, Nedashkovskaya, Nadezhda
Формат: Стаття
Мова:Англійська
Опубліковано: The National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute" 2025
Теми:
Онлайн доступ:https://journal.iasa.kpi.ua/article/view/329313
Теги: Додати тег
Немає тегів, Будьте першим, хто поставить тег для цього запису!
Назва журналу:System research and information technologies
Завантажити файл: Pdf

Репозитарії

System research and information technologies
_version_ 1867334449646010368
author Androsov, Dmytro
Nedashkovskaya, Nadezhda
author_facet Androsov, Dmytro
Nedashkovskaya, Nadezhda
author_institution_txt_mv [ { "author": "Dmytro Androsov", "institution": "Educational and Research Institute for Applied System Analysis of the National Technical University of Ukraine \"Igor Sikorsky Kyiv Polytechnic Institute\", Kyiv" }, { "author": "Nadezhda Nedashkovskaya", "institution": "Educational and Research Institute for Applied System Analysis of the National Technical University of Ukraine \"Igor Sikorsky Kyiv Polytechnic Institute\", Kyiv" } ]
author_sort Androsov, Dmytro
baseUrl_str http://journal.iasa.kpi.ua/oai
collection OJS
datestamp_date 2025-05-20T17:56:07Z
description Sequential personalized recommendations, such as next best offer prediction or modeling demand evolution for next basket prediction, remain a key challenge for businesses. In recent years, deep learning models have been applied to solve these problems and demonstrated high feasibility. With the introduction of graph-based deep learning, it has become easier to perform collaborative filtering and link prediction tasks. The current paper proposes a new method of building a recommender system using a graph representation learning framework in combination with deep neural networks for sequence-to-sequence modeling and statistical learning for sequence-to-graph mapping. Benchmarking model performance on an online retail store visits dataset provides evidence of the method’s ranking capabilities.
doi_str_mv 10.20535/SRIT.2308-8893.2025.1.01
first_indexed 2025-07-17T10:28:43Z
format Article
fulltext  Publisher IASA at the Igor Sikorsky Kyiv Polytechnic Institute, 2025 Системні дослідження та інформаційні технології, 2025, № 1 7 TIДC ТЕОРЕТИЧНІ ТА ПРИКЛАДНІ ПРОБЛЕМИ ІНТЕЛЕКТУАЛЬНИХ СИСТЕМ ПІДТРИМАННЯ ПРИЙНЯТТЯ РІШЕНЬ UDC 004.85 DOI: 10.20535/SRIT.2308-8893.2025.1.01 THE HYBRID SEQUENTIAL RECOMMENDER SYSTEM SYNTHESIS METHOD BASED ON ATTENTION MECHANISM WITH AUTOMATIC KNOWLEDGE GRAPH CONSTRUCTION D.V. ANDROSOV, N.I. NEDASHKOVSKAYA Abstract. Sequential personalized recommendations, such as next best offer predic- tion or modeling demand evolution for next basket prediction, remain a key chal- lenge for businesses. In recent years, deep learning models have been applied to solve these problems and demonstrated high feasibility. With the introduction of graph-based deep learning, it has become easier to perform collaborative filtering and link prediction tasks. The current paper proposes a new method of building a re- commender system using a graph representation learning framework in combination with deep neural networks for sequence-to-sequence modeling and statistical learn- ing for sequence-to-graph mapping. Benchmarking model performance on an online retail store visits dataset provides evidence of the method’s ranking capabilities. Keywords: recommender system, graph neural network, graph embeddings. INTRODUCTION First, it is necessary to coin the term “recommender system” to reach a common understanding of this notion throughout the following work. A recommendation should reflect the strong relationship between user activities and item relations [1]. As an example, a user’s preference for a historical documentary is highly cor- related with dedicating more time to watch another documentary or other educa- tional program, rather than an action film [1]. These types of relations can be set explicitly by the group of experts on a level of high granularity or determined in a data-driven way on an item level. A recommender system (RS) is a set of statistical models that considers the certain user’s interaction history, knowledge about this user and each item from the available interactions, and provides relevant content (recommendation) [2]. Relevancy means an ordering relation of the user’s likelihood to interact with the set of available items. Hence there exists a broad range of recommender ap- proaches, such as non-personalized, semi-personalized, and personalized [2]. In the scope of current work, we focus on the development of the personalized RS, thus terms “recommender system” and “personalized recommender system” along with their abbreviations are interchangeable. D.V. Androsov, N.I. Nedashkovskaya ISSN 1681–6048 System Research & Information Technologies, 2025, № 1 8 In recent years, personalized RS have achieved significant success across various real-world applications, including e-commerce platforms, streaming me- dia, and online retail industries. A particularly notable area of RS application is the next best offer (NBO) recommendation task, which involves predicting which item a user will view or purchase after interacting with the platform. NBO, also known as next best action (NBA) [3], or generalized as next- basket recommendation (NBR) [4], is a prevalent use case for any enterprise en- gaged in business-to-consumer (B2C) operations. Marketing teams in such enter- prises have been implementing NBO/NBA projects for many years. However, many of these projects fail to deliver the expected returns [3]. This lack of per- formance can be caused by several factors: the use of traditional methods, failure to retrain NBO models with new sets of features (resulting in underutilization of both breadth and depth of available data), lack of effective campaign validation methods, technological deficiencies, etc. The advent of machine learning has provided a renewed perspective on NBO/NBR. There is now an opportunity to utilize these technologies, along with comprehensive data, to enhance and optimize basket recommendations more ef- fectively than before. As an example, by leveraging deep learning approaches, the delivery of per- sonalized offers and recommendations was significantly enhanced, overall dem- onstrating improvements in customer engagement. These enhancements can lead to increased customer satisfaction and loyalty, which ultimately drive higher sales and revenue for the business [4; 5]. RELATED WORK The sequential recommendation task and the next best action task aim to produce such a recommended item set that satisfies the condition of relevance given the most recent user interactions. More strictly, given user Uu , user session vector Ss , where S is the collection of subsets of item set S , candidate item set dII  , and the content-dependent relevancy function ]1,0[:  dISUR , Nd , one can formulate the sequential recommendation task as: ),,(maxarg* dd IsuRI  , where R could be any real-valued function, but usually the likelihood function ))(|,(),,(  dd IsupIsuR is applied [6]. Since user sessions could be of arbitrary length, the objective of capturing long-term dependencies between session items and candidate ones is a key one. To accomplish this task, models based on high-order Markov chains were proposed, such as context tree models (CT) [6; 7] and Markov chain similarity models [8]. Context trees construct a partition tree for each user session and then define a high-order Markov chain by traversing this tree [7]. As an alternative, the combination of high-order Markov chains with similar- ity-based methods, such as sparse linear methods (SLIM) and factored item simi- larity models (FISM) allows capturing short-term and long-term user-item and item-item relations simultaneously [8]. The hybrid sequential recommender system synthesis method based on attention… Системні дослідження та інформаційні технології, 2025, № 1 9 Besides Markov chain models, deep learning models are widely used for solving sequential recommendation tasks. Among all deep learning architectures, recurrent neural networks (RNN) are most widely chosen for accomplishing the sequential recommendation problem, especially long short-term memory net- works (LSTM), and their modifications, such as bi-directional LSTM [9; 10]. Similarly to Markov chain-based hybrid approaches, LSTM networks are often used to capture session-level patterns. Thus, to enrich the proposed recommenda- tions with learned long-term behavioural patterns, rule-based recommender de- sign is applied [9]. Alternatively, bidirectional LSTM model was leveraged to infer recommendation rules based on current and previous user sessions [10]. Recently, a relatively new family of neural networks was applied for this task, called attention networks [11]. Attention networks are ubiquitously applied in recommendation retrieval tasks, e.g. hierarchical attention networks, that con- siders inputs of user-item and item-item interactions to predict further user ac- tions [12] or stochastic self-attention networks to produce next recommendation candidates [13]. MATERIALS AND METHODOLOGY For solving the sequential recommendation retrieval task, it is proposed to rely on graph neural network (GNN) framework. GNNs are a family of deep neural networks capable to perform inference on data structured as graphs [14; 15; 17]. During the training process each vertex Vv of graph ),( EVG updates its representation by aggregating features from its neighbors. This process can be formalized as:             )( )()()()1( vNu kk u kk v bhWh , where )(k vh is the feature vector of vertex v at layer k, N(v) represents the neigh- bors of v, )(kW and )(kb are the layer-specific weights and biases, and σ is a non- linear activation function. After choosing the framework, it is necessary to somehow interpret the input sequences for the network in form of a graph. It is intuitively understood, that user-item, user-user and item-item relations could be naturally represented as a graph ),,,( fwEVGG  , where IUV , is a set of users and items (i.e. vertices of a graph), ),|),(( IiUuiuE  is an edge set, and :f E w is a mapping from edge set to edge weights Rw . Thus, geometrical deep learning frameworks, such as graph neural networks (GNN) are applied to solve recommendation retrieval tasks [16]. Embedding of such graph, i.e. dense vector collection of node relations could be obtained by processing G through graph convolution network (GCN) [15] and attention network [14]. As an example, [14] combines attention mecha- nism with graph convolutional networks [15] to capture embeddings of user-item graph and produce next item recommendation for a given user. GCN updates node features by aggregating features from neighboring nodes, using the following formula [15]: D.V. Androsov, N.I. Nedashkovskaya ISSN 1681–6048 System Research & Information Technologies, 2025, № 1 10        )()(2 1 2 1)1( ~~~ kkk WhDADh , where IAA  ~ is the adjacency matrix A with added self-loops (identity matrix I ); D ~ is the degree matrix corresponding to A ~ ; )(kh represents the node features at layer k; )(kW is the weight matrix at layer k;  is an activation function. By further considering the assumption that user preferences are non-static in long-term perspective, it is proposed to add to graph G (continuous) time dimen- sion t and obtain in such way dynamic graph ),,,( )()()()( fwEVG tttt . In such case, every user session at time t is generated by traversing the graph )(tG . However, synthesis of such graph at a step t = 0 is required based on previous interactions. For solving the task above, it is proposed to use rule mining algorithms, such as computing pointwise mutual information (PMI) [18] between each pair of bought/seen items, FP-Growth trees [19] and TF-IDF metric [20] for selecting the most relevant item combinations in the context of each session. Pointwise mutual information for knowledge graph synthesis Consider a set of user sessions/transactions },...,,{ 21 msssD  , where each trans- action js is a set of items },...,,{ 21 kiii from a larger set of items i . Let )(ip denote the probability of item i occurring in a transaction, and ),( jip denote the joint probability of items i and j co-occurring in the same trans- action. These probabilities can be estimated as follows: D D ip i)( ; D D jip ji, ),(  , where || iD is the number of transactions containing item i, and || , jiD is the number of transactions containing both i and j. The PMI between two items i and j is calculated using the formula: )()( ),( log),( jpip jip jiPMI  . Then by introducing the threshold ε and indicator function ),( jiPMI1 , one can construct a knowledge graph ),,,( ),( )0( fEVGG jiPMI  1 . FP-Growth algorithm for rule mining and knowledge graph synthesis The FP-Growth (Frequent Pattern Growth) algorithm is a widely used method for mining frequent itemsets in large datasets, particularly in the context of associa- tion rule learning. The FP-Growth algorithm is more efficient and scalable com- pared to other rule mining algorithms like Apriori, since it does not explicitly generate itemsets, but builds FP-Growth tree for this purpose [19]. Consider a set of user sessions },...,,{ 21 msssD  , the FP-tree G is defined as: ))}(),(),((|{ gChildgSuppgNameggG  , The hybrid sequential recommender system synthesis method based on attention… Системні дослідження та інформаційні технології, 2025, № 1 11 where each node g is a structure storing the node’s name (Name(g)), its support value (Supp(g)), and a set of references to its child nodes (Child(g)). Items Ii are vertices of the FP-tree. The path from the root g0 to a vertex g represents a set of items IF  . Let }|{)( igGgiG  be the set of vertices corresponding to item Ii . The support of item i is calculated as follows:    )( )()( iGg gSuppiSupp . To construct the FP-tree, items Ii are ordered in descending order of their support Supp(i). Items with support below the minimum threshold Suppmin are excluded from further consideration. For each element in each sorted transaction in the initial session/transaction database (TDB), the FP-tree nodes are con- structed as follows:  If a descendant of the current node exists that contains the current ele- ment, no new node is created, and the support of this descendant is incremented by one.  Otherwise, a new descendant node is created with support initialized to one.  The newly found or created node becomes the current node. Thus, the levels of the FP-tree correspond to items ordered by descending support values Supp(i), resulting in a specific order for the set of items. For FP-Growth trees, it is proposed to clip the maximum height of tree at 2, thus limiting the maximum frequent pattern length to be 2. Then, the graph )0(G is directly obtained from the frequent patterns of lengths 1 and 2, and the weight mapping is defined as follows: )(: eSuppew  . Leveraging TF-IDF for knowledge graph construction Term Frequency-Inverse Document Frequency (TF-IDF) is a metric that is com- monly used to evaluate the importance of a word in a document from a collection of documents, called corpus. It is widely used in text mining and information re- trieval to identify significant words in documents [20]. The term frequency TF(t,d) is the number of times a term t appears in a document d. It is often nor- malized to prevent bias towards longer documents: }ˆ:ˆ{ }:{ ),( dtt dtt dtTF    , where }:{ dtt  is the number of times term t appears in document d, and }ˆ:ˆ{ dtt  is the total number of terms in document d. The inverse document fre- quency IDF(t,D) measures how important a term is across the corpus: }:{ log),( dtDd D DtIDF   , where }:{ dtDd  is the quantity of documents that contain the term t. Note that for the IDF ),( Dt it is encouraged to use logarithm of inverse-document fre- quency, hence rare items will not receive too big values of TF-IDF score. D.V. Androsov, N.I. Nedashkovskaya ISSN 1681–6048 System Research & Information Technologies, 2025, № 1 12 After obtaining the latter measures, the TF-IDF score for a term t in a docu- ment d is retrieved by multiplying the latter quantities: ),(),(),,( DtIDFdtTFDdtIDFTF  . It is proposed to apply the TF-IDF metric to each pair of user sessions items sji ),( and by interpreting the tuple ),( ji as a single term in document u. After the following operation, by introducing the threshold ε and indicator function  ),( jiIDFTF1 , one can construct a knowledge graph ),,,( ),( )0( fEVGG jiPMI  1 . Graph Neural Network training algorithm After obtaining such graph )0(G , it is proposed to create a deep neural network, which algorithm of fitting is shown on Fig. 1. The loss function in the context of the proposed algorithm is the cross- entropy function: )(),( trueetruepredicted eEeeH predicted  , where )(qE p is the expected value of random variable q with respect to the ran- dom variable p . For the top-k retrieval phase for the given user u and its session s , it is pro- posed to calculate affinity scores between embedding each item Ii and re- trieved embedding representation of the pair su, . In the scope of the current approach, it is proposed to use the cosine similarity function: ji ji ji   T ),(cos . Warm embedding initialization Since the proposed GNN operates with the adjacency matrix of a knowledge graph )(tG and user and item properties’ embeddings UE an IE , respectively, it Fig. 1. GNN training algorithm The hybrid sequential recommender system synthesis method based on attention… Системні дослідження та інформаційні технології, 2025, № 1 13 is crucial to initialize such embedding matrices in a way to preserve the relation- ship between embeddings, depicted in the original graph. Thus, it is recom- mended to leverage warm embedding initialization technique before starting model training. Warm embedding initialization is the practice of initializing the embeddings with pre-trained values rather than random values. This technique leverages prior knowledge to potentially enhance the model’s performance, particularly during the initial stages of training. One can pre-compute embeddings by using such natural language processing (NLP) models as Word2Vec [21], GloVe [22], and FastText [23]. Since it is necessary to depict the relationships formulated in knowledge graph G(0), it is proposed to pre-compute embeddings for G(0) using Node2Vec algorithm [24]. The main idea is to generate random walks on the graph and treat these walks as sentences, where each node corresponds to a lexeme, e.g. a word or item identifier. Node2Vec generates biased random walks starting from each node to ex- plore the graph. The algorithm introduces two parameters, p and q:  p is a return parameter, which controls the likelihood of revisiting a node in the walk. A higher value makes it less likely to revisit the previous node, pro- moting exploration.  q is an in-out parameter, which controls whether the likelihood of visiting nodes is close or far from the starting node. A higher value biases the walk towards breadth-first search, while a lower value biases it towards depth-first search. Once the random walks are generated, Node2Vec utilizes the same Skip- Gram approach of learning lexeme embedding, as in the original Word2Vec ar- chitecture [24]. The objective is to maximize the following probability of observ- ing a node’s neighborhood given its embedding:    Vu uNv uvp )( )|(logmax , where V is the set of nodes, )(uN is the neighborhood of node u, and )|( uvp is the probability of node v being a neighbor of node u, modeled using the embeddings. EXPERIMENTS AND RESULTS Dataset description For the experiment purposes, the dataset was selected from an e-commerce con- sumer electronics retail platform. The schema of the data is presented in the Table 1. T a b l e 1 . Dataset fields description Column name Column type Column description user_id unsigned bigint user identifier product_id unsigned bigint product identifier category_id unsigned bigint item category identifier event_type string interaction type (view, purchase) user_session base64 user session identifier D.V. Androsov, N.I. Nedashkovskaya ISSN 1681–6048 System Research & Information Technologies, 2025, № 1 14 The dataset for evaluation of proposed method is a two-month snapshot from an anonymous e-commerce store database and has approximately 69 million re- cords and 6.5 million sessions (9 GB of raw data). Models’ configurations For the sake of qualitative assessment of experiment results, i.e. understanding the potential benefits of leveraging the proposed approach, it was decided to use a LSTM-based deep neural network as a baseline model for sequential recom- mendation. For the experiment purposes, the following models’ configuration was se- lected after performing hyperparameters tuning:  embedding dimension size – 64;  number of multi-layer perceptron layers – 2;  prediction per session – 1 item, i.e. next best offer prediction;  neighbor decay parameter α – 0.5;  training step – 0.01;  number of epochs – 20. For Node2Vec algorithm, the following configuration was set:  embedding dimension size – 64;  number of random walk stages – 3;  window size for skip-grams generation – 10;  number of negative samples per skip-gram sequence – 5;  training step – 0.01;  number of epochs – 5. For automated knowledge graph generation for different approaches, the given thresholds for minimum values were selected:  for FP-Growth algorithm, the minimum support rate was set at 0.05% (which roughly correspond to be 1 in 3450 sessions);  for TF-IDF filtering, the threshold for TF-IDF score for each item was set at 0.3;  for PMI-based algorithm, the minimum PMI was set at the value of 12. In order to evaluate the quality of recommendation retrieval, the most popular ranking metrics Mean Average Precision@k and Negative Discounted Cumulative Gain@k were chosen along with average training time per epoch in seconds. Results and discussions The results of model evaluation over the dataset are shown in Table 2. T a b l e 2 . Model evaluation Model Avg. time per epoch, s MAP@1 MAP@10 NDCG@1 NDCG@10 baseline LSTM 0.2670 0.0389 0.0937 0.0388 0.1217 proposed GNN w. PMI 610.02 0.1442 0.2551 0.1444 0.2988 proposed GNN w. TF-IDF 115.08 0.0844 0.1622 0.0845 0.1960 proposed GNN w. FP-Growth 53.920 0.0399 0.0887 0.0378 0.1150 The hybrid sequential recommender system synthesis method based on attention… Системні дослідження та інформаційні технології, 2025, № 1 15 Overall, GNNs performs significantly better than baseline LSTM, despite their training time being slower more than 200 times. Such slowdown is caused by traversing each time the synthesized knowledge graph G(t). From the compari- son above one can decide to stick either with TF-IDF option which provides the compromise in terms of both accuracy and learning time, or experimenting with PMI clipping for achieving the 4x increase in recommendation retrieval accuracy in the scope of next best action prediction. Using FP-Growth algorithm for auto- matic knowledge graph construction wasn’t beneficial nor in accuracy of predic- tions, offering the same results as the baseline LSTM model, but yet being much slower. The evolution of the MAP@1 metric for all models is depicted on Fig. 2. The evolution of the MAP@10 metric for all models is depicted on Fig. 3. On the other hand, the evolution of the NDCG@1 and NDCG@10 metric for all models are depicted on Fig. 4 and Fig. 5, respectively. Fig. 2. MAP@1 per epoch Fig. 3. MAP@10 per epoch D.V. Androsov, N.I. Nedashkovskaya ISSN 1681–6048 System Research & Information Technologies, 2025, № 1 16 By examining the results one can conclude that the proposed approach is feasible to use in the context of the next-best-offer recommendation. CONCLUSIONS In the current research the problem of personalized sequential recommendations was addressed. By conducting a literature review on a given topic, it was discovered that by leveraging hybrid approaches, including Context Trees and high-order Markov chains, the problem of sequential recommendation was solved, but with a severe limitation — order of items to consider depends on order of Markov chain. By introduction of deep learning and leveraging the ability of (artificial) neu- ral networks to be a universal approximators, to solve the given problem, deep recurrent neural networks, namely LSTMs were used, since of their ability of cap- turing temporal dependencies of arbitrary lengths. However, RNNs are requiring the data to be only in the form of a sequence, thus discarding the ability to naturally represent user-user, item-item and user- item relations as a network. Such a way of representing these relations and captur- ing them as a knowledge graph, using data mining algorithms, is a key idea of the proposed research. Fig. 4. NDCG@1 per epoch Fig. 5. NDCG@10 per epoch The hybrid sequential recommender system synthesis method based on attention… Системні дослідження та інформаційні технології, 2025, № 1 17 It was also proposed by the current paper to slightly change the mechanics of an Attention mechanism, to allow capture not only the studied sequence, but examine associations, presented in a knowledge graph, for each item in the sequence. By examining different techniques to capture associative similarities be- tween sets of items, namely TF-IDF, FP-Growth trees and PMI metric, construct- ing a deep graph neural network with proposed modification of Attention mecha- nism, and leveraging Node2Vec algorithm to retrieve the embedding matrix of a knowledge graph, new method for hybrid sequential recommender system design was introduced. By benchmarking the models on an anonymous retailer dataset, the evidence of high precision of GNN models in terms of MAP@k and NDCG@k for next- best offer recommendation was obtained. REFERENCES 1. C. Aggarwal, Recommender Systems: The Textbook. Springer, 2016. 2. K. Falk, Practical Recommender Systems. Manning, 2019. 3. A. Rasool, Next Best Offer (NBO) / Next Best Action (NBA)- why it requires a fresh perspective? 2019. [Online]. Available: https://www.linkedin.com/pulse/next-best- offer-nbo-action-nba-why-requires-fresh-azaz-rasool/ 4. S. Wang et al., “Modeling User Demand Evolution for Next-Basket Prediction,” IEEE Transactions on Knowledge and Data Engineering, vol. 35(11), pp. 11585– 11598, 2023. doi: https://doi.org/10.1109/TKDE.2022.3231018 5. K.A. Eliyahu, Achieving Commercial Excellence through Next Best Offer models. [Online]. Available: https://www.linkedin.com/pulse/achieving-commercial- excellence-through-next-best-offer-kisliuk/ 6. S. Wang et al., “Sequential Recommender Systems: Challenges, Progress and Pros- pects,” in IJCAI 2019, 2019. doi: https://doi.org/10.24963/ijcai.2019/883 7. F. Garcin, C. Dimitrakakis, and B. Faltings, “Personalized News Recommendation with Context Trees,” in Proceedings of the 7th ACM conference on Recommender systems, ACM, 2013. doi: https://doi.org/10.1145/2507157.2507166 8. R. He, J. McAuley, “Fusing Similarity Models with Markov Chains for Sparse Se- quential Recommendation,” in 2016 IEEE 16th International Conference on Data Mining (ICDM), Barcelona, 2016, pp. 191–200. doi: 10.1109/ICDM.2016.0030 9. Q. Xia et al., “Modeling Consumer Buying Decision for Recommendation Based on Multi-Task Deep Learning,” in CIKM ’18, 2018. doi: https://doi.org/10.1145/ 3269206.3269285 10. C. Zhao, J. You, X. Wen, and X. Li, “Deep Bi-LSTM Networks for Sequential Recom- mendation,” Entropy (Basel), 22(8), 870, 2020. doi: https://doi.org/10.3390/e22080870 11. A. Vaswani et al., “Attention Is All You Need,” in Advances in Neural Information Processing Systems (NIPS 2017), pp. 5998–6008, 2017. doi: https://doi.org/10.48550/ arXiv.1706.03762 12. H. Ying et al., “Sequential Recommender System based on Hierarchical Attention Network,” in Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, Stockholm, 13–19 July 2018, pp. 3926–3932. doi: https://doi.org/10.24963/ijcai.2018/546 13. Z. Fan et al., “Sequential Recommendation via Stochastic Self-Attention,” in Proceedings of the ACM Web Conference 2022 (WWW '22), 2022. doi: https://doi.org/10.1145/3485447.3512077 14. T. Hekmatfar, S. Haratizadeh, P. Razban, and S. Goliaei, “Attention-Based Recommen- dation On Graphs,” arXiv. 2022. [Online]. Available: https://arxiv.org/abs/2201.05499 15. T.N. Kipf, M. Welling, “Semi-Supervised Classification with Graph Convolutional Networks,” arXiv, 2016. [Online]. Available: https://arxiv.org/abs/1609.02907 16. N.I. Nedashkovskaya, D.V Androsov, “Multicriteria evaluation of recommender sys- tems using fuzzy analytic hierarchy process,” PREPRINT (Version 1), available at Research Square. doi: https://doi.org/10.21203/rs.3.rs-3228086/v1 D.V. Androsov, N.I. Nedashkovskaya ISSN 1681–6048 System Research & Information Technologies, 2025, № 1 18 17. Z. Wu et al., “A Comprehensive Survey on Graph Neural Networks,” IEEE Transac- tions on Neural Networks and Learning Systems, 32(1), pp. 4–24, 2020. doi: https://doi.org/10.48550/arXiv.1901.00596 18. K.W. Church, P. Hanks, “Word association norms, mutual information, and lexicog- raphy,” Comput. Linguist, vol. 16(1), pp. 22–29, 1990. 19. J. Han, J. Pei, Y. Yiven, “Mining Frequent Patterns without Candidate Generation,” ACM SIGMOD Record, vol. 29(2), pp. 1–12, 2000. doi: 10.1145/342009.335372 20. S. Robertson, “Understanding inverse document frequency: on theoretical arguments for IDF,” Journal of Documentation, vol. 60(5), pp. 503–520, 2004. doi: 10.1108/00220410410560582 21. T. Mikolov, K. Chen, G. Corrado, and J. Dean, “Efficient Estimation of Word Representations in Vector Space,” International Conference on Learning Repre- sentations (ICLR), May 2-4, 2013, Scottsdale, Arizona, USA, 2013. doi: https://doi.org/10.48550/arXiv.1301.3781 22. J. Pennington, R. Socher, and C.D. Manning, “GloVe: Global Vectors for Word Representation,” in Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 25-29 October 2014, Doha, Qatar, pp. 1532–1543, 2014. doi: 10.3115/v1/D14-1162 23. P. Bojanowski, E. Grave, A. Joulin, and T. Mikolov, “Enriching Word Vectors with Subword Information,” Transactions of the Association for Computational Linguis- tics, vol. 5, pp. 135–146, 2017. doi: 10.1162/tacl_a_00051 24. A. Grover, J. Lescovec, “node2vec: Scalable Feature Learning for Networks,” in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 13-17, 2016, San Francisco, CA, USA, pp. 855–864, 2016. doi: https://doi.org/10.1145/2939672.2939754 Received 06.06.2024 INFORMATION ON THE ARTICLE Nadezhda I. Nedashkovskaya, ORCID: 0000-0002-8277-3095, Educational and Research Institute for Applied System Analysis of the National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute”, Ukraine, e-mail: nedashkovskaya.nadezhda@lll.kpi.ua Dmytro V. Androsov, ORCID: 0009-0001-1330-1473, Educational and Research Insti- tute for Applied System Analysis of the National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute”, Ukraine, e-mail: androsovdmitry80@gmail.com ГIБРИДНИЙ МЕТОД ПОБУДОВИ СЕКВЕНЦIАЛЬНИХ РЕКОМЕНДАЦIЙНИХ СИСТЕМ, ЗАСНОВАНИЙ НА АВТОМАТИЧНОМУ СИНТЕЗI ГРАФIВ ЗНАНЬ ТА МЕХАНIЗМI УВАГИ / Д.В. Андросов, Н.I. Недашківська Анотація. Послiдовнi персоналiзованi рекомендацiї, такi як прогнозування на- ступної найкращої пропозицiї або моделювання еволюцiї попиту для прогно- зування наповнення кошика покупок, залишаються ключовим завданням для бiзнесу. Останнiм часом, задля вирiшення цих проблем застосовувалися лан- цюги Маркова та моделi глибокого навчання, що прогнозували послiдовностi взаємодiї користувачiв із товарами, демонстрували високу ефективнiсть. Проте ключовим недолiком таких моделей було неунiфiковане подання наборiв да- них для довгострокового та короткострокового прогнозуванння вподобань. З появою архiтектур глибокого навчання на графах та можливостi їх застосуван- ня одночасно в задачах колаборативної фiльтрацiї та прогнозування зв’язкiв мiж об’єктами, розвиток рекомендацiних систем отримав новий поштовх. Про- поновано новий метод розроблення гiбридних рекомендацiйних систем, який поєднує навчання подань графiв з глибокими нейронними мережами для мо- делювання та прогнозування послiдовностей, з метою розв’язання задачi ви- дачi послiдовних персоналiзованих рекомендацiй. Отриманi результати оцiнювання продуктивностi моделi на основi набору даних вiдвiдувань та ку- півель в iнтернет-магазинi доводять можливість ранжування та потенцiал для впровадження бiзнесами у сферi роздрiбної торгiвлi. Ключовi слова: рекомендацiйна система, графова нейронна мережа, подання графiв.
id journaliasakpiua-article-329313
institution System research and information technologies
keywords_txt_mv keywords
language English
last_indexed 2025-09-17T09:26:02Z
publishDate 2025
publisher The National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute"
record_format ojs
resource_txt_mv journaliasakpiua/a1/70ec106d00c6937592b9802e4f73bca1.pdf
spelling journaliasakpiua-article-3293132025-05-20T17:56:07Z The hybrid sequential recommender system synthesis method based on attention mechanism with automatic knowledge graph construction Гiбридний метод побудови секвенцiальних рекомендацiйних систем, заснований на автоматичному синтезi графiв знань та механiзмi уваги Androsov, Dmytro Nedashkovskaya, Nadezhda рекомендацiйна система графова нейронна мережа подання графiв recommender system graph neural network graph embeddings Sequential personalized recommendations, such as next best offer prediction or modeling demand evolution for next basket prediction, remain a key challenge for businesses. In recent years, deep learning models have been applied to solve these problems and demonstrated high feasibility. With the introduction of graph-based deep learning, it has become easier to perform collaborative filtering and link prediction tasks. The current paper proposes a new method of building a recommender system using a graph representation learning framework in combination with deep neural networks for sequence-to-sequence modeling and statistical learning for sequence-to-graph mapping. Benchmarking model performance on an online retail store visits dataset provides evidence of the method’s ranking capabilities. Послiдовнi персоналiзованi рекомендацiї, такi як прогнозування наступної найкращої пропозицiї або моделювання еволюцiї попиту для прогнозування наповнення кошика покупок, залишаються ключовим завданням для бiзнесу. Останнiм часом, задля вирiшення цих проблем застосовувалися ланцюги Маркова та моделi глибокого навчання, що прогнозували послiдовностi взаємодiї користувачiв із товарами, демонстрували високу ефективнiсть. Проте ключовим недолiком таких моделей було неунiфiковане подання наборiв даних для довгострокового та короткострокового прогнозуванння вподобань. З появою архiтектур глибокого навчання на графах та можливостi їх застосування одночасно в задачах колаборативної фiльтрацiї та прогнозування зв’язкiв мiж об’єктами, розвиток рекомендацiних систем отримав новий поштовх. Пропоновано новий метод розроблення гiбридних рекомендацiйних систем, який поєднує навчання подань графiв з глибокими нейронними мережами для моделювання та прогнозування послiдовностей, з метою розв’язання задачi видачi послiдовних персоналiзованих рекомендацiй. Отриманi результати оцiнювання продуктивностi моделi на основi набору даних вiдвiдувань та купівель в iнтернет-магазинi доводять можливість ранжування та потенцiал для впровадження бiзнесами у сферi роздрiбної торгiвлi. The National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute" 2025-03-28 Article Article application/pdf https://journal.iasa.kpi.ua/article/view/329313 10.20535/SRIT.2308-8893.2025.1.01 System research and information technologies; No. 1 (2025); 7-18 Системные исследования и информационные технологии; № 1 (2025); 7-18 Системні дослідження та інформаційні технології; № 1 (2025); 7-18 2308-8893 1681-6048 en https://journal.iasa.kpi.ua/article/view/329313/318892
spellingShingle рекомендацiйна система
графова нейронна мережа
подання графiв
Androsov, Dmytro
Nedashkovskaya, Nadezhda
Гiбридний метод побудови секвенцiальних рекомендацiйних систем, заснований на автоматичному синтезi графiв знань та механiзмi уваги
title Гiбридний метод побудови секвенцiальних рекомендацiйних систем, заснований на автоматичному синтезi графiв знань та механiзмi уваги
title_alt The hybrid sequential recommender system synthesis method based on attention mechanism with automatic knowledge graph construction
title_full Гiбридний метод побудови секвенцiальних рекомендацiйних систем, заснований на автоматичному синтезi графiв знань та механiзмi уваги
title_fullStr Гiбридний метод побудови секвенцiальних рекомендацiйних систем, заснований на автоматичному синтезi графiв знань та механiзмi уваги
title_full_unstemmed Гiбридний метод побудови секвенцiальних рекомендацiйних систем, заснований на автоматичному синтезi графiв знань та механiзмi уваги
title_short Гiбридний метод побудови секвенцiальних рекомендацiйних систем, заснований на автоматичному синтезi графiв знань та механiзмi уваги
title_sort гiбридний метод побудови секвенцiальних рекомендацiйних систем, заснований на автоматичному синтезi графiв знань та механiзмi уваги
topic рекомендацiйна система
графова нейронна мережа
подання графiв
topic_facet рекомендацiйна система
графова нейронна мережа
подання графiв
recommender system
graph neural network
graph embeddings
url https://journal.iasa.kpi.ua/article/view/329313
work_keys_str_mv AT androsovdmytro thehybridsequentialrecommendersystemsynthesismethodbasedonattentionmechanismwithautomaticknowledgegraphconstruction
AT nedashkovskayanadezhda thehybridsequentialrecommendersystemsynthesismethodbasedonattentionmechanismwithautomaticknowledgegraphconstruction
AT androsovdmytro gibridnijmetodpobudovisekvencialʹnihrekomendacijnihsistemzasnovanijnaavtomatičnomusintezigrafivznanʹtamehanizmiuvagi
AT nedashkovskayanadezhda gibridnijmetodpobudovisekvencialʹnihrekomendacijnihsistemzasnovanijnaavtomatičnomusintezigrafivznanʹtamehanizmiuvagi
AT androsovdmytro hybridsequentialrecommendersystemsynthesismethodbasedonattentionmechanismwithautomaticknowledgegraphconstruction
AT nedashkovskayanadezhda hybridsequentialrecommendersystemsynthesismethodbasedonattentionmechanismwithautomaticknowledgegraphconstruction