1 documents found
Information × Registration Number 0223U005290, 0121U100534 , R & D reports Title Methods and tools of reinforcement learning for the tasks of analysis and forecasting for complex systems popup.stage_title Head Kasianov Pavlo O., д.ф.-м.н. Registration Date 19-12-2023 Organization Educational and Scientific Complex "Institute for Applied System Analysis" of National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute" popup.description2  The investigation is devoted to the study of problems of controlled partially observable stochastic systems with discrete time, optimization of expected total discounted costs. New sufficient conditions regarding transition probability and observation for weak continuity of transitions for MDP with trust states are obtained, several well-known such conditions are generalized, and existence of an optimal strategy, validity of optimality equations determining optimal strategies, convergence of iterations to optimal values ​​are established for POMDP. In addition, a software product was created in which, using deep reinforcement learning and long-term planning, a neural network was obtained. This network approximates a strategy that achieves better results and metrics on a selected data set. Product Description popup.authors Babych Halyna Kondratova Liudmyla Kupenko Olha P. Levenchuk Liudmyla Marchenko Halyna Paliichuk Liliia Shubenkova Iryna A. popup.nrat_date 2023-12-19 Close
R & D report
1
Head: Kasianov Pavlo O.. Methods and tools of reinforcement learning for the tasks of analysis and forecasting for complex systems. (popup.stage: ). Educational and Scientific Complex "Institute for Applied System Analysis" of National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute". № 0223U005290
1 documents found

Updated: 2026-03-21