Оцінка ймовірності виникнення шахрайства в процесі кредитування клієнтів банку

1 documents found

Information × Registration Number 2121U002460, Article popup.category Стаття Title popup.author popup.publication 01-01-2021 popup.source_user Сумський державний університет popup.source https://essuir.sumdu.edu.ua/handle/123456789/86244 popup.publisher Сумський державний університет Description Статтю присвячено актуальній темі оцінці ймовірності виникнення кредитних шахрайств у банках. Дана проблематика пов’язана із зростанням рівня діджиталізації економічних процесів та переведенням платіжних операцій у цифровий простір. Її вирішення здійснюється у восьми наукових напрямках, що підтверджено шляхом побудови та аналізу карти наукометричної бібліографії досліджень, присвячених проблемі шахрайств щодо кредитування клієнтів банків. В статті було виділено кластери наукових праць, що стосуються: процесів захисту онлайн-транзакцій; машинного, ансамблевого та інкрементного навчання для вирішення проблем кредитних шахрайств; ймовірнісних підходів; процесів виявлення аномалій у операціях, пов’язаних із відмиванням незаконних коштів у банках; процесу знаходження шахрайств у фінансовій сфері; оцінки ризиків; Data Mining. Для проведення дослідження оцінки ймовірності виникнення кредитних шахрайств у банках використано статистичні дані, які складаються з 122 змінних та 307511 записів щодо клієнтів банку. Побудова концептуальної моделі дозволила окреслити етапи здійснення моделювання, яке проводилося за допомогою сучасної мови програмування Python. Дані було очищено від пропущеної інформації та перевірено на відповідність нормального закону розподілу. В результаті отриманого набору даних було побудовано три моделі - логістична регресія, дерево рішень та нейронна мережа. Виявилося, що частка правильних прогнозів у тренувальній вибірці для логістичної регресії склала 93,09%, для дерева рішень та нейронної мережі – 100,00%, а у тестовій вибірці, відповідно, – 93,60%, 99,15%, 86,67%. Це свідчить про адекватність даних обох вибірок та високу точність прогнозування. Побудовані моделі було також перевірено на точність та якість. В результаті виявилося, що всі моделі є досить точними та якісними, але дерево рішення є найбільш точною, якісною та адекватною моделлю. Побудовані моделі є універсальними інструментами для виявлення шахрайських операцій, але вони потребують постійного моніторингу та оновлення інформації у зв’язку із появою нових ознак злочинної дії в процесі кредитування клієнтів. The article is devoted to the current topic of assessing the likelihood of credit fraud in banks. This issue is related to the growth of economic processes digitalization and the transfer of payment transactions to the digital space. Its solution is carried out in eight scientific areas, confirmed by the construction and analysis of a map of scientometric bibliography of research on the problem of fraud in lending to bank customers. The article highlights clusters of scientific papers related to processes of protection of online transactions, machine, ensemble and incremental training to solve the problems of credit fraud, probabilistic approaches, techniques of detecting anomalies in operations related to money laundering in banks, the process of finding fraud in the financial sector, risk assessments, Data Mining. The data set from 122 variables and 307,511 records of the bank's customers were used to conduct a study to assess the likelihood of credit fraud in banks. The construction of the conceptual model made it possible to outline the stages of modelling, which was carried out using the modern Python programming language. The data was cleared of missing information and checked for compliance with the normal distribution law. As a result of the obtained data set, three models were built - logistic regression, decision tree and neural network. It turned out that the share of correct predictions in the training sample for logistic regression was 93.09%, for the decision tree and neural network - 100.00%, and in the test sample, respectively - 93.60%, 99.15%, 86, 67%. It indicates the adequacy of the data of both pieces and the high accuracy of forecasting. The constructed models were also tested for accuracy and quality. As a result, it turned out that all models are pretty accurate and high quality, but the decision tree is the most accurate, high quality and adequate model. Built-in models are universal tools for detecting fraudulent transactions, but they require constant monitoring and updating of information in connection with the emergence of new signs of criminal activity in the process of lending to customers. popup.nrat_date 2025-03-24 Close

Article

Стаття

: published. 2021-01-01; Сумський державний університет, 2121U002460

1 documents found

Updated: 2026-03-25

Роздрукувати цю сторінку

National Repository of Academic Texts

The NRAT database:

Reports in the field of scientific and scientific and technical activities

Dissertations for obtaining scientific degrees and abstracts

Materials from publications and local repositories

Search academic texts