Знайдено документів: 1
Інформація × Реєстраційний номер 2120U007890, Матеріали видань та локальних репозитаріїв Категорія Препринт Назва роботи Context-Based Question-Answering System for the Ukrainian Language Автор Tiutiunnyk SerhiiTiutiunnyk Serhii Дата публікації 01-01-2020 Постачальник інформації Український католицький університет Першоджерело https://hdl.handle.net/20.500.14570/1898 Видання Опис This work presents a context-based question answering model for the Ukrainian language based on Wikipedia articles using Bidirectional Encoder Representations from Transformers (BERT) (Devlin et al., 2018) model, which takes a context (Wikipedia article) and a question to the context. The result of the model is an answer to the question. The model consists of two parts. The first one is a pre-trained multilingual BERT model, which is trained on the top-100, the most popular languages on Wikipedia articles. The second part is the fine-tuned model, which is trained on the dataset of questions and answers to the Wikipedia articles. The training and validation data is Stanford Question Answering Dataset (SQuAD) (Rajpurkar et al., 2016). There are no question answering datasets for the Ukrainian language. The plan is to build an appropriate dataset with machine translation and use it for the fine-tuning training stage and compare the result with models which were fine-tunedon the other languages. The next experiment is to train a model on the Slavic language datasets before fine-tuning on the Ukrainian language and compare the results. Додано в НРАТ 2025-11-05 Закрити
Матеріали
Препринт
Tiutiunnyk Serhii. Context-Based Question-Answering System for the Ukrainian Language : публікація 2020-01-01; Український католицький університет, 2120U007890
Знайдено документів: 1

Оновлено: 2026-03-27