Information × Registration Number 2120U007885, Article popup.category Препринт Title Image Recommendation for Wikipedia Articles (AI translated) popup.author Onyshchak OlehOnyshchak Oleh popup.publication 01-01-2020 popup.source_user Український католицький університет popup.source https://hdl.handle.net/20.500.14570/1920 popup.publisher Description Multimodal learning, which is simultaneous learning from different data sources such as audio, text, images; is a rapidly emerging field of Machine Learning. It is also considered to be learning on the next level of abstraction, which will allow us to tackle more complicated problems such as creating cartoons from a plot or speech recognition based on lips movement. In this paper, we will introduce a basic model to recommend the most relevant images for a Wikipedia article based on state-of-the-art multimodal techniques. We will also introduce the Wikipedia multimodal dataset, containing more than 36,000 high-quality articles. popup.nrat_date 2025-11-05 Close