Visual Question Answering for Peruvian Cuisine in Regional Spanish
答疑
地理
历史
情报检索
计算机科学
作者
Mariana Risco Cosavalente
出处
期刊:Proceedings of the ... AAAI Conference on Artificial Intelligence [Association for the Advancement of Artificial Intelligence (AAAI)] 日期:2025-04-11卷期号:39 (28): 29602-29604
标识
DOI:10.1609/aaai.v39i28.35339
摘要
This project leverages Visual Question Answering (VQA) to promote Peruvian gastronomy by utilizing a culturally rich dataset and advanced models such as LLaVA-1.5 and GPT-2 Large. The evaluation will comprise both automated metrics and culinary expert assessments. This system addresses regional variations in dish names, promotes inclusivity by involving Peruvians from diverse regions in dataset construction, and enhances cultural representation.