Please consult this GitLab repo for the evaluation scripts (as of VQA 2025 guidelines) used in both tasks: - Answer Generation - Multiple Choice https://gitlab.nist.gov/gitlab/retrieval/vqa-trec-evaluation