MANGO: A Benchmark for Evaluating Mapping and Navigation Abilities…

by jsendak | Mar 29, 2024 | Cosmology & Computing | 0 comments

Large language models such as ChatGPT and GPT-4 have recently achieved astonishing performance on a variety of natural language processing tasks. In this paper, we propose MANGO, a benchmark to…