MANGO: A Benchmark for Evaluating Mapping and Navigation Abilities...

Large language models such as ChatGPT and GPT-4 have recently achieved astonishing performance on a variety of natural language processing tasks. In this paper, we propose MANGO, a benchmark to…