Search:
Match:
1 results

Analysis

This research introduces a novel benchmark, DrawingBench, focused on evaluating the spatial reasoning and UI interaction abilities of large language models. The use of mouse-based drawing tasks provides a unique and challenging method for assessing these capabilities.
Reference

DrawingBench evaluates spatial reasoning and UI interaction capabilities through mouse-based drawing tasks.