SpatialPIN: Enhancing Spatial Reasoning Capabilities of Vision-Language Models through Prompting and Interacting 3D Priors

Cheng T-Y, Lu K, Ma C, Markham A, Trigoni N
No abstract available
Keywords:

46 Information and Computing Sciences

,

4602 Artificial Intelligence

,

1.1 Normal biological development and functioning