Upload an equirectangular (360°) panoramic image. The model segments it into 13 indoor semantic classes using SAM features and instance-guided refinement.
Dataset: Stanford2D3DS · Backbone: SAM ViT-H · Input: RGB · [Paper] [Code] [Model]