Question about the method implementation.

Hi, thank you for sharing your work and providing the implementation!

While reviewing the code, I noticed a few differences between the implementation and the description in the paper. In the paper, during the 2D-to-3D construction process, it mentions that a 3D model is utilized to extract features for each point. However, in the code, it seems that CLIP features are being used instead.

Additionally, the paper describes processing features for top views (as outlined in OpenMask3D), but in the code, it appears that the CLIP features are computed for the entire frame instead.

Could you clarify if I might be misunderstanding something here? Thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Question about the method implementation. #41

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Question about the method implementation. #41

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions