Annotation format for Referring Expression Segmentation (RES)

#120
by hongsik91 - opened

Hi everyone,
I am currently working on the task using the COCO dataset, and I have a few questions regarding the correct input/output format.
I suspect my model is currently suffering from "grammar collapse," and I would appreciate some clarification on the data preprocessing and formatting.

RDP Algorithm: Should the Douglas–Peucker (RDP) algorithm be applied before tokenization (i.e., before quantization)? If so, what is the epsilon value typically used in your implementation?

Segmentation Ordering: Is there a specific rule for the sequence of points in the mask?
For example, should they be ordered from the top-left, proceeding clockwise?

Separator Tokens: When using tokens, is there a specific order for the segments (e.g., largest part first), or is the order arbitrary?

Poly-formatting: Is the use of tokens strictly necessary, or is it optional?

Any guidance or pointers to the relevant parts of the codebase or paper would be greatly appreciated.

Sign up or log in to comment