I noticed that in dataset/dataset.py, the validation process uses a hard-coded index idx=0 to select only the first sample from each scene's validation data. Could you help clarify what was the rationale behind this design choice instead of using the full validation set? And does this partial validation affect metrics' reliability or model comparison?
This implementation might limit the evaluation completeness, as models are only validated on a single sample per scene. I'd appreciate any insights into this design decision and suggestions for modifying it to use the full validation set if needed.
Thank you for your work and clarification!