Is there somewhere I can find detailed documentation on NaVit? #305
Unanswered
dempsey-ryan
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm having trouble understanding what the various parameters do, even after reading the source code.
Specifically, I'm wondering what group_max_seq_len does, and why it has non-deterministic results? For example:
For larger maximum sequence lengths, all the images have identical outputs. My problem is, I want deterministic results, therefore I want a constant max sequence length regardless of how the images are batched (kind of the whole reason I want to use NaViT). However, if I pick the maximum of the whole dataset, then you have the above (1315) result where every single image has identical logits.
If you can clarify how I decide on this parameter I would really appreciate it.
Beta Was this translation helpful? Give feedback.
All reactions