VisionLanguageFusionModule #52

lzl2040 · 2024-03-06T07:19:42Z

I find in this module you use multiplication instead of addition or concatenation.

tgt2 = self.multihead_attn(query=self.with_pos_embed(tgt, query_pos),
                                   key=self.with_pos_embed(memory, pos),
                                   value=memory, attn_mask=None,
                                   key_padding_mask=memory_key_padding_mask)[0]
        tgt = tgt * tgt2
        return tgt

why not use addition or concatenation? What are the benefits of using multiplication？

The text was updated successfully, but these errors were encountered:

wjn922 · 2024-03-12T16:18:05Z

In our early experiments, we find that multiplication would have higher performance than addition for RVOS. So we consistently use multiplication all along the work.

But actually, the performance advantage is less than 1 point. So using multiplication, addition or concatenation may not significantly impact the results.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VisionLanguageFusionModule #52

VisionLanguageFusionModule #52

lzl2040 commented Mar 6, 2024

wjn922 commented Mar 12, 2024

VisionLanguageFusionModule #52

VisionLanguageFusionModule #52

Comments

lzl2040 commented Mar 6, 2024

wjn922 commented Mar 12, 2024