RichTextBox Multi-Line VB.NET

M2HF: Multi-Branch Multi-Modal Hybrid Fusion for Text-Video Retrieval

Abstract: Videos contain multimodal content, and exploring multi-branch cross-modal interactions with natural language queries can be of benefit to the text-video retrieval task (TVR). However, recent ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

M2HF: Multi-Branch Multi-Modal Hybrid Fusion for Text-Video Retrieval

Trending now