Cite
TokenFlow: Rethinking Fine-grained Cross-modal Alignment in Vision-Language Retrieval
MLA
Zou, Xiaohan, et al. TokenFlow: Rethinking Fine-Grained Cross-Modal Alignment in Vision-Language Retrieval. 2022. EBSCOhost, widgets.ebscohost.com/prod/customlink/proxify/proxify.php?count=1&encode=0&proxy=&find_1=&replace_1=&target=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsarx&AN=edsarx.2209.13822&authtype=sso&custid=ns315887.
APA
Zou, X., Wu, C., Cheng, L., & Wang, Z. (2022). TokenFlow: Rethinking Fine-grained Cross-modal Alignment in Vision-Language Retrieval.
Chicago
Zou, Xiaohan, Changqiao Wu, Lele Cheng, and Zhongyuan Wang. 2022. “TokenFlow: Rethinking Fine-Grained Cross-Modal Alignment in Vision-Language Retrieval.” http://widgets.ebscohost.com/prod/customlink/proxify/proxify.php?count=1&encode=0&proxy=&find_1=&replace_1=&target=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsarx&AN=edsarx.2209.13822&authtype=sso&custid=ns315887.