Cite
LIRE: listwise reward enhancement for preference alignment
MLA
Zhu, Mingye, et al. LIRE: Listwise Reward Enhancement for Preference Alignment. 2024. EBSCOhost, widgets.ebscohost.com/prod/customlink/proxify/proxify.php?count=1&encode=0&proxy=&find_1=&replace_1=&target=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsarx&AN=edsarx.2405.13516&authtype=sso&custid=ns315887.
APA
Zhu, M., Liu, Y., Zhang, L., Guo, J., & Mao, Z. (2024). LIRE: listwise reward enhancement for preference alignment.
Chicago
Zhu, Mingye, Yi Liu, Lei Zhang, Junbo Guo, and Zhendong Mao. 2024. “LIRE: Listwise Reward Enhancement for Preference Alignment.” http://widgets.ebscohost.com/prod/customlink/proxify/proxify.php?count=1&encode=0&proxy=&find_1=&replace_1=&target=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsarx&AN=edsarx.2405.13516&authtype=sso&custid=ns315887.