Cite
Nano: Nested Human-in-the-Loop Reward Learning for Few-shot Language Model Control
MLA
Fan, Xiang, et al. Nano: Nested Human-in-the-Loop Reward Learning for Few-Shot Language Model Control. 2022. EBSCOhost, https://doi.org/10.18653/v1/2023.findings-acl.758.
APA
Fan, X., Lyu, Y., Liang, P. P., Salakhutdinov, R., & Morency, L.-P. (2022). Nano: Nested Human-in-the-Loop Reward Learning for Few-shot Language Model Control. https://doi.org/10.18653/v1/2023.findings-acl.758
Chicago
Fan, Xiang, Yiwei Lyu, Paul Pu Liang, Ruslan Salakhutdinov, and Louis-Philippe Morency. 2022. “Nano: Nested Human-in-the-Loop Reward Learning for Few-Shot Language Model Control.” doi:10.18653/v1/2023.findings-acl.758.