Cite
MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models
MLA
Frantar, Elias, et al. MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models. 2024. EBSCOhost, widgets.ebscohost.com/prod/customlink/proxify/proxify.php?count=1&encode=0&proxy=&find_1=&replace_1=&target=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsarx&AN=edsarx.2408.11743&authtype=sso&custid=ns315887.
APA
Frantar, E., Castro, R. L., Chen, J., Hoefler, T., & Alistarh, D. (2024). MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models.
Chicago
Frantar, Elias, Roberto L. Castro, Jiale Chen, Torsten Hoefler, and Dan Alistarh. 2024. “MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models.” http://widgets.ebscohost.com/prod/customlink/proxify/proxify.php?count=1&encode=0&proxy=&find_1=&replace_1=&target=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsarx&AN=edsarx.2408.11743&authtype=sso&custid=ns315887.