ГОСТ:
Kwon W. et al. Efficient Memory Management for Large Language Model Serving with PagedAttention //No information found about journal. — 2023. —C. 611-626
MLA:
Kwon, Woosuk, et al. "Efficient Memory Management for Large Language Model Serving with PagedAttention" No information found about journal (2023): 611-626