Digital Library[ Search Result ]
Search : "[ keyword: Batch size ]" (1)
An Analysis on Inference Time, Accuracy, Communication, and GPU Memory Usage for Inference Batch of Large Language Models
Changyong Shin Younghun Go Yeonho Yoo Gyeongsik Yang Chuck Yoo
Vol. 49, No. 10, pp. 1377-1385, Oct. 2024
10.7840/kics.2024.49.10.1377
Vol. 49, No. 10, pp. 1377-1385, Oct. 2024
10.7840/kics.2024.49.10.1377
Submenu
POPULAR KEYWORDS
(TOP 10 KEYWORDS)
Recent Publications
(LAST 3 YEARS)
-
Vol. 49, 2024
-
Vol. 48, 2023
- Vol. 48, No. 12 (Dec. 2023)
- Vol. 48, No. 11 (Nov. 2023)
- Vol. 48, No. 10 (Oct. 2023)
- Vol. 48, No. 9 (Sep. 2023)
- Vol. 48, No. 8 (Aug. 2023)
- Vol. 48, No. 7 (Jul. 2023)
- Vol. 48, No. 6 (Jun. 2023)
- Vol. 48, No. 5 (May. 2023)
- Vol. 48, No. 4 (Apr. 2023)
- Vol. 48, No. 3 (Mar. 2023)
- Vol. 48, No. 2 (Feb. 2023)
- Vol. 48, No. 1 (Jan. 2023)
-
Vol. 47, 2022
- Vol. 47, No. 12 (Dec. 2022)
- Vol. 47, No. 11 (Nov. 2022)
- Vol. 47, No. 10 (Oct. 2022)
- Vol. 47, No. 9 (Sep. 2022)
- Vol. 47, No. 8 (Aug. 2022)
- Vol. 47, No. 7 (Jul. 2022)
- Vol. 47, No. 6 (Jun. 2022)
- Vol. 47, No. 5 (May. 2022)
- Vol. 47, No. 4 (Apr. 2022)
- Vol. 47, No. 3 (Mar. 2022)
- Vol. 47, No. 2 (Feb. 2022)
- Vol. 47, No. 1 (Jan. 2022)