Digital Library[ Search Result ]
Search : "[ keyword: Model Parallelism ]" (3)
An Analysis on Inference Time, Accuracy, Communication, and GPU Memory Usage for Inference Batch of Large Language Models
Changyong Shin Younghun Go Yeonho Yoo Gyeongsik Yang Chuck Yoo
Vol. 49, No. 10, pp. 1377-1385, Oct. 2024
10.7840/kics.2024.49.10.1377
Vol. 49, No. 10, pp. 1377-1385, Oct. 2024
10.7840/kics.2024.49.10.1377
Comparison and Analysis for the Performance of Deep Learning-Based Time Series Prediction Algorithms According to Increasing Model Size
A Survey on Parallel Deep Learning
JinYi Yoon JiHo Lee Nayoung Han HyungJune Lee
Vol. 46, No. 10, pp. 1604-1617, Oct. 2021
10.7840/kics.2021.46.10.1604
Vol. 46, No. 10, pp. 1604-1617, Oct. 2021
10.7840/kics.2021.46.10.1604