Training and Serving System of Foundation Models: A Comprehensive Survey

Article Properties

DOI (url)

10.1109/ojcs.2024.3380828
Publication Date

2024/01/01
Journal

IEEE Open Journal of the Computer Society
Indian UGC (journal)
Refrences

97
Jiahang Zhou School of Systems Science and Engineering, Sun Yat-sen University, Guangzhou, China ORCID (unauthenticated)
Yanyu Chen School of Informatics, Xiamen University, Xiamen, China ORCID (unauthenticated)
Zicong Hong Department of Computing, The Hong Kong Polytechnic University, Hong Kong SAR, China ORCID (unauthenticated)
Wuhui Chen School of Software Engineering, Sun Yat-sen University, Guangzhou, China ORCID (unauthenticated)
Yue Yu Peng Cheng Laboratory, Shenzhen, China ORCID (unauthenticated)
Tao Zhang School of Systems Science and Engineering, Sun Yat-sen University, Guangzhou, China ORCID (unauthenticated)
Hui Wang Peng Cheng Laboratory, Shenzhen, China ORCID (unauthenticated)
Chuanfu Zhang School of Systems Science and Engineering, Sun Yat-sen University, Guangzhou, China ORCID (unauthenticated)
Zibin Zheng School of Software Engineering, Sun Yat-sen University, Guangzhou, China ORCID (unauthenticated)

Cite

Zhou, Jiahang, et al. “Training and Serving System of Foundation Models: A Comprehensive Survey”. IEEE Open Journal of the Computer Society, vol. 5, 2024, pp. 107-19, https://doi.org/10.1109/ojcs.2024.3380828.

Zhou, J., Chen, Y., Hong, Z., Chen, W., Yu, Y., Zhang, T., Wang, H., Zhang, C., & Zheng, Z. (2024). Training and Serving System of Foundation Models: A Comprehensive Survey. IEEE Open Journal of the Computer Society, 5, 107-119. https://doi.org/10.1109/ojcs.2024.3380828

Zhou J, Chen Y, Hong Z, Chen W, Yu Y, Zhang T, et al. Training and Serving System of Foundation Models: A Comprehensive Survey. IEEE Open Journal of the Computer Society. 2024;5:107-19.

Journal Categories

Science

Mathematics

Instruments and machines

Electronic computers

Computer science

Science

Science (General)

Cybernetics

Information theory

Technology

Electrical engineering

Electronics

Nuclear engineering

Electric apparatus and materials

Electric circuits

Electric networks

Technology

Electrical engineering

Electronics

Nuclear engineering

Electronics

Computer engineering

Computer hardware

Technology

Technology (General)

Industrial engineering

Management engineering

Information technology

Refrences

Title	Journal	Journal Categories	Citations	Publication Date
G10: Enabling an efficient unified GPU memory and storage architecture with smart tensor migrations				2023
Checkmate: Breaking the memory wall with optimal tensor rematerialization				2022
Accelerating distributed MoE training and inference with Lina				2023
SmartMoE: Efficiently training sparsely-activated models through combining offline and online parallelization				2023
Fast inference from transformers via speculative decoding				2023

Database	Last update
UGC	December 2024
DOAJ	December 2024
Crossref	May 2024