Abstract:
[Objective/Significance] With the rapid development of generative artificial intelligence (AIGC), various large models have gradually evolved from initially being capable of processing only single text modality in large language models to being able to handle multi-modal data such as text, images, voice, and video. However, domestic large language models targeting the ancient Chinese language field still mainly focus on improving the performance of ancient Chinese language processing tasks, and are mainly centered on single text modality information processing. There is still considerable room for development in terms of the knowledge understanding and question-answering interaction capabilities of large language models, as well as in multi-modal information processing. Based on this, Huazhong University of Science and Technology has newly launched the ancient Chinese multi-modal large language model “AI Jiusi 2.0”, which not only masters ancient Chinese professional knowledge but also possesses ancient Chinese application capabilities and supports multi-modal data processing, aiming to set an example for the development of multi-modal ancient Chinese large language models. [Method/Process] This paper details the dataset construction, computing power upgrade, model training, and interface optimization of “AI Jiusi 2.0”, and showcases the performance of the new version “AI Jiusi” in ancient Chinese language knowledge and language ability. [Result/ Conclusion] The newly upgraded “AI Jiusi 2.0” demonstrates significant advantages in the understanding of ancient Chinese texts and ancient Chinese knowledge question-answering, and has already acquired a certain ability to understand ancient Chinese characters (oracle bone
-
From:
刘根辉
-
Subject:
Library Science,Information Science
>>
Information Science
Linguistics and Applied Linguistics
>>
Linguistics and Applied Linguistics
-
Contribution:
No Submitted
-
Cite as:
ChinaXiv:202501.00233
(or this version
ChinaXiv:202501.00233V1)
DOI:10.12074/202501.00233
CSTR:32003.36.ChinaXiv.202501.00233
-
TXID:
509719c0-a1bb-4e80-b839-6ac0f7f097b3
- Recommended references:
刘根辉,刘金柱,王锦绣,罗捷春,李志芳,袁方,余静静,龚丹,谢雨霏,罗婉滢,郑苏楠,陈旷心,贺心雨,张润哲,夏婉婷,谢佳延,吕佳源,吕萍,余乐妍,郑诗铭,王金柳,刘艺溶,杨纯,张曼丽,吴翊嘉,余锁湘,汪靓.多模态古代汉语大语言模型AI九思2.0的设计与开发.语音乐律预印本平台.[DOI:10.12074/202501.00233]
(Click&Copy)