Ministry of Digital Affairs Collaborates with Academia Sinica to Release Research Corpus, Boosting Diverse AI Applications
NQ Score
86/100
AI Summary (NQ-processed)
The Ministry of Digital Affairs (MODA) and Academia Sinica are collaborating to release research corpora to the Taiwan Sovereign AI Training Corpus, supporting diverse AI applications. Academia Sinica has uploaded over 6.2 million tokens of high-quality traditional Chinese corpora, including academic research, policy analysis, historical culture, and popular science texts. These specialized corpora enhance AI models' understanding and inference accuracy in specific fields, contributing to the development of RAG knowledge bases and professional Q&A systems. Since its launch late last year, the corpus has accumulated over 3,000 datasets and 1.2 billion tokens, with plans for continued expansion.
AI analysis data is not yet available.
Frequently Asked Questions
- Q: What kind of corpora are provided in the Taiwan Sovereign AI Training Corpus?
- A: Academia Sinica has uploaded over 6.2 million tokens of high-quality traditional Chinese corpora, including academic research, policy analysis, historical culture, and popular science texts.
- Q: What benefits do specialized knowledge corpora bring to AI models?
- A: They enhance the model's understanding and inference accuracy in specific fields, contributing to the development of RAG knowledge bases, professional Q&A systems, model fine-tuning, summarization, classification, and knowledge extraction tasks.