Crossing Linguistic Horizons

Finetuning and Comprehensive Evaluation of Vietnamese Large Language Models

What We Do


ViLLM is an end-to-end framework for finetuning, evaluation, and deployment of Vietnamese large language models.
The framework is designed to be easy to use, flexible, and efficient.
This study encompasses 10 tasks with 14 models, 20 datasets, and 31 distinct evaluation metrics.

Task Dataset Metric
Question Answering XQuaD Exact Match, F1 Score
MLQA
Text Summarization VietNews ROUGE-1, ROUGE-2, ROUGE-L,
SummaC, BERTScore, Coverage,
Density, Compression
WikiLingua
Sentiment Analysis VLSP 2016 Accuracy, F1 Score, AUC ROC,
Expected Calibration Error at top-10,
Accuracy at 10% coverage
UiT-VSFC
Text Classification UiT-VSMEC Accuracy, F1 Score, AUC ROC,
Expected Calibration Error at top-10,
Accuracy at 10% coverage
PhoATIS
Knowledge ZaloE2E Exact Match, F1 Score
ViMMRC Accuracy, F1 Score, AUC ROC,
Expected Calibration Error at top-10,
Accuracy at 10% coverage
Toxicity Detection UiT-ViCTSD Accuracy, F1 Score, AUC ROC,
Expected Calibration Error at top-10,
Accuracy at 10% coverage
UiT-ViHSD
Information Retrieval mMARCO Mean Reciprocal Rank in top-10,
Boosted Mean Reciprocal Rank in top-10,
Normalized Discounted Cumulative Gain in top-10,
Boosted Normalized Discounted Cumulative Gain in top-10
mRobust04
Language Modeling MLQA-MLM Exact Match, Character Error Rate,
Word Error Rate, Character Edit Distance,
Word Edit Distance, Perplexity
VSEC
Reasoning Synthetic Reasoning -
Natural Language
Exact Match, F1 Score, Equivalent
Synthetic Reasoning -
Abstract Symbol
MATH
Machine Translation PhoMT Bilingual Evaluation Understudy, hLEPOR
OPUS100
Bias & Toxicity
in generation
XQuaD Demographic Representations of Races,
Demographic Representations of Genders,
Stereotypical Associations of Races,
Stereotypical Associations of genders,
Toxicity score
MLQA
VietNews
WikiLingua
PhoMT
OPUS100

Who We Are


Sang T. Truong

Stanford University

Duc Q. Nguyen

Ho Chi Minh City University of Technology - VNU-HCM

Toan Nguyen

Ho Chi Minh City University of Technology - VNU-HCM

Dong D. Le

Ho Chi Minh City University of Technology - VNU-HCM

Nhi N. Truong

Ho Chi Minh City University of Technology - VNU-HCM

Prof. Tho T. Quan

Ho Chi Minh City University of Technology - VNU-HCM

Prof. Sanmi Koyejo

Stanford University