Crossing Linguistic Horizons

Finetuning and Comprehensive Evaluation of Vietnamese Large Language Models

Zero-Shot Summarization Leaderboard

Models VietNews WikiLingua
R1 R2 RL SC BS Cv De Cp R1 R2 RL SC BS Cv De Cp
URA-LLaMa 70B 0.42 ± 0.17 0.21 ± 0.12 0.28 ± 0.00 -0.11 ± 0.00 0.03 ± 0.19 0.85 ± 0.00 14.59 ± 0.05 17.21 ± 0.33 0.37 ± 0.00 0.16 ± 0.00 0.24 ± 0.00 -0.22 ± 0.00 0.26 ± 0.16 0.17 ± 0.00 0.22 ± 0.00 22.24 ± 0.97
URA-LLaMa 13B 0.38 ± 0.00 0.18 ± 0.00 0.25 ± 0.00 -0.09 ± 0.00 0.01 ± 0.18 0.71 ± 0.00 6.01 ± 0.07 24.27 ± 0.61 0.22 ± 0.00 0.08 ± 0.00 0.14 ± 0.00 -0.16 ± 0.00 -0.13 ± 0.12 0.42 ± 0.01 3.06 ± 0.10 49.58 ± 1.16
URA-LLaMa 7B 0.38 ± 0.00 0.14 ± 0.00 0.25 ± 0.00 -0.09 ± 0.00 0.04 ± 0.12 0.65 ± 0.00 4.88 ± 0.03 7.77 ± 0.05 0.40 ± 0.00 0.15 ± 0.00 0.26 ± 0.00 -0.16 ± 0.00 0.19 ± 0.07 0.73 ± 0.00 4.79 ± 0.07 6.22 ± 0.07
LLaMa-2 13B 0.06 ± 0.00 0.02 ± 0.00 0.04 ± 0.00 -0.09 ± 0.00 -0.18 ± 0.04 0.07 ± 0.00 0.43 ± 0.01 28.25 ± 0.24 0.04 ± 0.00 0.00 ± 0.00 0.03 ± 0.00 -0.16 ± 0.00 -0.11 ± 0.08 0.03 ± 0.00 0.07 ± 0.01 19.55 ± 0.51
LLaMa-2 7B 0.06 ± 0.00 0.01 ± 0.00 0.05 ± 0.00 -0.09 ± 0.00 -0.23 ± 0.04 0.06 ± 0.00 0.21 ± 0.00 15.75 ± 0.20 0.04 ± 0.00 0.00 ± 0.00 0.03 ± 0.00 -0.16 ± 0.00 -0.14 ± 0.07 0.03 ± 0.00 0.06 ± 0.00 17.84 ± 0.50
Vietcuna 7B 0.28 ± 0.00 0.06 ± 0.00 0.18 ± 0.00 -0.09 ± 0.00 -0.09 ± 0.09 0.31 ± 0.00 0.80 ± 0.01 171.63 ± 1.71 0.24 ± 0.00 0.06 ± 0.00 0.15 ± 0.00 -0.16 ± 0.00 -0.18 ± 0.07 0.51 ± 0.01 1.16 ± 0.01 238.67 ± 3.37
GPT-3.5 0.36 ± 0.00 0.20 ± 0.00 0.24 ± 0.00 -0.09 ± 0.00 0.04 ± 0.13 0.86 ± 0.00 3.97 ± 0.02 13.32 ± 0.65 0.43 ± 0.00 0.21 ± 0.00 0.27 ± 0.00 -0.16 ± 0.00 0.22 ± 0.03 0.87 ± 0.00 3.29 ± 0.03 35.50 ± 0.82
GPT-4 0.41 ± 0.00 0.21 ± 0.00 0.26 ± 0.00 -0.08 ± 0.00 -0.04 ± 0.11 0.84 ± 0.00 3.45 ± 0.00 15.43 ± 0.49 0.44 ± 0.00 0.21 ± 0.00 0.27 ± 0.00 -0.16 ± 0.00 0.24 ± 0.04 0.82 ± 0.00 2.37 ± 0.01 6.61 ± 0.16