Update README.md
Browse files
README.md
CHANGED
@@ -41,6 +41,8 @@ See more details in the "Training Details of Vicuna Models" section in the appen
|
|
41 |
|
42 |
## Evaluation
|
43 |
|
|
|
|
|
44 |
Vicuna is evaluated with standard benchmarks, human preference, and LLM-as-a-judge. See more details in this [paper](https://arxiv.org/pdf/2306.05685.pdf) and [leaderboard](https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard).
|
45 |
|
46 |
## Difference between different versions of Vicuna
|
|
|
41 |
|
42 |
## Evaluation
|
43 |
|
44 |
+
![Evaluation Results](https://github.com/lm-sys/lm-sys.github.io/blob/main/public/images/webdata/vicuna_v1.5_eval.png?raw=true)
|
45 |
+
|
46 |
Vicuna is evaluated with standard benchmarks, human preference, and LLM-as-a-judge. See more details in this [paper](https://arxiv.org/pdf/2306.05685.pdf) and [leaderboard](https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard).
|
47 |
|
48 |
## Difference between different versions of Vicuna
|