OpenCompass
community
AI & ML interests
None defined yet.
Organization Card
👋 join us on Discord and WeChat
follow us on Github
OpenCompass is a platform focused on evaluation of AGI, include Large Language Model and Multi-modality Model. We aim to:
- develop high-quality libraries to reduce the difficulties in evaluation
- provide convincing leaderboards for improving the understanding of the large models
- create powerful toolchains targeting a variety of abilities and tasks
- build solid benchmarks to support the large model research
Collections
1
spaces
8
pinned
Running
on
CPU Upgrade
481
🌎
Open VLM Leaderboard
VLMEvalKit Evaluation Results Collection
pinned
Runtime error
4
🌎
CompassJudger Subjective Evaluation Learderboard
CompassJudger Subjective Evaluation Learderboard
pinned
Running
4
🌎
JudgerBench Leaderboard
JudgerBench Leaderboard
pinned
Running
83
🌎
Open VLM Video Leaderboard
VLMEvalKit Eval Results in video understanding benchmark
pinned
Runtime error
18
🚀
MMBench Leaderboard
pinned
Running
84
🚀
OpenCompass LLM Leaderboard
models
8
opencompass/CompassJudger-1-14B-Instruct
Text Generation
•
Updated
•
88
•
1
opencompass/CompassJudger-1-32B-Instruct
Text Generation
•
Updated
•
394
•
8
opencompass/anah-v2
Text Generation
•
Updated
•
21
•
2
opencompass/CompassJudger-1-1.5B-Instruct
Updated
•
93
•
1
opencompass/CompassJudger-1-7B-Instruct
Updated
•
186
•
2
opencompass/anah-7b
Text Generation
•
Updated
•
26
opencompass/anah-20b
Text Generation
•
Updated
•
13
opencompass/mixtral-8x7b-32k
Updated
•
1
datasets
7
opencompass/mmmlu_lite
Viewer
•
Updated
•
20k
•
35
•
2
opencompass/MMBench-Video
Preview
•
Updated
•
289
•
6
opencompass/NeedleBench
Viewer
•
Updated
•
524
•
4.07k
•
2
opencompass/anah
Viewer
•
Updated
•
783
•
95
•
2
opencompass/flames
Viewer
•
Updated
•
537
•
54
opencompass/CriticBench
Updated
•
196
•
4
opencompass/MMBench
Updated
•
38
•
1