MirakramAghalarov
commited on
Commit
•
5e1fdc4
1
Parent(s):
60b5bf6
Updated README
Browse files- src/display/about.py +13 -8
src/display/about.py
CHANGED
@@ -50,29 +50,34 @@ If you have a fine-tuned Azerbaijani LLM, submit it for evaluation!
|
|
50 |
LLM_BENCHMARKS_TEXT = f"""
|
51 |
## Azerbaijani Open LLM sponsored by Kapital Bank
|
52 |
|
53 |
-
|
|
|
|
|
|
|
|
|
|
|
54 |
"""
|
55 |
|
56 |
LLM_DATASET_TEXT = f"""
|
57 |
-
##Banking_Call_Classification_MC
|
58 |
This dataset consists of 192 rows and 4 columns. It is a multiple-choice dataset used to determine which of the presented categories the subject of a request sent to the bank by a client belongs to.
|
59 |
|
60 |
-
##Banking_Exam_MCQ
|
61 |
A benchmark dataset of 200-300 multiple-choice questions sourced from universityexam materials across multiple departments,focused specifically on the banking sector in Azerbaijan.
|
62 |
|
63 |
-
##Banking_QA
|
64 |
This dataset consists of 97 raws and is a question-answer dataset in the Azerbaijani language about banking.
|
65 |
|
66 |
-
##Wiki_CQA
|
67 |
This database consists of 97 rows in Azerbaijani language. It consists of a test consisting of a context from Wikipedia, questions related to that context, and a created answer.
|
68 |
|
69 |
-
##GSM8K
|
70 |
A benchmark dataset contain 44 rows,diverse grade school math word problems to measure a model’s abilitiy to solve milti-step mathematical reasoning problems.
|
71 |
|
72 |
-
##ARC
|
73 |
This Benchmark dataset consists of multiple-choice science questions aimed at testing a model's ability to understand and apply elementary scientific knowledge, similar to questions that might appear in standard science exams for students. This version of the dataset is in Azerbaijani, providing an opportunity for models to engage in reasoning and inference in the Azerbaijani language. The dataset is divided into an easy set and a challenge set, with questions requiring reasoning beyond simple fact recall
|
74 |
|
75 |
-
##Informatics_MC, Azerbaijani_Lang_MC, History_MC, Physics_MC, Geography_MC, LLM-Literature_MC, Logic_MC, Azerbaijani_Hist_MC, Chemistry_MC, Biology_MC
|
76 |
A comprehensive collection of educational datasets in the Azerbaijani language, covering ten distinct academic disciplines: informatics, Azerbaijani language, world history, physics, geography, literature, logic, Azerbaijani history, chemistry, and biology. Each dataset contains 100 carefully curated multiple-choice questions, designed to assess knowledge and understanding in their respective fields.
|
77 |
"""
|
78 |
|
|
|
50 |
LLM_BENCHMARKS_TEXT = f"""
|
51 |
## Azerbaijani Open LLM sponsored by Kapital Bank
|
52 |
|
53 |
+
Azerbaijani Open LLM Leaderboard is a pioneering initiative dedicated to advancing and showcasing Azerbaijani language large language models (LLMs). Sponsored by Kapital Bank, this leaderboard provides a transparent and comprehensive ranking platform for open-source Azerbaijani LLMs, fostering innovation in natural language processing (NLP) within the Azerbaijani language. By creating a space for collaboration and healthy competition, we aim to support researchers, developers, and the broader AI community in improving the quality, accessibility, and practical applications of Azerbaijani-focused LLMs. Through this platform, we hope to bridge language gaps in AI technology and drive forward advancements in multilingual AI, all while encouraging the development of AI resources that are locally relevant and globally competitive.
|
54 |
+
|
55 |
+
## Partners
|
56 |
+
|
57 |
+
Along with Kapital Bank, some of the companies and groups collaborated in this approach. LocalDocs, PRODATA LLC and R&D Center of Baku Higher Oil School.
|
58 |
+
|
59 |
"""
|
60 |
|
61 |
LLM_DATASET_TEXT = f"""
|
62 |
+
## Banking_Call_Classification_MC
|
63 |
This dataset consists of 192 rows and 4 columns. It is a multiple-choice dataset used to determine which of the presented categories the subject of a request sent to the bank by a client belongs to.
|
64 |
|
65 |
+
## Banking_Exam_MCQ
|
66 |
A benchmark dataset of 200-300 multiple-choice questions sourced from universityexam materials across multiple departments,focused specifically on the banking sector in Azerbaijan.
|
67 |
|
68 |
+
## Banking_QA
|
69 |
This dataset consists of 97 raws and is a question-answer dataset in the Azerbaijani language about banking.
|
70 |
|
71 |
+
## Wiki_CQA
|
72 |
This database consists of 97 rows in Azerbaijani language. It consists of a test consisting of a context from Wikipedia, questions related to that context, and a created answer.
|
73 |
|
74 |
+
## GSM8K
|
75 |
A benchmark dataset contain 44 rows,diverse grade school math word problems to measure a model’s abilitiy to solve milti-step mathematical reasoning problems.
|
76 |
|
77 |
+
## ARC
|
78 |
This Benchmark dataset consists of multiple-choice science questions aimed at testing a model's ability to understand and apply elementary scientific knowledge, similar to questions that might appear in standard science exams for students. This version of the dataset is in Azerbaijani, providing an opportunity for models to engage in reasoning and inference in the Azerbaijani language. The dataset is divided into an easy set and a challenge set, with questions requiring reasoning beyond simple fact recall
|
79 |
|
80 |
+
## Informatics_MC, Azerbaijani_Lang_MC, History_MC, Physics_MC, Geography_MC, LLM-Literature_MC, Logic_MC, Azerbaijani_Hist_MC, Chemistry_MC, Biology_MC
|
81 |
A comprehensive collection of educational datasets in the Azerbaijani language, covering ten distinct academic disciplines: informatics, Azerbaijani language, world history, physics, geography, literature, logic, Azerbaijani history, chemistry, and biology. Each dataset contains 100 carefully curated multiple-choice questions, designed to assess knowledge and understanding in their respective fields.
|
82 |
"""
|
83 |
|