MirakramAghalarov commited on
Commit
5e1fdc4
1 Parent(s): 60b5bf6

Updated README

Browse files
Files changed (1) hide show
  1. src/display/about.py +13 -8
src/display/about.py CHANGED
@@ -50,29 +50,34 @@ If you have a fine-tuned Azerbaijani LLM, submit it for evaluation!
50
  LLM_BENCHMARKS_TEXT = f"""
51
  ## Azerbaijani Open LLM sponsored by Kapital Bank
52
 
53
- This Open leaderboard
 
 
 
 
 
54
  """
55
 
56
  LLM_DATASET_TEXT = f"""
57
- ##Banking_Call_Classification_MC
58
  This dataset consists of 192 rows and 4 columns. It is a multiple-choice dataset used to determine which of the presented categories the subject of a request sent to the bank by a client belongs to.
59
 
60
- ##Banking_Exam_MCQ
61
  A benchmark dataset of 200-300 multiple-choice questions sourced from universityexam materials across multiple departments,focused specifically on the banking sector in Azerbaijan.
62
 
63
- ##Banking_QA
64
  This dataset consists of 97 raws and is a question-answer dataset in the Azerbaijani language about banking.
65
 
66
- ##Wiki_CQA
67
  This database consists of 97 rows in Azerbaijani language. It consists of a test consisting of a context from Wikipedia, questions related to that context, and a created answer.
68
 
69
- ##GSM8K
70
  A benchmark dataset contain 44 rows,diverse grade school math word problems to measure a model’s abilitiy to solve milti-step mathematical reasoning problems.
71
 
72
- ##ARC
73
  This Benchmark dataset consists of multiple-choice science questions aimed at testing a model's ability to understand and apply elementary scientific knowledge, similar to questions that might appear in standard science exams for students. This version of the dataset is in Azerbaijani, providing an opportunity for models to engage in reasoning and inference in the Azerbaijani language. The dataset is divided into an easy set and a challenge set, with questions requiring reasoning beyond simple fact recall
74
 
75
- ##Informatics_MC, Azerbaijani_Lang_MC, History_MC, Physics_MC, Geography_MC, LLM-Literature_MC, Logic_MC, Azerbaijani_Hist_MC, Chemistry_MC, Biology_MC
76
  A comprehensive collection of educational datasets in the Azerbaijani language, covering ten distinct academic disciplines: informatics, Azerbaijani language, world history, physics, geography, literature, logic, Azerbaijani history, chemistry, and biology. Each dataset contains 100 carefully curated multiple-choice questions, designed to assess knowledge and understanding in their respective fields.
77
  """
78
 
 
50
  LLM_BENCHMARKS_TEXT = f"""
51
  ## Azerbaijani Open LLM sponsored by Kapital Bank
52
 
53
+ Azerbaijani Open LLM Leaderboard is a pioneering initiative dedicated to advancing and showcasing Azerbaijani language large language models (LLMs). Sponsored by Kapital Bank, this leaderboard provides a transparent and comprehensive ranking platform for open-source Azerbaijani LLMs, fostering innovation in natural language processing (NLP) within the Azerbaijani language. By creating a space for collaboration and healthy competition, we aim to support researchers, developers, and the broader AI community in improving the quality, accessibility, and practical applications of Azerbaijani-focused LLMs. Through this platform, we hope to bridge language gaps in AI technology and drive forward advancements in multilingual AI, all while encouraging the development of AI resources that are locally relevant and globally competitive.
54
+
55
+ ## Partners
56
+
57
+ Along with Kapital Bank, some of the companies and groups collaborated in this approach. LocalDocs, PRODATA LLC and R&D Center of Baku Higher Oil School.
58
+
59
  """
60
 
61
  LLM_DATASET_TEXT = f"""
62
+ ## Banking_Call_Classification_MC
63
  This dataset consists of 192 rows and 4 columns. It is a multiple-choice dataset used to determine which of the presented categories the subject of a request sent to the bank by a client belongs to.
64
 
65
+ ## Banking_Exam_MCQ
66
  A benchmark dataset of 200-300 multiple-choice questions sourced from universityexam materials across multiple departments,focused specifically on the banking sector in Azerbaijan.
67
 
68
+ ## Banking_QA
69
  This dataset consists of 97 raws and is a question-answer dataset in the Azerbaijani language about banking.
70
 
71
+ ## Wiki_CQA
72
  This database consists of 97 rows in Azerbaijani language. It consists of a test consisting of a context from Wikipedia, questions related to that context, and a created answer.
73
 
74
+ ## GSM8K
75
  A benchmark dataset contain 44 rows,diverse grade school math word problems to measure a model’s abilitiy to solve milti-step mathematical reasoning problems.
76
 
77
+ ## ARC
78
  This Benchmark dataset consists of multiple-choice science questions aimed at testing a model's ability to understand and apply elementary scientific knowledge, similar to questions that might appear in standard science exams for students. This version of the dataset is in Azerbaijani, providing an opportunity for models to engage in reasoning and inference in the Azerbaijani language. The dataset is divided into an easy set and a challenge set, with questions requiring reasoning beyond simple fact recall
79
 
80
+ ## Informatics_MC, Azerbaijani_Lang_MC, History_MC, Physics_MC, Geography_MC, LLM-Literature_MC, Logic_MC, Azerbaijani_Hist_MC, Chemistry_MC, Biology_MC
81
  A comprehensive collection of educational datasets in the Azerbaijani language, covering ten distinct academic disciplines: informatics, Azerbaijani language, world history, physics, geography, literature, logic, Azerbaijani history, chemistry, and biology. Each dataset contains 100 carefully curated multiple-choice questions, designed to assess knowledge and understanding in their respective fields.
82
  """
83