Spaces:

hallucinations-leaderboard
/

leaderboard

Running on CPU Upgrade

pminervini commited on Feb 7

Commit

7148b21

•

1 Parent(s): d176dfa

update

Files changed (2) hide show

app.py CHANGED Viewed

@@ -242,7 +242,7 @@ with demo:
                     leaderboard_table,
                     queue=True)
-        with gr.TabItem("📝 About", elem_id="llm-benchmark-tab-table", id=2):
             gr.Markdown(LLM_BENCHMARKS_TEXT, elem_classes="markdown-text")
             print(f'dataset df columns: {list(dataset_df.columns)}')
             dataset_table = gr.components.Dataframe(

                     leaderboard_table,
                     queue=True)
+        with gr.TabItem("About", elem_id="llm-benchmark-tab-table", id=2):
             gr.Markdown(LLM_BENCHMARKS_TEXT, elem_classes="markdown-text")
             print(f'dataset df columns: {list(dataset_df.columns)}')
             dataset_table = gr.components.Dataframe(

src/display/about.py CHANGED Viewed

@@ -6,13 +6,11 @@ INTRODUCTION_TEXT = """
 📐 The Hallucinations Leaderboard aims to track, rank and evaluate hallucinations in LLMs.
 Submit a model for automated evaluation on the [Edinburgh International Data Facility](https://www.epcc.ed.ac.uk/hpc-services/edinburgh-international-data-facility) (EIDF) GPU cluster on the "Submit" page.
 The backend of the Hallucinations leaderboard is based on the [Eleuther AI Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) --- more details in the "About" page.
 Metrics and datasets used by the Hallucinations Leaderboard were identified while writing our [awesome-hallucinations-detection](https://github.com/EdinburghNLP/awesome-hallucination-detection) page (you are encouraged to contribute to this list via pull requests).
 If you have comments or suggestions on datasets and metrics, please [reach out to us in our discussion forum](https://huggingface.co/spaces/hallucinations-leaderboard/leaderboard/discussions).
-For more information, check the About page and our [blog post](https://huggingface.co/blog/leaderboards-on-the-hub-hallucinations).
 """
 LLM_BENCHMARKS_TEXT = f"""

 📐 The Hallucinations Leaderboard aims to track, rank and evaluate hallucinations in LLMs.
 Submit a model for automated evaluation on the [Edinburgh International Data Facility](https://www.epcc.ed.ac.uk/hpc-services/edinburgh-international-data-facility) (EIDF) GPU cluster on the "Submit" page.
 The backend of the Hallucinations leaderboard is based on the [Eleuther AI Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) --- more details in the "About" page.
 Metrics and datasets used by the Hallucinations Leaderboard were identified while writing our [awesome-hallucinations-detection](https://github.com/EdinburghNLP/awesome-hallucination-detection) page (you are encouraged to contribute to this list via pull requests).
 If you have comments or suggestions on datasets and metrics, please [reach out to us in our discussion forum](https://huggingface.co/spaces/hallucinations-leaderboard/leaderboard/discussions).
+For more information about the leaderboard, check our [HuggingFace Blog article](https://huggingface.co/blog/leaderboards-on-the-hub-hallucinations).
 """
 LLM_BENCHMARKS_TEXT = f"""