Zorik's picture

1 10 5

Zorik

zorik

·

AI & ML interests

NLP

Organizations

zorik's activity

upvoted a paper 9 days ago

Are LLMs Better than Reported? Detecting Label Errors and Mitigating Their Effect on Model Performance

Paper • 2410.18889 • Published 13 days ago • 15

upvoted a paper 27 days ago

GLEE: A Unified Framework and Benchmark for Language-based Economic Environments

Paper • 2410.05254 • Published about 1 month ago • 80

upvoted a paper 30 days ago

LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations

Paper • 2410.02707 • Published Oct 3 • 47

upvoted a paper about 1 month ago

NL-Eye: Abductive NLI for Images

Paper • 2410.02613 • Published Oct 3 • 22

upvoted a paper 5 months ago

Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations?

Paper • 2405.05904 • Published May 9 • 6

upvoted a collection 12 months ago

SEAHORSE release

The SEAHORSE metrics (as described in https://arxiv.org/abs/2305.13194). • 12 items • Updated Jul 31 • 18

upvoted 4 papers about 1 year ago

On the Robustness of Dialogue History Representation in Conversational Question Answering: A Comprehensive Study and a New Prompt-based Method

Paper • 2206.14796 • Published Jun 29, 2022 • 1

KoBE: Knowledge-Based Machine Translation Evaluation

Paper • 2009.11027 • Published Sep 23, 2020 • 1

RED-ACE: Robust Error Detection for ASR using Confidence Embeddings

Paper • 2203.07172 • Published Mar 14, 2022 • 1

TrueTeacher: Learning Factual Consistency Evaluation with Large Language Models

Paper • 2305.11171 • Published May 18, 2023 • 2