-
Self-Rewarding Language Models
Paper ā¢ 2401.10020 ā¢ Published ā¢ 144 -
ReFT: Reasoning with Reinforced Fine-Tuning
Paper ā¢ 2401.08967 ā¢ Published ā¢ 28 -
Tuning Language Models by Proxy
Paper ā¢ 2401.08565 ā¢ Published ā¢ 21 -
TrustLLM: Trustworthiness in Large Language Models
Paper ā¢ 2401.05561 ā¢ Published ā¢ 65
Collections
Discover the best community collections!
Collections including paper arxiv:2311.00176
-
Attention Is All You Need
Paper ā¢ 1706.03762 ā¢ Published ā¢ 44 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper ā¢ 1810.04805 ā¢ Published ā¢ 14 -
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Paper ā¢ 1907.11692 ā¢ Published ā¢ 7 -
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter
Paper ā¢ 1910.01108 ā¢ Published ā¢ 14
-
Understanding LLMs: A Comprehensive Overview from Training to Inference
Paper ā¢ 2401.02038 ā¢ Published ā¢ 61 -
Learning To Teach Large Language Models Logical Reasoning
Paper ā¢ 2310.09158 ā¢ Published ā¢ 1 -
ChipNeMo: Domain-Adapted LLMs for Chip Design
Paper ā¢ 2311.00176 ā¢ Published ā¢ 8 -
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Paper ā¢ 2308.09583 ā¢ Published ā¢ 7
-
NExT-GPT: Any-to-Any Multimodal LLM
Paper ā¢ 2309.05519 ā¢ Published ā¢ 78 -
Large Language Model for Science: A Study on P vs. NP
Paper ā¢ 2309.05689 ā¢ Published ā¢ 20 -
AstroLLaMA: Towards Specialized Foundation Models in Astronomy
Paper ā¢ 2309.06126 ā¢ Published ā¢ 16 -
Large Language Models for Compiler Optimization
Paper ā¢ 2309.07062 ā¢ Published ā¢ 23
-
MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning
Paper ā¢ 2310.03731 ā¢ Published ā¢ 29 -
ToolChain*: Efficient Action Space Navigation in Large Language Models with A* Search
Paper ā¢ 2310.13227 ā¢ Published ā¢ 12 -
ChipNeMo: Domain-Adapted LLMs for Chip Design
Paper ā¢ 2311.00176 ā¢ Published ā¢ 8 -
Language Models can be Logical Solvers
Paper ā¢ 2311.06158 ā¢ Published ā¢ 18
-
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper ā¢ 2402.17764 ā¢ Published ā¢ 602 -
Mixtral of Experts
Paper ā¢ 2401.04088 ā¢ Published ā¢ 159 -
Mistral 7B
Paper ā¢ 2310.06825 ā¢ Published ā¢ 47 -
Don't Make Your LLM an Evaluation Benchmark Cheater
Paper ā¢ 2311.01964 ā¢ Published ā¢ 1
-
MADLAD-400: A Multilingual And Document-Level Large Audited Dataset
Paper ā¢ 2309.04662 ā¢ Published ā¢ 22 -
Neurons in Large Language Models: Dead, N-gram, Positional
Paper ā¢ 2309.04827 ā¢ Published ā¢ 16 -
Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs
Paper ā¢ 2309.05516 ā¢ Published ā¢ 9 -
DrugChat: Towards Enabling ChatGPT-Like Capabilities on Drug Molecule Graphs
Paper ā¢ 2309.03907 ā¢ Published ā¢ 8