TroL: Traversal of Layers for Large Language and Vision Models Paper β’ 2406.12246 β’ Published Jun 18 β’ 34
Biomedical Vision-Language Models (VLMs) Collection Some of my favorite biomedical vision-language models β’ 15 items β’ Updated May 7 β’ 7
OpenAI Vision API Collection Demos of projects using the OpenAI Vision API. β’ 3 items β’ Updated Nov 22, 2023 β’ 3
The Perception Collection Collection Dataset and Model for "Prometheus-Vision: Vision-Language Model as a Judge for Fine-Grained Evaluation" β’ 5 items β’ Updated Jan 15 β’ 4
Computer Vision Backbones 𧩠Collection Collection of useful computer vision backbones to fine-tune. It also includes large image classification models, that can be used as backbone. ⒠22 items ⒠Updated Sep 19, 2023 ⒠17
Vision Models (GGUF) Collection How to use: Download a "mmproj" model file + one or more of the primary model files. β’ 5 items β’ Updated Dec 22, 2023 β’ 37
Tulu V2 Suite Collection The set of models associated with the paper "Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2" β’ 19 items β’ Updated 22 days ago β’ 43
DRAGON Models Collection Production-grade RAG-optimized 6-7B parameter models - "Delivering RAG on ..." the leading foundation base models β’ 20 items β’ Updated 25 days ago β’ 44
Awesome feedback datasets Collection A curated list of datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO. β’ 19 items β’ Updated Apr 12 β’ 64
Zeroshot Classifiers Collection These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. β’ 11 items β’ Updated Apr 3 β’ 103
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. β’ 43 items β’ Updated Apr 12 β’ 111
read papers Collection This is a collection of some papers I've read in the past few months β’ 10 items β’ Updated Nov 21, 2023 β’ 47
Seamless Communication Collection A significant step towards removing language barriers through expressive, fast and high-quality AI translation. β’ 16 items β’ Updated Jan 16 β’ 144
Latent Consistency Models LoRAs Collection Latent Consistency Models for Stable Diffusion - LoRAs and full fine-tuned weights β’ 4 items β’ Updated Nov 10, 2023 β’ 98
Zephyr 7B Collection Models, datasets, and demos associated with Zephyr 7B. For code to train the models, see: https://github.com/huggingface/alignment-handbook β’ 9 items β’ Updated Apr 12 β’ 144
Leaderboards and benchmarks β¨ Collection Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... β’ 67 items β’ Updated Aug 6 β’ 83
PhotoMaker Collection Let us create photos/paintings/avatars for anyone in any style within seconds. β’ 5 items β’ Updated Jul 22 β’ 25
From screenshots to HTML Collection WebSight is a dataset of 823,000 HTML/CSS codes representing synthetically generated English websites, each accompanied by a corresponding screenshot. β’ 4 items β’ Updated Apr 15 β’ 17
SLIM GGUF Collection Quantized GGUF 'tool' implementations of SLIM Models β’ 30 items β’ Updated 20 days ago β’ 9
Transformers.js demos Collection A collection of my favorite WebML demos, built with Transformers.js! β’ 30 items β’ Updated Jul 11 β’ 78
The Big Benchmarks Collection Collection Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) β’ 12 items β’ Updated May 28 β’ 131
LLM as a Judge Collection Curated resources that support the use of LLMs to serve as automatic evaluators of other LLM outputs. β’ 16 items β’ Updated 1 day ago β’ 20
Canary Collection A collection of multilingual and multitask speech to text models from NVIDIA NeMo π€ β’ 1 item β’ Updated Jul 17 β’ 16
π Daily Picks in Interpretability & Analysis of LMs Collection Outstanding research in interpretability and evaluation of language models, summarized β’ 70 items β’ Updated 9 days ago β’ 84
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! β’ 30 items β’ Updated Jun 12 β’ 211
Recent models: last 100 repos, sorted by creation date Collection The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. β’ 121 items β’ Updated Jan 31 β’ 489
LLM Hallucination Detection Papers Collection Collection of LLM hallucination and evaluation papers that I've been exploring and implementing. Some of them have my comments and annotated doodles. β’ 12 items β’ Updated Feb 20 β’ 12
datasets-SPIN Collection Generated synthetic data used to finetune SPIN. β’ 8 items β’ Updated Feb 9 β’ 11
SLIM Models Collection Structured Language Instruction Models (SLIMs) β’ 31 items β’ Updated 20 days ago β’ 28
Qwen1.5 Collection Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. β’ 55 items β’ Updated 1 day ago β’ 205
βοΈπ¦ Provenance, Watermarking & Deepfake Detection Collection Technical tools for more control over non-consensual synthetic content β’ 14 items β’ Updated Apr 1 β’ 38
LLM Leaderboard best models β€οΈβπ₯ Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: β’ 264 items β’ Updated Jun 22 β’ 392