lmms-lab
/

llava-next-interleave-qwen-7b-dpo

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Link model to paper

#3

by nielsr HF staff - opened Jul 12

base: refs/heads/main

←

from: refs/pr/3

Discussion Files changed

Files changed (1) hide show

README.md +14 -0

README.md CHANGED Viewed

@@ -46,4 +46,18 @@ Use the code below to evaluate the model.
 Please first edit /path/to/ckpt to the path of checkpoint, /path/to/images to the path of "interleave_data" in scripts/interleave/eval_all.sh and then run
 ```bash
 bash scripts/interleave/eval_all.sh
 ```

 Please first edit /path/to/ckpt to the path of checkpoint, /path/to/images to the path of "interleave_data" in scripts/interleave/eval_all.sh and then run
 ```bash
 bash scripts/interleave/eval_all.sh
+```
+## Bibtex citation
+```bibtex
+@misc{li2024llavanextinterleavetacklingmultiimagevideo,
+      title={LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models},
+      author={Feng Li and Renrui Zhang and Hao Zhang and Yuanhan Zhang and Bo Li and Wei Li and Zejun Ma and Chunyuan Li},
+      year={2024},
+      eprint={2407.07895},
+      archivePrefix={arXiv},
+      primaryClass={cs.CV},
+      url={https://arxiv.org/abs/2407.07895},
+}
 ```