Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
YanweiLi
/
llama-vid-13b-pretrain-224-video-fps-1
like
0
Text Generation
Transformers
llava
vision-language model
llama
video understanding
Inference Endpoints
arxiv:
2311.17043
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
llama-vid-13b-pretrain-224-video-fps-1
1 contributor
History:
3 commits
YanweiLi
Create README.md
a2d4c49
11 months ago
.gitattributes
Safe
1.52 kB
initial commit
12 months ago
README.md
Safe
1.65 kB
Create README.md
11 months ago
config.json
Safe
1.24 kB
Upload 3 files
12 months ago
mm_projector.bin
Safe
pickle
Detected Pickle imports (3)
"torch.BFloat16Storage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
459 MB
LFS
Upload 3 files
12 months ago
trainer_state.json
Safe
373 kB
Upload 3 files
12 months ago