Custom 4-bit Finetuning 5-7 times faster inference than QLora
pinned
1
#9 opened over 1 year ago
by
rmihaylov
AI World
#89 opened 8 months ago
by
MohammadMuzamil
Adding `safetensors` variant of this model
#88 opened 9 months ago
by
Dennison33
combining falcon 40b instruct with langchain
#87 opened 10 months ago
by
rra21
Update generation_config.json
1
#85 opened about 1 year ago
by
nkasmanoff
Update generation_config.json
1
#84 opened about 1 year ago
by
nkasmanoff
Getting gibberish output with Falcon-40b instruct
2
#83 opened about 1 year ago
by
harsh244
Falcon 40B Inference on GKE Autopilot A100 40GB
3
#82 opened about 1 year ago
by
bshongwe
Adding `safetensors` variant of this model
#81 opened about 1 year ago
by
Flolight
Adding `safetensors` variant of this model
#80 opened about 1 year ago
by
Flolight
CPU or GPU
1
#76 opened about 1 year ago
by
lalit34
Optimizing Inference Time for Chat Conversations on Falcon
#73 opened about 1 year ago
by
humza-sami
Use input attention mask instead of casual mask in attention
#72 opened about 1 year ago
by
CyberZHG
is there a way to not use trust_remote = True
#71 opened over 1 year ago
by
momentumhd
Unable to load and run finetuned falcon model
#70 opened over 1 year ago
by
DioulaD
Parameters contains nan numbers when loading model locally
#69 opened over 1 year ago
by
yunsxie
ValueError: sharded is not supported for AutoModel ERROR
8
#68 opened over 1 year ago
by
peyers
ValueError in KoboldAI when loading the model
1
#66 opened over 1 year ago
by
JermemyHaschal
Cannot set "instructions" when invoking inference endpoint
1
#65 opened over 1 year ago
by
aruana
Changes in modelling_RW.py to be able to handle past_key_values for faster model generations
#64 opened over 1 year ago
by
puru22
Model sometimes generates '</s>'
1
#63 opened over 1 year ago
by
jlzhou
Correct blogpost link
#62 opened over 1 year ago
by
isydmr
Error: ShardCannotStart
#61 opened over 1 year ago
by
Bhupesh2003
Finetuning Falcon-40B-Instruct For ChatBot Use Case
1
#59 opened over 1 year ago
by
sdkramer10
Adding `safetensors` variant of this model
2
#58 opened over 1 year ago
by
nth-attempt
Add `tokenizer_class` to get `pipeline` to load tokenizer
#57 opened over 1 year ago
by
chiragjn
Adding `safetensors` variant of this model
#56 opened over 1 year ago
by
shayan
ValueError: Error raised by inference API: Model tiiuae/falcon-40b-instruct time out using HuggingFaceHub
1
#55 opened over 1 year ago
by
nicoleds
Question about Apache 2.0 license
2
#54 opened over 1 year ago
by
psinger
Running the Falcon-40B-Instruct model on Azure Kubernetes Service
#53 opened over 1 year ago
by
zioproto
Experimental ggml demos
2
#52 opened over 1 year ago
by
matthoffner
Truncated output from API call through langchain
4
#51 opened over 1 year ago
by
TMTechnology
Experiences with complex instructions
1
#50 opened over 1 year ago
by
Tuana
Update README.md
#49 opened over 1 year ago
by
saattrupdan
Why Rotary Positional Embeddings Over Alibi?
#48 opened over 1 year ago
by
mallorbc
About Input validation error: `inputs` tokens + `max_new_tokens` must be <= 1512.
3
#47 opened over 1 year ago
by
Holynull
is Alibi version available for fine tuning to a large context window?
3
#46 opened over 1 year ago
by
run
Finetune Falcon-4b with large token size.
2
#44 opened over 1 year ago
by
amnasher
Model returns entire input prompt together with output
11
#43 opened over 1 year ago
by
andee96
Instruction prompt
3
#42 opened over 1 year ago
by
mazzaqq
Update README.md
#41 opened over 1 year ago
by
zagg8705
Arabic Language support
2
#40 opened over 1 year ago
by
Hgdawy
Request: DOI
#39 opened over 1 year ago
by
ongkn
what is the input token length of Falcon-40B and -7B models?
3
#38 opened over 1 year ago
by
sermolin
AttributeError: 'RWConfig' object has no attribute 'n_hea'
2
#36 opened over 1 year ago
by
ibrim
cuda error on more than 400 words
#35 opened over 1 year ago
by
a749734
test case one
1
#33 opened over 1 year ago
by
FALCONBoy
a100-80g memory but still call error
6
#32 opened over 1 year ago
by
leocheung
How many of you are planning on using this as a programming assistent?
4
#31 opened over 1 year ago
by
BliepBlop
Why not add system requirements on the model card?
9
#28 opened over 1 year ago
by
johnjohndoedoe