Commit History

Tidying: reordering code a bit
124003a

lukestanley commited on

Add Mistral API support due to my RunPod serverless system reliability issues
8093276

lukestanley commited on

Add TODO for Runpod timeout handling
3c6c618

lukestanley commited on

Docs: HuggingFace README.md metadata changes
1dba83c

lukestanley commited on

Make Gradio app text more concise
5abc6b6

lukestanley commited on

Add assert in improvement_loop function to make more robust
4901d0f

lukestanley commited on

Assert RunPod env vars are setup before trying to use them
00af17e

lukestanley commited on

Change return type of improvement_loop to dict in app.py
859cc57

lukestanley commited on

Update allow_flagging option in Gradio interface
db708d2

lukestanley commited on

Fix Gradio textbox with placeholder
171111d

lukestanley commited on

Docs: Add local usage instructions for running the Gradio web server GUI
38a55db

lukestanley commited on

Clarify setup comments, remove unused global, increase max iterations
c995e6d

lukestanley commited on

Add JSON parsing and format output in HTML
5fb50e6

lukestanley commited on

Clarify ChillTranslator description
ca6258e

lukestanley commited on

Docs: Move sections around
47a3557

lukestanley commited on

Doc: Idea for speed improvements and intermediate results display, grouping future directions
968cab3

lukestanley commited on

Documentation changes
21ce4d4

lukestanley commited on

Add description to app
9f20b49

Luke Stanley commited on

Add cached examples
acc8b42

Luke Stanley commited on

Documentation: Add image
2dac454

Luke Stanley commited on

Documentation: Update future directions
d72193c

Luke Stanley commited on

Add HuggingFace Space demo link
03b6491

Luke Stanley commited on

Reduce max_iterations value in chill.py
6bfaa63

Luke Stanley commited on

Comment out llama-cpp-python installation command in Docker for HuggingFace Space
56e7667

Luke Stanley commited on

Switch to serverless worker by default (PR #2 from lukestanley/serverless_json_llm)
a054519
unverified

Luke Stanley commited on

Revert expected serverless output metadata stripper
c013599

Luke Stanley commited on

Documents serverless motivation and testing instructions
5da2aef

Luke Stanley commited on

Avoid unneeded imports, make serverless output more sensible, removing some debugging and comments
469f650

Luke Stanley commited on

Fix RUNPOD_ENDPOINT_ID environment variable
ce5ad5f

Luke Stanley commited on

Add more serverless GPU endpoint setup instruction detail
b51ce5c

Luke Stanley commited on

Document serverless setup
f2e80c9

Luke Stanley commited on

Rename serverless test file, set default model to Phi 2 for test, removed jq install, and env vars that are set the same in utils.py already, ignore .cache in git
83e4d57

Luke Stanley commited on

Introduces worker mode env var
56e785c

Luke Stanley commited on

Make GPU detection and llama-cpp-python re-installation conditional
434144a

Luke Stanley commited on

Initialise global variables in improvement_loop function
e30b729

Luke Stanley commited on

Ensure N_GPU_LAYERS is int
9475016

Luke Stanley commited on

Expose json typed LLM interface for RunPod
976ea17

Luke Stanley commited on

RunPod Mixtral JSON output test
233efeb

Luke Stanley commited on

Add hello world RunPod setup
feeb679

Luke Stanley commited on

Update default GPU layer, temperature values
e327a9e

lukestanley commited on

Add env vars to set GPU layer count and context size, make verbose
e01e28e

lukestanley commited on

Fix gif link since LFS related gif binary purge due to HF requirments
0945e5b

lukestanley commited on

Add n_gpu_layers parameter to Llama initialization
88e6118

lukestanley commited on

Fix: Move n_ctx parameter to model setup!
358cd20

lukestanley commited on

Fix check for LLM_MODEL_PATH to avoid load error
ff938c3

lukestanley commited on

Correct Space metadata
f5a3b9d

lukestanley commited on

Add HuggingFace space metadata
994c606

lukestanley commited on