Commits · lukestanley/ChillTranslator

Tidying: reordering code a bit

124003a

lukestanley commited on Feb 29

Add Mistral API support due to my RunPod serverless system reliability issues

8093276

lukestanley commited on Feb 29

Add TODO for Runpod timeout handling

3c6c618

lukestanley commited on Feb 29

Docs: HuggingFace README.md metadata changes

1dba83c

lukestanley commited on Feb 29

Make Gradio app text more concise

5abc6b6

lukestanley commited on Feb 29

Cache examples

2857585

lukestanley commited on Feb 29

Add assert in improvement_loop function to make more robust

4901d0f

lukestanley commited on Feb 29

Assert RunPod env vars are setup before trying to use them

00af17e

lukestanley commited on Feb 29

Change return type of improvement_loop to dict in app.py

859cc57

lukestanley commited on Feb 29

Update allow_flagging option in Gradio interface

db708d2

lukestanley commited on Feb 28

Fix Gradio textbox with placeholder

171111d

lukestanley commited on Feb 28

Docs: Add local usage instructions for running the Gradio web server GUI

38a55db

lukestanley commited on Feb 28

Clarify setup comments, remove unused global, increase max iterations

c995e6d

lukestanley commited on Feb 28

Docs: Readme update

58dcbdf

lukestanley commited on Feb 28

Add JSON parsing and format output in HTML

5fb50e6

lukestanley commited on Feb 28

Move constants

12c7670

lukestanley commited on Feb 28

Clarify ChillTranslator description

ca6258e

lukestanley commited on Feb 28

Docs: Move sections around

47a3557

lukestanley commited on Feb 28

Doc: Idea for speed improvements and intermediate results display, grouping future directions

968cab3

lukestanley commited on Feb 28

Documentation changes

21ce4d4

lukestanley commited on Feb 28

Add description to app

9f20b49

Luke Stanley commited on Feb 28

Add cached examples

acc8b42

Luke Stanley commited on Feb 28

Documentation: Add image

2dac454

Luke Stanley commited on Feb 28

Documentation: Update future directions

d72193c

Luke Stanley commited on Feb 28

Add HuggingFace Space demo link

03b6491

Luke Stanley commited on Feb 28

Reduce max_iterations value in chill.py

6bfaa63

Luke Stanley commited on Feb 28

Comment out llama-cpp-python installation command in Docker for HuggingFace Space

56e7667

Luke Stanley commited on Feb 28

Switch to serverless worker by default (PR #2 from lukestanley/serverless_json_llm)

a054519
unverified

Luke Stanley commited on Feb 28

Revert expected serverless output metadata stripper

c013599

Luke Stanley commited on Feb 28

Documents serverless motivation and testing instructions

5da2aef

Luke Stanley commited on Feb 28

Avoid unneeded imports, make serverless output more sensible, removing some debugging and comments

469f650

Luke Stanley commited on Feb 28

Fix RUNPOD_ENDPOINT_ID environment variable

ce5ad5f

Luke Stanley commited on Feb 28

Add more serverless GPU endpoint setup instruction detail

b51ce5c

Luke Stanley commited on Feb 28

Document serverless setup

f2e80c9

Luke Stanley commited on Feb 28

Rename serverless test file, set default model to Phi 2 for test, removed jq install, and env vars that are set the same in utils.py already, ignore .cache in git

83e4d57

Luke Stanley commited on Feb 28

Introduces worker mode env var

56e785c

Luke Stanley commited on Feb 28

Make GPU detection and llama-cpp-python re-installation conditional

434144a

Luke Stanley commited on Feb 28

Initialise global variables in improvement_loop function

e30b729

Luke Stanley commited on Feb 28

Ensure N_GPU_LAYERS is int

9475016

Luke Stanley commited on Feb 27

Expose json typed LLM interface for RunPod

976ea17

Luke Stanley commited on Feb 27

RunPod Mixtral JSON output test

233efeb

Luke Stanley commited on Feb 27

Add hello world RunPod setup

feeb679

Luke Stanley commited on Feb 26

Update default GPU layer, temperature values

e327a9e

lukestanley commited on Feb 26

Add env vars to set GPU layer count and context size, make verbose

e01e28e

lukestanley commited on Feb 26

Fix gif link since LFS related gif binary purge due to HF requirments

0945e5b

lukestanley commited on Feb 25

Add n_gpu_layers parameter to Llama initialization

88e6118

lukestanley commited on Feb 25

Fix: Move n_ctx parameter to model setup!

358cd20

lukestanley commited on Feb 25

Fix check for LLM_MODEL_PATH to avoid load error

ff938c3

lukestanley commited on Feb 25

Correct Space metadata

f5a3b9d

lukestanley commited on Feb 25

Add HuggingFace space metadata

994c606

lukestanley commited on Feb 25

Commit History

Tidying: reordering code a bit 124003a

Add Mistral API support due to my RunPod serverless system reliability issues 8093276

Add TODO for Runpod timeout handling 3c6c618

Docs: HuggingFace README.md metadata changes 1dba83c

Make Gradio app text more concise 5abc6b6

Cache examples 2857585

Add assert in improvement_loop function to make more robust 4901d0f

Assert RunPod env vars are setup before trying to use them 00af17e

Change return type of improvement_loop to dict in app.py 859cc57

Update allow_flagging option in Gradio interface db708d2

Fix Gradio textbox with placeholder 171111d

Docs: Add local usage instructions for running the Gradio web server GUI 38a55db

Clarify setup comments, remove unused global, increase max iterations c995e6d

Docs: Readme update 58dcbdf

Add JSON parsing and format output in HTML 5fb50e6

Move constants 12c7670

Clarify ChillTranslator description ca6258e

Docs: Move sections around 47a3557

Doc: Idea for speed improvements and intermediate results display, grouping future directions 968cab3

Documentation changes 21ce4d4

Add description to app 9f20b49

Add cached examples acc8b42

Documentation: Add image 2dac454

Documentation: Update future directions d72193c

Add HuggingFace Space demo link 03b6491

Reduce max_iterations value in chill.py 6bfaa63

Comment out llama-cpp-python installation command in Docker for HuggingFace Space 56e7667

Switch to serverless worker by default (PR #2 from lukestanley/serverless_json_llm) a054519 unverified

Revert expected serverless output metadata stripper c013599

Documents serverless motivation and testing instructions 5da2aef

Avoid unneeded imports, make serverless output more sensible, removing some debugging and comments 469f650

Fix RUNPOD_ENDPOINT_ID environment variable ce5ad5f

Add more serverless GPU endpoint setup instruction detail b51ce5c

Document serverless setup f2e80c9

Rename serverless test file, set default model to Phi 2 for test, removed jq install, and env vars that are set the same in utils.py already, ignore .cache in git 83e4d57

Introduces worker mode env var 56e785c

Make GPU detection and llama-cpp-python re-installation conditional 434144a

Initialise global variables in improvement_loop function e30b729

Ensure N_GPU_LAYERS is int 9475016

Expose json typed LLM interface for RunPod 976ea17

RunPod Mixtral JSON output test 233efeb

Add hello world RunPod setup feeb679

Update default GPU layer, temperature values e327a9e

Add env vars to set GPU layer count and context size, make verbose e01e28e

Fix gif link since LFS related gif binary purge due to HF requirments 0945e5b

Add n_gpu_layers parameter to Llama initialization 88e6118

Fix: Move n_ctx parameter to model setup! 358cd20

Fix check for LLM_MODEL_PATH to avoid load error ff938c3

Correct Space metadata f5a3b9d

Add HuggingFace space metadata 994c606

Tidying: reordering code a bit

124003a

Add Mistral API support due to my RunPod serverless system reliability issues

8093276

Add TODO for Runpod timeout handling

3c6c618

Docs: HuggingFace README.md metadata changes

1dba83c

Make Gradio app text more concise

5abc6b6

Cache examples

2857585

Add assert in improvement_loop function to make more robust

4901d0f

Assert RunPod env vars are setup before trying to use them

00af17e

Change return type of improvement_loop to dict in app.py

859cc57

Update allow_flagging option in Gradio interface

db708d2

Fix Gradio textbox with placeholder

171111d

Docs: Add local usage instructions for running the Gradio web server GUI

38a55db

Clarify setup comments, remove unused global, increase max iterations

c995e6d

Docs: Readme update

58dcbdf

Add JSON parsing and format output in HTML

5fb50e6

Move constants

12c7670

Clarify ChillTranslator description

ca6258e

Docs: Move sections around

47a3557

Doc: Idea for speed improvements and intermediate results display, grouping future directions

968cab3

Documentation changes

21ce4d4

Add description to app

9f20b49

Add cached examples

acc8b42

Documentation: Add image

2dac454

Documentation: Update future directions

d72193c

Add HuggingFace Space demo link

03b6491

Reduce max_iterations value in chill.py

6bfaa63

Comment out llama-cpp-python installation command in Docker for HuggingFace Space

56e7667

Switch to serverless worker by default (PR #2 from lukestanley/serverless_json_llm)

a054519
unverified

Revert expected serverless output metadata stripper

c013599

Documents serverless motivation and testing instructions

5da2aef

Avoid unneeded imports, make serverless output more sensible, removing some debugging and comments

469f650

Fix RUNPOD_ENDPOINT_ID environment variable

ce5ad5f

Add more serverless GPU endpoint setup instruction detail

b51ce5c

Document serverless setup

f2e80c9

Rename serverless test file, set default model to Phi 2 for test, removed jq install, and env vars that are set the same in utils.py already, ignore .cache in git

83e4d57

Introduces worker mode env var

56e785c

Make GPU detection and llama-cpp-python re-installation conditional

434144a

Initialise global variables in improvement_loop function

e30b729

Ensure N_GPU_LAYERS is int

9475016

Expose json typed LLM interface for RunPod

976ea17

RunPod Mixtral JSON output test

233efeb

Add hello world RunPod setup

feeb679

Update default GPU layer, temperature values

e327a9e

Add env vars to set GPU layer count and context size, make verbose

e01e28e

Fix gif link since LFS related gif binary purge due to HF requirments

0945e5b

Add n_gpu_layers parameter to Llama initialization

88e6118

Fix: Move n_ctx parameter to model setup!

358cd20

Fix check for LLM_MODEL_PATH to avoid load error

ff938c3

Correct Space metadata

f5a3b9d

Add HuggingFace space metadata

994c606