LangChain
#5
by
laek
- opened
Does anyone have an example of this being loaded in LangChain?
That link is for llama.cpp which couldn't be used with this GPTQ model and GPU inference, but could be used with the Vicuna GGMLs that you can download
For GPTQ GPU inference with Langchain, check out https://github.com/cxfcxf/embeddings