Open-Orca
/

OpenOrcaxOpenChat-Preview2-13B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

OpenOrcaxOpenChat-Preview2-13B / README.md

imone's picture

Update README

276f3c7 over 1 year ago

|

400 Bytes

	---
	license: llama2
	---

	Usage:

	1. [Install OpenChat](https://github.com/imoneoi/openchat/#installation)

	2. `python -m ochat.serving.openai_api_server --model-type openchat_llama2 --model Open-Orca/Llama2_GPT4_1M --engine-use-ray --worker-use-ray --max-num-batched-tokens 5120`

	To use features such as tensor parallelism on consumer GPUs, API keys and logging, follow the OpenChat documentation.