DRAGON Models
Collection
Production-grade RAG-optimized 6-7B parameter models - "Delivering RAG on ..." the leading foundation base models
•
23 items
•
Updated
•
44
dragon-llama2-ov is a high-quality, fact-based question-answering model, designed for retrieval augmented generation (RAG) with complex business documents, quantized and packaged in OpenVino int4 for AI PCs using Intel GPU, CPU and NPU.
This model provides a good combination of accuracy and inference performance.
Base model
llmware/dragon-llama-7b-v0