llama-memory

Easy deployment of quantized llama models on cpu


Keywords
vector, database, db, rag, long, term, ai, memory, llama, artificial, intelligence, natural, language, processing, nlp, quantization, cpu, deployment, inference, model, models, repo, repository, library, libraries, gguf, llm
Licenses
AFL-3.0/NCGL-UK-2.0
Install
pip install llama-memory==0.0.1a1

Documentation

MEMORY FOR GGUF_LLAMA AND OTHER LLAMAs