Post #1934

@huggingface

Hugging Face

Visualizzazioni18Numero di visualizzazioni

Pubblicato12 dic12/12/2025, 16:10

Contenuto del post

Contenuto

‌Hugging Face (Twitter) RT @victormustar: 🎉 llama.cpp now has Ollama-style model management. • Auto-discover GGUFs from cache • Load on first request • Each model runs in its own process • Route by `model` (OpenAI-compatible API) • LRU unload at `--models-max` https://huggingface.co/blog/ggml-org/model-management-in-llamacpp