Post #1413

@huggingface

Hugging Face

Visualizzazioni26Numero di visualizzazioni

Pubblicato29 set29/09/2025, 14:21

Contenuto del post

Contenuto

Hugging Face (Twitter) RT @Saboo_Shubham_: oLLM is a lightweight Python library for local large-context LLM inference. Run gpt-oss-20B, Qwen3-next-80B, Llama-3.1-8B on ~$200 consumer GPU with just 8GB VRAM. And this is without any quantization - only fp16/bf16 precision. 100% Opensource.