Find similar content

Source channel @githubtrending · Post #14907 · Jul 3

#python#agents#generative_ai_tools#llamacpp#llm#onnx#openvino#parsing#retrieval_augmented_generation#small_specialized_models llmware is a powerful, easy-to-use platform that helps you build AI applications using small, specialized language models designed for business tasks like question-answering, summarization, and data extraction. It supports private, secure deployment on your own machines without needing expensive GPUs, making it cost-effective and safe for enterprise use. You can organize and search your documents, run smart queries, and combine knowledge with AI to get accurate answers quickly. It also offers many ready-to-use models and examples, plus tools for building chatbots and agents that automate complex workflows. This helps you save time, improve accuracy, and securely leverage AI for your business needs[1][3][5]. https://github.com/llmware-ai/llmware

Hashtags

#python #agents #generative_ai_tools #llamacpp #llm #onnx #openvino #parsing #retrieval_augmented_generation #small_specialized_models

Results

2 similar posts found

Search: #granite4

当前筛选 #granite4清除筛选

Machinelearning

@ai_machinelearning_big_data · Post #8680 · 10/02/2025, 05:01 PM

Find similar View

✔️IBM представила Granite 4.0 — новое семейство open-weights языковых моделей от 3B до 32B параметров. Четыре новые модели: - Granite 4.0 H Small - 32B/9B активных параметров - Granite 4.0 H Tiny - 7B/1B - Granite 4.0 H Micro - 3B/3B - Granite 4.0 Micro - 3B/3B Benchmarking (Artificial Analysis Index): - Granite 4.0 H Small: 23 балла (на 8 выше Granite 3.3 8B), обходит Gemma 3 27B (22), но уступает Mistral Small 3.2 (29) и Qwen3 30B A3B (37). - Granite 4.0 Micro: 16 баллов, выше Gemma 3 4B (15) и LFM 2 2.6B (12). ⚡ Token efficiency: - Granite 4.0 Small — 5.2M токенов - Granite 4.0 Micro — 6.7M токенов Обе модели заметно эффективнее Granite 3.3 8B и большинства non-reasoning моделей <40B. Детали: - Контекст: до 128K токенов - Лицензия: Apache 2.0 - Granite 4.0 H Small доступна на Replicate по $0.06 / $0.25 за 1M input/output токенов - Все модели доступны на Hugging Face - Модель Micro (3.4B) можно запускать полностью локально. 🔗 Hugging Face: https://huggingface.co/collections/unsloth/granite-40-68ddf64b4a8717dc22a9322d 🔗Unsloth: https://docs.unsloth.ai/new/ibm-granite-4.0 @ai_machinelearning_big_data #AI#IBM#Granite4#LLM#OpenWeights

Hashtags

#ai #ibm #granite4 #llm #openweights

GitHub Trends

@githubtrending · Post #15348 · 12/20/2025, 12:00 PM

Find similar View

#go#gemma3#go#gpt_oss#granite4#llama#llama3#llm#on_device_ai#phi3#qwen3#qwen3vl#sdk#stable_diffusion#vlm NexaSDK runs AI models locally on CPUs, GPUs, and NPUs with a single command, supports GGUF/MLX/.nexa formats, and offers NPU-first Android and macOS support for fast, multimodal (text, image, audio) inference, plus an OpenAI‑compatible API for easy integration. This gives you low-latency, private on-device AI across laptops, phones, and embedded systems, reduces cloud costs and data exposure, and lets you deploy and test new models immediately on target hardware for faster development and better user experience. https://github.com/NexaAI/nexa-sdk

Hashtags

#go #gemma3 #gpt_oss #granite4 #llama #llama3 #llm #on_device_ai #phi3 #qwen3 #qwen3vl #sdk #stable_diffusion #vlm