TGTGInsighttelegram intelligenceLIVE / telegram public index
← GitHub Trends

TGINSIGHT SIMILAR POSTS

Find similar content

Source channel @githubtrending · Post #14974 · Jul 19

#cplusplus ik_llama.cpp is an improved version of llama.cpp that runs faster on CPUs and hybrid GPU/CPU setups. It supports many new advanced quantization methods, which help models use less memory and run more efficiently. It also offers better performance for special models like DeepSeek and MoE, with faster prompt processing and token generation. You can run it on various hardware, including Android, and it has features to control where model data is stored (CPU or GPU). This means you get quicker AI responses and can handle bigger or more complex models smoothly on your computer or device[2][1][4]. https://github.com/ikawrakow/ik_llama.cpp

Hashtags

Results

1 similar post found

Search: #convolution

当前筛选 #convolution清除筛选
GitHub Trends

@githubtrending · Post #14659 · 05/01/2025, 03:30 PM

#cplusplus#arm#convolution#deep_learning#embedded_devices#llm#machine_learning#ml#mnn#transformer#vulkan#winograd_algorithm MNN is a lightweight and efficient deep learning framework that helps run AI models on mobile devices and other small devices. It supports many types of AI models and can handle tasks like image recognition and language processing quickly and locally on your device. This means you can use AI features without needing to send data to the cloud, which improves privacy and speed. MNN is used in many apps, including those from Alibaba, and supports various platforms like Android and iOS. It also helps reduce the size of AI models, making them faster and more efficient. https://github.com/alibaba/MNN