Le projet THINK

Projet de R&T transverse IN2P3

Aller au contenu

Accueil
Les techniques neuronales
IA embarquée
Résultats

Archives mensuelles : janvier 2026

Optimizing SLMs on Intel® Xeon® Processors: A llama.cpp Performance Study

Publié le 21 janvier 2026 par

In this post, we’ll dicuss how to run responsive, CPU-only applications using a quantized SLM in the GPT-Generated Unified Format (GGUF).

Publié dans Non classé | Commentaires fermés sur Optimizing SLMs on Intel® Xeon® Processors: A llama.cpp Performance Study

Rechercher
Articles récents
Neural networks news
Intel NN News

Archives
Catégories
- Non classé

Le projet THINK

Fièrement propulsé par WordPress

Generated by Feedzy