Le projet THINK

Projet de R&T transverse IN2P3

Aller au contenu

Accueil
Les techniques neuronales
IA embarquée
Résultats

← In-production AI Optimization Guide for Xeon: Search and Recommendation Use Case

Document Summarization: Transforming Enterprise Content with Intel® AI for Enterprise RAG →

AutoRound Meets SGLang: Enabling Quantized Model Inference with AutoRound

Publié le 21 novembre 2025 par

We are thrilled to announce an official collaboration between SGLang and AutoRound, enabling low-bit quantization for efficient LLM inference.

Ce contenu a été publié dans Non classé. Vous pouvez le mettre en favoris avec ce permalien.

← In-production AI Optimization Guide for Xeon: Search and Recommendation Use Case

Document Summarization: Transforming Enterprise Content with Intel® AI for Enterprise RAG →

Rechercher
Articles récents
Neural networks news
Intel NN News
- Reduce Downtime Up To 50% by Utilizing AI-Ready RAS Features of Intel® Xeon® Processors
  As generative and agentic AI use cases proliferate across nearly every industry, improving the […]
- How to Fine-Tune an LLM on Intel® GPUs With Unsloth
  Fine-tuning an LLM doesn’t have to require massive infrastructure. With Unsloth now supporting […]
- Intel® Xeon® Processors Set the Standard for Vector Search Benchmark Performance
  In real-world vector search performance tests, Intel® Xeon® server architectures outperform AMD […]

Archives
Catégories
- Non classé

Le projet THINK

Fièrement propulsé par WordPress

Generated by Feedzy