Intel is democratizing AI inference by delivering a better price and performance for
real-world use cases on the 4th gen Intel® Xeon® Scalable Processors, formerly codenamed Sapphire Rapids. In this article, Intel® CPU refers to 4th gen Intel® Xeon® Scalable Processors. For protein folding of a set of proteins of lengths less than a thousand, using DeepMind’s AlphaFold2 inference based end-to-end pipeline, a dual socket Intel® CPU node delivers 30% better performance compared to our measured performance of an Intel® CPU with an A100 offload.
-
-
Articles récents
- Optimizing SLMs on Intel® Xeon® Processors: A llama.cpp Performance Study
- Intel® Xeon® 6 Processors: The Smart Total Cost of Ownership Choice
- Next-Gen AI Inference: Intel® Xeon® Processors Power Vision, NLP, and Recommender Workloads
- Document Summarization: Transforming Enterprise Content with Intel® AI for Enterprise RAG
- AutoRound Meets SGLang: Enabling Quantized Model Inference with AutoRound
-
Neural networks news
Intel NN News
- Optimizing SLMs on Intel® Xeon® Processors: A llama.cpp Performance Study
In this post, we'll dicuss how to run responsive, CPU-only applications using a quantized SLM in […]
- Intel® AI for Enterprise Inference as a Deployable Architecture on IBM Cloud
Intel® AI for Enterprise Inference as a Deployable Architecture on IBM CloudAuthored by: Pai […]
- Intel® Xeon® 6 Processors: The Smart Total Cost of Ownership Choice
The latest Intel® Xeon® 6 processors deliver performance advantages across key enterprise […]
- Optimizing SLMs on Intel® Xeon® Processors: A llama.cpp Performance Study
-