-
-
Articles récents
- Optimizing SLMs on Intel® Xeon® Processors: A llama.cpp Performance Study
- Intel® Xeon® 6 Processors: The Smart Total Cost of Ownership Choice
- Next-Gen AI Inference: Intel® Xeon® Processors Power Vision, NLP, and Recommender Workloads
- Document Summarization: Transforming Enterprise Content with Intel® AI for Enterprise RAG
- AutoRound Meets SGLang: Enabling Quantized Model Inference with AutoRound
-
Neural networks news
Intel NN News
- Optimizing SLMs on Intel® Xeon® Processors: A llama.cpp Performance Study
In this post, we'll dicuss how to run responsive, CPU-only applications using a quantized SLM in […]
- Intel® AI for Enterprise Inference as a Deployable Architecture on IBM Cloud
Intel® AI for Enterprise Inference as a Deployable Architecture on IBM CloudAuthored by: Pai […]
- Intel® Xeon® 6 Processors: The Smart Total Cost of Ownership Choice
The latest Intel® Xeon® 6 processors deliver performance advantages across key enterprise […]
- Optimizing SLMs on Intel® Xeon® Processors: A llama.cpp Performance Study
-
Archives de catégorie : Non classé
KVCrush: Rethinking KV Cache Alternative Representation for Faster LLM Inference
Developed by Intel, KVCrush can improve LLM inference throughput up to 4x with less than 1% accuracy drop.
Publié dans Non classé
Commentaires fermés sur KVCrush: Rethinking KV Cache Alternative Representation for Faster LLM Inference
Scaling AI with Confidence: Lenovo’s Approach to Responsible and Practical Adoption
In the race to operationalize AI, success depends not on flashy pilots, but on turning experimentation into measurable business value. According to David Ellison, Chief Data Scientist and Director of AI Engineering at Lenovo, the most successful AI projects start … Continuer la lecture
Publié dans Non classé
Commentaires fermés sur Scaling AI with Confidence: Lenovo’s Approach to Responsible and Practical Adoption
Unlocking AI-Driven Media Monetization with Intel® Xeon® CPUs and Broadpeak BannersIn2
In this article, we will cover how to deploy high-performance AI inferencing for media data curation and retrieval-augmented generation (RAG) without requiring discrete GPUs.
Publié dans Non classé
Commentaires fermés sur Unlocking AI-Driven Media Monetization with Intel® Xeon® CPUs and Broadpeak BannersIn2
AI at the Edge: Intel’s Vision for Real-World Impact
When it comes to scaling AI, the conversation isn’t only about the cloud—it’s about the edge. According to Matthew Formica, Senior Director and Head of Edge Product Marketing & AI PC/Edge AI Software Developer Relations at Intel, the edge represents … Continuer la lecture
Publié dans Non classé
Commentaires fermés sur AI at the Edge: Intel’s Vision for Real-World Impact
Intel® Xeon® Processors: The Most Preferred CPU for AI Host Nodes
Today’s AI workloads are not purely offloaded to GPU accelerators. Host CPUs such as the Intel® Xeon® 6 processors play a significant role in maximizing the performance of AI-accelerated systems.
Publié dans Non classé
Commentaires fermés sur Intel® Xeon® Processors: The Most Preferred CPU for AI Host Nodes
Building AI With Empathy: Sorenson’s Mission for Accessibility
For Sorenson Senior Director of AI Mariam Rahmani, the future of AI isn’t about building the flashiest models—it’s about creating solutions that close communication gaps and empower people, especially the Deaf and Hard of Hearing community. With a third of … Continuer la lecture
Publié dans Non classé
Commentaires fermés sur Building AI With Empathy: Sorenson’s Mission for Accessibility
Multi-node deployments using Intel® AI for Enterprise RAG
As enterprises scale generative AI across diverse infrastructures, Intel® AI for Enterprise RAG solution delivers a modular, hardware-aware framework for Retrieval-Augmented Generation (RAG) optimized for Kubernetes and Intel platforms. With intelligent scheduling, NUMA-aware resource isolation, and dynamic scaling, it ensures predictable, … Continuer la lecture
Publié dans Non classé
Commentaires fermés sur Multi-node deployments using Intel® AI for Enterprise RAG
Connected Data is the Future: How Neo4j Is Enabling the Next Generation of AI
In the evolving landscape of artificial intelligence, connected data is becoming a core competitive advantage.
Publié dans Non classé
Commentaires fermés sur Connected Data is the Future: How Neo4j Is Enabling the Next Generation of AI
Orchestrating AI for Real Business Value: Google Cloud’s Approach to Scalable Intelligence
In the race to operationalize AI, success hinges not on hype, but on clarity, customization, and speed to value. According to Riyaz Habibbhai, Director of Product Marketing at Google Cloud, truly effective AI product marketing focuses on one simple but … Continuer la lecture
Publié dans Non classé
Commentaires fermés sur Orchestrating AI for Real Business Value: Google Cloud’s Approach to Scalable Intelligence
Curious Case of Chain of Thought: Improving CoT Efficiency via Training-Free Steerable Reasoning
Researchers from the University of Texas at Austin and Intel Labs investigated chain-of-thought reasoning structures in large language models to identify and calibrate flawed reasoning pathways
Publié dans Non classé
Commentaires fermés sur Curious Case of Chain of Thought: Improving CoT Efficiency via Training-Free Steerable Reasoning