Archives de catégorie : Non classé

AutoRound Meets SGLang: Enabling Quantized Model Inference with AutoRound

Publié le 21 novembre 2025 par

We are thrilled to announce an official collaboration between SGLang and AutoRound, enabling low-bit quantization for efficient LLM inference.

Publié dans Non classé | Commentaires fermés

In-production AI Optimization Guide for Xeon: Search and Recommendation Use Case

Publié le 17 novembre 2025 par

In this guide, you’ll learn multiple aspects of optimizing the Search and Recommendation model deployed in Production using Intel Xeon CPU servers.

Publié dans Non classé | Commentaires fermés

Argonne’s Aurora Supercomputer Drives Simulations to Explore How Light Shapes Quantum Materials

Publié le 14 novembre 2025 par

Researchers using the Aurora supercomputer at the U.S. Department of Energy’s Argonne National Laboratory have achieved the largest-ever simulations of light interacting with quantum materials.

Publié dans Non classé | Commentaires fermés

Argonne’s Aurora Supercomputer Helps Power Breakthrough Simulations of Quantum Materials

Publié le 14 novembre 2025 par

Using three U.S. Department of Energy (DOE) supercomputers, researchers from the University of Southern California (USC) and DOE’s Lawrence Berkeley National Laboratory developed new ways to model these complex systems with greater precision than ever before.

Publié dans Non classé | Commentaires fermés

AERIS Earth Systems Model Pushes AI for Science to New Heights

Publié le 14 novembre 2025 par

Researchers at the U.S. Department of Energy’s (DOE) Argonne National Laboratory introduce AERIS, a breakthrough AI system learning from decades of Earth systems data to deliver fast, high-resolution forecasts.

Publié dans Non classé | Commentaires fermés

Leveraging Edge AI for Business Innovation

Publié le 6 novembre 2025 par

Discover how Intel Edge AI merges computing and intelligence to drive automation, real-time decisions, and business transformation.

Publié dans Non classé | Commentaires fermés

Intel® AI for Enterprise Inference as a Deployable Architecture on IBM Cloud

Publié le 26 octobre 2025 par

Intel® AI for Enterprise Inference as a Deployable Architecture on IBM Cloud The enterprise AI landscape demands solutions that can scale efficiently while maintaining operational simplicity and cost-effectiveness. Intel® AI for Enterprise Inference (Enterprise Inference), powered by the Open Platform … Continuer la lecture →

Publié dans Non classé | Commentaires fermés

Scaling Intel® AI for Enterprise RAG Performance: 64-Core vs 96-Core Intel® Xeon®

Publié le 23 octobre 2025 par

This evaluation shows materially higher concurrency and improved latency scaling when moving from a 64-core to a 96-core Intel® Xeon® configuration for Intel® AI for Enterprise RAG inference. The 96-core SKU doubles SLA-compliant concurrency for Llama-AWQ and Mistral-AWQ (32 → … Continuer la lecture →

Publié dans Non classé | Commentaires fermés

Comprehensive Analysis: Intel® AI for Enterprise RAG Performance

Publié le 23 octobre 2025 par

This comprehensive analysis demonstrates that systems with two 64-core Intel® Xeon® processors can effectively support enterprise-scale RAG deployments, handling up to 32 concurrent users with optimized configurations that comply with targeted SLAs. These results validate Intel® Xeon® as a viable … Continuer la lecture →

Publié dans Non classé | Commentaires fermés

Agentic AI: The Dawn of Specialized Small Language Models

Publié le 20 octobre 2025 par

Small Language Models (SLMs) are emerging as the nimble, quick-thinking counterparts to LLMs providing specialized knowledge in a way that is lean, fast and cost-effective.

Publié dans Non classé | Commentaires fermés

Archives de catégorie : Non classé

AutoRound Meets SGLang: Enabling Quantized Model Inference with AutoRound

In-production AI Optimization Guide for Xeon: Search and Recommendation Use Case

Argonne’s Aurora Supercomputer Drives Simulations to Explore How Light Shapes Quantum Materials

Argonne’s Aurora Supercomputer Helps Power Breakthrough Simulations of Quantum Materials

AERIS Earth Systems Model Pushes AI for Science to New Heights

Leveraging Edge AI for Business Innovation

Intel® AI for Enterprise Inference as a Deployable Architecture on IBM Cloud

Scaling Intel® AI for Enterprise RAG Performance: 64-Core vs 96-Core Intel® Xeon®

Comprehensive Analysis: Intel® AI for Enterprise RAG Performance

Agentic AI: The Dawn of Specialized Small Language Models

Articles récents

Neural networks news

Intel NN News

Archives

Catégories