Mamba-Shedder: Intel Labs Explores Efficient Compression of Selective Structured State Space Models

Utilizing block pruning techniques, Intel Labs researchers developed the Mamba-Shedder solution to remove redundancies in Mamba-based models, improving their computational and memory efficiency

Publié dans Non classé | Commentaires fermés sur Mamba-Shedder: Intel Labs Explores Efficient Compression of Selective Structured State Space Models

Driving Industrial Innovation with AI at the Edge: Open Platforms Leading the Way

Industrial enterprises need more than just AI to scale technology across diverse environments. They need open, flexible, and interoperable platforms.

Publié dans Non classé | Commentaires fermés sur Driving Industrial Innovation with AI at the Edge: Open Platforms Leading the Way

Deploying Scalable Enterprise RAG on Kubernetes with Ansible Automation

Generative AI is changing how businesses work, and Retrieval-Augmented Generation (RAG) is one of the most exciting tools out there. It helps AI give smarter, more accurate answers by connecting it to your company’s data. But setting it up can be tricky—until now.

If you’re curious about how to bring AI into your business without the headaches, this post is for you.

Publié dans Non classé | Commentaires fermés sur Deploying Scalable Enterprise RAG on Kubernetes with Ansible Automation

Scalable Vector Search: Deep Dive Series

Vector search is at the core of the AI revolution, and this blog series is here to teach you all about it. Our blog series introduces scalable vector search and dives deeper into sector compression, dimensionality reduction, and Retrieval-Augmented Generation systems.

Publié dans Non classé | Commentaires fermés sur Scalable Vector Search: Deep Dive Series

Leveling Up Your AI Skills in 30 Minutes

 

 

 

Publié dans Non classé | Commentaires fermés sur Leveling Up Your AI Skills in 30 Minutes

Building Agentic AI Foundations: How Intel® Liftoff Startups Are Preparing for the Next GPT Moment

Agentic AI is here: See how Intel® Liftoff startups are building smarter, more autonomous systems that plan, reason, and execute across real-world workflows.

Publié dans Non classé | Commentaires fermés sur Building Agentic AI Foundations: How Intel® Liftoff Startups Are Preparing for the Next GPT Moment

Designing Empathetic AI: The Future of Human-Centered Technology

Ted Shelton, Chief Operating Officer at Inflection AI, discusses how emotionally intelligent AI is transforming business interactions, customer experience, and organizational workflows.

Publié dans Non classé | Commentaires fermés sur Designing Empathetic AI: The Future of Human-Centered Technology

Deploying Llama 4 Scout and Maverick Models on Intel® Gaudi® 3 with vLLM

Learn how to deploy Llama 4 Scout and Maverick models on Intel® Gaudi® 3 using vLLM for efficient, high-performance inference across complex AI tasks.

Publié dans Non classé | Commentaires fermés sur Deploying Llama 4 Scout and Maverick Models on Intel® Gaudi® 3 with vLLM

Intel Labs’ Innovative Low-Rank Model Adaptation Increases Model Accuracy and Compression

Intel Labs’ Neural Low-Rank Adapter Search (NLS) produces accurate models with INT4 weights and is available in OpenVINO’s Neural Network Compression Framework

Publié dans Non classé | Commentaires fermés sur Intel Labs’ Innovative Low-Rank Model Adaptation Increases Model Accuracy and Compression

Running Llama3.3-70B on Intel® Gaudi® 2 with vLLM: A Step-by-Step Inference Guide

Run Llama 3.3-70B efficiently on Intel® Gaudi® 2 using vLLM. Learn setup, configuration, and performance tips for scalable, production-ready inference.

Publié dans Non classé | Commentaires fermés sur Running Llama3.3-70B on Intel® Gaudi® 2 with vLLM: A Step-by-Step Inference Guide