Cisco Secure AI Factory: Integrating Mistral.ai and NVIDIA for Enterprise AI

L'écosystème Cisco Secure AI Factory avec Mistral.ai et NVIDIA

Patrice Nivaggioli

AI Engineer | Cisco EMEA

Track 4/BK 2

Agenda

Introduction
Cisco + NVIDIA: Bringing AI to the Enterprise
Cisco + NVIDIA: Bringing Secure AI to the Enterprise
Cisco + Mistral.ai: Bringing GenAI to the Enterprise
Wrap-Up

Cisco AI Strategy

AI in Cisco

Networking, Security, Collaboration, Observability solutions

Examples:

WIFI RRM in Catalyst Center
SDWAN Path prediction
Failure prediction with Thousand Eyes
Malicious Activities with HyperShield
Threat Detection in Splunk/XDR

AI with Cisco

API/SDKs, Agents, Agentic Frameworks, Open weights models

Examples:

Object Detection with Meraki API
Logs and Metrics AI Analysis with Splunk MLTK and DSDL
AI Agents with Webex Contact Center fdrn.ai and agntcy.org

AI on Cisco

Cisco Secure AI Factory

Accelerated Computing and Networking
AI Datacenter Fabric
AI Edge
AI Observability and Security

Partnerships: Cisco, NVIDIA

AI use cases across industries

Employee Experience

Chatbots | AI assistants | Copilots

Customer Experience

Virtual agents | Specialized knowledge base

Business Process

AI Ops

Threats Detection/Prevention | Incident Response/Remediation | Automation | Root Cause Analysis | Network Digital Twins

Challenges with AI projects

delays time to value realization

Security vulnerabilities

AI models, frameworks, apps, and supporting infrastructure represent a new cyberattack surface

Network performance bottlenecks

Model training and inferencing generates a lot of traffic, slowing networks and delaying time-to-value

Complex AI infrastructure deployment

Lack of high-performance infrastructure with integrated compute, network, storage, and AI software can stall AI projects

Bringing AI to the Enterprise

Partnership: Cisco and NVIDIA

YouTube Video Link

Cisco Newsroom Article

Cisco and NVIDIA expand partnership to accelerate AI adoption in the enterprise

Partnership focus

Private data centers
Backend AI networks
Full-stack AI infrastructures
Cisco's switching/control plane/security and infrastructure stack

How are we partnering?

Cisco will be part of NVIDIA's Enterprise Reference Architecture
Cisco will be part of NVIDIA's Cloud Partner Reference Architecture
Deliver AI Factory with customer choice: Cisco Silicon One® or NVIDIA Spectrum-X Silicon

AI Datacenter Networks

Front End (N/S) (OOB) (Storage)

Scale Up (intra-node)

Back End (Scale-Out) (E/W)

LLM Training Parallelism and Algorithms

Pipeline parallelism: ncclSend, ncclReceive

Data parallelism: ncclAllReduce

Tensor Parallelism: ncclAllGather

And also: MoE (mixture of experts), ncclSend/nccIRecv (alltoall), FSDP (fully sharded data parallelism), ncclAllGather, and other variations.

Communication in a ring (or tree) is limited by the speed of the lowest link.

Collective Communication and RDMA

RDMA CPU Offload

RDMA Stack:

RDMA Applications: Pytorch, TensorFlow, JAX
RDMA Software Stack: NCCL, OpenMPI, GPFS
IB Transport Protocol
IB Network Layer
Infiniband Link Layer
Infiniband Management
InfiniBand
UDP
IP
Ethernet Link Layer
Ethernet / IP Management
RoCEv2

Backend Fabric for AI Training

GPU Memory to GPU Memory RDMA
Very High Bandwidth
Very Low Latency
Ultra Bursty Traffic (on/off)
Low entropy flow and elephant flows

Technologies: Non blocking, Lossless, PFC, ECN, Packet Load Balancing

AI Ready Fabric: Design 1024 GPUs (8 x N9364E, 16 x N9364E)

Rail Optimized design

Bringing Secure AI to the Enterprise

Partnership: Cisco and NVIDIA

YouTube Video Link

Cisco Newsroom Article

The new AI risk landscape

Consequences of unmanaged AI risk:

Financial damage
Litigation risk
Reputational damage
Compliance risk
Security risk
IP leakage

Example Scenario: A customer interacts with an AI chatbot for a car purchase, leading to a legally binding offer for $1.

Cisco Secure AI Factory with NVIDIA

Security-first architecture
High-performance, enterprise-proven networking, compute, storage, and AI software stack
Pre-validated, with flexible deployment options

Components: AI/GenAI models, Enterprise data, NVIDIA AI Enterprise, Cisco UCS, Cisco Nexus Switches, NetApp, Pure Storage, VAST Data.

Outputs: Knowledgebase Copilots, Content & Code Generation, Virtual Agent & Chatbots, Detection & Predictions.

Security-first architecture enables safe Enterprise AI

Securing the Applications

Cisco AI Defense: Robust testing and runtime security of LLMs and generative AI applications.

Securing the Workloads

Cisco Hypershield: Protection against adversary lateral movement and proactive vulnerability mitigation without the need for patching, all from a single management interface.

Securing the Infrastructure

Integration with NVIDIA Bluefield-3's DOCA AppShield for intrusion detection in AI-focused virtual machines and containers. (Future)

Cisco Hybrid Mesh Firewall: Unified security management and consistent and pervasive policy across multiple enforcement points. (Future)

To include management of NVIDIA BlueField®-3 DPUs for enabling AI Cluster perimeter firewalls. (Future)

Cisco Secure AI Factory stack details

AI/GenAI Pipeline:

NVIDIA: AI Enterprise, NeMo, NIMS, Blueprints
Compute: Cisco UCS® w/ NVIDIA Accelerated Computing, NVIDIA BlueField®-3 DPUs, Managed w/ Cisco Intersight®
Networking: Cisco Nexus 9300 w/ Silicon One® Switches w/ NVIDIA Spectrum-X*, Managed w/ Cisco Nexus Dashboard
Storage: NetApp | Pure Storage | VAST Data
Security: Cisco AI Defense, Cisco Hybrid Mesh Firewall, Isovalent, Secure Firewall, Hypershield

Outputs: Knowledgebase Copilots, Content & Code Generation, Virtual Agent & Chatbots, Detection & Predictions.

Modular system deployment and scaling with AI PODS

AI POD: An atomic unit of Secure AI Factory with NVIDIA, including NVIDIA AI Enterprise, Cisco UCS, and NetApp/Pure Storage/VAST Data.

Cisco Secure AI Factory with NVIDIA: Scalable architecture using AI PODs, incorporating Compute, Networking, and Security components.

What we build at Mistral AI

Frontier models

Open and enterprise models released every month
SOTA across language, code, vision
Deeply customizable and deployable anywhere

Enterprise platforms

Portable inference engine across datacenter, cloud, edge
Enterprise interfaces for builders and users
All the tooling required to start seeing ROI

Enterprise solutions

Use case discovery and execution
Custom model training
Deployment, optimization, and scaling

Partners: Snowflake, CMA CGM, BNP PARIBAS, orange™, CISCO, STELLANTIS, CURSOR, Harve

Cisco Newsroom Article

Cisco and Mistral to accelerate GenAI adoption in the enterprise

Main focus areas

Private enterprise architecture
End-to-end customizability
Enterprise Context
Enterprise AI strategy

How are we collaborating?

Enterprise Grade GenAI Platform
Custom model training and enterprise interfaces development
Deployment, Optimization and Scaling

La Plateforme

Applied AI and deployment services:

AI tooling: Agents, Fine-tuning, Embedding, Function/tool calling, Distillation, Safety, Monitoring
Library of frontier LLMs: SOTA language models, Small and edge models, Code models, Multimodal models, Custom models
AI infrastructure management: Inference container, Routing/caching, Load balancing, API gateway, Security and resilience

Cisco Secure AI Factory stack with Mistral.ai

AI/GenAI Pipeline:

Mistral.ai: Frontier Models & Enterprise Platform, Tools, Agents, OCR
NVIDIA: AI Enterprise, NeMo, NIMS, Blueprints
Compute: Cisco UCS® w/ NVIDIA Accelerated Computing, NVIDIA BlueField®-3 DPUs, Managed w/ Cisco Intersight®
Networking: Cisco Nexus 9300 w/ Silicon One® Switches w/ NVIDIA Spectrum-X*, Managed w/ Cisco Nexus Dashboard
Storage: NetApp | Pure Storage | VAST Data
Security: Cisco AI Defense, Cisco Hybrid Mesh Firewall, Isovalent, Secure Firewall, Hypershield

Outputs: Knowledgebase Copilots, Content & Code Generation, Virtual Agent & Chatbots, Detection & Predictions.

Wrap-Up

Cisco + NVIDIA: AI Networking Fabric
Cisco + NVIDIA: Cisco Secure AI Factory
Cisco + Mistral.ai: Enterprise Grade GenAI Platform

Toute l'équipe Datacenter France est à votre disposition sur notre stand.

	Cisco Subcontractor Generative Artificial Intelligence Exhibit Exhibit detailing Cisco's requirements and subcontractor obligations for the use of Generative AI tools, focusing on data security, privacy, and responsible AI practices.
	Cisco AI Assistant Components - Product Overview Explore the user-friendly components of the Cisco AI Assistant, including its text input, chat history, feedback mechanisms, and notification system for enhanced security environment management.
	Cisco UCS C845A M8 Rack Server: Scalable AI and HPC Platform Data sheet detailing the Cisco UCS C845A M8 Rack Server, a highly scalable and customizable AI system built on the NVIDIA MGX reference design. Features support for multiple NVIDIA and AMD GPUs, advanced AI workloads, and integration with Cisco Intersight.
	Cisco AI Network Analytics: Overview, Benefits, and Deployment An overview of Cisco AI Network Analytics, detailing its features, benefits for network insights, and licensing/deployment considerations for Cisco DNA Center.
	Cisco UCS C885A M8 Rack Server: High-Performance AI Compute Discover the Cisco UCS C885A M8 Rack Server, a dense GPU server engineered for demanding AI workloads like LLM training, fine-tuning, and inference. Featuring NVIDIA HGX or AMD MI GPUs, it offers scalable accelerated compute capabilities.
	Cisco AI Network Analytics: Advanced Feature Overview Discover Cisco AI Network Analytics, an advanced solution for proactive network monitoring, troubleshooting, and performance optimization using AI and machine learning. Learn about its features, architecture, and benefits for IT operations.
	Cisco Contact Center Enterprise Solutions Release 12.6(1) Release Notes This document details new features, updates, important notes, deprecated items, and caveats for Cisco Contact Center Enterprise Solutions Release 12.6(1), covering products like Cisco Unified Contact Center Enterprise, Cisco Finesse, and more.
	Cisco Webex Cinematic Meetings: Enhance Hybrid Collaboration with AI-Powered Video Explore Cisco Webex Cinematic Meetings, featuring AI-powered video devices that transform hybrid collaboration. Learn about features like People Focus, Frames, Crossview, and Campfire Set-Up designed for inclusive and engaging virtual meetings.

Agenda

Cisco AI Strategy

AI in Cisco

AI with Cisco

AI on Cisco

AI use cases across industries

Employee Experience

Customer Experience

Business Process

AI Ops

Challenges with AI projects

Security vulnerabilities

Network performance bottlenecks

Complex AI infrastructure deployment

Bringing AI to the Enterprise

Cisco and NVIDIA expand partnership to accelerate AI adoption in the enterprise

Partnership focus

How are we partnering?

AI Datacenter Networks

LLM Training Parallelism and Algorithms

Collective Communication and RDMA

Backend Fabric for AI Training

Bringing Secure AI to the Enterprise

The new AI risk landscape

Cisco Secure AI Factory with NVIDIA

Security-first architecture enables safe Enterprise AI

Securing the Applications

Securing the Workloads

Securing the Infrastructure

Cisco Secure AI Factory stack details

Modular system deployment and scaling with AI PODS

What we build at Mistral AI

Frontier models

Enterprise platforms

Enterprise solutions

Cisco and Mistral to accelerate GenAI adoption in the enterprise

Main focus areas

How are we collaborating?

La Plateforme

Cisco Secure AI Factory stack with Mistral.ai

Wrap-Up

Related Documents