GPU Card Installation

This chapter contains the following topics:

Server Firmware Requirements

The following table lists the minimum server firmware versions for the supported GPU cards.

>
GPU Card Cisco IMC/BIOS Minimum Version Required
NVIDIA L4 PCIe, 72W, Gen 4 x8 (UCSC-GPU-L4) 4.1(3)
Intel GPU Flex 140 PCIe, 75W, Gen 4 x8 (UCSC-GPU-FLEX140) 4.1(3)

GPU Card Configuration Rules

Note the following rules when populating a server with GPU cards.

Requirement For All GPUs: Memory-Mapped I/O Greater Than 4 GB

All supported GPU cards require enablement of the BIOS setting that allows greater than 4 GB of memory-mapped I/O (MMIO).

Procedure

  1. Refer to the Cisco UCS Manager configuration guide (GUI or CLI) for your release for instructions on configuring service profiles: Cisco UCS Manager Configuration Guides
  2. Refer to the chapter on Configuring Server-Related Policies > Configuring BIOS Settings.
  3. In the section of your profile for PCI Configuration BIOS Settings, set Memory Mapped IO Above 4GB Config to one of the following:
    • Disabled-Does not map 64-bit PCI devices to 64 GB or greater address space.
    • Enabled-Maps I/O of 64-bit PCI devices to 64 GB or greater address space.
    • Platform Default-The policy uses the value for this attribute contained in the BIOS defaults for the server. Use this only if you know that the server BIOS is set to use the default enabled setting for this item.
  4. Reboot the server.

Replacing a Single-Wide GPU Card

A GPU kit (UCSC-GPURKIT-C220) is available from Cisco. The kit contains a GPU mounting bracket and the following risers (risers 1 and 2):

Procedure

  1. Remove an existing GPU card from the PCIe riser:
    1. Shut down and remove power from the server as described in Shutting Down and Removing Power From the Server.
    2. Slide the server out the front of the rack far enough so that you can remove the top cover. You might have to detach cables from the rear panel to provide clearance.
    3. Caution: If you cannot safely view and access the component, remove the server from the rack.
    4. Remove the top cover from the server as described in Removing Top Cover.
    5. Using a #2 Phillips screwdriver, loosen the captive screws.
    6. Pull evenly on both ends of the GPU card to disconnect the card from the socket.
    7. If the riser has no card, remove the blanking panel from the rear opening of the riser.
  2. Holding the GPU level, slide it out of the socket on the PCIe riser.
  3. Install a new GPU card:

    Note: The Intel Flex 140 and Nvidia L4 are half-height, half-length cards. If one is installed in full-height PCIe slot 1, it requires a full-height rear-panel tab installed to the card.

    1. Align the new GPU card with the empty socket on the PCIe riser and slide each end into the retaining clip.
    2. Push evenly on both ends of the card until it is fully seated in the socket.
    3. Ensure that the card's rear panel tab sits flat against the riser rear-panel opening.

PCIe Riser Assembly

Figure 1: PCIe Riser Assembly, 3 HHHL

Note: For easy identification, riser numbers are stamped into the sheet metal on the top of each riser cage.

1 Captive screw for PCIe slot 1 (alignment feature) 6 Handle for PCIe slot 3 riser PCIe slot 1 rear-panel opening
2 Captive screw for PCIe slot 2 (alignment feature) 7 Rear-panel opening for PCIe slot 1
3 Captive screw for PCIe slot 2 (alignment feature) 8 Rear-panel opening for PCIe slot 2
4 Handle for PCIe slot 1 riser 9 Rear-panel opening for PCIe slot 3
5 Handle for PCIe slot 2 riser -

Figure 2: PCIe Riser Assembly, 2 FHFL

1 Captive screw for PCIe slot 1 4 Handle for PCIe slot 2 riser Rear-panel opening for PCIe slot 1
2 Captive screw for PCIe slot 2 5 Rear-panel opening for PCIe slot 1
3 Handle for PCIe slot 1 riser - Rear-panel opening for PCIe slot 2

d) Position the PCIe riser over its sockets on the motherboard and over the chassis alignment channels.

Figure 3: PCIe Riser Alignment Features

For a server with 3 HHHL risers, 3 sockets and 3 alignment features are available, as shown below.

Figure 4: PCIe Riser Alignment Features

For a server with 2 FHFL risers, 2 sockets and 2 alignment features are available, as shown below.

e) Carefully push down on both ends of the PCIe riser to fully engage its two connectors with the two sockets on the motherboard.

f) When the riser is level and fully seated, use a #2 Phillips screwdriver to secure the riser to the server chassis.

g) Replace the top cover to the server.

h) Replace the server in the rack, replace cables, and then fully power on the server by pressing the Power button.

Optional: Continue with Installing Drivers to Support the GPU Cards, on page 8.

Installing Drivers to Support the GPU Cards

After you install the hardware, you must update to the correct level of server BIOS and then install GPU drivers and other software in this order:

  1. Update the server BIOS.
  2. Update the GPU drivers.

1. Updating the Server BIOS

Install the latest Cisco UCS C240 M4 server BIOS by using the Host Upgrade Utility for the Cisco UCS C240 M4 server.

Note: You must do this procedure before you update the NVIDIA drivers.

Procedure

  1. Navigate to the following URL: http://www.cisco.com/cisco/software/navigator.html
  2. Click Servers-Unified Computing in the middle column.
  3. Click Cisco UCS C-Series Rack-Mount Standalone Server Software in the right-hand column.
  4. Click the name of your model of server in the right-hand column.
  5. Click Unified Computing System (UCS) Server Firmware.
  6. Click the release number.
  7. Click Download Now to download the ucs-server platform-huu-version_number.iso file.
  8. Verify the information on the next page, and then click Proceed With Download.
  9. Continue through the subsequent screens to accept the license agreement and browse to a location where you want to save the file.
  10. Use the Host Upgrade Utility to update the server BIOS.

The user guides for the Host Upgrade Utility are at Utility User Guides.

2. Updating the GPU Card Drivers

After you update the server BIOS, you can install GPU drivers to your hypervisor virtual machine.

Procedure

  1. Install your hypervisor software on a computer. Refer to your hypervisor documentation for the installation instructions.
  2. Create a virtual machine in your hypervisor. Refer to your hypervisor documentation for instructions.
  3. Install the GPU drivers to the virtual machine. Download the drivers from either:
  4. Restart the server.
  5. Check that the virtual machine is able to recognize the GPU card. In Windows, use the Device Manager and look under Display Adapters.
Models: UCSC-GPU-L4, UCSC-GPU-FLEX140, FLEX140 GPU Card, GPU Card, Card

File Info : application/pdf, 10 Pages, 2.22MB

PDF preview unavailable. Download the PDF instead.

m-gpu-install DITA Open Toolkit XEP 4.30.961; modified using iText 2.1.7 by 1T3XT

Related Documents

Preview Cisco Select UCS Accessories End-of-Life Announcement
This document announces the end-of-sale and end-of-life for Cisco Select UCS accessories. It provides key dates, product part numbers, and migration options.
Preview Cisco UCS Firmware Upgrade Guidelines and Prerequisites
Comprehensive guide detailing best practices, prerequisites, and procedures for upgrading firmware on Cisco UCS systems, including fabric interconnects, servers, and adapters.
Preview Cisco UCS C240 M5 Server GPU Card Installation Guide
This guide provides detailed instructions for installing and configuring GPU cards in Cisco UCS C240 M5 servers, including firmware requirements, configuration rules, and driver installation.
Preview Cisco UCS C3160 Rack Server Installation and Service Guide
This official Cisco UCS C3160 Rack Server Installation and Service Guide provides essential information for IT professionals. It details the installation, configuration, maintenance, and troubleshooting of the Cisco UCS C3160 high-density storage server, covering hardware components, system setup, and best practices.
Preview Cisco UCS C220 M4 Server Installation and Service Guide
Comprehensive guide for installing and servicing the Cisco UCS C220 M4 Server, covering hardware setup, component replacement, and system configuration.
Preview Cisco UCS C220 M5 Rack Server Spec Sheet
Detailed specifications and configuration guide for the Cisco UCS C220 M5 Rack Server (Small Form Factor Disk Drive Model), covering its features, components, and setup.
Preview Cisco Storage Controller Considerations for UCS C-Series M7 Servers
This document outlines considerations for storage controllers, including supported models, firmware compatibility, RAID backup options, drive mixing guidelines, and cabling for Cisco UCS C-Series M7 servers.
Preview Cisco UCS C845A M8 Rack Server: Scalable AI and HPC Platform
Data sheet detailing the Cisco UCS C845A M8 Rack Server, a highly scalable and customizable AI system built on the NVIDIA MGX reference design. Features support for multiple NVIDIA and AMD GPUs, advanced AI workloads, and integration with Cisco Intersight.