NVIDIA Base Command Manager 11

Developer Manual

Introduction

This manual is designed for developers looking to program the NVIDIA Base Command Manager (BCM) to enhance or modify its functionality. It focuses on the Python API, enabling automation of cluster operations, metric collection, and interaction with the CMDaemon process. The content is relevant for users familiar with the Administrator Manual, particularly CMDaemon, and provides detailed instructions for leveraging the Python API for various cluster management tasks.

About the Manuals

NVIDIA Base Command Manager 11 offers a suite of manuals covering different aspects of the system. These include the Administrator Manual for general cluster management, the Installation Manual for setup, the User Manual for end-user job submission, and specialized manuals for Cloudbursting, Edge, Containerization, and Mission Control integration. All manuals are regularly updated and available online at https://docs.nvidia.com/base-command-manager.

Getting Support

Support for BCM subscriptions from version 10 onwards is available through the NVIDIA Enterprise Support page. For developer-specific inquiries or more extensive support needs, developers can contact the BCM support team to arrange a support contract. Professional services are also available via the NVIDIA Enterprise Services page.

Python API Overview

The NVIDIA Base Command Manager Python API, overhauled in version 8.2, provides a pure Python connection to the cluster manager. This allows for cluster operations to be automated using Python on any operating system supporting Python 3.5 and higher. The API utilizes several extra modules, including pyOpenSSL, ply, and lxml, among others. The manual details how to get started with the Python API, including setting up the environment, connecting to a cluster, inspecting and modifying settings, performing operations on entities, and monitoring cluster data.

Examples and Resources

The manual includes a comprehensive list of examples located at `/cm/local/examples/cmd/pythoncm` on the head node. These examples demonstrate various functionalities, such as managing nodes, collecting monitoring data, and executing commands. Trying out these examples is recommended for practical understanding and efficient use of the Python API.

Models: Base Command Manager 11, Command Manager 11

File Info : application/pdf, 69 Pages, 506.54KB

PDF preview unavailable. Download the PDF instead.

developer-manual

References

pdfTeX-1.40.21 LaTeX with hyperref

Related Documents

Preview NVIDIA Mission Control Manual: Managing NVIDIA Base Command Manager 11
A comprehensive guide to NVIDIA Mission Control features within NVIDIA Base Command Manager 11, covering rack management, NVLink monitoring, power shelf integration, and autonomous hardware recovery for NVIDIA B200 and GB200 platforms.
Preview NVIDIA TensorRT Developer Guide for Deep Learning Inference Optimization
Explore the NVIDIA TensorRT Developer Guide (PG-08540-001_v8.2.0 Early Access) to learn how to optimize and deploy deep learning models for high-performance inference on NVIDIA GPUs. This comprehensive manual covers C++ and Python APIs, advanced features like quantization, dynamic shapes, custom layers, and performance best practices.
Preview NVIDIA CUDA Fermi Compatibility Guide
A guide for developers to ensure CUDA applications are compatible with NVIDIA's Fermi architecture, covering compatibility, requirements, and building applications using CUDA Toolkit.
Preview CUDA Runtime API Reference Manual
Comprehensive reference manual for the NVIDIA CUDA Runtime API, detailing functions for GPU computing, device management, memory operations, stream synchronization, and more. Version January 2022.
Preview NVIDIA DGX SuperPOD Deployment Guide
This document provides detailed instructions for deploying NVIDIA Base Command Manager on NVIDIA DGX SuperPOD configurations, covering initial cluster setup, head node configuration, and high availability setup.
Preview NVIDIA Jetson Orin Nano Super Developer Kit Datasheet
The NVIDIA Jetson Orin Nano Super Developer Kit is a compact, powerful, and affordable generative AI supercomputer for edge devices. It features an NVIDIA Ampere architecture GPU, a 6-core ARM CPU, and extensive connectivity, enabling developers, students, and makers to build next-generation AI applications in robotics, vision AI, and more. The kit includes the Jetson Orin Nano 8GB module and a reference carrier board, supported by the NVIDIA AI software stack.
Preview NVIDIA Jetson AGX Orin Developer Kit Reviewer's Guide
Explore the NVIDIA Jetson AGX Orin Developer Kit, a powerful platform for next-generation AI and robotics. Discover its class-leading AI performance, energy efficiency, and comprehensive software stack for edge AI applications.
Preview NVIDIA AI Enterprise User Guide: GPU Virtualization, Deployment, and Management
Comprehensive user guide for NVIDIA AI Enterprise, detailing installation, configuration, and management of AI and data analytics workloads on virtualized GPU environments. Covers vGPU, Kubernetes, VMware vSphere, and Red Hat KVM.