Chapter 6. Configuring PCI passthrough
You can use PCI passthrough to attach a physical PCI device, such as a graphics card or a network device, to an instance. If you use PCI passthrough for a device, the instance reserves exclusive access to the device for performing tasks, and the device is not available to the host.
Using PCI passthrough with routed provider networks
				The Compute service does not support single networks that span multiple provider networks. When a network contains multiple physical networks, the Compute service only uses the first physical network. Therefore, if you are using routed provider networks you must use the same physical_network name across all the Compute nodes.
			
				If you use routed provider networks with VLAN or flat networks, you must use the same physical_network name for all segments. You then create multiple segments for the network and map the segments to the appropriate subnets.
			
To enable your cloud users to create instances with PCI devices attached, you must complete the following:
- Designate Compute nodes for PCI passthrough.
- Configure the Compute nodes for PCI passthrough that have the required PCI devices.
- Deploy the overcloud.
- Create a flavor for launching instances with PCI devices attached.
Prerequisites
- The Compute nodes have the required PCI devices.
6.1. Designating Compute nodes for PCI passthrough
To designate Compute nodes for instances with physical PCI devices attached, you must:
- create a new role file to configure the PCI passthrough role
- configure a new overcloud flavor for PCI passthrough to use to tag the Compute nodes for PCI passthrough
Procedure
- Generate a new roles data file named - roles_data_pci_passthrough.yamlthat includes the- Controller,- Compute, and- ComputePCIroles:- openstack overcloud roles \ generate -o /home/stack/templates/roles_data_pci_passthrough.yaml \ Compute:ComputePCI Compute Controller - (undercloud)$ openstack overcloud roles \ generate -o /home/stack/templates/roles_data_pci_passthrough.yaml \ Compute:ComputePCI Compute Controller- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Open - roles_data_pci_passthrough.yamland edit or add the following parameters and sections:- Expand - Section/Parameter - Current value - New value - Role comment - Role: Compute- Role: ComputePCI- Role name - name: Compute- name: ComputePCI- description- Basic Compute Node role- PCI Passthrough Compute Node role- HostnameFormatDefault- %stackname%-novacompute-%index%- %stackname%-novacomputepci-%index%- deprecated_nic_config_name- compute.yaml- compute-pci-passthrough.yaml
- 
						Register the PCI passthrough Compute nodes for the overcloud by adding them to your node definition template, node.jsonornode.yaml. For more information, see Registering nodes for the overcloud in the Director Installation and Usage guide.
- Inspect the node hardware: - openstack overcloud node introspect \ --all-manageable --provide - (undercloud)$ openstack overcloud node introspect \ --all-manageable --provide- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - For more information, see Inspecting the hardware of nodes in the Director Installation and Usage guide. 
- Create the - compute-pci-passthroughbare metal flavor to use to tag nodes that you want to designate for PCI passthrough:- openstack flavor create --id auto \ --ram <ram_size_mb> --disk <disk_size_gb> \ --vcpus <no_vcpus> compute-pci-passthrough - (undercloud)$ openstack flavor create --id auto \ --ram <ram_size_mb> --disk <disk_size_gb> \ --vcpus <no_vcpus> compute-pci-passthrough- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 
								Replace <ram_size_mb>with the RAM of the bare metal node, in MB.
- 
								Replace <disk_size_gb>with the size of the disk on the bare metal node, in GB.
- Replace - <no_vcpus>with the number of CPUs on the bare metal node.Note- These properties are not used for scheduling instances. However, the Compute scheduler does use the disk size to determine the root partition size. 
 
- 
								Replace 
- Tag each bare metal node that you want to designate for PCI passthrough with a custom PCI passthrough resource class: - openstack baremetal node set \ --resource-class baremetal.PCI-PASSTHROUGH <node> - (undercloud)$ openstack baremetal node set \ --resource-class baremetal.PCI-PASSTHROUGH <node>- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Replace - <node>with the ID of the bare metal node.
- Associate the - compute-pci-passthroughflavor with the custom PCI passthrough resource class:- openstack flavor set \ --property resources:CUSTOM_BAREMETAL_PCI_PASSTHROUGH=1 \ compute-pci-passthrough - (undercloud)$ openstack flavor set \ --property resources:CUSTOM_BAREMETAL_PCI_PASSTHROUGH=1 \ compute-pci-passthrough- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - To determine the name of a custom resource class that corresponds to a resource class of a Bare Metal service node, convert the resource class to uppercase, replace all punctuation with an underscore, and prefix with - CUSTOM_.Note- A flavor can request only one instance of a bare metal resource class. 
- Set the following flavor properties to prevent the Compute scheduler from using the bare metal flavor properties to schedule instances: - openstack flavor set \ --property resources:VCPU=0 --property resources:MEMORY_MB=0 \ --property resources:DISK_GB=0 compute-pci-passthrough - (undercloud)$ openstack flavor set \ --property resources:VCPU=0 --property resources:MEMORY_MB=0 \ --property resources:DISK_GB=0 compute-pci-passthrough- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Add the following parameters to the - node-info.yamlfile to specify the number of PCI passthrough Compute nodes, and the flavor to use for the PCI passthrough designated Compute nodes:- parameter_defaults: OvercloudComputePCIFlavor: compute-pci-passthrough ComputePCICount: 3 - parameter_defaults: OvercloudComputePCIFlavor: compute-pci-passthrough ComputePCICount: 3- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- To verify that the role was created, enter the following command: - openstack overcloud profiles list - (undercloud)$ openstack overcloud profiles list- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
6.2. Configuring a PCI passthrough Compute node
To enable your cloud users to create instances with PCI devices attached, you must configure both the Compute nodes that have the PCI devices and the Controller nodes.
Procedure
- 
						Create an environment file to configure the Controller node on the overcloud for PCI passthrough, for example, pci_passthrough_controller.yaml.
- Add - PciPassthroughFilterto the- NovaSchedulerDefaultFiltersparameter in- pci_passthrough_controller.yaml:- parameter_defaults: NovaSchedulerDefaultFilters: ['AvailabilityZoneFilter','ComputeFilter','ComputeCapabilitiesFilter','ImagePropertiesFilter','ServerGroupAntiAffinityFilter','ServerGroupAffinityFilter','PciPassthroughFilter','NUMATopologyFilter'] - parameter_defaults: NovaSchedulerDefaultFilters: ['AvailabilityZoneFilter','ComputeFilter','ComputeCapabilitiesFilter','ImagePropertiesFilter','ServerGroupAntiAffinityFilter','ServerGroupAffinityFilter','PciPassthroughFilter','NUMATopologyFilter']- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- To specify the PCI alias for the devices on the Controller node, add the following configuration to - pci_passthrough_controller.yaml:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - For more information about configuring the - device_typefield, see PCI passthrough device type field.Note- If the - nova-apiservice is running in a role different from the- Controllerrole, replace- ControllerExtraConfigwith the user role in the format- <Role>ExtraConfig.
- Optional: To set a default NUMA affinity policy for PCI passthrough devices, add - numa_policyto the- nova::pci::aliases:configuration from step 3:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- 
						To configure the Compute node on the overcloud for PCI passthrough, create an environment file, for example, pci_passthrough_compute.yaml.
- To specify the available PCIs for the devices on the Compute node, use the - vendor_idand- product_idoptions to add all matching PCI devices to the pool of PCI devices available for passthrough to instances. For example, to add all Intel® Ethernet Controller X710 devices to the pool of PCI devices available for passthrough to instances, add the following configuration to- pci_passthrough_compute.yaml:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - For more information about how to configure - NovaPCIPassthrough, see Guidelines for configuring- NovaPCIPassthrough.
- You must create a copy of the PCI alias on the Compute node for instance migration and resize operations. To specify the PCI alias for the devices on the PCI passthrough Compute node, add the following to - pci_passthrough_compute.yaml:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow Note- The Compute node aliases must be identical to the aliases on the Controller node. Therefore, if you added - numa_affinityto- nova::pci::aliasesin- pci_passthrough_controller.yaml, then you must also add it to- nova::pci::aliasesin- pci_passthrough_compute.yaml.
- To enable IOMMU in the server BIOS of the Compute nodes to support PCI passthrough, add the - KernelArgsparameter to- pci_passthrough_compute.yaml. For example, use the following- KernalArgssettings to enable an Intel IOMMU:- parameter_defaults: ... ComputePCIParameters: KernelArgs: "intel_iommu=on iommu=pt"- parameter_defaults: ... ComputePCIParameters: KernelArgs: "intel_iommu=on iommu=pt"- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - To enable an AMD IOMMU, set - KernelArgsto- "amd_iommu=on iommu=pt".
- Add your custom environment files to the stack with your other environment files and deploy the overcloud: - openstack overcloud deploy --templates \ -e [your environment files] \ -e /home/stack/templates/pci_passthrough_controller.yaml \ -e /home/stack/templates/pci_passthrough_compute.yaml \ - (undercloud)$ openstack overcloud deploy --templates \ -e [your environment files] \ -e /home/stack/templates/pci_passthrough_controller.yaml \ -e /home/stack/templates/pci_passthrough_compute.yaml \- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Create and configure the flavors that your cloud users can use to request the PCI devices. The following example requests two devices, each with a vendor ID of - 8086and a product ID of- 1572, using the alias defined in step 7:- openstack flavor set \ --property "pci_passthrough:alias"="a1:2" device_passthrough - (overcloud)# openstack flavor set \ --property "pci_passthrough:alias"="a1:2" device_passthrough- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
Verification
- Create an instance with a PCI passthrough device: - openstack server create --flavor device_passthrough \ --image <image> --wait test-pci - # openstack server create --flavor device_passthrough \ --image <image> --wait test-pci- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Log in to the instance as a cloud user. For more information, see Log in to an Instance.
- To verify that the PCI device is accessible from the instance, enter the following command from the instance: - lspci -nn | grep <device_name> - $ lspci -nn | grep <device_name>- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
6.3. PCI passthrough device type field
				The Compute service categorizes PCI devices into one of three types, depending on the capabilities the devices report. The following lists the valid values that you can set the device_type field to:
			
- type-PF
- The device supports SR-IOV and is the parent or root device. Specify this device type to passthrough a device that supports SR-IOV in its entirety.
- type-VF
- The device is a child device of a device that supports SR-IOV.
- type-PCI
- 
							The device does not support SR-IOV. This is the default device type if the device_typefield is not set.
					You must configure the Compute and Controller nodes with the same device_type.
				
6.4. Guidelines for configuring NovaPCIPassthrough
- 
						Do not use the devnameparameter when configuring PCI passthrough, as the device name of a NIC can change. Instead, usevendor_idandproduct_idbecause they are more stable, or use theaddressof the NIC.
- 
						To use the product_idparameter to pass through a Physical Function (PF), you must also specify theaddressof the PF. However, you can use just theaddressparameter to specify PFs, because the address is unique on each host.
- 
						To pass through all the Virtual Functions (VFs) you must specify only the product_idandvendor_id. You must also specify theaddressif you are using SRIOV for NIC partitioning and you are running OVS on a VF.
- 
						To pass through only the VFs for a PF but not the PF itself, you can use the addressparameter to specify the PCI address of the PF andproduct_idto specify the product ID of the VF.