Este conteúdo não está disponível no idioma selecionado.
Chapter 32. Using Advanced Error Reporting
Advanced Error Reporting (AER) notifies you of error events for PCIe devices. Enabled by default in RHEL, AER collects errors in kernel logs. Use the rasdaemon program to parse and store these errors in a database for easier analysis.
32.1. Overview of AER Copiar o linkLink copiado para a área de transferência!
Advanced Error Reporting (AER) is a kernel feature that provides enhanced error reporting for Peripheral Component Interconnect Express (PCIe) devices. The AER kernel driver attaches root ports which support PCIe AER capability in order to:
- Gather the comprehensive error information
- Report errors to the users
- Perform error recovery actions
When AER captures an error, it sends an error message to the console. For a repairable error, the console output is a warning.
Example 32.1. Example AER output
Feb 5 15:41:33 hostname kernel: pcieport 10003:00:00.0: AER: Corrected error received: id=ae00
Feb 5 15:41:33 hostname kernel: pcieport 10003:00:00.0: AER: Multiple Corrected error received: id=ae00
Feb 5 15:41:33 hostname kernel: pcieport 10003:00:00.0: PCIe Bus Error: severity=Corrected, type=Data Link Layer, id=0000(Receiver ID)
Feb 5 15:41:33 hostname kernel: pcieport 10003:00:00.0: device [8086:2030] error status/mask=000000c0/00002000
Feb 5 15:41:33 hostname kernel: pcieport 10003:00:00.0: [ 6] Bad TLP
Feb 5 15:41:33 hostname kernel: pcieport 10003:00:00.0: [ 7] Bad DLLP
Feb 5 15:41:33 hostname kernel: pcieport 10003:00:00.0: AER: Multiple Corrected error received: id=ae00
Feb 5 15:41:33 hostname kernel: pcieport 10003:00:00.0: PCIe Bus Error: severity=Corrected, type=Data Link Layer, id=0000(Receiver ID)
Feb 5 15:41:33 hostname kernel: pcieport 10003:00:00.0: device [8086:2030] error status/mask=00000040/00002000
32.2. Collecting and displaying AER messages Copiar o linkLink copiado para a área de transferência!
To collect and display AER messages, use the rasdaemon program.
Procedure
Install the
rasdaemonpackage.# yum install rasdaemonEnable and start the
rasdaemonservice.# systemctl enable --now rasdaemon Created symlink /etc/systemd/system/multi-user.target.wants/rasdaemon.service/usr/lib/systemd/system/rasdaemon.service. Issue the
ras-mc-ctlcommand.# ras-mc-ctl --summary # ras-mc-ctl --errorsThe command displays a summary of the logged errors (the
--summaryoption) or displays the errors stored in the error database (the--errorsoption).