Comment
Author: Admin | 2025-04-27
Settings to prefer Maximum Performance to test out some hints I have seen during my research into the issue. Interestingly, the benchmark ran through (only one time) and reported the correct card, 100% CPU usage and plausible Frame Rates/Scores. However, when I went to save the results after the Benchmark, the system froze again.This leads me to believe that the Power Supply should be ok, since the benchmark was able to run through. Otherwise, I would have expected an earlier failure.nvidia-bug-report/GPU falls of busUnder Ubuntu 22.10, when I log into a tty, I get the following messages in the syslog:Jan 22 00:10:23 johannes-Z790-AERO-G kernel: [ 17.195769] pcieport 0000:00:01.0: AER: Uncorrected (Non-Fatal) error received: 0000:00:01.0Jan 22 00:10:23 johannes-Z790-AERO-G kernel: [ 17.195774] pcieport 0000:00:01.0: PCIe Bus Error: severity=Uncorrected (Non-Fatal), type=Transaction Layer, (Requester ID)Jan 22 00:10:23 johannes-Z790-AERO-G kernel: [ 17.195783] pcieport 0000:00:01.0: device [8086:a70d] error status/mask=00100000/00010000Jan 22 00:10:23 johannes-Z790-AERO-G kernel: [ 17.195789] pcieport 0000:00:01.0: [20] UnsupReq (First)Jan 22 00:10:23 johannes-Z790-AERO-G kernel: [ 17.195794] pcieport 0000:00:01.0: AER: TLP Header: 34000000 01000010 00000000 00000000Jan 22 00:10:23 johannes-Z790-AERO-G kernel: [ 17.195801] nvidia 0000:01:00.0: AER: can't recover (no error_detected callback)Jan 22 00:10:23 johannes-Z790-AERO-G kernel: [ 17.195801] snd_hda_intel 0000:01:00.1: AER: can't recover (no error_detected callback)Jan 22 00:10:23 johannes-Z790-AERO-G kernel: [ 17.195812] pcieport 0000:00:01.0: AER: device recovery failedJan 22 00:10:23 johannes-Z790-AERO-G kernel: [ 17.565295] NVRM: GPU at PCI:0000:01:00: GPU-695bdbb4-8c56-b809-4f9c-c9e864a3ad2eJan 22 00:10:23 johannes-Z790-AERO-G kernel: [ 17.565318] NVRM: Xid (PCI:0000:01:00): 79, pid='', name=, GPU has fallen off the bus.Jan 22 00:10:23 johannes-Z790-AERO-G kernel: [ 17.565329] NVRM: GPU 0000:01:00.0: GPU has fallen off the bus.Jan 22 00:10:23 johannes-Z790-AERO-G kernel: [ 17.565351] NVRM: A GPU crash dump has been created. If possible, please runJan 22 00:10:23 johannes-Z790-AERO-G kernel: [ 17.565351] NVRM: nvidia-bug-report.sh as root to collect this data beforeJan 22 00:10:23 johannes-Z790-AERO-G kernel: [ 17.565351] NVRM: the NVIDIA kernel module is unloaded.This is reproducible. The card fans stop, it seems dead. Attached is the bug report:nvidia-bug-report.log.gz (116.9 KB)I do not know how to generate something like this under windows to see if it is similar. Is it possible to read from this information if the card is bad or if there is a
Add Comment