Hello,
I upgraded my Debian (11 Bullseye) on 29th of May and I got a problem. Since then I get error messages on boot and the lowest resolution on lightDM and desktop. It must be Nvidia it seems.
The errors from boot are:
The output of # systemctl status systemd-modules-load.service:
The output of # systemctl status nvidia-persistenced.service:
Also, there was an error shown right during system upgrade:
Some excerpts from /var/lib/dkms/nvidia-current/470.223.02/build/make.log that are somewhat meaningful I think:
The last time I upgraded before the problem, it was on 26th of April.
Can someone please help me with this? I can provide full output of these snippets if needed and other info.
Thank you very much!
I upgraded my Debian (11 Bullseye) on 29th of May and I got a problem. Since then I get error messages on boot and the lowest resolution on lightDM and desktop. It must be Nvidia it seems.
The errors from boot are:
[FAILED] Failed to start Load Kernel Modules.
See 'systemctl status systemd-modules-load.service' for details.
...
[FAILED] Failed to start NVIDIA Persistence Daemon.
See 'systemctl status nvidia-persistenced.service' for details.
The output of # systemctl status systemd-modules-load.service:
● systemd-modules-load.service - Load Kernel Modules
Loaded: loaded (/lib/systemd/system/systemd-modules-load.service; static)
Active: failed (Result: exit-code) since Tue 2024-06-11 11:10:25 CEST; 2h 3min ago
Docs: man:systemd-modules-load.service(8)
man:modules-load.d(5)
Process: 431 ExecStart=/lib/systemd/systemd-modules-load (code=exited, status=1/FAILURE)
Main PID: 431 (code=exited, status=1/FAILURE)
CPU: 36ms
Jun 11 11:10:25 local-host systemd-modules-load[435]: modprobe: ERROR: could not insert 'nvidia': Invalid argument
Jun 11 11:10:25 local-host systemd-modules-load[438]: modprobe: FATAL: Module nvidia-current-modeset not found in directory /lib/modules/5.10.0-29-amd64
Jun 11 11:10:25 local-host systemd-modules-load[433]: modprobe: ERROR: ../libkmod/libkmod-module.c:990 command_do() Error running install command 'modprobe nvidia ; modprobe -i nvidia-current-modeset ' for module nvidia_modeset: retcode 1
Jun 11 11:10:25 local-host systemd-modules-load[433]: modprobe: ERROR: could not insert 'nvidia_modeset': Invalid argument
Jun 11 11:10:25 local-host systemd-modules-load[439]: modprobe: FATAL: Module nvidia-current-drm not found in directory /lib/modules/5.10.0-29-amd64
Jun 11 11:10:25 local-host systemd-modules-load[431]: Error running install command 'modprobe nvidia-modeset ; modprobe -i nvidia-current-drm ' for module nvidia_drm: retcode 1
Jun 11 11:10:25 local-host systemd-modules-load[431]: Failed to insert module 'nvidia_drm': Invalid argument
Jun 11 11:10:25 local-host systemd[1]: systemd-modules-load.service: Main process exited, code=exited, status=1/FAILURE
Jun 11 11:10:25 local-host systemd[1]: systemd-modules-load.service: Failed with result 'exit-code'.
Jun 11 11:10:25 local-host systemd[1]: Failed to start Load Kernel Modules.
The output of # systemctl status nvidia-persistenced.service:
● nvidia-persistenced.service - NVIDIA Persistence Daemon
Loaded: loaded (/lib/systemd/system/nvidia-persistenced.service; enabled; vendor preset: enabled)
Active: failed (Result: exit-code) since Tue 2024-06-11 13:17:12 CEST; 2min 28s ago
Process: 449 ExecStart=/usr/bin/nvidia-persistenced --user nvpd (code=exited, status=1/FAILURE)
Process: 472 ExecStopPost=/bin/rm -rf /var/run/nvidia-persistenced (code=exited, status=0/SUCCESS)
CPU: 34ms
Jun 11 13:17:11 local-host systemd[1]: Starting NVIDIA Persistence Daemon...
Jun 11 13:17:12 local-host nvidia-persistenced[457]: Started (457)
Jun 11 13:17:12 local-host nvidia-persistenced[457]: Failed to query NVIDIA devices. Please ensure that the NVIDIA device files (/dev/nvidia*) exist, and that user 114 has read and write permissions for those files.
Jun 11 13:17:12 local-host nvidia-persistenced[457]: Shutdown (457)
Jun 11 13:17:12 local-host nvidia-persistenced[449]: nvidia-persistenced failed to initialize. Check syslog for more details.
Jun 11 13:17:12 local-host systemd[1]: nvidia-persistenced.service: Control process exited, code=exited, status=1/FAILURE
Jun 11 13:17:12 local-host systemd[1]: nvidia-persistenced.service: Failed with result 'exit-code'.
Jun 11 13:17:12 local-host systemd[1]: Failed to start NVIDIA Persistence Daemon.
Also, there was an error shown right during system upgrade:
...
Building module:
cleaning build area...
env NV_VERBOSE=1 make -j6 modules KERNEL_UNAME=5.10.0-29-amd64.....................(bad exit status: 2)
Error! Bad return status for module build on kernel: 5.10.0-29-amd64 (x86_64)
Consult /var/lib/dkms/nvidia-current/470.223.02/build/make.log for more information.
...
Some excerpts from /var/lib/dkms/nvidia-current/470.223.02/build/make.log that are somewhat meaningful I think:
...
test -e include/generated/autoconf.h -a -e include/config/auto.conf || ( \
echo >&2; \
echo >&2 " ERROR: Kernel configuration is invalid."; \
echo >&2 " include/generated/autoconf.h or include/config/auto.conf are missing.";\
echo >&2 " Run 'make oldconfig && make prepare' on kernel src to fix it."; \
echo >&2 ; \
/bin/false)
...
FATAL: modpost: GPL-incompatible module nvidia.ko uses GPL-only symbol 'rcu_read_unlock_strict'
make[3]: *** [/usr/src/linux-headers-5.10.0-30-common/scripts/Makefile.modpost:123: /var/lib/dkms/nvidia-current/470.223.02/build/Module.symvers] Error 1
make[2]: *** [/usr/src/linux-headers-5.10.0-30-common/Makefile:1783: modules] Error 2
make[2]: Leaving directory '/usr/src/linux-headers-5.10.0-30-amd64'
make[1]: *** [Makefile:192: __sub-make] Error 2
make[1]: Leaving directory '/usr/src/linux-headers-5.10.0-30-common'
make: *** [Makefile:80: modules] Error 2
The last time I upgraded before the problem, it was on 26th of April.
Can someone please help me with this? I can provide full output of these snippets if needed and other info.
Thank you very much!
Last edited: