Update To CentOS 7.7 / Arch Ppc64le / Problem With Nvidia Driver

Home » CentOS » Update To CentOS 7.7 / Arch Ppc64le / Problem With Nvidia Driver
CentOS 3 Comments

Hello,

today I updated a CentOS 7.6 ppc64le machine to CentOS 7.7. After reboot to the new kernel (4.18.0-80.7.2.el7.ppc64le) dkms could not build the nvidia-module.

Error-message from dkms:

Compiler version check failed:

The major and minor number of the compiler used to compile the kernel:

gcc version 8.3.1 20190311 (Red Hat 8.3.1-3) (GCC)

does not match the compiler used here:

cc (GCC) 4.8.5 20150623 (Red Hat 4.8.5-39)

Output of /proc/version with new kernel running is:
Linux version 4.18.0-80.7.2.el7.ppc64le (mockbuild@ppc64le-01.bsys.CentOS.org)
(gcc version 8.3.1 20190311 (Red Hat 8.3.1-3) (GCC)) #1 SMP Thu Sep 12 15:45:05
UTC 2019

Problem seams to be:

The kernel was compiled with gcc-version 8.3.1 and installed is gcc 4.8.5. All previous kernels were compiled with gcc 4.8.5. See:

#cat /usr/src/kernels/*/include/generated/compile.h |grep LINUX_COMPILER
define LINUX_COMPILER “gcc version 4.8.5 20150623 (Red Hat 4.8.5-36) (GCC)”
define LINUX_COMPILER “gcc version 4.8.5 20150623 (Red Hat 4.8.5-36) (GCC)”
define LINUX_COMPILER “gcc version 4.8.5 20150623 (Red Hat 4.8.5-36) (GCC)”
define LINUX_COMPILER “gcc version 4.8.5 20150623 (Red Hat 4.8.5-36) (GCC)”
define LINUX_COMPILER “gcc version 4.8.5 20150623 (Red Hat 4.8.5-36) (GCC)”
define LINUX_COMPILER “gcc version 8.3.1 20190311 (Red Hat 8.3.1-3) (GCC)”
define LINUX_COMPILER “gcc version 8.3.1 20190311 (Red Hat 8.3.1-3) (GCC)”

Any comments?

Best regards, Ralf

3 thoughts on - Update To CentOS 7.7 / Arch Ppc64le / Problem With Nvidia Driver

  • Hi

    I had similar erors when installing the new kernel and then the new kernel did not boot to the graphics. Even the ctrl-alt-F2 to get a terminal did not work.

    I had to reboot with adding a 3 to the end of the grub linux line, to get a terminal (without graphics). Then I ran the latest nvidia driver installer downloaded from the nvidia site.

    Then a normal reboot and it worked.

    cheers mathy

  • Well, if you use that kernel, that means you’re on Power9 variant, and that architecture doesn’t exist anymore upstream (so no RHEL 7.7 for Power9). As almost all packages are just ppc64le (which still exist upstream), the decision was to still provide 7.7.1908 for Power9 users, but using the kernel from CentOS 8, rebuilt for CentOS 7. (same is also true for aarch64)

    For that kernel to be built, we had to use newer gcc, that you can find/use through devtoolset-8 :
    http://mirror.CentOS.org/altarch/7/sclo/ppc64le/rh/devtoolset-8/

    Curious : which kind of machine do you have that has both a Power9 and nvidia ? that seems to *not* be an IBM node, but a kind of openpower workstation ?

    PS : worth knowing that ppc64le arch itself (supporting up to Power8) is still using the standard .el7 rebuilt kernel.

    PS2 : worth creating a bug report on https://bugs.CentOS.org for easier tracking and also indexing, so that other people in your situation would follow the bug report (index by crawlers) and eventually discussion can happen there.

  • Hello Fabian,


    …>> Any comments?

    thanks for Your quick response.

    Ok. So I try to install devtoolset-8 and build the nvidia-driver with that gcc.

    It’a a IBM Power System AC922 (8335-GTH) with Nvidia Tesla V100 graphic cards. Supercomputer “Summit” uses this nodes (https://www.olcf.ornl.gov/summit/).

    I will do that.

    Best regards, Ralf