SCSI Not Settling???

Home » CentOS » SCSI Not Settling???
CentOS 5 Comments

I’m getting a lot of this at boottime:

udev still not settled. Waiting. udevadm settle – timeout of 0 seconds reached, the event queue contains:
/sys/module/scsi_wait_scan (11707)

udev still not settled. Waiting. udevadm settle – timeout of 0 seconds reached, the event queue contains:
/sys/module/scsi_wait_scan (11709)

udev still not settled. Waiting. udevadm settle – timeout of 0 seconds reached, the event queue contains:
/sys/module/scsi_wait_scan (11711)

udev still not settled. Waiting. udevadm settle – timeout of 0 seconds reached, the event queue contains:
/sys/module/scsi_wait_scan

It takes a few minutes to boot.

The machine is a Dell PRECISION T5600 with a PERC H310 RAID controller.


5 thoughts on - SCSI Not Settling???

  • Gé Weijers wrote:

    Hmmm, a) what’s *in* this workstation? b) was this a fresh install?

    mark

  • Thanks for answering.

    a) The workstation has:
    – CPU E5-2687W
    – 32 GB RAM (ECC)
    – AMD FirePro V7900, original graphics card. (This happens with both the AMD driver installed and not installed.)
    – MegaRAID SAS 2008 controller configured for RAID1

    I had a quad PCIe serial card in it as well, but I have removed it. No change.

    b) It’s was not a fresh install, but it’s been doing this only recently. It started out with CentOS 6.3 or so in november 2012.

    I saw some kernel oopses as well, I’m wondering whether the RAID controller is going bad.

    May 16 20:01:25 localhost kernel: ————[ cut here ]————
    May 16 20:01:25 localhost kernel: WARNING: at kernel/rcutree.c:332
    rcu_irq_enter+0x55/0x70() (Tainted: P W ————— )
    May 16 20:01:25 localhost kernel: Hardware name: Precision T5600
    May 16 20:01:25 localhost kernel: Modules linked in: fuse ebtable_nat ebtables ipt_MASQUERADE iptable_nat nf_nat xt_CHECKSUM iptable_mangle bridge tpm_infineon nfsd lockd nfs_acl auth_rpcgss sunrpc exportfs autofs4
    8021q garp stp llc cpufreq_ondemand acpi_cpufreq freq_table mperf ipt_REJECT nf_conntrack_ipv4 nf_defrag_ipv4 iptable_filter ip_tables nf_conntrack_ftp nf_conntrack_netbios_ns nf_conntrack_broadcast ip6t_REJECT
    ip6t_ipv6header nf_conntrack_ipv6 nf_defrag_ipv6 xt_state nf_conntrack ip6table_filter ip6_tables ipv6 vhost_net macvtap macvlan tun kvm_intel kvm uinput iTCO_wdt iTCO_vendor_support microcode dcdbas fglrx(P)(U) sg ftdi_sio usbserial i2c_i801 i2c_core lpc_ich mfd_core snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device snd_pcm snd_timer snd soundcore snd_page_alloc shpchp e1000e ptp pps_core ext4 jbd2 mbcache sr_mod cdrom sd_mod crc_t10dif xhci_hcd ahci megaraid_sas dm_mirror dm_region_hash dm_log dm_mod [last unloaded:
    scsi_wait_scan]
    May 16 20:01:25 localhost kernel: Pid: 0, comm: swapper Tainted: P W
    ————— 2.6.32-431.11.2.el6.x86_64 #1
    May 16 20:01:25 localhost kernel: Call Trace:
    May 16 20:01:25 localhost kernel: [] ?
    warn_slowpath_common+0x87/0xc0
    May 16 20:01:25 localhost kernel: [] ?
    enqueue_entity+0x125/0x450
    May 16 20:01:25 localhost kernel: [] ?
    warn_slowpath_null+0x1a/0x20
    May 16 20:01:25 localhost kernel: [] ?
    rcu_irq_enter+0x55/0x70
    May 16 20:01:25 localhost kernel: [] ? irq_enter+0x1b/0x80
    May 16 20:01:25 localhost kernel: [] ? do_IRQ+0x43/0xf0
    May 16 20:01:25 localhost kernel: [] ?
    ret_from_intr+0x0/0x11
    May 16 20:01:25 localhost kernel: [] ?
    kmem_cache_free+0xbf/0x2b0
    May 16 20:01:25 localhost kernel: [] ?
    scsi_sg_free+0x0/0x60
    May 16 20:01:25 localhost kernel: [] ?
    mempool_free_slab+0x17/0x20
    May 16 20:01:25 localhost kernel: [] ?
    mempool_free+0x95/0xa0
    May 16 20:01:25 localhost kernel: [] ?
    scsi_sg_free+0x4c/0x60
    May 16 20:01:25 localhost kernel: [] ?
    __sg_free_table+0x60/0x80
    May 16 20:01:25 localhost kernel: [] ?
    __scsi_release_buffers+0x104/0x110
    May 16 20:01:25 localhost kernel: [] ?
    scsi_io_completion+0x2a6/0x6c0
    May 16 20:01:25 localhost kernel: [] ?
    scsi_finish_command+0xc2/0x130
    May 16 20:01:25 localhost kernel: [] ?
    scsi_softirq_done+0x145/0x170
    May 16 20:01:25 localhost kernel: [] ?
    blk_done_softirq+0x85/0xa0
    May 16 20:01:25 localhost kernel: [] ?
    __do_softirq+0xc1/0x1e0
    May 16 20:01:25 localhost kernel: [] ?
    call_softirq+0x1c/0x30
    May 16 20:01:25 localhost kernel: [
    ] ?
    call_softirq+0x1c/0x30
    May 16 20:01:25 localhost kernel: [] ?
    do_softirq+0x65/0xa0
    May 16 20:01:25 localhost kernel: [] ? irq_exit+0x85/0x90
    May 16 20:01:25 localhost kernel: [] ? do_IRQ+0x75/0xf0
    May 16 20:01:25 localhost kernel: [] ?
    ret_from_intr+0x0/0x11
    May 16 20:01:25 localhost kernel: [] ?
    intel_idle+0xde/0x170
    May 16 20:01:25 localhost kernel: [] ?
    intel_idle+0xc1/0x170
    May 16 20:01:25 localhost kernel: [] ?
    cpuidle_idle_call+0xa7/0x140
    May 16 20:01:25 localhost kernel: [] ? cpu_idle+0xb6/0x110
    May 16 20:01:25 localhost kernel: [] ? rest_init+0x7a/0x80
    May 16 20:01:25 localhost kernel: [] ?
    start_kernel+0x424/0x430
    May 16 20:01:25 localhost kernel: [] ?
    x86_64_start_reservations+0x125/0x129
    May 16 20:01:25 localhost kernel: [] ?
    x86_64_start_kernel+0x115/0x124
    May 16 20:01:25 localhost kernel: —[ end trace a0e1ee14d68f0684 ]—


  • BTW: I just saw those ‘oopses’, or I would have mentioned them right away. I’m seriously suspecting the RAID controller.


  • Gé Weijers wrote:

    Hard to tell from this. Anything significant in dmesg | tail? I’ve seen that damn swapper/tainted, but there’s some tuning that can be done, that might be hugepage issues.

    mark