SCSI Not Settling???
I’m getting a lot of this at boottime:
udev still not settled. Waiting. udevadm settle – timeout of 0 seconds reached, the event queue contains:
/sys/module/scsi_wait_scan (11707)
udev still not settled. Waiting. udevadm settle – timeout of 0 seconds reached, the event queue contains:
/sys/module/scsi_wait_scan (11709)
udev still not settled. Waiting. udevadm settle – timeout of 0 seconds reached, the event queue contains:
/sys/module/scsi_wait_scan (11711)
udev still not settled. Waiting. udevadm settle – timeout of 0 seconds reached, the event queue contains:
/sys/module/scsi_wait_scan
It takes a few minutes to boot.
The machine is a Dell PRECISION T5600 with a PERC H310 RAID controller.
—
Gé
5 thoughts on - SCSI Not Settling???
Gé Weijers wrote:
Hmmm, a) what’s *in* this workstation? b) was this a fresh install?
mark
Thanks for answering.
a) The workstation has:
– CPU E5-2687W
– 32 GB RAM (ECC)
– AMD FirePro V7900, original graphics card. (This happens with both the AMD driver installed and not installed.)
– MegaRAID SAS 2008 controller configured for RAID1
I had a quad PCIe serial card in it as well, but I have removed it. No change.
b) It’s was not a fresh install, but it’s been doing this only recently. It started out with CentOS 6.3 or so in november 2012.
I saw some kernel oopses as well, I’m wondering whether the RAID controller is going bad.
Gé
May 16 20:01:25 localhost kernel: ————[ cut here ]———— [] ?] ?] ?] ?] ? irq_enter+0x1b/0x80] ? do_IRQ+0x43/0xf0] ?] ?] ?] ?] ?] ?] ?] ?] ?] ?] ?] ?] ?] ? ] ? [] ?] ? irq_exit+0x85/0x90] ? do_IRQ+0x75/0xf0] ?] ?] ?] ?] ? cpu_idle+0xb6/0x110] ? rest_init+0x7a/0x80] ?] ?] ?
May 16 20:01:25 localhost kernel: WARNING: at kernel/rcutree.c:332
rcu_irq_enter+0x55/0x70() (Tainted: P W ————— )
May 16 20:01:25 localhost kernel: Hardware name: Precision T5600
May 16 20:01:25 localhost kernel: Modules linked in: fuse ebtable_nat ebtables ipt_MASQUERADE iptable_nat nf_nat xt_CHECKSUM iptable_mangle bridge tpm_infineon nfsd lockd nfs_acl auth_rpcgss sunrpc exportfs autofs4
8021q garp stp llc cpufreq_ondemand acpi_cpufreq freq_table mperf ipt_REJECT nf_conntrack_ipv4 nf_defrag_ipv4 iptable_filter ip_tables nf_conntrack_ftp nf_conntrack_netbios_ns nf_conntrack_broadcast ip6t_REJECT
ip6t_ipv6header nf_conntrack_ipv6 nf_defrag_ipv6 xt_state nf_conntrack ip6table_filter ip6_tables ipv6 vhost_net macvtap macvlan tun kvm_intel kvm uinput iTCO_wdt iTCO_vendor_support microcode dcdbas fglrx(P)(U) sg ftdi_sio usbserial i2c_i801 i2c_core lpc_ich mfd_core snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device snd_pcm snd_timer snd soundcore snd_page_alloc shpchp e1000e ptp pps_core ext4 jbd2 mbcache sr_mod cdrom sd_mod crc_t10dif xhci_hcd ahci megaraid_sas dm_mirror dm_region_hash dm_log dm_mod [last unloaded:
scsi_wait_scan]
May 16 20:01:25 localhost kernel: Pid: 0, comm: swapper Tainted: P W
————— 2.6.32-431.11.2.el6.x86_64 #1
May 16 20:01:25 localhost kernel: Call Trace:
May 16 20:01:25 localhost kernel:
warn_slowpath_common+0x87/0xc0
May 16 20:01:25 localhost kernel: [
enqueue_entity+0x125/0x450
May 16 20:01:25 localhost kernel: [
warn_slowpath_null+0x1a/0x20
May 16 20:01:25 localhost kernel: [
rcu_irq_enter+0x55/0x70
May 16 20:01:25 localhost kernel: [
May 16 20:01:25 localhost kernel: [
May 16 20:01:25 localhost kernel: [
ret_from_intr+0x0/0x11
May 16 20:01:25 localhost kernel: [
kmem_cache_free+0xbf/0x2b0
May 16 20:01:25 localhost kernel: [
scsi_sg_free+0x0/0x60
May 16 20:01:25 localhost kernel: [
mempool_free_slab+0x17/0x20
May 16 20:01:25 localhost kernel: [
mempool_free+0x95/0xa0
May 16 20:01:25 localhost kernel: [
scsi_sg_free+0x4c/0x60
May 16 20:01:25 localhost kernel: [
__sg_free_table+0x60/0x80
May 16 20:01:25 localhost kernel: [
__scsi_release_buffers+0x104/0x110
May 16 20:01:25 localhost kernel: [
scsi_io_completion+0x2a6/0x6c0
May 16 20:01:25 localhost kernel: [
scsi_finish_command+0xc2/0x130
May 16 20:01:25 localhost kernel: [
scsi_softirq_done+0x145/0x170
May 16 20:01:25 localhost kernel: [
blk_done_softirq+0x85/0xa0
May 16 20:01:25 localhost kernel: [
__do_softirq+0xc1/0x1e0
May 16 20:01:25 localhost kernel: [
call_softirq+0x1c/0x30
May 16 20:01:25 localhost kernel: [
call_softirq+0x1c/0x30
May 16 20:01:25 localhost kernel:
do_softirq+0x65/0xa0
May 16 20:01:25 localhost kernel: [
May 16 20:01:25 localhost kernel: [
May 16 20:01:25 localhost kernel: [
ret_from_intr+0x0/0x11
May 16 20:01:25 localhost kernel: [
intel_idle+0xde/0x170
May 16 20:01:25 localhost kernel: [
intel_idle+0xc1/0x170
May 16 20:01:25 localhost kernel: [
cpuidle_idle_call+0xa7/0x140
May 16 20:01:25 localhost kernel: [
May 16 20:01:25 localhost kernel: [
May 16 20:01:25 localhost kernel: [
start_kernel+0x424/0x430
May 16 20:01:25 localhost kernel: [
x86_64_start_reservations+0x125/0x129
May 16 20:01:25 localhost kernel: [
x86_64_start_kernel+0x115/0x124
May 16 20:01:25 localhost kernel: —[ end trace a0e1ee14d68f0684 ]—
—
Gé
BTW: I just saw those ‘oopses’, or I would have mentioned them right away. I’m seriously suspecting the RAID controller.
Gé
—
Gé
Gé Weijers wrote:
Hard to tell from this. Anything significant in dmesg | tail? I’ve seen that damn swapper/tainted, but there’s some tuning that can be done, that might be hugepage issues.
mark
I’m running a memory check right now, just to rule things out.
Gé
—
Gé