Unix Technical Forum

mdadm -A (1.7.0) segfaults; md raid device not started on boot

This is a discussion on mdadm -A (1.7.0) segfaults; md raid device not started on boot within the Linux Operating System forums, part of the Unix Operating Systems category; --> (Mandrake 10.1, kernel 2.6.8.1-12mdk i686, mdadm 1.7.0) I have the same problem as this guy (sorry about the horrendous ...


Go Back   Unix Technical Forum > Unix Operating Systems > Linux Operating System

FAQ Members List Calendar Search Today's Posts Mark Forums Read
  #1 (permalink)  
Old 01-18-2008, 08:24 AM
Alistair Bayley
 
Posts: n/a
Default mdadm -A (1.7.0) segfaults; md raid device not started on boot

(Mandrake 10.1, kernel 2.6.8.1-12mdk i686, mdadm 1.7.0)

I have the same problem as this guy (sorry about the horrendous URL):
http://groups-beta.google.com/group/...7b21dc3a706bc2

i.e. I created a raid1 device on /dev/md0 which worked fine until
reboot. On reboot the device /dev/md0 isn't started, and then the
reiserfs filesystem on /dev/md0 fails to start, and the whole boot
process halts. I boot by commenting /dev/md0 in /etc/fstab, reboot, run
"mdadm -A /dev/md0 /dev/hdb1 /dev/hdd1" to start the md device,
uncomment the line in fstab, and finally mount /dev/md0.

On boot dmesg gives (trimmed to just the section with the md messages):

[..]
BIOS EDD facility v0.16 2004-Jun-25, 3 devices found
init init/main.c:726
md: Autodetecting RAID arrays.
md: autorun ...
md: considering hdd1 ...
md: adding hdd1 ...
md: adding hdb1 ...
md: created md0
md: bind<hdb1>
md: bind<hdd1>
md: running: <hdd1><hdb1>
md: personality 3 is not loaded!
md :do_md_run() returned -22
md: md0 stopped.
md: unbind<hdd1>
md: export_rdev(hdd1)
md: unbind<hdb1>
md: export_rdev(hdb1)
md: ... autorun DONE.
RAMDISK: Compressed image found at block 0
VFS: Mounted root (ext2 filesystem).
[..]


I found this message, which has the same boot symptoms:
http://groups-beta.google.com/group/...9a59e6130aec73

"modprobe md-personality-3" is silent - is that right? I want to figure
out what needs to be fixed/configured so that the /dev/md0 device gets
started in the boot sequence *before* the filesystem on it is mounted.
Is it just a case of adding something to /etc/modprobe.preload (what
though?) Can someone please point me to the right docs, as I'm not sure
what to look at next.


Thanks,
Alistair.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #2 (permalink)  
Old 01-18-2008, 08:25 AM
Walter Mautner
 
Posts: n/a
Default Re: mdadm -A (1.7.0) segfaults; md raid device not started on boot

Alistair Bayley wrote:

> (Mandrake 10.1, kernel 2.6.8.1-12mdk i686, mdadm 1.7.0)


Here 2.6.11mm4 on Mdk 10.2
> [..]
> BIOS EDD facility v0.16 2004-Jun-25, 3 devices found
> init init/main.c:726
> md: Autodetecting RAID arrays.
> md: autorun ...
> md: considering hdd1 ...
> md: adding hdd1 ...
> md: adding hdb1 ...
> md: created md0
> md: bind<hdb1>
> md: bind<hdd1>
> md: running: <hdd1><hdb1>
> md: personality 3 is not loaded!
> md :do_md_run() returned -22
> md: md0 stopped.
> md: unbind<hdd1>
> md: export_rdev(hdd1)
> md: unbind<hdb1>
> md: export_rdev(hdb1)
> md: ... autorun DONE.
> RAMDISK: Compressed image found at block 0
> VFS: Mounted root (ext2 filesystem).
> [..]
>
>
> I found this message, which has the same boot symptoms:
>

http://groups-beta.google.com/group/...9a59e6130aec73
>
> "modprobe md-personality-3" is silent - is that right? I want to figure
> out what needs to be fixed/configured so that the /dev/md0 device gets
> started in the boot sequence *before* the filesystem on it is mounted.
> Is it just a case of adding something to /etc/modprobe.preload (what
> though?) Can someone please point me to the right docs, as I'm not sure
> what to look at next.
>

You didn't post the lines just after the "Mounted root ...".

Here mine:

md: hdb1 has different UUID to hdd6
md: created md0
md: bind<hdb6>
md: bind<hdd6>
md: running: <hdd6><hdb6>
md: personality 3 is not loaded!
md: do_md_run() returned -22
md: md: hdb1 has different UUID to hdd6
md: created md0
md: bind<hdb6>
md: bind<hdd6>
md: running: <hdd6><hdb6>
md: personality 3 is not loaded!
md: do_md_run() returned -22
......
md: ... autorun DONE.
RAMDISK: Compressed image found at block 0
VFS: Mounted root (ext2 filesystem).
md: raid1 personality registered as nr 3
md: md0 stopped.
md: bind<hdb6>
md: bind<hdd6>
raid1: raid set md0 active with 2 out of 2 mirrors
EXT3-fs: mounted filesystem with ordered data mode.

I think you need the raid1 personality in your initrd (RAMDISK), or compiled
into your kernel (though mkinitrd will then barf about a missing raid
module and refuse to work, unless you tell it to do nevertheless).

--
Longhorn error#4711: TCPA / NGSCP VIOLATION: Microsoft optical mouse
detected penguin patterns on mousepad. Partition scan in progress
*to*remove*offending*incompatible*products.**React ivate*MS*software.
Linux woodpecker.homnet.at 2.6.11-mm4[LinuxCounter#295241,ICQ#4918962]
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #3 (permalink)  
Old 01-18-2008, 08:25 AM
Alistair Bayley
 
Posts: n/a
Default Re: mdadm -A (1.7.0) segfaults; md raid device not started on boot

Walter Mautner wrote:
>
> You didn't post the lines just after the "Mounted root ...".
>
> Here mine:
>
> [..]
>
> I think you need the raid1 personality in your initrd (RAMDISK), or compiled
> into your kernel (though mkinitrd will then barf about a missing raid
> module and refuse to work, unless you tell it to do nevertheless).



I didn't post after "Mounted root" because there's no further mention of
md. The entire dmesg output is below, for completeness.

How does the kernel know to load the raid module when I invoke mdadm,
but isn't able to do this in the boot process? I'd like to be able to
get this working *without* having to recompile the kernel, if it's
possible. The root filesystem is ext3 on a normal hd device, and it's
already mounted by the time we want to start up the raid device, so in
my mind we should be able to load the raid1 module at this stage.

Would it be easier to add raid1 to initrd, or to just recompile the kernel?

Thanks,
Alistair.


[dmesg output, after boot has stopped when filesystem on md device can't
be mounted. This is *all* of the output.]

Linux version 2.6.8.1-12mdk (quintela@n5.mandrakesoft.com) (gcc version
3.4.1 (Mandrakelinux (Alpha 3.4.1-3mdk)) #1 Fri Oct 1 12:53:41 CEST 2004
BIOS-provided physical RAM map:
BIOS-e820: 0000000000000000 - 000000000009f800 (usable)
BIOS-e820: 000000000009f800 - 00000000000a0000 (reserved)
BIOS-e820: 00000000000e0000 - 0000000000100000 (reserved)
BIOS-e820: 0000000000100000 - 000000000c000000 (usable)
BIOS-e820: 00000000fff80000 - 0000000100000000 (reserved)
0MB HIGHMEM available.
192MB LOWMEM available.
On node 0 totalpages: 49152
DMA zone: 4096 pages, LIFO batch:1
Normal zone: 45056 pages, LIFO batch:11
HighMem zone: 0 pages, LIFO batch:1
DMI 2.0 present.
ACPI disabled because your bios is from 98 and too old
You can enable it with acpi=force
Built 1 zonelists
Local APIC disabled by BIOS -- reenabling.
Found and enabled local APIC!
Initializing CPU#0
Kernel command line: BOOT_IMAGE=linux ro root=301 acpi=ht resume=/dev/hda5
PID hash table entries: 1024 (order 10: 8192 bytes)
Detected 233.365 MHz processor.
Using tsc for high-res timesource
Console: colour VGA+ 80x25
Dentry cache hash table entries: 32768 (order: 5, 131072 bytes)
Inode-cache hash table entries: 16384 (order: 4, 65536 bytes)
Memory: 190952k/196608k available (1859k kernel code, 5000k reserved,
578k data, 200k init, 0k highmem, 0k BadRAM)
Checking if this processor honours the WP bit even in supervisor mode... Ok.
Calibrating delay loop... 458.75 BogoMIPS
Security Scaffold v1.0.0 initialized
SELinux: Disabled at boot.
Capability LSM initialized
Mount-cache hash table entries: 512 (order: 0, 4096 bytes)
CPU: After generic identify, caps: 0080fbff 00000000 00000000 00000000
CPU: After vendor identify, caps: 0080fbff 00000000 00000000 00000000
CPU: L1 I cache: 16K, L1 D cache: 16K
CPU: L2 cache: 512K
CPU: After all inits, caps: 0080fbff 00000000 00000000 00000040
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#0.
CPU: Intel Pentium II (Klamath) stepping 04
Checking 'hlt' instruction... OK.
init init/main.c:689
init init/main.c:702
init init/main.c:707
do_pre_smp_initcalls init/main.c:653
do_pre_smp_initcalls init/main.c:659
init init/main.c:711
init init/main.c:714
enabled ExtINT on CPU#0
ESR value before enabling vector: 00000000
ESR value after enabling vector: 00000000
Using local APIC timer interrupts.
calibrating APIC timer ...
...... CPU clock speed is 233.0314 MHz.
...... host bus clock speed is 66.0660 MHz.
init init/main.c:716
init init/main.c:718
checking if image is initramfs...it isn't (no cpio magic); looks like an
initrd
ACPI: Looking for DSDT in initrd ... not found!
Freeing initrd memory: 184k freed
init init/main.c:724
do_basic_setup init/main.c:634
do_basic_setup init/main.c:636
NET: Registered protocol family 16
PCI: PCI BIOS revision 2.10 entry at 0xfd9cc, last bus=1
PCI: Using configuration type 1
mtrr: v2.0 (20020519)
ACPI: Subsystem revision 20040326
ACPI: Interpreter disabled.
Linux Plug and Play Support v0.97 (c) Adam Belay
PnPBIOS: Disabled
PCI: Probing PCI hardware
PCI: Probing PCI hardware (bus 00)
PCI: Using IRQ router PIIX/ICH [8086/7110] at 0000:00:07.0
vesafb: probe of vesafb0 failed with error -6
apm: BIOS version 1.2 Flags 0x03 (Driver version 1.16ac)
audit: initializing netlink socket (disabled)
audit(1113597246.4294966468:0): initialized
VFS: Disk quotas dquot_6.5.1
Dquot-cache hash table entries: 1024 (order 0, 4096 bytes)
devfs: 2004-01-31 Richard Gooch (rgooch@atnf.csiro.au)
devfs: boot_options: 0x0
Initializing Cryptographic API
Limiting direct PCI/PCI transfers.
isapnp: Scanning for PnP cards...
isapnp: No Plug & Play device found
Serial: 8250/16550 driver $Revision: 1.90 $ 8 ports, IRQ sharing enabled
ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A
RAMDISK driver initialized: 16 RAM disks of 32000K size 1024 blocksize
Uniform Multi-Platform E-IDE driver Revision: 7.00alpha2
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
PIIX4: IDE controller at PCI slot 0000:00:07.1
PIIX4: chipset revision 1
PIIX4: not 100% native mode: will probe irqs later
ide0: BM-DMA at 0xfc90-0xfc97, BIOS settings: hdaMA, hdbMA
ide1: BM-DMA at 0xfc98-0xfc9f, BIOS settings: hdcMA, hddMA
Probing IDE interface ide0...
hda: FUJITSU MPD3173AT, ATA DISK drive
hdb: ST3160023A, ATA DISK drive
Using anticipatory io scheduler
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
Probing IDE interface ide1...
hdc: TOSHIBA CD-ROM XM-6202B, ATAPI CD/DVD-ROM drive
hdd: ST3160023A, ATA DISK drive
ide1 at 0x170-0x177,0x376 on irq 15
hda: max request size: 128KiB
hda: 33793232 sectors (17302 MB) w/512KiB Cache, CHS=33525/16/63, UDMA(33)
/dev/ide/host0/bus0/target0/lun0: p1 p2 < p5 p6 >
hdb: max request size: 1024KiB
hdb: 312581808 sectors (160041 MB) w/8192KiB Cache, CHS=19457/255/63,
UDMA(33)
/dev/ide/host0/bus0/target1/lun0: p1
hdd: max request size: 1024KiB
hdd: 312581808 sectors (160041 MB) w/8192KiB Cache, CHS=19457/255/63,
UDMA(33)
/dev/ide/host0/bus1/target1/lun0: p1
mice: PS/2 mouse device common for all mice
serio: i8042 AUX port at 0x60,0x64 irq 12
input: PS/2 Generic Mouse on isa0060/serio1
serio: i8042 KBD port at 0x60,0x64 irq 1
input: AT Translated Set 2 keyboard on isa0060/serio0
md: md driver 0.90.0 MAX_MD_DEVS=256, MD_SB_DISKS=27
NET: Registered protocol family 2
IP: routing cache hash table of 2048 buckets, 16Kbytes
TCP: Hash tables configured (established 16384 bind 32768)
NET: Registered protocol family 1
BIOS EDD facility v0.16 2004-Jun-25, 3 devices found
init init/main.c:726
md: Autodetecting RAID arrays.
md: autorun ...
md: considering hdd1 ...
md: adding hdd1 ...
md: adding hdb1 ...
md: created md0
md: bind<hdb1>
md: bind<hdd1>
md: running: <hdd1><hdb1>
md: personality 3 is not loaded!
md :do_md_run() returned -22
md: md0 stopped.
md: unbind<hdd1>
md: export_rdev(hdd1)
md: unbind<hdb1>
md: export_rdev(hdb1)
md: ... autorun DONE.
RAMDISK: Compressed image found at block 0
VFS: Mounted root (ext2 filesystem).
kjournald starting. Commit interval 5 seconds
EXT3-fs: mounted filesystem with ordered data mode.
Freeing unused kernel memory: 200k freed
usbcore: registered new driver usbfs
usbcore: registered new driver hub
USB Universal Host Controller Interface driver v2.2
PCI: Found IRQ 5 for device 0000:00:07.2
uhci_hcd 0000:00:07.2: UHCI Host Controller
uhci_hcd 0000:00:07.2: irq 5, io base 0000fca0
uhci_hcd 0000:00:07.2: new USB bus registered, assigned bus number 1
hub 1-0:1.0: USB hub found
hub 1-0:1.0: 2 ports detected
EXT3 FS on hda1, internal journal
Adding 1124508k swap on /dev/hda5. Priority:-1 extents:1
Linux agpgart interface v0.100 (c) Dave Jones
agpgart: Detected an Intel 440LX Chipset.
agpgart: Maximum main memory to use for agp memory: 150M
agpgart: AGP aperture is 64M @ 0xf8000000
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
Reply


Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On
Forum Jump


All times are GMT. The time now is 09:56 AM.


Powered by vBulletin® Version 3.6.5
Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
SEO by vBSEO 3.2.0
www.UnixAdminTalk.com