LinuxQuestions.org - [SOLVED] Logical volumes not available on reboot

- Linux - Server (https://www.linuxquestions.org/questions/linux-server-73/)

- - Logical volumes not available on reboot (https://www.linuxquestions.org/questions/linux-server-73/logical-volumes-not-available-on-reboot-724093/)

Vanyel

05-06-2009 04:58 AM

Logical volumes not available on reboot

I'm working with an external LaCie 4BIG RAID array connected to a RHEL4 server (via eSATA) that has one large VG with 5 100gb LogVols inside (so far).

They're all formatted and mounted perfectly and I've copied my data onto them. The problem is that after a reboot, none of my logical volumes remains active. The 'lvdisplay' command shows their status as "not available". I can manually issue an "lvchange -a y /dev/<volgroup>" and they're back, but I need them to automatically come up with the server.

Shouldn't Logical Vols be persistently active by default?

How can I make them so?

Is it possible to place commands (I'm thinking the "lvchange -a y /dev/<volgroup>") into /etc/fstab so they'd be executed first and then have the volumes mounted? Seems like a workaround, even if it is.

jschiwal

05-06-2009 05:55 AM

I'm guessing that the problem may be that the drives powered down and take too long to power up. Later when you tried manually they were spun up and you didn't have a problem. Look at the kernel boot messages for clues.

Vanyel

05-06-2009 11:47 AM

Not sure that's right ...

Hmmm ... I don't think that's quite right. If that were the problem, then wouldn't the symptom be drives not available for mounting in fstab during boot, but when I later got to a login prompt and checked by hand, I'd "mysteriously" find solid and happy, "LV STATUS: Available" logical volumes waiting for me? What I'm finding is that the drive/Volume Group is up, but the Logical Volumes inside the VG are inactive.

To more directly address your comment, I did look in /var/log/dmesg and I think these are the only relevant parts -

Code:

sata_sil 0000:00:08.0: version 2.0

ACPI: PCI Interrupt 0000:00:08.0[A] -> GSI 29 (level, low) -> IRQ 185

ata1: SATA max UDMA/100 cmd 0xF8830080 ctl 0xF883008A bmdma 0xF8830000 irq 185

ata2: SATA max UDMA/100 cmd 0xF88300C0 ctl 0xF88300CA bmdma 0xF8830008 irq 185

scsi2 : sata_sil

ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 310)

ata1.00: ATA-6, max UDMA/133, 2930019504 sectors: LBA48 

ata1.00: ata1: dev 0 multi count 1

ata1.00: configured for UDMA/100

scsi3 : sata_sil

ata2: SATA link down (SStatus 0 SControl 310)

  Vendor: ATA      Model: LaCie  4big Qua  Rev: 0  

  Type:  Direct-Access                      ANSI SCSI revision: 05

SCSI device sdg: 2930019504 512-byte hdwr sectors (1500170 MB)

SCSI device sdg: drive cache: write back

SCSI device sdg: 2930019504 512-byte hdwr sectors (1500170 MB)

SCSI device sdg: drive cache: write back

 sdg: sdg1

Attached scsi disk sdg at scsi2, channel 0, id 0, lun 0

Attached scsi generic sg13 at scsi2, channel 0, id 0, lun 0,  type 0

The SATA link comes up fine, which is what I see onscreen in POST, then it looks like it goes down? But then right after, the array (sdg) and its one partition (sdg1) are seen as attached. I'd think it would need to be spun up for that.

LASTLY, an idea - if I activate the Logical vols by putting the "lvchange -a y <volgroup>" command into the boot process in the /etc/rc.d/rc.sysinit script, right before the part where it mounts all the other local (i.e. non /) filesystems, would that work? I'm not accustomed to playing around in this area, but I can see that logical volume mgmt is already setup earlier in this script, so this *should* work. This is a production server, so I can't take it down at a whim to experiment.

Still a workaround though, if so. The volumes should be persistently active!!! Grrr!

Vanyel

05-06-2009 10:54 PM

Ugh ugh ugh! Spinup/spindown does not seem to be the problem. The array has a setting where it will power up and power down as the server does and I did have that enabled. I turned it off and rebooted. No array power cycling. Same problem. No change.

Later I tried my idea of putting the "lvchange -a y <volgroup>" command into the boot process in the /etc/rc.d/rc.sysinit script, right before the part where it mounts all the other local (i.e. non /) filesystems. LogVol mgmnt does seem to be established in the OS before that section.

Well this not only did NOT work on reboot (logical volume not found), but now both the VG and LVs on the array seem to be GONE!! The partition is still there and I think only the metaadata is somehow hosed. I've taken the array off of the production server and will experiment with it hooked to my Desktop. Since the data on it is not in production yet, I'm not asking for help on recovery. I'll give it a quick whirl for practise but the data is not yet important. I can reformat the whole thing cavalierly.

But I expect to be back where I started from once I get this back up on my other machine.

Anybody have an answer for why active Logical Volumes won't maintain their active status across reboots?

Vanyel

05-07-2009 01:58 PM

Hooked up to my desktop now. Whew. Only needed to do a vgscan and it was all there. Same problem, after reboot, active LogVols are inactive.

Might the problem be with the Volume Group and not the Logical Volumes?

Quote:

[van@mournblade ~]$ sudo vgdisplay
Password:
--- Volume group ---
VG Name mailserver_lacie
System ID
Format lvm2
Metadata Areas 1
Metadata Sequence No 6
VG Access read/write
VG Status resizable
MAX LV 0
Cur LV 5
Open LV 0
Max PV 0
Cur PV 1
van PV 1
VG Size 1.36 TB
PE Size 4.00 MB
Total PE 357667
Alloc PE / Size 128000 / 500.00 GB
Free PE / Size 229667 / 897.14 GB
VG UUID BOBUfI-fJos-QBzY-GuA2-PsKj-TPJY-grN3AU

==========================================================================================
[van@mournblade ~]$ sudo lvdisplay
--- Logical volume ---
LV Name /dev/mailserver_lacie/usr1
VG Name mailserver_lacie
LV UUID LOwVH9-3h6v-NGFo-pdPZ-C3K9-tGgQ-k2Q4Tw
LV Write Access read/write
LV Status NOT available
LV Size 100.00 GB
Current LE 25600
Segments 1
Allocation inherit
Read ahead sectors 0

--- Logical volume ---
LV Name /dev/mailserver_lacie/usr2
VG Name mailserver_lacie
LV UUID eMqOYy-fgtk-ZTNe-4e3I-3GKU-zKql-v0Q33u
LV Write Access read/write
LV Status NOT available
LV Size 100.00 GB
Current LE 25600
Segments 1
Allocation inherit
Read ahead sectors 0

--- Logical volume ---
LV Name /dev/mailserver_lacie/usr3
VG Name mailserver_lacie
LV UUID qxMDLz-AJQV-dGJP-YNKy-JXto-0CTH-qc9L2m
LV Write Access read/write
LV Status NOT available
LV Size 100.00 GB
Current LE 25600
Segments 1
Allocation inherit
Read ahead sectors 0

--- Logical volume ---
LV Name /dev/mailserver_lacie/usr4
VG Name mailserver_lacie
LV UUID GN33bv-avzz-JU3z-QIST-7Wa2-nOlp-xVPtyf
LV Write Access read/write
LV Status NOT available
LV Size 100.00 GB
Current LE 25600
Segments 1
Allocation inherit
Read ahead sectors 0

--- Logical volume ---
LV Name /dev/mailserver_lacie/usr5
VG Name mailserver_lacie
LV UUID CsSizM-ZLQB-pReC-M0yz-54Xd-qwXv-0h2gBE
LV Write Access read/write
LV Status NOT available
LV Size 100.00 GB
Current LE 25600
Segments 1
Allocation inherit
Read ahead sectors 0

Vanyel

05-07-2009 07:11 PM

Problem solved!!!

After a LOT of reading and consulting a SysAdmin friend ... I still have NO IDEA what the problem is! But I've got a successful workaround so I'm happy and can go on. For anyone else who ever has the same problem ...

As near as I can tell, even though the eSATA card and the connected array are physically recognized during POST, the VG/LVs on the array aren't recognized by the OS that early. Why so late? No clue.

So I went back to my idea of manually inserting the lvchange command into the boot process (decided to use /sbin/vgchange -a y instead) and went about tracking down where.

What worked was inserting this into /etc/inittab as the next to last entry, just before X11 starts in runlevel 5 -

md:35: once:/sbin/vgchange -a y

SUCCESS!

Of course, on my production machine I'll have to chkconfig off sendmail and httpd, then replace the vgchange command with a custom shell script that contains the command and manually turns both on, since the array will be hosting the volumes where they have their data and this executes after the services start. But after everything else I've been through to get this far, that's trivial.

- Van

jschiwal

05-08-2009 02:28 AM

I wonder if the init file inside the initrd ramdisk may be missing some of the lv related commands.

This is from Fedora 10 on an hp laptop:

Code:

#!/bin/nash



mount -t proc /proc /proc

setquiet

echo Mounting proc filesystem

echo Mounting sysfs filesystem

mount -t sysfs /sys /sys

echo Creating /dev

mount -o mode=0755 -t tmpfs /dev /dev

mkdir /dev/pts

mount -t devpts -o gid=5,mode=620 /dev/pts /dev/pts

mkdir /dev/shm

mkdir /dev/mapper

echo Creating initial device nodes

mknod /dev/null c 1 3

mknod /dev/zero c 1 5

mknod /dev/systty c 4 0

mknod /dev/tty c 5 0

mknod /dev/console c 5 1

mknod /dev/ptmx c 5 2

mknod /dev/fb c 29 0

mknod /dev/tty0 c 4 0

mknod /dev/tty1 c 4 1

mknod /dev/tty2 c 4 2

mknod /dev/tty3 c 4 3

mknod /dev/tty4 c 4 4

mknod /dev/tty5 c 4 5

mknod /dev/tty6 c 4 6

mknod /dev/tty7 c 4 7

mknod /dev/tty8 c 4 8

mknod /dev/tty9 c 4 9

mknod /dev/tty10 c 4 10

mknod /dev/tty11 c 4 11

mknod /dev/tty12 c 4 12

mknod /dev/ttyS0 c 4 64

mknod /dev/ttyS1 c 4 65

mknod /dev/ttyS2 c 4 66

mknod /dev/ttyS3 c 4 67

/lib/udev/console_init tty0

daemonize --ignore-missing /bin/plymouthd

plymouth --show-splash

echo Setting up hotplug.

hotplug

echo Creating block device nodes.

mkblkdevs

echo Creating character device nodes.

mkchardevs

echo "Loading pata_amd module"

modprobe -q pata_amd

echo "Loading pata_acpi module"

modprobe -q pata_acpi

echo "Loading ata_generic module"

modprobe -q ata_generic

echo Waiting for driver initialization.

stabilized --hash --interval 250 /proc/scsi/scsi

echo Making device-mapper control node

mkdmnod

mkblkdevs

echo Scanning logical volumes

lvm vgscan --ignorelockingfailure

echo Activating logical volumes

lvm vgchange -ay --ignorelockingfailure  VolGroup00

resume /dev/VolGroup00/LogVol01

echo Creating root device.

mkrootdev -t ext3 -o defaults,ro /dev/VolGroup00/LogVol00

echo Mounting root filesystem.

mount /sysroot

cond -ne 0 plymouth --hide-splash

echo Setting up other filesystems.

setuproot

loadpolicy

plymouth --newroot=/sysroot

echo Switching to new root and running init.

switchroot

echo Booting has failed.

sleep -1

Vanyel

05-08-2009 07:34 AM

Seems possible. Thanks jschiwal. I have to get the project that the array is needed for underway, so I probably won't test that. But will keep it in mind.

Oh, and as a Postscript to all this, I remade the array one more time and discovered that ONLY LVs on it do not come up normally. A plain old fdisk ext3 volume on it can be fstab'd and mounts fine. So anyone having the same problem, consider if you can do without Logical Volumes for a "quick and dirty" solution (I don't *need* them for my purpose. But I much prefer them).

- Van

cwindomsr

04-12-2011 02:05 PM

Achieving Persistiency for LVMs through reboots

I don't know if anybody ever answered this question, I have found that in SUSE 11 there is a command file in the /etc/init.d directory called boot.lvm. I added this to service database and set it to start at runlevels 235. Upon reboot the Logical Volume Manager starts and runs the appropriate commands and mt 3.1TB logical volume is immediately available.

I have not tried this on RedHat and other Linux variants.

Hope This Helps,
cwindomsr

hoover67

08-15-2011 03:54 AM

Thanks guys, had the exact same problem on an SLES11 host this morning, thanks to this thread I managed to fix it quickly.

I used the command

chkconfig --level 235 boot.lvm on

to ensure the service is started on the next reboot.

Cheers & thanks, Uwe

karlochacon

08-30-2011 02:27 PM

same issue here guys

I've worked with Red hat - centos and suse 10 and never had this problem but today working with 4 Suse 11 SP1 has this issue

adding this boot.lvm to chkconfig worked flawlessly :)

Vanyel

09-19-2011 10:23 AM

FYI, I don't see this boot.lvm command in RHEL 4.

All times are GMT -5. The time now is 05:17 PM.