[SOLVED] Failed to import any pool on this device .. did I loose my Data?

JannikJung0 · June 17, 2017, 10:00pm

Hey all,
I tried importing a disk and got the following error.
I am worried that I lost all my data.

Brief description of the problem

Error, while Importing Pool of Disk. The Disk has been managed by rockstor on an old installation.

Detailed step by step instructions to reproduce the problem

New VM Installation on Proxmox
Added the Disks, set the Serial Number
Tried Importing the Disks: 1 imported without any problem. The other one trows this error. Both have been manages by Rockstor before.

Web-UI screenshot

Error Traceback provided on the Web-UI


            Traceback (most recent call last):
  File "/opt/rockstor/src/rockstor/storageadmin/views/disk.py", line 699, in _btrfs_disk_import
    mount_root(po)
  File "/opt/rockstor/src/rockstor/fs/btrfs.py", line 246, in mount_root
    run_command(mnt_cmd)
  File "/opt/rockstor/src/rockstor/system/osi.py", line 115, in run_command
    raise CommandException(cmd, out, err, rc)
CommandException: Error running a command. cmd = /bin/mount /dev/disk/by-label/1tb /mnt2/1tb. rc = 32. stdout = ['']. stderr = ['mount: wrong fs type, bad option, bad superblock on /dev/vdb,', '       missing codepage or helper program, or other error', '', '       In some cases useful info is found in syslog - try', '       dmesg | tail or so.', '']

Thanks!

EDIT: Tried running the following command:
[root@rockstor ~]# btrfs check /dev/vdb checksum verify failed on 883916800 found EC6DD33F wanted 000002A8 checksum verify failed on 883916800 found EC6DD33F wanted 000002A8 bytenr mismatch, want=883916800, have=68719477456 Couldn’t read tree root ERROR: cannot open file system [root@rockstor ~]#

Is it safe to run a btrfs restore?

EDIT2: I manages to mount the drive manually using:
mount -o rw,degraded,recovery /dev/vdb /mnt2/1tb
Then, I was able to import the Pool in the GUI.

This thread can be closed. I’ll leave this here, for anybody having the same problem.

Could somebody explain, why and how this could happen?

phillxnet · June 18, 2017, 11:54am

@JannikJung0 Hello again and thanks for the report.

Were the drives part of the same pool I’m assuming so. If so then your first error:

CommandException: Error running a command. cmd = /bin/mount /dev/disk/by-label/1tb /mnt2/1tb. rc = 32. stdout = ['']. stderr = ['mount: wrong fs type, bad option, bad superblock on /dev/vdb,', ' missing codepage or helper program, or other error', '', ' In some cases useful info is found in syslog - try', ' dmesg | tail or so.', '']

Can result from a know issue with mount by label (which we initially try) on multi disk pools:

github.com/rockstor/rockstor-core

insufficient use of btrfs device scan

opened 06:20PM - 27 Nov 16 UTC

closed 03:32AM - 24 Jun 17 UTC

phillxnet

enhancement

Currently we only execute this command on boot via bootstrap.py btrfs/device_sca…n(). In certain, currently rare, circumstance this can lead to failed mounts with multi-volume filesystems when mounting by label as we do: https://btrfs.wiki.kernel.org/index.php/Problem_FAQ#Filesystem_can.27t_be_mounted_by_label "... if one volume of a multi-volume filesystem fails when mounting, but the other succeeds: ``` # mount /dev/sda1 /mnt/fs mount: wrong fs type, bad option, bad superblock on /dev/sdd2, missing codepage or helper program, or other error In some cases useful info is found in syslog - try dmesg | tail or so # mount /dev/sdb1 /mnt/fs # Then you need to ensure that you run a btrfs device scan first: # btrfs device scan ``` Rockstor log excerpt of the same: ``` [27/Nov/2016 16:45:29] ERROR [storageadmin.views.command:87] Exception while refreshing state for Pool(luks-pool-on-bcache). Moving on: Error running a command. cmd = ['/bin/mount', '/dev/disk/by-label/luks-pool-on-bcache', '/mnt2/luks-pool-on-bcache', '-o', ',compress=no']. rc = 32. stdout = ['']. stderr = ['mount: wrong fs type, bad option, bad superblock on /dev/mapper/luks-3efb3830-fee1-4a9e-a5c6-ea456bfc269e,', ' missing codepage or helper program, or other error', '', ' In some cases useful info is found in syslog - try', ' dmesg | tail or so.', ''] ``` In the above instance a 2 disk LUKS full disk encrypted pool was unable to mount as it's drives had only been unlocked after boot. ``` Label: 'luks-pool-on-bcache' uuid: bb027f88-a3d9-445c-b1d1-ffb86a103c98 Total devices 2 FS bytes used 272.00KiB devid 1 size 2.00GiB used 417.12MiB path /dev/mapper/luks-3efb3830-fee1-4a9e-a5c6-ea456bfc269e devid 2 size 2.00GiB used 417.12MiB path /dev/mapper/luks-a47f4950-3296-4504-b9a4-2dc75681a6ad ``` And our "by-label" points to one of them as expected: ``` ls -la /dev/disk/by-label/luks-pool-on-bcache lrwxrwxrwx 1 root root 10 Nov 27 16:41 /dev/disk/by-label/luks-pool-on-bcache -> ../../dm-1 ``` and we find: ``` cat /sys/devices/virtual/block/dm-1/dev 251:1 ``` with lsblk having output of: ``` sdb 8:16 bcache-bdev-1 disk bcache └─bcache1 252:1 disk crypto_LUKS └─luks-3efb3830-fee1-4a9e-a5c6-ea456bfc269e 251:1 crypt btrfs luks-pool-on-bcache ``` So 251:1 is our luks-3efb3830-fee1-4a9e-a5c6-ea456bfc269e But if we try and mount via this device: ``` /bin/mount /dev/disk//by-label/luks-pool-on-bcache /mnt2/luks-pool-on-bcache/ mount: wrong fs type, bad option, bad superblock on /dev/mapper/luks-3efb3830-fee1-4a9e-a5c6-ea456bfc269e, missing codepage or helper program, or other error In some cases useful info is found in syslog - try dmesg | tail or so. ``` Note that the resolved mapper path is correct. But if we attempt to mount the ‘other device’ listed in btrfs fi show: ``` /bin/mount /dev/mapper/luks-a47f4950-3296-4504-b9a4-2dc75681a6ad /mnt2/luks-pool-on-bcache/ ``` then we are good and get a successful mount. So if we are to move to a more dynamic disk model, ie such as external backup media or live plugging of disks, or occasional after boot LUKS container opening then we must first run our btrfs/device_scan() function to refresh disk data prior to attempting mounts. I am unsure where best this should be implemented. Maybe in scan_disks() but that may just be over the top or better still just before each mount ie in: mount_root(pool): """ Mounts a given pool at the default mount root (usually /mnt2/) ... """ @schakrava I suspect this one is more in your realm: I can easily confirm fix with my current role issue #1494 where I am trialling live disk / pool additions.

That issue references the btrfs wiki that describes this known behaviour / bug.

As it goes this is what I’m working on currently. Essentially if one first executes a:

btrfs device scan

then the issue is averted. As this is overkill for every pool mount ‘on the fly’ which is where we are heading capabilities wise, I’m implementing an ‘all pool disks only’ scan first, rather than a system wide one. So given you may have effectively ‘live plugged’ some disks via your use of a VM that command may have helped to avert this problem.

From Manpage/btrfs-check - btrfs Wiki
the above command should make no changes to an unmounted filesystem!

Often if time permits (it can take ages) and the only copy of the data resides on the normally un-mountable pool then the restore feature is the first port of call as any repair actions can make things worse. See:
https://btrfs.wiki.kernel.org/index.php/Restore

The btrfs restore utility is a non-destructive method for attempting to recover data from an unmountable filesystem. It 
makes no attempt to repair the filesystem, which means that you cannot cause further damage by running it. It is 
available as part of btrfs-progs.

You state having first tried:

We have to be very careful recommending such action as in some circumstance this is a one shot deal, ie you get one attempt to make repairs in rw mode to the mounted pool. If those repairs are not enacted in that first rw,degraded mount then you will there after not be able to remount the pool in any mode other than ro. Also recovery is a deprecated option:
https://btrfs.wiki.kernel.org/index.php/Mount_options

recovery (since 3.2)
(Deprecated in 4.6-rc1) Enable autorecovery upon mount; currently it scans list of several previous tree roots and tries 
to use the first readable. The information about the tree root backups is stored by kernels starting with 3.2, older 
kernels do not and thus no recovery can be done. Replaced by usebackuproot in 4.6-rc1, but still works.

If you in fact mounted one drive rw,degraded as the other was not yet visible to the system (see above btrfs device scan) then you could have inadvertently put the two drives out of sync, as suggested by the btrfs check command. But this is not consistent with your reported order.

Btrfs the filesystem and Rockstor, each for their own reasons (some interrelated), have issues with dynamic drive arrangements. I suspect your Proxmox use and lack of a reboot / use of btrfs device scan after disk changes, may have lead to this issue, ie only one drives btrfs nature visible (no indications of such between steps 2 and 3). There are also know issues with udev in this circumstance but I’m pretty sure we have those sorted now via, please see:

github.com/rockstor/rockstor-core

work around failure of udev to observe btrfs device add

opened 06:30PM - 09 Jan 17 UTC

closed 05:12PM - 15 Jan 17 UTC

phillxnet

bug

In some rare instances it is possible for udev and lsblk to fail to reflect the …current state of a pools disk members. This was initially observed while testing btrfs in partition device members in issue #1494, however I have now duplicated the same failure using whole disk members. So far this has been observed (sporadically) only directly after adding a device to a pool, but only after a few such operations have taken place just prior to the issue related device add. The most reliable indicator was still very intermittent but essentially involved adding and then removing and re-adding the same device to a pool, often multiple times. The fail state is identified by lsblk and udev info not having been updated directly after the following command: btrfs device add devname mountpoint The failed state is then persistent. It was found the ‘btrfs device scan’ had no effect but: echo change > /sys/block/vdd/vdd2/uevent where vdd2 (a partitioned pool device member) was added but not accounted for in lsblk or udev info. However on subsequent issue reproduction I was also able to test a system wide udev trigger as workaround: udevadm trigger Which also succeeded in returning udev and lsblk output to what would normally be expected, ie in line with the output of btrfs fi show. Given Rockstor reliance on udev to maintain a current view on devices the resulting failed state causes the most recently added device to not be listed or recognised as a pool member. Examples of failed / inconsistent state between btrfs fi show and udev / lsblk. Here we see a 2 device pool where udev only knows of one of it's member. The first pool example was created via: mkfs.btrfs -f -d single -m single -L testpool /dev/vdd2 /dev/vdc2 and after multiple add and delete commands for one and then the other device we have: ``` Label: 'testpool' uuid: d8357c81-1352-43f9-8ab3-883a9cd1cc03 Total devices 2 FS bytes used 256.00KiB devid 2 size 2.85GiB used 592.00MiB path /dev/vdc2 devid 3 size 2.85GiB used 0.00B path /dev/vdd2 ``` But an lsblk output that fails to show /dev/vdd2 (devid 3 in above): ``` NAME="vdc2" MODEL="" SERIAL="" SIZE="2.9G" TRAN="" VENDOR="" HCTL="" TYPE="part" FSTYPE="btrfs" LABEL="testpool" UUID="d8357c81-1352-43f9-8ab3-883a9cd1cc03" ``` Also note that udevadm info --name /dev/vdd2 | grep FS_ had no output. where as it's counterpart /dev/vdc2 show the expected btrfs FS_ entries. In the above case the "echo change > /sys/block/vdd/vdd2/uevent" was found to resolve the inconsistency. Another example involved a 2 device pool having no btrfs info reflected by udev and lsblk for either of it's members: ``` Label: 'testpool' uuid: d31d8295-3fa6-4555-8ef2-ab72a7a5f828 Total devices 2 FS bytes used 256.00KiB devid 2 size 2.85GiB used 592.00MiB path /dev/vdd2 devid 3 size 2.85GiB used 0.00B path /dev/vdc2 ``` where "lsblk -a -P -o NAME,TYPE,FSTYPE,LABEL,UUID | grep testpool" had no output ie without the grep we see the member info as empy: ``` NAME="vdd2" TYPE="part" FSTYPE="" LABEL="" UUID="" NAME="vdc2" TYPE="part" FSTYPE="" LABEL="" UUID="" ``` Another example involved a single device pool where that device pool member had no btrfs related info in udev or lsblk: ``` btrfs fi show testpool Label: 'testpool' uuid: d31d8295-3fa6-4555-8ef2-ab72a7a5f828 Total devices 1 FS bytes used 256.00KiB devid 3 size 2.85GiB used 592.00MiB path /dev/vdc2 ``` and again no appropriate btrfs or label info found in lsblk's output yet we have an active mount: ``` mount | grep testpool /dev/vdc2 on /mnt2/testpool type btrfs (rw,relatime,space_cache,subvolid=5,subvol=/) ``` In this case I was able to test the "udevadm trigger" command and then we see: ``` lsblk -a -P -o NAME,TYPE,FSTYPE,LABEL,UUID | grep testpool NAME="vdc2" TYPE="part" FSTYPE="btrfs" LABEL="testpool" UUID="d31d8295-3fa6-4555-8ef2-ab72a7a5f828" ``` So we have our return lsblk info on btrfs. and our FS_ entries in udev also re-appeared: ``` udevadm info --name /dev/vdc2 | grep FS_ E: ID_FS_LABEL=testpool E: ID_FS_LABEL_ENC=testpool E: ID_FS_TYPE=btrfs E: ID_FS_USAGE=filesystem E: ID_FS_UUID=d31d8295-3fa6-4555-8ef2-ab72a7a5f828 E: ID_FS_UUID_ENC=d31d8295-3fa6-4555-8ef2-ab72a7a5f828 E: ID_FS_UUID_SUB=a716ba75-b8b9-4403-8be0-f85ede430af6 E: ID_FS_UUID_SUB_ENC=a716ba75-b8b9-4403-8be0-f85ede430af6 ``` where they had previously been absent. Finally I was able to catch a full disk example: ``` btrfs fi show testpool Label: 'testpool' uuid: 03cf9a94-4d88-4179-b3a9-0f53e22682c1 Total devices 2 FS bytes used 256.00KiB devid 5 size 2.00GiB used 448.00MiB path /dev/vde devid 6 size 2.20GiB used 0.00B path /dev/vdf ``` And the lsblk output for the command: lsblk -a -P -o NAME,TYPE,FSTYPE,LABEL,UUID | grep testpool was: ``` NAME="vde" TYPE="disk" FSTYPE="btrfs" LABEL="testpool" UUID="03cf9a94-4d88-4179-b3a9-0f53e22682c1" ``` ie no entry for /dev/vdf When at the same time we see no output from "udevadm info --name /dev/vdf | grep FS_" only expected info is for the other /dev/vde member. ``` In this example the "udevadm trigger" command also succeeded in resolving these discrepancies.

for where we demonstrated the effect of btrfs formats not showing up. That however should have been addressed in the consequent changes in:

github.com/rockstor/rockstor-core

work around failure of udev to observe btrfs device add. Fixes #1606

rockstor:master ← phillxnet:1606_work_around_failure_of_udev_to_observe_btrfs_device_add

opened 07:03PM - 09 Jan 17 UTC

phillxnet

+15 -2

In some rare instances it is possible for udev and lsblk to fail to reflect the …current state of a pools disk members. This was initially observed while testing btrfs in partition device members in issue #1494, however the same has now been reproduced using whole disk members. Please see issue #1606 for examples of where this discrepancy was observed and the subsequent testing of 'udevadmin trigger' to cause udev and lsblk to again reflect the current state of btrfs pools. All proof was via command line to isolate any Rockstor mechanisms. This pr creates a simple wrapper around 'udevadmin trigger' and calls this directly after a pool resize action is requested to add one or more devices. The call method mimics that of resize_pool() for consistency. Exception handling was tested by substituting a nonsense command, in this scenario a user visible error was returned. Fixes #1606 This pr was made against current master and a fresh build was used for testing.

which should have already been in your Rockstor version, unless you used a really old one of course.

Dynamic drive management and filesystems on top of such is not an easy problem. Always best to have the final hw arrangement established on a fresh boot prior to an import, especially on multi drive pools. But as indicated we are working towards greater robustness in such situations and this is also a focus at Facebook this year for the btrfs team, ie multi disk pools.

Hope that helps and I realise this is not an explanation but I am not a btrfs expert. Just wanted to chip in on advising against the order you promote here.

I’m chuffed you have your data access up and running again however I would caution, given the above, that you make sure your backups are in place and up to date prior to a reboot / re-mount. Just in case. Especially given the rw,degraded mount execution.

Well done on getting it sorted, can be quite a worry but we have many moving parts here, however btrfs raid1 is normally pretty forgiving, especially while current drive count meets or exceeds the minimum of 2 and hopefully I’ll have the additional use of ‘btrfs device scan’ in Rockstor ready for review shortly.

Obviously the longer term aim is to have Rockstor’s Web UI help more ‘guidance wise’ in these failure scenarios. However it is not an easy problem to solve elegantly across all scenarios but the work towards this is under way.

Thanks again for sharing your experience.