Problems replacing/upgrading a disk

cferra · December 1, 2018, 7:53am

I currently have 12 disk in my array - setup in a BTRFS Raid5 config. I have 4x8tb disks and 8x2tb disks. I wanted to upgrade 2 of my 2tb disks and I am having a nightmare of a time being able to do it.

I ran into difficulty pretty much exactly pointed out in this forum post: Can't remove failed drive from pool - Rebalance in progress where I tried the remove disk option in the GUI, and got an error message.

I tried to mount the array as degraded, pulled the drive that I wanted to replace and then attempted to remove the missing disk but that didn’t go over well immediately I was giving warnings of a degraded pool - a re-balance seemed to happen and when it looked like it was done, I attempted to do a btrfs delete missing /mnt2/Pool1 command and I got an Input/Output error.

so long story short I put the original disk back in, rebooted - it was added back to the pool - some drive errors showed on the disk, not sure why, I performed btrfs dev stats reset and the errors went away but I am back to square one.

It would seem to me that I did something wrong here, what is the best method to upgrade the drives with out risking the data?

cferra · December 2, 2018, 10:50am

getting back to this, I am also noticing that when I reboot rockstor, things seems to mount rw, but then after a few moments it flips to ro and there is nothing that I can do to stop this behavior.

After a few panic attacks – i began to fear the worst and started to copy the data off to a new drive, but when in ro, the read speeds I am getting when trying to copy the data off this volume is very slow, bearing in mind that everything was working pretty normally before I tried to replace the drive.

im at a loss here, hopefully something can be done here short of copying all the data to another drive and rebuilding everything.

phillxnet · December 2, 2018, 12:12pm

@cferra Hello again and sorry I couldn’t chip in earlier.

This very much indicates a file system integrity problem. Btrfs will often go ro as a precautionary measure it if encounters issues.

I’m afraid I can’t spend much time on my answer as I’m actually working on the exact issue that threw you in the first place ie:

github.com/rockstor/rockstor-core

pool resize disk removal unknown internal error and no UI counterpart

opened 06:19PM - 01 Jun 17 UTC

closed 02:10PM - 09 Jul 19 UTC

phillxnet

Thanks to forum member Noggin for highlighting this behaviour. Occasionally when… removing a disk from a pool there can be a UI time out directly after the last dialog entitled "Resize Pool / Change RAID level for ..." which acts as last confirmation of the configured operation: ![harmless-put-timeout-on-dev-remove](https://cloud.githubusercontent.com/assets/2521585/26693598/a6a7f03a-46fc-11e7-80d2-f45df1f32e28.png) There is then no UI 'balance' indicated while the removal is in progress, yet the UI indicates that a balance is in progress when a balance is attempted (only attempted by Noggin as I did not attempt to execute a balance whilst the removal was in progress). ``` btrfs balance status /mnt2/time_machine_pool/ No balance found on '/mnt2/time_machine_pool/' ``` The pool resize is however indicated by the requested disk's having their size 'demoted' to zero and showing a reduced usage with subsequent executions of **btrfs fi show**: ``` Label: 'time_machine_pool' uuid: 8f363c7d-2546-4655-b81b-744e06336b07 Total devices 4 FS bytes used 31.57GiB devid 3 size 149.05GiB used 17.03GiB path /dev/sdd devid 4 size 0.00B used 5.00GiB path /dev/sda devid 5 size 149.05GiB used 23.03GiB path /dev/mapper/luks-d36d39ea-c0b3-4355-b0c5-bd3248e6bbfe devid 6 size 149.05GiB used 23.00GiB path /dev/mapper/luks-d7524e90-4d9e-4772-932f-d1407b6b5fe7 ``` and then later on: ``` Label: 'time_machine_pool' uuid: 8f363c7d-2546-4655-b81b-744e06336b07 Total devices 4 FS bytes used 32.57GiB devid 3 size 149.05GiB used 18.03GiB path /dev/sdd devid 4 size 0.00B used 2.00GiB path /dev/sda devid 5 size 149.05GiB used 24.03GiB path /dev/mapper/luks-d36d39ea-c0b3-4355-b0c5-bd3248e6bbfe devid 6 size 149.05GiB used 24.00GiB path /dev/mapper/luks-d7524e90-4d9e-4772-932f-d1407b6b5fe7 ``` As can be seen devid 4 is having it's pool usage reduced (from 5 to 3 GB) between runs. In the above example the disk removal completed successfully however there was never an UI indication of it's 'in progress' nature or any record of a balance having taken place at that time. Reference to Noggins's forum thread suspected as indicating the same as my observations in final testing of pr #1716 which lead also to this issue creation (details of the precedence steps available in that pr): https://forum.rockstor.com/t/cant-remove-failed-drive-from-pool-rebalance-in-progress/3319 where a 3.8.16-16 (3.9.0 iso install) version exhibited the same behaviour (pre #1716 merge).

which incidentally is only a cosmetic issue as if you had simply waited the disk removal would most likely have completed. Apologies not to have gotten to this sooner but we have a lot going on in Rockstor currently.

Removing a missing disk was added and tested recently in:

github.com/rockstor/rockstor-core

Implement a delete missing disk in pool UI #1700

rockstor:master ← phillxnet:1700_Implement_a_delete_missing_disk_in_pool_UI

opened 01:59PM - 02 Oct 18 UTC

phillxnet

+233 -58

Extend the existing pool details page and it's Resize/ReRaid wizard to facilitat…e the removal of detached / missing disks. Includes a number of bug fixes re Pool edits, mostly concerned with disk add / remove when a pool has one or more detached members. Advise and link on Disks page against detached members of managed pools. Note that detached disks with no managed pool association are unaffected by these changes. I.e. they can be, as before, removed (from db) via a bin icon. Summary: - Ensure pool is mounted prior to add / remove / raid change actions and present a user error if this prerequisite is not met. - Add 'btrfs device delete missing' function to managed pool detached member via the existing 'Remove disks' option within Pool details page 'Resize/ReRaid Pool'. - Web-UI Disk's page - add map icon to identify "... detached member of a Rockstor managed Pool." tooltip quote. - Web-UI Disk's page - add exclamation mark "Use the associated Pool page Resize/ReRaid 'Remove disks' option if it is not to be reattached." tooltip quote. - Web-UI Pool details page - add contextually aware text / link giving advice and optional 'dev delete missing' work around for when no detached disks are known: i.e importing a degraded pool on a fresh install for disaster recovery. - Change remaining / resulting disk count validation to account for detached disks. - Remove redundant balance after disk delete (pool member removal) operation. - Change disk model temp_name property to return canonical variant of target_name, ie make it redirect role aware. - Add pool details page Disk table "Temp Name" column (first and currently only use of temp_name disk model property). - Fix bug where io_error_stats failed for btrfs-in-partition (ie redirect role). - Add "Replace existing disk (planned option)"" placeholder Web-UI text. - Minor rewording of user facing messages re pool edits. - Minor variable name refactoring / renaming. - Manage disk selection within Resize/ReRaid wizard and consequent validation via disk id not disk name, fixes buggy behaviour re detached disk management. - Add TODO: re running resize_pool() as async task, relevant mostly to disk delete as a time out is generated on all but trivially small pools. - Add TODO: pool_info() - move 'disks' to dictionary with (devid, size, used) tuple value. - Add TODO: _update_disk_state() - how to fix a recent btrfs in partition regression (my fault). Fixes #1700 See issue comments for context and images of added Web-UI elements. And as the last remaining sub-issue linked in the following super set issues: Fixes #737 (6 other dependant and now closed issues). Fixes #1199 (5 other dependant and now closed issues). @schakrava Ready for Review Testing: All affected pool edit options were tested to behave as expected (given known caveats below) on both KVM systems and real hardware. Caveats: - When removing a drive live "btrfs fi show" and consequently Rockstor indicate detached / missing pool members. But prior to a reboot any attempt to 'btrfs device delete missing' results in a '... no missing devices..' type error which we do surface. Post reboot delete missing works as intended. - As "btrfs device delete (missing or otherwise)" initiates an internal balance that can take several hours we experience a time out that break the Web-UI feedback and associated db updates for this operation. However this is a related but independent issue which is detailed in existing issue: "pool resize disk removal unknown internal error and no UI counterpart" #1722 The work around for the time being is to execute the same disk removal operation again where the second attempt (if post internal balance completion) executes as expected due to the initial disk removal having now succeeded and so is skipped, avoiding the cause of the timeout that breaks the otherwise functional (on trivially small pools) btrfs device delete (missing or otherwise) pool operation.

which had my current issue as a caveat. We have to break things up into manageable parts but yes it was a shame you were caught by this one. The above was released as stable channel 3.9.2-41.

Another problem here is your use of a parity raid, which is recommended against within the UI in the tooltip and other places. It’s definitely getting better but we don’t have the most recent improvements in our kernel / btrfs-tools. Hence the openSUSE move you commented on recently, who very much do keep their btrfs kernel parts and user land progs updated. And one of the shortcommings of the btrfs parity raid levels (5/6) are their repair capabilities. There are known issues there.

Anyway given that you removed a disk and then mounted degraded,rw and then initiated a remove missing, and then in turn interrupted that delete missing (first error but salvageable), and then re-attached the prior member: it was this last step that was the worst move and if you had not done that your pool may have been fine. You have, through both short comings in Rockstor’s UI (I’m working on that now) and impatience on your part (understandable given the lack of UI feedback: hence the “…and no UI counterpart.” in my current issue) been caught between things.

Essentially you should not have re-attached that disk and should have let the missing disk removal complete, and prior to that the initial disk removal via the raid pool resize had as a known issue that UI error message and would also most likely have completed, as @Noggin’s did in your sighted forum thread. So all in this was rather poking btrfs’s soft spots and so I would say you pool is not to be relied upon.

Sorry and I feel for you and I am working to improve this situation but it is always far better to ask before doing major changes, especially if you then compound them by interruption (first attached removal attempt) / additional intervention (disk physical removal while it was mid logical removal) / disk re-insertion (mid prior 2 events) etc.

Note also that clearing the drive errors report only clears the record of what has happened. To repair a pool one uses the scrub feature. But given your pool is 1) raid 5/6 and 2) has endured enough already and is as a consequence going read only I think it’s backup restore time or get what you can off.

Yes, a degraded pool is one where a device is missing, you had just removed a device. The “seemed to happen” balance is part of the problem we face here as when one removes a disk there is an internal balance that does not show up in the usual ‘btrfs balance status’ command as one would expect. So I am having to develop an algorithm of sorts that looks to the negative unallocated figures within a ‘btrfs dev usage’ command. Crazy that really but hey that is what is required and I’m in the process of developing that. So in short removing a disk, prior attached or missing, causes an almost ‘invisible’ balance which can take hours.

I’m guessing that was due to an already active internal re-arrangement.

I would also recommend that you use raid1 or raid10 (1 preferred) as these are far more robust and much better at self repair. Especially given our older kernel and btrfs-progs.

So in short if you had simply let the machine alone after:

or asked on the forum. Then you would probably still be OK. I know this is a real pain and seems like a silly error to be tripped up on but the problem is actually quite complex, as the associated issues expand upon. But I have a far better behaved version of Rockstor working in house in this regard and I have marked this forum thread also as one to update when this very rough edge of Rockstor is fixed by way of released code.

Hope that helps re what what may have gone wrong in all parties concerned but I will update this thread when I’ve sorted at least a little of our end of things. Appologies again for not getting there sooner but these things are always more complicated that it at first appears. And do please ask if you encounter any further issues as it’s best to ask as it is quite likely that others would have encountered either the same or something very similar, i.e. @Noggin’s experience.

It was actually exactly what you did at the very beginning, Pool details page Resize remove smaller disk. Wait (ignore the UI timeout related error). Once that has settled. details page Resize add new disk. Or add then remove if ports / bays allow. It’s just a shame that I didn’t have my current issue ready in time. But as we put more capabilities into Rockstor we also have more to maintain / keep workable, and we have had mention of late of slow downs to the point of other time outs so I also have to do some performance work at the same time as adding this last major UI fumble which as caught you out so unfavourably.

So sorry again for not having this ready in time for your ‘event’ and thanks for helping to support Rockstor’s development via a stable subscription. Good luck with the data, you may have use for the btrfs restore command.

cferra · December 3, 2018, 9:00pm

well - from the sounds of that, it doesn’t look good… I am able to get the volume mounted currently , ro and I can see the data, and can copy it to some spare drives, however the volume seems to go down after about 30 minutes or so and read speeds are very slow. Using some btrfs wizardry, is there way to make this more stable so that I can get the data off or am I resigned to rebooting the server every 30 minutes?

Forgive my noobishness here. I’m unfortunately in uncharted water here.

dmesg for what it’s worth is throwing me parent:
transid verify failed
and an alarming:
btrfs_run_delayed_refs:2971: error=-5 IO failure.

currently showing on dev/sda but i’ve seen transit id warnings on /dev/sdb /sdc etc…

Haioken · December 17, 2018, 7:27am

@cferra

I missed this topic when it came up, but found it in your post history after @bar1’s post.

Just thought I’d weigh in here that as per the other topic, mounting in recovery mode may indeed help you.
In particular mounting in recovery mode skips the log replay feature, so your transid mismatches should not error and drop your mountpoint.

cferra · December 17, 2018, 9:07am

I tried to get it to go in recovery mode but was never able - ultimately - after many many reboots I was able to get 99.9 percent of the data off but it was an exercise in frustration.