High Guys,
This is probably more BTRFS related, but perhaps somebody with more experience than I can shed some light…
I had a functioning BTRFS RAID10 config, 4x6Tb disks.
I had just decided the system was stable enough to reclaim aome disks and add them to the existing RAID, 4x4Tb.
I physically added the disks, killed the existing mdraid on them and zero’d the superblocks.
I the n added all 4 via the web GUI. The balance process started automatically. By 10PM, the balance was at 13%.
The balance stopped shortly after, remaining at 13% overnight.
The system was extremely non-responsive at this point, and many docker containers were not responding.
I attempted a graceful reboot, and waited 3 hours for the system to restart. At this point the console had stopped as well as SSH - I was forced to powercycle the system.
Since then, the system is started but doesnt respond to just about anything. Many processes stuck in uninterruptable sleep awaiting I/O (stat D).
Load is continually climbing, currently >35, CPU usage and I/O wait at 0.0, morenthan 50% mem free, and the disks are all idle.
I’ve attempted restarting the balance, but command is non responsive, as is checking balance status.
Many property retrieval ops in wait (please forgive the formatting, on tablet):
root 5 0.0 0.0 0 0 ? D 16:18 0:00 [kworker/u32:0]
root 142 0.0 0.0 0 0 ? D 16:18 0:00 [kworker/u32:1]
root 838 0.0 0.0 0 0 ? D 16:18 0:00 [kworker/u32:4]
root 844 0.0 0.0 0 0 ? D 16:18 0:00 [kworker/u32:6]
root 1287 0.0 0.0 0 0 ? D 16:18 0:00 [kworker/u32:11]
root 1288 0.0 0.0 0 0 ? D 16:18 0:00 [kworker/u32:12]
root 1289 0.0 0.0 0 0 ? D 16:18 0:00 [kworker/u32:13]
root 1291 0.0 0.0 0 0 ? D 16:18 0:00 [kworker/u32:14]
root 1295 0.0 0.0 0 0 ? D 16:18 0:00 [kworker/u32:16]
root 3655 0.0 0.0 0 0 ? D 16:19 0:00 [btrfs-transacti]
root 5750 0.0 0.0 200 4 ? D 16:20 0:00 elglob -0 – envdirs /var/run/s6/env-* forx -p – envdir ${envdirs} importas -u envdir envdir s6-rmrf $
{envdir}
root 6389 0.0 0.0 15920 1060 ? D 16:20 0:00 /sbin/btrfs property get /mnt2/tempraid/rockon/btrfs/subvolumes/d0bc50ff71e2d0582618d1edfcbdcbdfc7eccdbb0
804e1e928dc3da4bd03a2c3
root 6537 0.0 0.0 15944 1104 tty1 D+ 16:22 0:00 btrfs balance start -mconvert raid10 -dconvert raid10 /mnt2/tempraid/
root 6573 0.0 0.0 15944 1112 tty2 D+ 16:23 0:00 btrfs balance status /mnt2/tempraid/
root 6729 0.0 0.0 15920 1020 ? D 16:23 0:00 /sbin/btrfs property get /mnt2/tempraid/rockon/btrfs/subvolumes/d0bc50ff71e2d0582618d1edfcbdcbdfc7eccdbb0
804e1e928dc3da4bd03a2c3
root 7004 0.0 0.0 15920 1096 ? D 16:26 0:00 /sbin/btrfs property get /mnt2/tempraid/rockon/btrfs/subvolumes/d0bc50ff71e2d0582618d1edfcbdcbdfc7eccdbb0
804e1e928dc3da4bd03a2c3
root 7228 0.0 0.0 15920 1012 ? D 16:28 0:00 /sbin/btrfs property get /mnt2/tempraid/rockon/btrfs/subvolumes/d0bc50ff71e2d0582618d1edfcbdcbdfc7eccdbb0
804e1e928dc3da4bd03a2c3
root 7454 0.0 0.0 15920 1092 ? D 16:30 0:00 /sbin/btrfs property get /mnt2/tempraid/rockon/btrfs/subvolumes/d0bc50ff71e2d0582618d1edfcbdcbdfc7eccdbb0
804e1e928dc3da4bd03a2c3
root 7669 0.0 0.0 15920 1020 ? D 16:32 0:00 /sbin/btrfs property get /mnt2/tempraid/rockon/btrfs/subvolumes/d0bc50ff71e2d0582618d1edfcbdcbdfc7eccdbb0
804e1e928dc3da4bd03a2c3
root 7889 0.0 0.0 15920 1056 ? D 16:34 0:00 /sbin/btrfs property get /mnt2/tempraid/rockon/btrfs/subvolumes/d0bc50ff71e2d0582618d1edfcbdcbdfc7eccdbb0
804e1e928dc3da4bd03a2c3
root 8441 0.0 0.0 15920 1012 ? D 16:37 0:00 /sbin/btrfs property get /mnt2/tempraid/rockon/btrfs/subvolumes/d0bc50ff71e2d0582618d1edfcbdcbdfc7eccdbb0
804e1e928dc3da4bd03a2c3
root 8876 0.0 0.0 15920 1052 ? D 16:40 0:00 /sbin/btrfs property get /mnt2/tempraid/rockon/btrfs/subvolumes/d0bc50ff71e2d0582618d1edfcbdcbdfc7eccdbb0
804e1e928dc3da4bd03a2c3
root 9263 0.0 0.0 15920 1020 ? D 16:42 0:00 /sbin/btrfs property get /mnt2/tempraid/rockon/btrfs/subvolumes/d0bc50ff71e2d0582618d1edfcbdcbdfc7eccdbb0
804e1e928dc3da4bd03a2c3
root 9644 0.0 0.0 15920 1108 ? D 16:44 0:00 /sbin/btrfs property get /mnt2/tempraid/rockon/btrfs/subvolumes/d0bc50ff71e2d0582618d1edfcbdcbdfc7eccdbb0
804e1e928dc3da4bd03a2c3
root 10441 0.0 0.0 15920 1096 ? D 16:47 0:00 /sbin/btrfs property get /mnt2/tempraid/rockon/btrfs/subvolumes/d0bc50ff71e2d0582618d1edfcbdcbdfc7eccdbb0
804e1e928dc3da4bd03a2c3
root 10865 0.0 0.0 15920 1024 ? D 16:49 0:00 /sbin/btrfs property get /mnt2/tempraid/rockon/btrfs/subvolumes/d0bc50ff71e2d0582618d1edfcbdcbdfc7eccdbb0
804e1e928dc3da4bd03a2c3
root 11260 0.0 0.0 15920 1016 ? D 16:51 0:00 /sbin/btrfs property get /mnt2/tempraid/rockon/btrfs/subvolumes/d0bc50ff71e2d0582618d1edfcbdcbdfc7eccdbb0
804e1e928dc3da4bd03a2c3
root 11595 0.0 0.0 15920 1056 ? D 16:52 0:00 /sbin/btrfs property get /mnt2/tempraid/rockon/btrfs/subvolumes/d0bc50ff71e2d0582618d1edfcbdcbdfc7eccdbb0
804e1e928dc3da4bd03a2c3
root 11843 0.0 0.0 15920 1004 ? D 16:55 0:00 /sbin/btrfs property get /mnt2/tempraid/rockon/btrfs/subvolumes/d0bc50ff71e2d0582618d1edfcbdcbdfc7eccdbb0
804e1e928dc3da4bd03a2c3
root 12101 0.0 0.0 15920 1104 ? D 16:58 0:00 /sbin/btrfs property get /mnt2/tempraid/rockon/btrfs/subvolumes/d0bc50ff71e2d0582618d1edfcbdcbdfc7eccdbb0
804e1e928dc3da4bd03a2c3
root 12412 0.0 0.0 15920 1004 ? D 17:00 0:00 /sbin/btrfs property get /mnt2/tempraid/rockon/btrfs/subvolumes/d0bc50ff71e2d0582618d1edfcbdcbdfc7eccdbb0
804e1e928dc3da4bd03a2c3
root 12659 0.0 0.0 15920 1088 ? D 17:03 0:00 /sbin/btrfs property get /mnt2/tempraid/rockon/btrfs/subvolumes/d0bc50ff71e2d0582618d1edfcbdcbdfc7eccdbb0
804e1e928dc3da4bd03a2c3
root 12890 0.0 0.0 15920 1124 ? D 17:06 0:00 /sbin/btrfs property get /mnt2/tempraid/rockon/btrfs/subvolumes/d0bc50ff71e2d0582618d1edfcbdcbdfc7eccdbb0
804e1e928dc3da4bd03a2c3
root 13191 0.0 0.0 15920 1124 ? D 17:13 0:00 /sbin/btrfs property get /mnt2/tempraid/rockon/btrfs/subvolumes/d0bc50ff71e2d0582618d1edfcbdcbdfc7eccdbb0
804e1e928dc3da4bd03a2c3
root 13440 0.0 0.0 15920 1120 ? D 17:16 0:00 /sbin/btrfs property get /mnt2/tempraid/rockon/btrfs/subvolumes/d0bc50ff71e2d0582618d1edfcbdcbdfc7eccdbb0
804e1e928dc3da4bd03a2c3
And ideas how I can get this back up? Happy to provide more info if needed.