Here we go again
Today comming home from work, I found that my Rockstor machine had crashed / frozen. Webpage wouldnt open, and telnetting into it didn’t work either.
So I did the a hard reset, as it was completely unresponsive.
The system rebooted, and came up with the prompt that you could go to the webpage.
But that still didn’t work, and dmesg gave some info. My pool “/mnt2/RSPool” hadn’t mounted:
[ 22.282894] BTRFS: device label RSPool devid 10 transid 481084 /dev/sdb
[ 22.283536] BTRFS info (device sdb): enabling auto defrag
[ 22.283536] BTRFS info (device sdb): use no compression
[ 22.283536] BTRFS info (device sdb): disk space caching is enabled
[ 22.283536] BTRFS info (device sdb): has skinny extents
[ 22.288284] BTRFS error (device sdb): failed to read the system array: -5
[ 22.305078] BTRFS error (device sdb): open_ctree failed
btrfs fi sh gives me this:
Label: ‘rockstor_rockstor’ uuid: af5bd5b1-c1d9-4773-b288-f3aaf2046dc2
Total devices 1 FS bytes used 3.03GiB
devid 1 size 12.93GiB used 4.53GiB path /dev/sdg3
Label: ‘RSPool’ uuid: 12bf3137-8df1-4d6b-bb42-f412e69e94a8
Total devices 6 FS bytes used 5.24TiB
devid 9 size 2.73TiB used 2.57TiB path /dev/sdf
devid 10 size 1.82TiB used 1.67TiB path /dev/sdb
devid 11 size 1.82TiB used 1.67TiB path /dev/sda
devid 12 size 1.82TiB used 1.67TiB path /dev/sdd
devid 13 size 2.73TiB used 2.57TiB path /dev/sde
devid 14 size 931.51GiB used 774.00GiB path /dev/sdc
Everything looks fine.
Btrfs fi us //mnt2/RSPool gives me this:
Overall:
Device size: 12.93GiB
Device allocated: 4.53GiB
Device unallocated: 8.40GiB
Device missing: 0.00B
Used: 3.03GiB
Free (estimated): 9.48GiB (min: 9.48GiB)
Data ratio: 1.00
Metadata ratio: 1.00
Global reserve: 16.00MiB (used: 0.00B)
Data,single: Size:4.00GiB, Used:2.92GiB
/dev/sdg3 4.00GiB
Metadata,single: Size:512.00MiB, Used:110.06MiB
/dev/sdg3 512.00MiB
System,single: Size:32.00MiB, Used:16.00KiB
/dev/sdg3 32.00MiB
Unallocated:
/dev/sdg3 8.40GiB
Which is definately not my RSPool, but rather my root drive.
Searching the net is not very helpfull in these situations, much of the info is old and outdated.
Found this page somewhat usefull:
http://marc.merlins.org/perso/btrfs/post_2014-03-19_Btrfs-Tips_-Btrfs-Scrub-and-Btrfs-Filesystem-Repair.html
But none of the commands allowed me to mount (I have not tried to mount in degraded mode for now), or fix anything.
Going to this page:
https://btrfs.wiki.kernel.org/index.php/Btrfsck
I decided to run a btrfs check /dev/sd(x) (no repair option).
It gives the following result (this is from /dev/sdb, but the results are the same for different drives):
Checking filesystem on /dev/sdb
UUID: 12bf3137-8df1-4d6b-bb42-f412e69e94a8
checking extents
checking free space cache
cache and super generation don’t match, space cache will be invalidated
checking fs roots
checking csums
checking root refs
checking quota groups
Counts for qgroup id: 0/5 are different
our: referenced 16384 referenced compressed 16384
disk: referenced 32768 referenced compressed 32768
diff: referenced -16384 referenced compressed -16384
our: exclusive 16384 exclusive compressed 16384
disk: exclusive 32768 exclusive compressed 32768
diff: exclusive -16384 exclusive compressed -16384
Counts for qgroup id: 0/8682 are different
our: referenced 16384 referenced compressed 16384
disk: referenced 32768 referenced compressed 32768
diff: referenced -16384 referenced compressed -16384
our: exclusive 16384 exclusive compressed 16384
disk: exclusive 32768 exclusive compressed 32768
diff: exclusive -16384 exclusive compressed -16384
found 5763908636672 bytes used, no error found
total csum bytes: 5621155700
total tree bytes: 6391808000
total fs tree bytes: 221315072
total extent tree bytes: 170819584
btree space waste bytes: 398949541
file data blocks allocated: 5775958544384
referenced 5756768088064
There does seem to be some counts for qgoups that are different…
But nothing else is reported.
Im tempted to just run a repair, but have decided to wait and ask around (starting here), what to do next.
I don’t find btrfs particularly easy to work with in these regards
I do still wonder why the system comes up, without me being able to open the webinterface, as the boot drive seems to be fine.