Ran out of space, now rockstor won't mount drives

phillxnet · May 11, 2019, 12:58pm

Thanks for helping to support Rockstor development. Well worth double checking your version exactly as if you went from testing to stable we had the following bug:

github.com/rockstor/rockstor-core

version and date incorrectly reported re update info

opened 07:53PM - 09 Dec 17 UTC

closed 12:53AM - 13 Dec 17 UTC

phillxnet

The indicated Rockstor package version can get 'confused' with the most recent v…ersion available. This leads to inaccuracies re user indicators based on installed package version/build date and the presence or otherwise of the 'up arrow' to indicate available updates. And given the package date is also set incorrectly (via a fail over) we are inadvertently displaying all known changes in our "List of changes in this update" rather than only those associated with the interim package versions.

and it’s associated pull request:

github.com/rockstor/rockstor-core

version and date incorrectly reported re update info. Fixes #1870

rockstor:master ← phillxnet:1870_version_and_date_incorrectly_reported_re_update_info

opened 06:24PM - 10 Dec 17 UTC

phillxnet

+7 -6

A recent move to yum from rpm inadvertently caused a miss reporting of installed… version for that of available version. This was due to the differing outputs and the inclusion of both installed and available in the newer yum format. The command switch set was revised to output only the installed version and to include the previously missing and required build date/time info. Consequent parsing adjustments were made to accommodate for the now changed date format. Time info was excluded as it is not used. @schakrava Ready for review. Fixes #1870 Please see issue text for stable package test / validation procedure. As there are currently existing anomalies in package change log dates and contents re recently added minor version numbers this must be resolved in future package releases for the intended function of the "List of changes in this update" to correctly populate. See issue text for details. Currently it will be blank for the more recent stable package releases but should work as intended when the date anomalies are resolved in future package releases (ie when package changelogs indicate the time of release). Also tested by applying to a fresh 3.9.1-0 install pre and post testing channel subscription and thereafter then updating via command line to 3.9.1-15. In all instances installed and available were displayed correctly. An update from 3.9.1-15 to 3.9.1-16 was then successfully completed via the command line.

Also note that very early 3.9.2.# versions shared this quotas disabled failure; as per my last posts GitHub reverence so best check and report here the version so we can rule that one out also:

So do make sure via:

yum info rockstor

Which should show the installed and available versions.

Would be good to get to the source of your issue here as you have reported non rockstor systemd tasks failing to start atd (the at deamon) which makes me think you have a low level problem with your system disk (ssd) or it’s attachment (cable) / driver / controller. And of course Rockstor sits on top of a bunch of other such services. Maybe take a look at the btrfs report for errors on the system disk for example via (in your case):

btrfs dev stats /mnt2/rockstor_rockstor00

Re:

anon52054362_Arthur:

May 10 18:06:13 lazarus docker-wrapper[28140]: system.exceptions.CommandException: Error running a command. cmd = /usr/bin/dockerd --log-driver=journald --storage-driver btrfs --storage-opt btrfs.min_space=1G --data-root /mnt2/rockons. rc = 1. stdout = ['']. stderr = ['chmod /mnt2/rockons: operation not permitted', '']

you may have a stuck immutable bit or that subvol has gone read only, or is usually on data pool but if not mounted at that time has fallen through to system pool directory. For an instance we had a while ago with immutable flag bugs see @Haioken and @Rene_Castberg contributions in the following thread:

Again this was associated from early stable channel releases.

You have quite dispersed errors all from a single system. And seemingly spanning both your pools. Assuming in that case the /mnt2/rockon was on your data drive.

You could also check your systems memory, ie take a look at our Pre-Install Best Practice (PBP) doc section and specifically the Memory Test (memtest86+) subsection.

Just puzzled by so many different and unrelated (bar via OS parts) errors here.

Let us know how you get on. And think about what might have changed from your prior, presumably more stable time to what is happening now. I.e. have you increased the load on the PSU for example, again a common cause of intermittent and unrelated issues. Does the system have adequate ventilation: dust bunnies etc.

Hope that helps.