Rockstor suddenly won't start (possible issue with docker?)

Brief description of the problem

Yesterday-noon was cleaning up some files on one of the shares, everything was working fine. Then yesterday evening I noticed I coudn’t access the NAS anymore. Tried to login using the UI, but also that was offline. I have traced it back to docker not starting, which I assume is a requirement for the rockstor-pre service to start (due to Rock-ons).

Some system info:

  • Rockstor built on openSUSE Leap 15.4
  • Docker version 24.0.7-ce, build 311b9ff0aa93
  • Rockstor-5.0.15-0

A bit of a tangent:

I also get a warning in Zypper:

Repository ‘Update repository of openSUSE Backports’ metadata expired since 2024-09-23 11:47:52 CEST

I read somewhere else on the forum that this version of openSUSE might be end-of-life, so I’ll need to check on how to update that.

Anyways, back to the topic at hand:

The docker service won’t start, when I look at the logs I see the following error:

Feb 07 00:49:07 nas.xxx systemd[1]: Starting Docker Application Container Engine…
Feb 07 00:49:07 nas.xxx dockerd[13479]: time=“2025-02-07T00:49:07+01:00” level=info msg=“SUSE:secrets :: enabled”
Feb 07 00:49:07 nas.xxx dockerd[13479]: chmod /mnt2/home: operation not permitted
Feb 07 00:49:07 nas.xxx systemd[1]: docker.service: Main process exited, code=exited, status=1/FAILURE
Feb 07 00:49:07 nas.xxx systemd[1]: docker.service: Failed with result ‘exit-code’.
Feb 07 00:49:07 nas.xxx systemd[1]: Failed to start Docker Application Container Engine.
Feb 07 00:49:07 nas.xxx systemd[1]: docker.service: Scheduled restart job, restart counter is at 1.
Feb 07 00:49:07 nas.xxx systemd[1]: Stopped Docker Application Container Engine.

Other bits and pieces:

  • The NetworkManager service is running
  • OS Drive is only at 30% usage
  • Btrfs check on OS drive and pool drives all returned no errors

How to proceed?

I’m not quite sure on how to proceed from here. It just seems odd that it worked in the afternoon but in the evening everything stopped working and it won’t come back up.

Any ideas or suggestions?

I dug a bit deeper. When looking at the docker service status I see the following:

ExecStart=/usr/bin/dockerd --add-runtime oci=/usr/sbin/docker-runc $DOCKER_NETWORK_OPTIONS $DOCKER_OPTS --config-file /opt/rockstor/conf/docker-daemon.json

When I run this command as “sudo” I get the same error: operation not permitted

This line can be found in

/etc/systemd/system/docker.service

I have no idea what it does, though.

@stitch10925 hello again.

You are correct LEAP 15.4 has been EOL for some time now (07 Dec 2023) 15.5 just reached its EOL (31 Dec 2024) so you want to consider moving from there to 15.6 now. You can either follow the two upgrade How-tos, and do it step-wise:

https://rockstor.com/docs/howtos/15-4_to_15-5.html
and then
https://rockstor.com/docs/howtos/15-5_to_15-6.html

or, backup your configuration (and items you manually changed, not through the WebUI), and re-install it using the one of the official 15.6 downloads.

As for the docker service, I believe it’s dependent on the rockstor services to be started, so you could do another

systemctl restart network

and then

systemctl start rockstor-bootstrap

(or you execute the start of rockstor-build, rockstor-pre and rockstor individually), and see where that lands you. If it doesn’t work, you could check both the rockstor logs or journalctl -xe to see whether you get a better idea of what else might be going wrong.

This might be an issue with your Rockon Root. You mentioned you did some cleanup, did that include snapshots? If so, you might have inadvertently deleted some of the Rockon Root snapshots, which would lead to docker failing. If that is the case, then your best bet is to recreate the rockon root and associate it with the Rockon service again.

But that’s just one thought.

3 Likes

@Hooverdan hi again as well!

Sorry that I always pop in when there’s trouble.

Your suggestion to update did the trick. I followed the instructions, reinstalled rockstor (because for some reason it could not find the rockstor-bootstrap service) and everything came alive again!

Now checking the disks for errors, but I think they’ll be ok.

No idea what happened and why rockstor went offline. I have some other services that went offline as well, which is odd, since they are on separate machines, but that’s beside the scope of this post :slight_smile:

Thanks again for your help! Still running and loving Rockstor! I suggest it whenever I can :smiley:

3 Likes