I’ve got a couple of strange issues presenting on my system recently. I haven’t been able to find anyone else bringing it up anywhere else, so it’s very much a possibility that its something that I’ve unintentionally caused, since I’m not super familiar with Linux, although I’ve been trying to use the cli as much as possible for a couple years now anyway to get familiar with it.
Relevant setup info:
System: Rockstor 3.9.2-57
Kernel: 5.9.10-1.el7.elrepo.x86_64
Intel Celeron 1037U Dual-core 4-BAY NAS with Intel HM77 chipset
Hardware RAID 0 / 1 / 5 / 10
8GB DDR3L 1600 RAM
SATA III
128GB System SSD, 4 x 4TB WD Reds
Dual Bonded/Teamed Gigabit NICs
btrfs: v5.9
Note: I would update to openSUSE, but I haven’t been able to find any easy to follow instructions on how to do it yet… and I don’t know enough about how building from source works to be able to come up with the commands on my own. also - I needed a newer version of btrfs because my data was using RAID1C3 and the version that came with rockstor refused to read it, so that’s why I ended up needing to build that. lol like I said, I can follow instructions…
Anyway, so the first issue is that there were update for dnsmasq, net-snmp and a couple associated net-snmp library packages that were released recently. I’m subscribed to the stable update channel and as a result my system was automatically installing these packages.
Somehow, after it did my system would become unresponsive every time (I primarily access via samba linked to my windows desktop). My very limited troubleshooting skills discovered that the new packages were causing several services to fail right from boot, including network.service, smbd.service, and probably about 4-6 more .services that I don’t recall offhand. The web interface still worked however, and samba would be disabled every time and couldn’t be turned back on unless I used the cli.
This would happen every time I tried to use the server, and the only way I found to fix it was by rolling back the updated packages via the yum history feature. After that it would work fine again - until it decided to install those new packages again. Rinse, repeat.
I eventually got irritated with having to dick around with my server every time I wanted to use it, so I added an exclusion for those packages in the /etc/yum.conf file, which seemed to fix the issue. A bit of a band-aid solution, but it works. Just wanted to bring that to your attention, anyway.
The second issue I haven’t found a solution for, and that’s that my server is endlessly spawning the sendmail and postdrop executables. They are remaining in memory until the server essentially grinds to a halt. The web interface is SO slow (but works, sometimes).
When I ssh into the box and run ps -A I find a list 3/4 of a mile long, mostly “sendmail” and “postdrop” processes, although there is a doubling up of multiple other executables as well (nfsd shows up 8 or 9 times, postgres about 15 times, grep here and there, a bunch of kworker threads and various others). If i issue a pkill sendmail command - and then a similar one for postdrop - all of a sudden my web interface works great and all is well. I tried counting how many there were one time after I had just cleared it about 10 mins before, and it already had 26 new instances of both sendmail and postdrop.
I have tried to de-register or remove every email reporting feature I could find but they still keep showing up. I suspect it has something to do with the netdata rockon, because I’ve been receiving nearly 500 emails/day from netdata saying “cant connect” “connection restored” “entropy low” “disk use 10 mins alarm” etc etc etc.
I don’t care about the so-called issues it’s reporting, but I haven’t been able to figure out how to turn all those “alarms” off anywhere on the netdata UI, and I don’t have enough experience with docker yet to know how to track down the config file within the rockon container… What I do care about is the fact that my processor sits there running at full tick with all these processes that don’t ever seem to die, and the higher than normal system temperatures that result from it. So, I’ve been having endless fun with that the past few weeks anyway…
Like I said, its probably something I did to it out of sheer ignorance, but any ideas on how to fix it would be appreciated because I’m not finding any success with it.