SMB Shares 'hanging'?

Sorry for the piecemealing here. Finally, I know you mentioned you’re using NFS to connect linux machines to Rockstor. After looking for obvious log entries to see whether you can identify what’s going on, may be you can also use one of your linux instances to connect to Rockstor using samba and see whether you’re also running into that timeout/hanging issue. This might help to identify whether it’s a server vs. client issue …
I assume, for your windows clients you’ve also been doing the normal update/upgrade maintenance over time, so it will be reasonable to assume that those are fairly updated with the most recent patches.

gives 3.1.1 under dialect at my windows machine.

gives the following error:

zypper info samba

Retrieving repository ‘Rockstor-Stable’ metadata …[error]
Repository ‘Rockstor-Stable’ is invalid.
[Rockstor-Stable|http://b7679531-0bbc-d8f9-536b-a85e45604eb0@updates.rockstor.co m:8999/rockstor-stable/leap/15.4?credentials=/etc/zypp/credentials.d/Rockstor-St able&auth=basic] Valid metadata not found at specified URL
History:

Please check if the URIs defined for this repository are pointing to a valid rep ository.
Warning: Skipping repository ‘Rockstor-Stable’ because of the above error.
Some of the repositories have not been refreshed because of an error.

And after that:

Information for package samba:

Repository : Update repository with updates from SUSE Linux Enterprise 15
Name : samba
Version : 4.15.13+git.636.53d93c5b9d6-150400.3.23.1
Arch : x86_64
Vendor : SUSE LLC https://www.suse.com/
Installed Size : 1.7 MiB
Installed : Yes (automatically)
Status : up-to-date
Source package : samba-4.15.13+git.636.53d93c5b9d6-150400.3.23.1.src
Upstream URL : https://www.samba.org/
Summary : A SMB/CIFS File, Print, and Authentication Server

gives the same error as above
and right after:

Information for package cifs-utils:

Repository : Update repository with updates from SUSE Linux Enterprise 15
Name : cifs-utils
Version : 6.15-150400.3.9.1
Arch : x86_64
Vendor : SUSE LLC https://www.suse.com/
Installed Size : 214.3 KiB
Installed : Yes
Status : up-to-date
Source package : cifs-utils-6.15-150400.3.9.1.src
Upstream URL : http://www.samba.org/linux-cifs/cifs-utils/
Summary : Utilities for doing and managing mounts of the Linux CIFS filesystem

is the same for me at 60 seconds.

Ok so now for the logs. What should I be looking for exactly here?

has many files!

1 Like

The same smb shares seem to be accessible from linux.
No timeouts!

2 Likes

It seems like the “stable channel” repository is active in your system. However, the current stable 4.1.1-0" only" runs under Leap 15.3, and the above error message indicates it’s looking for the rockstor-stable repository under Leap 15.4 (which it won’t find for now, since we’re still working through the Release Candidates on the testing channel before the next stable version is release in the stable channel, hopefully shortly).

You mentioned above that you have 4.5.8-0 installed. After the installation of 4.5.8 did you set the stable channel or the testing channel?

1 Like

ah, ok. That’s the discrepancy then. 4.5.8-0 is the prior Release Candidate from the testing channel. To avoid this error and get the next Release Candidate 4.5.9-1 with the latest fixes you should probably switch to the testing channel.
Once, the next stable release (soon) is out, you could then switch back to the stable channel. That way the error message during zypper updates (for all other Leap packages) will not occur anymore. I’ll let @phillxnet comment further on that.

On the files in the samba directory, I would look at the newest smbd and nmbd log files, as well (if you have them) new files that contain the windows computer name that you’re connecting to Rockstor from.

You can probably ignore looking at files with the extension xz and that contain date stamps, as far as I know these are part of the log rotation, and since the symptom you’re seeing occurs all the time, looking at the most recent ones (e.g. today’s date) is probably the easiest (using e.g. ls -ll to see the last modified dates)

2 Likes

@Sky12016 Re:

Nothing more to say really: bar the (soon) should be in the next few days where I’m hoping we are basically there on the next release which is likely to be 4.6.0-0. If all goes well that will also be ‘promoted’ the being our first stable release and thus we will, finally, be kicking of the next stable channel. We didn’t want to populate the 15.4 repo before we had a ‘proper’ stable release to pop in there. Hence it’s just not htere yet. I should likely have created an actual empty repo but but given we sign our repos and it’s all automated during the publishing of our also signed rpms it actually a little tricky: plus we are so nearly there now: fingers crossed. When will publish the next Release Candidate (RC7) into testing there will be an accompanying post here:

Hope that helps,and my apologies for the non-existent stable repo situation, things have just taken longer than intended. But we are a hairs width away now :slight_smile: .

2 Likes

Well; from the log named after the windows machine I am getting the timeouts, I have hundreds of these errors:

[2023/05/29 22:41:56.491468, 3] …/…/source3/smbd/smb2_server.c:3956(smbd_smb2_request_error_ex)
smbd_smb2_request_error_ex: smbd_smb2_request_error_ex: idx[1] status[NT_STATUS_OBJECT_NAME_NOT_FOUND] || at …/…/source3/smbd/smb2_create.c:337

and at the end:

[2023/05/29 22:41:58.885344, 3] …/…/source3/smbd/smb2_server.c:3956(smbd_smb2_request_error_ex)
smbd_smb2_request_error_ex: smbd_smb2_request_error_ex: idx[1] status[STATUS_NO_MORE_FILES] || at …/…/source3/smbd/smb2_query_directory.c:160
[2023/05/29 22:42:09.049905, 3] …/…/source3/smbd/service.c:1131(close_cnum)
windows-pc (ipv4:ipaddress:1969) closed connection to service IPC$

In another log from a different windows machine I find different errors:

[2023/05/29 17:45:13.636038, 3] …/…/lib/util/util_net.c:257(interpret_string_addr_internal)
interpret_string_addr_internal: getaddrinfo failed for name RockstorNAS (flags 34) [Temporary failure in name resolution]
[2023/05/29 17:45:13.636082, 3] …/…/source3/lib/util_sock.c:1036(get_mydnsfullname)
get_mydnsfullname: getaddrinfo failed for name RockstorNAS [Unknown error]
[2023/05/29 17:45:13.565541, 3] …/…/source3/smbd/smb2_server.c:3956(smbd_smb2_request_error_ex)
smbd_smb2_request_error_ex: smbd_smb2_request_error_ex: idx[1] status[NT_STATUS_OBJECT_NAME_NOT_FOUND] || at …/…/source3/smbd/smb2_create.c:337
[2023/05/29 17:45:13.568645, 3] …/…/source3/smbd/smb2_server.c:3956(smbd_smb2_request_error_ex)
smbd_smb2_request_error_ex: smbd_smb2_request_error_ex: idx[1] status[NT_STATUS_ACCESS_DENIED] || at …/…/source3/smbd/smb2_tcon.c:151
[2023/05/29 21:47:40.720251, 3] …/…/source3/smbd/smb2_server.c:3956(smbd_smb2_request_error_ex)
smbd_smb2_request_error_ex: smbd_smb2_request_error_ex: idx[1] status[NT_STATUS_CANCELLED] || at …/…/source3/smbd/smb2_notify.c:126

I also see that there are two log files that are named after the name computer but the one has the local ip address and the other the hostname. The one with the ip address contains multiple appearances of the following:

[2023/05/29 18:19:55.245479, 3] …/…/lib/util/util_net.c:257(interpret_string_addr_internal)
interpret_string_addr_internal: getaddrinfo failed for name RockstorNAS (flags 34) [Name or service not known]
[2023/05/29 18:19:55.245571, 3] …/…/source3/lib/util_sock.c:1036(get_mydnsfullname)
get_mydnsfullname: getaddrinfo failed for name RockstorNAS [Unknown error]

and there are also some login errors:

[2023/05/29 22:05:24.299035, 2] …/…/source3/auth/auth.c:348(auth_check_ntlm_password)
check_ntlm_password: Authentication for user [nistsio] → [nistsio] FAILED with error NT_STATUS_WRONG_PASSWORD, authoritative=1

which are incomprehensible since the user/password combination is correct and moments later the login goes through!

2 Likes

A quick thought that may deserve to be ruled out:
How is the windows machine showing this issue connected to your LAN?

The reason I’m asking is that these errors and the symptoms you describe seem to indicate a connection failure between the client and server, and I know Windows can sometimes be aggressive in disabling some devices for the sale of power savings.
I’m especially wondering about Ethernet/wifi/USB adapter, etc…

It’s a long shot but worth ruling out now.

EDIT: similarly, any relevant detail on your network could prove useful: local DNS server or anything of the sort, by any chance?

2 Likes

is actually not an error, it means success … I believe it essentially just states that no more files were found when looking in a given directory, after listing all the existing ones (like an eof within a file).

the NT_STATUS_OBJECT_NAME_NOT_FOUND had some bug associated with it last year, that was apparently fixed in the 4.14 Samba version (not sure this is the same thing), but in some other instances they apparently pointed to some permission issues not being visible correctly in samba or messed up in the actual share… I assume, the files on your samba shares have the “appropriate” user/group setups, so a logged in user has the correct access and is not timing out. Though this is a long shot, since you’ve mentioned that this is an intermittent state and then you have access like usual.

So, back to @Flox question whether you have anything specific set up in your network that might interfere …

1 Like

Sorry but I am a little confused as to what I should do in order to get rid of the error regarding the rockstor-stable repository.

you can switch to the testing channel for now. Once the stable release is announced and out, you can switch back to the stable channel.

1 Like

Hi @Flox

My network topology is quite simple and nothing has changed in that front.

VDSL Gateway modem → OPNSense router (dhcp server) → pihole (local dns server) → LAN

Rockstor and everything else I am troubleshooting, is connected via Ethernet cat5e cable.

1 Like

Are you connecting to the SMB shares with Rockstor’s IP address or the Rockstor servername?

If with servername, I found using the wsdd (web service discovery service) Rockon helpful in keeping the Rockstor appliance visible across the network for windows-based machines. Again, would not explain why you had no issues with the CentOS version of Rockstor …

I have tried both (hostname & IP) and it does not make a difference.
Currently I am testing the “Local DNS records” found in pi-hole where I added some of my local machines with static ips.

I 'll let you know how it goes.

How will I be informed that the stable release is out in order to switch back to stable channel updates?

Isn’t this a bit risky in a way that if I don’t catch up on the news in time my server may get updates from the testing channel?

@Sky12016 yes, I’m curious whether pi-hole will help in this regard.

There will be an announcement on the forums when the next stable release comes out, so you should see this very quickly. Once that’s out, it will take at least a couple of week for the test channel to be updated again…

But since that is probably fairly imminent you can continue to live with the one error message there if you don’t want to keep track of the announcement/switching, etc.

2 Likes

Hello people!

In case anyone stumbles upon this post; my problem is finally solved.
I did add the rockstor machine hostname into pihole’s local dns records, but apart from that I found out that one of my windows machines had a rockstor server path mapped as a network drive using the old hostname! When I removed that everything is working as it should!
I am not sure whether other windows machines (without using the old hostname) should have been impacted by that but judging from the result I am leaning towards yes.

So that is that! The hostname change from the old to the new Rockstor installation was the culprit.

Thank you all for bearing with me.
Cheers!

3 Likes

Ahh, That’s happen with Pihole issues… I never use those… In fact I have Windows Servers (2 of them) and WIndows PC and Mac, I have no trouble since 6 month… Little thing need tweaking day by day…

ATG2004

Hi @atg2004.

Can’t say I agree. My pihole has been rock solid for 3 years running!
Only updating it whenever I remember…

if the host wasnt in pi-hole the windows machines were probably using multicast dns, which is not that reliable or predictable, makes perfect sense.