Unable to add Disk

Ben_Abecassis · October 22, 2019, 2:41am

Hey guys - first off, search on the forum isnt working… or I’m dumb

Now on to the reason I am here… I have installed a new SATA disk, and I after getting it to appear (needed to reboot…) I am not able to add it to a pool. When I try and add it, I see

“mdraid member (UI pending)”

Not sure how to fix this…

phillxnet · October 22, 2019, 4:43pm

@Ben_Abecassis Welcome to the Rockstor community.

I can chip in on this one:

We have a css / theme issue of sorts so that the normally visible magnifiying glass icon for search is black and so becomes invisible. If you mouse over in the top right you should see it. I’ll try and have a look at this soon.

The “Rescan” button at the bottom of the Disk page should have help here, but yes sometimes hardware can be tricky when live plugged.

This is currently ‘by design’ as we don’t cater for mdraid except in a very limited use. And so we currently just show the:

message whenever we encounter one. If you are certain that this disk’s prior life is over then you can wipe it’s mdraid personality along with all it’s prior data via the instructions in the following issue we have open to improve the docs on this element of using prior use mdraid disks:

github.com/rockstor/rockstor-doc

improve mdraid content re re-deployed prior mdraid drives

opened 04:48PM - 20 May 16 UTC

phillxnet

Thanks to @elmcrest in the following forum thread for influencing this issue cre…ation: https://forum.rockstor.com/t/solved-disks-failing-with-badge-disk-is-a-mdraid-member/1210/12?u=phillxnet Now we have the 'advanced' mdraid system disk howto we need to clarify that this mdraid capability is only supported for the system drive and offer up a howto on re-purposing mdraid members as Rockstor btrfs data drives. Ie detail the commands required to erase mdraid member status inherited from previous NAS deployments. ie BIG WARNING THAT ALL DATA WILL BE LOST PERMANENTLY and then (for example): ``` yum install mdadm mdadm --stop /dev/md0 mdadm --zero-superblock /dev/vdb mdadm --zero-superblock /dev/vdc ``` And finally a 'Rescan' in disk page to refresh Rockstor's knowledge of the devices mdraid member status change. This Rescan mdraid member status change is functional as of 3.8-13.11, via pr: https://github.com/rockstor/rockstor-core/pull/1314 Some initial investigation of the mdX device name may also be required, ie ``` cat /proc/mdstat ```

But do note that this will wipe the disk of all it’s prior data. This disk was likely in a NAS before hand which used the mdraid software raid system. Rockstor only caters for the btrfs raid/file system. Hence the requirement to wipe and start afresh.

Of particular importance when you do this, as root in a local terminal or via a ssh session to your Rockstor machine, is that you select the correct disk.

In your case the disk name is displayed in the picture you posted. And so it’s full by-id name (those used in Rockstor) is as follows:

/dev/disk/by-id/ata-WDC_WD20EARS-00MVWB0...

so if you use that name (TAB key should auto complete) and make absolutely certain it is definitely the correct disk, then you can use the “mdadm --zero-superblock you-dev-here” command, after first stopping what is likely to be /dev/md0 and you should be good. The disk will then no longer have this ‘special’ superblock and so should be usable by Rockstor.

Hope that helps and take a look at that issue for a little background. And if you are at all unsure of what you are doing here then do ask for more explicit instructions as you will be destroying prior data on whatever disk you specify.

Ben_Abecassis · October 25, 2019, 3:35am

Thanks! Small issue -

[root@rockstor ben]# mdadm --stop /dev/md0                                                                                     
mdadm: error opening /dev/md0: No such file or directory

The "md"s I have are

[root@rockstor ben]# ls /dev/md*                                                                                               
/dev/md124  /dev/md125  /dev/md126  /dev/md127

How can I know which one to use?

phillxnet · October 25, 2019, 6:23pm

@Ben_Abecassis

It may well be all of them but to be sure first if you post the output of:

ls -la /dev/disk/by-id/

So you can see the by-id names to canonical temp names.

And then from the output of the following command:

cat /proc/mdstat

You / we should be able to see what drives are associated with what /dev/md* (Multi Device) software raid devices.

If as I suspect they are all only associated with your previously indicated drive (indicated in the Rockstor Web-UI), and there may also be a missing drive their if it came from a set.

Hopefully that should help. If they are all associated with the same devices and that device translates to the by-id name given in Rockstor’s Web-UI and you are happy to loose all data on their then you can proceed with the process of wiping it via stopping all associated md devices and then executing that --zero-superblock command with either the by-id name or the short temp name of the problem drive.

Hope that helps. And if in doubt then just post the output of those command here to help others see what the situation is.

Ben_Abecassis · October 26, 2019, 3:36pm

[ben@rockstor ~]$ ls -la /dev/disk/by-id                                                                                       
total 0                                                                                                                        
drwxr-xr-x 2 root root 560 Oct 24 23:59 .
drwxr-xr-x 8 root root 160 Oct 24 23:59 ..
lrwxrwxrwx 1 root root   9 Oct 24 23:59 ata-SAMSUNG_HD103UJ_S13PJDWS111499 -> ../../sde
lrwxrwxrwx 1 root root   9 Oct 24 23:59 ata-WDC_WD20EARS-00MVWB0_WD-WCAZA7486513 -> ../../sdf
lrwxrwxrwx 1 root root  10 Oct 24 23:59 ata-WDC_WD20EARS-00MVWB0_WD-WCAZA7486513-part1 -> ../../sdf1
lrwxrwxrwx 1 root root  10 Oct 24 23:59 ata-WDC_WD20EARS-00MVWB0_WD-WCAZA7486513-part2 -> ../../sdf2
lrwxrwxrwx 1 root root  10 Oct 24 23:59 ata-WDC_WD20EARS-00MVWB0_WD-WCAZA7486513-part3 -> ../../sdf3
lrwxrwxrwx 1 root root  10 Oct 24 23:59 ata-WDC_WD20EARS-00MVWB0_WD-WCAZA7486513-part4 -> ../../sdf4
lrwxrwxrwx 1 root root   9 Oct 24 23:59 scsi-35000cca51fd5b7e8 -> ../../sda
lrwxrwxrwx 1 root root  10 Oct 24 23:59 scsi-35000cca51fd5b7e8-part1 -> ../../sda1
lrwxrwxrwx 1 root root  10 Oct 24 23:59 scsi-35000cca51fd5b7e8-part2 -> ../../sda2
lrwxrwxrwx 1 root root  10 Oct 24 23:59 scsi-35000cca51fd5b7e8-part3 -> ../../sda3
lrwxrwxrwx 1 root root   9 Oct 24 23:59 scsi-35000cca715cf2506 -> ../../sdb
lrwxrwxrwx 1 root root   9 Oct 24 23:59 scsi-350024e90022b7aaa -> ../../sdc
lrwxrwxrwx 1 root root   9 Oct 24 23:59 scsi-35f8db4c181603f4a -> ../../sdd
lrwxrwxrwx 1 root root   9 Oct 24 23:59 wwn-0x50000f0000db6db9 -> ../../sde
lrwxrwxrwx 1 root root   9 Oct 24 23:59 wwn-0x5000cca51fd5b7e8 -> ../../sda
lrwxrwxrwx 1 root root  10 Oct 24 23:59 wwn-0x5000cca51fd5b7e8-part1 -> ../../sda1
lrwxrwxrwx 1 root root  10 Oct 24 23:59 wwn-0x5000cca51fd5b7e8-part2 -> ../../sda2
lrwxrwxrwx 1 root root  10 Oct 24 23:59 wwn-0x5000cca51fd5b7e8-part3 -> ../../sda3
lrwxrwxrwx 1 root root   9 Oct 24 23:59 wwn-0x5000cca715cf2506 -> ../../sdb
lrwxrwxrwx 1 root root   9 Oct 24 23:59 wwn-0x50014ee2b088f39e -> ../../sdf
lrwxrwxrwx 1 root root  10 Oct 24 23:59 wwn-0x50014ee2b088f39e-part1 -> ../../sdf1
lrwxrwxrwx 1 root root  10 Oct 24 23:59 wwn-0x50014ee2b088f39e-part2 -> ../../sdf2
lrwxrwxrwx 1 root root  10 Oct 24 23:59 wwn-0x50014ee2b088f39e-part3 -> ../../sdf3
lrwxrwxrwx 1 root root  10 Oct 24 23:59 wwn-0x50014ee2b088f39e-part4 -> ../../sdf4
lrwxrwxrwx 1 root root   9 Oct 24 23:59 wwn-0x50024e90022b7aaa -> ../../sdc
lrwxrwxrwx 1 root root   9 Oct 24 23:59 wwn-0x5f8db4c181603f4a -> ../../sdd

[ben@rockstor ~]$ cat /proc/mdstat                                                                                             
Personalities :                                                                                                                
md125 : inactive sdf1[0](S)                                                                                                    
      1959872 blocks                                                                                                           


md126 : inactive sdf3[0](S)                                                                                                    
      987904 blocks                                                                                                            


md127 : inactive sdf2[0](S)                                                                                                    
      256896 blocks                                                                                                            


unused devices: <none>

So if I read this correctly, sdf is the actual drive… but when I try and zero them, it doesnt work

[root@rockstor ben]# mdadm --stop /dev/md125                                                                                   
mdadm: stopped /dev/md125                                                                                                      
[root@rockstor ben]# mdadm --stop /dev/md126                                                                                   
mdadm: stopped /dev/md126                                                                                                      
[root@rockstor ben]# mdadm --stop /dev/md127                                                                                   
mdadm: stopped /dev/md127                                                                                                      
[root@rockstor ben]# mdadm --zero-superblock /dev/md125                                                                        
mdadm: Couldn't open /dev/md125 for write - not zeroing                                                                        
[root@rockstor ben]# mdadm --zero-superblock /dev/md126                                                                        
mdadm: Couldn't open /dev/md126 for write - not zeroing                                                                        
[root@rockstor ben]# mdadm --zero-superblock /dev/md127                                                                        
mdadm: Couldn't open /dev/md127 for write - not zeroing

phillxnet · October 26, 2019, 9:53pm

@Ben_Abecassis It looks like you are almost their.

From a quick look your have successfully stopped the /dev/md devices but your have not specified the correct device for the --zero-superblock.

If you have a look at the original issue:

github.com/rockstor/rockstor-doc

improve mdraid content re re-deployed prior mdraid drives

opened 04:48PM - 20 May 16 UTC

phillxnet

Thanks to @elmcrest in the following forum thread for influencing this issue cre…ation: https://forum.rockstor.com/t/solved-disks-failing-with-badge-disk-is-a-mdraid-member/1210/12?u=phillxnet Now we have the 'advanced' mdraid system disk howto we need to clarify that this mdraid capability is only supported for the system drive and offer up a howto on re-purposing mdraid members as Rockstor btrfs data drives. Ie detail the commands required to erase mdraid member status inherited from previous NAS deployments. ie BIG WARNING THAT ALL DATA WILL BE LOST PERMANENTLY and then (for example): ``` yum install mdadm mdadm --stop /dev/md0 mdadm --zero-superblock /dev/vdb mdadm --zero-superblock /dev/vdc ``` And finally a 'Rescan' in disk page to refresh Rockstor's knowledge of the devices mdraid member status change. This Rescan mdraid member status change is functional as of 3.8-13.11, via pr: https://github.com/rockstor/rockstor-core/pull/1314 Some initial investigation of the mdX device name may also be required, ie ``` cat /proc/mdstat ```

One stops the md devices (hence them not then being available) and then zeros the superblock on each of their members.
And as you say it looks like you have only the one member sdf.

But do take care as the link between by-id names and the canonical /dev/sd* names changes from one boot to the next so do double check that sdf is the correct drive.
If you haven’t rebooted then it should still be the same one.

So in short you ran the zero superblock commands on the meta multi devices that you had already stopped. But you need to run the zero superblock command on the md members, once you have stopped the md devices that a disk backs.

Hope that helps.

Also note that you should be able to use the by-id name, which doesn’t change from boot to boot, with the zero superblock command. At least I strongly suspect so

Ben_Abecassis · October 27, 2019, 6:55pm

Couldnt get that too work… then I realized that since I didnt care about the data - there was no reason to keep the partition table that was giving me this headache. I destroyed the partitions and Rockstor was happy and I was able to add them to a pool Thanks!

phillxnet · October 27, 2019, 7:23pm

@Ben_Abecassis Glad you finally go this sorted and thanks for the update.

If it was only the partition table that was upsetting things then the Web-UI would have been able to help you their. Anyway you are sorted now which is great. We have to be extra careful with deletes and so tend to tread lightly. Ideally all drives arriving are wiped of all prior ‘lives’. Ex mdraid and ex ZFS members are the most problematic as it goes. All others should be ‘wipable’ from within Rockstor’s Web-UI via the cog that appears next to them.

Thanks again for the update.