DiskSuite Fail in SunOS 5.9 - Systems Maintenance(Archived)

Hello,
I get this error when I run check_disksuite.sh.
"Mirror d3 is in a wrong state / Submirror d13 is in a wrong state."
SunOS 5.9
What does this message mean?
How can I fix it?
Thanks 

Whats metastat show 

the metastat command show this:
root#... # metastat
d3: Mirror
Submirror 0: d13
State: Needs maintenance
Submirror 1: d23
State: Okay
Pass: 1
Read option: roundrobin (default)
Write option: parallel (default)
Size: 71689920 blocks (34 GB)
d13: Submirror of d3
State: Needs maintenance
Invoke: metareplace d3 c1t0d0s3 <new device>
Size: 71689920 blocks (34 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
c1t0d0s3 0 No Maintenance Yes
d23: Submirror of d3
State: Okay
Size: 71689920 blocks (34 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
c1t1d0s3 0 No Okay Yes
d1: Mirror
Submirror 0: d11
State: Okay
Submirror 1: d21
State: Okay
Pass: 1
Read option: roundrobin (default)
Write option: parallel (default)
Size: 20484288 blocks (9.8 GB)
d11: Submirror of d1
State: Okay
Size: 20484288 blocks (9.8 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
c1t0d0s1 0 No Okay Yes
d21: Submirror of d1
State: Okay
Size: 20484288 blocks (9.8 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
c1t1d0s1 0 No Okay Yes
d0: Mirror
Submirror 0: d10
State: Okay
Submirror 1: d20
State: Okay
Pass: 1
Read option: roundrobin (default)
Write option: parallel (default)
Size: 50798592 blocks (24 GB)
d10: Submirror of d0
State: Okay
Size: 50798592 blocks (24 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
c1t0d0s0 0 No Okay Yes
d20: Submirror of d0
State: Okay
Size: 50798592 blocks (24 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
c1t1d0s0 0 No Okay Yes
Device Relocation Information:
Device Reloc Device ID
c1t1d0 Yes id1,sd#SFUJITSU_MAT3073N_SUN72G_000531B097BN____AAN0P58097BN
c1t0d0 Yes id1,sd#SFUJITSU_MAT3073N_SUN72G_000531B099BS____AAN0P58099BS
root#... # 

check disk c1t0d0 for errors, replace if bad, if it looks good then run this command:
metareplace -e d3 c1t0d0s3 

Sorry but my Solaris level is very bad. What command can I use for check the disk? 

iostat -en | grep c1t0d0
will show soft, hard, transport, total errors since the last reboot
you could also try
format
enter the appropriate number, then
analyze
read (doesn't harm SunOS) 

You can also check the /var/adm/messages* file for errors related to the disk.
You can also run 'metastat' and see if there are any other failed metadevices belonging to the same disk, if multiple metadevices have failed you probably have a faulty harddrive.
Another option is to simply run the metareplace command stated above; if the device fails again within a short period of time its probably time to change the harddrive..
.7/M.

Related

Done mirroring, aliasing in OPB; can't boot from second drive

Hi,
Configured mirroring and while testing whether we can boot off from the mirror drive, it says �Can't locate boot device� I've done required aliasing in Open Boot PROM.
-----------
# ls -al /dev/dsk/c0t1d0s0
lrwxrwxrwx 1 root root 43 Sep 28 14:08 /dev/dsk/c0t1d0s0 -> ../../devices/pci#1c,600000/scsi#2/sd#1,0:a
Rebooting with command: boot mirrordisk
Boot device: /pci#1c,600000/scsi#2/sd#1,0:a File and args:
Evaluating:
Can't locate boot device
{0} ok boot
------------
Now, what might be the problem from booting off from second hard drive? Here follows the metastatus output:
------------
# metastat
d5: Mirror
Submirror 0: d15
State: Okay
Submirror 1: d25
State: Okay
Pass: 1
Read option: roundrobin (default)
Write option: parallel (default)
Size: 18434709 blocks (8.8 GB)
d15: Submirror of d5
State: Okay
Size: 18434709 blocks (8.8 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
c0t0d0s5 0 No Okay Yes
d25: Submirror of d5
State: Okay
Size: 18434709 blocks (8.8 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
c0t1d0s5 0 No Okay Yes
d3: Mirror
Submirror 0: d13
State: Okay
Submirror 1: d23
State: Okay
Pass: 1
Read option: roundrobin (default)
Write option: parallel (default)
Size: 4194828 blocks (2.0 GB)
d13: Submirror of d3
State: Okay
Size: 4194828 blocks (2.0 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
c0t0d0s3 0 No Okay Yes
d23: Submirror of d3
State: Okay
Size: 4194828 blocks (2.0 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
c0t1d0s3 0 No Okay Yes
d1: Mirror
Submirror 0: d11
State: Okay
Submirror 1: d21
State: Okay
Pass: 1
Read option: roundrobin (default)
Write option: parallel (default)
Size: 18434709 blocks (8.8 GB)
d11: Submirror of d1
State: Okay
Size: 18434709 blocks (8.8 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
c0t0d0s1 0 No Okay Yes
d21: Submirror of d1
State: Okay
Size: 18434709 blocks (8.8 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
c0t1d0s1 0 No Okay Yes
d0: Mirror
Submirror 0: d10
State: Okay
Submirror 1: d20
State: Okay
Pass: 1
Read option: roundrobin (default)
Write option: parallel (default)
Size: 17409114 blocks (8.3 GB)
d10: Submirror of d0
State: Okay
Size: 17409114 blocks (8.3 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
c0t0d0s0 0 No Okay Yes
d20: Submirror of d0
State: Okay
Size: 17409114 blocks (8.3 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
c0t1d0s0 0 No Okay Yes
d7: Mirror
Submirror 0: d17
State: Okay
Submirror 1: d27
State: Okay
Pass: 1
Read option: roundrobin (default)
Write option: parallel (default)
Size: 12289806 blocks (5.9 GB)
d17: Submirror of d7
State: Okay
Size: 12289806 blocks (5.9 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
c0t0d0s7 0 No Okay Yes
d27: Submirror of d7
State: Okay
Size: 12289806 blocks (5.9 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
c0t1d0s7 0 No Okay Yes
Device Relocation Information:
Device Reloc Device ID
c0t1d0 Yes id1,sd#SSEAGATE_ST336607LSUN36G_3JA2B1AF000074033N6L
c0t0d0 Yes id1,sd#SSEAGATE_ST336607LSUN36G_3JA2B5BH000074033C3Z
--------------------
Please shed some light what I'm missing, thanks.
TIA
--
Raqueeb Hassan
Bangladesh 
Hello,
You must replace the majorname sd with disk.
/pci#1c,600000/scsi#2/sd#1,0:a
/pci#1c,600000/scsi#2/disk#1,0:a
Regards. 
Thanks very much for the pointer. It works.
I followed "Configuring Boot Disks With Solaris� Volume Manager Software" from blueprint, they haven't talked about "sd" when it shows "disk" instead.
My show-disks says;
{1} ok show-disks
a) /pci#1c,600000/scsi#2,1/disk
b) /pci#1c,600000/scsi#2/disk
c) /pci#1e,600000/ide#d/cdrom
d) /pci#1e,600000/ide#d/disk
q) NO SELECTION
Thanks once again.
--
Raqueeb Hassan
Bangladesh

Disk Errors

Hi,
I have the following output from metastat:
$ metastat
d14: Mirror
Submirror 0: d24
State: Okay
Submirror 1: d34
State: Needs maintenance
Pass: 1
Read option: roundrobin (default)
Write option: parallel (default)
Size: 119751168 blocks (57 GB)
d24: Submirror of d14
State: Okay
Size: 119751168 blocks (57 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
c1t0d0s4 0 No Okay Yes
d34: Submirror of d14
State: Needs maintenance
Invoke: metareplace d14 c1t1d0s4 <new device>
Size: 119751168 blocks (57 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
c1t1d0s4 0 No Maintenance Yes
d11: Mirror
Submirror 0: d21
State: Okay
Submirror 1: d31
State: Needs maintenance
Pass: 1
Read option: roundrobin (default)
Write option: parallel (default)
Size: 8201856 blocks (3.9 GB)
d21: Submirror of d11
State: Okay
Size: 8201856 blocks (3.9 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
c1t0d0s1 0 No Okay Yes
d31: Submirror of d11
State: Needs maintenance
Invoke: metareplace d11 c1t1d0s1 <new device>
Size: 8201856 blocks (3.9 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
c1t1d0s1 0 No Maintenance Yes
d10: Mirror
Submirror 0: d20
State: Okay
Submirror 1: d30
State: Needs maintenance
Pass: 1
Read option: roundrobin (default)
Write option: parallel (default)
Size: 14337984 blocks (6.8 GB)
d20: Submirror of d10
State: Okay
Size: 14337984 blocks (6.8 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
c1t0d0s0 0 No Okay Yes
d30: Submirror of d10
State: Needs maintenance
Invoke: metareplace d10 c1t1d0s0 <new device>
Size: 14337984 blocks (6.8 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
c1t1d0s0 0 No Maintenance Yes
Device Relocation Information:
Device Reloc Device ID
c1t1d0 Yes id1,sd#SFUJITSU_MAT3073N_SUN72G_000512B02CEA____AAN0P5302CEA
c1t0d0 Yes id1,sd#SFUJITSU_MAT3073N_SUN72G_000514B02TC6____AAN0P5402TC6
Can anyone help me with were i start to correct this? 
Well it looks like 1t1d0 has failed.
What does this disk look like in format?
Are there any errors in /var/adm/messages?

fsck / mirrored

Hi,
I'm getting the following error in /var/adm/messages:
Mar 23 12:17:06 aicprod0 ufs: [ID 879645 kern.notice] NOTICE: /: unexpected free inode 135961, run fsck(1M) -o f
I need to run fsck but I want to be sure that I'm going
to do the right procedure.
The system is mirrored with solstice disksuite.
d10: Mirror
Submirror 0: d11
State: Okay
Submirror 1: d12
State: Okay
Pass: 1
Read option: roundrobin (default)
Write option: parallel (default)
Size: 54122526 blocks (25 GB)
d11: Submirror of d10
State: Okay
Size: 54122526 blocks (25 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
c1t0d0s0 0 No Okay Yes
d12: Submirror of d10
State: Okay
Size: 54122526 blocks (25 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
c1t1d0s0 0 No Okay Yes
I will do:
- boot cdrom -s
- metadetach d10 d11 --> is this right? do i have to detach primary or secondary disk?
- fsck /dev/rdsk/c1t0d0s0
- metattach
reboot
Thanks
Tarek 
Why dont you do a fsck on /dev/md/dsk/d10.
For my part I would not do a metadetach

Recovering /var/ and /usr using disk suite

Hello all:
How do I recover /var/ and /usr using disk suite. A snippet of metastat is below. Do I just issue a metareplace or metarecover?
Thanks for your help.
Ezra
d30: Mirror
Submirror 0: d31
State: Okay
Pass: 1
Read option: roundrobin (default)
Write option: parallel (default)
Size: 20480121 blocks (9.8 GB)
d31: Submirror of d30
State: Okay
Size: 20480121 blocks (9.8 GB)
Stripe 0:
     Device Start Block Dbase State Reloc Hot Spare
     c1t0d0s3 0 No Okay Yes
d40: Mirror
Submirror 0: d41
State: Okay
Pass: 1
Read option: roundrobin (default)
Write option: parallel (default)
Size: 20480121 blocks (9.8 GB)
d41: Submirror of d40
State: Okay
Size: 20480121 blocks (9.8 GB)
Stripe 0:
     Device Start Block Dbase State Reloc Hot Spare
     c1t0d0s4 0 No Okay Yes
d42: Concat/Stripe
Size: 20480121 blocks (9.8 GB)
Stripe 0:
     Device Start Block Dbase     Reloc
     c1t1d0s4 0 No      Yes
d32: Concat/Stripe
Size: 20480121 blocks (9.8 GB)
Stripe 0:
     Device Start Block Dbase     Reloc
     c1t1d0s3 0 No      Yes 
Note:
c1t1d0 is the failed device.
Ezra 
Your snippet doesn't show anything wrong, so I can't understand why you're asking about recovery. Are you talking about fixing something or are you thinking of some failure scenario that hasn't happened yet? You also don't show any output as to what we're looking at. I'm assuming from your subject that /var and /usr are mounted on d30 and d40, but you might want to confirm that.
Also the output suggests that you've started setting up mirrors, but haven't done so. All the data is on c1t0 and nothing has been copied to c1t1 yet.
There's no redundant data shown here. So any failure in c1t0 will be fatal and unrecoverable from a SVM standpoint. You'd need to add the second half of the mirrors.
--
Darren

metareplace not working on 2-node cluster

Hi all, I have a 2-node Solaris cluster running Solaris 10 11/06 that has a submirror in "Needs maintenance" state. Here's the output I'm getting:
# /usr/sbin/metastat -s u02
u02/d100: Mirror
Submirror 0: u02/d101
State: Okay
Submirror 1: u02/d102
State: Needs maintenance
Pass: 1
Read option: roundrobin (default)
Write option: parallel (default)
Size: 209694720 blocks (99 GB)
u02/d101: Submirror of u02/d100
State: Okay
Size: 209694720 blocks (99 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
d11s0 0 No Okay No
u02/d102: Submirror of u02/d100
State: Needs maintenance
Invoke: metareplace u02/d100 d16s0 <new device>
Size: 209694720 blocks (99 GB)
Stripe 0:
Device Start Block Dbase State Reloc Hot Spare
d16s0 0 No Maintenance No
Device Relocation Information:
Device Reloc Device ID
d16 No -
d11 No -
When I try the metareplace cmd, I get an error as follows:
# /usr/sbin/metareplace -s u02 -e u02/d100 d16s0
metareplace: server1: /dev/md/u02/rdsk/d16s0: No such file or directory
How do I use the metareplace cmd to fix the submirror that's in needs maintenance state?
Thanks in advance,
Stewart 
I've just had a quick scan of the bugs and you may need:
/usr/sbin/metareplace -s u02 -e u02/d100 /dev/did/rdsk/d16s0
instead of what it suggests, if I'm reading the bug correctly.
Tim
---

Categories

Resources