Unix Technical Forum

RAID recover on Sun StorEdge A1000

This is a discussion on RAID recover on Sun StorEdge A1000 within the Sun Solaris Hardware forums, part of the Solaris Operating System category; --> Hi! I've got raid0 on my Sun StorEdge A1000 under Sparc Solaris 8. I know, raid0 is not best ...


Go Back   Unix Technical Forum > Unix Operating Systems > Solaris Operating System > Sun Solaris Hardware

Register FAQ Members List Calendar Search Today's Posts Mark Forums Read
  #1 (permalink)  
Old 01-16-2008, 11:35 AM
Klim Samgin
 
Posts: n/a
Default RAID recover on Sun StorEdge A1000

Hi!

I've got raid0 on my Sun StorEdge A1000 under Sparc Solaris 8.
I know, raid0 is not best solution :-(
Recently I was not able to mount my StorEdge.
I saw red error light on one of the disks.

Raid Manager software 6.22.01.14
shows error for that disk:
"A drive failed because it experienced a write failure"

And it shows that my raid0 is dead

I dismounted this failed driver from array
and try to read its data on PC.

AND I CAN READ IT! I saw hex data!

I mounted drive back into array and turn it on.
And I've got green light on failed disk!

But I still can't mount array.

Raid Manager shows that failed
disk is "replcaced", needs format and raid0 is still dead.

Can somebody help me with questions:

1. It's the same disk, why is my array dead?
2. Is it possible to recover my data?
(for example, using dd command to dump disk,
diskedit to edit it and so on)
3. May be I need to update service information on disks header or
I need to edit information in array controller memory?

Klim.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #2 (permalink)  
Old 01-16-2008, 11:35 AM
Kurt Duncan
 
Posts: n/a
Default Re: RAID recover on Sun StorEdge A1000

"Klim Samgin" <klimsamgin@yahoo.com> wrote in message
news:f5cc7e25.0312030634.3edaf7b9@posting.google.c om...
> Raid Manager software 6.22.01.14
> shows error for that disk:
> "A drive failed because it experienced a write failure"
>


If the controller cannot write to a destination drive (as the message
indicates), the drive is spun down. With RAID-0, this means at least one
block of data did not get written to disk. If the system crashed at this
point, probably many blocks did not get written to disk. If there was an
easy way to trick the volume back into existence, you would have no
confidence that whatever data remained, was intact, nor where any missing
blocks might be.

Your best bet is to delete the LUN, recreate it, reformat, reload from
tape...
Backups are always a requirement, but doubly so on RAID-0.

And I'd think real hard about re-using the drive that failed. I wouldn't
ever use it in another RAID-0 LUN, but I'm not sure I'd be too happy with it
in any other configuration, either.


Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #3 (permalink)  
Old 01-16-2008, 11:36 AM
Klim Samgin
 
Posts: n/a
Default Re: RAID recover on Sun StorEdge A1000

"Kurt Duncan" <kurt.duncan@lsil.com> wrote in message news:<bqnluq$c0m$1@news.lsil.com>...
> "Klim Samgin" <klimsamgin@yahoo.com> wrote in message
> news:f5cc7e25.0312030634.3edaf7b9@posting.google.c om...
> > Raid Manager software 6.22.01.14
> > shows error for that disk:
> > "A drive failed because it experienced a write failure"
> >

>
> If the controller cannot write to a destination drive (as the message
> indicates), the drive is spun down. With RAID-0, this means at least one
> block of data did not get written to disk. If the system crashed at this
> point, probably many blocks did not get written to disk. If there was an
> easy way to trick the volume back into existence, you would have no
> confidence that whatever data remained, was intact, nor where any missing
> blocks might be.
>
> Your best bet is to delete the LUN, recreate it, reformat, reload from
> tape...
> Backups are always a requirement, but doubly so on RAID-0.
>
> And I'd think real hard about re-using the drive that failed. I wouldn't
> ever use it in another RAID-0 LUN, but I'm not sure I'd be too happy with it
> in any other configuration, either.


Hi!

Thank you for your answer.

I have halted Sun server with storedge in the evening
and I have turned them on in the morning.
The server can't mount storege than
and I saw this problem.
So, there were no writings when storedge crashes.

This storedge is using to store distributives, cdrom images and so on.
There were many readings, but few writings.
So, the storedge data was consistent at crash moment.

If I have a byte-to-byte copy of the failed drive,
from the same manufacturer, the same model and size, and so on,
why is array manager software marked it "replaced"?
I think, array manager software get this answer from storedge controller.
Why is storedge controller think so?
May be it checks raid disks headers, for example,
reads first sector of every disk, and looking for a label...

If so, I can mark every disk with same label
and I get recovered LUN.

What do you think about this possibility?

Klim.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #4 (permalink)  
Old 01-16-2008, 11:36 AM
Kenneth Sibley
 
Posts: n/a
Default Re: RAID recover on Sun StorEdge A1000

Klim,

If I remember correctly this contains an old LSI
controller. These units store a DAC Store header
on disks in the array. This identifies what drives
are in what LUNs and what volumes are in those
LUNs.

If you replaced the drive then the system is recognizing
that this drive identifier is missing (similar to the
private region that Veritas Volume Manager uses).

I think you are better off reloading from tape.

Ken S.

"Klim Samgin" <klimsamgin@yahoo.com> wrote in message
news:f5cc7e25.0312050010.51b5b456@posting.google.c om
"Kurt Duncan" <kurt.duncan@lsil.com> wrote in message
news:<bqnluq$c0m$1@news.lsil.com>...
> "Klim Samgin" <klimsamgin@yahoo.com> wrote in message
> news:f5cc7e25.0312030634.3edaf7b9@posting.google.c om...
> > Raid Manager software 6.22.01.14
> > shows error for that disk:
> > "A drive failed because it experienced a write failure"
> >

>
> If the controller cannot write to a destination drive (as the message
> indicates), the drive is spun down. With RAID-0, this means at least one
> block of data did not get written to disk. If the system crashed at this
> point, probably many blocks did not get written to disk. If there was an
> easy way to trick the volume back into existence, you would have no
> confidence that whatever data remained, was intact, nor where any missing
> blocks might be.
>
> Your best bet is to delete the LUN, recreate it, reformat, reload from
> tape...
> Backups are always a requirement, but doubly so on RAID-0.
>
> And I'd think real hard about re-using the drive that failed. I wouldn't
> ever use it in another RAID-0 LUN, but I'm not sure I'd be too happy with

it
> in any other configuration, either.


Hi!

Thank you for your answer.

I have halted Sun server with storedge in the evening
and I have turned them on in the morning.
The server can't mount storege than
and I saw this problem.
So, there were no writings when storedge crashes.

This storedge is using to store distributives, cdrom images and so on.
There were many readings, but few writings.
So, the storedge data was consistent at crash moment.

If I have a byte-to-byte copy of the failed drive,
from the same manufacturer, the same model and size, and so on,
why is array manager software marked it "replaced"?
I think, array manager software get this answer from storedge controller.
Why is storedge controller think so?
May be it checks raid disks headers, for example,
reads first sector of every disk, and looking for a label...

If so, I can mark every disk with same label
and I get recovered LUN.

What do you think about this possibility?

Klim.


Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
Reply


Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On
Forum Jump


All times are GMT. The time now is 10:42 PM.


Powered by vBulletin® Version 3.6.5
Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
SEO by vBSEO 3.2.0
www.UnixAdminTalk.com