• February 23, 2025, 10:28:21 PM
  • Welcome, Guest
Please login or register.

Login with username, password and session length
Advanced search  

News:

This Forum Beta is ONLY for registered owners of D-Link products in the USA for which we have created boards at this time.

Pages: [1] 2

Author Topic: SMART report question  (Read 14087 times)

irha

  • Level 1 Member
  • *
  • Posts: 18
SMART report question
« on: October 30, 2010, 10:06:25 PM »

My DNS-321 is showing "Abnormal" status for one of the drives in a RAID-1 configuration and it is a bit confusing which drive is failing. Under hard sdrive info, I see something like this:

Right WDC: ... Normal
Left WDC: ... Abnormal

When I click on "Abnormal", I get a report that says:
Item    Now     Worst   Thresh  Updated
1       Raw_Read_Error_Rate     1       1       51      302541
3       Spin_Up_Time    110     107     21      7466
4       Start_Stop_Count        100     100     0       247
5       Reallocated_Sector_Ct   200     200     140     0
7       Seek_Error_Rate 200     200     0       0
9       Power_On_Hours  99      99      0       825
10      Spin_Retry_Count        100     100     0       0
11      Calibration_Retry_Count 100     253     0       0
12      Power_Cycle_Count       100     100     0       26
192     Power-Off_Retract_Count 200     200     0       19
193     Load_Cycle_Count        200     200     0       278
194     Temperature_Celsius     108     102     0       39
196     Reallocated_Event_Count 200     200     0       0
197     Current_Pending_Sector  192     192     0       1458
198     Offline_Uncorrectable   192     192     0       1402
199     UDMA_CRC_Error_Count    200     200     0       0
200     Multi_Zone_Error_Rate   185     184     0       3178


What exactly is abnormal here? For comparison, I clicked on the "Normal" button and this is what I see:
Item    Now     Worst   Thresh  Updated
1       Raw_Read_Error_Rate     200     200     51      0
3       Spin_Up_Time    168     164     21      6600
4       Start_Stop_Count        100     100     0       257
5       Reallocated_Sector_Ct   200     200     140     0
7       Seek_Error_Rate 200     200     0       0
9       Power_On_Hours  90      90      0       7589
10      Spin_Retry_Count        100     100     0       0
11      Calibration_Retry_Count 100     253     0       0
12      Power_Cycle_Count       100     100     0       26
192     Power-Off_Retract_Count 200     200     0       9
193     Load_Cycle_Count        200     200     0       257
194     Temperature_Celsius     115     106     0       35
196     Reallocated_Event_Count 200     200     0       0
197     Current_Pending_Sector  200     200     0       0
198     Offline_Uncorrectable   200     200     0       0
199     UDMA_CRC_Error_Count    200     200     0       0
200     Multi_Zone_Error_Rate   200     200     0       0

Looks like the one which is supposed to be normal has a very high Raw_Read_Error_Rate, where as the one that is supposed to be abnormal, has only a value of 1. So which one is actually failing? Also, would the  "Left WDC" drive be the top or bottom drive? I guess I can pull one of the drives out and match the s/no, but I am not even sure if DNS-321 is giving accurate information.
Logged

jamieburchell

  • Level 6 Member
  • *
  • Posts: 947
Re: SMART report question
« Reply #1 on: October 31, 2010, 03:19:33 AM »

The drive you say is "abnormal" in your post has

1       Raw_Read_Error_Rate     1       1       51      302541
198     Offline_Uncorrectable   192     192     0       1402
200     Multi_Zone_Error_Rate   185     184     0       3178

What do you mean top and bottom drives? They are installed left to right?

« Last Edit: October 31, 2010, 03:29:34 AM by jamieburchell »
Logged
If your little 323 is not working right,
You've racked your brains and been up all night
Take a deep breath and wipe away the sweat,
Login as web admin and try a factory reset!

irha

  • Level 1 Member
  • *
  • Posts: 18
Re: SMART report question
« Reply #2 on: November 01, 2010, 12:53:28 PM »

The drive you say is "abnormal" in your post has

1       Raw_Read_Error_Rate     1       1       51      302541
198     Offline_Uncorrectable   192     192     0       1402
200     Multi_Zone_Error_Rate   185     184     0       3178

I looked at a reference http://www.z-a-recovery.com/man-smart.htm, but I am still a bit confused on how to interpret these numbers. So in general, a lower number means it is bad? Is 200 considered a good reference number, and anything lower than that might indicate a failure?

Edit: I jumped to relevant information without reading at the beginning, and I see that the link already answers my above question. It basically says: "on the scale from 0 (bad) to some maximum (good) value. Maximum values are typically 100, 200 or 253. Rule of thumb is: high values are good, low values are bad.". I now understand what these numbers indicate.

What do you mean top and bottom drives? They are installed left to right?
You are right, I think I was confused because I had Corza for a few days before replacing it with the D-Link, and it had drives one over the other.

Thanks for your help.
« Last Edit: November 01, 2010, 12:56:50 PM by irha »
Logged

jamieburchell

  • Level 6 Member
  • *
  • Posts: 947
Re: SMART report question
« Reply #3 on: November 01, 2010, 01:04:23 PM »

Actually it's my understanding that the last column of values are that of interest and that the higher the error values, the "worse" it is. I.e. 302541

In any case, the abnormal drive has defective sectors so I would consider replacing it.
« Last Edit: November 02, 2010, 03:13:31 AM by jamieburchell »
Logged
If your little 323 is not working right,
You've racked your brains and been up all night
Take a deep breath and wipe away the sweat,
Login as web admin and try a factory reset!

irha

  • Level 1 Member
  • *
  • Posts: 18
Re: SMART report question
« Reply #4 on: November 01, 2010, 01:31:10 PM »

Actually it's my understanding that the last row of values are that of interest and that the higher the error values, the "worse" it is. I.e. 302541

In any case, the abnormal drive has defective sectors so I would consider replacing it.
Thanks for the clarification.

PS: I guess you mean last column of values.
Logged

irha

  • Level 1 Member
  • *
  • Posts: 18
Re: SMART report question
« Reply #5 on: November 01, 2010, 03:15:13 PM »

I already requested for an RMA advance replacement from WD, but wondering if I should something immediately right now. Is it better to emove the one that is reported abnormal in advance and use it with the one drive or should I leave it in there? Also, is it better to keep the NAS shutdown until I get the replacement?
Logged

jamieburchell

  • Level 6 Member
  • *
  • Posts: 947
Re: SMART report question
« Reply #6 on: November 01, 2010, 03:22:09 PM »

Yes I meant column :)
I would backup your data somewhere if possible. Not sure about your other questions. I'd probably leave it in, it hasn't totally failed yet and two up to date copies is better than one.
Logged
If your little 323 is not working right,
You've racked your brains and been up all night
Take a deep breath and wipe away the sweat,
Login as web admin and try a factory reset!

jamieburchell

  • Level 6 Member
  • *
  • Posts: 947
Re: SMART report question
« Reply #7 on: November 01, 2010, 03:36:55 PM »

You're right about higher normalised values being good and that they shouldn't be lower than the threshold. If those column headings are correct, the first row (error rate) means the drive is failing as the normalised value is 1 and is less than the threshold. There's also an indication of bad sectors.
« Last Edit: November 01, 2010, 04:34:28 PM by jamieburchell »
Logged
If your little 323 is not working right,
You've racked your brains and been up all night
Take a deep breath and wipe away the sweat,
Login as web admin and try a factory reset!

irha

  • Level 1 Member
  • *
  • Posts: 18
Re: SMART report question
« Reply #8 on: November 01, 2010, 05:29:38 PM »

Thanks jamieburchell!

I am also wondering if one of the drives failing could cause the read speeds to go down drastically. Currently, the read speeds vary anywhere from just a few kb to sometimes up to 10mb/s and is very unpredictable. I actually never got good speeds with this NAS (mostly an average of 2.5 to 3mb/s) but didn't have time to diagnose until now. So far I verified that the connections from my PC and DNS321 to the gigabit switch are in deed at gigabit speed (my switch lights two leds if the speed is gigabit), so looks like the problem is not with the networking and could be due to the bad disk all along.
Logged

jamieburchell

  • Level 6 Member
  • *
  • Posts: 947
Re: SMART report question
« Reply #9 on: November 02, 2010, 03:12:36 AM »

Yes it could be, what with the read error rate being so high. If you did pull the bad disk out you might see the speed improve but I don't know that for sure.
Logged
If your little 323 is not working right,
You've racked your brains and been up all night
Take a deep breath and wipe away the sweat,
Login as web admin and try a factory reset!

irha

  • Level 1 Member
  • *
  • Posts: 18
Re: SMART report question
« Reply #10 on: November 05, 2010, 12:40:02 PM »

I got my replacement drive and put the drive in after shutting down the NAS. During the startup, the NAS took me through a couple of pages to initialize the drive and the process was stuck with the formatting progress bar at 0% and the NAS is unresponsive to web UI or ssh. I waited over night and it is the same, so I restarted the NAS using the power switch and but the same behavior repeats (ie., the same wizard at the startup and the same unresponsiveness later). What is going wrong here? I would appreciate any help. Thanks
Logged

jamieburchell

  • Level 6 Member
  • *
  • Posts: 947
Re: SMART report question
« Reply #11 on: November 05, 2010, 03:11:24 PM »

Try a factory reset first me thinks.
Are you also certain that you replaced the correct drive? Which drive did you go for? Have you tried checking the drive with the manufacturers diagnostics to ensure it's healthy?
« Last Edit: November 05, 2010, 03:14:26 PM by jamieburchell »
Logged
If your little 323 is not working right,
You've racked your brains and been up all night
Take a deep breath and wipe away the sweat,
Login as web admin and try a factory reset!

irha

  • Level 1 Member
  • *
  • Posts: 18
Re: SMART report question
« Reply #12 on: November 05, 2010, 04:31:50 PM »

I replaced the left side one, and confirmed the s/no to be the same as the one I RMA'ed. I didn't test the drive, but I am sure the one inside is the one reported as "Normal". Could doing a factory reset make the NAS forget how to rebuild the array? If the only option after a reset is to build a fresh array together, then that would cause a loss of my data, so I am a little worried.
Logged

gunrunnerjohn

  • Level 11 Member
  • *
  • Posts: 2717
Re: SMART report question
« Reply #13 on: November 06, 2010, 07:13:44 AM »

A factory reset should not lose the data, I've done that exact procedure with a RAID-1 array.
Logged
Microsoft MVP - Windows Desktop Experience
Remember: Data you don't have two copies of is data you don't care about!
PS: RAID of any level is NOT a second copy.

irha

  • Level 1 Member
  • *
  • Posts: 18
Re: SMART report question
« Reply #14 on: November 06, 2010, 06:19:35 PM »

OK, I went ahead and did a factory reset and logged in with no password. I got the same screen, but I skipped and went into the status and verified the status. I then went into Raid page and opted to rebuild the raid-1 array and it is now stuck at the same screen as before. The device is now unresponsive. One thing I failed to mention is that in this state, I don't see any drive activity (as per the leds) taking place. Here is a screenshot of this screen:

Logged
Pages: [1] 2