#1
  1. 'fie' on me, allege-dly
    Devshed Supreme Being (6500+ posts)

    Join Date
    Mar 2003
    Location
    in da kitchen ...
    Posts
    12,889
    Rep Power
    6444

    HDD dying more than the average amount


    Hi Folks, been a while ...

    On a client site, and an SSD has just died. This was a replacement for a HDD which was itself replaced, twice.

    The drives were bought by me, from different suppliers, and also by the end user.

    Just wondering are there some environmental factors I'm overlooking which could account for a greater than average number of drive failures on a mixture of machines and a mixture of drives in one physical location

    All Dell laptops, apart from one toshiba

    Any ideas welcomed, before I join the ranks of the follicly challenged prematurely ...
    --Ax
    without exception, there is no rule ...
    Handmade Irish Jewellery
    Targeted Advertising Cookie Optout (TACO) extension for Firefox
    The great thing about Object Oriented code is that it can make small, simple problems look like large, complex ones


    09 F9 11 02
    9D 74 E3 5B
    D8 41 56 C5
    63 56 88 C0
    Some people, when confronted with a problem, think "I know, I'll use regular expressions." Now they have two problems.
    -- Jamie Zawinski
    Detavil - the devil is in the detail, allegedly, and I use the term advisedly, allegedly ... oh, no, wait I did ...
    BIT COINS ANYONE
  2. #2
  3. No Profile Picture
    Contributing User
    Devshed Regular (2000 - 2499 posts)

    Join Date
    Mar 2006
    Posts
    2,400
    Rep Power
    1688
    My first guesses would have been heat and utilisation.
    Do SSDs have potential issues with heat? If the disk(s) are 'hammered' constantly - high-rate, small or medium I/O requests then the physical mechs would tend to suffer, but that would not impact SSDs - unless the high I/O was write-based.
    It is strange as what I would assume would be bad for a physical disk mech would not be too bad for an SSD and vice-versa due to their different internals. Hence my assumption above about the I/O; pretty much the only thing (apart from heat) that they would share.

    Or it could all be a horrible conicidence!
    The moon on the one hand, the dawn on the other:
    The moon is my sister, the dawn is my brother.
    The moon on my left and the dawn on my right.
    My brother, good morning: my sister, good night.
    -- Hilaire Belloc
  4. #3
  5. No Profile Picture
    Super Moderator
    Devshed Specialist (4000 - 4499 posts)

    Join Date
    Jul 2003
    Posts
    4,008
    Rep Power
    2790
    Welcome back, Axwieldr.

    Yes SSD drives do suffer in the heat.

    The only other thing I can think of is the lack of manufacturing in Thailand due to last years tsunami, may have lead to lower quality sourcing of components in cheaper drives.
    [PHP] | [Perl] | [Python] | [Java] != [JavaScript] | [XML] | [ANSI C] | [C++] | [LUA] | [MySQL] | [FirebirdSQL] | [PostgreSQL] | [HTML] | [XHTML] | [CSS]

    W3Fools - A W3Schools Intervention.
  6. #4
  7. 'fie' on me, allege-dly
    Devshed Supreme Being (6500+ posts)

    Join Date
    Mar 2003
    Location
    in da kitchen ...
    Posts
    12,889
    Rep Power
    6444
    Originally Posted by SimonJM
    My first guesses would have been heat and utilisation.
    Do SSDs have potential issues with heat? If the disk(s) are 'hammered' constantly - high-rate, small or medium I/O requests then the physical mechs would tend to suffer, but that would not impact SSDs - unless the high I/O was write-based.
    It is strange as what I would assume would be bad for a physical disk mech would not be too bad for an SSD and vice-versa due to their different internals. Hence my assumption above about the I/O; pretty much the only thing (apart from heat) that they would share.

    Or it could all be a horrible conicidence!
    The offices are habitable, there was an issue with aircon some time ago.

    I've been thinking EM radiation, but that'd be inverse square and would be visible to the naked eye, surrounding businesses report no issues - ley lines perchance ...
    --Ax
    without exception, there is no rule ...
    Handmade Irish Jewellery
    Targeted Advertising Cookie Optout (TACO) extension for Firefox
    The great thing about Object Oriented code is that it can make small, simple problems look like large, complex ones


    09 F9 11 02
    9D 74 E3 5B
    D8 41 56 C5
    63 56 88 C0
    Some people, when confronted with a problem, think "I know, I'll use regular expressions." Now they have two problems.
    -- Jamie Zawinski
    Detavil - the devil is in the detail, allegedly, and I use the term advisedly, allegedly ... oh, no, wait I did ...
    BIT COINS ANYONE
  8. #5
  9. No Profile Picture
    Super Moderator
    Devshed Specialist (4000 - 4499 posts)

    Join Date
    Jul 2003
    Posts
    4,008
    Rep Power
    2790
    Originally Posted by Axweildr
    ley lines perchance ...
    Yes, my last office had the same issue. We suffered from a minor demon incursion once due to it; messy business all round.
    [PHP] | [Perl] | [Python] | [Java] != [JavaScript] | [XML] | [ANSI C] | [C++] | [LUA] | [MySQL] | [FirebirdSQL] | [PostgreSQL] | [HTML] | [XHTML] | [CSS]

    W3Fools - A W3Schools Intervention.
  10. #6
  11. 'fie' on me, allege-dly
    Devshed Supreme Being (6500+ posts)

    Join Date
    Mar 2003
    Location
    in da kitchen ...
    Posts
    12,889
    Rep Power
    6444
    Originally Posted by Winters
    Welcome back, Axwieldr.

    Yes SSD drives do suffer in the heat.

    The only other thing I can think of is the lack of manufacturing in Thailand due to last years tsunami, may have lead to lower quality sourcing of components in cheaper drives.
    The SSD was quite warm when pulled from the chassis, so maybe that's it. It's a SAMSUNG drive, not their cheapest either, but I thought flash was flash ...
    --Ax
    without exception, there is no rule ...
    Handmade Irish Jewellery
    Targeted Advertising Cookie Optout (TACO) extension for Firefox
    The great thing about Object Oriented code is that it can make small, simple problems look like large, complex ones


    09 F9 11 02
    9D 74 E3 5B
    D8 41 56 C5
    63 56 88 C0
    Some people, when confronted with a problem, think "I know, I'll use regular expressions." Now they have two problems.
    -- Jamie Zawinski
    Detavil - the devil is in the detail, allegedly, and I use the term advisedly, allegedly ... oh, no, wait I did ...
    BIT COINS ANYONE
  12. #7
  13. No Profile Picture
    Super Moderator
    Devshed Specialist (4000 - 4499 posts)

    Join Date
    Jul 2003
    Posts
    4,008
    Rep Power
    2790
    Hot or warm? Warmer than 70C?
    [PHP] | [Perl] | [Python] | [Java] != [JavaScript] | [XML] | [ANSI C] | [C++] | [LUA] | [MySQL] | [FirebirdSQL] | [PostgreSQL] | [HTML] | [XHTML] | [CSS]

    W3Fools - A W3Schools Intervention.
  14. #8
  15. 'fie' on me, allege-dly
    Devshed Supreme Being (6500+ posts)

    Join Date
    Mar 2003
    Location
    in da kitchen ...
    Posts
    12,889
    Rep Power
    6444
    Originally Posted by Winters
    Hot or warm? Warmer than 70C?
    "Quite warm" based on anecdotal evidence, they guy who freed the drive isn't currently available for comment ...

    Currently scanned over 50% of the drive and no files or folders evidenced, not looking good, but I'm really looking to understand why so many drives crap out in this place
    --Ax
    without exception, there is no rule ...
    Handmade Irish Jewellery
    Targeted Advertising Cookie Optout (TACO) extension for Firefox
    The great thing about Object Oriented code is that it can make small, simple problems look like large, complex ones


    09 F9 11 02
    9D 74 E3 5B
    D8 41 56 C5
    63 56 88 C0
    Some people, when confronted with a problem, think "I know, I'll use regular expressions." Now they have two problems.
    -- Jamie Zawinski
    Detavil - the devil is in the detail, allegedly, and I use the term advisedly, allegedly ... oh, no, wait I did ...
    BIT COINS ANYONE
  16. #9
  17. No Profile Picture
    Super Moderator
    Devshed Specialist (4000 - 4499 posts)

    Join Date
    Jul 2003
    Posts
    4,008
    Rep Power
    2790
    SSD drives are very robust but they can fail for various reasons. Heavy amounts of writing can cause failures and, as mentioned, a continuous excess of heat can destroy them.

    I would just send them back to the manufacturers for replacement and hopefully some answers.
    [PHP] | [Perl] | [Python] | [Java] != [JavaScript] | [XML] | [ANSI C] | [C++] | [LUA] | [MySQL] | [FirebirdSQL] | [PostgreSQL] | [HTML] | [XHTML] | [CSS]

    W3Fools - A W3Schools Intervention.
  18. #10
  19. Providing fuel for space ships
    Devshed Supreme Being (6500+ posts)

    Join Date
    Mar 2004
    Location
    nr Edinburgh, Scotland
    Posts
    14,382
    Rep Power
    3848
    Hey Ax, been a while

    How many laptops are we talking about, one, two, ten !?

    Just wondering if they are all in the same room and potentially running off the same electrical ring circuit. If so, could be a power surge of some sort that is frying the hard drives? Are they experiencing issues with any other electrical equipment ? If so, might be worth having an electrician have a look at the circuit.
    The No Ma'am commandments:

    1.) It is O.K. to call hooters 'knockers' and sometimes snack trays
    2.) It is wrong to be French
    3.) It is O.K. to put all bad people in a giant meat grinder
    4.) Lawyers, see rule 3
    5.) It is O.K. to drive a gas guzzler if it helps you get babes
    6.) Everyone should car pool but me
    7.) Bring back the word 'stewardesses'
    8.) Synchronized swimming is not a sport
    9.) Mud wrestling is a sport
  20. #11
  21. No Profile Picture
    Stumpier old Moderator
    Devshed Supreme Being (6500+ posts)

    Join Date
    Jun 2003
    Posts
    14,409
    Rep Power
    4538
    Also, were all the computers purchased at the same time? Maybe you got a bunch with some inherent defect.

    How are the drive(s) failing?
    ======
    Doug G
    ======
    It is a truism of American politics that no man who can win an election deserves to. --Trevanian, from the novel Shibumi
  22. #12
  23. 'fie' on me, allege-dly
    Devshed Supreme Being (6500+ posts)

    Join Date
    Mar 2003
    Location
    in da kitchen ...
    Posts
    12,889
    Rep Power
    6444
    Thanks for the tips lads, Howdy aitken

    Drives failing with Reallocated Sectors counts climbing, offline uncorrectables, the SSD just plain crapped out - It was bad news because there was a full days work lost in a time critical project.

    Reinstalled XP on a new 320GB Drive, and tracked the temp of the cores and the GPU and HD0, cheers to Speedfan over and hour or so, running now at 69/70o for the Core and GPU.

    I'm guessing we can call this well hot? I wouldn't have expected much more than about 50, am I being realistic? After 2 hours the heat is starting to become apparent on the outer casing.
    Attached Images
    --Ax
    without exception, there is no rule ...
    Handmade Irish Jewellery
    Targeted Advertising Cookie Optout (TACO) extension for Firefox
    The great thing about Object Oriented code is that it can make small, simple problems look like large, complex ones


    09 F9 11 02
    9D 74 E3 5B
    D8 41 56 C5
    63 56 88 C0
    Some people, when confronted with a problem, think "I know, I'll use regular expressions." Now they have two problems.
    -- Jamie Zawinski
    Detavil - the devil is in the detail, allegedly, and I use the term advisedly, allegedly ... oh, no, wait I did ...
    BIT COINS ANYONE
  24. #13
  25. No Profile Picture
    Contributing User
    Devshed Novice (500 - 999 posts)

    Join Date
    Oct 2009
    Location
    Nebraska, USA
    Posts
    830
    Rep Power
    275
    that is weird.....the cpu cores' temps are showing about normal, but, the CPU itself si way too high. and, yes, that gpu temp is too high, too. 50-55C is about normal for avg GPU temps. and CPU temps should be between 34-50C. anything over those temps, especially @ idle is WAY TOO HOT.

    I would check the laptop cpu fan..make sure its functioning properly and that no "dust bunnies" have "crawled" in to nest.

    Only other alternative is buying a laptop cooler pad to blow across 'em to keep 'em cooler. [other than returning them and telling manufacturer to shove 'em where the sun don't shine ]
  26. #14
  27. Contributed User
    Devshed Specialist (4000 - 4499 posts)

    Join Date
    Jun 2005
    Posts
    4,366
    Rep Power
    1870
    > On a client site, and an SSD has just died. This was a replacement for a HDD which was itself replaced, twice.
    Does "replace" mean it's the same physical machine in all cases?
    In addition to what aitken325i said, I would also suspect the charging unit as well in this case. Perhaps it's not as good at filtering out mains noise.

    Do the laptops spend most of their time "mains powered" rather than running directly from the battery?
    Does this also mean that they're set up for the "performance" profile rather than say "balanced" or "power saver".

    > Drives failing with Reallocated Sectors counts climbing, offline uncorrectables
    Is this using the SMART data from the drive itself? Some drives also provide temperature data as well. Perhaps stick some temperature recorders over your drives to see how hot they're really getting (or got to when failed). One on each side would tell you how much of a temperature gradient there is.

    Picking up on what DonR said regarding idle temperatures. Are the machines actually idle when not being used.
    For example, is there any corporate "monitoring" s/w running on the machines which is making them just a little bit less idle than just a normal retail installation.
    If you dance barefoot on the broken glass of undefined behaviour, you've got to expect the occasional cut.
    If at first you don't succeed, try writing your phone number on the exam paper

IMN logo majestic logo threadwatch logo seochat tools logo