Jump to content

  • Log in with Twitter Log in with Windows Live Log In with Google      Sign In   
  • Create Account

Photo

High Interface CRC Error Count


  • Please log in to reply
9 replies to this topic

#1 IMuijtjens

IMuijtjens

    Newbie

  • Members
  • Pip
  • 6 posts

Posted 13 August 2017 - 03:16 PM

Hello,

 

Today I noticed the S.M.A.R.T. values of my hard drives in my storage server are reporting a huge count of Interface CRC Errors. All disks are showing these errors. I'm using ESXI on my Home Server. 2 VM's (1 web server and 1 storage server). 7 hard disks (8TB) are connected on my LSI SAS 9211-8i controller, which is flashed in IT Mode (JBOD). I configured pass-through and linked the controller to my storage VM. The error count continues to increase, but the performance of the disks stays ok. I did a copy from my data pool to my backup pool. 100GB of data copied in 13 minutes. After the copy completed, the error count was increased on all the disks.

 

I'm wondering what is causing these errors. All the disks in my Storage VM are showing these errors. The hard drive which is connected to my web server (RDM) is not showing these errors. That disk is connected on a SATA-port on my motherboard. Could it be the pass-through configuration which causing issues?

HD Tune Pro: ATA     WDC WD80EFZX-68U Health

ID                              Current  Worst    ThresholdData          Status     
(01) Raw Read Error Rate        100      100      16       0             ok         
(02) Throughput Performance     132      132      54       112           ok         
(03) Spin Up Time               145      145      24       38684393924   ok         
(04) Start/Stop Count           100      100      0        225           ok         
(05) Reallocated Sector Count   100      100      5        0             ok         
(07) Seek Error Rate            100      100      67       0             ok         
(08) Seek Time Performance      128      128      20       18            ok         
(09) Power On Hours Count       100      100      0        1270          ok         
(0A) Spin Retry Count           100      100      60       0             ok         
(0C) Power Cycle Count          100      100      0        220           ok         
(16) Unknown Attribute          100      100      25       100           ok         
(C0) Unsafe Shutdown Count      100      100      0        275           ok         
(C1) Load Cycle Count           100      100      0        275           ok         
(C2) Temperature                176      176      0        214749478946  ok         
(C4) Reallocated Event Count    100      100      0        0             ok         
(C5) Current Pending Sector     100      100      0        0             ok         
(C6) Offline Uncorrectable      100      100      0        0             ok         
(C7) Interface CRC Error Count  200      200      0        9163          attention  

Health Status         : ok

Setup:

 

Setup.png



#2 Spider99

Spider99

    Advanced Member

  • Members
  • PipPipPip
  • 442 posts
  • LocationWarrington, UK

Posted 13 August 2017 - 04:36 PM

in a non vm environment those errors mean something is up with the cable/controller or the hard disk controller/sata connector

 

as its all your disks - it could be the controller

 

as your other drive is not reporting errors because its on a different controller

 

not seen it with VM's before

 

if you connect one of the affected disks to the m/b sata controller do the errors stop?


Tim

---------

DrivePool & Scanner on 2012R2 Essentials Server Gigabyte Z87x OC Force, i7-4790S, 32 GB, LSI 9211, HP SAS Expander, 27 Storage disks, 6 SSD Cache, ~115 TB Storage


#3 IMuijtjens

IMuijtjens

    Newbie

  • Members
  • Pip
  • 6 posts

Posted 13 August 2017 - 06:13 PM

After a lot of reasearch and sweating I finnaly figured out the issue. This issue was driving me nuts.

 

Looks like it was the firmwire version on the LSI SAS 9211-8i controller. The controller was flashed with version P20. This firmwire version can cause a lot of troubles like drives falling out of the raid, CRC Errors etc. I flashed the controller back to firmwire version P19, reconnected my drives, and tried some file copies on the drive pool. None of the drives increased the error count. It's a shame the errors will always stay visible in the S.M.A.R.T. details, but I'm glad this fixed the issue.


  • Christopher (Drashna) likes this

#4 Christopher (Drashna)

Christopher (Drashna)

    Customer and Technical Support

  • Administrators
  • 8,196 posts
  • LocationSan Diego, CA, USA

Posted 13 August 2017 - 06:31 PM

well, I'm glad to hear that you were able to isolate the cause, and fix it.


Christopher Courtney

aka "Drashna"

Microsoft MVP for Windows Home Server 2009-2012

Lead Moderator for We Got Served

Moderator for Home Server Show

 

This is my server

 

Lots of "Other" data on your pool? Read about what it is here.


#5 Spider99

Spider99

    Advanced Member

  • Members
  • PipPipPip
  • 442 posts
  • LocationWarrington, UK

Posted 13 August 2017 - 07:28 PM

interesting i will have to check which firmware my 9211 is on - thought it was p20 but might be wrong - although i do not have any of the crc errors fortunately


Tim

---------

DrivePool & Scanner on 2012R2 Essentials Server Gigabyte Z87x OC Force, i7-4790S, 32 GB, LSI 9211, HP SAS Expander, 27 Storage disks, 6 SSD Cache, ~115 TB Storage


#6 IMuijtjens

IMuijtjens

    Newbie

  • Members
  • Pip
  • 6 posts

Posted 13 August 2017 - 08:35 PM

interesting i will have to check which firmware my 9211 is on - thought it was p20 but might be wrong - although i do not have any of the crc errors fortunately

 

Are you using 4TB disks? I saw someone who was using 4TB Dekstar disks with no issues running the P20 firmware, he had issues with a 6TB Dekstar NAS disk. Perhaps it depends on the brand/size of the disk. I'm using 5x WD Gold 8TB and 2x WD Red 8TB which are both failing under firmware P20.



#7 Spider99

Spider99

    Advanced Member

  • Members
  • PipPipPip
  • 442 posts
  • LocationWarrington, UK

Posted 13 August 2017 - 09:00 PM

yes i have 4tb HGST drives and WD blue 4TB and 6TB  - all fine with my 9211


Tim

---------

DrivePool & Scanner on 2012R2 Essentials Server Gigabyte Z87x OC Force, i7-4790S, 32 GB, LSI 9211, HP SAS Expander, 27 Storage disks, 6 SSD Cache, ~115 TB Storage


#8 Christopher (Drashna)

Christopher (Drashna)

    Customer and Technical Support

  • Administrators
  • 8,196 posts
  • LocationSan Diego, CA, USA

Posted 13 August 2017 - 11:34 PM

And here I am, sitting on P15....


Christopher Courtney

aka "Drashna"

Microsoft MVP for Windows Home Server 2009-2012

Lead Moderator for We Got Served

Moderator for Home Server Show

 

This is my server

 

Lots of "Other" data on your pool? Read about what it is here.


#9 Spider99

Spider99

    Advanced Member

  • Members
  • PipPipPip
  • 442 posts
  • LocationWarrington, UK

Posted 14 August 2017 - 01:42 PM

if it ain't broke... :)


Tim

---------

DrivePool & Scanner on 2012R2 Essentials Server Gigabyte Z87x OC Force, i7-4790S, 32 GB, LSI 9211, HP SAS Expander, 27 Storage disks, 6 SSD Cache, ~115 TB Storage


#10 Christopher (Drashna)

Christopher (Drashna)

    Customer and Technical Support

  • Administrators
  • 8,196 posts
  • LocationSan Diego, CA, USA

Posted 14 August 2017 - 06:20 PM

Yup.  

 

I definitely support the "don't upgrade the firmware if everything works fine".  Especially for live systems.  Nothing like scrambling for replacements when something goes wrong.


Christopher Courtney

aka "Drashna"

Microsoft MVP for Windows Home Server 2009-2012

Lead Moderator for We Got Served

Moderator for Home Server Show

 

This is my server

 

Lots of "Other" data on your pool? Read about what it is here.





0 user(s) are reading this topic

0 members, 0 guests, 0 anonymous users