Jump to content
  • 0

Hot spare option?


Asturia

Question

3 answers to this question

Recommended Posts

  • 0

Hi,

 

Just a quick question.

Does DrivePool have the option to include a hotspare disk so that when it detects a disk that is failing it will automatically switch to that disk?

I'm fairly certain that feature does not exist, but if your also using Scanner with Drive pool, if scanner detects imminent failure of a drive, drivepool will try to evacuate data from the problem disk, but you must have enough available space on the existing drives.  You can read more here, http://blog.covecube.com/2012/09/stablebit-drivepool-balancing-plug-ins-and-scanner-integration/.

Link to comment
Share on other sites

  • 0

"Hot Spare". No, DrivePool tries to use every disk in the pool.  But that also depends on the Balancer settings.  

 

And as dbailey has pointed out, installing Scanner creates a Balancer for it in DrivePool, so that if Scanner determines that the drive is failing, it will migrate all data off of that drive. You can optionally set up the balancer to migrate data off a drive with SMART errors as well.   *And* you can set it to avoid using any disks that have exceeded their manufacturer specify temperature threshold (as these disks are more likely to fail).

Link to comment
Share on other sites

  • 0

As someone who has worked in the storage industry for many years, I thought I would take a moment and clarify what a hot spare is.

In a hardware raid system with mirroring or parity drives are required to be the same size. I'll use mirroring for the example, but parity just takes more drives and uses space more efficiently.

 

If drive 1 has "1234567" stored in sector 42, then drive 2 has "1234567" stored in sector 42. That is simple mirroring with 2 copies of the data.

If drive 1 fails then sector 42 can just be read from drive 2 and operation continues, however redundance/protection has been lost.

 

This is where the hot spare idea comes in. The only way to restore protection/redundance is add another drive. Keeping one online as a spare allows automatic replacement and the data can be immediately duplicated for protection.

 

Of course having 3 drives for evey two needed would be pretty expensive. Even mirroring gets pretty expensive in large storage facilities, hence the need for parity raid. So what really happens is there may be 1 hot spare online for 10 or 20 drives. It is ready to go when needed because the requirement is that duplication be restored asap.

 

With Drivepool a hot spare is totally unnecessary. It would just be wasting a disk that could be part of the pool and helping performance. (Remember reads and writes can be spread across disks.)

 

Why is it unnecessary? The need for a hot spare is to provide a place to duplicate the information. With Drivepool as long as you have more drives than the number of copies of your data (after a drive fails) then there is a place to recreate the copy. Of course this does require enough overall space.

 

Let's use a concrete example.

 

I have 3 drives in my system. All are 1TB drives. I have my entire pool duplicated 2x. If I have 2TB stored then each drive could be 66% full. (I'll touch on capacity and usage in a minute.)

If one drive fails then I still need at least 2TB across two drives to meet my duplication goal. I have two 1TB drives so I just recreate the duplicates and everything is safe again. 

So, any unused storage space can be used to give me the same ability to restore protection that I get with a hot spare for hardware raid.

 

The formula is duplication count < number of drives

and data size * duplication count < sum(drive sizes) - size of largest drive

 

This means you are covered.

 

Drive utilization should normally be maintained less than 80%. (I alluded to this above.) The reason is that allocation for writes slows dramatically in most operating systems when the drive becomes more used. It takes more work to find free space and it tends to be in more small blocks spread about the disk. So you will likely alway have enough space available to handle a drive failure if you normally run <80% utilization and you will have faster disk access.

 

Using an extra disk in your pool instead of having a spare also provides a performance benefit.

With 2 drives and duplication of 2 then reading data can happen from both drives at the same time. The result is that read speed is 2x drive speed.

If I add a third drive, then I can get 3x drive speed for reads, if the read chunk is larger than the chunks used for duplication. (This is how storage spaces works.)

With Drivepool, I believe that entire files are duplicated so this benefit isn't as clear. However, in a server situation where there is a high likelyhood of reading multiple files at the same time we can achive a read speed of Nx where N is the number of drives.

Link to comment
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Answer this question...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

Loading...
×
×
  • Create New...