1 |
On 09/14/2013 09:59 AM, Grant wrote: |
2 |
>> It's time to switch hosts. I'm looking at the following: |
3 |
>> |
4 |
>> Dual Xeon E5-2690 |
5 |
>> 32GB RAM |
6 |
>> 4x SSD RAID10 |
7 |
> If I make this 6x SSD RAID10 with redundant power supplies, what is my |
8 |
> weakest link as far as hardware? If a CPU craps out, will the system |
9 |
> keep running? |
10 |
> |
11 |
> - Grant |
12 |
> |
13 |
consider making the main memory ECC too and flick the correct switches |
14 |
in kernel to ensure ECC is monitored. |
15 |
no point in ensuring the data is resilient if the content is garbled. |
16 |
|
17 |
and also consider what happens if the raid controller fails due to a |
18 |
popped capacitor five years from now |
19 |
will you still be able to get a like for like replacement ? |
20 |
bear in mind that you may have to keep the raid card firmware up to date |
21 |
in order to be compatible with newer cards |
22 |
of course, this is all relative to how long you stay with your host but |
23 |
you have to decide how much resilience you want to build in. |
24 |
|
25 |
it's the mechanical parts of spinning rust or pseudo mechanical nand |
26 |
gate switching for SSD that will tend to fail, |
27 |
secondary to that in most places the PSU acts as a static cling with a |
28 |
dust blower attached, and any slight knock knocks the dust off causing a |
29 |
short circuit especially if any humidity is caught in the air |
30 |
also consider the fans blowing around the air inside the machine |
31 |
|
32 |
you can start thinking what about earthquakes or flooding in the area - |
33 |
surely you want to ensure two geographically diverse locations |
34 |
cpu / motherboard failures on server spec tend to not be very likely, |
35 |
especially if the environment is controlled (air filters/temp/power) |
36 |
|
37 |
a great many things happen that are beyond anyone's sphere of control - |
38 |
just look at the new york datacentres during hurricane Sandy; would it |
39 |
be better to have had more diesel on site or just everything replicated |
40 |
at another site ? |
41 |
|
42 |
the real question is what is your expectation of uptime and how can your |
43 |
budget match that. |
44 |
uptime is affected by software as well as hardware don't forget. |