1 |
On Wednesday, 6 March 2019 16:31:27 GMT Laurence Perkins wrote: |
2 |
> On Fri, 2019-03-01 at 10:12 +0000, Peter Humphrey wrote: |
3 |
|
4 |
> > [OT] |
5 |
> > Evidence is mounting that the Atom box is in terminal decline. I get |
6 |
> > things like batches of files in the portage tree changing owner, and then |
7 |
> > when I correct that, long lists of supposedly locally changed ebuilds |
8 |
> > preventing syncing. And when I boot weekly into its little rescue system |
9 |
> > to backup the main system, the root filesystem remounts itself read-only |
10 |
> > while tar is running. Smartd recognises the SSD and runs daily tests, but |
11 |
> > reports no errors. No amount of wiping and reinstalling has helped so far. |
12 |
> |
13 |
> What filesystem are you running and how old is the SSD? That sounds |
14 |
> like some of the symptoms EXT4 had on early generation flash media |
15 |
> where its assumptions about what order writes would physically make it |
16 |
> to the disk in were wrong, leading to corruption. |
17 |
|
18 |
The disk is a 64GB SanDisk SDSSDP device, which I bought five years ago to |
19 |
replace a failed spinning disk. All partitions are ext4 except /boot, which is |
20 |
ext2. |
21 |
|
22 |
> So unless it was working correctly at some point in the past, try a |
23 |
> different filesystem. EXT3 or BTRFS didn't have the same problems. |
24 |
|
25 |
It was working just fine until recently. |
26 |
|
27 |
> If it's just that the SSD is failing, then get a new one before |
28 |
> something important gets damaged and you have to redo the whole thing. |
29 |
|
30 |
Everything on it is disposable. |
31 |
|
32 |
The box is getting a bit long in the tooth: I bought it in November 2010. It's |
33 |
a single-core, 32-bit Atom N270 (not N2700). It doesn't owe me anything now, |
34 |
in spite of having cost £450 at the time. I don't know whether it's worth |
35 |
throwing any more money at it. On the other hand, I see Amazon are only asking |
36 |
for £20 for a small SSD. |
37 |
|
38 |
The repeatability of some of the errors it throws makes me question whether |
39 |
the disk or something else is at fault. (What would cause a file system to be |
40 |
remounted read-only in the middle of its work?) |
41 |
|
42 |
I have a spare four-core, 64-bit Celeron box (I bought it for a purpose that's |
43 |
gone away). I've been wondering what to do with it, so maybe it can replace |
44 |
the Atom box. It's powerful enough to compile its own software, whereas the |
45 |
Atom needs help. Whichever I use, its job will be as a server of DNS, LAN |
46 |
mail, time and git. Maybe print too. Also it will fetch my ISP's POP mail and |
47 |
serve it over IMAP to this box. |
48 |
|
49 |
> The self-test capability of storage media is almost universally |
50 |
> horrible and you generally don't get a failure report until your data |
51 |
> has already been lost. If your SMART output gives you the raw |
52 |
> statistics on the device instead of just pass/fail then analyzing that |
53 |
> usually gives a better indication of whether something is about to go |
54 |
> wrong. |
55 |
|
56 |
It seems to report only pass/fail, so that's not much help. |
57 |
|
58 |
Decisions, decisions... |
59 |
|
60 |
-- |
61 |
Regards, |
62 |
Peter. |