BUT, every once in a while (let's say, a couple of weeks, then a couple of
months), all of sudden the array will report as being degraded.
However, other than the notice that the array is degraded and that a mirror is
offline, I can find nothing in any log, or any changes in the dmesg to suggest
what may have happened.
I have changed the hard drive cables. I have changed out the SSD drives.
But, it still happens every so often.
When the array is degraded, I can still fdisk/disklabel the "offline" disk
without a problem. I can rebuild the degraded array with the "offline" disk (#
bioctl -R /dev/sd1m sd2), and the rebuild completes without a problem, and the
array is stable for weeks/months until, randomly, it happens again.
I am wondering if there is anything I should be looking at/for to help figure
out what the issue is?
As I said, I have already swapped out hardware (at least) once. If it is a
hardware issue, I can keep swapping out hardware, but (at this point) it seems
that the probability is really low that I would have multiple drives that have
the same intermittent problem (but, obviously, not zero).
I would appreciate any advice on how to track down what the problem may be the
next time it happens.