Linux - GeneralThis Linux forum is for general Linux questions and discussion.
If it is Linux Related and doesn't seem to fit in any other forum then this is the place.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
Something about how MDADM's scrubs really don't fix errors and stuff about MDADM not verifying parity on reads (only writes). Any truth to all this? Maybe it's a bug that's been addressed since I can't find any recent discussions, only stuff from 2008 to about 2013.
Something about how MDADM's scrubs really don't fix errors and stuff about MDADM not verifying parity on reads (only writes). Any truth to all this? Maybe it's a bug that's been addressed since I can't find any recent discussions, only stuff from 2008 to about 2013.
For raid5 there is no way to know which block is bad if you do not get an error reported from the disk, for raid6 there is a possibility to recover, but I cannot say if the current MDADM uses it or not. I normally do a md5 or sha checksum of all files on an array so I later can see if there have been any corruption (and on what file).
When I started using RAID5 you could quite often get silent corruption of the data due to bandwidth problems on the motherboard or because the system is gradually becoming unreliable etc. So to have an external checksum is recommended so you can at least know that the system is good condition.
For raid5 there is no way to know which block is bad if you do not get an error reported from the disk, for raid6 there is a possibility to recover, but I cannot say if the current MDADM uses it or not. I normally do a md5 or sha checksum of all files on an array so I later can see if there have been any corruption (and on what file).
When I started using RAID5 you could quite often get silent corruption of the data due to bandwidth problems on the motherboard or because the system is gradually becoming unreliable etc. So to have an external checksum is recommended so you can at least know that the system is good condition.
I currently use RAID 6 with my MDADM setup. What is this checksum voodoo you speak of?
cd <root of structure i want to check>
find . -type f -exec md5sum {} \; >md5sum.sum
Which may take many hours depending on size to check, as for me the md5sum.sum file can be some 500 MB
Then to check you do:
md5sum -c md5sum.sum >md5check.txt
grep FAIL md5check.txt | more
If you then got a fail on a file you know is not modfied or you get different fails a second run you know your setup has problems.
Thanks for the info! Doesn't seem too complicated but if that link I originally posted is true, I sure do wish mdadm could be updated to do some repairing during a scrub.
Unless you know how, I think I'll look into automating that and only sending me an email if there are any failures.
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.