Linux - HardwareThis forum is for Hardware issues.
Having trouble installing a piece of hardware? Want to know if that peripheral is compatible with Linux?
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
I have had an Areca 1210 for about 6 mos now running RAID 1E and have been tossing the option back and forth about rebuilding the RAID 10 array into a RAID 5 array. All 4 drives are populated on the controller. My question is.....
1) What kind of performance can I expect running RAID 5 vs. RAID 10 using the same number of drives? I would think that since RAID 5 stripes across 4 drives instead of 2 that it would perform much faster than RAID 10 on reads and writes, just a smaller gain for writes since it is using distributed parity.
2) What kind of read/write speed could I actually expect?
3) Does anyone have a similar setup that has real numbers (hdparm -tT would be sufficient)?
My typical transfers for reads under hdparm -tT is
/dev/sda:
Timing cached reads: 4144 MB in 2.00 seconds = 2073.49 MB/sec
Timing buffered disk reads: 330 MB in 3.00 seconds = 109.91 MB/sec
Any input is appreciated.
Sincerely,
Greg
Last edited by greg.dieselpowe; 10-29-2006 at 09:42 AM..
There would be no performance benefit going from raid 10 to raid 5. Maybe even a slight performance hit. A least on writing side of things. raid 5 introduces distributed parity, checking, and some other stuff. Whereas raid 1+0 combines two simple but very high performance methods of raid. With the raid 10 you get the best of both worlds but at the highest cost.
I have no idea what kind of harddrives you have but it looks like your raid performance is on the low average side. It looks consistent with what you might expect from 4 fairly cheap 2mb 7200 rpm harddrives.
I am already using the largest stripe the card supports. So I guess I should start wondering if a different filesystem would make a difference here also. This is off topic for this post though so I will find a more appropriate forum. Thanks for your input slantoflight.
I have had an Areca 1210 for about 6 mos now running RAID 1E and have been tossing the option back and forth about rebuilding the RAID 10 array into ...
Not so much for Greg, but others that stumble upon this like I did...
RAID 1E and RAID 10 are different. RAID 10 is a strip across mirrors. So you need 4 disks minimum, and even numbers there after. 2 disks each make up a mirror and the two mirrors are striped. This means with 4 100GB drives you get a 200GB set with both increased write and significantly increased read performance. You can loose up to 1 disk from each underlying mirror. With 6 disks in the array, you could loose up to 3 disks.
RAID 1E is an IBM mirroring/striping technique working with an arbitrary number of disks. This image from wikipedia should explain it, best. With 4 100GB disks you still end up with 200GB of storage. You can loose as many as 2 non-adjacent disks. This doesn't mean much for 4 disks, but for 5 disks it would mean you can still loose a max of 2 disks. 10 won't work with 5 disks.
Personally, I would use RAID 5 just because it allows a 4 100 GB disk array to reach 300GB with similar performance. The biggest downside I see is you can only loose 1 drive, regardless of how strategically you break them.
The cause for your speed capping out so low is most likely the drives controllers all being on the pci bus which has a capacity of around 160MB/s for all traffic. It's not too expensive to get a motherboard now that links sata controllers with PCIe bus lines instead of the shared pci bus. Even pcie x1 drive controller cards will do the trick.
For the record my 3 drive sata2 raid5 gets around 200MB/s reads. This is using linux software raid on RHEL5.
Remember, the utility hdparm just benchmarks raw throughput. The actual throughput depends on other variables such as the filesystem. Take for example, a 2.5 inch hard drive has a throughput of 5 MiB per second. When a filesystem like XFS is used, the throughput can be much higher. The reason is XFS writes in memory first and they writes the data to the hard drive. If memory is 2 GiB or larger, write throughput will be through the roof. Use either XFS or JFS to get the highest throughput.
When a filesystem like XFS is used, the throughput can be much higher. The reason is XFS writes in memory first and they writes the data to the hard drive. If memory is 2 GiB or larger, write throughput will be through the roof.
This of course would only be for file transfers of less than 2 GiB. A continuous stream of files from 1 disk to another (such as during a backup) will eventually slow down to the speed limitation of the controller, but it is a very good point to consider.
I experimented with RAID 0 across 4 SAS disks (15k RPM) on a Dell PowerEdge server with PERC 5i controller and learned the hard way the RAID gets lost with just one disk failure. OS is CentOS 5 64-bit.
When the replacement disk arrives, I'm likely going with a software RAID (so I don't have to worry about a failed controller), unless someone argues for a hardware install.
For the eventual rebuild, I'd want to maximize disk space (maybe 3 disks + spare, or 4 disks and no backup drive). The server has 6 disks, but the first two will be software RAID 1 of just the OS, and the remaining 4 for data/project stuff.
From what I've learned, it seems like RAID 10 would be best? I'd considered RAIDs 5 and 6, but I'm consistently finding talk of lost space and performance hits.
Since the controller can't natively handle RAID 10 anyway, what is the recommended software implementation for it? I don't plan to use LVM, unless it is the only way.
Remember, the utility hdparm just benchmarks raw throughput. The actual throughput depends on other variables such as the filesystem. Take for example, a 2.5 inch hard drive has a throughput of 5 MiB per second. When a filesystem like XFS is used, the throughput can be much higher. The reason is XFS writes in memory first and they writes the data to the hard drive. If memory is 2 GiB or larger, write throughput will be through the roof. Use either XFS or JFS to get the highest throughput.
I know this is old, but I can't leave this kind of misinformation floating around for other people to be misled by.
Write caching is not exclusive to XFS, virtually any file system can cache write in memory and then write them to the hard drive. This is a risk vs reward scenario, unless you are using a battery backed cache (and even if you are) because data that is cached but not written to disk can be lost if power to the system is cut. Even a battery backed cache will lose data if the power cannot be restored before the battery runs out of juice. A UPS can mitigate this risk also, but again it has a limited amount of power stored and when that runs out, poof. Hopefully you have set up an automatic shutdown to prevent data loss.
Also write throughput to a write cache is a completely meaningless statistic, as the write cache is still bottlenecked by the need to write out the cached data to the hard drive. Only the real hardware performance is relevant, as your cache will only provide this artificially inflated write speed for a brief period of time, and then it has to wait for the hard disk to free up some of the cache. XFS actually slows down disk writes because it is a journaling file system, and although the creators of it have sought to minimize the impact of this journaling, it is still overhead.
From what I've learned, it seems like RAID 10 would be best? I'd considered RAIDs 5 and 6, but I'm consistently finding talk of lost space and performance hits.
RAID 5 and 6 have a performance hit because of the parity calculatons, but waste less space than RAID 1E/RAID 10 for the same reason. The latter in a 4-drive array will waste half of the disk space on a copy of everything stored on the array, whereas the former only waste one quarter of the array (at least for RAID5), so if you have 4x100GB, RAID5 gives you 300GB while RAID 1E/10 gives you 200GB of usable space. Not sure what RAID 6 gives you, haven't done the math.
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.