2013 LinuxQuestions.org Members Choice AwardsThis forum is for the 2013 LinuxQuestions.org Members Choice Awards.
You can now vote for your favorite products of 2013. This is your chance to be heard! Voting ends on February 4th.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
My backup solution is a custom bash script based around piping the results from find to cpio, so cpio it is.
Filesize limitations with the bin, odc or newc file formats would make GNU cpio a poor choice for many. If you like cpio I would use bsdcpio (or heirloom cpio) with pax format or use afio.
You could also get GNU tar to use the pipe to read its file list, e.g. "tar --no-recursion -T- -cvf archive.tar", again with pax file format or even just gnutar file format. Neither have any serious limitations.
Pax is another option but make sure it is heirloom Pax as the pax util provided by most distros cannot actually make pax formatted archives, only ustar, which is really no better than newc.
Last edited by ruario; 01-02-2014 at 04:03 PM.
Reason: clarified I was talking about file formats and mentioned heirloom cpio; mentioned GNU tar; mentioned pax
MAS (To use Tim Minchen's suggestion of "mildly amused smirk" rather than LOL ).
Thanks for the technical advice, but for the situation I have, backups of recently acquired files on a suite of Windows machines, I like cpio for the ability to manipulate path names. It has been working fine for years. I do not want to use a packed archive format, as what is required is to be able to read files directly from the Windows machines. It is also easier to demonstrate the recovery process to the occasional external auditor when our quality system is being assessed. The disaster recovery plan for these Windows machines (basically dedicated instrument controllers) is to restore a known good disk image, then restore any needed recently acquired files from backup. The file size issue will not occur due to the way these Windows machines are used.
I think what I am really trying to say is just the truism that the best choice of backup system heavily depends on what taking backups is trying to achieve.
Thanks for the technical advice, but for the situation I have, backups of recently acquired files on a suite of Windows machines, I like cpio for the ability to manipulate path names.
You can do the same with numerous other utils, including those I mentioned (e.g. GNU tar has the --xform switch).
Quote:
Originally Posted by allend
It has been working fine for years.
Fair enough but as computing changes and common file sizes increase you may find a time where you hit the individual file size limitations of the old cpio formats (bin = 2GB, odc = 8GB, newc = 4GB). I know that I personally have multimedia files and Linux distro ISO images in these size ranges lying around on my disks. afio extends the odc format (only for entries that need it) past these limits and pax has no real, practical limits right from the get go (9 EB file sizes are possible).
Quote:
Originally Posted by allend
I do not want to use a packed archive format, as what is required is to be able to read files directly from the Windows machines.
I'm not sure what you mean by this but the pax file format is the POSIX.1-2001 standard file format and an extension of tar. There are numerous utilities available on Windows that will open its contents just fine. Additionally as afio works with odc by default and only extends the header on entries that exceed its limitations its archives should also be readable using common Windows archiving tools, particularly as you state no files currently do exceed the limits.
Quote:
Originally Posted by allend
It is also easier to demonstrate the recovery process to the occasional external auditor when our quality system is being assessed. The disaster recovery plan for these Windows machines (basically dedicated instrument controllers) is to restore a known good disk image, then restore any needed recently acquired files from backup.
I fail to see how this would be different if you used pax or afio.
Quote:
Originally Posted by allend
The file size issue will not occur due to the way these Windows machines are used.
Fair enough, I did not know your specific use case until you just stated it, so it could well have been an issue. In any case you can just take the information (assuming you were not already aware) as something to keep in mind for the future.
Quote:
Originally Posted by allend
I think what I am really trying to say is just the truism that the best choice of backup system heavily depends on what taking backups is trying to achieve.
Sure, I agree with that. It was just a warning that (IMHO at least) cpio (the GNU implementation in particular) has run out of steam as it is no longer being actively developed to handle the types of files that are ever more common in the modern world. So it was just a heads up, so that you don't get bitten in the future. Like all advice on the internet you are free to just ignore it!
Having a single backup isn't exactly the best idea for protecting your data. If one of your backup mediums gets damaged or destroyed, it's very difficult to restore from your backup.
I follow the 3-2-1 backup philosophy quite strongly: 3 copies, 2 mediums, 1 off-site.
I'm using CrashPlan for my off-site backup solution and Deja-Dup for my local backup. Having a local backup is immensely useful, since there are many times where you might need to restore your data from a backup but not have the time, patience, or bandwidth to restore from an online backup service; the online backup service comes in handy when something catastrophic occurs, such as a fire, theft, or natural disaster.
I also store most of my media files on a dedicated file server at my friend's house in another state (since he has a fast FiOS connection).
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.