LinuxQuestions.org
Help answer threads with 0 replies.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software
User Name
Password
Linux - Software This forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.

Notices


Reply
  Search this Thread
Old 11-01-2012, 09:21 PM   #1
Skaperen
Senior Member
 
Registered: May 2009
Location: center of singularity
Distribution: Xubuntu, Ubuntu, Slackware, Amazon Linux, OpenBSD, LFS (on Sparc_32 and i386)
Posts: 2,684
Blog Entries: 31

Rep: Reputation: 176Reputation: 176
An alternative to rsync?


I would like to know if there is an alternative to rsync for doing backups.

I currently use rsync to make incremental backups based on what is changed. I use the backup feature of rsync by configuring it to move replaced or deleted files into an "arch/YYYY-mm-dd" directory. The current copy of the backed up file tree is in "sync". Both "arch" and "sync" are in a directory designated for each backup configuration.

The primary problem is these backups get very large, with many millions of files. What rsync does is collect all the names of all the files into memory before doing any real transfers of data. This is putting a lot of memory stress on the source (to be backed up) and target (where backups are saved) systems.

I believe such memory hogging is really not needed. At any one time, the most that should be needed is to keep the names of all files in each of the directory levels down to the one where the backup activity is currently working ... not the entire tree. So I am looking for something that can do these things rsync does, but without doing this reading of all files in the entire tree.

There is also a secondary problem. I want to make a backup of the backup. This is resulting in double transfers. Because files in "sync" (on the target, which is now the source for this secondary backup) get moved to "arch" when replaced, the new target gets these files transferred to it as new files in "arch", even though it has a copy in "sync". I tried the --fuzzy option on rsync to see if it would find the duplicate "somewhere else". This has not worked.

If a new program is made to specifically deal with this, and synchronize primary backups to secondary backups with a minimum of transferred data (replicate the previous moves), that would be great. But it will still need to do smart data-incremental transfers where files being replaced still have most of the old data just like rsync was originally designed for.

If there is no such existing program, is there any interest in one being developed that focuses on incremental primary and secondary backups?
 
Old 11-01-2012, 11:25 PM   #2
sag47
Senior Member
 
Registered: Sep 2009
Location: Raleigh, NC
Distribution: Ubuntu, PopOS, Raspbian
Posts: 1,899
Blog Entries: 36

Rep: Reputation: 477Reputation: 477Reputation: 477Reputation: 477Reputation: 477
Which version of rsync are you using? From my rsync 3.0.8 man page it says:
Quote:
Beginning with rsync 3.0.0, the recursive algorithm used is now an incremental scan that uses much less memory than before and begins the transfer after the scanning of the first few directories have been completed. This incremental scan only affects our recursion algorithm, and does not change a non-recursive transfer. It is also only possible when both ends of the transfer are at least version 3.0.0.
Perhaps you need to update your rsync package or remove it and compile the latest rsync from sources.

SAM
 
Old 11-03-2012, 03:32 PM   #3
Skaperen
Senior Member
 
Registered: May 2009
Location: center of singularity
Distribution: Xubuntu, Ubuntu, Slackware, Amazon Linux, OpenBSD, LFS (on Sparc_32 and i386)
Posts: 2,684

Original Poster
Blog Entries: 31

Rep: Reputation: 176Reputation: 176
The versions varies from 3.0.6 to 3.0.8 depending on which system. I'm not seeing any incremental effect. It still scans the ENTIRE tree before doing any file transfers.

I don't know why they ever did it that way. Everything I see rsync doing can be done with only having the names in memory for the directories it is currently working at. I guess this was all designed back in the days of not so many files on small disks.

Last edited by Skaperen; 11-03-2012 at 03:34 PM.
 
Old 11-03-2012, 07:58 PM   #4
rknichols
Senior Member
 
Registered: Aug 2009
Distribution: Rocky Linux
Posts: 4,779

Rep: Reputation: 2212Reputation: 2212Reputation: 2212Reputation: 2212Reputation: 2212Reputation: 2212Reputation: 2212Reputation: 2212Reputation: 2212Reputation: 2212Reputation: 2212
What options are you using? In the rsync manpage, the description for the "--recursive" option lists several other options that disable the incremental recursion mode and thus require much more memory.
 
Old 11-04-2012, 10:20 AM   #5
jefro
Moderator
 
Registered: Mar 2008
Posts: 21,980

Rep: Reputation: 3624Reputation: 3624Reputation: 3624Reputation: 3624Reputation: 3624Reputation: 3624Reputation: 3624Reputation: 3624Reputation: 3624Reputation: 3624Reputation: 3624
If you want to go wild.
Some of the new file systems could be set up to provide similar data protection. Btrfs and zfs could be used to keep backups of data.

Go go old school a tar.gz on cron might do fine.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Linux remote backups, alternative to rsync? Alize Linux - Server 10 01-02-2018 06:02 PM
rsync solaris + ld.so.1: rsync: fatal: libiconv.so.2: open failed: xxx_anuj_xxx Solaris / OpenSolaris 25 02-23-2012 03:23 AM
[SOLVED] rsync fails in cron - ssh key prob for rsync? jonathansfl Linux - Server 6 12-09-2010 09:48 AM
Could I run rsync to download files from a server without rsync daemon? Richard.Yang Linux - Software 1 09-18-2009 04:08 AM
rsync using alternative ssh port va1damar Linux - Networking 5 01-29-2007 02:22 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Software

All times are GMT -5. The time now is 03:07 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration