LinuxQuestions.org

LinuxQuestions.org (/questions/)
-   Linux - Software (http://www.linuxquestions.org/questions/linux-software-2/)
-   -   Package to find and delete duplicates (http://www.linuxquestions.org/questions/linux-software-2/package-to-find-and-delete-duplicates-823628/)

maury0324 08-02-2010 03:12 PM

Package to find and delete duplicates
 
I have about 1TB of data to sort out and need a good GUI package to find and safely delete the duplicate files. I am running Ubuntu 10.04 amd_64.
Would appriciate any thoughts on how to get it done. TIA maury

GrapefruiTgirl 08-02-2010 03:15 PM

What sort of 'data' or 'files' are they? And what defines "duplicates"? Duplicate filenames, or duplicate content?

This doesn't sound like a task that needs a GUI tool. Please expand upon the requirements. :)

PTrenholme 08-02-2010 03:37 PM

Try the krusader KDE application.

But Celine does have a point. Most file systems will not allow you to create multiple files with the same name in the same directory, so your use of the word "duplicate" is a little ambiguous.

bryanl 08-02-2010 05:16 PM

FSlint does a good job at this and it is in the repositories (and software center).

Quote:

FSlint is a toolkit to clean filesystem lint. It includes a GTK+ GUI as well as a command line interface and can be used to reclaim disk space. It has an interface for uninstalling packages, and it can find things like:

Duplicate files
Problematic filenames
Temporary files
Bad symlinks
Empty directories
Nonstripped binaries

maury0324 08-02-2010 06:41 PM

Well, I guess the best way to describe it is I have a mess of accumulated
MP3 files (Radio shows and such) News stories of all sorts. Sometimes I would get duplicates and was saved wherever I had room. I finally have it all in one sock and want to get it sorted out and though the best way to start is get rid of duplicates. I may have to sort it into categories first?

evo2 08-02-2010 09:41 PM

If you are willing to forgo the gui you could use fdupes + a little shell scripting foo.

Evo2.

bryanl 08-02-2010 11:06 PM

This does get to be a puzzle. FSlint is easy to install, readily available, has a GUI, will find files with the same contents under a given directory (fdupes is one of the tools it uses, I think) that you can delete as you choose, and will do other useful things to help clean up a file system.

Yet it seems to be on the blacklist.

Why?

evo2 08-02-2010 11:31 PM

What "blacklist"?

Evo2.

maury0324 08-03-2010 05:47 AM

Thanks everyone
I will give FSlint a try. My plan at the moment is to manually sort out the stuff into sections and then use FSlint to find duplication. Fortunatily I can do it at my leasure. Maury


All times are GMT -5. The time now is 09:33 AM.