LinuxQuestions.org
Help answer threads with 0 replies.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software
User Name
Password
Linux - Software This forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.

Notices

Reply
 
Search this Thread
Old 08-02-2010, 03:12 PM   #1
maury0324
Member
 
Registered: Jul 2003
Location: Florida
Distribution: Debian Squeeze amd_64
Posts: 66

Rep: Reputation: 16
Package to find and delete duplicates


I have about 1TB of data to sort out and need a good GUI package to find and safely delete the duplicate files. I am running Ubuntu 10.04 amd_64.
Would appriciate any thoughts on how to get it done. TIA maury
 
Old 08-02-2010, 03:15 PM   #2
GrapefruiTgirl
Guru
 
Registered: Dec 2006
Location: underground
Distribution: Slackware64
Posts: 7,594

Rep: Reputation: 543Reputation: 543Reputation: 543Reputation: 543Reputation: 543Reputation: 543
What sort of 'data' or 'files' are they? And what defines "duplicates"? Duplicate filenames, or duplicate content?

This doesn't sound like a task that needs a GUI tool. Please expand upon the requirements.
 
Old 08-02-2010, 03:37 PM   #3
PTrenholme
Senior Member
 
Registered: Dec 2004
Location: Olympia, WA, USA
Distribution: Fedora, (K)Ubuntu
Posts: 4,147

Rep: Reputation: 330Reputation: 330Reputation: 330Reputation: 330
Try the krusader KDE application.

But Celine does have a point. Most file systems will not allow you to create multiple files with the same name in the same directory, so your use of the word "duplicate" is a little ambiguous.
 
Old 08-02-2010, 05:16 PM   #4
bryanl
Member
 
Registered: Dec 2003
Posts: 85

Rep: Reputation: 33
FSlint does a good job at this and it is in the repositories (and software center).

Quote:
FSlint is a toolkit to clean filesystem lint. It includes a GTK+ GUI as well as a command line interface and can be used to reclaim disk space. It has an interface for uninstalling packages, and it can find things like:

Duplicate files
Problematic filenames
Temporary files
Bad symlinks
Empty directories
Nonstripped binaries
 
Old 08-02-2010, 06:41 PM   #5
maury0324
Member
 
Registered: Jul 2003
Location: Florida
Distribution: Debian Squeeze amd_64
Posts: 66

Original Poster
Rep: Reputation: 16
Well, I guess the best way to describe it is I have a mess of accumulated
MP3 files (Radio shows and such) News stories of all sorts. Sometimes I would get duplicates and was saved wherever I had room. I finally have it all in one sock and want to get it sorted out and though the best way to start is get rid of duplicates. I may have to sort it into categories first?
 
Old 08-02-2010, 09:41 PM   #6
evo2
Guru
 
Registered: Jan 2009
Location: Japan
Distribution: Debian, SL
Posts: 5,098

Rep: Reputation: 1100Reputation: 1100Reputation: 1100Reputation: 1100Reputation: 1100Reputation: 1100Reputation: 1100Reputation: 1100Reputation: 1100
If you are willing to forgo the gui you could use fdupes + a little shell scripting foo.

Evo2.
 
Old 08-02-2010, 11:06 PM   #7
bryanl
Member
 
Registered: Dec 2003
Posts: 85

Rep: Reputation: 33
This does get to be a puzzle. FSlint is easy to install, readily available, has a GUI, will find files with the same contents under a given directory (fdupes is one of the tools it uses, I think) that you can delete as you choose, and will do other useful things to help clean up a file system.

Yet it seems to be on the blacklist.

Why?
 
Old 08-02-2010, 11:31 PM   #8
evo2
Guru
 
Registered: Jan 2009
Location: Japan
Distribution: Debian, SL
Posts: 5,098

Rep: Reputation: 1100Reputation: 1100Reputation: 1100Reputation: 1100Reputation: 1100Reputation: 1100Reputation: 1100Reputation: 1100Reputation: 1100
What "blacklist"?

Evo2.
 
Old 08-03-2010, 05:47 AM   #9
maury0324
Member
 
Registered: Jul 2003
Location: Florida
Distribution: Debian Squeeze amd_64
Posts: 66

Original Poster
Rep: Reputation: 16
Thanks everyone
I will give FSlint a try. My plan at the moment is to manually sort out the stuff into sections and then use FSlint to find duplication. Fortunatily I can do it at my leasure. Maury
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
[SOLVED] Find URL in Debian package index via awk/sed (=find a line, then search from there) maddes.b Linux - Software 11 06-28-2013 07:37 AM
linux-headers package on a VPS? (Couldn't find package) FEL Linux - Newbie 8 04-06-2009 11:45 AM
synaptic package manager forcing me to delete a broken package mbman Linux - Software 6 04-25-2006 09:30 AM
how could I delete duplicates entries in xml using php catzilla Programming 2 10-30-2005 07:08 PM
how to delete duplicates entries in xml file using sed/awk/sort ? catzilla Linux - Software 1 10-28-2005 02:57 PM


All times are GMT -5. The time now is 03:07 AM.

Main Menu
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
identi.ca: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration