LinuxQuestions.org
Review your favorite Linux distribution.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - General
User Name
Password
Linux - General This Linux forum is for general Linux questions and discussion.
If it is Linux Related and doesn't seem to fit in any other forum then this is the place.

Notices


Reply
  Search this Thread
Old 04-14-2019, 02:52 PM   #1
rjo98
Senior Member
 
Registered: Jun 2009
Location: US
Distribution: RHEL, CentOS
Posts: 1,760

Rep: Reputation: 49
How to find almost duplicate MP3s


So say I have 3 MP3s which are really the same song, but each one cuts off .1 seconds earlier than the other. If I use a duplicate file finder like in FSlint, it doesn't catch these as they aren't perfect matches for each other. Music-wise, they're like 99% the same though.

Is there some current software out there more geared to music duplicate finding, that also identifies near-duplicate matches?
 
Old 04-14-2019, 04:37 PM   #2
RandomTroll
Senior Member
 
Registered: Mar 2010
Distribution: Slackware
Posts: 1,191

Rep: Reputation: 196Reputation: 196
Don't they have similar names? id3tool returns version 1 tags; id3info returns version 2 tags ; mediainfo returns mpeg tags. If the files don't have similar names but have similar tags... If they're really the same file but missing the last second you can excerpt the first second, say, with truncate, then use fslint on those excerpts. You can run them all through audiotag.info.
 
2 members found this post helpful.
Old 04-15-2019, 02:08 AM   #3
ondoho
LQ Addict
 
Registered: Dec 2013
Posts: 11,308
Blog Entries: 8

Rep: Reputation: 2935Reputation: 2935Reputation: 2935Reputation: 2935Reputation: 2935Reputation: 2935Reputation: 2935Reputation: 2935Reputation: 2935Reputation: 2935Reputation: 2935
Quote:
Originally Posted by rjo98 View Post
they're like 99% the same though.
unfortunately, for an operating system, that's the same as 0%.

that said, i heard that software exists that actually listens to the music and makes intelligent deductions.
i'm sure a few well-formulated web searches will reveal more info about this, and if it's available for linux.
 
Old 04-15-2019, 06:43 AM   #4
Tonus
Member
 
Registered: Jan 2007
Location: Paris, France
Distribution: Slackware-current
Posts: 451
Blog Entries: 3

Rep: Reputation: 119Reputation: 119
How to find almost duplicate MP3s

Beets could do the job.

See http://beets.io
 
1 members found this post helpful.
Old 04-15-2019, 03:20 PM   #5
rjo98
Senior Member
 
Registered: Jun 2009
Location: US
Distribution: RHEL, CentOS
Posts: 1,760

Original Poster
Rep: Reputation: 49
Sounds like beets may be what i'm looking for, if it does indeed help identify and clean out almost dupes via acoustic info. I'll have to check it out.

Other software solutions I found very well formulated google searches presented mostly results for software titles that were no longer actively developed, but i see beets' change long was updated yesterday so that's promising.

That's a clever idea with the truncate bit, guess i'd have to do it both at the beginning and the end as i'm sure there could be variances at both. Ah the good old days of having to trim dead air from MP3s because disk space was at a premium, i think that's where most of the inconsistencies have come from.
 
Old 04-16-2019, 08:19 AM   #6
RandomTroll
Senior Member
 
Registered: Mar 2010
Distribution: Slackware
Posts: 1,191

Rep: Reputation: 196Reputation: 196
Quote:
Originally Posted by rjo98 View Post
That's a clever idea with the truncate bit, guess i'd have to do it both at the beginning and the end
No.
 
Old 04-16-2019, 08:56 AM   #7
BW-userx
LQ Guru
 
Registered: Sep 2013
Location: MID-SOUTH USA
Distribution: Slackware 14.2 current / Linux Mint / Debian / Void Linux
Posts: 8,353

Rep: Reputation: 1744Reputation: 1744Reputation: 1744Reputation: 1744Reputation: 1744Reputation: 1744Reputation: 1744Reputation: 1744Reputation: 1744Reputation: 1744Reputation: 1744
what I do to try and cut down on dup mp3s.

I got a script that re-samples, then uses metatag info to put them somewhere else. It is set up to create all files with the same filename(same artist/song) and place them in the corresponding folder via meta tags so it over writes if dups in same artist/album. if a criteria is missing in the meta tags info it gets split off and sent to a different location for manual processing to make a visual determination and fixing its meta data to then store it with the others.

You can use its metadata info to do the same without re-sampling.
I use exiftool for metadata info.
There probably is a tool out there to convert all metadata to one version. To help eliminate having to use more than one tool to be sure you get the metadata no matter what version it is.

I split my files into 4 separate parent directories, then run 4 scripts simultaneously. ( I got a lot of mp3 files)

Last edited by BW-userx; 04-16-2019 at 09:05 AM.
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
LXer: Researchers find almost EVERY computer with an Intel Skylake and above CPU can be owned via USB LXer Syndicated Linux News 0 11-10-2017 04:14 PM
how to recover lost mp3s for free, using photorec, find, xargs and "EasyTag" GlennsPref Mandriva 2 05-12-2010 10:30 AM
does tar or bzip2 squash duplicate or near-duplicate files? garydale Linux - Software 6 11-19-2009 04:43 PM
Find and Grep in Script - almost there chess Programming 8 05-01-2007 09:15 AM
rippers, mp3s and mp3s.... (?) bxb32001 Linux - General 0 07-14-2001 12:53 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - General

All times are GMT -5. The time now is 04:57 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration