LinuxQuestions.org
Review your favorite Linux distribution.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - General
User Name
Password
Linux - General This Linux forum is for general Linux questions and discussion.
If it is Linux Related and doesn't seem to fit in any other forum then this is the place.

Notices


Reply
  Search this Thread
Old 01-25-2014, 11:06 AM   #1
Weapon S
Member
 
Registered: May 2011
Location: Netherlands
Distribution: Debian, Archlinux
Posts: 262
Blog Entries: 2

Rep: Reputation: 49
Space saving filesystem suggestions, please?


There's a spare hard disk I have at home. I'd like to use it for storing mainly anime episodes and streaming them to a Raspberry Pi.
Would be nice, if I could compress it, as I don't care for computational overhead, or random access.
I looked into btrfs, because of compression. But I then learned about deduplication. I think deduplication would be ideal for storing anime series.
I haven't found any FUSE solutions. But I can imagine something that basically writes a giant compressed archive to the disk might be effective. (And effectively "deduplicate".)
Also, it would only be necessary to present the files as normally uncompressed files to other computers.
A problem is, that I'm currently running Debian Squeeze (old-stable). I've wanted to try Arch, or Debian Sid, so this compression idea might be a good reason as any to do so. I enjoy making weird shizzle work, so I'm willing to do some experimenting and learn.
Could anyone nudge me into the right direction, please?
 
Old 01-25-2014, 01:16 PM   #2
jailbait
LQ Guru
 
Registered: Feb 2003
Location: Virginia, USA
Distribution: Debian 12
Posts: 8,337

Rep: Reputation: 548Reputation: 548Reputation: 548Reputation: 548Reputation: 548Reputation: 548
Here is a paper in Linux Journal which give the results from several compression programs used on different types of data.

http://www.linuxjournal.com/article/8051

For myself, disks are so cheap that I just buy more disks rather than using compression.

---------------------
Steve Stites

Last edited by jailbait; 01-25-2014 at 01:18 PM.
 
Old 01-27-2014, 04:51 PM   #3
jefro
Moderator
 
Registered: Mar 2008
Posts: 21,982

Rep: Reputation: 3626Reputation: 3626Reputation: 3626Reputation: 3626Reputation: 3626Reputation: 3626Reputation: 3626Reputation: 3626Reputation: 3626Reputation: 3626Reputation: 3626
Many modern video formats will not compress well. I kind of doubt you will save much space. I forget what the current best compression method is.



Btrfs and zfs have the ability to compress data on the fly as well as maybe one or two others.

Another way might be to use some on the fly compression. Fuse, or squashfs or aufs maybe.
 
Old 01-27-2014, 09:08 PM   #4
TobiSGD
Moderator
 
Registered: Dec 2009
Location: Germany
Distribution: Whatever fits the task best
Posts: 17,148
Blog Entries: 2

Rep: Reputation: 4886Reputation: 4886Reputation: 4886Reputation: 4886Reputation: 4886Reputation: 4886Reputation: 4886Reputation: 4886Reputation: 4886Reputation: 4886Reputation: 4886
As jefro already pointed out, usually video files are already compressed and will not further compress. They will even get larger if you try that, due to the added overhead. You will possibly save space if you convert them to a different compression format, for example if they are MPEg-2 encoded you might convert them to H.264, but this will also result in a decreased quality (re-encoding will always cause qulaity loss)
 
Old 01-28-2014, 12:43 AM   #5
Weapon S
Member
 
Registered: May 2011
Location: Netherlands
Distribution: Debian, Archlinux
Posts: 262

Original Poster
Blog Entries: 2

Rep: Reputation: 49
Yes, that is true. But the main problem with compressed file-systems seems to be they perform per file compression. But if you have a collection of files with identical parts, disk block deduplication, or solid block compression might work.
The problem with compressed archives seems to be the dictionary is based on a "local window"; it might not look back far enough to identify repetition across large files. But this is just from very shallow research into the matter.
Per block deduplication, as opposed to per file, could save space, and I believe significantly. Mainly because the opening sequences of anime.
I'll look into zfs and the exact deduplication methods of btrfs.

Last edited by Weapon S; 01-28-2014 at 12:58 AM.
 
Old 03-18-2014, 02:35 AM   #6
Weapon S
Member
 
Registered: May 2011
Location: Netherlands
Distribution: Debian, Archlinux
Posts: 262

Original Poster
Blog Entries: 2

Rep: Reputation: 49
I was getting ready to dig into the source, but btrfs has updated their documentation to clearly state bedup provides per file deduplication and not per block.
Also I remembered that a block compressed archive has horrendous random access.
This means that as of yet there are no space saving solutions for my case.
This probably makes me look more naive, but I'm convinced a per-block deduplicating open-source file system would be a great idea.
 
Old 03-18-2014, 03:48 AM   #7
whizje
Member
 
Registered: Sep 2008
Location: The Netherlands
Distribution: Slackware64 current
Posts: 594

Rep: Reputation: 141Reputation: 141
ZFS deduplication.
 
1 members found this post helpful.
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Saving root filesystem with tar Mr. Alex Linux - Newbie 4 01-04-2014 12:02 PM
Creating a filesystem to use free space on an existing filesystem idsouza28 Linux - Newbie 7 06-05-2012 12:29 AM
[SOLVED] Filesystem suggestions govert Linux - Server 18 07-19-2011 06:28 AM
space saving techniques? c_olin3404 Linux - General 2 03-22-2005 04:31 PM
Suggestions for power saving tips aside from ACPI and clock scaling... adamis Linux - Laptop and Netbook 1 11-13-2003 11:08 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - General

All times are GMT -5. The time now is 12:48 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration