Linux - GeneralThis Linux forum is for general Linux questions and discussion.
If it is Linux Related and doesn't seem to fit in any other forum then this is the place.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
I read 2TB as space consumption, not files count, so not necessarily indicative of anything in a visual perusal.
true, although both have grown. this also means i am not going to copy it to the spare 2TB drive i do have (i have only one spare USB3 port and this would take all day on USB2). i will limit any solutions being considered to those that either don't change what's there (in existing trees) or add new trees that i can "rm -r" later.
a python script will keep a dictionary indexed by each file's i-node and store a set of the top-level directory name(s) the file is found in. another dictionary indexed by i-node will store the file's sub-path under the top-level tree. after a scan finishes building these two dictionaries a loop iterating over all the i-nodes will handle each that has a size of its set of top-level names different than the number of trees by reporting which trees it is in and which trees it is not in, along with its list of sub-paths. then this script can figure out if any tree has every file and list them as critical trees to keep. it can also rerun the scan over all trees looking for the problem sub-paths and report all the relations (which sub-path in which top-level).
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.