Bash script to change a filename associated with an inode index number.
I am trying to write a BASH script that will search through a directory and find filenames with illegal characters in them then find the inode associated with the file and then change the filename associated to that inode number to a legal name by stripping out illegal chars and replacing them with underscores.
I have a piece of code that is in the right direction. It finds illegal names and removes them. With code like this: //Remove by filenames with illegal chars. find . -name '*[+{;"\\=?~()<>&*|$ ]*' -exec rm -f '{}' \; or //Remove by passing an inode number. find . -inum $inum -exec rm {} \; Instead of removing the file I want to keep the file but change the filename associated with the inode number. Missing pieces that I think i still need: Method of stripping the bad filename of illegal chars and replacing them with underscores. I assume there will need to be a string variable that holds the old name and then is processed and then stored in a new name variable. These two variables could be used with a move (mv) command to change the bad name. Is there a direct way to update the name associated with an inode number, without using mv? I am still working on finding the solution to this and I will post what I find, but any help out there would be great. Thank you, Ziv |
Quote:
|
Replacing with underscores:
filename='test=example+file?' echo $filename echo $filename | sed 's/[+{;"\\=?~()<>&*|$ ]/_/g' I don't understand why you don't want to use mv either! |
I didn't know if it would work...
When I try it at the command line, it gives me trouble with the filenames, depending on the illegal chars it encounters in the name.
mv is fine, but it seems that I will need to do it by referring to it's inode number. Am I right about this? Ziv |
When I try it at the command line, it gives me trouble with the filenames, depending on the illegal chars it encounters in the name.
I was gonna warn about that:.. filename="A$(echo -en \\x01\\x10\\x0101\\x002\\x83)Z" now try: echo $filename | sed 's/[+{;"\\=?~()<>&*|$ ]/_/g' vs: echo ${filename//[^a-zA-Z0-9]/} replacement: Code:
retScrubbedString() { # Slow but no sed needed: A__1__Z |
Tried your script, actually it outputs on my computer:
A___012_Z Gonna take me a while to figure out exactly how that script works. How about this, which I think has the same effect as your script, why not use sed with a more accurate regular expression?: echo $filename | sed 's/[^0-9a-zA-Z_./-]/_/g' |
Tried your script, actually it outputs on my computer:
A___012_Z Depends on the charset you're using I guess? How about this, which I think has the same effect as your script Yeah, that works fine. That's the *easy* way tho :-] |
How can I use the find command to execute a mv?
I like the sed command that is above, but I am having problems tying that into a mv command that will change the old filename into a new one.
I was trying something like this... find . -name '*[;"\\=?()<>&*|$]*' -exec $badname= '{}' \; | $filname= sed 's/[;"\\=?()<>&*|$]/_/g' | mv $badname $filename I want the script to search the directory for bad filenames then replace the bad characters with underscores and mv the old badname to the new corrected filename. Any help is appreciated... ZIv |
How about something like this....
Code:
#!/bin/bash |
Slight Problem...
There is a slight problem with that solution...
I get the following output after running the script. ********** #sh rename_filename_test.sh mv: when moving multiple files, last argument must be a directory Try `mv --help' for more information. mv: `rename_filename_test.sh' and `rename_filename_test.sh' are the same file ********** Should the script file be included in the mv? None of the illegal characters are in the filename. Also, should the file be in the same folder the script is running on? Thank you for the reply, Ziv |
Hmm, it works ok for me. Try copy the entire script and paste it into a text file called test. I have that file located in the directory and run it with the command: sh test
|
Quote:
# sh test mv: `rename_filename_test.sh' and `rename_filename_test.sh' are the same file mv: `test' and `test' are the same file What is the output like when you run it? Ziv |
Oh, I see what you are saying.
Quote:
|
Looks like it is working good...
Two last things that need ironed out...
1) It doesn't seem to be changing the "[" or "]" chars in the filenames. 2) It is not recursing subdirectories it only tries to change the name of it. See below... # sh test mv: `23-Mar-2004_120109[1].tar.gz' and `23-Mar-2004_120109[1].tar.gz' are the same file mv: `backup_utils.txt' and `backup_utils.txt' are the same file mv: cannot move `cups-pdf' to a subdirectory of itself, `cups-pdf/cups-pdf' mv: `encryptpdf.exe' and `encryptpdf.exe' are the same file mv: cannot move `printerdrivers' to a subdirectory of itself, `printerdrivers/printerdrivers' mv: `QCS_A2LA_Request.doc' and `QCS_A2LA_Request.doc' are the same file mv: `remove_illegals.txt' and `remove_illegals.txt' are the same file mv: `test' and `test' are the same file mv: `western_web.zip' and `western_web.zip' are the same file Any ideas on these problems? Ziv |
Well, I hit a wall on this one. :(
The part about a name with brackets ( [ ] ) in it can be solved but it creates yet another problem as the period ( . ) before the file extension is also in the same class as the brackets. That is they both are [[:punct:]] I also had very little luck with the recursive problem. Code:
#!/bin/bash |
I'll have another BASH at this :)
Hello again,
I have had another go at this one.... Using find might not work because if you change the names of some directories half way through the search, I guess it can confuse find, possibly missing some files, therefore if you want a recursive algorithm, you should use a script with a for loop. DISCLAIMER: I am not a bash expert... I am still learning every day. I have done a small amount of testing on this script and it seems to work on the few test cases I have tried, but recursive scripts that modify the filesystem are powerful tools to be used with care, so don't run this as root, and if the data is important, back it up first. If there are problems though, I will be happy to try to fix them, but I can't recover any lost data! Code:
bad='[^A-Za-z0-9_.-]' I used && instead of ;'s and if fi blocks. You may find this hard to read. Sorry! I'm sure this code could be cleaned up a bit and it can definitely be made more readable! Improvements are very welcome, I would be happy to see alternative ways to do this. I hope this is at least be something you can work from. Let me know how you get on. Happy Bashing! Mark. |
Thank you I will give it a try...
I will let you know how it goes...
Thank you for your help putting this together. Ziv |
I think I got it!!!
Well this seems to work for me, with some modifications...
Here is the code I created based off of your help. Code:
#!/bin/bash Do you see any problems with this method? Ziv |
It is not working perfectly...
Dang!
It doesn't handle spaces in directory names. Well... any other ideas? Ziv |
What error did you get?
As I tried to point out in my earlier post, I don't think you should use find if you are going to be renaming the directories, since this will cause lots of file not found errors once you have renamed the parent directories (NOTE: I reconsidered this again, and now i have decided that if you make sure that you rename the deepest file first, maybe you can get this to work! I will experiment with this).
Your problem with spaces is because the way 'for' works on a variable, splitting at the spaces. Although I'm sure there is a way to fix this, you will still end up with the problem with find I mention above. To avoid both these problems, I would advise using the 'for i in *' method, as I did in my example, and iterating through the directories manually. I have no idea what problem you encountered. Perhaps I didn't explain well enough how to use the script? Or perhaps you have something in your directory structure that I hadn't thought of (sockets / hard links / some feature I've never heard of / a strange character that causes my script to fail). If you could explain to me exactly what didn't work wiht my script, I am sure I can fix the problem. Ideally, post a script which creates a directory structure for which my proposed renaming script fails such as Code:
mkdir 'foo bar'; touch 'foo bar/baz'; Mark. |
I found an error in my script! The last minute change I did to ignore softlinks was incorrect in the example I posted! (I missed a &&)
Code:
bad='[^A-Za-z0-9_.-]' |
OK there are problems with my script too! First the test for a symbolic link is -h, and not -s. Second, it looks like the values of local variables are not restored to their original values on returning from a function call.... perhaps it is possible to specify variables as local somehow. I'll look at this now and get back to you later when I have an answer.
|
Script version 3 :
Code:
bad='[^A-Za-z0-9_.-]' Code:
$ mkdir 'a b'; touch 'a b/c d'; sh renamebad Code:
$ ls -R |
All times are GMT -5. The time now is 07:02 PM. |