ProgrammingThis forum is for all programming questions.
The question does not have to be directly related to Linux and any language is fair game.
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
I'm currently writing a bash shell script that needs to take a tab delimited text file and convert it into a MySQL importable file. I have no experience with gawk (which is what I'm assuming I'd use - if not, please don't hesitate to correct me) - but this is what I'm looking to do:
Original File (the <TAB> is just representative of an actual tab):
That just puts quotes around the whole thing.
"1123432 114 Oceanside drive 3324|4432|4432|2234 11234.jpg"
Sliptwixt that puts quotes around the first 4 fields.
"1123432" "114" "Oceanside" "Drive"
Use [[:cntrl:]] with sed or grep to look for control characters. If <Tab> is the only control character then something like this would work:
That would replace the three <Tab's> with a " but unfortunately I'm sure there are other return characters, like linefeeds and or carrage returns.
I think Chr$(9) is the tab. So you need to make that the field delimiter.
I assumed the format was consistant and this was a quick-and-dirty one time thing. If it were something I'd have to revisit more than once, I'd opt to script it in Perl or something so I have a little more control over inconsistancies in the datafile and/or some kind of error reporting.
I hope you find the solution that works best for you.