ProgrammingThis forum is for all programming questions.
The question does not have to be directly related to Linux and any language is fair game.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
yeah, the collision thing did cross my mind, I don't know what the statistical probability is I guessed it was very low.
using md5sum was just a hack
The thing is.
If all this data is in a database then it is probably better to query the database directly instead of exporting data and messing about with grep, awk, cut, uniq, diff.
I'm sure a solution can be achieved with either bash, awk, perl, python.
( not a script using all of those, but a script written for either one )
Ultimately executing a report run on the database is going to be far more powerful/flexible and easier once the time has been invested.
yeah, the collision thing did cross my mind, I don't know what the statistical probability is I guessed it was very low.
using md5sum was just a hack
The thing is.
If all this data is in a database then it is probably better to query the database directly instead of exporting data and messing about with grep, awk, cut, uniq, diff.
I'm sure a solution can be achieved with either bash, awk, perl, python.
( not a script using all of those, but a script written for either one )
Ultimately executing a report run on the database is going to be far more powerful/flexible and easier once the time has been invested.
i agree, the mainframe dataset is pulled from db2, vsam, i.d.m.s., websphere mqueues and other datasets.
to get an additional column added, business solutions delivery would have to create a service request; then, app-dev would need to design it; and, enterprise release management would need to approve the budget.
i agree, the mainframe dataset is pulled from db2, vsam, i.d.m.s., websphere mqueues and other datasets.
to get an additional column added, business solutions delivery would have to create a service request; then, app-dev would need to design it; and, enterprise release management would need to approve the budget.
I have to say that, to me, this is screaming out to be done in Perl (eg Firerat post #16).
This sort of data munging is something it's excellent at (I used to do a lot of this sort of thing) and it may help to know that both arrays and hashes are both first class data types in Perl.
Regexes are also built-in.
it is a shame you can not install sqlite3, or get the CSV data to a machine that does have it installed.
I assume some policy is in place limiting the installation of programs or the export of data files.
if you provide data samples which resemble the real data, and fully explain the "rules of the game" we can come up something.
until then, you are on your own.
i actually have a sqlite instance on a redhat virtual server that isnt locked down as far as hipaa data is concerned. this a.i.x. server doesnt have it and i aint got r00t.
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.