Quote:
Then processing is performed on the data structure. For modularity/extensibility data structures are exported and imported by next consumers in the data processing chain. I've dealt with huge amounts of data - be it VLSI design, static timing analysis, VLSI verification, ASIC standard library cells characterization, acoustic modeling, whatever - the approach with data structures always works and is the book approach. |
Quote:
For SW to be good one needs competition - as everywhere else. I do not think quantum chemistry SW is widely used, so I do not expect competition in the field. There are well known and highly regarded data formats/approaches used in scientific calculations, for example, HDF: http://www.hdfgroup.org/ . |
Ok, I will rephrase that "easiest to learn" comment
Which language has commands/functions that are most naturally implemented to perform these tasks. For example: If awk has a find_the_first_word_after_this_string("Insert string here") command, or If perl has a grab_text_between_these_two_strings("string1", "string2") command, then it is quite easy to decent which language is best suited for which task. I am ignoring performance because it seems that no consensus is coming any time soon regarding that. In any case, the fact that two senior members cannot reach a consensus about it means to me that awk and perl have only marginal differences in performance. Hence I place my main priority on implementation. |
I am having trouble keeping up. Give me a moment to review all the posts. I missed one directly referring to GAMESS with a link to some kind of cookbook.
|
Quote:
Quote:
|
Quote:
Though Perl can do anything 'awk' can do. The correct questions emanate from the understanding of the whole data parsing and processing mission. My whole experience tells me that 'awk' is insufficient for this. Or, in other words, relying on tools of limited capability (like 'awk') perpetuates data mess. Another issue to consider - there are more than 15000 (8368 authors 18244 modules) Perl modules available at http://www.cpan.org/ -> http://search.cpan.org/ . I.e. pretty much every standard programming task is already implemented in some kind of Perl module. |
Ok, those perl scripts are indeed the type of thing I am looking for. That is not to say that the previously mentioned awk scripts would not work either. I was going to put these scripts in separate files anyway so the user would be able to invoke them at his/her convenience. Therefore, there is nothing from stopping me from writing one command perl_getafterstring, and another awk_getafterstring. I can test both--although I suspect the performance will vary and average performance of each will be very close. I am guessing this will come down to personal preference and case by case problems.
|
Thank you very much for both of your input. I will try to put both in my software.
|
Quote:
please don't cloud the newbie mind with blatant lies. Awk is perfectly sufficient for what he is doing. |
Quote:
|
Quote:
|
Hi Feynman - try to ignore any bickering. May I ask if you are happy to progress on your own now or do you still require help?
I had a look at the file you attached. I am assuming this is only the input data? (I didn't read all of it just skimmed) If you are still working on a solution that requires help, maybe using the data from this file you could give an example output that satisfies what you are looking for? If not required anymore, good luck :) |
Quote:
Because for me the considerations are obvious. I do not care that 'awk' can in some case be faster than Perl because in the grand scheme of things (WRT data parsing and consequent data processing) it's not an issue. |
Thank you grail. Well, with the given awk commands, I have 4/5 task covered. My description of the unsolved task was admittedly vague so I rewrote it in an earlier post. I will work on/copy-past from that cookbook site/get some help with perl programs that do the equivalent. I will try to have both available as separate commands for my program. I assume if I am using more than one cpu and if I have the right software installed that these text searches will be automatically redistributed across my other cpus. I really do not have any experience in parallelization, but I do have access to more than one cpu.
|
Quote:
|
All times are GMT -5. The time now is 10:38 PM. |