LinuxQuestions.org
Go Job Hunting at the LQ Job Marketplace
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software
User Name
Password
Linux - Software This forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.

Notices

Reply
 
Search this Thread
Old 08-13-2007, 09:42 AM   #1
linker3000
LQ Newbie
 
Registered: May 2007
Posts: 9

Rep: Reputation: 0
Processing data from a 'foreign' database with mysql, or tools to pre-process data.


Hi,

I am working on a scheme to bring some data across from an old, but still in use, system so that we can process it for use with our (Linux-based) mailing list software (relax, it's part of our customer base - not yet more spam!).

The data is hosted on a server running an operating system called THEOS and I have no problems pulling over the files by FTP. The file structure is described as 'THEOS ISAM' and as far as I can see from a quick glance, comprises fixed length records with various field delimiters.

I need to either get the data into tab-delimited format or perhaps directly into a mysql database so it can be manipulated. In the past when I have done this in the MS world, I'd use a combination of BASIC and Excel, but since the target app is Linux-based, I'd like to go straight from THEOS -> Linux.

I can hack around in perl, PHP etc. to varying degrees, but wondered if there were any import pre-processing front ends for use with mysql that make it easy to define and test record structure definitions that can then be used as import filters? Any tools worth having a look at before I do some manual data structure dissection?

Hope that make sense!?

Thanks
 
Old 08-14-2007, 09:36 PM   #2
jlinkels
Senior Member
 
Registered: Oct 2003
Location: Bonaire
Distribution: Debian Wheezy/Jessie/Sid, Linux Mint DE
Posts: 4,188

Rep: Reputation: 523Reputation: 523Reputation: 523Reputation: 523Reputation: 523Reputation: 523
This kind of work I do with AWK. For current project I need to convert the output of a huge Paradox export to something which I can import in MySQL.

What the script does:

- Filter records matching certain criteria
- Check values for null values, either generate an error message or replace it by \N if null is allowed
- Concatenate strings
- Replace strings with numbers if they must become enumerated values for my database
- Print each record in a tab delimited format.

The Paradox files are sent to me in a regular fashion, I call awk from a bask script, and loads the result in the database.

It takes you about a day or so to understand awk and crank out the basic code.

If your export is a CSV file with quoted strings, do not try to process this with AWK. Simple cases are fine, but if you find a string like:
Code:
onevalue, "Sherlock asked: "What do you think, Watson?"", nextvalue
AWK chokes on that and many unsuccesful attempts have been made to implement this. Find a CSV preprocessor for that. If you can't find it, I have a good public domain application. It used to be available on the internet but it seems to have disappeared. If you need it, let me know.

jlinkels
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
adding data to mysql database ? rharris72 Programming 6 01-30-2006 04:07 PM
need to get data from xml file to MySQL database, and then use php to access Armon Linux - General 1 01-18-2006 03:54 PM
Data Processing joelhop Linux - General 8 01-01-2006 09:08 PM
pulling data from mysql database zuessh Linux - Software 5 05-29-2003 10:13 AM
Manage data and maintenance of MySQL database online rhuser Linux - Software 2 02-17-2003 09:31 PM


All times are GMT -5. The time now is 03:34 AM.

Main Menu
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
identi.ca: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration