LinuxQuestions.org
Did you know LQ has a Linux Hardware Compatibility List?
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Newbie
User Name
Password
Linux - Newbie This Linux forum is for members that are new to Linux.
Just starting out and have a question? If it is not in the man pages or the how-to's this is the place!

Notices

Reply
 
Search this Thread
Old 03-04-2013, 04:23 AM   #1
clemensclemens
LQ Newbie
 
Registered: Mar 2013
Posts: 1

Rep: Reputation: Disabled
what's a good piece of software to extract data from xml files?


Hello,

I am trying to make one word-list (unicode txt) extracting entries from a series of xml files.
What would be the best software that can accomplish the task?
And what should I do?
I have never tried doing something similar, so I am a bit lost, but I am very willing to learn
All the best to all of you,
Cheers,

Clemens
P.s. just in case, here is a zip file withthe xml files
http://dfiles.eu/files/wd92zyffr
I am trying to extract only what's in between the <hdwd> </hdwd> tags.
Cheers
 
Old 03-04-2013, 05:56 AM   #2
fortran
Member
 
Registered: Nov 2011
Location: Cairo, Egypt
Distribution: CentOS, RHEL, Fedora
Posts: 300
Blog Entries: 2

Rep: Reputation: 50
If you want to extract data in csv format, you can use this site.
http://www.luxonsoftware.com/converter/xmltocsv
but the file limit is up to 4 MB.
 
Old 03-04-2013, 07:08 AM   #3
chrism01
Guru
 
Registered: Aug 2004
Location: Sydney
Distribution: Centos 6.5, Centos 5.10
Posts: 16,289

Rep: Reputation: 2034Reputation: 2034Reputation: 2034Reputation: 2034Reputation: 2034Reputation: 2034Reputation: 2034Reputation: 2034Reputation: 2034Reputation: 2034Reputation: 2034
I'd recommend a good XML tool;
http://search.cpan.org/~mirod/XML-Twig-3.42/Twig.pm
http://search.cpan.org/~grantm/XML-S.../XML/Simple.pm

Do not try to hand code a soln; its a lot more complex than it looks.
 
Old 03-05-2013, 10:53 AM   #4
David the H.
Bash Guru
 
Registered: Jun 2004
Location: Osaka, Japan
Distribution: Debian sid + kde 3.5 & 4.4
Posts: 6,823

Rep: Reputation: 1949Reputation: 1949Reputation: 1949Reputation: 1949Reputation: 1949Reputation: 1949Reputation: 1949Reputation: 1949Reputation: 1949Reputation: 1949Reputation: 1949
Just as a bit of advice, I don't think anybody's going to want to download a 17mb (!) archive of files just to test out possible solutions for you.

You're more likely to get real help if you post one or two short (but well-formed) examples of the xml here instead, and the output you want from it, in the format that you want it in.


And if you do, please use ***[code][/code]*** tags around your code and data, to preserve the original formatting and to improve readability. Do not use quote tags, bolding, colors, "start/end" lines, or other creative techniques.

Last edited by David the H.; 03-05-2013 at 10:54 AM.
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
[SOLVED] extract directory and xml data to create a comma delimited file j-me Programming 33 05-10-2012 08:57 AM
Extract Data between XML tags aharrison Linux - Newbie 13 11-17-2010 07:28 PM
After a piece of software/a bash script to mass-move files [Fedora Core 6] Alux Linux - Desktop 3 11-23-2006 02:33 PM
CD extract and playback software. What's a good program. smaudlin Linux - Desktop 2 11-01-2006 10:31 PM


All times are GMT -5. The time now is 01:39 AM.

Main Menu
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
identi.ca: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration