Review your favorite Linux distribution.
Go Back > Forums > Linux Forums > Linux - Newbie
User Name
Linux - Newbie This Linux forum is for members that are new to Linux.
Just starting out and have a question? If it is not in the man pages or the how-to's this is the place!


  Search this Thread
Old 03-04-2013, 04:23 AM   #1
LQ Newbie
Registered: Mar 2013
Posts: 1

Rep: Reputation: Disabled
what's a good piece of software to extract data from xml files?


I am trying to make one word-list (unicode txt) extracting entries from a series of xml files.
What would be the best software that can accomplish the task?
And what should I do?
I have never tried doing something similar, so I am a bit lost, but I am very willing to learn
All the best to all of you,

P.s. just in case, here is a zip file withthe xml files
I am trying to extract only what's in between the <hdwd> </hdwd> tags.
Old 03-04-2013, 05:56 AM   #2
Registered: Nov 2011
Location: Cairo, Egypt
Distribution: CentOS, RHEL, Fedora
Posts: 300
Blog Entries: 2

Rep: Reputation: 51
If you want to extract data in csv format, you can use this site.
but the file limit is up to 4 MB.
Old 03-04-2013, 07:08 AM   #3
LQ Guru
Registered: Aug 2004
Location: Sydney
Distribution: Centos 6.9, Centos 7.3
Posts: 17,395

Rep: Reputation: 2395Reputation: 2395Reputation: 2395Reputation: 2395Reputation: 2395Reputation: 2395Reputation: 2395Reputation: 2395Reputation: 2395Reputation: 2395Reputation: 2395
I'd recommend a good XML tool;

Do not try to hand code a soln; its a lot more complex than it looks.
Old 03-05-2013, 10:53 AM   #4
David the H.
Bash Guru
Registered: Jun 2004
Location: Osaka, Japan
Distribution: Debian sid + kde 3.5 & 4.4
Posts: 6,823

Rep: Reputation: 1960Reputation: 1960Reputation: 1960Reputation: 1960Reputation: 1960Reputation: 1960Reputation: 1960Reputation: 1960Reputation: 1960Reputation: 1960Reputation: 1960
Just as a bit of advice, I don't think anybody's going to want to download a 17mb (!) archive of files just to test out possible solutions for you.

You're more likely to get real help if you post one or two short (but well-formed) examples of the xml here instead, and the output you want from it, in the format that you want it in.

And if you do, please use ***[code][/code]*** tags around your code and data, to preserve the original formatting and to improve readability. Do not use quote tags, bolding, colors, "start/end" lines, or other creative techniques.

Last edited by David the H.; 03-05-2013 at 10:54 AM.


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off

Similar Threads
Thread Thread Starter Forum Replies Last Post
[SOLVED] extract directory and xml data to create a comma delimited file j-me Programming 33 05-10-2012 08:57 AM
Extract Data between XML tags aharrison Linux - Newbie 13 11-17-2010 07:28 PM
After a piece of software/a bash script to mass-move files [Fedora Core 6] Alux Linux - Desktop 3 11-23-2006 02:33 PM
CD extract and playback software. What's a good program. smaudlin Linux - Desktop 2 11-01-2006 10:31 PM > Forums > Linux Forums > Linux - Newbie

All times are GMT -5. The time now is 01:00 AM.

Main Menu
Write for LQ is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration