LinuxQuestions.org
Share your knowledge at the LQ Wiki.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Non-*NIX Forums > Programming
User Name
Password
Programming This forum is for all programming questions.
The question does not have to be directly related to Linux and any language is fair game.

Notices


Reply
  Search this Thread
Old 12-17-2010, 10:51 PM   #1
Galib
Member
 
Registered: Mar 2009
Location: $HOME
Distribution: Slackware64
Posts: 69

Rep: Reputation: 17
website data extract


What would be the best way to extract data by sending queries to a website?

The data comes in table format.
 
Old 12-18-2010, 02:35 AM   #2
devnull10
Member
 
Registered: Jan 2010
Location: Lancashire
Distribution: Slackware Stable
Posts: 572

Rep: Reputation: 120Reputation: 120
How do you want to extract it, and what do you want to do with it?
 
Old 12-18-2010, 04:55 AM   #3
j-ray
Senior Member
 
Registered: Jan 2002
Location: germany
Distribution: ubuntu, mint, suse
Posts: 1,591

Rep: Reputation: 145Reputation: 145
My personal's favourite (assumed you want to analyze a specific table somewhere on the web)
1. write a perl script
2. use LWP, create a UserAgent, fetch the result
3. extract table content from the result with regular expressions
4. split content into an array of rows with regular expressions
5. split the elements of the table rows with regular expressions
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Extract Data using CURL ludo33 Programming 4 11-29-2009 02:17 AM
extract data from link jindalarpan Linux - Software 5 09-21-2009 08:50 PM
How to extract Data from word document? nesta Programming 3 11-26-2008 11:35 AM
Extract data from netbackup tape take 2 grayarea Linux - Software 2 05-28-2007 09:40 AM
Extract data ust Linux - General 1 10-23-2003 05:45 AM

LinuxQuestions.org > Forums > Non-*NIX Forums > Programming

All times are GMT -5. The time now is 04:20 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration