LinuxQuestions.org
Share your knowledge at the LQ Wiki.
Go Back   LinuxQuestions.org > Forums > Non-*NIX Forums > Programming
User Name
Password
Programming This forum is for all programming questions.
The question does not have to be directly related to Linux and any language is fair game.

Notices


Reply
  Search this Thread
Old 01-05-2024, 09:19 AM   #16
boughtonp
Senior Member
 
Registered: Feb 2007
Location: UK
Distribution: Debian
Posts: 3,708

Rep: Reputation: 2627Reputation: 2627Reputation: 2627Reputation: 2627Reputation: 2627Reputation: 2627Reputation: 2627Reputation: 2627Reputation: 2627Reputation: 2627Reputation: 2627


An XML parser is not a HTML parser.

With controlled HTML, pre-processing before parsing can be safe (depending on specifics), but with uncontrolled HTML it is asking for bugs.

 
Old 07-19-2024, 03:59 AM   #17
Michael Uplawski
Senior Member
 
Registered: Dec 2015
Posts: 1,639

Original Poster
Blog Entries: 40

Rep: Reputation: Disabled
Quote:
Originally Posted by boughtonp View Post
An XML parser is not a HTML parser.

With controlled HTML, pre-processing before parsing can be safe (depending on specifics), but with uncontrolled HTML it is asking for bugs.

You did not read my posts, that is asking for misunderstanding.
 
Old 07-19-2024, 04:34 AM   #18
NevemTeve
Senior Member
 
Registered: Oct 2011
Location: Budapest
Distribution: Debian/GNU/Linux, AIX
Posts: 4,924
Blog Entries: 1

Rep: Reputation: 1886Reputation: 1886Reputation: 1886Reputation: 1886Reputation: 1886Reputation: 1886Reputation: 1886Reputation: 1886Reputation: 1886Reputation: 1886Reputation: 1886
By now, one could have checked the relation between XML, SGML, HTML and XHTML. Neither of those is meant to be parsed via regular expressions.
 
Old 07-19-2024, 06:51 AM   #19
Michael Uplawski
Senior Member
 
Registered: Dec 2015
Posts: 1,639

Original Poster
Blog Entries: 40

Rep: Reputation: Disabled
Quote:
Originally Posted by NevemTeve View Post
By now, one could have checked the relation between XML, SGML, HTML and XHTML. Neither of those is meant to be parsed via regular expressions.
Why do you keep talking about XML when the problem had nothing to do with that? I *am* parsing HTML and XML with an XML/HTML-parser. This has not been the question.
 
Old 07-19-2024, 07:40 AM   #20
boughtonp
Senior Member
 
Registered: Feb 2007
Location: UK
Distribution: Debian
Posts: 3,708

Rep: Reputation: 2627Reputation: 2627Reputation: 2627Reputation: 2627Reputation: 2627Reputation: 2627Reputation: 2627Reputation: 2627Reputation: 2627Reputation: 2627Reputation: 2627
Quote:
Originally Posted by Michael Uplawski View Post
You did not read my posts, that is asking for misunderstanding.
I cannot guarantee what I did back in January, but I stand by both of the posts I made in this thread, and have nothing more to add.

I have no idea why you have responded half a year later on a thread marked solved just to say that.

 
1 members found this post helpful.
Old 07-20-2024, 12:04 AM   #21
Michael Uplawski
Senior Member
 
Registered: Dec 2015
Posts: 1,639

Original Poster
Blog Entries: 40

Rep: Reputation: Disabled
I abstain from using the Web as much as possible and was not aware of your post.
 
  


Reply

Tags
regexp syntax


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
perl: how to insert numerical digit immediately after regexp backreference variable? chadwick Programming 8 05-19-2008 12:49 PM
SED, regexp or such - remove text after space aolong Linux - General 5 03-07-2008 02:36 PM
regexp question rytrom Linux - Newbie 3 09-01-2003 12:50 PM
validating a surname - regexp fu chr15t0 Programming 2 06-20-2003 05:55 AM
Regexp stumper lackluster Programming 2 11-02-2002 12:31 AM

LinuxQuestions.org > Forums > Non-*NIX Forums > Programming

All times are GMT -5. The time now is 05:35 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration