LinuxQuestions.org
Welcome to the most active Linux Forum on the web.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software
User Name
Password
Linux - Software This forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.

Notices


Reply
  Search this Thread
Old 10-30-2009, 11:37 PM   #1
satimis
Senior Member
 
Registered: Apr 2003
Posts: 3,695

Rep: Reputation: 56
simple html editor


Hi folks,

Debian 50

I need a simple html editor to edit .htlm files converted from .pdf files. Because some icons are converted to strange codes. I don't expect running EMCS to do this simple job. Please advise.

Can I run Text Editor to do the job? If YES, then how?

TIA

B.R.
satimis

Last edited by satimis; 10-30-2009 at 11:41 PM.
 
Old 10-30-2009, 11:41 PM   #2
murankar
Member
 
Registered: Jan 2008
Location: Cleveland Ohio
Distribution: Current CentOS 5.6
Posts: 118

Rep: Reputation: 20
nvu, bluefish, quanta+. theres three to get you started
 
Old 11-01-2009, 09:26 PM   #3
jlinkels
LQ Guru
 
Registered: Oct 2003
Location: Bonaire, Leeuwarden
Distribution: Debian /Jessie/Stretch/Sid, Linux Mint DE
Posts: 5,195

Rep: Reputation: 1043Reputation: 1043Reputation: 1043Reputation: 1043Reputation: 1043Reputation: 1043Reputation: 1043Reputation: 1043
If you are overwhelmed with Quanta, try Kate. Kate is basically the editor window of Quanta. Quanta offers more features like project management and upload management.

jlinkels
 
Old 11-01-2009, 10:01 PM   #4
satimis
Senior Member
 
Registered: Apr 2003
Posts: 3,695

Original Poster
Rep: Reputation: 56
Hi murankar and jlinkels,


Thanks for your advice.

I have bluefish and kompozer running here. Actually what I need is to convert .pdf files (e-document) to .html files for easy reading. So I can read forward and backward with a few clicks. That is my purpose, nothing else.

The aforementioned software can convert .pdf to .html. But the icons, such as bullet, arrow, etc., are converted to codes. I need changing the codes back to icons which I can but have to do it manually, one by one.

"Find & Replace" can't work. Because I can't insert their icons on the "Find & Replace" window. Therefore it will take me lengthy time completing the correction. Furthermore I have more than 10 .pdf files to be converted to .html files. Each document has more than 30 pages. That is my background story.


B.R.

Last edited by satimis; 11-01-2009 at 10:03 PM.
 
Old 11-01-2009, 10:07 PM   #5
Elv13
Member
 
Registered: Apr 2006
Location: Montreal,Quebec
Distribution: Gentoo
Posts: 825

Rep: Reputation: 129Reputation: 129
Do a bash script with sed doing the job for you. There must be some kind of pattern usable for scripting.
 
Old 11-01-2009, 10:17 PM   #6
satimis
Senior Member
 
Registered: Apr 2003
Posts: 3,695

Original Poster
Rep: Reputation: 56
Quote:
Originally Posted by Elv13 View Post
Do a bash script with sed doing the job for you. There must be some kind of pattern usable for scripting.
Hi Elv13,


Thanks for your advice.

Could you please shed me more detail on running sed command

E.G.
After converting .pdf to .html

bullet converted to •

arrow converted to !

" (invert comma) converted to �

etc.


TIA


B.R.
satimis
 
Old 11-03-2009, 06:53 PM   #7
murankar
Member
 
Registered: Jan 2008
Location: Cleveland Ohio
Distribution: Current CentOS 5.6
Posts: 118

Rep: Reputation: 20
there are special codes you can use for meta characters such as:

Code:
&#44
= ,;
Code:
&#33
= !
end each of those special numbers with an ";". I left it off so it does not get converted here.
and so on.

if that is what you need then find and replace may work. let us know if that might work for you.

Last edited by murankar; 11-03-2009 at 06:55 PM.
 
Old 11-04-2009, 06:23 AM   #8
Shadow_7
Senior Member
 
Registered: Feb 2003
Distribution: debian
Posts: 4,137
Blog Entries: 1

Rep: Reputation: 874Reputation: 874Reputation: 874Reputation: 874Reputation: 874Reputation: 874Reputation: 874
Bluefish, anjuta, and many others if you need a gui. I generally use nano on the cli.

It should be scriptable.

$ egrep -r -i "stuff" ./*.html

to find what has "stuff".

$ cat file.html | sed 's/stuff/newstuff/' | tee newfile.html

Always best to keep the original UNTIL you're sure that it did what you "expected" it to do. Although you could just use | more instead of tee. Although a file lets you use an editor with easy / fast scrolling and/or a better font. Is that an O or an 0? An I, l, |, 1 or just my cursor? U or V or W or ||? Bear in mind regular expression aka regex. So any non-alpha's and non-numerics probably need an escape \. But the syntax is 's/old/new/' to swap something for another thing with sed.
 
Old 11-04-2009, 08:11 AM   #9
pixellany
LQ Veteran
 
Registered: Nov 2005
Location: Annapolis, MD
Distribution: Mint
Posts: 17,809

Rep: Reputation: 743Reputation: 743Reputation: 743Reputation: 743Reputation: 743Reputation: 743Reputation: 743
Quote:
Originally Posted by Elv13 View Post
Do a bash script with sed doing the job for you. There must be some kind of pattern usable for scripting.
That's the last answer I would have predicted in response to someone asking about html editors......

For the OP: I like Bluefish, but I have not tried most of the other options.

Note that OpenOffice can also be used.
 
Old 11-04-2009, 10:28 AM   #10
satimis
Senior Member
 
Registered: Apr 2003
Posts: 3,695

Original Poster
Rep: Reputation: 56
Quote:
Originally Posted by Shadow_7 View Post
Bluefish, anjuta, and many others if you need a gui. I generally use nano on the cli.

It should be scriptable.

$ egrep -r -i "stuff" ./*.html

to find what has "stuff".

$ cat file.html | sed 's/stuff/newstuff/' | tee newfile.html

Always best to keep the original UNTIL you're sure that it did what you "expected" it to do. Although you could just use | more instead of tee. Although a file lets you use an editor with easy / fast scrolling and/or a better font. Is that an O or an 0? An I, l, |, 1 or just my cursor? U or V or W or ||? Bear in mind regular expression aka regex. So any non-alpha's and non-numerics probably need an escape \. But the syntax is 's/old/new/' to swap something for another thing with sed.
Hi Shadow_7,

Thanks for your advice.

Before proceeding further I expect to clarify follows first;

I ran
$ pdftohtml spss.pdf

It generates 3 files;
Code:
spss.html
spss_ind.html
spsss.html

Reading spsss.html with text editor it display;
Code:
....
...
•&nbsp;&nbsp;You need to use options or procedures that are not available using&nbsp;interactive mode.&nbsp;<br>
....
...

Browsing spsss.html it displays;
Code:
....
...
•  You need to use options or procedures that are not available using interactive mode. 
....
...

On running;
$ egrep -r -i "•" spsss.html

no printout


What shall I find? TIA

B.R.
satimis

Last edited by satimis; 11-04-2009 at 10:29 AM.
 
Old 11-04-2009, 10:39 AM   #11
satimis
Senior Member
 
Registered: Apr 2003
Posts: 3,695

Original Poster
Rep: Reputation: 56
Quote:
Originally Posted by pixellany View Post
...
....
Note that OpenOffice can also be used.
Hi pixellany,


Thanks for your advice. I'm running OOo 2.4 here. The PDF extension of OOo is only available on version 3.0+
http://extensions.services.openoffice.org/node/2587

Therefore I can't make it work here.

B.R.
satimis
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Best HTML editor web250 Linux - Software 7 07-11-2007 01:08 AM
html editor c0c0deuz Linux - Software 2 12-06-2003 03:45 PM
HTML Editor :( g0dzuki99 Linux - Software 7 11-25-2002 02:33 PM
html editor sundog Linux - Software 4 07-29-2002 08:33 AM
Simple Home Networking & HTML Editor RedHatMN Linux - Newbie 3 04-04-2001 06:54 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Software

All times are GMT -5. The time now is 06:12 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration