LinuxQuestions.org
Help answer threads with 0 replies.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software
User Name
Password
Linux - Software This forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.

Notices


Reply
  Search this Thread
Old 12-23-2006, 05:12 PM   #16
PatrickNew
Senior Member
 
Registered: Jan 2006
Location: Charleston, SC, USA
Distribution: Debian, Gentoo, Ubuntu, RHEL
Posts: 1,148

Original Poster
Blog Entries: 1

Rep: Reputation: 48

Latex is an excellent idea. I can't believe I didn't think of it earlier. Thanks man!
 
Old 12-25-2006, 12:35 PM   #17
Sepero
Member
 
Registered: Jul 2004
Location: Tampa, Florida, USA
Distribution: Ubuntu
Posts: 734
Blog Entries: 1

Rep: Reputation: 33
Ok, so the pdf that you output with OO was 42k. And that proves what exactly?

Perhaps it proves that the pdf module in OO is inefficient?
Did you try creating a pdf with any other program?

Perhaps it proves that pdf produces larger files when the document is ONE PAGE LONG?
Did you try creating a pdf that was 400 pages long?


In front of me right now I have a PDF book of 107 pages. Guess how big it's file size is... 1 Meg... 5 Megs... 10 Megs?


No, 265K. And here I have another pdf book, 454 pages/Size 773K. Instead of being the claimed 42K per page, they end up being a lot closer to 2k. (Which is right on par with your prototype format.)


So what's My point?
I don't think there's anything wrong with using latex, but if your idea is smaller size, I think you've completely misjudged pdf.
 
Old 12-25-2006, 10:25 PM   #18
PatrickNew
Senior Member
 
Registered: Jan 2006
Location: Charleston, SC, USA
Distribution: Debian, Gentoo, Ubuntu, RHEL
Posts: 1,148

Original Poster
Blog Entries: 1

Rep: Reputation: 48
Question: Is there a program that renders a .tex without making a pdf or ps out of it. I can't seem to find one, although I could have sworn there was.

Comment: I realize that I due bear some undue prejudice against the pdf format because my primary experience of it while learning computers was using adobe acrobat viewer with IE over a dialup connection. However, the idea of my format is not pdf-hating based. I think pdf is a wonderfull format that balances flexibility with size. I'm trying to fill a different niche. Instead of balancing flexibility and size, I want to sacrifice flexibility for size. I want to try to create slightly different versions optimized for each alphabet, etc. I'm very sorry if this came off as a pdf slam. All I mean is that if PDF can be such a wonderfully flexible format, then it should be possible to make it a bit more compact by sacrificing some of that flexibility.

Note: I am not sure, but I believe that pdf employs compression, while my prototype does not. However, I do intend to add that functionality if I can find an easy way to render it.
 
Old 12-26-2006, 09:44 AM   #19
jlinkels
LQ Guru
 
Registered: Oct 2003
Location: Bonaire, Leeuwarden
Distribution: Debian /Jessie/Stretch/Sid, Linux Mint DE
Posts: 5,195

Rep: Reputation: 1043Reputation: 1043Reputation: 1043Reputation: 1043Reputation: 1043Reputation: 1043Reputation: 1043Reputation: 1043
What about a Latex to HTML converter? Those exist.

jlinkels
 
Old 12-26-2006, 01:52 PM   #20
PatrickNew
Senior Member
 
Registered: Jan 2006
Location: Charleston, SC, USA
Distribution: Debian, Gentoo, Ubuntu, RHEL
Posts: 1,148

Original Poster
Blog Entries: 1

Rep: Reputation: 48
Not a bad idea, but then I'd just use html directly.
 
Old 12-26-2006, 06:42 PM   #21
Sepero
Member
 
Registered: Jul 2004
Location: Tampa, Florida, USA
Distribution: Ubuntu
Posts: 734
Blog Entries: 1

Rep: Reputation: 33
Quote:
Originally Posted by PatrickNew
Not a bad idea, but then I'd just use html directly.
You might not care for my opinion, but I think that's the best idea you've had yet. Use html directly. It is small, and when zipped, it's incredibly tiny. Plus, _everyone_ has a html browser already installed. No extra software.

Have a look here:
http://en.wikipedia.org/wiki/LaTeX

From what I can tell, Tex is basically a meta-document format. It is not meant to be used for creating documents directly. Instead, it was made for translating into other kinds of documents: txt, html, pdf, ps, odt, etc.
 
Old 12-27-2006, 12:31 AM   #22
PatrickNew
Senior Member
 
Registered: Jan 2006
Location: Charleston, SC, USA
Distribution: Debian, Gentoo, Ubuntu, RHEL
Posts: 1,148

Original Poster
Blog Entries: 1

Rep: Reputation: 48
Quote:
Originally Posted by Sepero
You might not care for my opinion, but I think that's the best idea you've had yet.
I'm very sorry if the way I've been writing makes me seem angry with you. I'm not, I respect your opinion. In many ways you are right - pdf is a compact format. In a lot of ways it is questionable what gains will be made by essentially optimizing pdf for a specific subset of its features. I do care what you think, you're an active member of this board and if memory serves me right you've helped me out before. I don't mean to be insulting.

Regarding html, my only concern are certain features not supported. I can place images behind text only with a table/css/image hack and there is little way to specify a document's page height/width (although width is more important than height). However, I will research these more. Ultimately - I don't want to create a file format that will be tiny but have to expand to a much larger format in memory in order to be readable. That's exactly why myformat -> .tex -> ps is not a good idea.
 
Old 12-27-2006, 02:57 AM   #23
Sepero
Member
 
Registered: Jul 2004
Location: Tampa, Florida, USA
Distribution: Ubuntu
Posts: 734
Blog Entries: 1

Rep: Reputation: 33
I understand your concern with the html and images. It can be a bit tricky depending on your goal. Especially things like "over lapping images".
Can you explain to us what the purpose of this is?
You wish to write a book or some documentation for the OLPC project?
 
Old 12-27-2006, 03:47 AM   #24
PatrickNew
Senior Member
 
Registered: Jan 2006
Location: Charleston, SC, USA
Distribution: Debian, Gentoo, Ubuntu, RHEL
Posts: 1,148

Original Poster
Blog Entries: 1

Rep: Reputation: 48
The goal is primarily a learning experience for myself - with the results to be submitted to OLPC if this learning experience yeilds a usable result. The idea is to write a file format similar to pdf or ps, but with a smaller feature set and smaller final binary file size. Basically, to create a more compact pdf, so that OLPC users can fit more e-books onto their OLPC CF cards or spend less space on the same books and have more room for other things.

My current problem is that I don't have the programming skills to write a rendering engine for an entirely new binary format. I'm hoping to find a good intermediate format that I can test and develop my new format with. IE create a document in my intermediate format and convert it. To view the file, temporarily convert back to the intermediate format. Ideally the intermediate format is only temporary, until I can write the rendering engine - otherwise one would just store the document in the intermediate format.
 
Old 12-27-2006, 06:43 PM   #25
Sepero
Member
 
Registered: Jul 2004
Location: Tampa, Florida, USA
Distribution: Ubuntu
Posts: 734
Blog Entries: 1

Rep: Reputation: 33
No offense, but I don't think you quite get "it". People write books in pdf and html. That is all there is to it. Your plan can only succeed if after you finish it, you do one of two things:
1. Convert every pdf and html book that already exists to your format.
2. Run a huge campaign to convince authors that your format is somehow technically better than the other options.

To be perfectly honest, I don't see your plan working. If I wanted a "really small" document, I'd use a program like 'pdftotext'. Then I'd have the smallest format possible, plain text. Lastly, I could gzip it, and view it with 'zless'. (I've done this before)
 
Old 12-27-2006, 09:29 PM   #26
PatrickNew
Senior Member
 
Registered: Jan 2006
Location: Charleston, SC, USA
Distribution: Debian, Gentoo, Ubuntu, RHEL
Posts: 1,148

Original Poster
Blog Entries: 1

Rep: Reputation: 48
1. Ideally, I'd be writing a pdf to newformat converter. OLPC notebooks are headed to areas that for the most part don't have a internet infrastructure. These laptops are being designed so that they can be human powered because they're intended to be sent to many areas that don't have available electricity. I don't think we're going to have too much trouble with kids surfing around online and being unable to open the pdf's they stumble across. Most e-books will have to be physically sent on physical media (likely an SD card) and it wouldn't be that hard for the person with a full-featured first-world computer to toss it through the converter.

2. I don't need to convert every document, because I'm not expecting OLPC to remove pdf and html viewing software from their computers. I want to provide one additional option, not completely replace a well-established standard.

3. I don't need this to "succeed". I've said before I'm doing this as a learning experience. *Then* if it yeilds a usable result I'll submit it to OLPC. Yes - I am reinventing the wheel, but primarily as a tool to help me understand how the wheel was built. If I can make a wheel that's a little better for driving toy cars on instead of using wheels designed for bicycles - then so be it. If not - then OK. I'm not writing this with any delusion that it will ever become a standard.

4. If I got this format to work exactly as I want it, it would be smaller than plain text. By pushing a pdf through pdf2text, any images will obviously be stripped out, and only plain text portions will remain. One of the ways I seek to reclaim size is by optimizing the character set for the language. For English, instead of 8-bit ASCII, I'm using a 7-bit character set and I just stripped out non-printing characters. After allowing for all letters (capital and lowercase), numbers and symbols present on a standard US QWERTY keyboard (plus a couple) I still have some unused characters. These are used as tags in an XML-esque style way. One-character tags. If we have two comparable documents (plain text only) the newformat one will be smaller since each character is 7 bits not 8. The ASCII will have the advantage for very small files because it doesn't waste the 86 bits on a header, but if the file runs more than 86 characters the newformat has an advantage. Admittedly, I don't know how this affects gzipping.
 
Old 12-27-2006, 10:01 PM   #27
PatrickNew
Senior Member
 
Registered: Jan 2006
Location: Charleston, SC, USA
Distribution: Debian, Gentoo, Ubuntu, RHEL
Posts: 1,148

Original Poster
Blog Entries: 1

Rep: Reputation: 48
To anyone following the post, I think I've settled on html as the intermediate format. I'll have to be happy with the hack for placing an image behind text and I've decided on using a table with width fixed by a pixel value to separate pages.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Best HTML editor web250 Linux - Software 7 07-11-2007 01:08 AM
HTML editor rblampain Programming 9 07-26-2005 09:30 AM
HTML Editor MrTheGeek Linux - General 9 12-14-2004 05:26 AM
html editor c0c0deuz Linux - Software 2 12-06-2003 03:45 PM
html editor i_is_cat Linux - Software 8 05-15-2003 10:40 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Software

All times are GMT -5. The time now is 11:59 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration