LinuxQuestions.org
Visit Jeremy's Blog.
Go Back   LinuxQuestions.org > Blogs
User Name
Password

Notices


Old

Web-Scraping and src-attribute : change absolute to relative

Posted 10-26-2024 at 04:48 AM by Michael Uplawski
Updated 10-26-2024 at 04:52 AM by Michael Uplawski

I consider this a recurring task:

After having downloaded HTML and all the images or other files, needed, the absolute links in the HTML-file are no longer working and must be shortened to point at the location of the downloaded files.

In the most simple cases you can use your preferred line-editor to quickly eliminate any path that is preceding file-names in the src-attributes.

Whatever ... I prefer Ruby (and I do not care to bother with sed). Nokogiri...
Senior Member
Views 53 Comments 0 Michael Uplawski is offline
Old

Generate a glossary from HTML

Posted 09-09-2017 at 07:34 AM by Michael Uplawski
Updated 08-11-2023 at 03:28 AM by Michael Uplawski (list format, formating and outlook)

HTML2INDEX

Install as a ruby-gem:
Code:
:~$ gem install html2index
Read the RDOC : http://www.rubydoc.info/gems/html2index/1.1

This program creates an index or glossary of marked expressions in a HTML-file

The current man-page is here:

--------------------------

HTML2Index


Creates an index or glossary of marked expressions in an HTML-file
...
Senior Member
Views 404 Comments 1 Michael Uplawski is offline
Old

SIMPLE! Write man pages / Docutils and tweaks

Posted 08-01-2017 at 03:59 AM by Michael Uplawski
Updated 08-05-2017 at 01:31 AM by Michael Uplawski (... bunch of bad wording. Categories missing)

Write a bunch of man pages with Docutils and a few simple tweaks
Subtitle: Do it now!
You can write man-pages in many ways, notably by just coding it with “Troff” or “Groff”. Anyway, the resulting man-page can always be opened in an ordinary text-editor or pager and may look a lot like my own man-page for Timequiz:

Code:
user@machine:/tmp$ more ./test.man 
.
.TH TIMEQUIZ  "" "" ""
.SH NAME
TIMEQUIZ \- play a history
...
Attached Files
File Type: txt timequiz_rst.txt (4.7 KB, 8 views)
Senior Member
Views 310 Comments 0 Michael Uplawski is offline
Old

soundless screen cast helper-tool

Posted 10-23-2016 at 02:21 PM by Michael Uplawski
Updated 10-23-2016 at 03:48 PM by Michael Uplawski

The distribution of my very first screen cast in a local distribution network for locally produced, biological foodstuffs is the real cause for the publication of this entry.

I have no use for sound in my screen cast and when I played around with “recordMyDesktop” thought about how I should be able to type instructions in a terminal or text-editor, show manipulations in software and switch between the two, without getting lost, forgetting details or making dumb typos all the time....
Senior Member
Views 196 Comments 0 Michael Uplawski is offline
Old

Transforminator

Posted 09-08-2016 at 03:03 PM by Michael Uplawski
Updated 12-31-2023 at 06:32 AM by Michael Uplawski (version 1.1.7)

Edit 31/12/2023: A new version 1.1.7 of Cremefraiche is available on rubygems.org. It comprises some bug fixes, the license is now wtfpl-2 and more HTML-garbage may be handled to render the PDF readable.
Years ago, I wrote a ruby-program which converts Email to PDF, then ignored it.
Discussions of GTK3 and the pros and cons of the decisions taken by the Gnome- and GTK-developers awakened again my interest in the program, as it comes with an optional GTK3 user-interface.
...
Attached Images
File Type: pdf CremeFraicheGui.pdf (408.7 KB, 12 views)
Attached Files
Senior Member
Views 265 Comments 2 Michael Uplawski is offline

  



All times are GMT -5. The time now is 04:19 PM.

Main Menu
Advertisement
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration