LinuxQuestions.org
Help answer threads with 0 replies.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Non-*NIX Forums > General
User Name
Password
General This forum is for non-technical general discussion which can include both Linux and non-Linux topics. Have fun!

Notices


Reply
  Search this Thread
Old 06-15-2021, 01:25 PM   #16
Trihexagonal
Member
 
Registered: Jul 2017
Posts: 362
Blog Entries: 1

Rep: Reputation: 334Reputation: 334Reputation: 334Reputation: 334

That's what I meant by it's sometimes hard to find the error. The one I had was from my clicking-finger sneaking in a click with I was changing my Validator link code for a blue button to match my new color scheme and accidentally pasted the link in the metatag section.

It kept returning an error for a character in a different link, and continued to do so after I deleted the link. I had to look at the whole page to see it.
 
Old 06-15-2021, 02:19 PM   #17
business_kid
LQ Guru
 
Registered: Jan 2006
Location: Ireland
Distribution: Slackware, Slarm64 & Android
Posts: 16,292

Rep: Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322
You guys are making me very glad I never got into software, although I was often pushed in that direction.
 
Old 06-15-2021, 05:18 PM   #18
Trihexagonal
Member
 
Registered: Jul 2017
Posts: 362
Blog Entries: 1

Rep: Reputation: 334Reputation: 334Reputation: 334Reputation: 334
You've never opened Leafpad?

Typing out XHTML or CSS on a text editor and running it though the validator is about as deep into software as it gets for me. Unless you consider using Gimp to manipulate images getting into software, though I will admit it took me longer to figure out Gimp than it did Linux.

I learned to write XHTML and CSS at w3schools.com for free. You can learn XML, JS, Python, PHP, etc., you just have to apply yourself.
 
Old 06-16-2021, 08:41 AM   #19
business_kid
LQ Guru
 
Registered: Jan 2006
Location: Ireland
Distribution: Slackware, Slarm64 & Android
Posts: 16,292

Rep: Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322
I have a son at programming, and listening to him, I got very glad I never started.
He ended up writing stuff in about 10 languages, then got into serious work and had to learn many more.
I did Assembler, VHDL, & PLC languages, but no real programming that wasn't hardware related.

Python I have respect for. But (there's always a but) how do you find the already written module that already does what you want, or the C/C++ library that implements your function? There must be an awful lot to learn besides syntax. I intend only to read Web stuff, not actually write it.
 
Old 06-16-2021, 05:01 PM   #20
Michael Uplawski
Senior Member
 
Registered: Dec 2015
Posts: 1,622

Original Poster
Blog Entries: 40

Rep: Reputation: Disabled
Folks.
This whole thread is not about programming.

I could have started it by pointing at somebody else's work without having any clue about how she/he did it.

And even the latest remarks concerned HTML which is a Markup-language. I still am against calling “writing HTML” programming. It is “writing HTML”, like when you underline text in a text-processor. There are just more things you can do in HTML. If you must use the HTML–maybe–5 thing, call it as you please but do not confound programming with “writing HTML”. Sorry to those who call HTML their only “programming language”. It isn't.

“When you feel there is an unfair burden on your shoulders – Well that's just the way it is sometimes” (Forrest Gump/Winston Groom)
 
Old 06-16-2021, 09:00 PM   #21
Trihexagonal
Member
 
Registered: Jul 2017
Posts: 362
Blog Entries: 1

Rep: Reputation: 334Reputation: 334Reputation: 334Reputation: 334
Quote:
Originally Posted by Michael Uplawski View Post
Folks.

And even the latest remarks concerned HTML which is a Markup-language. I still am against calling “writing HTML” programming.
No, it's not and XHTML is not code. It's Markup.
 
Old 06-17-2021, 05:27 AM   #22
business_kid
LQ Guru
 
Registered: Jan 2006
Location: Ireland
Distribution: Slackware, Slarm64 & Android
Posts: 16,292

Rep: Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322
Quote:
Originally Posted by Michael Uplawski View Post
Folks.
This whole thread is not about programming.
Ok. Back to web scraping.

I decided to try my hand at scraping one afternoon. One whatsapp reply message later, I downloaded Scrapy, I had BeautifulSoup, and I found and grabbed a basic spider script, using scrapy & bs4. It used the example of rottentomatoes.com, and sure enough, it extracted all the html links from rottentomatoes.com. Very good.

Then I turned it on one of the tv stations within my geo-ip compass, rte player, and got this:
Code:
bash-5.1$ ./spider.py http://www.rte.ie/player
[URL] https://www.rte.ie/player/ => https://www.rte.ie/player/assets/icons/player/favicon-triangle.png
[URL] https://www.rte.ie/player/ => https://www.rte.ie/player/runtime-es5.f9424a36757fb88cc6fb.js
[EXTERNAL-URL] https://www.rte.ie/player/ => https://imasdk.googleapis.com/js/sdkloader/ima3.js
[URL] https://www.rte.ie/player/ => https://www.rte.ie/player/runtime-es2015.f9424a36757fb88cc6fb.js
[URL] https://www.rte.ie/player/ => https://www.rte.ie/player/main-es5.06bcd2d45f7b5b2533d8.js
[EXTERNAL-URL] https://www.rte.ie/player/ => https://static.cloudflareinsights.com/beacon.min.js
[URL] https://www.rte.ie/player/ => https://www.rte.ie/player/styles.d8d3aec393223ccbcc97.css
[URL] https://www.rte.ie/player/ => https://www.rte.ie/player/main-es2015.06bcd2d45f7b5b2533d8.js
[URL] https://www.rte.ie/player/ => https://www.rte.ie/player/polyfills-es5.8d6a97a6cfc8a61ca06c.js
[URL] https://www.rte.ie/player/ => https://www.rte.ie/player/scripts-index-page.c6a09ebb4332bce45d60.js
[URL] https://www.rte.ie/player/ => https://www.rte.ie/player/polyfills-es2015.b3944856ea1b33a3dba9.js
[EXTERNAL-URL] https://www.rte.ie/player/ =>  https://www.googletagservices.com/tag/js/gpt.js
[URL] https://www.rte.ie/player/ => https://www.rte.ie/player/scripts.9ede1f08bc2c3b291e6a.js
[URL] https://www.rte.ie/player/ => https://www.rte.ie/player/manifest.json
bash-5.1$
Now I can watch stuff there, so I reckon I can scrape it. If they get a feature film, it doesn't go up on the player. They sometimes get premieres, because we're such a small spot, they pay little because cost is based on audience size. The .js scripts fills the (hdmi) screen with type in a tiny font and links seem to go nowhere. So how does one scrape that?

Last edited by business_kid; 06-17-2021 at 05:29 AM.
 
Old 06-17-2021, 08:39 AM   #23
business_kid
LQ Guru
 
Registered: Jan 2006
Location: Ireland
Distribution: Slackware, Slarm64 & Android
Posts: 16,292

Rep: Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322
Been reading a bit more …

It seems there's scraping and scraping - nothing is as simple as it first seems. And it seems like a cat-and-mouse game between python devs and web/js/css/json/etc. devs. Interesting thread to follow though.
 
Old 06-25-2021, 03:10 AM   #24
Trihexagonal
Member
 
Registered: Jul 2017
Posts: 362
Blog Entries: 1

Rep: Reputation: 334Reputation: 334Reputation: 334Reputation: 334
Well, somebody else has part of my Tutorial on their site now but he didn't send a bot to scrape it. He copied the parts he wanted.

I should be honored someone who has attended Islamic Azad University (QIAU-Qazvin Branch) Graphics, has a Bachelor of Science (BSc) Computer Hardware Engineering and is a SAN Storage Engineer ( Dell EMC / HPE / HDS / NetApp ) with extensive Cisco MDS / Brocade experience.

Consultant & Infrastructure Engineer | SAN Solution Architect | DevOps / SysOps / Linux System Engineer would steal a Beginners Tutorial written by a 10th Grade High School Dropout.

He looks very respectable, unlike the other fellow. He has not, however, chosen to respond to my inquiries to take my material down from his site either.

The fact he lives in Iran gives me an option I hope he never pushes me to cross.


I could just send out my new bot army, steal everything they own and put it up on a site somewhere in Russia as each others work.
Attached Thumbnails
Click image for larger version

Name:	my_post_2017.jpg
Views:	10
Size:	169.2 KB
ID:	36678   Click image for larger version

Name:	copied_by_you.jpg
Views:	7
Size:	136.4 KB
ID:	36679  
 
Old 06-25-2021, 07:03 AM   #25
Michael Uplawski
Senior Member
 
Registered: Dec 2015
Posts: 1,622

Original Poster
Blog Entries: 40

Rep: Reputation: Disabled
Quote:
Originally Posted by Trihexagonal View Post
Well, somebody else has part of my Tutorial on their site now but he didn't send a bot to scrape it. He copied the parts he wanted.
Sorry, but I do not get it.

AFAIS, you are not making money as a teacher or else selling your documents. What harm do people do, when they re-publish your work?

I have begun writing IT-tutorials, before I was working in IT, before I knew code and only shortly after I had understood something. In fact, knowing that my struggles with some technolgy were partly induced by bad documentation, it gave me the motivation to try it a diffent way and – maybe – find people who prefer my explanations to those that were already available.

Today I still find my documents and translations from English to German and I do not give a damn about who publishes any one of these docs. Some are too old to be taken seriously today, but much is still valid: Anonymous Remailers were functional before we talked about Onion-Routing, and still – that is how they worked.

What is so bad about having your texts published by others?
 
Old 06-26-2021, 01:06 AM   #26
Trihexagonal
Member
 
Registered: Jul 2017
Posts: 362
Blog Entries: 1

Rep: Reputation: 334Reputation: 334Reputation: 334Reputation: 334
Quote:
Originally Posted by Michael Uplawski View Post
Sorry, but I do not get it.

AFAIS, you are not making money as a teacher or else selling your documents. What harm do people do, when they re-publish your work?
No, It's not even under my real name. Nobody I associate with connects me with Trihexagonal, ruebot, izezi or even starts to understand anything I do online under those names.

I don't know anybody who uses a computer for pleasure. My Sister works at a computer all day but freely admits she couldn't follow my tutorial. I haven't made a dime off anything I've done and am in the hole in hosting cost alone.

Not to mention the time I put into writing, graphics and promoting it like a carny on crank.

Quote:
Originally Posted by Michael Uplawski View Post
I have begun writing IT-tutorials, before I was working in IT, before I knew code and only shortly after I had understood something. In fact, knowing that my struggles with some technolgy were partly induced by bad documentation, it gave me the motivation to try it a diffent way and – maybe – find people who prefer my explanations to those that were already available.
Links to all your original material, please.

Quote:
Originally Posted by Michael Uplawski View Post
Today I still find my documents and translations from English to German and I do not give a damn about who publishes any one of these docs. Some are too old to be taken seriously today, but much is still valid: Anonymous Remailers were functional before we talked about Onion-Routing, and still – that is how they worked.
I work to keep mine updated and am constantly trying to come up with ways to improve it:

Feb 4, 2021 - Updated to reflect FreeBSD 12.2-RELEASE.
Feb 9, 2021 - Updated to show what needs done for a machina with an older nvidia chip before you can boot to the desktop.
Feb 23, 2021 - Updated to include 3 Fluxbox styles of different colors as .txt attachments and instructions how to put them to use.
Feb 24, 2021 - Updated to reflect steps needed done for a box with Switchable Graphics during the installation of x11/xorg...
Mar 11, 2021 - Wrote a different tutorial on Spoofing your Ether MAC on FreeBSD without dropping or bringing back up the Interface.
Mar 12, 2021 - Updated to include installation of sysutils/devcpu-data early during the build process for download of firmware microcode updates
Mar 16, 2021 - Updated my pf firewall ruleset with rules for a user who wanted to use it with CUPS.
May 8, 2021 - Added my pf CUPS ruleset that allows the machine running cupsd to be accessed by other machines on the LAN.

Did I mention I was offline for over a year and just got back online in January 2021? Or that I just did a complete graphic design and layout change for both my sites?

I updated Demonica's with a new theme as a marketing strategy to combat a disinformation campaign by the competition that got my site tagged with a short https://t.co/ Twitter URL for a possibly dangerous site?

I just put up some new wallpapers on Demonica's site today. You'll like them:

https://demonica.trihexagonal.org/

Or not.

That now I have to deal with this crap, in Iran and Bulgaria, on top of everything else my whole point.

Quote:
Originally Posted by Michael Uplawski View Post
What is so bad about having your texts published by others?
Give me links to everything you've ever written, I'll sign my name to it as Author and go around promoting it as my own original material.

Right here, in this forum, in front of you, and let's see if you have an epiphany or are quiet as a mouse.

Or maybe even give me a cookie in the form of reps gained, p'haps?
 
Old 06-26-2021, 04:25 AM   #27
Michael Uplawski
Senior Member
 
Registered: Dec 2015
Posts: 1,622

Original Poster
Blog Entries: 40

Rep: Reputation: Disabled
Quote:
Originally Posted by Trihexagonal View Post
Links to all your original material, please.
Impossible. I was about to look up some of these and stopped right there. What you ask is not really clear. But I give you the link to a collection of docs, held on a server in Germany for ages, by a person I once held in high esteem and probably should still, if I knew what he is currently doing :

http://altlasten.lutz.donnerhacke.de.../anon/privacy/ Only the text-links are functional (labelled “txt”).
Here are some search results that I have not further scrutinized :
https://searx.tuxcloud.net/search?q=...language=en-US
And – of course I forgot – my blog here on LQ and my web-site:
www.uplawski.eu (do whatever you want with that – only the laws apply and I do not have any means to go beyond that).
I cannot give you links to the docs I produced at work, software-architecture, a course on design-patterns in C++ (I should read that one, myself), user-manual for the graphical user-interface of a C++/Java code-generator (for the German constructor of military submarines – German submarines, that is)

Quote:
Give me links to everything you've ever written, I'll sign my name to it as Author and go around promoting it as my own original material.
You do not need my assistance to do that. And you are not responding to my message. That is not important, I only clarify.
This is not my thread, I know – I am only the OP and others have responded. Everything else would be luxury.

As far as I am concerned, this discussion may now slip into oblivion – if that is English.

Last edited by Michael Uplawski; 06-26-2021 at 04:33 AM.
 
Old 06-26-2021, 06:19 AM   #28
Trihexagonal
Member
 
Registered: Jul 2017
Posts: 362
Blog Entries: 1

Rep: Reputation: 334Reputation: 334Reputation: 334Reputation: 334
Quote:
Originally Posted by Michael Uplawski View Post
As far as I am concerned, this discussion may now slip into oblivion – if that is English.
Don't get uppity, Uplawski.

Quote:
Originally Posted by Michael Uplawski View Post
I may have expressed myself poorly.
Because you must not understand the English language very well if you can't understand why I don't want someone else to claim to have authored my original material.

Here, let me give you an English lesson before you slip into oblivion:

Quote:
plagiarism
[ pley-juh-riz-uhm, -jee-uh-riz- ]

noun
1. an act or instance of using or closely imitating the language and thoughts of another author without authorization and the representation of that author's work as one's own, as by not crediting the original author: It is said that he plagiarized Thoreau's plagiarism of a line written by Montaigne.

2. a piece of writing or other work reflecting such unauthorized use or imitation: “These two manuscripts are clearly plagiarisms,” the editor said, tossing them angrily on the floor.

https://www.dictionary.com/browse/plagiarism
 
Old 06-26-2021, 07:01 AM   #29
Michael Uplawski
Senior Member
 
Registered: Dec 2015
Posts: 1,622

Original Poster
Blog Entries: 40

Rep: Reputation: Disabled
Quote:
Originally Posted by Trihexagonal View Post
Don't get uppity, Uplawski.
Wow.
 
Old 06-26-2021, 11:06 AM   #30
Trihexagonal
Member
 
Registered: Jul 2017
Posts: 362
Blog Entries: 1

Rep: Reputation: 334Reputation: 334Reputation: 334Reputation: 334
Was my use of English Alliterative skills that stunning to you? Because yours are sorely lacking for you to be in a position to critique mine.

Quote:
Originally Posted by Michael Uplawski View Post
Sorry, but I do not get it.

AFAIS, you are not making money as a teacher or else selling your documents. What harm do people do, when they re-publish your work?
Just because your English comprehension is such that you couldn't understand the point of my going into detail about the work I put into my tutorial after you summarized the quality and worth you put into yours, don't blame it on my lack of Mastery of the Language:

Quote:
Originally Posted by Michael Uplawski View Post
Today I still find my documents and translations from English to German and I do not give a damn about who publishes any one of these docs. Some are too old to be taken seriously today, but much is still valid: Anonymous Remailers were functional before we talked about Onion-Routing, and still – that is how they worked.
Or mistake the work I laid out was put into mine after getting back online after being off over a year to bring it back upto date and make it better then before to your sloth in keeping yours up to date for your concern for what you wrote.


Quote:
Originally Posted by Michael Uplawski View Post
Impossible. I was about to look up some of these and stopped right there. What you ask is not really clear
I believe I made myself crystal clear. Once you gave me your treasured govt links, all of them, I was going to plagiarize your work and parade it in front of you here since you couldn't seem to grasp the concept of why I would care if someone re-published my work.

How about if I re-publish Isaac Asimov "Rendezvous With Rama" and claim I wrote it? Think that would fly? By three times?


Quote:
Originally Posted by Michael Uplawski View Post
But I give you the link to a collection of docs...
File it under T.

Quote:
Originally Posted by Michael Uplawski View Post
You do not need my assistance to do that.
If I plagiarized it it wouldn't make any difference if I had your permission or not. You obviously did not know the meaning of the word.

Quote:
Originally Posted by Michael Uplawski View Post
And you are not responding to my message. That is not important, I only clarify.
Stir up the mud and it doesn't clarify it. You didn't comprehend the context of anything I said. I couldn't really believe you were even asking a serious question, to be perfectly honest.


Quote:
Originally Posted by Michael Uplawski View Post
As far as I am concerned, this discussion may now slip into oblivion – if that is English.
If that is English... Well, you've got the talking part done. Now back it up.

Show me your English Elegance and Worth as a Writer before you think to elevate yourself to the level of critiquing mine.

https://able2know.org/topic/554096-1

I'm izezi and I wrote everything on that page, because nobody can beat me and they don't even try.

I say as a writer you're washed up and worthless. Prove me wrong.
 
  


Reply

Tags
civility, custom, html, legality, webdriver



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
[SOLVED] /etc/udev/rules.d/40-libsane.rules and /etc/udev/rules.d/S99-2000S1.rules missing LABEL=libsane_rules_end mumahendras3 Slackware 6 03-09-2020 02:27 AM
LXer: Introduction to python web scraping and the Beautiful Soup library LXer Syndicated Linux News 0 09-10-2018 08:42 AM
LXer: Web scraping with Python (Part 2) LXer Syndicated Linux News 0 09-04-2009 09:00 PM
LXer: Web Scraping with Python LXer Syndicated Linux News 0 12-03-2008 03:40 PM
LXer: Extract data from the Internet with Web scraping LXer Syndicated Linux News 0 03-29-2006 12:55 PM

LinuxQuestions.org > Forums > Non-*NIX Forums > General

All times are GMT -5. The time now is 08:23 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration