GeneralThis forum is for non-technical general discussion which can include both Linux and non-Linux topics. Have fun!
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
2 different domains, on 2 different hosts, blocked my attempts to connect to harryshearer.com (Spinal Tap's bassist, voice of many characters on The Simpsons), returning
Quote:
'Generally a 406 error is caused because a request has been blocked by Mod Security. If you believe that your request has been blocked by mistake please contact the web site owner.'
I figured it out: harryshearer.com rejects connections by lynx. At home I change the user-agent header to fool them. If I do it at those sites I can fool harryshearer.com.
I should have suspected. I'm a long-time member of the lynx mailing list, this is a common problem, I long ago changed my user-agent header at home. I don't know why I didn't think of it first thing elsewhere.
Why would they ban lynx and not other text browsers?
Lynxsters tell me lynx is the most popular browser to 'scrape' websites. It may be smaller and/or faster; it may merely be older. washingtonpost.com blocks lynx. I notice that many websites don't count lynx accesses. I can access nytimes.com all I like without a subscription. newyorker.com does count, but it's exceptional.
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.