LinuxQuestions.org

LinuxQuestions.org (/questions/)
-   Linux - Newbie (https://www.linuxquestions.org/questions/linux-newbie-8/)
-   -   Linux Server Character Encoding - what do I need to install?!? (https://www.linuxquestions.org/questions/linux-newbie-8/linux-server-character-encoding-what-do-i-need-to-install-795540/)

impelmedia 03-15-2010 11:02 AM

Linux Server Character Encoding - what do I need to install?!?
 
Hello!

I hope somebody can help...we have a dedicated linux server for our web hosting services which we purchased a few months ago...however the support is limited and every time we ask for assistance we are told to find the answer ourselves and pay the techies to install our solution! Anyway... we seem to have issues with character encoding on our websites - any text that isn't fully ASCII coded is outputted as funny symbols - for example:

Acting as a “4 PL” – we will manage your operation, achieving agreed service and cost.

Should read as:

Acting as a 4 PL we will manage your operation, achieving agreed service and cost.

This is text that is updated via a CMS, and our clients do not code any HTML at all so obviously this is a problem! Does anyone know which library I need to install on our linux box so that these characters are processed properly in the first place?

Thank you in advance!!!

smoker 03-15-2010 11:10 AM

The issue is with the CMS in my opinion.

impelmedia 03-15-2010 11:16 AM

We have the same problem on other sites - sites that had worked on our old windows server suddenly started outputting the funny characters until we changed the coding on the pages....

jamescondron 03-15-2010 11:39 AM

Well, do that again then

i92guboj 03-15-2010 11:52 AM

The problem can be in one of many levels, and not necessarily in Linux.

Your CMS (or whatever) needs to produce sane pages that comply with one of the W3C standards. So, the first question I ask is: what HTML version are you using? (as in HTML 4.01, XHTML 1.0 Strict/Transitional, whatever else). This is critical.

Second question, have you validated the resulting web pages against the W3C validator? In the case it applies, do your web pages have a valid header with a correct doctype and a correct chatset setup. The way to do this will of course depend on the doctype you choose. Each one has its own rules. If everything is correct, then you shouldn't need to change anything. The fact that your web server is running in one or another OS shouldn't be a thing to worry about these days if your setup is correct. But it will be most times because of incorrect setups that doesn't adhere to any standard and rather rely on peculiarities of one or another concrete web browser.

ps. If the problem does only appears in the dynamic contents and not the static text it could also be an issue with your databases.

impelmedia 03-16-2010 03:54 AM

The site has the following document type and character set:

<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" />

The W3 validator confirmed this was OK...

Also - the problem occurs in all elements on a page - the static website content and that dynamically generated...I am just most concerned about the info pulled from the database because it is that which our clients update who will not code...

Also - this site has been working as is (no changes to the source code) on a different server, so I can only deduce that the problem is on our server...

Any ideas?

Thanks for all your help so far!


All times are GMT -5. The time now is 05:59 AM.