LinuxQuestions.org
Help answer threads with 0 replies.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Distributions > Slackware
User Name
Password
Slackware This Forum is for the discussion of Slackware Linux.

Notices


Reply
  Search this Thread
Old 03-09-2008, 04:00 AM   #1
Yalla-One
Member
 
Registered: Oct 2004
Location: Norway
Distribution: Slackware, CentOS
Posts: 641

Rep: Reputation: 36
UTF-8 in Slackware -what's missing?


Hello,

With 2.6.24 in the latest -current, UTF-8 seems to be one step closer to Slackware.

Does anyone know what are the items outstanding or problems to watch out for in seeking to make Slackware 100% UTF-8 ?

I know the man pages used to be a problem, but have found no recent information on that subject. Does anyone else have input on the progress on UTF-8 man-pages

Are there other items one should be aware of before starting the conversion?

(As Slackware keeps gaining momentum among non-American Linux-users, this gets more relevant every day)

-y1
 
Old 03-09-2008, 09:23 AM   #2
Eternal_Newbie
Member
 
Registered: Jun 2005
Location: The Pudding Isles
Distribution: Slackware
Posts: 573

Rep: Reputation: 59
Well, there was/is a problem with hplip GUI tools because they expect UTF8 input (see this thread ). rworkman is aware of it and the work-around, though. That is the only i18n issue I am aware of.

EDIT: The hp-tools workround is now mentioned in the slackware-current CHANGES_AND_HINTS.txt

Last edited by Eternal_Newbie; 03-09-2008 at 10:32 AM. Reason: spelling, clarification, update
 
Old 03-11-2008, 02:49 AM   #3
ppr:kut
Slackware Contributor
 
Registered: Aug 2006
Location: Netherlands
Distribution: Slackware
Posts: 631

Rep: Reputation: 463Reputation: 463Reputation: 463Reputation: 463Reputation: 463
Midnight Commander needs a patch to display utf8 characters, and it also needs a utf8-patched slang2.
Further I had some problems with kmail authenticating to a iso-8859-15 server, with utf8 enabled locally.
But I don't know, if that problem was caused by kmail or the server.
 
Old 03-13-2008, 09:36 AM   #4
Su-Shee
Member
 
Registered: Sep 2007
Location: Berlin
Distribution: Slackware
Posts: 510

Rep: Reputation: 53
I'm using UTF-8 on Slackware for over 2 years now - no problem at all.

The problem with manpages is the support for UTF-8 in groff (AFAIK) and the last time I checked, it couldn't display Thai script, for example.

Most application dealing with IDN (which is not UTF-8, but if we're talking i18n here... I use work also perfectly fine. (mutt, Thunderbird, Firefox, qmail to name a few working apps...)

Input methods with scim do fine, printing works, Perl's got some nice features and support for Unicode stuff..

I'm missing nothing, actually.

(And as a non-american Slackware user: I'm using Slackware since '94.

Last edited by Su-Shee; 03-13-2008 at 09:37 AM.
 
Old 03-14-2008, 02:37 AM   #5
ppr:kut
Slackware Contributor
 
Registered: Aug 2006
Location: Netherlands
Distribution: Slackware
Posts: 631

Rep: Reputation: 463Reputation: 463Reputation: 463Reputation: 463Reputation: 463
I've switched to UTF8 now myself and take back my comment on Midnight Commander. I don't know what all this complaints about missing UTF8 support in mc are about, but it works good enough to work with, at least under a X terminal. Mc seems to have some issues with col-width calculation when displaying filenames with unicode-characters. But that's more a design-issue than a functionality issue.
I'm in an UTF8 system now for 2 days. So far, no problems at all.
 
Old 03-14-2008, 04:27 PM   #6
Yalla-One
Member
 
Registered: Oct 2004
Location: Norway
Distribution: Slackware, CentOS
Posts: 641

Original Poster
Rep: Reputation: 36
Thanks for the insight guys!

Did you make the change only in KDE, or did you also edit /etc/profile.d/lang.sh ?

I've never used midnight commander, so as long as 'man' works, I'm all ready to go

-y1
 
Old 03-15-2008, 02:33 AM   #7
ppr:kut
Slackware Contributor
 
Registered: Aug 2006
Location: Netherlands
Distribution: Slackware
Posts: 631

Rep: Reputation: 463Reputation: 463Reputation: 463Reputation: 463Reputation: 463
I did edit the /etc/profile.d/lang.sh.
I wanted to switch a while ago, but had to switch back as too many files had iso-content. Also some files had iso-characters in their filenames.
I had to correct these things for the final switch, which happened just days ago.

Good luck on your journey to UTF8!
 
Old 03-15-2008, 03:57 AM   #8
Yalla-One
Member
 
Registered: Oct 2004
Location: Norway
Distribution: Slackware, CentOS
Posts: 641

Original Poster
Rep: Reputation: 36
Thanks again,

You raise an interesting point - what to do with all the files that have troublesome filenames... I've noticed a plethora of utf8-conversion scripts out there - do you have anyone in particular to recommend?
(While I do have backups, I'd rather not find out in retrospect that the particular script I choose is garbling all filenames beyond recognition)

-y1
 
Old 03-15-2008, 10:14 AM   #9
ppr:kut
Slackware Contributor
 
Registered: Aug 2006
Location: Netherlands
Distribution: Slackware
Posts: 631

Rep: Reputation: 463Reputation: 463Reputation: 463Reputation: 463Reputation: 463
Nope. I've written one myself but I won't recommend anyone to use it...
And I still have not converted everything. OOo-files, notes etc do still have some weird chars.
I basically relied on iconv and uconv.
 
Old 03-15-2008, 12:55 PM   #10
Yalla-One
Member
 
Registered: Oct 2004
Location: Norway
Distribution: Slackware, CentOS
Posts: 641

Original Poster
Rep: Reputation: 36
I just used convmv and it did the trick perfectly of converting all my files to UTF-8. It has a "dryrun" mode and a live-ammo mode, and was even recommended by linux.com.

http://www.j3e.de/linux/convmv/

-y1
 
Old 04-11-2008, 01:46 AM   #11
iiv
Member
 
Registered: Jun 2007
Location: Russia, Moscow Region
Distribution: Slackware
Posts: 167

Rep: Reputation: 30
Basic utils still seem to lack UTF-8 support. For example
Code:
find dir -ls
will give just escape sequences for UTF-8 symbols.

Also,
Code:
watch ls dir
will just ignore UTF-8 characters, as if there were not any.

We are far away from good UTF support, it seems that many widespread GNU utils need a rewrite. If only I were a good programmist..
 
Old 04-11-2008, 05:35 AM   #12
pdw_hu
Member
 
Registered: Nov 2005
Location: Budapest, Hungary
Distribution: Slackware, Gentoo
Posts: 346

Rep: Reputation: Disabled
Apart from the already mentioned mc vs utf8 issue, i can't saying anything to keep you from switching either. Configure your apps properly and you'll be happy ever after
 
Old 04-11-2008, 02:51 PM   #13
iiv
Member
 
Registered: Jun 2007
Location: Russia, Moscow Region
Distribution: Slackware
Posts: 167

Rep: Reputation: 30
Quote:
Originally Posted by pdw_hu View Post
Apart from the already mentioned mc vs utf8 issue, i can't saying anything to keep you from switching either. Configure your apps properly and you'll be happy ever after
Unfortunately it is not so. Sorry. Partly it is so. If you use US or British English or you don't use console at all then it will be so. But many issues are around, for examble those I mentioned above and plus those those packages that were updated in current, speaking of wget and oggenc they have the same problem: they span new lines while refreshing the lines.
 
Old 12-14-2008, 04:29 PM   #14
Yalla-One
Member
 
Registered: Oct 2004
Location: Norway
Distribution: Slackware, CentOS
Posts: 641

Original Poster
Rep: Reputation: 36
Quote:
Originally Posted by iiv View Post
Basic utils still seem to lack UTF-8 support. For example
Code:
find dir -ls
will give just escape sequences for UTF-8 symbols.
You're right - Having searched savannah for info on findutils, I cannot find any open bugs on find not being utf8-compliant. Any ideas or pointers where to search for the root cause to this?

I mean - for groff/man, it's easy: lot's of useful patches sitting in cvs, but no release for ages (groff:4 years, man: 1 year) - but for findutils everything appears to be OK as per the bug database and changelogs...

Then again, I'm a novice at tracking these things down - hence the question :-)

-y1
(slowly returning online...)
 
Old 03-08-2010, 04:29 PM   #15
guanx
Senior Member
 
Registered: Dec 2008
Posts: 1,172

Rep: Reputation: 227Reputation: 227Reputation: 227
Quote:
Originally Posted by Su-Shee View Post
I'm using UTF-8 on Slackware for over 2 years now - no problem at all.

The problem with manpages is the support for UTF-8 in groff (AFAIK) and the last time I checked, it couldn't display Thai script, for example.

Most application dealing with IDN (which is not UTF-8, but if we're talking i18n here... I use work also perfectly fine. (mutt, Thunderbird, Firefox, qmail to name a few working apps...)

Input methods with scim do fine, printing works, Perl's got some nice features and support for Unicode stuff..

I'm missing nothing, actually.

(And as a non-american Slackware user: I'm using Slackware since '94.
The groff utf8 support is still broken. I cannot display any Chinese man pages. Please, slackware developers, either remove those Chinese man pages or add the groff-utf8 package:
http://www.haible.de/bruno/packages-groff-utf8.html
 
  


Reply

Tags
linux, slackware, utf8


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
utf-8 in slackware 11.0 cyrillic font only on tty1 __saint Slackware 1 06-22-2007 11:40 AM
im getting UTF-8 to STRING: Could not open converter from 'UTF-8' to 'ISO-8859-1' jabka Linux - Newbie 2 11-24-2006 05:44 AM
xemacs missing utf-8 kakk Linux - Software 1 02-02-2006 09:38 AM
[Enter] in text documents diffrent on Windows and Linux? UTF-8/UTF-16 problem or? brynjarh Linux - General 1 11-24-2004 05:20 AM
X11 / UTF-8 locale seems missing 'fr_FR.UTF-8' chrsitophermann Debian 11 07-17-2004 02:04 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Distributions > Slackware

All times are GMT -5. The time now is 01:41 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration