Linux - GeneralThis Linux forum is for general Linux questions and discussion.
If it is Linux Related and doesn't seem to fit in any other forum then this is the place.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
so far, i've grabbed all the gb2312 code from the sql file with a grep statement like this:
grep -o "&#[[:digit:]]\{5\};" >> filename
and then put that in an mysql database, made an html file and ran it through the browser, made another file and imported both of those files into another mysql database, first database with primary auto_increment of num and numb.
now, i just wrote this php script:
<?php
//this converts from gb2312 to utf8. it just goes through a file and replaces all the strings like &#?????; to the utf8 one
that's a test script, when executed and printed in UTF8 encoding, it shows the perl command over and over with two chinese characters next to each other; they should both look the same, verifying that you have put your database together correctly.
when you want to actually run the script, find the file name and replace it for test.txt and take the hide marks off of the exec command.
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.