I've got a few mysql databases of Chinese-English dictionaries.
I've got one that i can't "load data infile".
the file is here:
http://www.worldtradetown.com/15000-3.csv
it contains 15,000 traditional and simplified chinese characters and their pinyin (roman pronunciation).
this command:
mysql> load data infile '/mnt/data/study/chinese/15000-3.csv' into table pinyin fields terminated by '`';
produces a table that produces results that look like this:
mysql> select * from pinyin limit 10;
+---------+---------------+
| chinese | pinyin |
+---------+---------------+
| ? | de5;;di2;;di4 |
| ? | shi4 |
| ? | bu4;;bu2 |
| ? | wo3 |
while other tables in the database produce correct results like this:
mysql> select * from chinese_frequency limit 10;
+--------+---------+--------+------+
| no | chinese | pinyin | rank |
+--------+---------+--------+------+
| 214048 | 的 | di4 | 1 |
| 214048 | 的 | de5 | 2 |
| 70872 | 了 | liao3 | 3 |
| 70872 | 了 | le5 | 4 |
| 61364 | 我 | wo3 | 5 |
The database is utf8, and if you open up that file in your web-browser (firefox) and set the encoding to utf8, both traditional and simplified characters show up. This is also true if you open it with openoffice and select utf8 as the character encoding.
So, why can't I load data infile into mysql?