Things:
- Loading a ISO-8559-1 encoded dump into a UTF-8 database breaks, fields are truncated at the first non-valid character. Not terribly surprising, but MySQL is silent about the breakage.
- Content which looks like, and is declared, ISO-8559-1 might actually get rendered by browsers as CP1252 (aka MS-ANSI WINDOWS-1252)
- The difference between these two are that 8859 doesn’t use 0×7f to 0×9f, but Windows does – for long hyphens, ellipsis etc
iconv -f cp1252 -t utf-8IYF
Links:
Leave a Reply
Recent articles
- cron
(Wednesday, 02. 24. 2010 – 6 Comments) - SOAP in unexpected “actually, quite easy” incident.
(Wednesday, 09. 30. 2009 – No Comments) - ipmitool for OSX
(Tuesday, 09. 29. 2009 – No Comments) - vxargs – visual parallel xargs
(Friday, 09. 18. 2009 – 1 Comment)
Archives
- February 2010
- September 2009
- August 2009
- January 2009
- September 2008
- August 2008
- July 2008
- May 2008
- April 2008
- February 2008
- January 2008
- November 2007
- October 2007
- September 2007
- August 2007
- December 2006
- November 2006
- August 2006
- June 2006
- May 2006
- March 2006
- February 2006
- January 2006
- December 2005
- November 2005
- October 2005