Things:
- Loading a ISO-8559-1 encoded dump into a UTF-8 database breaks, fields are truncated at the first non-valid character. Not terribly surprising, but MySQL is silent about the breakage.
- Content which looks like, and is declared, ISO-8559-1 might actually get rendered by browsers as CP1252 (aka MS-ANSI WINDOWS-1252)
- The difference between these two are that 8859 doesn’t use 0x7f to 0x9f, but Windows does – for long hyphens, ellipsis etc
iconv -f cp1252 -t utf-8IYF
Links:
Leave a Reply
Recent articles
- pf on OS X 10.7
(Wednesday, 14. 09. 2011 – 3 Comments) - Cyrus saslauthd and passwords containing quote marks
(Saturday, 11. 06. 2011 – No Comments) - Competing webserver workloads
(Thursday, 17. 02. 2011 – No Comments) - Timeouts and failing fast
(Sunday, 23. 01. 2011 – No Comments)
Archives
- September 2011
- June 2011
- February 2011
- January 2011
- October 2010
- September 2010
- February 2010
- September 2009
- August 2009
- January 2009
- September 2008
- August 2008
- July 2008
- May 2008
- April 2008
- February 2008
- January 2008
- November 2007
- October 2007
- September 2007
- August 2007
- December 2006
- November 2006
- August 2006
- June 2006
- May 2006
- March 2006
- February 2006
- January 2006
- December 2005
- November 2005
- October 2005
Meta