Normalize UTF-8 #2
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
I was reading some pages in Español via Cosmarmot with TurboGopher on MacOS 8. It handles ISO8859-1 Latin-1 text, but accented characters were coming across as "Ãx" where x is some ascii character. This shouldn't be, because accented Spanish characters in UTF-8 are in the same position they are in ISO Latin-1. My guess is that these are two UTF-8 codepoint, a combining accent and a letter. I think normalizing the UTF-8 before sending to Gopher will prevent such mojibake.
So, I normalized the UTF-8 in
b50e099cd5, but it doesn't resolve the issue. I maybe TurboGopher is just not great? Going to re-test with a simple gopher browser on a Latin-1 terminal.