I need to call out a myth about UTF-8. Tools built to assume UTF-8 are not backw...

int_19h · 2025-09-13T21:30:13 1757799013

The usual statement isn't that UTF-8 is backwards compatible with ASCII (it's obvious that any 8-bit encoding wouldn't be; that's why we have UTF-7!). It's that UTF-8 is backwards compatible with tools that are 8-bit clean.

wrp · 2025-09-14T00:49:10 1757810950

Yes, the myth I was pointing out is based on loose terminology. It needs to be made clear that "backwards compatible" means that UTF-8 based tools can receive but are not constrained to emit valid ASCII. I see a lot of comments implying that UTF-8 can interact with an ASCII ecosystem without causing problems. Even worse, it seems most Linux developers believe there is no longer a need to provide a default ASCII setting if they have UTF-8.

account42 · 2025-09-15T15:45:41 1757951141

Do you have an actual example where this causes an issue? "ASCII" tools mostly just passed along non-ASCII bytes unchanged even before UTF-8.

kccqzy · 2025-09-13T15:06:11 1757775971

That's not a myth about UTF-8. That's a decision by tools not to support pure ASCII.