Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

That will almost certainly preserve the invisible characters. Most invisible characters are used for some kind of in-line formatting in Unicode, so it's not desirable to remove them.


What inline formatting in notepad.exe? It doesn't even support bolding/italics/underling.

But I guess there are tabs and line return/carriage returns, so there's that.


Right-to-left/left-to-right markers. Language tags. Various invisible spaces. Homoglyphs. (all trivially filterable though)




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: