A true flaw of UTF-8 in the long run. They should have biased the values of multibyte sequences to remove redundant encodings.