[Cryptography] canonicalizing unicode strings.

Nico Williams nico at cryptonector.com
Tue Feb 6 15:01:25 EST 2018


On Tue, Feb 06, 2018 at 05:46:27PM +0800, jamesd at echeque.com wrote:
> On 15/01/2018 13:04, Howard Chu wrote:
> >jamesd at echeque.com wrote:
> >>Is there somewhere a list of near duplicate unicode symbols, or existing
> >>canonicalization code?
> >
> >Have you already read https://www.unicode.org/reports/tr15/tr15-45.html ?
> 
> This link is extremely useful, but does not address the homoglyph problem.

UTS#39 has a confusables.txt file, which is the closes to what you're
asking for, and the UC's UTS#39 *is* the proper vehicle for updates to
this.

Nico
-- 


More information about the cryptography mailing list