HOME

Saturday, October 24, 2015

Canonicalizing and Hashing Emails for Customer Match

This is regarding Google's new "Customer Match" feature, full details here: https://support.google.com/adwords/answer/6276125?hl=en

The article mentions that you can upload hashed emails for increased privacy protection, as stated in the article, this effectively converts the email address:

example@gmail.com

to the code:

264e53d93759bde067fd01ef2698f98d1253c730d12f021116f02eebcfa9ace6

Gmail, and many other large email providers, will ignore dots inserted in email addresses (so e.x.a.m.p.l.e@gmail.com is the same as example@gmail.com) AND they will ignore anything inserted after a PLUS sign in email addresses (so example+ignoreme@gmail.com is the same as example@gmail.com)

MY QUESTION:

When you hash example+ignoreme@gmail.com or e.x.a.m.p.l.e@gmail.com you get different codes. Should we make it a policy to "canonicalize" email addresses by removing dots and everything after a plus sign before hashing?

Thanks

No comments:

Post a Comment