What are good hashing kernels? Would it be possible to use rate of collusion to select a set of hashing kernels? Or the hashing kernels need to be locality sensitive

asked Nov 17 '11 at 11:15

ZXZ's gravatar image

ZXZ
46446


One Answer:

Usually people use murmurhash or some look-alike as the hashing function because it's fast and easy to build many independent hash functions out of. Optimizing hashing functions is hard and kind of pointless, as you really want them to be independent to handle collisions. What kinds of thing you're hashing (say words, bigrams, or character n-grams, or more complex features), on the other hand can and should be optimized for using whatever you do (ablation, feature selection, etc) to make sure you're not using too many useless features.

answered Nov 17 '11 at 14:07

Alexandre%20Passos's gravatar image

Alexandre Passos ♦
2554154278421

Your answer
toggle preview

powered by OSQA

User submitted content is under Creative Commons: Attribution - Share Alike; Other things copyright (C) 2010, MetaOptimize LLC.