My recent project had some issues with hashing some 10 million numbers. To analyze the matter I wrote a small test program, see numberhash.c.
I wanted to know which influence the following factors play:
- hashing just numbers (no alphabetic characters)
- ASCII vs. EBCIDC
- choice of hash function
- load factor
- distribution of collisions