Closed
Description
Hi,
In HashedPartitioner you use python hash function
idx = hash(key) % size #line 12
The python hash function is not consistent and is based on the current running python environment.
For example hash('123') would produce a different partition each time a python process is restarted.
Is there a chance to use another python hash function instead (e.g. I'd recommend murmur hash, mmh3).
Thanks
Metadata
Metadata
Assignees
Labels
No labels