parrotcode: Hash table | |
Contents | C |
src/hash.c - Hash table
A hashtable contains an array of bucket indexes.
Buckets are nodes in a linked list,
each containing a void *
key and value.
During hash creation the types of key and value as well as appropriate compare and hashing functions can be set.
This hash implementation uses just one piece of malloced memory.
The hash->bu
union points into this regions.
At positive indices are bucket pointers,
at negative indices is the bucket store itself.
This hash doesn't move during GC, therefore a lot of the old caveats don't apply.
static size_t key_hash_STRING(Interp *interpreter, void *value, size_t seed)
value
.static int STRING_compare(Parrot_Interp interp, void *a, void *b)
a
is the search key,
b
is the bucket key.static size_t key_hash_cstring(Interp *interpreter, void *value, size_t seed)
static int cstring_compare(Parrot_Interp interp, void *a, void *b)
key_hash
and compare
functions.static size_t key_hash_int(Interp *interp, void *value, size_t seed)
static int int_compare(Parrot_Interp interp, void *a, void *b)
key_hash
and compare
functions.void dump_hash(Interp *interpreter, Hash *hash)
void mark_hash(Interp *interpreter, Hash *hash)
void hash_visit(Interp *interpreter, Hash *hash, void *pinfo)
pinfo
is the visit info,
(see include/parrot/pmc_freeze.h>).static void expand_hash(Interp *interpreter, Hash *hash)
MAXFULL_PERCENT
% of N as the number of buckets.
This way,
as soon as we run out of buckets on the free list,
we know that it's time to resize the hashtable. bucket_index = hash(key) % hash_size
hash_size
(the old hash_size
). In fact, because the hashtable is always a power of two in size, it depends only on the next bit in the hash value, after the ones previously used.->next
pointers, and they'll be all over memory.)void new_hash(Interp *interpreter, Hash **hptr)
hptr
.void new_cstring_hash(Interp *interpreter, Hash **hptr)
hptr
.void new_hash_x(Interp *interpreter, Hash **hptr, PARROT_DATA_TYPES val_type, Hash_key_type hkey_type, hash_comp_fn compare, hash_hash_key_fn keyhash)
hptr
.Hash*
is seen, it doesn't get properly marked (only the Hash*
buffer is marked, not its contents). By passing the **hptr
up to the Hash's init function, the newly constructed PMC is on the stack including this newly constructed Hash, so that it gets marked properly.void new_pmc_hash_x(Interp *interpreter, PMC *container, PARROT_DATA_TYPES val_type, Hash_key_type hkey_type, hash_comp_fn compare, hash_hash_key_fn keyhash)
container
PMC gets stored in the Hash end the newly created Hash is in PMC_struct_val(container).PMC *Parrot_new_INTVAL_hash(Interp *interpreter, UINTVAL flags)
flags
can be PObj_constant_FLAG
or 0.INTVAL hash_size(Interp *interpreter, Hash *hash)
void *hash_get_idx(Interp *interpreter, Hash *hash, PMC *key)
HashBucket *hash_get_bucket(Interp *interpreter, Hash *hash, void *key)
key
.void *hash_get(Interp *interpreter, Hash *hash, void *key)
key
or NULL
if no bucket is found.INTVAL hash_exists(Interp *interpreter, Hash *hash, void *key)
HashBucket *hash_put(Interp *interpreter, Hash *hash, void *key, void *value)
key
is not copied.void hash_delete(Interp *interpreter, Hash *hash, void *key)
void hash_clone(Interp *interp, Hash *hash, Hash **dest)
hash
to dest
.docs/pdds/pdd08_keys.pod.
leo add function pointer for compare, hash, mark
hash keys are now (void *)
add new_cstring_hash()
function
bucket->value
is now a plain pointer, no more an HASH_ENTRY
With little changes, we can again store arbitrary items if needed, see TODO in code.
boemmels renamed HASH
and HASHBUCKET
to Hash
and HashBucket
leo randomize key_hash
seed
extend new_hash_x()
init call by value_type
and _size
.
leo USE_STRING_EQUAL
define, see comment above
leo heavy rewrite: use just one piece of malloced memory
Future optimizations:
realloc
.)
|