Class CharWormSet
- All Implemented Interfaces:
Accountable,CharCollection,CharContainer,CharLookupContainer,CharSet,Preallocable,Cloneable,Iterable<CharCursor>
chars, implemented using Worm Hashing strategy.
This strategy is appropriate for a medium sized set (less than 2M keys). It takes more time
to put keys in the set because it maintains chains of keys having the same hash. Then the
lookup speed is fast even if the set is heavy loaded or hashes are clustered. On average it takes
slightly more memory than CharHashSet: heavier but the load factor is higher
(it varies around 80%) so it enlarges later.
- See Also:
-
Field Summary
FieldsModifier and TypeFieldDescriptionprotected intSeed used to ensure the hash iteration order is different from an iteration to another.char[]The array holding keys.byte[]abs(next[i])=offsetto next chained entry index.protected intSet size (number of entries). -
Constructor Summary
ConstructorsConstructorDescriptionNew instance with sane defaults.CharWormSet(int expectedElements) New instance with the provided defaults.CharWormSet(CharContainer container) Creates a new instance from all elements of another container. -
Method Summary
Modifier and TypeMethodDescriptionbooleanadd(char key) Addskto the set.final intaddAll(char... elements) Adds all elements from the given list (vararg) to this set.intaddAll(CharContainer container) Adds all elements from the givenCharContainerto this set.intaddAll(Iterable<? extends CharCursor> iterable) Adds all elements from the given iterable to this set.protected voidallocateBuffers(int capacity) voidclear()Removes all elements from this collection.clone()Clones this set.booleancontains(char key) Lookup a given element in the container.voidensureCapacity(int expectedElements) Ensure this container can hold at least the given number of elements without resizing its buffers.boolean<T extends CharProcedure>
TforEach(T procedure) Applies aprocedureto all container elements.static CharWormSetfrom(char... elements) Create a set from a variable number of arguments or an array ofchar.inthashCode()protected inthashKey(char key) booleanindexExists(int index) charindexGet(int index) Returns the exact value of the existing key.voidindexInsert(int index, char key) Inserts a key for an index that is not present in the set.intindexOf(char key) Returns a logical "index" of a given key that can be used to speed up follow-up logic in certain scenarios (conditional logic).voidindexRemove(int index) Removes a key at an index previously acquired fromindexOf(char).charindexReplace(int index, char equivalentKey) Replaces the existing equivalent key with the given one and returns any previous value stored for that key.booleanisEmpty()Shortcut forsize() == 0.iterator()Returns an iterator to a cursor traversing the collection.protected intProvides the next iteration seed used to build the iteration starting slot and offset increment.longAllocated memory estimationlongBytes that is actually been usedvoidrelease()Removes all elements from the collection and additionally releases any internal buffers.booleanremove(char key) An alias for the (preferred)removeAll(char).intremoveAll(char key) Removes all occurrences ofefrom this collection.intremoveAll(CharContainer other) Removes all keys present in a given container.intDefault implementation uses a predicate for removal.intremoveAll(CharPredicate predicate) Removes all elements in this collection for which the given predicate returnstrue.intDefault implementation uses a predicate for retaining.intretainAll(CharPredicate predicate) Default implementation redirects toCharCollection.removeAll(CharPredicate)and negates the predicate.intsize()Return the current number of elements in this container.char[]toArray()Default implementation of copying to an array.toString()Convert the contents of this container to a human-friendly string.visualizeKeyDistribution(int characters) Visually depict the distribution of keys.Methods inherited from class java.lang.Object
finalize, getClass, notify, notifyAll, wait, wait, waitMethods inherited from interface com.carrotsearch.hppc.CharCollection
removeAll, retainAll, retainAllMethods inherited from interface com.carrotsearch.hppc.CharContainer
toArrayMethods inherited from interface java.lang.Iterable
forEach, spliterator
-
Field Details
-
keys
public char[] keysThe array holding keys. -
next
public byte[] nextabs(next[i])=offsetto next chained entry index.next[i]=0for free bucket.The offset is always forward, and the array is considered circular, meaning that an entry at the end of the array may point to an entry at the beginning with a positive offset.
The offset is always forward, but the sign of the offset encodes head/tail of chain.
next[i] > 0 for the first head-of-chain entry (within [1,WormUtil.maxOffset(int)]),next[i] < 0 for the subsequent tail-of-chain entries (within [-WormUtil.maxOffset(int),-1]. For the last entry in the chain,abs(next[i])=WormUtil.END_OF_CHAIN. -
size
protected int sizeSet size (number of entries). -
iterationSeed
protected int iterationSeedSeed used to ensure the hash iteration order is different from an iteration to another.
-
-
Constructor Details
-
CharWormSet
public CharWormSet()New instance with sane defaults. -
CharWormSet
public CharWormSet(int expectedElements) New instance with the provided defaults.There is no load factor parameter as this set enlarges automatically. In practice the load factor varies around 80% (between 75% and 90%). The load factor is 100% for tiny sets.
- Parameters:
expectedElements- The expected number of elements. The capacity of the set is calculated based on it.
-
CharWormSet
Creates a new instance from all elements of another container.
-
-
Method Details
-
from
Create a set from a variable number of arguments or an array ofchar. The elements are copied from the argument to the internal buffer. -
clone
Clones this set. The cloning operation is efficient because it copies directly the internal arrays, without having to put elements in the cloned set. The cloned set has the same elements and the same capacity as this set. -
size
public int size()Return the current number of elements in this container. The time for calculating the container's size may takeO(n)time, although implementing classes should try to maintain the current size and return in constant time.- Specified by:
sizein interfaceCharContainer
-
isEmpty
public boolean isEmpty()Shortcut forsize() == 0.- Specified by:
isEmptyin interfaceCharContainer
-
contains
public boolean contains(char key) Lookup a given element in the container. This operation has no speed guarantees (may be linear with respect to the size of this container).- Specified by:
containsin interfaceCharContainer- Specified by:
containsin interfaceCharLookupContainer- Returns:
- Returns
trueif this container has an element equal toe.
-
add
public boolean add(char key) Addskto the set. -
addAll
public final int addAll(char... elements) Adds all elements from the given list (vararg) to this set.- Returns:
- Returns the number of elements actually added as a result of this call (not previously present in the set).
-
addAll
Adds all elements from the givenCharContainerto this set. -
addAll
Adds all elements from the given iterable to this set.- Returns:
- Returns the number of elements actually added as a result of this call (not previously present in the set).
-
remove
public boolean remove(char key) An alias for the (preferred)removeAll(char). -
removeAll
public int removeAll(char key) Removes all occurrences ofefrom this collection.- Specified by:
removeAllin interfaceCharCollection- Parameters:
key- Element to be removed from this collection, if present.- Returns:
- The number of removed elements as a result of this call.
-
removeAll
Removes all keys present in a given container.- Returns:
- Returns the number of elements actually removed as a result of this call.
-
removeAll
Removes all elements in this collection for which the given predicate returnstrue.- Specified by:
removeAllin interfaceCharCollection- Returns:
- Returns the number of removed elements.
-
forEach
Applies aprocedureto all container elements. Returns the argument (any subclass ofCharProcedure. This lets the caller to call methods of the argument by chaining the call (even if the argument is an anonymous type) to retrieve computed values, for example (IntContainer):int count = container.forEach(new IntProcedure() { int count; // this is a field declaration in an anonymous class. public void apply(int value) { count++; } }).count;- Specified by:
forEachin interfaceCharContainer
-
forEach
Applies apredicateto container elements as long, as the predicate returnstrue. The iteration is interrupted otherwise.- Specified by:
forEachin interfaceCharContainer
-
iterator
Returns an iterator to a cursor traversing the collection. The order of traversal is not defined. More than one cursor may be active at a time. The behavior of iterators is undefined if structural changes are made to the underlying collection.The iterator is implemented as a cursor and it returns the same cursor instance on every call to
Iterator.next()(to avoid boxing of primitive types). To read the current list's value (or index in the list) use the cursor's public fields. An example is shown below.for (CharCursor<char> c : container) { System.out.println("index=" + c.index + " value=" + c.value); }- Specified by:
iteratorin interfaceCharContainer- Specified by:
iteratorin interfaceIterable<CharCursor>
-
clear
public void clear()Removes all elements from this collection.- Specified by:
clearin interfaceCharCollection- See Also:
-
release
public void release()Removes all elements from the collection and additionally releases any internal buffers. Typically, if the object is to be reused, a simpleCharCollection.clear()should be a better alternative since it'll avoid reallocation.- Specified by:
releasein interfaceCharCollection- See Also:
-
equals
-
hashCode
public int hashCode() -
hashKey
protected int hashKey(char key) -
indexOf
public int indexOf(char key) Returns a logical "index" of a given key that can be used to speed up follow-up logic in certain scenarios (conditional logic). The semantics of "indexes" are not strictly defined. Indexes may (and typically won't be) contiguous. The index is valid only between modifications (it will not be affected by read-only operations).- Parameters:
key- The key to locate in the set.- Returns:
- A non-negative value of the logical "index" of the key in the set or a negative value if the key did not exist.
- See Also:
-
indexExists
public boolean indexExists(int index) - Parameters:
index- The index of a given key, as returned fromindexOf(char).- Returns:
- Returns
trueif the index corresponds to an existing key or false otherwise. This is equivalent to checking whether the index is a positive value (existing keys) or a negative value (non-existing keys). - See Also:
-
indexGet
public char indexGet(int index) Returns the exact value of the existing key. This method makes sense for sets of objects which define custom key-equality relationship.- Parameters:
index- The index of an existing key.- Returns:
- Returns the equivalent key currently stored in the set.
- Throws:
AssertionError- If assertions are enabled and the index does not correspond to an existing key.- See Also:
-
indexReplace
public char indexReplace(int index, char equivalentKey) Replaces the existing equivalent key with the given one and returns any previous value stored for that key.- Parameters:
index- The index of an existing key.equivalentKey- The key to put in the set as a replacement. Must be equivalent to the key currently stored at the provided index.- Returns:
- Returns the previous key stored in the set.
- Throws:
AssertionError- If assertions are enabled and the index does not correspond to an existing key.- See Also:
-
indexInsert
public void indexInsert(int index, char key) Inserts a key for an index that is not present in the set. This method may help in avoiding double recalculation of the key's hash.- Parameters:
index- The index of a previously non-existing key, as returned fromindexOf(char).- Throws:
AssertionError- If assertions are enabled and the index does not correspond to an existing key.- See Also:
-
indexRemove
public void indexRemove(int index) Removes a key at an index previously acquired fromindexOf(char).- Parameters:
index- The index of the key to remove, as returned fromindexOf(char).- Throws:
AssertionError- If assertions are enabled and the index does not correspond to an existing key.- See Also:
-
toString
Convert the contents of this container to a human-friendly string. -
ensureCapacity
public void ensureCapacity(int expectedElements) Ensure this container can hold at least the given number of elements without resizing its buffers.- Specified by:
ensureCapacityin interfacePreallocable- Parameters:
expectedElements- The total number of elements, inclusive.
-
visualizeKeyDistribution
Visually depict the distribution of keys.- Specified by:
visualizeKeyDistributionin interfaceCharSet- Parameters:
characters- The number of characters to "squeeze" the entire buffer into.- Returns:
- Returns a sequence of characters where '.' depicts an empty fragment of the internal buffer and 'X' depicts full or nearly full capacity within the buffer's range and anything between 1 and 9 is between.
-
ramBytesAllocated
public long ramBytesAllocated()Allocated memory estimation- Specified by:
ramBytesAllocatedin interfaceAccountable- Returns:
- Ram allocated in bytes
-
ramBytesUsed
public long ramBytesUsed()Bytes that is actually been used- Specified by:
ramBytesUsedin interfaceAccountable- Returns:
- Ram used in bytes
-
allocateBuffers
protected void allocateBuffers(int capacity) -
nextIterationSeed
protected int nextIterationSeed()Provides the next iteration seed used to build the iteration starting slot and offset increment. This method does not need to be synchronized, what matters is that each thread gets a sequence of varying seeds. -
removeAll
Default implementation uses a predicate for removal.- Specified by:
removeAllin interfaceCharCollection- Returns:
- Returns the number of removed elements.
-
retainAll
Default implementation uses a predicate for retaining.- Specified by:
retainAllin interfaceCharCollection- Returns:
- Returns the number of removed elements.
-
retainAll
Default implementation redirects toCharCollection.removeAll(CharPredicate)and negates the predicate.- Specified by:
retainAllin interfaceCharCollection- Returns:
- Returns the number of removed elements.
-
toArray
public char[] toArray()Default implementation of copying to an array.- Specified by:
toArrayin interfaceCharContainer
-