| Package | Description |
|---|---|
| org.simmetrics.builders | |
| org.simmetrics.metrics |
| Modifier and Type | Method and Description |
|---|---|
StringDistance |
StringDistanceBuilder.BuildStep.build()
Builds a distance with the given steps.
|
StringDistance |
StringDistanceBuilder.StringDistanceInitialSimplifierStep.build()
Builds a distance with the given simplifier.
|
StringDistance |
StringDistanceBuilder.StringDistanceSimplifierStep.build()
Builds a distance with the given simplifier.
|
StringDistance |
StringDistanceBuilder.CollectionDistanceTokenizerStep.build()
Builds a string distance metric that will use the given
simplification, tokenization and filtering steps.
|
| Modifier and Type | Method and Description |
|---|---|
static StringDistanceBuilder.StringDistanceInitialSimplifierStep |
StringDistanceBuilder.with(StringDistance distance)
Starts building a distance metric with a string distance metric.
|
| Modifier and Type | Class and Description |
|---|---|
class |
DamerauLevenshtein
Calculates the Damerau-Levenshtein similarity and distance measure between
two strings.
|
class |
Jaro
Calculates the Jaro distance (similarity) over two strings.
|
class |
JaroWinkler
Calculates the Jaro-Winkler distance (similarity) over two strings.
|
class |
Levenshtein
Calculates the Levenshtein distance and similarity over two strings.
|
class |
LongestCommonSubsequence
Applies the longest common subsequence algorithm to calculate the similarity
and distance between two strings.
|
class |
LongestCommonSubstring
Applies the longest common substring algorithm to calculate the similarity
and distance between two strings.
|
| Modifier and Type | Method and Description |
|---|---|
static StringDistance |
StringDistances.blockDistance()
Returns a string distance metric that uses a
Tokenizers.whitespace() and
the BlockDistance metric. |
static StringDistance |
StringDistances.cosineSimilarity()
Returns a distance metric that uses a
Tokenizers.whitespace() and
the CosineSimilarity metric. |
static StringDistance |
StringDistances.damerauLevenshtein()
Returns a string distance metric that uses a
DamerauLevenshtein metric. |
static StringDistance |
StringDistances.dice()
Returns a string distance metric that uses a
Tokenizers.whitespace() and
the Dice metric. |
static StringDistance |
StringDistances.euclideanDistance()
Returns a string distance metric that uses a
Tokenizers.whitespace() and
the EuclideanDistance metric. |
static StringDistance |
HammingDistance.forString()
Returns a Hamming distance to compare strings.
|
static StringDistance |
StringDistances.generalizedJaccard()
Returns a string distance metric that uses a
Tokenizers.whitespace() and
the GeneralizedJaccard metric. |
static StringDistance |
StringDistances.hammingDistance()
Returns a string distance metric that uses the
HammingDistance metric. |
static StringDistance |
StringDistances.identity()
Returns an string distance metric that uses the
Identity metric. |
static StringDistance |
StringDistances.jaccard()
Returns a string distance metric that uses a
Tokenizers.whitespace() and
the Jaccard metric. |
static StringDistance |
StringDistances.jaro()
Returns a string distance metric that uses the
Jaro metric. |
static StringDistance |
StringDistances.jaroWinkler()
Returns a string distance metric that uses the
JaroWinkler metric. |
static StringDistance |
StringDistances.levenshtein()
Returns a string distance metric that uses the
Levenshtein metric. |
static StringDistance |
StringDistances.longestCommonSubsequence()
Returns a string distance metric that uses the
LongestCommonSubsequence metric. |
static StringDistance |
StringDistances.longestCommonSubstring()
Returns a string distance metric that uses the
LongestCommonSubstring metric. |
static StringDistance |
StringDistances.overlapCoefficient()
Returns a string distance metric that uses a
Tokenizers.whitespace() and
the OverlapCoefficient distance metric. |
static StringDistance |
StringDistances.qGramsDistance()
Returns a string distance metric that uses a
Tokenizers.qGramWithPadding(int) for q=3 and the
BlockDistance metric. |
static StringDistance |
StringDistances.simonWhite()
Returns a string distance metric that uses a
Tokenizers.whitespace()
followed by a Tokenizers.qGramWithPadding(int) for q=2
and the SimonWhite metric. |
Copyright © 2014–2016. All rights reserved.