public final class StringDistances extends Object
Consists of well known metrics. All distance metrics are setup with sensible
defaults, to customize metrics use StringDistanceBuilder.
The created distance metrics are immutable and thread-safe.
| Constructor and Description |
|---|
StringDistances() |
| Modifier and Type | Method and Description |
|---|---|
static StringDistance |
blockDistance()
Returns a string distance metric that uses a
Tokenizers.whitespace() and
the BlockDistance metric. |
static StringDistance |
cosineSimilarity()
Returns a distance metric that uses a
Tokenizers.whitespace() and
the CosineSimilarity metric. |
static StringDistance |
damerauLevenshtein()
Returns a string distance metric that uses a
DamerauLevenshtein metric. |
static StringDistance |
dice()
Returns a string distance metric that uses a
Tokenizers.whitespace() and
the Dice metric. |
static StringDistance |
euclideanDistance()
Returns a string distance metric that uses a
Tokenizers.whitespace() and
the EuclideanDistance metric. |
static StringDistance |
generalizedJaccard()
Returns a string distance metric that uses a
Tokenizers.whitespace() and
the GeneralizedJaccard metric. |
static StringDistance |
hammingDistance()
Returns a string distance metric that uses the
HammingDistance metric. |
static StringDistance |
identity()
Returns an string distance metric that uses the
Identity metric. |
static StringDistance |
jaccard()
Returns a string distance metric that uses a
Tokenizers.whitespace() and
the Jaccard metric. |
static StringDistance |
jaro()
Returns a string distance metric that uses the
Jaro metric. |
static StringDistance |
jaroWinkler()
Returns a string distance metric that uses the
JaroWinkler metric. |
static StringDistance |
levenshtein()
Returns a string distance metric that uses the
Levenshtein metric. |
static StringDistance |
longestCommonSubsequence()
Returns a string distance metric that uses the
LongestCommonSubsequence metric. |
static StringDistance |
longestCommonSubstring()
Returns a string distance metric that uses the
LongestCommonSubstring metric. |
static StringDistance |
overlapCoefficient()
Returns a string distance metric that uses a
Tokenizers.whitespace() and
the OverlapCoefficient distance metric. |
static StringDistance |
qGramsDistance()
Returns a string distance metric that uses a
Tokenizers.qGramWithPadding(int) for q=3 and the
BlockDistance metric. |
static StringDistance |
simonWhite()
Returns a string distance metric that uses a
Tokenizers.whitespace()
followed by a Tokenizers.qGramWithPadding(int) for q=2
and the SimonWhite metric. |
public static StringDistance cosineSimilarity()
Tokenizers.whitespace() and
the CosineSimilarity metric.public static StringDistance blockDistance()
Tokenizers.whitespace() and
the BlockDistance metric.public static StringDistance damerauLevenshtein()
DamerauLevenshtein metric.public static StringDistance dice()
Tokenizers.whitespace() and
the Dice metric.public static StringDistance euclideanDistance()
Tokenizers.whitespace() and
the EuclideanDistance metric.public static StringDistance generalizedJaccard()
Tokenizers.whitespace() and
the GeneralizedJaccard metric.public static StringDistance identity()
Identity metric.public static StringDistance jaccard()
Tokenizers.whitespace() and
the Jaccard metric.public static StringDistance jaro()
Jaro metric.public static StringDistance jaroWinkler()
JaroWinkler metric.public static StringDistance levenshtein()
Levenshtein metric.public static StringDistance overlapCoefficient()
Tokenizers.whitespace() and
the OverlapCoefficient distance metric.public static StringDistance qGramsDistance()
Tokenizers.qGramWithPadding(int) for q=3 and the
BlockDistance metric.public static StringDistance simonWhite()
Tokenizers.whitespace()
followed by a Tokenizers.qGramWithPadding(int) for q=2
and the SimonWhite metric.public static StringDistance hammingDistance()
HammingDistance metric.public static StringDistance longestCommonSubsequence()
LongestCommonSubsequence metric.public static StringDistance longestCommonSubstring()
LongestCommonSubstring metric.Copyright © 2014–2016. All rights reserved.