com.cybozu.labs.langdetect.util
Class NGram

java.lang.Object
  extended by com.cybozu.labs.langdetect.util.NGram

public class NGram
extends Object

Cut out N-gram from text. Users don't use this class directly.

Author:
Nakatani Shuyo

Field Summary
static HashMap<Character,Character> cjk_map
           
static int N_GRAM
           
 
Constructor Summary
NGram()
          Constructor.
 
Method Summary
 void addChar(char ch)
          Append a character into ngram buffer.
 String get(int n)
          Get n-Gram
static String normalize_vi(String text)
          Normalizer for Vietnamese.
static char normalize(char ch)
          Character Normalization
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

N_GRAM

public static final int N_GRAM
See Also:
Constant Field Values

cjk_map

public static HashMap<Character,Character> cjk_map
Constructor Detail

NGram

public NGram()
Constructor.

Method Detail

addChar

public void addChar(char ch)
Append a character into ngram buffer.

Parameters:
ch -

get

public String get(int n)
Get n-Gram

Parameters:
n - length of n-gram
Returns:
n-Gram String (null if it is invalid)

normalize

public static char normalize(char ch)
Character Normalization

Parameters:
ch -
Returns:
Normalized character

normalize_vi

public static String normalize_vi(String text)
Normalizer for Vietnamese. Normalize Alphabet + Diacritical Mark(U+03xx) into U+1Exx .

Parameters:
text -
Returns:
normalized text


Copyright © 2012. All Rights Reserved.