|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object pt.tumba.ngram.NGramConstants
public class NGramConstants
Contant values used in the TCatNG package.
Field Summary | |
---|---|
static int |
SIMILARITYJIANG
Use the similarity metric proposed by Jiand & Conranth in "Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy". |
static int |
SIMILARITYLIN
Use the similarity metric proposed by Lin in "An information-theoretic definition of similarity". |
static int |
SIMILARITYOUTOFPLACE
Use the similarity metric proposed by Cavnar & Trenkle. |
static byte[] |
SKIPABLE
Bytes skipable while building the proviles. |
static boolean |
SMOOTHING
Use Good-Turing smoothing on the NGram occurence frequency. |
static int |
USEDNGRAMSMAX
The lowest ranking position for storage in the N-gram profiles. |
static int |
USEDNGRAMSMIN
The highest ranking position for storage in the N-gram profile. |
Constructor Summary | |
---|---|
NGramConstants()
|
Method Summary |
---|
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public static final byte[] SKIPABLE
public static final int USEDNGRAMSMAX
USEDNGRAMSMAX=400
only the top 400 highest occurring N-grams will be stored.
public static final int USEDNGRAMSMIN
USEDNGRAMSMIN=200
the top 200 highest occurring N-grams will be skipped.
public static final int SIMILARITYLIN
public static final int SIMILARITYJIANG
public static final int SIMILARITYOUTOFPLACE
public static final boolean SMOOTHING
Constructor Detail |
---|
public NGramConstants()
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |