Package org.languagetool.languagemodel
Class LuceneLanguageModel
java.lang.Object
org.languagetool.languagemodel.BaseLanguageModel
org.languagetool.languagemodel.LuceneLanguageModel
- All Implemented Interfaces:
AutoCloseable
,LanguageModel
Like
LuceneSingleIndexLanguageModel
, but can merge the results of
lookups in several independent indexes to one result.- Since:
- 2.7
-
Field Summary
FieldsFields inherited from interface org.languagetool.languagemodel.LanguageModel
GOOGLE_SENTENCE_END, GOOGLE_SENTENCE_START
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionvoid
close()
long
Get the occurrence count fortoken
.long
Get the occurrence count for the given token sequence.private static @Nullable File[]
getSubDirectoriesOrNull
(File topIndexDir) long
toString()
static void
validateDirectory
(File topIndexDir) Methods inherited from class org.languagetool.languagemodel.BaseLanguageModel
getPseudoProbability, getPseudoProbabilityStupidBackoff
-
Field Details
-
lms
-
-
Constructor Details
-
LuceneLanguageModel
- Parameters:
topIndexDir
- a directory which contains either: 1) sub directories called1grams
,2grams
,3grams
, which are Lucene indexes with ngram occurrences as created byorg.languagetool.dev.FrequencyIndexCreator
or 2) sub directoriesindex-1
,index-2
etc that contain the sub directories described under 1)
-
-
Method Details
-
validateDirectory
-
getSubDirectoriesOrNull
-
getCount
Description copied from class:BaseLanguageModel
Get the occurrence count for the given token sequence.- Specified by:
getCount
in classBaseLanguageModel
-
getCount
Description copied from class:BaseLanguageModel
Get the occurrence count fortoken
.- Specified by:
getCount
in classBaseLanguageModel
-
getTotalTokenCount
public long getTotalTokenCount()- Specified by:
getTotalTokenCount
in classBaseLanguageModel
-
close
public void close() -
toString
-