Class UnknownDictionary
- java.lang.Object
-
- org.apache.lucene.analysis.ja.dict.BinaryDictionary
-
- org.apache.lucene.analysis.ja.dict.UnknownDictionary
-
- All Implemented Interfaces:
Dictionary
public final class UnknownDictionary extends BinaryDictionary
Dictionary for unknown-word handling.
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description private static classUnknownDictionary.SingletonHolder-
Nested classes/interfaces inherited from class org.apache.lucene.analysis.ja.dict.BinaryDictionary
BinaryDictionary.ResourceScheme
-
-
Field Summary
Fields Modifier and Type Field Description private CharacterDefinitioncharacterDefinition-
Fields inherited from class org.apache.lucene.analysis.ja.dict.BinaryDictionary
DICT_FILENAME_SUFFIX, DICT_HEADER, HAS_BASEFORM, HAS_PRONUNCIATION, HAS_READING, POSDICT_FILENAME_SUFFIX, POSDICT_HEADER, TARGETMAP_FILENAME_SUFFIX, TARGETMAP_HEADER, VERSION
-
Fields inherited from interface org.apache.lucene.analysis.ja.dict.Dictionary
INTERNAL_SEPARATOR
-
-
Constructor Summary
Constructors Modifier Constructor Description privateUnknownDictionary()UnknownDictionary(java.net.URL targetMapUrl, java.net.URL posDictUrl, java.net.URL dictUrl)Create aUnknownDictionaryfrom an external resource URL (e.g.UnknownDictionary(java.nio.file.Path targetMapFile, java.nio.file.Path posDictFile, java.nio.file.Path dictFile)Create aUnknownDictionaryfrom an external resource path.UnknownDictionary(BinaryDictionary.ResourceScheme scheme, java.lang.String path)Deprecated, for removal: This API element is subject to removal in a future version.replaced byUnknownDictionary(Path, Path, Path)for files andUnknownDictionary(URL, URL, URL)for classpath/module resources
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description CharacterDefinitiongetCharacterDefinition()private static java.io.InputStreamgetClassResource(java.lang.String suffix)java.lang.StringgetInflectionForm(int wordId)Get inflection form of tokensjava.lang.StringgetInflectionType(int wordId)Get inflection type of tokensstatic UnknownDictionarygetInstance()java.lang.StringgetReading(int wordId, char[] surface, int off, int len)Get reading of tokensintlookup(char[] text, int offset, int len)-
Methods inherited from class org.apache.lucene.analysis.ja.dict.BinaryDictionary
getBaseForm, getLeftId, getPartOfSpeech, getPronunciation, getResource, getRightId, getWordCost, lookupWordIds
-
-
-
-
Field Detail
-
characterDefinition
private final CharacterDefinition characterDefinition
-
-
Constructor Detail
-
UnknownDictionary
@Deprecated(forRemoval=true, since="9.1") public UnknownDictionary(BinaryDictionary.ResourceScheme scheme, java.lang.String path) throws java.io.IOExceptionDeprecated, for removal: This API element is subject to removal in a future version.replaced byUnknownDictionary(Path, Path, Path)for files andUnknownDictionary(URL, URL, URL)for classpath/module resources- Parameters:
scheme- scheme for loading resources (FILE or CLASSPATH).path- where to load resources from; a path, including the file base name without extension; this is used to match multiple files with the same base name.- Throws:
java.io.IOException
-
UnknownDictionary
public UnknownDictionary(java.nio.file.Path targetMapFile, java.nio.file.Path posDictFile, java.nio.file.Path dictFile) throws java.io.IOExceptionCreate aUnknownDictionaryfrom an external resource path.- Parameters:
targetMapFile- where to load target map resourceposDictFile- where to load POS dictionary resourcedictFile- where to load dictionary entries resource- Throws:
java.io.IOException- if resource was not found or broken
-
UnknownDictionary
public UnknownDictionary(java.net.URL targetMapUrl, java.net.URL posDictUrl, java.net.URL dictUrl) throws java.io.IOExceptionCreate aUnknownDictionaryfrom an external resource URL (e.g. from Classpath withClassLoader.getResource(String)).- Parameters:
targetMapUrl- where to load target map resourceposDictUrl- where to load POS dictionary resourcedictUrl- where to load dictionary entries resource- Throws:
java.io.IOException- if resource was not found or broken
-
UnknownDictionary
private UnknownDictionary() throws java.io.IOException- Throws:
java.io.IOException
-
-
Method Detail
-
getClassResource
private static java.io.InputStream getClassResource(java.lang.String suffix) throws java.io.IOException- Throws:
java.io.IOException
-
lookup
public int lookup(char[] text, int offset, int len)
-
getCharacterDefinition
public CharacterDefinition getCharacterDefinition()
-
getReading
public java.lang.String getReading(int wordId, char[] surface, int off, int len)Description copied from interface:DictionaryGet reading of tokens- Specified by:
getReadingin interfaceDictionary- Overrides:
getReadingin classBinaryDictionary- Parameters:
wordId- word ID of token- Returns:
- Reading of the token
-
getInflectionType
public java.lang.String getInflectionType(int wordId)
Description copied from interface:DictionaryGet inflection type of tokens- Specified by:
getInflectionTypein interfaceDictionary- Overrides:
getInflectionTypein classBinaryDictionary- Parameters:
wordId- word ID of token- Returns:
- inflection type, or null
-
getInflectionForm
public java.lang.String getInflectionForm(int wordId)
Description copied from interface:DictionaryGet inflection form of tokens- Specified by:
getInflectionFormin interfaceDictionary- Overrides:
getInflectionFormin classBinaryDictionary- Parameters:
wordId- word ID of token- Returns:
- inflection form, or null
-
getInstance
public static UnknownDictionary getInstance()
-
-