LatinLanguageAnalyzer class Null safety
A TextAnalyzer implementation for Latin languages analysis.
Exposes a const default generative constructor.
- Implemented types
- Implementers
Constructors
- LatinLanguageAnalyzer()
-
Initializes a const LatinLanguageAnalyzer.
const
Properties
- characterFilter → AsyncTermModifier
-
A function that filters out unwanted characters or replaces them with
other characters.
read-only, inherited
- hashCode → int
-
The hash code for this object.
read-only, inherited
- jsonTokenizer → JsonTokenizer
-
Extracts tokens from the fields in a JSON document for use in full-text
search queries and indexes.
read-only, override
- nGrammer → NGrammer
-
A language-specific function that generates n-grams from text.
read-only, override
- paragraphSplitter → TextSplitter
-
Returns a list of paragraphs from text.
read-only, override
- phraseSplitter → PhraseSplitter
-
Returns a list of keywords from text.
read-only, override
- runtimeType → Type
-
A representation of the runtime type of the object.
read-only, inherited
- sentenceSplitter → TextSplitter
-
Returns a list of sentences from text.
read-only, override
- stemmer → TermModifier
-
Language-specific function that returns the stem of a term.
read-only, inherited
- syllableCounter → SyllableCounter
-
Returns the number of syllables in a string after stripping out all
white-space and punctuation.
read-only, override
-
termExceptions
→ Map<
String, String> -
A map of term exceptions
read-only, inherited
- termExpander → TermExpander?
-
Expands text to a collection of related Strings, e.g. synonyms,
abbreviations or spelling suggestions.
read-only, inherited
- termFilter → AsyncTermModifier
-
A filter function that returns a modified term or null.
read-only, override
- termSplitter → TextSplitter
-
Returns a list of words from text.
read-only, override
- tokenizer → Tokenizer
-
Extracts one or more tokens from text for use in full-text search queries
and indexes.
read-only, override
Methods
-
asNumber(
String term) → num? -
Attempts to parse the term to a number. Returns null if the term does not
represent a number, amount or percentage
inherited
-
isStopword(
String term) → bool -
Returns true if the
termis a stopword excluded from tokenization.inherited -
noSuchMethod(
Invocation invocation) → dynamic -
Invoked when a non-existent method or property is accessed.
inherited
-
toString(
) → String -
A string representation of this object.
inherited
Operators
-
operator ==(
Object other) → bool -
The equality operator.
inherited
Static Methods
-
isHashtag(
String term) → bool - Returns true if the String starts with "@" or "#" followed by one or more word-chacters only.
-
isHyphenated(
String term) → bool -
Returns true if the
termcontains one or more hyphens. -
isNumber(
String term) → bool - Returns true if the String contains digits and delimiters (periods or commas) where delimiters are not at the start or end of the String.
-
isNumberOrAmount(
String term) → bool - Returns true if the String contains digits and delimiters (periods or commas) where delimiters are not at the start or end of the String.
-
replaceHyphens(
String term, [String replace = ' ']) → String -
Replaces all hyphenations with
replace.
Constants
- kHypenations → const String
-
Selector for single hyphen characters preceded and followed by a word
boundary.
r'(?<=\b)-{1}(?=\b)' - kSentenceDelimiter → const String
-
The delimiter inserted at sentence endings to allow splitting of the text
into sentences.
r'%~%' - rBracketsAndCarets → const String
-
Matches all brackets and carets.
r'[\[\]\(\)\{\}\<\>]' - rEnclosingQuotes → const String
-
Selector for enclosing quote marks.
'(?<=^|$rNonWordChars)$rQuotes+|$rQuotes+(?=$rEndString|$rNonWordChars)' - rEndString → const String
-
Selects the end of a string.
r'\Z' - rHashtag → const String
-
Selects text that starts with "@" or "#" followed by one or more
word-chacters only.
r'(?<=^)[#@]{1}(\w)+(?=$)' -
Selects text that starts with "@" or "#" and is preceded or followed by
non-word characters or the start/end of the String.
r'(?<=^|\W)[#@]{1}(\w)+(?=\W|$)' - rLineEndingSelector → const String
-
Matches all line endings.
'[\u000A\u000B\u000C\u000D]+' - rNonWordChars → const String
-
Matches al characters except:
r"[^a-zA-Z0-9À-öø-ÿ¥Œ€@™#-\&_'-]" - rNumber → const String
-
Matches a String that includes digits and delimiters (periods and commas)
where delimiters are not at the start or end of the string
r'(?<=^)(\d|((?<=\d)[,.]{1}(?=\d)))+(?=$)' - rNumbers → const String
-
Matches all numbers, including those delimited with periods and or commas.
r'(?<=^|\W)(\d|((?<=\d)[,.]{1}(?=\d)))+(?=$|\W)' - rNumbersAndAmounts → const String
-
Matches all numbers and amounts, including:
r'(?<=^|\s)([$€£¥₣₹ك]{1}|[A-Z]{3})?(\d+|' r'((?<=\d)[,.]{1}(?=\d)))+([%]{1})?(?=$|\W)' - rPhraseDelimiterSelector → const String
-
Matches strings where text is split for keywording:
r'[\r\n]+|[\!:;,\-—."\' r"'}\)\]]+\W+|\W+[\!:;,\-—." r'"\' r"'}\)\]\(\[\{]+|\W(?=$)|(?<=^)\W" - rQuotes → const String
-
Selector for all single or double quotation marks and apostrophes.
'[\'"“”„‟’‘‛]' - rSentenceEndingSelector → const String
-
Matches all sentence endings.
'(?<=$rWordChars|\\s)(\\. )(?=([^a-z])|\\s+|\$)|(\\.)(?=\$)|' '(?<=[^([{])([?!])(?=([^)]}])|\\s+|\$)' - rWordChars → const String
-
Matches characters used to write words, including:
r"[a-zA-Z0-9À-öø-ÿ¥Œ€@™#-\&_'-]"