Lemmatization Strategy
This file defines the LemmatizationStrategy protocl class, which all lemmatization strategies should extend to be usable by the Simplemma library.
Classes
LemmatizationStrategy
Bases: Protocol
This protocol defines the interface for lemmatization strategies. Subclasses implementing this protocol
must provide an implementation for the get_lemma method.
Note
This protocol should be implemented by concrete lemmatization strategy classes.
Concrete implementations of this protocol should provide a concrete implementation for the get_lemma method.
Source code in simplemma/strategies/lemmatization_strategy.py
9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 | |
Functions
get_lemma(token, lang)
abstractmethod
This method receives a token and a language code and should return the lemma for the token in the specified language.
If the lemma is not found, it should return None.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
token |
str
|
The input token to lemmatize. |
required |
lang |
str
|
The language code for the token's language. |
required |
Returns:
| Type | Description |
|---|---|
Optional[str]
|
Optional[str]: The lemma for the token, or |
Raises:
| Type | Description |
|---|---|
NotImplementedError
|
If the method is not implemented by the subclass. |
Source code in simplemma/strategies/lemmatization_strategy.py
19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 | |