- Name: kytea
- Version: 0.4.7
- Release: 2.mga9
- Epoch:
- Group: Text tools
- License: ASL 2.0
- Url: http://www.phontron.com/kytea/
- Summary: Toolkit for analyzing texts in Japanese, Chinese, and other languages
- Architecture: aarch64
- Size: 128861461
- Distribution: Mageia
- Vendor: Mageia.Org
- Packager: umeabot <umeabot>
Description:
General toolkit for analyzing text, with a focus on Japanese, Chinese
and other languages requiring word or morpheme segmentation.
KyTea is able to perform the following types of processing:
- Word Segmentation: it can separate an unsegmented text stream into
appropriate units (words or morphemes).
- Tagging: it can estimate the tags for words such as POS (part of
speech) tags.
- Pronunciation: it has the ability to estimate the pronunciation
of unknown words.
While KyTea comes with a default model, if you have your own annotated
text, it provides a tool to train your own model.
- OptFlags: -O2 -g -pipe -Wformat -Werror=format-security -Wp,-D_FORTIFY_SOURCE=2 -fstack-protector --param=ssp-buffer-size=4 -fasynchronous-unwind-tables
- Cookie: localhost 1647333102
- Buildhost: localhost
Sources packages:
Other version of this rpm: