- Name: fastr
- Version: 2.04
- Release: 9mdv2008.1
- Epoch:
- Group: Sciences/Computer science
- License: GPL
- Url: http://www.limsi.fr/Individu/jacquemi/FASTR/
- Summary: A tool for automatic indexing
- Architecture: x86_64
- Size: 413896
- Distribution: Mandriva Linux
- Vendor: Mandriva
- Packager: Guillaume Rousse <guillomovitch@mandriva.org>
Description:
Fastr is a parser for term and variant recognition. Fastr take as input a
corpus and a list of terms and ouputs the indexed corpus in which terms and
variants are recognized.
Fastr can be used in two modes:
- controlled indexing: input consists of a corpus and a list of terms,
- free indexing: input only consists of a corpus, the list of terms is
automatically acquired from the corpus.
Fastr uses the following resources:
- the corpus and the list of terms are tagged by the TreeTagger:
http://www.ims.uni-stuttgart.de/Tools/DecisionTreeTagger.html
- if available, a list of morphological families and a list of semantic links
are used to calculate morphological and semantic variation. See sample files
- /usr/share/fastr/der-families-xx
- /usr/share/fastr/sem-classes-xx or ./lib/sem-links-xx
for the format (xx is the name of the language [en|fr]).
Perl modules are provided in order to generate these data from WordNet and
CELEXfor the English language.
The formalism of Fastr is close to PATR-II.
- OptFlags: -O2 -g -pipe -Wp,-D_FORTIFY_SOURCE=2 -fstack-protector --param=ssp-buffer-size=4 -fexceptions
- Cookie: seggie.mandriva.com 1197999442
- Buildhost: seggie.mandriva.com
Sources packages:
Other version of this rpm: