- Name: perl-Text-DeDuper
- Version: 1.01
- Release: 1
- Epoch: 0
- Group: Development/Languages/Perl
- License: GPL v1+ or Artistic
- Url:
- Summary: Text::DeDuper - near duplicates detection module
- Architecture: noarch
- Size: 14453
- Distribution: PLD
- Vendor:
- Packager:
Description:
This module uses the resemblance measure as proposed by Andrei Z. Broder at al
(http://www.ra.ethz.ch/CDstore/www6/Technical/Paper205/Paper205.html) to detect
similar (near-duplicate) documents based on their text.
Note of caution: The module only works correctly with languages where texts can
be tokenised to words by detecting alphabetical characters sequences. Therefore
it might not provide very good results for e.g. Chinese.
- OptFlags: -O2 -fno-strict-aliasing -fwrapv -march=x86-64
- Cookie:
- Buildhost: ymir-builder
Sources packages:
Other version of this rpm: