Description:agTagger is a keyword extractor that uses the AGROVOC thesaurus to extract keywords from the content of some URLs.
Abstract:At a high level of abstraction, agTagger (the piece of software behind the AgTagger module: ”AgTagger” and ”AgroTagger” can be used as synonyms in this context) is a keyword extractor that uses the AGROVOC thesaurus to extract keywords from the content of some URLs. Since AGROVOC is published as Linked Open Data, the agTagger can do more than extracting keywords, it can extract AGROVOC URIs. The agTagger is based on MAUI, a piece of software that automatically identifies main topics in text documents, using two different algorithms: the keyphrase extraction algorithm KEA, and the machine learning toolkit WEKA. To be used in the agTagger, MAUI was trained to work with AGROVOC (in English).
The application is available for download at: https://github.com/agrisfao/agrotagger/. It is a command line application, entirely based on JAVA, and provided with some bash scripts that can be executed in a Linux environment.