Agris4Drupal (A4D) : an integrated environment to index agricultural online content in Drupal
Agris4Drupal (A4D) is a RDF process-oriented module developed for Drupal users to index the contents against the integrated ontologies, say FAO’s Geopolitical ontology and AGRIVOC manual and automatic, for any online text that is of interest.
A4D provides fundamental two types of service:
1) Basic service:
n Importing, browsing, and retrieval functionality for ontology files
n Importing namespace files to help mark and exporting customized format
2) Auto Indexing service:
Figure.1 workflow for auto Indexing
The concept listed/managed in the Generated RDF file in Figure 1 could be used to index the original page content. The service is similar to OpenCalais, but goes further:
n extract or mark the terms that can characterizes the text through certain algorithm or service, output as tags;
n look up those tags in the ontology reservoir, which may stores AGROVOC, GEOPOLITICAL and other related ontologies, to retrieve the formally defined concepts and terms in their respective knowledge domain;
n output those retrieved concepts and terms as RDF format and display them as a marker/tags of the text for indexing.
3. A4D box
A4D consists of 4 modules:
1) agris: a module that provides the basic operations through which users can:
n import ontology,
n create new repository,
n add new namespace,
n select the method of keyword extraction, and
n query on rdf data.
2) rdf4Agris: the origial RDF module from Drupal community with an essential modifiation to make it work for agris.
3) opencalais4Agris: a module that is used for keyword extraction by calling service from OpenCalais , rewritten based on the OpenCalais module from Drupal community
4) generateRdf: a module for user to generate RDF file with indexed concepts from the content or the URL they input.
4. Deliverables planned
Figure 2. deliverables planned
As described in Figure 2, the blocks w/ yellow background are the outputs need to present to user, and the clouds filled by light blue indicate the service that the module need to provide. A good reference could be found at http://data.nytimes.com/.