Technology Choices and Tools

Notes for wiki editors:

- please only add, do not delete what others have written;

- you can add new rows by right clicking on a row/cell and select "Add row";

- you can add any comments by clicking on the "Add new comment" link at the bottom of the page.

Objective and scope

The objective of this document is to support decision making in the choice of tools for managing contents and making them available on the web. In the context of CIARD, the requirement of making contents also as accessible and re-usable as possible is given special consideration.

By “contents” in this document we mean anything that can be made available through the web or identified through metadata available on the web: texts, documents, multimedia, data of any kind.

By way of example, and because they represent the most frequent types of contents that agricultural Institutions want to make available on the web, this document will focus on the management of generic web contents (articles, web pages), news, events and documents, touching on often needed additional functionalities like community features and collaborative editing features.

As an example of specialized content management with special requirements in terms of metadata standards and management workflow, this document will use the case of document repository management. Tools for the specialized management of other types of contents (geographic data, statistical data) can be covered in the tools matrix at a later stage, but the criteria for the selection of this category of tools will in most cases coincide with the criteria for selecting a specialized document repository management tool versus a generic content management tool that can be customized for document management. 

The first table identifies, based on specific Institutional contexts and content management requirements,  the most important features that a tool must have and the generic typology of tool that may best meet those requirements.

The second table presents, starting from the features/requirements identified in the first table and also additional non-functional requirements, some tools that meet the single requirements. Documentation on these tools and their matching against  the requirements will be managed through a wiki, so that users/experts on specific tools (or their owners/authors) can contribute useful information.

A more narrative version of table 1 is being prepared here.

See definitions and classifications at the bottom of the tables for further clarifications.

Decision Tree

Table 1. "Decision tree": context > requirements > recommended tool typology

 
Context / needs / constraints
Notes
Requirements
Recommended typology
 
 
 
1. INSTITUTIONAL CONTEXT AND INFRASTRUCTURE
 
 
1.A
Key policies / strategies for information creation / sharing / dissemination
 
Open Access mandate
 For requirements, see here
 Not relevant
 
Copyright / licensing issues
 
- Ability to manage custom administrative metadata for the resources
- Ability to filter viewing / downloading according to administrative metadata
 
 
Preservation
 
- Ability to upload resources
- Ability to maintain stable URLs for resources
Preferably an on-premise tool
 
Any typology provided that it has these features
 
Partnerships and agreements: commitment to contribute data to specific service providers / engines / gateways
Examples: AGRIS, AgriFeeds, CABI Abstracts...
This requires the ability to produce data / metadata in the agreed format, with the agreed protocols.
Required features vary depending on the technical requirements of the service provider.
 
General requirements:
- Ability to define custom output formats
 
Possible specific requirements:
- Ability to implement an OAI data provider
- Ability to implement custom RSS feeds
 
 
 
 
 
 
 
 
 
 
 
1.B
IM/IT functional structure (roles and responsibilities, workflows)
 
Submission and editorial workflow
There may be different degrees of complexity in an editorial workflow, from permissions to subsequent steps in the approval chain to notifications
- Multi-author system
- Ability to set up granular permissions
- Ability to define an editorial workflow
 
 
IM staff / librarians responsible for the system
 
 
 
 
IT Unit responsible for the system
 
 
 
 
Communications staff responsible for the system staff
 
 
 
 
Research staff responsible for the system
 
 
 
 
 
 
 
 
 
Only internal service
 
 
 
 
Content selection criteria
 
 
 
 
 
 
 
 
 
 
 
 
 
 1.C
Human resources: IT & IM technical skills, Staff availability
 
IM staff / librarians (permanently) available to the project
This means the available staff have the competencies to either: a) evaluate a specialized system; b) elaborate a content strategy and a content model for a customizable CM tool.
This also means that the system should already be customized for IM functionalities familiar to IM experts / librarians
- Suitable content types (e.g. document, with suitable metadata) and workflow already in place
 
OR
 
- Customizable content types and workflow (defined through a GUI, not programming)
 - Specialized CMS (e.g. DMS)
 
- Pre-customized CMS with suitable features
 
IT staff (permanently) available to the project
This means that the tool can be highly customized by IT experts
- Customizable content types and workflow (even through programming)
 
 
Only content authors
This means there is no technical expertise and the system has to be installed and immediately used by content authors/managers
- Suitable content types and workflow already in place
- Good support tools
- Specialized CMS (e.g. DMS)
 
- Pre-customized CMS with suitable features
 
 
 
 
 
 
 
 
 
 
1.D
Funding / resources
 
Huge funding
 
- Customizable content types and workflow (even through programming)
Any typology
 
-- Time-bound, short-term
 
- Open source
- Good documentation
- Large community
- Free support tools
- Free help desk
 
 
-- Regular, long-term
 
 
 Any typology
 
Limited funding / resources
 
- Free
- Open source
- Good documentation
- Large community
- Free support tools
- Free help desk
- Free open source Specialized CMS (e.g. DMS)
- Free open source pre-customized CMS with suitable features
 
- Free service in the cloud
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1.E
Information Technology : Deployment environment
 
Highly performing servers
 
 
 
 
Ability to install and manage any web server
 
 
 
 Decent public server and ability to install a simple web server (e.g. Xampp)    
 
Good external hosting
 
 
 
 
Basic/poor external hosting
 
 
 
 
Operating system
 
 
 
 
Network connectivity
 
 
 
 
IT support
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2. SCOPE OF THE CONTENT MANAGEMENT TOOL
 
 
 2.A
Users / authors
 
 
 
 
- Good IT skills
 
 
 
 - Poor IT skills   
 -   
     
 2.B
Main function of the tool
 
 
 
 
- Managing contents
 
 
 
 
- Hosting a community
 
 
 
 
- Collaborative editing
 
 
 
 
 
 
 
 
 
 
 
 
 
 
Functional requirements
 2.C
Content to be managed and content management needs
 
Only digital contents
 
 
 
 
Also printed material
 
 
 
 
Only new contents
 
 
 
 
Also contents already existing
 
 
 
 
- Digitalization
 
 
 
 
- Migration
 
 
 
 
Also contents from external sources
 
 
 
 
- Integrated information system
 
 
 
 - RSS aggregation   
 - XML imports   
 - Linked Data consumer   
     
 2.D
Type of contents to be managed
 
 
 
 
- Documents / DLIOs
 
 
 
 
- News / events
 
 
 
 
- Multimedia
 
 
 
 
 
 
 
 
     
     
 2.EContent organization   
 - Custom internal classifications   
 - Custom external vocabularies (e.g. Agrovoc)   
 - Authority lists: internal   
 - Authority lists: external   
     
 
Current number of objects
 
 
 
 
Estimated growth
 
 
 
 
 
 
 
 
 
Source / suppliers
 
 
 
 
Preservation
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 2.F
Content dissemination needs
 
Audience
 
 
 
 
- internal (researchers, experts, employees)
 
 
 
 
- external (specialists)
 
- Good search and browse capabilities
- Ability to use custom categorizations
- Ability to integrate scientific classifications / vocabularies
 
 
- external (farmers)
 
- Good search and browse capabilities
- Ability to create light pages
- Good performance
- Ability to deliver content through other media (e.g. phones)
 
     
 
 Accessibility
 
 
 
 - Web search / browse   
 
...
 
 
 
 
- General RSS feed
   
 
- Custom dynamic RSS feeds
 
 
 
 - Basic XML output (e.g. Dublin Core)   
 
- Custom XML exports (e.g. Agris AP, MODS...)
 
 
 
 - RDFa   
 - Custom RDF outputs (XML, Json...)   
 - Linked Data provider   
 - Email notifications   
 
 
 
 
 

 

Table 2. Software selection matrix: requirements > software tools

RequirementRecommended typologiesProductDetails  
Ability to provide access to full-text documents- Specialized CMS (e.g. DMS)
- Pre-customized generic CMS
DSpace

DSpace allows to do this by...

(or link to DSpace wiki page in the Tools Wiki)

 
  Drupal

 Drupal allows to do this thanks to... (or link)

 
   ... ... 
Ability to manage administrative metadata for the resources
 - Specialized CMS (e.g. DMS)
- Pre-customized generic CMS
DSpace  
  Drupal  
   ...  
     
     
     
     
     
     
     
     
     
     
     
     
     

Notes on the structure

Since the objective of this exercise is to provide a practical instrument for decision making, the structure wants to be as similar to a decision tree as possible.

Being practical, this structure may lack systematicity. Some requirements / constraints apply at different levels, e.g. the requirement to be able to contribute data to a global database may be a (binding) Institutional requirement subsequent to a partnership agreement, and/or a functional requirement defined only at the level of a project.

The definitions and classifications below are provided in order to give some coherence to the potentially infinite ways to classify/group different types of requirements, of Institutional contexts and of tools. 

Definitions and classifications

Definitions 

CMS, WCMS, DMS, specialized CMS

a)  CMS vs. Web Content Management System (WCMS):

while it is quite common to consider a CMS an equivalent of a WCMS, i.e. a tool to manage a website, the definitions found in Wikipedia show that there is an agreement in distinguishing among them (http://en.wikipedia.org/wiki/Content_management_system): a CMS manages virtually anything: "In a CMS, data can be defined as nearly anything: documents, movies, pictures, phone numbers, scientific data, and so forth. CMSs are frequently used for storing, controlling, revising, semantically enriching, and publishing documentation", while a WCMS "is a software system which provides website authoring and administration tools designed to allow users with little knowledge of web programming languages or markup languages to create and manage the site's content with relative ease".

In this document, we will refer to the CMS typology as a customizable generic CM tool, while WCMS are only considered a specialized tool for managing websites.

In case a CMS is going to be applied to the management of all Institutional contents and workflows, the CMS should also have the features of an Enterprise Content Management (ECM) tool.

b) Specialized CMS vs. generic CMS:

- specialized CMSs include document management systems (DMS), library management systems, project management systems, multimedia management systems, conference management systems and in general any management system that has a pre-defined content model and workflow optimized for the specific type of content managed: these tools have the advantage of being highly customized for specific needs and the disadvantage of usually not allowing to manage different types of contents in an integrated information system;

- generic CMSs (see definition above, point a) are generic CM tools that allow to define the content model and the workflows for potentially any kind of contents: these tools have the advantage of allowing to manage different types of contents in an integrated information system and the disadvantage of requiring some (or much, it depends on their flexibility) customization for highly specialized tasks;

- a third category could be considered: pre-customized CMS installations, which includes generic CMS platforms with custom installations that have a pre-defined content model and workflow optimized for one or more specific types of content, still allowing for the management of other content types; the level of specialization in these tools may of course not be as high as in the specialized CM tools mentioned above.



Content management

The process of managing your content ought to include structuring, organizing, searching on, filtering, and easily modifying your content. It ought to include being able to quickly define new types of content. It ought to facilitate establishing meaningful relationships between disparate pieces of content. It ought to make your content more useful simply by virtue of the content being in the system.” (Croft 2006

But most of all: when looking for a CMS, look for one that provides you a platform to do whatever you want. Don’t look for one that anticipates what you want and then makes you do it. Your CMS should be a platform upon which you can easily build and manage structured data of any type. You should never have to wedge a photo gallery into a blog entry form — you should be able to simply create a photo gallery object and associate with a particular blog entry. If your CMS can’t do this — or doesn’t at least provide enough of an extensible platform that you could add this kind of functionality with relative ease — then your CMS isn’t really managing your content at all. Instead, it’s getting in the way of you managing it.” (Croft 2006).

“a CMS is a bunch of powerful tools that add semantic richness to content, and that require customization to fit a specific project’s needs” (Kahn 2010

Resources 

...

“IM” and “IM staff”

...

  

Classification of tools

based on their MAIN scope - not secondary features - and their "packaging".

"Packaging": a) on-premise software / installation package: installed on the organization’s own network or by the organization on any hosting platform; b) Software as a Service (SaaS) / in the cloud: web access to the service managing the contents, which is made available and controlled by a 3rd-party provider.

- Specialized CMS

(document management systems (DM), library management systems, project management systems, multimedia management systems, conference management systems etc.)

---- on premise (DSpace, Fedora Commons, Jira, CiviCRM...)

---- in the cloud (Google Docs, Dropbox, ... [ask Meena?], Flickr...)

- Generic customizable CMS (or simply CMS)

---- on premise (Drupal, Wordpress, Joomla...)

---- in the cloud (Google Sites, Wordpress.com, ...)

- Pre-customized generic CMS

---- on premise (AgriDrupal, SADC AIMS, OneVision...)

---- in the cloud (AGRIS Metamaker, RSS generators...)

- Ad-hoc solutions

either programmed from scratch or based on frameworks (applications developed with Django, PHPnuke etc.) 

This classification has purely practical purposes, many tools may fall in different categories according to different perspectives and different use cases.  For instance, is Wordpress a generic CMS, a WCMS or a specialized blogging system with some additional features? Some tools that are usually considered WCMS are actually highly customizable generic CMSs. And so on.

 

Bibliography

[Kahn 2010] Jonathan Kahn, 2010, Strategic Content Management, http://www.alistapart.com/articles/strategic-content-management/ 


[Croft 2006] Jeff Croft, 2006, On personal content management, http://jeffcroft.com/blog/2006/sep/20/personal-content-management/