Chapter 12
Comparative Analysis of SoftwareEvaluation
Introduction
The evaluation of selected candidate software was carried out on different evaluation cri-
teria. This chapter deals with the comparative analysis of each software against the set of
evaluation criteria under broad and narrow subject categories.
Though the objective of the present study is to not compare one software with another
and decide the best among all, it is necessary to understand what are the general features
supported by each software under broad and narrow subject categories and will help users
to glance through different features in a tabular form. Thisanalysis may be helpful to
the Library Staff to know which area is more important and which software can be chosen
with a given set of environment. In the further sections of this chapter all the narrow criteria
which are defined for evaluation of OSS-DL are compared as a comparative table under the
broad subject categories.
12.1 Content Acquisition
Table 12.1 lists out all content acquisition related criteria in a tabular form. It is found
that all selected software support to upload digital documents through Web User Interface
except Fedora and Greenstone. Fedora & Greenstone has separate Librarian’s Interface
360
CHAPTER 12. COMPARATIVE ANALYSIS 361
through which documents can be uploaded into the repository. Duplicate checking of doc-
uments is supported only by EPrints and Greenstone. No othersoftware support duplicate
checking. One of the important feature of CDS-Invenio is it has capability to create bibli-
ographic database as well as has capability to manage digital documents.
EPrints and Fedora has unique capability to upload digital documents from existing URL.
Greenstone software also has this feature but this feature does not work properly with
Greenstone. Importing is handled by all software and all software support to import data if
it is available in XML format. All software have capability to export data into either XML,
DC format, METS, MARC format. Automatic extraction of keywords is at present handled
by only CDS-Invenio. No other software support to extract keywords from metadata or full
texts uploaded into the repository. CDS-Invenio, DoKS & Fedora supports to add different
versions of digital documents which is one of the important feature of digital library. Weed
out policy is at present supported by CDS-Invenio, DSpace, EPrints, Fedora & SOPS.
12.2 Content Management
Table 12.2 lists all evaluation criteria related to contentmanagement. CDS-Inveion &
DSpace software provide a good workflow management. CDS-Invenio can handle com-
plex workflow processess. EPrints, MyCoRe software to some extent provide workflow
management. DoKS and SOPS allow to create only submitters but does not support any
other work flow management such as creating reviewers, metadata reviwers, etc. Fedora &
Greenstone does not have any work flow management. Generating authority files is sup-
ported at present only by DSpace, DoKS, Greenstone & MyCoRe.DSpace & CDS-Invenio
has capabilities to show strength of each collection on web site.
12.3 Metadata Submission and Support
Table 12.3 lists all evaluation criteria related to metadata submission & support in a tabular
form. All the software have capability to upload descriptive, administrative & structural
metadata except SOPS. All software support UTF-8 hence it ispossible to add metadata
in any language. All software have capability to support Dublin Core metadata schema
except SOPS. CDS-Inveion, EPrints, Fedora & Greenstone supports to add metadata other
CH
AP
TE
R12.
CO
MP
AR
ATIV
EA
NA
LYS
IS3
62
Table 12.1: Content Acquisition
CDS-Invenio DSpace DoKS EPrints Fedora Greenstone MyCoRe SOPS
Software sup-ports to allowto add biblio-graphic dataof books, re-ports, journalsas well as allother types ofdigital mediasuch as text,audio, video,image, dataset files etc.which is oneof the uniquefeature ofCDS-Invenio.
DSpace sup-ports to allowto add all typesof digital docu-ments rangingfrom books,reports, journalarticles, lecturenotes, tech-nical reports,thesis, images,audio/videofiles to datasets.
DoKS supportsto speciallyadd electronicthesis of a stu-dent as well asit supports toadd CurriculumVitae, workexperience,referenceinformation,annotationdetails, Edu-cation etc. ofevery studentalong withthe electronicthesis.
EPrints sup-ports to addarticles, booksections,monographs,conferenceor workshopitems, books,thesis, patent,artefact,show/exhibition,composition,performance,image, video,audio, dataset,experiment,teaching re-source.
Fedora sup-ports to uploadconventionaldigital ob-jects (such asbooks, othertext docu-ments, learn-ing objects,geospatialdata, im-ages, maps,computerprograms),Complex,Compoundas well asdynamic ob-jects (suchas videos,numerical datasets, their as-sociated codebooks andaudios as well.
Greenstonesupportsto add anytype of doc-uments suchas from books,reports, jour-nal/newspaperarticles, lecturenotes, tech-nical reports,thesis, images,audio/video,visual artfiles as wellas any otherdocuments.
MyCoRe sup-ports to addvariety of doc-ument suchas article,abstract, bibli-ography, book,dataset, pro-gram, thesis,examination,journal, learn-ing material,manual, mu-sic, notes,person infor-mation, picture,preprint, pre-sentation,research pa-per, service,statistical data,video etc.
SOPS sup-ports to addonly electronicpublicationssuch as pa-pers, keynotepapers, shortpapers, ab-stracts, postersetc.
CDS-Invenioallow to de-fine the do-main/scope ofthe digital li-brary for whomit is intendedfor.
DSpace isdesigned forcreating insti-tutional repos-itory of anyorganization.
DoKS is spe-cially meantfor organisingelectronic the-sis/dissertationsfrom any uni-versity.
Software hascapability todefine thedomain of thedigital library.
Fedora doesnot supportto define do-main/scopeof the digitallibrary.
Greenstonehas capabilityto define do-main/scope ofdigital libraryfor whom it isintended for.
MyCoRe hasbeen espe-cially designedfor creatinginstitutionalcontent reposi-tory or archivalsolutions.
SOPS is spe-cially meantfor organis-ing electronicpublications.
CH
AP
TE
R12.
CO
MP
AR
ATIV
EA
NA
LYS
IS3
63
CDS-Invenio DSpace DoKS EPrints Fedora Greenstone MyCoRe SOPS
CDS-Inveniocan be usedto organiseany type ofdigital docu-ments and itis designedfor all types oflibraries.
Software is in-tended for thefaculty mem-bers of anyorganization.
DoKS is in-tended foruniversitystudents for or-ganising theirthesis and theirCV’s online.
EPrints is basi-cally intendedfor creatingpreprint/postprintarchives butthe currentversion sup-ports addingother types ofdocuments aswell such asaudio/videocollections.
Fedora doesnot supportto define do-main/scopeof the digitallibrary.
Software isintended tocreate digi-tal library ofany type ofdocuments.
MyCoRe isdesignedfor authors,editors, read-ers where itsupports todefine differ-ent workflowfor differentgroups andallow endusers to uploaddocumentsdirectly.
It is intendedfor use of con-ference organ-isers and or-ganising elec-tronic publica-tions.
Software hasthe ability toaccept doc-uments inall desiredformats. Doc-uments canconsist ofmultiple files.Software givesthe ability tosubmit a docu-ment with morethan one file.
DSpace bydefault sup-ports to uploadPDF, AIFF, au-dio/basic, BMP,CSS, FMP3,GIF, HTML,image/png,JPEG, La-tex, MARC,Mahematica,Microsoft Ex-cel, Microsoftpowerpoint,MicrosoftProject, Mi-crosoft visio,Microsoft word,etc.
DoKS supportsto add any typeof file formatswhich are ofMIME type.It allow endusers to defineany MIME typeformats andcan uploadany MIMEtype documentformats.
All text for-mats are bydefault sup-ported by thesoftware, butsoftware alsosupports toupload JPEG,PNG, GIF,BMP, TIF fileformats, AdobePostscript,Adobe PDF,HTML file for-mats, MP3,AVI, MPEGfile formats.Software alsosupports to up-load MicrosoftPower Pint andWord formats.
Fedora sup-ports to uploadthe followingMIME typesof file for-mats: text/xml,text/plain,text/html,text/html+xml,text/svg+xml,text/rft, im-age/jpeg,image/jg2,image/gif,image/bmp,image/png,image/tiff,audio/mpeg,audio/x-aiff,audio/x-wavetc.
Greenstonesupports toupload severaltypes of digitaldocumentsand supportedplugins areavailable inGreenstonesuch as zip,gap, text, html,text, pdf, rff,word, ps, im-age, isis, nul,metadata xml,mp3, opendocument,lom, bibtextetc.
MyCoRe sup-ports to uploadall types of fileformats. Bydefault sys-tem supportscollections,dataset, event,image, ser-vice, software,sound, text,unknown,video, etc.
Though SOPSis meant foradding onlyelectronic pub-lications it alsosupports toupload otherfile formats.
Software doesnot carryout duplicatechecking whileuploading thedocuments intothe repository.
Software doesnot carryout duplicatechecking whileuploading thedocuments intothe repository.
Software doesnot carry outduplicationchecking whileuploading thedocuments intothe repository.
EPrints carryout duplicatechecking onthe basis ofTitle.
Fedora carryout duplicatechecking viaits PID numberi.e. PersistentIdentificationNumber.
Greenstonesupports tocarry out dupli-cate checkingwhile upload-ing the data.
MyCoRe doesnot carryout duplicatechecking.
SOPS does notcarry out dupli-cate checking.
CH
AP
TE
R12.
CO
MP
AR
ATIV
EA
NA
LYS
IS3
64
CDS-Invenio DSpace DoKS EPrints Fedora Greenstone MyCoRe SOPS
CDS-Inveniosupports toupload digitalobjects in nor-mal formats aswell as com-pressed digitalformats suchas zip, tgz, gz,gzip etc.
DSpace sup-ports to uploaddigital objectswith normalsize as well ascompresseddigital objects.
DoKS supportsto upload digi-tal objects withnormal size aswell as com-pressed digitalformats.
EPrints sup-ports to uploadnormal digitalobjects as wellas it supportsto upload filesending with.zip and .tar.gz.
Fedora sup-ports to uploaddigital objectswith normalsize as well ascompresseddigital filessuch as zip,tar, gz, gtar,gzip.
Greenstonesupports toupload digitalobjects withnormal size aswell as com-pressed digitalfile formatssuch as zip,tar, gz, gzipetc.
MyCoRe sup-ports to uploaddigital objectswith normalsize as well ascompressedfile formats.
SOPS sup-ports to uploaddigital objectswith normalsize as well ascompressedfile formats.
Software doesnot support toupload digitalobjects fromexisting URL.
Software doesnot support toupload digitalobjects fromexisting URL.
Software doesnot support toupload digitalojbects fromexisisting URL.
EPrints sup-ports to uploaddigital objectsfrom existingURL.
Fedora sup-ports to uploaddigital docu-ments fromexisting URLas well as itcan redirectto existingURL insteadof actuallydownloadingcontents intothe repository.
Greenstonehas the fea-ture to uploaddigital docu-ments fromexisting URLbut this featureis not workingproperly.
MyCoRe doesnot support up-loading digitalobjects fromexisting URL.
SOPS doesnot support up-loading digitalobjects fromexisting URL.
CDS-Inveniosupports dis-tributed/ com-munity basedacquisition.
DSpace sup-ports dis-tributed/communitybased acquisi-tion.
DoKS supportsdistributed/communitybased acquisi-tion.
EPrints sup-ports dis-tributed/communitybased acquisi-tion.
Fedora doesnot supportdistributed/communitybased acquisi-tion.
ThoughGreeenstonesupports dis-tributed basedacquisitionthis featuredoes not workproperly.
Softwaresupportsdistributedacquisition.
SOPS sup-ports dis-tributed/ com-munity basedacquisition.
CDS-Inveniosupports meta-data/contentimport. Thedata has tobe in MARCXML to MARC 21 or DublinCore format.The metadataand full textshave to beseparatelyimported.
DSpace sup-ports con-tent/metadataimport. Thedata has to bein the DublinCore format.
DoKS sup-ports to importmetadata ifit is availableXML format.DoKS supports to importall folders,users, filefor-mats, scrips,and recordscreated inDoKS.
Software sup-ports to importmetadata ofitems via DOII(via CrossRef),XML, PubMEDID, PubMEDXML formats.
Fedora sup-ports import ofdigital objectsif data is avail-able in XMLformat.
Greenstonesupports im-port of digitalobjects if datais available inXML format.
MyCoRe sup-ports to importdata and datahas to be in theXML format.
SOPS sup-ports to importdata if it isavailable inCSV file formatsuch as Excelor Access.
CH
AP
TE
R12.
CO
MP
AR
ATIV
EA
NA
LYS
IS3
65
CDS-Invenio DSpace DoKS EPrints Fedora Greenstone MyCoRe SOPS
Software sup-ports importingof bulk meta-data/content ofdigital objects.
Software sup-ports importingof bulk meta-data/contentsof digital ob-jects.
DoKS supportsto import bulkdigital objects.
EPrints sup-ports to importbulk digitalobjects.
Fedora sup-ports to importbulk digitalobjects.
Greenstonesupports toimport bulkdigital objects.
Software sup-ports importingof bulk digitalobjects.
SOPS sup-ports importingof bulk digitalobjects.
Software sup-ports to extractautomaticextraction ofkeywords fromthe full text ofthe documentsbased on thefrequency ofthe specificterms.
Software doesnot extract au-tomatic meta-data of digitaldocuments thatare imported.
Software doesnot extract au-tomatic meta-data of digitaldocuments thatare imported.
Software doesnot extractautomaticmetadata ofdigital docu-ments that areimported intothe repository.
Fedora au-tomaticallyextracts title ofthe documentand identifier ofthe document.
Greenstoneautomaticallrecognisesdate, encod-ing, file format,file size, lan-guage, numberof pages andsource detailsfor each object.
Software doesnot exatractautomaticmetadata ofdigital docu-ments that areimported.
SOPS doesnot importautomaticmetadata ofdigital docu-ments that areadded into therepository.
CDS-Inveniosupports meta-data exportin BibTex,Dublin Core,End Note,HTML, brief,HTML detailed,MARC, MAR-CXML, NLM,photocaptionsonly, portfolioformat.
Software sup-ports to exportdata into dublincore andMETS format.Generally datais exported asXML files.
DoKS supportsto export datainto XML for-mat.
EPrints sup-ports to exportmetadata intovariety of for-mats such asASCII Cita-tion, BibTex,DIDL, EP3XML, EndNote,METS, MODSetc.
Fedora sup-ports to exportcontents intoFOXML andMETS format.
Greenstonesupports to ex-port metadatainto METS,DSpace, MAR-CXML and GAformat.
MyCoRe sup-ports to exportmetadata inXML format.
SOPS sup-ports to exportdata intovariety of for-mats such asHaward StyleReference,BibTex, End-Note, ProCite,ReferenceManager, XMLetc.
Software sup-ports to exportbulk metadataof digital ob-jects.
Software sup-ports to exportbulk metadataof digital ob-jects.
Software sup-ports to exportbulk metadataof digital ob-jects.
Software sup-ports to exportbulk metadataof digital ob-jects.
Fedora sup-ports to exportbulk meta-data/contentsof digital ob-jects.
Greenstonesupports to ex-port bulk meta-data/contentsof digital ob-jects.
MyCoRe sup-ports to exportbulk digital ob-jects.
SOPS sup-ports to exportbulk metadata.
Software sup-ports to addpast versionsof each digitaldocument tocontrol differ-ent editionsof registeredobjects.
DSpace doesnot supportadding pastversions of dig-ital documents.
DoKS supportsadding pastversions of dig-ital documents.
EPrints doesnot supportadding pastversions of dig-ital documents.
Fedora has avery good fea-ture of addingdifferent ver-sions of thedigital docu-ments that areadded into therepository.
Greenstonedoes not sup-port addingpast versionsof digital docu-ments.
Software sup-ports to addpast versionsof digital docu-ments.
Software doesnot supportadding pastversions of dig-ital documents.
CH
AP
TE
R12.
CO
MP
AR
ATIV
EA
NA
LYS
IS3
66
CDS-Invenio DSpace DoKS EPrints Fedora Greenstone MyCoRe SOPS
CDS-Inveniosupport itsown accessionnumber foreach docu-ment that getsadded into therepository.
DSpace has itsown accessionnumber and itis called as aninternal ID.
DOKS create aunique id num-ber for everydocument, au-thor, CV, key-word etc. thatis added intothe repository.
EPrints createa unique nu-meric ID foreach docu-ment that getsadded into therepository.
Fedora sup-ports to createeither customaccessionnumber or de-fault accessionnumber andeach digital ob-ject is identifiedwith PID.
Greenstoneassigns OIDfor every digitaldocument thatis added intothe repository.
MyCoRe gen-erates its ownaccessionnumber forevery digitaldocument thatis added intothe repository.
Software gen-erates its ownaccessionnumber forevery digitaldocument thatis added intothe repository.
CDS-Inveniosupports weedout policy.
DSpace sup-ports weed outpolicy.
DoKS does notsupport weedout policy.
EPrints sup-port weed outpolicy.
Fedora sup-ports weed outpolicy.
Greenstonedoes not sup-port weed outpolicy.
MyCoRe doesnot supportweed outpolicy.
SOPS sup-ports weed outpolicy.
Softwaresupports todelete itemsfrom collec-tion/repositoryas well asmove objectsfrom onecollection toother collectioni.e. contentmapping.
Software sup-ports to deleteitems fromthe collectionbut supportto map itemsacross differentcollections.
DoKS sup-ports to deleteobjects fromthe reposi-tory. DoKSorganises alldocumentsadded intothe repositoryinto differentfolders.
EPrints sup-port to deleteitems fromthe repositrybut does notsupport anycross linkingof objects fromone collec-tion to othercollection.
Fedora sup-ports to deleteitems fromthe repository.Fedora addseach item intothe repositoryas a uniqueitem.
Greenstonesupports todelete itemsfrom the collec-tion but doesnot support tomove digitalobjects fromone collec-tion to othercollection.
MyCoRe sup-ports to deleteitems from therepository butdoes not sup-port to movedocumentsfrom one col-lection to othercollection.
SOPS sup-ports to deleteitems fromthe repositorybut each itemin SOPS isunique, it is notattached withany collection.
CH
AP
TE
R12.
CO
MP
AR
ATIV
EA
NA
LYS
IS3
67
CDS-Invenio DSpace DoKS EPrints Fedora Greenstone MyCoRe SOPS
No global per-sistent identi-fier scheme issupported bythe software.Software hasits own persis-tent ID numberfor all docu-ments that areadded into therepository.
DSpace sup-ports CNRIHandle systemidentifier forall the docu-ments that areadded into therepository.
Software doesnot have anypersistent iden-tifier schemebut it supportsto add anyoher persistentidentifer’s toevery thesisadded into therepository.
EPrints doesnot have anypersistent iden-tifier schemesupport.
Fedora sup-ports URI aswell as sup-ports GDRFregistry as wellas it supportsto add otherpersistentidentifier num-bers as well assuch as handleor DOI etc.
Greenstonedoes notsupport anypersistent iden-tifier scheme.
MyCoRe sup-ports UniformResourceName i.e. URNfor all docu-ments that areadded into therepository.
SOPS doesnot supportany persis-tent identifierscheme.
Software doesnot have anycapability tohandle an ob-ject’s changein the locationand state infuture withsame identifiernumber.
Software doesnot have anycapability tohandle an ob-ject’s changein the locationand state infuture withsame identifiernumber.
Software doesnot have anycapability tohandle an ob-ject’s changein the locationand state infuture withsame identifiernumber.
EPrints doesnot have anycapability tohandle an ob-ject’s changein the locationand state infuture with thesame identifier.
Fedora havecapability tohandle an ob-ject’s changein locationand state infuture withsame identifiernumber.
Greenstonedoes not haveany capabil-ity to handlean object’schange in thelocation andstate in futurewith the sameidentifier.
MyCoRe havecapability tohandle an ob-ject’s changein locationand state infuture withsame identifiernumber.
Software doesnot have capa-bility to handlean object’schange inthe locationand state infuture withsame identifernumber.
CHAPTER 12. COMPARATIVE ANALYSIS 368
Table 12.2: Content Management
Content Management CDS-Invenio
DSpace DoKS EPrints Fedora Greenstone
MyCoRe SOPS
Does the system have proper workflow in place which handles differentsubmission processes?
Yes Yes Yes EPrintssup-portslimitedsub-missionwork-flow
No No Yes No
Create overall system administrator Yes Yes Yes Yes Yes Yes Yes Yes
Allows to submit documents by usersvia web
Yes Yes Yes Yes No No Yes Yes
System allows to set up submissionrules
Yes Yes No No No No Yes No
Define accept/reject policy for docu-ments added into the repository
Yes Yes No Yes No No Yes No
Allows multiple collections withinsame installation
Yes Yes Yes No No Yes No No
Home page for each collec-tion/community
Yes Yes Yes No No Yes No No
Allows to set different policies for dif-ferent collections
Yes Yes No No No No Yes No
Allow to set different authorizationpolicies for different collection/items
Yes Yes No No No No Yes No
Allow to edit metadata submitted byusers
Yes Yes Yes Yes No No Yes Yes
Sends email notifications tousers/submitters/authors
Yes Yes No Yes No No No No
Sends email notifications to meta-data reviewer
Yes Yes No No No No No No
Sends email notification to reviewers Yes Yes No No No No No No
Sends email notification to adminis-trators
Yes Yes No Yes No No No No
Allow users to review completed con-tent
Yes Yes Yes Yes No No Yes Yes
Allow users to review uncompletedcontent
Yes Yes Yes Yes No No Yes Yes
Allow content administrator to reviewsubmissions
Yes Yes No Yes No No Yes No
View pending content administrationtasks
Yes Yes No Yes No No Yes No
CHAPTER 12. COMPARATIVE ANALYSIS 369
Content Management CDS-Invenio
DSpace DoKS EPrints Fedora Greenstone
MyCoRe SOPS
Does system support to upload doc-uments through Librarian’s interfaceas well?
No No No No Fedorasup-ports touploaddocu-mentsthroughLibrar-ian’sinter-faceonly.
Greenstonesup-ports tosubmititemsthroughLibrar-ian’sinter-faceonly
No No
Does system provide easy way foradding/editing of records for Librar-ian as well as for remote user?
Yes Yes Yes Yes No Yes Yes Yes
Are supported file formats well docu-mented in the system? Can new fileformats be added in the system?
Yes Yes No Yes Yes Yes No No
"Does the repository software haveany access control lists, Internet ad-dress filters etc. that limit who is al-lowed to submit documents into therepository?"
Yes Yes Yes Yes Yesonlyfedora-adminusercansubmitdocu-ments.
No Yes Yes
Does the repository software main-tain audit logs that identify by whomand when documents are submittedinto the repository?
No Yes Yes No Yes Yes No Yes
"Does system support knowledge or-ganization systems such as ontolo-gies, classification systems for userswhile adding documents via web?"
Yes Yes No No No Yes Yes No
Does the system support digital ob-jects to be grouped into organizedstructure?
Yes Yes Yes Yes No Yes Yes No
Does the system support to generateauthority files?
No Yes(Au-thors& Sub-jects)
Yes(Au-thorsandKey-words)
No No Yes Yes(Au-thors,insti-tutesandsub-jects)"
No
Does the system allow digital objectto a member of multiple collectionsi.e. Content mapping?
Yes Yes No No No No No No
Does system allow users to modify(add/edit/delete) metadata/completecontent added within the repository?
Yes Yes Yes Yes No Yes Yes Yes
Does the system show strength ofeach collection?
Yes Yes No No No No No No
CHAPTER 12. COMPARATIVE ANALYSIS 370
than Dublin Core such as MARC 21, METS, IEEE LOM etc. Default metadata entry
templates are supported in all software except Fedora. Realupdating and indexing is sup-
ported in all software except Greenstone. One has to rebuildthe collections in Greenstone
then only indexes gets updated. CDS-Invenio, DSpace, Fedora, Greenstone & MyCoRe
supports to upload provenance metadata. Provenance metadata is important from digital
preservation point of view. At present metadata crosswalk is supported by DSpace, Fedora
& Greenstone. Thesaurus building which is one of the important feature in digital libraries,
is supported in CDS-Invenio, DSpace & EPrints.
12.4 Classification
Table 12.4 lists all evaluation criteria related to classification. CDS-Invenio, Greenstone &
MyCoRe supports to enter classification number. DSpace software supports to group digi-
tal objects as per the keywords/thesaurus but does not support adding any class number to
each digital object. EPrints software supports to group digital objects as per the Library of
Congress subject heading lists. DoKS, Fedora & SOPS does notsupport any classification
system.
12.5 Information Search & Retrieval
Table 12.5 lists all evaluation criteria related to information search & retrieval. Meta-
data and full text searching is supported by all software. All software support boolean
search, basic & advanced search features. Truncation, Phrase searching, proximity search-
ing, stemming, fuzzy searching is supported in all softwareexcept EPrints. Thesaurus
based searching is available at present only in DSpace software.
12.6 Access Control Privacy and Management
Access control is very well handled inCDS-Invenio. It supports to define different roles for
different users. It records only name and email address of every user that is registered into
the repository. It has facility to show active, inactive users, guest users etc. Software allow
CH
AP
TE
R12.
CO
MP
AR
ATIV
EA
NA
LYS
IS3
71
Table 12.3: Metadata Submission and Support
CDS-Invenio DSpace DoKS EPrints Fedora Greenstone MyCoRe SOPS
Software sup-ports to adddescriptive,administrativeand structuralmetadata.
Software sup-ports to adddescriptive,administrativeand structuralmetadata.
DoKS supportsto add descrip-tive metadatabut not ad-ministrativeand structuralmetadata.
EPrints sup-ports to adddescriptive, tosome extentadministrativeand structuralmetadata.
Fedora sup-ports to adddescriptive,structural andadministrativemetadata.
Greenstoneby defaultsupports toadd descriptivemetadata butit can be con-figured to enterstructural andadministrativemetadata.
MyCoRe sup-ports to adddescriptive,administrativeand structuralmetadata.
SOPS doesnot support toadd in detailany of themetadata typeas there isno mentionof any meta-data schemasupport inSOPS.
Software sup-ports to addTitle, Author,Affiliation,Language,Number ofPages, Dateof Creation,Keywords, Ab-stract, Author’sComments,Other Refer-ence Number
DSpace by de-fault supportsadding Author,title, OtherTitle, Date ofIssue, Pub-lisher, Citation,Series/ ReportNo., Identifier,Type, Lan-guage, SubjectKeywords,Abstract,Sponsors,Descriptionfields
DoKS by de-fault supportsadding Ti-tle, Author,Abstract, Pub-lisher, Publica-tion Date, FullText, Appen-dices, Type,Language,Rights, DegreeName, DegreeLevel, Disci-pline, Grantorof the Degree,Department,Awards, Sta-tus, PersistentIdentifier,volumes, col-lation, subtitle,title, notes,copies etc.
EPrints by de-fault supportsto add itemtype, format,description,visible to,license, em-bargo expirydate, title,abstract, cre-ators, corpo-rate creators,Divisions,PublicationDetails, Fun-ders, Projects,Email, Ref-erences,UncontrolledKeywords,Comments,etc.
Fedora bydefault sup-ports to addID, state, Con-trol Group,Versionable,Created, La-bel, MIMEType, FormatURI, AlternateID, FedoraURL for everydocument.
Greenstone bydefault sup-ports to add all15 elementsof Dc such astitle, creator,format, date ofcreation, sub-ject, language,file format, filetype etc.
MyCoRe bydefault sup-ports addingTitle, Au-thor, Origin,Description,Publisher,Contributor,Date, Type,Format, Iden-tifier, Source,Language,Keywords,Coverage, Re-lation, Rights,Size, Con-tact, Note andCitation.
SOPS by de-fault supportsto enter id,authors, year,title, source,summary, key-words, series,type, email,more, content,discussionsand ratings.
Software sup-ports adding/editing/ delet-ing of metadatafields.
Software sup-ports adding/editing/ delet-ing of metadatafields.
Software sup-ports adding/editing/ delet-ing of metadatafields.
Software sup-ports adding/editing/ delet-ing of metadatafields.
Software sup-ports adding/editing/ delet-ing of metadatafields.
Software sup-ports adding/editing/ delet-ing of metadatafields.
Software sup-ports adding/editing/ delet-ing of metadatafields.
Software sup-ports adding/editing/ delet-ing of metadatarecords.
Help mes-sages aregiven in eachmetadata fieldswhile enteringdata.
Help mes-sages aregiven in eachmetadata fieldswhile enteringdata.
No help mes-sages aregiven in eachmetadata fieldswhile enteringdata.
For eachmetadata fieldproper helpis providedfor enteringdata along withexamples.
No help mes-sages aregiven in eachmetadata fieldswhile enteringdata.
Help mes-sages aregiven in eachmetadata fieldwhile enteringdata.
There are nohelp messagesgiven in eachmetadata fieldwhile enteringmetadata.
Help mes-sages aregiven in eachmetadata fieldswhile enteringdata.
CH
AP
TE
R12.
CO
MP
AR
ATIV
EA
NA
LYS
IS3
72
CDS-Invenio DSpace DoKS EPrints Fedora Greenstone MyCoRe SOPS
Software hasTitle and Au-thor as amandatoryfields.
Software hasTitle and Yearas a manda-tory fields.
DoKS hasTitle asa mandatoryfield.
EPrints hasItem Type,Title, Creator,Referenceand Subjectas mandatoryfields.
Fedora hasonly one fieldmandatoryi..e. Title orit is called asLabel.
There areno manda-tory fields inGreenstone.It automati-cally extractssome of themetadata fromdigital files.
Software hasTitle, Author,Type, Format,Languageand Rightsas mandatoryfields.
Software hasauthor, titleand year asmandatoryfields.
Software sup-ports UTF-8
Software sup-ports UTF-8
Software sup-ports UTF-8
Software sup-ports UTF-8
Software sup-ports UTF-8
Software sup-ports UTF-8
Software sup-ports UTF-8
Software sup-ports UTF-8
Software sup-ports to allowto create andmanage vari-ety of differentmetadataschema.
DSpace cur-rently supportsonly DublinCore metadataschema.
DoKS currentlysupports onlyone metadataschema.
EPrints sup-ports onlyone metadataschema.
Yes, Fedoradefines cre-ating newmetadataschemas asbehaviourdefinition.
Software allowto create vari-ety of metadatasupport.
MyCoRe cur-rently supportsQualifiedDublin Coremetadataschema butit has capa-bility to adddifferent meta-data schemafor differentcollections.
There is nomention ofany meta-data schemasupported bySOPS.
Metadata fieldsand formatscan be cus-tomized.
DSpace hasinputforms.xmlfile which canbe used tocustomizemetadataforms andformats.
It is possibleto customizemetadata fieldsand formats.
It is possibleto customizemetadata fieldsand formats.
It is possibleto customizemetadata fieldsand formats.
It is possibleto customizemetadata fieldsand formats.
It is possibleto customizemetadata fieldsand formats.
Metadata fieldsand formatscan be cus-tomized.
Software sup-ports by defaultDublin Coreand MARC21 metadatastandard
Software bydefault sup-ports DublinCore metadatastandard.
DoKS supportsby defaultETD-MSDublin Coremetadataschema.
EPrints sup-ports onlyDublin Coremetadataschema.
Fedora sup-ports DC,METS, MPEG-21, DIDL, IEEELOM, MARCmetadataschema.
Greenstonesupports:New ZealandGovernmentLocator Ser-vice MetadataStandard(nzgls), RFC1807 Meta-data ElementSet, QualifiedDublin Core,DevelopmentLibrary Sub-set ExampleMetadata,GreenstoneMetadata Set,AustralianGovernmentLocator Ser-vice MetadataElement Set
Software atpresent sup-ports QualifiedDublin Coremetadatastandard.
No defaultmetadataschema sup-port for SOPS.
CH
AP
TE
R12.
CO
MP
AR
ATIV
EA
NA
LYS
IS3
73
CDS-Invenio DSpace DoKS EPrints Fedora Greenstone MyCoRe SOPS
Software hasdefault meta-data entrytemplates
Software hasdefault meta-data entrytemplates
DoKS has de-fault metadataentry template.
EPrints has adefault meta-data entrytemplate.
Fedora doesnot have anydefault meta-data entry tem-plate. Softwareimports anyXML basedmetadata.
Software hasdefault meta-data entrytemplates
MyCoRe hassingle meta-data entrytemplate.
Software hasdefault meta-data entrytemplate.
Metadata for-mats are welldocumented inthe software.
Metadata for-mats are welldocumented inthe software.
Metadata for-mats are notdocumented inDoKS.
Metadata for-mats are welldocumented inEPrints.
Metadata for-mats are notdocumented inFedora.
Metadataformats aredocumented inthe system.
Metadataformats aredocumented inthe system.
Metadataformats aredocumented inSOPS.
Software sup-ports differentinterface formetadataentry and cus-tomized datagets added intothe system.
It is possible tocreate differentinterfaces formetadata entryand it can becustomizedto get dataadded into thesystem but itdemands moreknowledgeabout DSpacecoding.
DoKS does notsupport to cre-ate different in-terface for en-tering data.
Software sup-ports to createdifferent inter-face for enter-ing data.
It is not pos-sible to createdifferent inter-faces for enter-ing data.
It is possible tocreate differentinterfaces formetadata entryand softwarehas capa-bility to addcustomizeddata into thesystem.
It is possible tocreate differentinterfaces formetadata entryand softwarehas capa-bility to addcustomizeddata into thesystem.
Software doesnot support dif-ferent interfacefor metadataentry.
There are noways by whichsoftware canverify addeddata.
There areno automatedchecks to verifyadded data.
There areno automatedchecks to verifyadded data.
There areno automatedchecks to verifyadded data.
There areno automatedchecks to verifyadded data.
There areno automatedchecks to verifyadded data.
There areno automatedchecks to verifyadded data.
There areno automatedchecks to verifyadded data.
There are noautomatedchecks for dateentering etc.
There are noautomatedchecks for dateentering etc.
There are noautomatedchecks for dateentering etc.
There are noautomatedchecks for dateentering etc.
There are noautomatedchecks for dateentering etc.
There are noautomatedchecks for dateentering etc.
For date stringthere is auto-mated checkbut for otherfields there areno automatedchecks.
There are noautomatedchecks for dateentering etc.
CH
AP
TE
R12.
CO
MP
AR
ATIV
EA
NA
LYS
IS3
74
CDS-Invenio DSpace DoKS EPrints Fedora Greenstone MyCoRe SOPS
Software doesnot have anyways by whichdata tamper-ing can becontrolled.
Software doesnot have anyways by whichdata tamper-ing can becontrolled.
Software doesnot have anyways by whichdata tamper-ing can becontrolled.
Software doesnot have anyways by whichdata tamper-ing can becontrolled.
Software doesnot have anyways by whichdata tamper-ing can becontrolled.
Software doesnot have anyways by whichdata tamper-ing can becontrolled.
Software doesnot have anyways by whichdata tamper-ing can becontrolled.
Software doesnot have anyways by whichdata tamper-ing can becontrolled.
Software sup-ports realtime updatingand indexingof updatedcontents.
Software sup-ports realtime updatingand indexingof updatedcontents.
DoKS supportsreal time up-dating and in-dexing of con-tents.
EPrints sup-ports real timeupdating andindexing ofcontents.
Fedora sup-ports real timeupdating andindexing ofcontents.
Greenstonedoes not sup-port real timeupdating andindexing. Forevery itemadded into therepository onehas to rebuildthe collectionwhich is one ofthe drawbackof Greenstone.
Software sup-ports realtime updatingand indexingof updatedcontents.
Software sup-ports realtime updatingand indexingof updatedcontents.
Software sup-ports metadataextensibilityand complex-ity.
Software sup-ports metadataextensibilityand complex-ity.
Software doesnot supportmetadata ex-tensibility andcomplexity.
Software doesnot supportmetadata ex-tensibility andcomplexity.
It is possibleto have com-plex metadatafor every digitalobject.
Yes Green-stone supportsmetadata ex-tensibility andcomplexity.
Software sup-ports metadataextensibilityand complex-ity.
Software doesnot supportmetadata ex-tensibility andcomplexity.
Metadata ver-sioning is notsupported inthe software.
Metadata ver-sioning is notsupported inthe software.
Metadata ver-sioning is notsupported inthe software.
Metadata ver-sioning is notsupported inthe software.
Metadata ver-sioning is notsupported inthe software.
Metadata ver-sioning is notsupported inthe software.
Metadata ver-sioning is notsupported inthe software.
Metadata ver-sioning is notsupported inthe software.
Provenancemetadata issupported bythe software.
DSpace sup-ports prove-nance meta-data andfollowing meta-data is added: - Name andemail addressof the submit-ter, uploadingdate and time,number ofbitstreamsuploaded, totalnumber ofbytes of eachbitstream,checksum dataof uploadedbitstream
DoKS doesnot supportprovenancemetadata.
EPrints doesnot supportprovenancemetadata.
Fedora sup-ports to adddifferent infor-mation abouta digital objectsuch as size ofthe file, createdby, total sizeetc.
Greenstonesupports toadd prove-nance meta-data.
MyCoRerecords prove-nance meta-data such asdigital docu-ment createdat and lastchanged withtime and date.
SOPS does notsupport anyprovenancemetadata.
CH
AP
TE
R12.
CO
MP
AR
ATIV
EA
NA
LYS
IS3
75
CDS-Invenio DSpace DoKS EPrints Fedora Greenstone MyCoRe SOPS
Metadatacrosswalk isnot supported.
DSpace sup-ports metadatacrosswalk.
Software doesnot supportmetadatacrosswalk.
EPrints doesnot supportmetadatacrosswalk.
Fedora sup-ports metadatacrosswalk.
Greenstonesupportsmetadatacrosswalk.
MyCoRe doesnot supportany metadatacrosswalk.
SOPS doesnot supportany metadatacrosswalk.
Thesaurusbuilding is sup-ported by thesoftware.
Thesaurusbuilding is sup-ported by thesoftware.
Software doesnot supportthesaurusbuilding.
EPrints sup-ports LC sub-ject headinglist.
Software doesnot supportany thesaurusbuilding.
Software doesnot supportany thesaurusbuilding.
Software doesnot supportany thesaurusbuilding.
SOPS doesnot supportany thesaurusbuilding.
No ontologysupport avail-able in thesoftware.
No ontologysupport avail-able in thesoftware.
No ontologysupport avail-able in thesoftware.
No ontologysupport avail-able in thesoftware.
No ontologysupport avail-able in thesoftware.
No ontologysupport avail-able in thesoftware.
No ontologysupport avail-able in thesoftware.
No ontologysupport avail-able in thesoftware.
There are noways by whichsubmitted filescan be verified.
DSpace veri-fies the listedfile formatsand checkswhether thosefiles are sup-ported byDSpace thenonly thosefiles will beuploaded intothe repository.
DoKS hasthe facility toidentify filetype. If the fileis not identi-fied it uploadsthe file withunknownstatus.
EPrints has fa-cility to identifysubmitted fileand upload re-spective pluginfor that file.
Fedora has fa-cility to identifyuploaded file. Ifthe file is notMIME type Fe-dora throws er-ror.
Greenstonehas facility toidentify theuploaded filethrough avail-able plugins inGreenstone.
MyCoRe hasfacility to iden-tify file formatssubmitted intothe repository.If the file formatis not identi-fied it uploadsthat file asunknownformat.
There are noways by whichit is possibleto verify sub-mitted file for-mats in SOPS.
While export-ing metadatasoftware doesnot supportchecksums.
While export-ing metadatasoftware doesnot supportchecksums.
While export-ing metadatasoftware doesnot supportchecksums.
While export-ing metadatasoftware doesnot supportchecksums.
While export-ing metadatasoftware doesnot supportchecksums.
While export-ing metadatasoftware doesnot supportchecksums.
While export-ing metadatasoftware doesnot supportchecksums.
Software doesnot supportchecksumswhile exportingmetadata.
Software sup-ports RAP pro-tocol.
DSpace sup-ports RAPprotocol.
DoKS does notsupport RAPprotocol.
EPrints sup-ports RAPprotocol.
Fedora sup-ports RAPprotocol.
Greenstonesupports RAPprotocol.
Software doesnot supportRAP protocol.
SOPS doesnot supportRAP protocol.
CHAPTER 12. COMPARATIVE ANALYSIS 376
Table 12.4: Classification
Classification CDS-Invenio DSpace DoKS EPrints Fedora Green stone MyCoRe SOPS
Entering ClassNumbers
Yes No No No No Yes Yes No
Classification Sys-tem Support
Supports toadd any classnumber such asUDC/ DDC/ BC
No No No No Supports toadd any classnumber such asUDC/ DDC/ BCetc.
Supportsto addDDC classnumbers.
No
Grouping of Docu-ments as per classNumber
Yes Yes, asper key-words
Yes Yes, asper LCsubjecthead-ings
No Yes Yes No
Hierarchical Brows-ing Support
Yes Yes Yes Yes No Yes Yes No
to create different groups for different users, allow to define different user groups, allow
to limit access by role, file/object level, collection level, user/group level etc. Software
keeps track of passwords submitted by users and it allow end users to select his/her own
passwords. Access levels in CDS-Invenio can be generated atdifferent levels. Software
does not support access level through IP filtering or proxy filtering or credential based
filtering. Software does not support to allow to access certain collection for a specific
period.
DSpacecreates e-persons for all the members who register themselves through the web
browser and it is called as My DSpace. For every member DSpaceholds email address, first
and last name, list of collections for which the e-person wishes to be notified for new items,
password details etc. Software supports to add/edit/delete user profiles. DSpace supports
to define different roles for different users. DSpace does not keep detailed information
of every user who is registered into the repository. In DSpace for every user different
permissions are given for different roles. DSpace does not keep track of active and inactive
members. Software allow to create different roles, for different uses with different levels.
All passwords are encrypted and are stored into the database. Software does not assign
password for every user rather it allow end user to select his/her own password. DSpace has
mechanisms to retrieve forgotten passwords. DSpace supports to provide different action
CHAPTER 12. COMPARATIVE ANALYSIS 377
Table 12.5: Information Search and Retrieval
Browsing CDS-Invenio DSpace DoKS EPrints Fedora Greenstone MyCoRe SOPS
Author/ Creator/Contributor
Yes Yes Yes Yes Yes Yes No Yes
Title of the docu-ment/ Title of thebook etc.
Yes Yes No Yes Yes Yes No Yes
Issue/ Date/ Date ofPublication
Yes Yes Yes Yes Yes Yes No Yes
Collection Yes Yes Yes No No Yes No NoCommunities No Yes No No No No No NoSubject Browsing Yes No Yes Yes Yes Yes Yes YesPublisher wiseBrowsing
Yes No No No Yes Yes No No
Table of contentsbrowsing
No No No No No Yes No No
Multi-dimentionalbrowsing
No No No No No No No No
Searching
Metadata fieldssearching
Yes Yes Yes Yes Yes Yes Yes Yes
Full Text Searching Yes Yes Yes Yes Yes Yes Yes YesBoolean (AND, OR,NOT) Searching
Yes Yes Yes Yes Yes Yes Yes Yes
Basic Search Yes Yes Yes Yes Yes Yes Yes YesAdvanced Search Yes Yes Yes Yes Yes Yes Yes YesPutting differentoperators whilesearching
Yes Yes Yes No Yes Yes No Yes
Truncation/ wild cardsearching
Yes Yes Yes No Yes Yes Yes Yes
Phrase Searching Yes Yes Yes No Yes Yes Yes YesProximity Searching Yes Yes Yes No Yes Yes Yes YesStemming Yes Yes Yes No Yes Yes Yes YesFuzzy Searching Yes Yes Yes No Yes Yes Yes YesPhonetic Searching Yes No No No No No No NoCase sensitive orcase insensitivesearching
Yes No No No Yes Yes No No
Term weighting Yes No Yes No Yes Yes No NoSearch history op-tion, reuse of query,query save option
Yes No No No No Yes No No
Boosting the term Yes Yes Yes No Yes Yes No NoRange searching Yes No No No No Yes No NoUse of Thesarus No Yes No No No No No NoExpand Search Yes No No No Yes Yes Yes NoLateral Searching No No No No No No No NoSearch support forspecial collections
No No No No No Yes No No
Searching via tableof contents and clas-sification codes
No No No No No Yes No No
Ability to browsesubject/ authorauthority files
No No No No No Yes No No
PIC variant search-ing
No No No No No No No No
CHAPTER 12. COMPARATIVE ANALYSIS 378
CDS-Invenio DSpace DoKS EPrints Fedora Greenstone MyCoRe SOPS
Multilingual searchand retrieval
Yes Yes Yes Yes Yes Yes Yes No
Allow refinement ofsearches with quali-fiers
Yes No No No Yes Yes Yes No
Ability to group priorto search
Yes No No No No Yes No No
Spell checker sup-port
No No No No No No No No
Refine searches Yes No No No Yes Yes Yes No
Sorting
Author Yes Yes Yes Yes No Yes Yes NoTitle Yes Yes Yes Yes No Yes Yes NoIssue Date Yes Yes Yes Yes No Yes Yes NoRelevance Yes No No No No Yes No No
levels for different groups as well as for different communities, collections and items. By
default all items added into the repository have a policy of ANONYMOUS read. DSpace
cannot define access to a particular collection or item for a specific period.
DoKS has a very good user database management. It allows to createdatabase of dif-
ferent users. The system administrator or root only can add different users with different
permissions for every user and can assign password for everyuser. Every user have to be
a member of at least one group then only he/she is authorized to submit/edit documents
into the repository. Software supports to add/edit/deleteuser profiles. Software supports
to define different roles for different users. For every userDOKS keep details such as user
account, password, description, primary group, salutation, first name, last name, affiliation
details, institution name, faculty, department, email, telephone, FAX, Cell Number etc.
For every group DOKS allow to provide permissions based on different types of groups
such as each user can be either from admingroup, authorgroup, broadcastgroup, editor-
group etc. DoKS does not keep track of active and inactive users. System allow to create
different roles, for different groups with different actions. It has facility to control access
to users at collection level, file/object level as well as user/group level. All passwords
are encrypted into the system and stored in a database. For the first time software assign
passwords for each user and later on each user can change the password assigned by the
system. End user cannot change the password but only adminuser has privileges to change
the password. DoKS access levels are defined as per the different groups and for groups it
CHAPTER 12. COMPARATIVE ANALYSIS 379
allows to define access levels. Software does not have facility to allow to access a particular
collection for a specific period.
EPrints has a very good feature of keeping detailed information of every user who reg-
isters into the repository. For every user system keeps detailed information such as type
of user, email address, name, department, organization, address, country, homepage URL
and user preferences. System supports to add/edit/delete user profiles. EPrints have lim-
ited definition of defining roles. It allow to create user, editor and repository administrator
roles. No other roles such as metadata editor, metadata reviewer etc. can be created in
EPrints. EPrints store detailed information of every registered user. EPrints does not have
any provision to identify active and inactive members. Software does not create different
roles for different users and for different groups. EPrintsallow to give access to metadata
level information to all. The full text view can be controlled by allowing anyone, registered
users and repository staff only. EPrints supports to keep the items added into the reposi-
tory for a specific period. One can add embargo date for every item that is added into the
repository.
Fedora does not have any user management support. It supports to create only one user
account that is FedoraAdmin and only FedoraAdmin user is allowed to carry out different
transactions in Fedora. One of the important feature of Fedora is it allow to control access
to individual object through IP filtering as well as through proxy filtering. Other features
related to access control and privacy are not supported by Fedora. If Fez or Murador are
used Fedora supports to add different users etc.
Greenstonesupports adding different users through its web interface called as ’collector’.
When this feature was explored it did not work successfully.Since this function did not
work successfully with Greenstone’s version 2.75 many evaluation criteria related to access
control were not valid.
MyCoRe allows to create database of different users. The system administrator or root can
create different users with different roles and passwords.Software supports to add/edit/delete
user profiles. For every user MyCoRe records user account, password, description, primary
group, salutation, first name, last name, affiliation details, institution name, faculty, depart-
ment, email, telephone, FAX number and cell number details.MyCoRe allows to provide
permissions based on different types of groups such as each user can be either from admin-
CHAPTER 12. COMPARATIVE ANALYSIS 380
group, authorgroup, broadcastgroup, editorgroup, readergroup etc. Software does not keep
track of active/inactive groups. MyCoRe allow to create different groups for different uses
and allow each document to be customized for each role/group. Software supports access
limits at file/object level as well as at user/group level. All passwords are encrypted into the
system and stored in the database. For the first time system assign passwords for each use
and later on each user can change the password assigned by thesystem. MyCoRe has very
good access control mechanism. It allows to control access by individual user from the
repository as well as allow to control access by IP authentication at different levels. My-
CoRe supports to provide access control to every digital object added into the repository
with date stamp.
SOPSusers are managed in a WODA database. Software supports to add/edit/delete user
profiles. SOPS record user name, given name, family name, email, title, affiliation, country
and address for every user that is added into the repository.SOPS software does not have
any work flow. It does not give any permissions for different users. SOPS keep track
of active and inactive members. Software does not support tocreate different groups for
different users. Software does not allow access limits at different levels. Software assign
passwords for each user. System allow end users to select their own passwords. Software
supports to retrieve forgotten passwords. Software does not provide access to the digital
library through IP source filtering or through proxy filtering. System does not have any
feature to support access to the collections for a specific period.
12.7 Authentication and Authorization
CDS-Invenio has multiple authentication methods that let the repository administrator to
define different levels of access and restrictions according to the users. CDS-Invenio is well
designed for setting different authentication and authorization methods than DSpace. It has
a very good authentication & authorization support. Software supports to create complex
configurations for different roles, for different collections with different privileges. Soft-
ware supports to restrict access to contents through login name and password authentica-
tion. Software supports to have a super user/admin user who can perform any operation in
the system. CDS-Invenio also supports to have external authentication mechanisms. Soft-
ware supports to define different roles such as content developers role, administrator role,
CHAPTER 12. COMPARATIVE ANALYSIS 381
contributor role, general user’s role, metadata reviewer’s role etc.
DSpacesupports to set different authorization policies. Authorization system in DSpace
is based on associating actions with objects and the lists ofpeople who can perform them.
Software supports to set different roles for different members for different collections. Soft-
ware support to have a super user/admin for all actions in thesystem. Software supports to
use an external authentication mechanism such as LDAP. Software supports to have con-
tent developer role, administrator role, general user, metadata reviewer etc. Authentication
and authorization in DSpace is well designed and it is one of the best feature in DSpace.
DoKS software supports to provide setting different authorization policies. It supports to
restrict access only based on login and password. Software supports to restrict access on
different collections, items in collections viewing. Software supports to generate super
user/admin who is overall manger/administrator of the software. System can create differ-
ent roles within the system. It can be configured for externalauthentication mechanisms
such as LDAP. Software has capabilities to display only those pages that are pertinent to
each user that he/she is able to see. Software supports to create different roles such as
content developers role, administrator role, contributorrole, general user etc.
EPrints software supports setting authorization policies with limited support. Software
supports to restrict access based on login and password. EPrints software supports autho-
rization privileges based on item full view with permissions such as viewed by anyone,
registered users only and repository staff only. Software supports only three roles such as
users, editor and repository administrator. Software supports to generate super user/admin
for overall manger/administrator of the software. Software does not support to define dif-
ferent roles with the system. EPrints does not use any external authentication mechanism.
Fedora does not support any authentication and authorization. In Fedora only FedoraAd-
min can submit documents into the repository. None of the other features related to authen-
tication and authorization are supported by Fedora. Fedoracan be configured with external
authentication mechanism such as LDAP.
Greenstonedoes not support any authentication and authorization. In the documentation
manual there is mention of incorporating an authenticationscheme which can be used
to restrict people who are allowed to carry certain actions,for different collections for
different members but when this feature was explored it did not work successfully.
CHAPTER 12. COMPARATIVE ANALYSIS 382
MyCoRe supports to set different authorization policies. Software supports to restrict
access to contents through password authentication and through IP filtering as well. By
default, MyCoRe shows all documents added into the repository to end users but detailed
view is restricted based on different authorization policies. Software supports to generate
different roles for different members for different items/collections. Super Admin account
is created in the software. MyCoRe can also be configured withexternal authentication
mechanism such as LDAP. Software supports to create different roles such as content de-
veloper role, administrator role, contributor role etc.
SOPSdoes not support to set different authorization policies. Many features related to au-
thentication and authorization are not supported by SOPS. Software only supports to create
general user and allow end user to submit documents and software allow to create one ad-
ministrator account. None of the other features related to authentication and authorization
were valid for SOPS.
12.8 Interoperability
Table 12.6: Interoperability Support
Software Z39.50/OAI-PMH Support Harvesting Software used SRU/ SRWSupport
OpenURL Sup-port
CDS-Invenio OAI-PMH 2.0 protocol. It has its own harvesting soft-ware.
Yes Yes
DSpace OAI-PMH 2.0 protocol. PKP harvesting software Yes YesDoKS OAI-PMH 2.0 protocol. OAICAT harvesting software No YesEPrints OAI-PMH 2.0 protocol. PKP harvesting software No YesFedora OAI-PMH 2.0 protocol. No information about which
harvesting software is used isfound anywhere.
Yes Yes
Greenstone Software supports Z39.50 proto-col as well as OAI-PMH 2.0 pro-tocol.
Software uses its own OAIsoftware known as oaiserver.
Yes No
MyCoRe Software supports Z39.50 proto-col as well as OAI-PMH 2.0 pro-tocol.
Software uses its own harvest-ing software.
Yes Yes
SOPS OAI-PMH 2.0 protocol. No information about whichharvesting software is used isfound anywhere.
No No
Table 12.6 shows that all software support OAI-PMH protocol. Some of the software use
their own harvesting software. Greenstone & MyCoRe software support Z39.50 protocol
CHAPTER 12. COMPARATIVE ANALYSIS 383
as well. SRU/SRW support is provided by CDS-Invenio, DSpace, Fedora, Greenstone &
MyCoRe. OpenURL support is provided by all software except Greenstone & SOPS.
12.9 Ease of Deployment of Each Software
Table 12.7 lists all evaluation criteria related to installation of each software. Installation
of Greenstone is the easiest task only Greenstone software has automatic installation script
file. All other software demand high knowledge of configuringbackend database, web
server, file permissions, setting environments etc. There are no automatic installation script
files available for other software.
12.9.1 Hardware/Software Requirements
No specific hardware requirements are mentioned forCDS-Invenio. Though the first ver-
sion of software is yet to release, software has been bringing out new versions frequently
with several new features. Software does not have any procedure to monitor and receive
notifications when hardware technology changes. The software creators have process to
stay current with the latest operating security fixes.
There are no specific hardware requirements forDSpaceinstallation except UNIX. DSpace
has been widely used by a large community. Since it is jointlydeveloped by HP and MIT
regularly new versions of DSpace are available for download. Software does not have
any procedures in place to monitor and receive notificationswhen hardware technology is
changed.
There are no specific hardware requirements forDoKS installation.Software has brought
out sofar only three releases but the latest release is not working properly. Software does
not have any procedures in place to monitor and receive notifications when hardware tech-
nology is changed.
For installingEPrints atleast 1 GHz processor, 512 MB RAM and 40 GB disk space is
specified in documentation manual. EPrints has been actively used by a large community
among all the selected software. Software has been regularly bringing out new versions at
CH
AP
TE
R12.
CO
MP
AR
ATIV
EA
NA
LYS
IS3
84
Table 12.7: Software Installation
CDS-Invenio DSpace DoKS EPrints Fedora Greenstone MyCoRe SOPS
Automaticinstallationscript
No automaticinstallationscript avail-able.
No automaticinstallationscript avail-able.
No automaticinstallationscript avail-able.
No automaticinstallationscript avail-able.
No automaticinstallationscript avail-able.
Yes, singlefile is thereand it installsall relatedcomponents.
No automaticinstallationscript avail-able.
No Automaticinstallationscript avail-able.
Disk spaceused
700 MB 500 MB 100 MB 130 MB 349 MB 111 MB 268 MB 13 MB
Time requiredto install eachsoftware
It is difficult toinstall CDS-Invenio. Ittakes morethan a dayto installsuccessfullyCDS-Invenio.
It takes fewhours to suc-cessfully installDSpace.
It is tricky toinstall DoKS.Installation ofDoKs needsexpertise asthere is needto change dif-ferent file pathswhich is notmentioned inthe installationprocess. Ittakes a day tosuccessfullyinstall DoKS.
It is easy toinstall EPrints.It takes aroundtwo to fourhours tocompletelyconfigure thesoftware.
If all environ-ments are setproperly it iseasy to installFedora. Ittook aroundtwo hours tocompletelyconfigurefedora.
InstallingGreesnstonesoftware isvery easy. Ittakes hardlyany time toinstall thesoftware.
Installing My-CoRe is easy.It took abouttwo to threehours to dosuccessfulinstallation ofMyCoRe
Installation ofSOPS is noteasy. Onehas to changepaths in manyfiles. It requiresexpertise helpin handlingSOPS instal-lation. It takesaround four tofive hours toinstall SOPS.
Documentationhelp availableand useful forinstallation
Yes Yes Yes Yes Yes Yes Yes but avail-able in German
Yes but not indepth.
CH
AP
TE
R12.
CO
MP
AR
ATIV
EA
NA
LYS
IS3
85
CDS-Invenio DSpace DoKS EPrints Fedora Greenstone MyCoRe SOPS
Environmentsneeded to setfor installa-tion of eachsoftware
No environ-ments areneeded to setfor installation.
JAVA, Tomcat,Ant environ-ments areneed to set forinstallation
JAVA and ANThome envi-ronments areneeded to set.
No environ-ments areneeded to setfor installation.
JAVA, Tomcat,Fedora homeenvironmentsare needed toset
JRE environ-ments arerequired tobe set forGreenstoneInstallation.
MYCORE_HOME, DOC-PORTAL_HOME, ANT_HOME, JAVA_HOME envi-ronments areneeded to beset.
No environ-ments areneeded to setfor installation.
Operating Sys-tems on whichsoftware canbe installed
POSIX,Linux, So-laris, SunOS
FreeBSD, De-bian, Ubuntu,Mandriva,OSX, MacOS X, RedHat, Gentoo,SLES9 as wellas windows(with limitedsupport)
Any Linux fla-vor or windows
Linux, Solaris,BSD, OSX aswell as Win-dows
Windows 2000,NT, XP, Solarisand any Linuxflavour or MacOS
Greenstonecan be in-stalled on all32-bit win-dows (95/ 98/2000/ XP/), allPOSIX(Linux/BSD/ UNIX-likeOSES), OS X
MyCoRe canbe installedon 32-bit MSWindows (95/99/ NT/ 2000/XP), All POSIX(Linux/ BSD/UNIX-like),Linux, Solaris,IBM, AIX andMacOS
Linux, Unixas well aswindows
Programminglanguagesused
Python, LISPand PL/ SQL
JAVA andserver sideJAVA technolo-gies includingJAVA servlets,JSP
Java frame-work tools
PERL CORBA andJAVA
C++ , JAVAand Perl
Java, XSL(XSLT/ XPath/XSL-FO)
Perl
Database sup-ported at backend
MySQL PostGreSQLand Oracle
MySQL MySQL MySQL, Post-GreSQL, Ora-cle and Mckoi
GDBMdatabase i.e.Gnu DatabaseManager
MySQL,HSQL, IBMContent Man-ager, Oracle,DB2
WODA
Web serverused
Apache WebServer
Apache WebServer
Apache WebServer
Apache WebServer
Apache WebServer
Apache WebServer/ IISWeb Server
Apache WeServer, JettyWeb Server
Apache WebServer
Java servletengine needed
No Apache Tom-cat or JakartaTomcat
Apache Tom-cat
No ApacheTomcat andApache Axis(SOAP) servlet
No Apache Tom-cat
No
CH
AP
TE
R12.
CO
MP
AR
ATIV
EA
NA
LYS
IS3
86
CDS-Invenio DSpace DoKS EPrints Fedora Greenstone MyCoRe SOPS
System Sup-port/ Mainte-nance
Documentation/manuals
Online Doc-umentationavailable athttp:// cdsinve-nio. cern.ch/invenio/ docu-mentation.html
Online Doc-umentationavailable athttp://www.dspace.org
Online Doc-umentationavailablethroughwiki pagehttp://doks.khk.be/wiki/index.php/Main_Page
Online Doc-umentationavailablethroughwiki pagehttp://wiki.EPrints.org
Online Doc-umentationavailable athttp://www.fedora.info
Online Doc-umentationavailablethroughwiki pagehttp://wiki.green-stone.org/wiki/ index.php/Manual
Online Doc-umentationavailable athttp://www.mycore.de/content/ main/documenta-tion.xml
Documentationmanual avail-able but it isnot in depth.
Mailing lists/discussionforums
CDS-Inveniohas threemailing listssuch [email protected],[email protected],[email protected].
http://sourceforge.net/mail/ ?group_id=19984
No mailing listavailable
EPrints hasonly techni-cal mailinglist such ashttp://www.EPrints.org/tech.php/
Fedora hastwo mailinglists such ashttps://lists.source-forge.net/ lists/listinfo/ Fedora-commons-developers,https://lists.source-forge.net/ lists/listinfo/ Fedora-commons-users
Greenstonehas two mail-ing lists suchas [email protected]@ lists.scms.waikato.ac.nz
MyCoRe hasuser mailinglist and devel-opers mailinglist suchas [email protected], mycore-developers@lists. source-forge. net
No mailing listavailable.
Wiki pages https://twiki.cern.ch/ twiki/bin/ view/ CDS/Invenio
http://wiki.dspace.org
http://doks.khk.be/ wiki/index.php/Main_ Page
http://wiki.EPrints.org
http://Fedora.info/wiki
http://wiki.green-stone.org/ wiki/index.php/Green-stoneWiki
http://cmsobj.rrz.uni- ham-burg.de:50080/ cm-swiki/ My-CoReWikiWiki
No wiki pageavailable.
Help desk sup-port
No http://libraries.mit.edu/ dspace-mit/info/ dspace-help.html
No No http:// source-forge.net/forum/ fo-rum.php?forum_id=38148
No No help desksupport avail-able.
Ease of systemadministration
Software iscomplex toconfigure
System ad-ministrator caneasily config-ure softwarefor differentusers.
It is easyto configureDoKS fordifferent users.
EPrints canbe config-ured easily fordifferent users.
Fedora is notmeant for dif-ferent users.Only fedoraAd-min can submitdocuments tothe repository.
ThoughtGreenstonehas abilityto configurefor differentusers thisfeature doesnot work prop-erly through’collector’
It is easy toconfigure My-CoRe for differ-ent users.
SOPS is easyto configure fordifferent users.
CHAPTER 12. COMPARATIVE ANALYSIS 387
a frequent intervals. Software does not have any proceduresin place to monitor and receive
notifications when hardware technology is changed.
Fedora installation does not require any specific hardware configuration. It can be installed
on any PIV machine with 512 MB RAM. Fedora is well maintained by its developers and
regularly new versions are brought out. Software does not have any procedures in place to
monitor and receive notifications when hardware technologyis changed.
Greenstonerequires minimum P III or PIV machine for installation. Since 1998, Green-
stone developers are bringing out continuously new versions. A major change is made in
Greenstone version 2 and now Greenstone version 3 is available but it is not yet stable.
Software does not have any procedures in place to monitor andreceive notifications when
hardware technology is changed.
MyCoRe can be installed on any PIV machine though no specific hardware requirements
are mentioned on the MyCoRe site. In Germany MyCoRe has been actively used and
regularly new versions of MyCoRe are brought out. Software does not have any procedures
in place to monitor and receive notifications when hardware technology is changed.
SOPSworks well with Intel Pentium 133 MHz class machine. SOPS hasnot brought out
any new versions after version 1.0. Software does not have any mechanisms to monitor
and receive notifications when hardware technology changes.
12.9.2 Security
Table 12.8 lists evaluation criteria related to system security. All software have secure
database connection through login and password except Greenstone. Greenstone software
database access is not available to end users. DSpace, DoKS,Fedora & MyCoRe supports
data encrytion. Fedora also supports digital signature feature. DSpace, DoKS, EPrints
supports md5 checksum. Fedora supports data integrity stamp for the datastream which
can be calculated using one of many standard algorithms suchas MD5, SHA-1, SHA-256,
SHA-384, SHA-512.
CHAPTER 12. COMPARATIVE ANALYSIS 388
Table 12.8: Security & System Level Security
Security & SystemLevel Security
CDS-Invenio
DSpace DoKS EPrints Fedora Green-stone
MyCoRe SOPS
Data Encryption No Yes Yes No Yes No Yes No
Digital Signatures No No No Yes Yes No No No
Server Security Yes Yes Yes Yes Yes No Yes Yes
Fixity No Softwaresup-portsmd5check-sum.
No Softwaresup-portsmd5check-sum.
Fedora supports dataintegrity stamp for thedatastream which canbe calculated using oneof many standard algo-rithms such as MD5,SHA-1, SHA-256, SHA-384, SHA-512
No Softwaresupportsmd5 check-sum.
No
Firewall Support No No No No No No No No
12.9.3 General features supported by each software
12.9.3.1 Back end maintenance: None of the software requireany back end maintenance.
12.9.3.2 RSS Support: CDS-Invenio, DSpace, DoKS, EPrints,SOPS provide RSS support.
Greenstone, Fedora, MyCoRe does not provide any RSS support.
12.9.3.3 Upgrading Software: Upgrading CDS-Invenio, DoKS, EPrints, Fedora, MyCoRe,
Greenstone is trivial. DSpace upgrading is not very simple.If the major version re-
lease difference is there, one has to upgrade back end database as well as the DSpace
version which not very simple for the end user. Upgrading DSpace requires more
knowledge of backend technology. No new versions of SOPS aremade available
after the first release hence how to upgrade these software isnot required.
12.9.3.4 Data Migration: CDS-Invenio, DSpace, DoKS, EPrints, Fedora, Greenstone, My-
CoRe support migration of data from one repository softwareto another repository
software. No information available for data migration for SOPS software.
12.9.3.5 Ranking Algorithm: CDS-Invenio supports to rank all documents that are added into
the repository and SOPS software supports to rank each document that is added into
the repository in the scale of poor, average, good, very goodand excellent. No other
software support any ranking algorithm.
CHAPTER 12. COMPARATIVE ANALYSIS 389
12.9.3.6 Known issues/Bugs in the system
CDS-Invenio data entry has one bug, which is while submitting documents into the
repository one has to apply a patch which needs to be used to get this function work-
ing. The patch is websubmit_engine.py. There are no major issues in DSpace but
DSpace data import/export from different instances of DSpace is not very easy to
carry out. DoKS document submission is not very easy. Systemthrows error while
submitting documents into the repository. One of the main drawback of Greenstone
is, it does not have incremental indexing. For every item that is added into the reposi-
tory one has to rebuild the collection. If there are millionsof records then the rebuild
process takes lot of time, which is not the case with other software. The “collector”
option available with Greenstone does not work properly. There is no ’log out’ option
available for web interface which is one of the main bug in Greenstone. MyCoRe
installation has a small bug. While installing MyCoRe one has to rerun ’ant create’
and ’ant webapps’ commands twice and then only it gets successfully installed. One
of the major issue with SOPS is its installation. One has to change several paths in
many files while installing SOPS and then only it gets successfully installed.
12.9.3.7 Submitting Error Reports: CDS-Invenio, DSpace allow to submit error reports for
any error while using the software but DoKS, EPrints, Fedora, MyCoRe, SOPS does
not allow to submit any error reports while using the software.
12.9.3.8 Number of developers working for the software
Table 12.9: Number of Developers Working for Each Software
Name of the Software Number ofDevelopers
CDS-Invenio 2DSpace 14DoKS 6EPrints 6Fedora 16Greenstone 9MyCoRe 16SOPS No information
available.
CHAPTER 12. COMPARATIVE ANALYSIS 390
As shown in Table 12.9 maximum number of developers are working on Fedora and
MyCoRe followed by DSpace and Greenstone.
12.9.3.9 Total Number of Downloads of the Software
Table 12.10: Total Number of Downloads of Each Software
Software Total Number of Downloads
CDS-Invenio No information available.DSpace 3926 (7th March 2008)DoKS 173 (23rd February 2008)EPrints No information available.Fedora No information available.Greenstone No information available.MyCoRe 27 (3rd March 2008)SOPS No information available.
DSpace is downloaded maximum number of times as shown in Table 12.10. DSpace,
DoKS and MyCoRe are available through sourceforge site hences different statistics
are available but other software are not available through sourceforge site hence their
total number of downloads are not maintained by the softwarecreators.
12.9.3.10 Strength of the Community
Table 12.11: Strength of the Community of Each Software
Software Total Numberof downloads
CDS-Invenio 19DoKS 6DSpace 307EPrints 251Fedora 28Greenstone Not KnownMyCoRe 17SOPS 12
Table 12.11 shows that there are more number of installations of DSpace followed by
EPrints. Greenstone is heavily used in Asian countries but many of the installation of
CHAPTER 12. COMPARATIVE ANALYSIS 391
Greenstone are running on Intranet servers hence it is difficult to know total number
of users of Greenstone [1]. CDS-Invenio, Fedora, MyCoRe, SOPS and DoKS have
less user base.
12.9.3.11 License Terms and Conditions
Table 12.12: License Terms and Conditions
Software License Terms and Conditions
CDS-Invenio GNU General Public LicenseDSpace BSD Distribution LicenseDoKS GNU General Public LicenseEPrints GPL License terms and conditionsFedora Mozilla Public LicenseGreenstone GNU General Public LicenseMyCoRe GNU General Public LicenseSOPS No License specified
Table 12.12 shows license terms and conditions for each software. Each software is
available under different license terms and conditions. For SOPS software there is
no license terms and conditions specified.
12.9.3.12 Users Mailing List used by Each Software
Table 12.13: Mailing List of Users of Each Software
CDS-Invenio DSpace DoKS EPrints Fedora Greenstone MyCoRe SOPS
2002 132003 33 89 148 683 102004 147 378 269 899 662005 203 336 250 716 1015 1622006 87 463 2000 1202 1054 1922007 330 552 3000 835 1079 3182008 100 107 434 227 226 69
Total 913 1925 5684 3397 4956 817
Table 12.13 provide a tabular view of mailing list of users ofeach software. Though
CDS-Invenio was made available in 2002, the number of installations as well as
mailing list used by the user’s of CDS-Invenio is not as largeas compared to EPrints,
CHAPTER 12. COMPARATIVE ANALYSIS 392
Greenstone, Fedora. EPrints, Greenstore, Fedora mailing list is used by large number
of users. EPrints has the largest number of requests sent through its mailing list
followed by Greenstone. SOPS and DoKS software yet does not have any mailing
list due to small number of user community. DSpace software is also used by large
community and number of requests sent to DSpace are increasing day by day.
12.9.3.13 Training programs conducted by each software
DSpace user meeting is taking place every year since 2004. DSpace workshops
are held all over the world regularly. In India DRTC has takena leading role in
popularizing DSpace at various organizations and have conducted several workshops
on DSpace.
EPrints workshops are conducted regularly at local, regional, national and interna-
tional levels. NCSI, Bangalore has conducted several training programs on EPrints
in India.
Greenstone is one of the popular open source software. Thereare many workshops
held at national and international level for Greenstone. InIndia, IIM Kozhikode has
been identified as a Greenstone support team for South Asia.
There are no training programs conducted for CDS-Invenio, DoKS, Fedora, SOPS
and MyCoRe software.
12.9.4 Architecture of the Software
CDS-Invenio Architecture of CDS-Invenio is complex architecture. The key feature of
CDSware’s architecture is its modular logic where each module carries out a specific and
defined functionality. Software does not support distributed database management. There
is no mention about grid architecture found anywhere in the literature.
DSpace architecture is three layer architecture, including storage, business and applica-
tion layers. Software supports to store data and actual filesin different areas. DSpace also
supports grid architecture.
DoKS No information was found about architecture of DoKS.
CHAPTER 12. COMPARATIVE ANALYSIS 393
EPrints software has several data objects within the PERL modules torepresent runtime
entities such as users, e-prints, subjects etc. EPrints canbe configured as centralised/
distributed database.
Fedora architecture is based on the concept of object-oriented programming. Fedora
employs XML and METS to create digital objects by encapsulating content, along with
metadata about the content and actions that can be performedon the content. One of the
important feature of Fedora architecture is it has the flexibility to deal with the diversity of
electronic record types, supports versioning to preserve access to contents. Fedora supports
grid architecture.
Greenstone The key components in Greenstone architecture are the receptionist and the
collection server. The receptionist provides the user interface, dispatching requests to the
appropriate collection server or servers, and aggregatingresults for display back to the user.
Another significant part of Greenstone architecture is plug-ins and classifiers. Plug-ins
parse and extract metadata from imported documents. Classifiers create browse indexes.
MyCoRe No information about MyCoRe architecture was found any where.
SOPS architecture is modular architecture allowing different modules to be easily in-
cluded, left out, added or replaced relatively easily.
12.9.5 Scalability
CDS-Invenio, DSpace, Fedora and EPrints are scalable in terms of storage and retrieval.
These software have been tested with millions of records. But other software such as
Greenstone, MyCoRe, DoKS and SOPS does not have any information about testing of
millions of records uploaded into these software.
CHAPTER 12. COMPARATIVE ANALYSIS 394
12.9.6 Storage
CDS-Invenio, Fedora & Greenstone supports to store compressed file formats to save disk
space. It also supports compression/decompression techniques.
DSpace, DoKS, MyCoRe & SOPS does not support any compressionand decompression
technique. It does not support to store files if available in compressed file formats.
EPrints supports to store files in compressed file formats butdoes not support compression
and decompression techniques.
12.9.7 Backup/Restore Facility
None of the software selected for evaluation provide any backup/restore facility. All soft-
ware used for the study depend on the system level backup. Forbackup one has to take
backup of system files as well as database files and restore them accordingly.
12.10 User Friendly Interface
CDS-Invenio allow to modify the user interface through an API or through scripts. Soft-
ware supports to customize user interface. It also supportsto customize user interface in
any local language. At present user interface is available in 17 languages. User interface is
same for the Librarian and for the end user. All functions in CDS-Invenio are carried out
through web interface. System provide online help for the users. Software supports multi-
lingual access support and system allow users to submit feedback on system problems.
DSpace user interface can be easily customized if one has knowledgeof JSP. DSpace
user can change complete look of the user interface. DSpace supports to localise user
interface in any language. It is easy to use DSpace user interface for Librarian as well as
for end user. User interface is same for the use of Librarian and end user. All functions in
DSpace are carried out through web interface. DSpace provides online help for using the
software. Software has multilingual access support. System allow users to submit feedback
on system problems.
CHAPTER 12. COMPARATIVE ANALYSIS 395
DoKS It is possible to completely customize DoKS user interface.Software supports to
localise user interface in any language. DoKS user interface is not very easy for use either
for the Librarian or for the end user though user Interface issame for the Librarian and for
the end user. All functions in DoKS are carried out through web interface. DoKS supports
multilingual access support. System does not allow users tosubmit feedback on system
problems.
EPrints It is possible to completely customize EPrints user interface if the end user has
knowledge of PERL. Since EPrints supports UTF-8 it is possible to create user interface in
any localized language. User interface in EPrints is same for the Librarian and for the end
user. There is no online help provided for all operations that are carried out by the user or
by the Librarian. System allow users to submit feedback on system problems.
Fedora It is possible to customize user interface of Fedora. Fedorauser interface for
Librarian and for end user is different. It is not very easy touse Fedora Interface for
Librarian. The terminologies used in Fedora are complex hence it takes time to understand
how Fedora works. Software does not provide any help for end user. Software does not
provide multilingual access support. System does not allowusers to submit feedback on
system problems.
Greenstone user interface can easily be customized if the end user has knowledge of
PERL. Software allows to change header, footer and overall web interface of the system.
Greenstone also has one of the unique feature of creating a text only interface for visually
impaired users. Greenstone’s user interface at present is available in many languages such
as Arabic, Armenia, Bengali, Catalan etc. The Greenstone’suser interface is currently
available in more than 40 languages. Greenstone’s user interface is very easy to use for ex-
perienced and non experienced users. The user interface forLibrarian and for the end user
is different in Greenstone.Software does not allow to submit feedback on system problems.
MyCoRe It is possible to customize user interface of MyCoRe. MyCoResupports to
localize user interface in any language. MyCoRe user interface is very easy to use for the
experienced and non experienced staff. All functions in MyCoRe are carried out through
CHAPTER 12. COMPARATIVE ANALYSIS 396
web user interface. No help is provided for using the MyCoRe.MyCoRe supports mul-
tilingual access support. Software does not have any provision to provide feedback on
system problems.
SOPS It is possible to customize user interface of SOPS. Softwareallow to change
header, footer, theme etc. as per the end users requirements. Currently SOPS interface
is available in English, German and Slovenian Language. SOPS user interface is same for
Librarian and for the end user. Software provide web based interface for all its functions.
Software can support multilingual access support.
12.11 Usability
CDS-Invenio is not very easy to use for the Librarian as it has complex functionality.
One has to have proper command on using the software. Searching in CDS-Invenio is
easy and end user can find his/her way to information quickly.Help features in CDS-
Invenio are user friendly and they answer to how to search forthe information through
CDS-Invenio. Of course, these help feature do not provide technical answers. Software
supports to provide end users to give user opinions on the webas well as it has support
to fill up online user feedback form which is one of the unique feature of CDS-Invenio.
Software supports web usage analyzer based on general web site hits, site hits per day,
unique visitors per day, search collection usage analysis,search engine query analysis,
user basket statistics, user alert statistics etc. which are unique features of CDS-Invenio.
It also generates ranking data for ranking search results based on methods like : word
similarity, similar records, combined methods etc.
DSpace It is easy to use DSpace web interface for searching records as well as easy
to use for the Librarian/Author etc. End user can easily find his/her ways to information
quickly and easily. DSpace help features provide general help but software does not support
any technical help feature. DSpace supports to generate different usage statistics such as
number of item views, number of collection visits, number ofcommunity visits, number
of OAI requests, user logins, most popular searches etc. DSpace also supports to show
CHAPTER 12. COMPARATIVE ANALYSIS 397
results of statistical analysis on monthly basis or total statistical analysis and this statistical
analysis can be made available to public or restricted to system administrator access.
DoKS does not provide any help message while searching neither its documentation
available is very extensive. It is not very easy for the library staff to understand how DoKS
functions. DoKS does not provide any help features for searching documents. DoKS does
not provide any usage statistics reports. DoKS user interface for end users available is not
very easy to use.
EPrints is easy to use for the Librarian as well as for the end user for searching. User can
easily be able to find information through EPrints. EPrints provides general help features
for the end users but does not give any technical answers. EPrints supports to generate
all usage statistics such as usage patterns, use of materials, who uses what, when for what
reasons etc.
Fedora is not very easy to use for the Librarian as well as for the end users. The termi-
nologies used in Fedora are difficult for the end user to understand. For the end user it is
not very easy to search documents through Fedora. No help features are provided with the
user interface in Fedora. Fedora does not provide any usage statistics.
Greenstone librarian’s interface as well as user interface is easy to use. End user can
easily be able to find information from Greenstone digital library. Greenstone does not
provide extensive help feature. It gives general outline ofhelp but it does not cover all
aspects as required by a newcomer. Greenstone supports usage statistics at limited level
such as it keeps track of use of materials and who uses what andwhen.
MyCoRe is easy to use for the Librarian as well as for the end user. Enduser can eas-
ily navigate through different links and can find the information through MyCoRe portal.
MyCoRe does not provide any help feature for searching and retrieval which is one of the
drawback of MyCoRe. MyCoRe does not provide any usage statistics data.
CHAPTER 12. COMPARATIVE ANALYSIS 398
SOPS is easy for use either for the Librarian or for the end user. End user can easily
search through search interface provided with the SOPS. General help is provided with the
SOPS but it does not provide any technical help support. SOPShas very good feature of
keeping track of all activities carried out in SOPS and it is able to provide different usage
statistics such as how information is used, who uses what andwhen etc.
12.11.1 Transaction Log Analysis
CDS-Invenio store error log files in apache error log. It also stores all search history
requests carried out into the repository and stores that data in query log file.
DSpace keeps all log data into log directory where dspace.log file records all actions
carried out in DSpace. DSpace does not store any query log data.
DoKS keeps all logs data in doks.log file where it records error logs and actions carried
out by users. DoKS software keep track of all query’s submitted to the system and stores
it as a query log file.
EPrints uses Apache Server log file for recording all actions carriedout in EPrints.
EPrints software does not have any separate log directory/file in its area to track all ac-
tions. EPrints supports to store search history log data. Itstores all search history requests
carried out into the repository as per the user ID numbers.
Fedora keeps error logs in client as well as server log directory of Fedora. Fedora client
keeps detailed log of all items ingested into the repositoryas well as Fedora server keeps
log of services initialised by the Fedora repository. Fedora does not keep record details of
queries submitted to Fedora software.
Greenstone keeps usage logs, error logs files in /greenstone/etc directory. It does not
record details of queries submitted to Greenstone software.
CHAPTER 12. COMPARATIVE ANALYSIS 399
MyCoRe does not record any error, user, query logs.
SOPS keeps detailed log of all actions carried out in the software. It keeps usage and
error log files. It records all actions such as time duration,actions carried out by each user
etc. Software also records details of queries submitted to the SOPS repository by individual
users.
12.12 Copyright/Policy Issues
CDS-Invenio has capability to provide details about managing contract agreements for
all digital objects that are added into the repository. Software has capability to control
access to digital objects based on access control list to maintain copyright issue.
DSpace has the facility to keep track of all documents that are addedinto the repository
along with their copyright details. DSpace also supports tocontrol access based on access
control list based on login and password. Software also supports to record rights policies
for every document that is added into the repository.
DoKS does not have any provision to maintain contract agreement details for each dig-
ital document. Software supports to control access by IP address as well as by different
authorization procedures.
EPrints supports to allow to keep licensing conditions for individual digital objects with
following options as well as software supports to add any other third party license details.
Access to EPrints is controlled by user login and password.
License unspecifiedCreative Commons AttributionCreative Commons Attribution Non-commercialCreative Commons Attribution Non-commercial No DerivativesCreative Commons Attribution Non-commercial Share AlikeCreative Commons Attribution Public Domain DedicationCreative Commons Attribution GNU GPL (Software)Creative Commons Attribution GNU LGPL (Software)
CHAPTER 12. COMPARATIVE ANALYSIS 400
Fedora allows to maintain contract agreements for all the digital documents that are
added into the repository and supports to restrict access tothe individual objects through
IP filtering.
Greenstone does not have any facility to manage, store contract agreements for all the
digital documents that are added into the repository.
MyCoRe does not have any provision to maintain contract agreementsfor all the docu-
ments that are added into the repository, It supports to control access to digital documents
through IP filtering as well as through login and password.
SOPS does not have any provision to maintain contract agreement details for all the
documents that are added into the repository. It supports tocontrol access restrictions
based on login and password.
12.13 Advanced Features
Table 12.14 provide different advanced features related toeach software. All selected soft-
ware has long term leadership services except DoKS and SOPS.It is observed that all
software have proper support for their further development. CDS-Invenio at present sup-
port many advanced features such as providing citation data, providing virtual collection
support, page ranking support. CDS-Invenio also supports different personalization fea-
tures such as creating individual’s collection, generating discussion lists, writing reviews,
making different annoucements, ranking different digitalobjects available in the reposi-
tory. Similarly SOPS software supports to initiate discussion on particular digital docu-
ment available in the repository as well as allow to rank eachdigital document. DSpace &
DoKS software to some extent allow to create “My DSpace’ collections where individuals
can create their own collections and users can subscribe to aparticular collection.
CH
AP
TE
R12.
CO
MP
AR
ATIV
EA
NA
LYS
IS4
01
Table 12.14: Advanced Features
Advanced Fea-tures
CDS-Invenio DSpace DoKS EPrints Fedora Greenstone MyCoRe SOPS
Long TermLeadership
CDS-Invenio ismaintained anddeveloped byCERN hence ithas long termleadershipand consultingservices.
DSpace ismaintainedand developedby HP and MIT,it has long termleadership andservices.
DoKS hasnot broughtout any newversions after2006, henceit is difficult todemonstratelong termavailability ofDoKS.
EPrints ismaintainedand developedby University ofSouthampton.It has longterm lead-ership andconsultingservices.
Fedora ismaintainedand developedby CornellUniversity andUniversity ofVirginia Li-brary. It haslong termleadershipand consultingservices.
Greenstonehas long termleadershipand consultingservices as itis maintainedand developedby Universityof Waikato andsupported byUNESCO.
MyCoRe haslong term lead-ership andservices as itis supportedby HamburghUniversity,Germany.
SOPS hasnot broughtout any newversions afterits first re-lease henceit is difficult todemonstratelong termavailability ofSOPS.
Citation Data Software pro-vides citationdata featuresupport.
No No No No No Yes No
Road Map It has definedroadmap for fu-ture releases.
It has definedroadmap for fu-ture releases.
No No Yes Yes Yes No
Virtual Collec-tion Support
CDS-Inveniosupports gen-erating virtualcollectionsupport.
No No No No No No No
CH
AP
TE
R12.
CO
MP
AR
ATIV
EA
NA
LYS
IS4
02
Advanced Fea-tures
CDS-Invenio DSpace DoKS EPrints Fedora Greenstone MyCoRe SOPS
Load Balanc-ing
It can handlelarge volumesof data andhas capacity tohandle morerequests at atime.
No mentionabout loadbalancingof DSpacefound any-where thoughDSpace canhandle millionsof documentscurrently norepositoryholds millionsof records.
No EPrints canperform wellif millionsof recordsare addedin EPrints ormore numberof people try toaccess EPrintsdatabase.
Fedora hasbeen tested byadding millionsof recordsand it hasperformed wellthough num-ber of usersaccessingdatabase aremore.
No No No
Visualization No No No No No No No NoPersonalization Software
supports per-sonalizatonfeature andeach user cancreate theirown collec-tion baskets.Users canalso generatediscussionon any doc-ument whichis available inthe repository.They can alsowrite reviewsof any docu-ment that areavailable in therepository.
DSpace sup-ports ’MyDSpace’ fea-ture which al-low end usersto subscribeto a particu-lar collectionand receiveupdates.
DoKS sup-ports to addsearcheddocumentsthrough DoKSrepository intoindividualsprofile.
No No No No Software sup-ports to createpersonal col-lections.
CH
AP
TE
R12.
CO
MP
AR
ATIV
EA
NA
LYS
IS4
03
Advanced Fea-tures
CDS-Invenio DSpace DoKS EPrints Fedora Greenstone MyCoRe SOPS
TranslationService
No No No No No No No No
CommunityServices
Softwaresupports toprovide com-munity ser-vices such as itallow membersto exchangeideas, makeannoucementsand writereviews.
No No No No No No SOPS sup-ports to initiatediscussion ona particulardocumentavailable inSOPS repos-itory as wellas allow enduser to rateeach digitaldocument.
Page Ranking Software sup-ports pageranking algo-rithm for alldocumentsthat are addedinto the reposi-tory.
No No No No No No No
CHAPTER 12. COMPARATIVE ANALYSIS 404
12.14 Digital Preservation
CDS-Invenio uses automatic conversion of any text file format to PDF format for the
possibility of long-term preservation of digital documents and future conversion. It uses
MARC 21 format to ensure portability and preservation. Software does not yet support
several other features related to digital preservation. There is no mention of any digital
preservation strategy. Software preserves file’s originalidentity such as its name and size.
It supports to upload compound digital objects for e.g. it can upload postscript files and can
automatically convert postscript files into pdf format. It also supports to upload multiple
file formats of the same document. It has facility to keep licensing conditions for individual
images/objects. CDS-Invenio can handle variety of file formats but does not yet support
file format versioning. Digital preservation in CDS-Invenio is supported to very limited
extent.
DSpace supports bit level preservation, where a digital file is preserved exactly as it was
created without any change. While submitting documents in DSpace system keeps track of
known bitstream formats and their support level. The repository provides a list of supported
file formats. Supported formats include those that are documented standards (e.g., TIFF,
AIFF, XML) or have published specifications (e.g., PDF, RIFF).
The other two categories of support for DSpace are "known" and "unsupported". "Known"
formats are those that are common enough to be familiar and usually quite popular, but
which are proprietary and there are no published specifications of those formats. "Unsup-
ported" formats are those that are either unknown to the Libraries or are extremely rare (e.
g., a compiled program, a commercial CAD/CAM file, etc.).
If the software supports a particular file format it shows thelist of supported file formats
else, if format is not supported by DSpace it will give a message “DSpace could not iden-
tify the format of this file. Please describe the format file inthe input box below the list”.
DSpace has built in data integrity check using MD5 (a messagedigest algorithm for secu-
rity applications) to ensure the correctness of each file. DSpace doesnot have any feature
to preserve pre-existing persistent identifiers for submitted digital objects. The provenance
information in DSpace consists of information such as submitted by, email of the submitter,
date, total number of bitstreams submitted, date of making document available on DSpace
CHAPTER 12. COMPARATIVE ANALYSIS 405
web site, actual title of the file format uploaded, total bytes and checksum data. This data
is maintained in DC metadata.
DSpace has mechanisms to keep licensing conditions for eachindividual object as well as
software support to track and manage copyright and restriction issues. The software gen-
erates fixity data after submitting a file into the repositoryby the submitter. It is possible to
add new file formats into DSpace but they may not be supported by DSpace from preser-
vation point of view. DSpace can handle any type of MIME type of file format but may not
be able to preserve them if the format is not open standard.
DSpace version 1.4.2 does not support file format versioning. Software does not record
any representation information and system does not have anyway for automatic format
registration. While submitting any document into DSpace repository the software gives list
of DSpace supported file formats and if the particular file format is not in list it provides
option for the submitter to write details about the file format which is submitted by the
submitter.
EPrints at present does not support any digital preservation strategy but support some of
the features which would be required from digital preservation point of view and would be
useful in future. Software keeps file’s original identitiessuch as its name, size and created
date. Software supports checksum with MD5. EPrints version3 has introduced a new
’history’ function that documents all changes that are madeto records, from the point of
deposit onwards. This information will be useful in future,when there will be need to
modify the content of the repositories, e. g. to migrate file formats. To keep track of what
has happened to a record the repository will need to store both the object itself and all
actions that have been performed on it over time.
One of the important feature of EPrints is it preserves pre-existing persistent identifiers of
submitted digital objects. Software supports to link together compound digital objects and
supports to keep licensing conditions for individual digital objects in the repository with
different licensing conditions. EPrints uses MIME type identification of each file format,
but yet does not record any representation information of any file format added into the
repository.
CHAPTER 12. COMPARATIVE ANALYSIS 406
Fedora at present does not support any preservation strategy but all objects added into
Fedora repository are Internally represented in the file system such as open XML format.
These XML files include data and metadata for the objects plusrelationships to services
and other objects.
Fedora does not preserve file’s original identity. It changes file’s name into the PID number
assigned by Fedora. Fedora supports variety of data integrity checks while file being added
into the repository such as MD5, SHA-1, SHA-256, SHA-384, SHA-512. Fedora sup-
ports to upload alternate ID’s or alternate URI of the digital documents if they are already
existing.
Fedora documents provenance information of each object that is added into the repository.
Software supports to upload compound digital objects whichis one of the unique feature
of Fedora and useful from digital preservation point of view. Fedora has ability to handle
variety of file formats and it does support file format versioning. Fedora uses Internet
MIME type for representing file formats and supports automatic format registration. For
unknown file formats system converts those files to Fedora supported MIME type formats
and uploads the documents.
Fedora supports audit trail features where it provies an essential technology for managing
the life cycle of the digital object. If the descriptive metadata of a digital object is changed
Fedora keeps two versions of the metadata. Software has digital signature feature support.
Fedora has been found more useful from digital preservationpoint of view.
The other software such as DoKS, Greenstone, MyCoRe, SOPS has not yet implemented
any digital preservation support.