Semantic Assistants Wiki (SAW)
In the Context of the ETC Project
SAW Intro
• Wiki– Users collaboratively develop and organize content– Information Analysis is up to the user
• Goal: – “Self-aware wiki that can develop and organize its content”– Support users in information analysis
• Requires NLP to handle majority of content • Semantic Assistants Wiki: Integration of NLP with
Wikis
SAW in Action
Wiki Wiki-NLP Integration Semantic Assistants GATE NLP Pipeline:Names Entity Recognition
wiki.org/Mary “…Mary won…” “…Mary won…”
… Mary won the first prize...
… [[hasType::Person|Mary]] won… XMLXML
Example: Biomedical Literature Curation
• GenWiki: Filled with full text research papers• Entity Recognition: Enzymes, Organisms• Additional semantic information– Systematic name– Link to enzyme database entry
• Wiki is enriched with additional data e.g. using Semantic MediaWiki markup [[hasType::Enzyme]
• Time required to curate research papers reduced
Example: Wiktionary
• Automatically populate the wiki using computational linguistics
• Cross-link between different language entries• Manual work can be reduced
SAW for ETC
• Charaparser
Wiki Wiki-NLP Integration Semantic Assistants
etc-project.org/wiki/fna19 “…abaxial faces, without…”
GATE NLP Pipeline:Charaparser
“…abaxial faces, without…”
… abaxial faces, without
septate trichomes ...
SAW for ETC
• Charaparser <structure id="o0" name="face" constraint="abaxial“/><structure id="o1" name="trichome"> <character name="architecture" value="septate"/></structure><relation id="r0" name=“without" negation="false" from="o0" to="o1"/>
Faces* [[hasConstraint::Abaxial]] * [[without::Trichome]]
Trichome* [[hasArchitecture::septate]]
Wiki Wiki-NLP Integration Semantic Assistants
etc-project.org/wiki/fna19/superstructure/faces
Charaparser NLP Pipeline
etc-project.org/wiki/fna19/superstructure/trichome
XML
SAW for ETC
• Charaparser– GATE compliant NLP pipeline– Charaparser output to wiki markup translation module– Wiki as ‘User Interface’ of Charaparser
• Logic Reasoning, Information Theory, Ontology building can– Read and query charaparser results from wiki (RDF
triplets)– Be possibly integrated in wiki
Open Questions
• Charaparser as GATE compliant pipeline; Effort?• Alternatives to Semantic Assistant Wiki?
Apache Stanbol?• Representation of Structure, Character,
Relation in Wiki (e.g. duplicate structure names)
ReferencesBahar Sateli and René Witte. Natural Language Processing for MediaWiki: The Semantic Assistants Approach. WikiSym 2012.
Bahar Sateli, Marie-Jean Meurs, Greg Butler, Justin Powlowski, Adrian Tsang, René Witte. IntelliGenWiki: An Intelligent Semantic Wiki for Life Sciences. NETTAB 2012.
René Witte and Thomas Gitzinger. Connecting Wikis and Natural Language Processing Systems. WikiSym 2007.
http://www.semanticsoftware.info/first-open-source-release-semantic-assistants-wiki-nlp
Architecture