BioNLP is an initiative by the
Center for Computational Pharmacology
at the
University of Colorado Denver Health Sciences Center
to create and distribute code, software, and data for applying natural language
processing techniques to biomedical texts.
There are many projects associated with BioNLP.
Projects
- Parentheses Classifier: a classifier for the content of parenthesized text
- Simple Semantic Classifier: a text classifier for OBO domains
- BioLemmatizer: a biomedical literature specific lemmatizer.
- Knowtator: a Protege plug-in for text annotation.
- MutationFinder: an information extraction system for extracting descriptions of point mutations from free text.
- OpenDMAP: an ontology-driven, rule-based concept analysis and information extraction system
- BioNLP-Corpora: a repository of biologically and linguistically annotated corpora and biomedical datasets. This project includes
- Colorado Richly Annotated Full-Text Corpus (CRAFT)
- PICorpus
- GeneHomonym
- Annotation Projects
- MEDLINE Mining projects
- Anaphora Corpus
- TestSuite Corpora
- BioNLP-UIMA: Unstructured Information Management Architecture (UIMA) components geared towards the use and evaluation of tools for biomedical natural langauge processing, including tools for our own OpenDMAP
and MutationFinder use.
- OboAnalyzer: an analysis tool to detect
OBO ontology terms that use different linguistic conventions for expressing similar semantics.