Abstract
Discussion of point mutations is ubiquitous in biomedical literature, and manually compiling databases or literature on mutations in specific genes or proteins is tedious. We present an open-source, rule-based system, MutationFinder, for extracting point mutation mentions from text. On blind test data, it achieves nearly perfect precision and a markedly improved recall over a baseline.
Original language | English (US) |
---|---|
Pages (from-to) | 1862-1865 |
Number of pages | 4 |
Journal | Bioinformatics |
Volume | 23 |
Issue number | 14 |
DOIs | |
State | Published - Jul 15 2007 |
Externally published | Yes |
Fingerprint
ASJC Scopus subject areas
- Clinical Biochemistry
- Computer Science Applications
- Computational Theory and Mathematics
Cite this
MutationFinder : A high-performance system for extracting point mutation mentions from text. / Caporaso, James G; Baumgartner, William A.; Randolph, David A.; Cohen, K. Bretonnel; Hunter, Lawrence.
In: Bioinformatics, Vol. 23, No. 14, 15.07.2007, p. 1862-1865.Research output: Contribution to journal › Article
}
TY - JOUR
T1 - MutationFinder
T2 - A high-performance system for extracting point mutation mentions from text
AU - Caporaso, James G
AU - Baumgartner, William A.
AU - Randolph, David A.
AU - Cohen, K. Bretonnel
AU - Hunter, Lawrence
PY - 2007/7/15
Y1 - 2007/7/15
N2 - Discussion of point mutations is ubiquitous in biomedical literature, and manually compiling databases or literature on mutations in specific genes or proteins is tedious. We present an open-source, rule-based system, MutationFinder, for extracting point mutation mentions from text. On blind test data, it achieves nearly perfect precision and a markedly improved recall over a baseline.
AB - Discussion of point mutations is ubiquitous in biomedical literature, and manually compiling databases or literature on mutations in specific genes or proteins is tedious. We present an open-source, rule-based system, MutationFinder, for extracting point mutation mentions from text. On blind test data, it achieves nearly perfect precision and a markedly improved recall over a baseline.
UR - http://www.scopus.com/inward/record.url?scp=34547879631&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34547879631&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/btm235
DO - 10.1093/bioinformatics/btm235
M3 - Article
C2 - 17495998
AN - SCOPUS:34547879631
VL - 23
SP - 1862
EP - 1865
JO - Bioinformatics
JF - Bioinformatics
SN - 1367-4803
IS - 14
ER -