Skip to main content
Premium Trial:

Request an Annual Quote

Website Serves as Home for Biological Natural Language Processing


Interest in natural language processing is steadily increasing in the bioinformatics community. Following on the attention devoted to the topic at the recent Pacific Symposium for Biocomputing, Northeastern University’s Bob Futrelle has launched a website devoted to the subject of extracting information from biological literature.

The site ( is meant to serve as the focal point for research in the field. Futrelle said it is designed to take account of the fact that, despite its growth, natural language technology is still a very new area for most biologists.

"The people at [PSB] working in this area in January all felt that the time was right to build some organizational structure into the world-wide efforts. Given my many years of experience in both NLP and biology, I felt that I could really contribute by building a site and setting up a mailing list,”Futrelle said.

Futrelle said that one of the greatest difficulties biological NLP faces is the dearth of people trained in both areas. However, factors such as the availability of Medline and other electronic biology text are spurring the growth of the field.

While the challenges of building systems that allow full natural language querying of text are formidable, Futrelle said that because biology text is generally focused and concise, "the NLP systems that are devised for biology text are not going to have to understand all the nuances of literature such as fiction, poetry, legal writing, and other broad and complex fields.”Current biological NLP approaches, however, are only able to extract information from abstracts. Full-text searching capability is still a long-term goal, according to Futrelle.

In addition to hosting the site and setting up a mailing list on the topic, Futrelle intends to continue his own research in the field. He is in the process of submitting an NSF proposal to further his research on a new system architecture that he said would be more advanced than current approaches to computational linguistics.

Futrelle said he is considering hosting a workshop on natural language processing at the Intelligent Systems for Molecular Biology conference this July in Copenhagen.

— BT

Filed under

The Scan

Expanded Genetic Testing Uncovers Hereditary Cancer Risk in Significant Subset of Cancer Patients

In Genome Medicine, researchers found pathogenic or likely pathogenic hereditary cancer risk variants in close to 17 percent of the 17,523 patients profiled with expanded germline genetic testing.

Mitochondrial Replacement Therapy Embryos Appear Largely Normal in Single-Cell 'Omics Analyses

Embryos produced with spindle transfer-based mitochondrial replacement had delayed demethylation, but typical aneuploidy and transcriptome features in a PLOS Biology study.

Cancer Patients Report Quality of Life Benefits for Immune Checkpoint Inhibitors

Immune checkpoint inhibitor immunotherapy was linked in JAMA Network Open to enhanced quality of life compared to other treatment types in cancer patients.

Researchers Compare WGS, Exome Sequencing-Based Mendelian Disease Diagnosis

Investigators find a diagnostic edge for whole-genome sequencing, while highlighting the cost advantages and improving diagnostic rate of exome sequencing in EJHG.