Skip to main content
Premium Trial:

Request an Annual Quote

TOOLKIT: GoMiner Tackles Genomics Big Question by Adding Function to Data


John Weinstein's microarray group at the National Cancer Institute, he recounted, was faced with a very big question: "Now that I've done the experimental work, and I've done the statistical work, and I have my list of changed genes, what the hell does that list mean biologically?" In the hope of getting at least a bit closer to the answer, a team of developers from NCI, Georgia Tech, Emory University, and SRA International developed a software tool called GoMiner that organizes genes according to biological function using the Gene Ontology.

The tool, a client-server application available at and, was developed with microarray data in mind, but co-developer Barry Zeeberg of the NCI said it works with any high-throughput genomic or proteomic technology. The key to the system is the GoMiner engine, which automatically assigns a gene association and a GO category to each gene in the input gene list.

Before GoMiner was developed, Weinstein said, the only available option was to manually look up every gene of interest one at a time in the literature and other data sources in order to gauge its biological function. GoMiner, on the other hand, categorizes the list of changed genes within the context of every gene on the entire array in a matter of minutes — a process that Zeeberg estimated would take "literally a lifetime" to do manually for some arrays.

The genes are displayed in a tree structure based on the GO hierarchy, as well as in the form of a directed acyclic graph (DAG) — a representation of hierarchical data that permits some categories to have more than one parent. The DAG visualization is programmed using scalable vector graphics, so users can mouse over nodes to view selected genes or click on any node to view its position in multiple pathways.

GoMiner also links each gene in the tree view to its corresponding page in LocusLink, PubMed, MedMiner, GeneCards, KEGG, and BioCarta. Zeeberg said this list would continue to grow.

In addition to the Java-based GUI, a command-line version of GoMiner is also available for developers who would like to use it for higher-throughput experiments or integrate it with other applications. The command-line option, as well as the DAG capability, set GoMiner apart from similar programs such as Onto-Express and MappFinder, according to Zeeberg.

Zeeberg said the development team of about a dozen programmers relied on "agile computing" — an iterative approach that is technically a subset of extreme programming — to write GoMiner in four months. The team plans to continue developing the software over the next nine months or so to hone its usability features as well as its statistical capabilities, and Zeeberg added that the developers welcome any feedback or suggestions for improvement.

A paper describing GoMiner in detail recently appeared in Genome Biology [], and nearly 1,600 readers had accessed the article as of April 10.

— BT

Filed under

The Scan

Genetic Risk Factors for Hypertension Can Help Identify Those at Risk for Cardiovascular Disease

Genetically predicted high blood pressure risk is also associated with increased cardiovascular disease risk, a new JAMA Cardiology study says.

Circulating Tumor DNA Linked to Post-Treatment Relapse in Breast Cancer

Post-treatment detection of circulating tumor DNA may identify breast cancer patients who are more likely to relapse, a new JCO Precision Oncology study finds.

Genetics Influence Level of Depression Tied to Trauma Exposure, Study Finds

Researchers examine the interplay of trauma, genetics, and major depressive disorder in JAMA Psychiatry.

UCLA Team Reports Cost-Effective Liquid Biopsy Approach for Cancer Detection

The researchers report in Nature Communications that their liquid biopsy approach has high specificity in detecting all- and early-stage cancers.