•  
  •  
 

Journal of Undergraduate Research

Keywords

software program, DNA sequence data, GenBank

College

Life Sciences

Department

Biology

Abstract

Nucleotide sequence data on GenBank (a government sponsored repository located on http://www.ncbi.nlm.nih.gov/entrez/) is organized in a flat file format that can be accessed by searches. Obtaining a large volume of sequences from GenBank is a tedious process because the sequence data must be extracted from the flat files for every sequence. The goal of this project was develop a java program to upload many sequences from GenBank in one file, extract the gene names to allow the user to group synonyms, and save the data into either tab-delimited or FASTA format for easy importation into other sequence manipulating programs. This program was seen as a major step to streamline the analysis process in our lab.

Included in

Biology Commons

Share

COinS