Journal of Undergraduate Research
Keywords
software program, DNA sequence data, GenBank
College
Life Sciences
Department
Biology
Abstract
Nucleotide sequence data on GenBank (a government sponsored repository located on http://www.ncbi.nlm.nih.gov/entrez/) is organized in a flat file format that can be accessed by searches. Obtaining a large volume of sequences from GenBank is a tedious process because the sequence data must be extracted from the flat files for every sequence. The goal of this project was develop a java program to upload many sequences from GenBank in one file, extract the gene names to allow the user to group synonyms, and save the data into either tab-delimited or FASTA format for easy importation into other sequence manipulating programs. This program was seen as a major step to streamline the analysis process in our lab.
Recommended Citation
Vernon, Heather and McClellan, Dr. David
(2013)
"Development of Software Program for Organizing DNA Sequence Data from GenBank,"
Journal of Undergraduate Research: Vol. 2013:
Iss.
1, Article 1009.
Available at:
https://scholarsarchive.byu.edu/jur/vol2013/iss1/1009