GenBank Flat File Reader
Although the MATLAB Bioinformatics Toolbox has an endogenous GenBank file reader, genbankread(), it sometimes has difficulty reading these flat files with unexpected, but not unorthodox, formatting. This program, gbread(), is designed to replace genbankread() with a more versatile alternative. Unlike genbankread(), which expects fields to maintain strict formatting and order, gbread() blocks off each record as a cell array of text lines, searches for fields, and handles them with a switch/case design. Rather than induce a fatal error, unrecognized fields are appended to the output structure as a block of text. Benchmarks show a surprising ten-fold speed-up of gbread() over genbankread(). Unlike genbankread(), gbread() defaults to automatically parsing features, but does not sacrifice time to reading CDS entries when this option is turned off. Overall, gbread() provides enhanced versatility, reliability, speed, and convenience for reading GenBank flat files in MATLAB.
Cite As
Turner Conrad (2024). GenBank Flat File Reader (https://www.mathworks.com/matlabcentral/fileexchange/48999-genbank-flat-file-reader), MATLAB Central File Exchange. Retrieved .
MATLAB Release Compatibility
Platform Compatibility
Windows macOS LinuxCategories
- Industries > Biotech and Pharmaceutical > Genomics and Next Generation Sequencing >
- Computational Biology > Bioinformatics Toolbox >
Tags
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!Discover Live Editor
Create scripts with code, output, and formatted text in a single executable document.
Version | Published | Release Notes | |
---|---|---|---|
1.0.0.0 |