Proteome database of microorganisms
Proteome of an organism refers to the collection of all proteins in the species at the global level. Efforts have been expended to profile the proteome of many species in the past two decades. Such efforts have culminated in the collection of proteomes of many species in public databases. But, proteomic information remains hidden from view of most biologists given the need to parse the information contained in each FASTA file. Using an in-house MATLAB software (https://peerj.com/preprints/27856/), this work sought to parse individual proteome files of different microorganisms obtained from UniProt, and build an enhanced proteome database for each species. Each database contains protein name, amino acid sequence, number of residues in each protein, molecular weight of protein, and nucleotide sequence of each protein in the proteome. The presented database should serve as a useful resource for both fundamental and applied microbiological research. Herein is presented the proteome database of each species considered.
Bacteria and Archaea
Helicobacter pylori phage KHP30
Staphylococcus aureus Newman strain
Eukaryotes