NCBI (National Center for Biotechnology Information)
https://www.ncbi.nlm.nih.gov/
NCBI is a major repository for genomic, genetic, and biomedical data. It provides access to numerous databases including GenBank, PubMed, and BLAST that are essential for biological research.
Source Names in tables:
- NCBI: The ID of the protein on the NCBI website.
Phytozome
https://phytozome-next.jgi.doe.gov/
Phytozome is a comparative platform for plant genomics that provides access to annotated plant genomes from the JGI Plant Program and other external sources.
Source Names in tables:
- phytozome13_pac: The Phytozome PAC Transcript ID, internal unique ID for each protein sequence.
- phytozome13_name: The transcript name that depends on the sequencing project.
UniProt
https://www.uniprot.org/
UniProt is a comprehensive resource for protein sequence and functional information with extensive cross-references to other databases.
Source Names in tables:
- uniprot: The unique ID of the protein in the Uniprot database.
Sol Genomics
https://solgenomics.net/
The Sol Genomics Network (SGN) is a genomics and breeding database and resource for the Solanaceae family, including tomato, potato, pepper, eggplant, and related species.
Source Names in tables:
- sol_genomics: The unique ID of the protein in the Sol Genomics database.
- sol_genomics_corrected: Used for proteins from Sol Genomics that were manually curated and corrected, indicating they come from Sol genomics but will not be found there with the exact same sequence.