Abstract
We propose an XML (eXtensible Markup Language) format for proteomics database to exchange proteome analysis data. The XML-based data is highly machine-readable and easy to represent information hierarchy and relationships. There have been several XML formats of proteome data which mainly represent the sequence information stored in the Protein Identification Resource (PIR) and the Protein Data Base (PDB). Our XML-based data format has a proteome-analysis-oriented structure and describes information of sample preparation, 2D gel electrophoresis images, spot identification information in the gels and the sequence information of the spots. The model is used to exchange both of preparation parameters and the results of 2D gel electrophoresis analysis. It would accelerate collaboration among proteomics researchers if a platform exchanging these data is developed on the internet. By using the XML-based data format for proteomics, we have developed an XML editor and a web-based prototype system which consists of XML database, agent, security and graphical user interface (GUI).