Cloning of the entire set of an organism's protein-coding open reading frames (ORFs), or 'ORFeome', is a means of connecting the genome to downstream 'omics' applications. Here we report a proteome-scale study of the fission yeast Schizosaccharomyces pombe based on cloning of the ORFeome. Taking advantage of a recombination-based cloning system, we obtained 4,910 ORFs in a form that is readily usable in various analyses. First, we evaluated ORF prediction in the fission yeast genome project by expressing each ORF tagged at the 3' terminus. Next, we determined the localization of 4,431 proteins, corresponding to approximately 90% of the fission yeast proteome, by tagging each ORF with the yellow fluorescent protein. Furthermore, using leptomycin B, an inhibitor of the nuclear export protein Crm1, we identified 285 proteins whose localization is regulated by Crm1.
We have sequenced and annotated the genome of fission yeast (Schizosaccharomyces pombe), which contains the smallest number of protein-coding genes yet recorded for a eukaryote: 4,824. The centromeres are between 35 and 110 kilobases (kb) and contain related repeats including a highly conserved 1.8-kb element. Regions upstream of genes are longer than in budding yeast (Saccharomyces cerevisiae), possibly reflecting more-extended control regions. Some 43% of the genes contain introns, of which there are 4,730. Fifty genes have significant similarity with human disease genes; half of these are cancer related. We identify highly conserved genes important for eukaryotic cell organization including those required for the cytoskeleton, compartmentation, cell-cycle control, proteolysis, protein phosphorylation and RNA splicing. These genes may have originated with the appearance of eukaryotic life. Few similarly conserved genes that are important for multicellular organization were identified, suggesting that the transition from prokaryotes to eukaryotes required more new genes than did the transition from unicellular to multicellular organization.
Schizosaccharomyces pombe has an open reading frame, which we named alr1(+), encoding a putative protein similar to bacterial alanine racemase. We cloned the alr1(+) gene in Escherichia coli and purified the gene product (Alr1p), with an M(r) of 41,590, to homogeneity. Alr1p contains pyridoxal 5'-phosphate as a coenzyme and catalyzes the racemization of alanine with apparent K(m) and V(max) values as follows: for L-alanine, 5.0 mM and 670 micromol/min/mg, respectively, and for D-alanine, 2.4 mM and 350 micromol/min/mg, respectively. The enzyme is almost specific to alanine, but L-serine and L-2-aminobutyrate are racemized slowly at rates 3.7 and 0.37% of that of L-alanine, respectively. S. pombe uses D-alanine as a sole nitrogen source, but deletion of the alr1(+) gene resulted in retarded growth on the same medium. This indicates that S. pombe has catabolic pathways for both enantiomers of alanine and that the pathway for L-alanine coupled with racemization plays a major role in the catabolism of D-alanine. Saccharomyces cerevisiae differs markedly from S. pombe: S. cerevisiae uses L-alanine but not D-alanine as a sole nitrogen source. Moreover, D-alanine is toxic to S. cerevisiae. However, heterologous expression of the alr1(+) gene enabled S. cerevisiae to grow efficiently on D-alanine as a sole nitrogen source. The recombinant yeast was relieved from the toxicity of D-alanine.