Curation of the genome annotation of Pichia pastoris (Komagataella phaffii) CBS7435 from gene level to protein function

Minoska Valli, Nadine E Tatto, Armin Peymann, Clemens Gruber, Nils Landes, Heinz Ekker, Gerhard G Thallinger, Diethard Mattanovich, Brigitte Gasser, Alexandra B Graf

Research output: Contribution to journalArticlepeer-review

Abstract

As manually curated and non-automated BLAST analysis of the published Pichia pastoris genome sequences revealed many differences between the gene annotations of the strains GS115 and CBS7435, RNA-Seq analysis, supported by proteomics, was performed to improve the genome annotation. Detailed analysis of sequence alignment and protein domain predictions were made to extend the functional genome annotation to all P. pastoris sequences. This allowed the identification of 492 new ORFs, 4916 hypothetical UTRs and the correction of 341 incorrect ORF predictions, which were mainly due to the presence of upstream ATG or erroneous intron predictions. Moreover, 175 previously erroneously annotated ORFs need to be removed from the annotation. In total, we have annotated 5325 ORFs. Regarding the functionality of those genes, we improved all gene and protein descriptions. Thereby, the percentage of ORFs with functional annotation was increased from 48% to 73%. Furthermore, we defined functional groups, covering 25 biological cellular processes of interest, by grouping all genes that are part of the defined process. All data are presented in the newly launched genome browser and database available at www.pichiagenome.org In summary, we present a wide spectrum of curation of the P. pastoris genome annotation from gene level to protein function.

Original languageEnglish
Article number fow051
JournalFEMS Yeast Research
Volume16
Issue number6
DOIs
Publication statusPublished - Sept 2016

Keywords

  • Journal Article

Fingerprint

Dive into the research topics of 'Curation of the genome annotation of Pichia pastoris (Komagataella phaffii) CBS7435 from gene level to protein function'. Together they form a unique fingerprint.

Cite this