Assessment of a complete and classified platelet proteome from genome-wide transcripts of human platelets and megakaryocytes covering platelet functions

Jingnan Huang; Frauke Swieringa; Fiorella A Solari; Isabella Provenzale; Luigi Grassi; Ilaria De Simone; Constance C F M J Baaten; Rachel Cavill; Albert Sickmann; Mattia Frontini; Johan W M Heemskerk

doi:10.1038/s41598-021-91661-x

Assessment of a complete and classified platelet proteome from genome-wide transcripts of human platelets and megakaryocytes covering platelet functions

Jingnan Huang^*, Frauke Swieringa, Fiorella A Solari, Isabella Provenzale, Luigi Grassi, Ilaria De Simone, Constance C F M J Baaten, Rachel Cavill, Albert Sickmann, Mattia Frontini, Johan W M Heemskerk^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

Novel platelet and megakaryocyte transcriptome analysis allows prediction of the full or theoretical proteome of a representative human platelet. Here, we integrated the established platelet proteomes from six cohorts of healthy subjects, encompassing 5.2 k proteins, with two novel genome-wide transcriptomes (57.8 k mRNAs). For 14.8 k protein-coding transcripts, we assigned the proteins to 21 UniProt-based classes, based on their preferential intracellular localization and presumed function. This classified transcriptome-proteome profile of platelets revealed: (i) Absence of 37.2 k genome-wide transcripts. (ii) High quantitative similarity of platelet and megakaryocyte transcriptomes (R = 0.75) for 14.8 k protein-coding genes, but not for 3.8 k RNA genes or 1.9 k pseudogenes (R = 0.43-0.54), suggesting redistribution of mRNAs upon platelet shedding from megakaryocytes. (iii) Copy numbers of 3.5 k proteins that were restricted in size by the corresponding transcript levels (iv) Near complete coverage of identified proteins in the relevant transcriptome (log2fpkm > 0.20) except for plasma-derived secretory proteins, pointing to adhesion and uptake of such proteins. (v) Underrepresentation in the identified proteome of nuclear-related, membrane and signaling proteins, as well proteins with low-level transcripts. We then constructed a prediction model, based on protein function, transcript level and (peri)nuclear localization, and calculated the achievable proteome at ~ 10 k proteins. Model validation identified 1.0 k additional proteins in the predicted classes. Network and database analysis revealed the presence of 2.4 k proteins with a possible role in thrombosis and hemostasis, and 138 proteins linked to platelet-related disorders. This genome-wide platelet transcriptome and (non)identified proteome database thus provides a scaffold for discovering the roles of unknown platelet proteins in health and disease.

Original language	English
Article number	12358
Number of pages	18
Journal	Scientific Reports
Volume	11
Issue number	1
DOIs	https://doi.org/10.1038/s41598-021-91661-x
Publication status	Published - 11 Jun 2021

Keywords

BLOOD
F-ACTIN
INHIBITION
LANDSCAPE
METABOLISM
MUTATIONS
PHOSPHOPROTEOME
REVEALS

Access to Document

10.1038/s41598-021-91661-xLicence: CC BY

Cite this

Huang, J., Swieringa, F., Solari, F. A., Provenzale, I., Grassi, L., De Simone, I., Baaten, C. C. F. M. J., Cavill, R., Sickmann, A., Frontini, M., & Heemskerk, J. W. M. (2021). Assessment of a complete and classified platelet proteome from genome-wide transcripts of human platelets and megakaryocytes covering platelet functions. Scientific Reports, 11(1), Article 12358. https://doi.org/10.1038/s41598-021-91661-x

@article{8fc05d1f460348db8ca88cad07ecf83c,

title = "Assessment of a complete and classified platelet proteome from genome-wide transcripts of human platelets and megakaryocytes covering platelet functions",

abstract = "Novel platelet and megakaryocyte transcriptome analysis allows prediction of the full or theoretical proteome of a representative human platelet. Here, we integrated the established platelet proteomes from six cohorts of healthy subjects, encompassing 5.2 k proteins, with two novel genome-wide transcriptomes (57.8 k mRNAs). For 14.8 k protein-coding transcripts, we assigned the proteins to 21 UniProt-based classes, based on their preferential intracellular localization and presumed function. This classified transcriptome-proteome profile of platelets revealed: (i) Absence of 37.2 k genome-wide transcripts. (ii) High quantitative similarity of platelet and megakaryocyte transcriptomes (R = 0.75) for 14.8 k protein-coding genes, but not for 3.8 k RNA genes or 1.9 k pseudogenes (R = 0.43-0.54), suggesting redistribution of mRNAs upon platelet shedding from megakaryocytes. (iii) Copy numbers of 3.5 k proteins that were restricted in size by the corresponding transcript levels (iv) Near complete coverage of identified proteins in the relevant transcriptome (log2fpkm > 0.20) except for plasma-derived secretory proteins, pointing to adhesion and uptake of such proteins. (v) Underrepresentation in the identified proteome of nuclear-related, membrane and signaling proteins, as well proteins with low-level transcripts. We then constructed a prediction model, based on protein function, transcript level and (peri)nuclear localization, and calculated the achievable proteome at ~ 10 k proteins. Model validation identified 1.0 k additional proteins in the predicted classes. Network and database analysis revealed the presence of 2.4 k proteins with a possible role in thrombosis and hemostasis, and 138 proteins linked to platelet-related disorders. This genome-wide platelet transcriptome and (non)identified proteome database thus provides a scaffold for discovering the roles of unknown platelet proteins in health and disease.",

keywords = "BLOOD, F-ACTIN, INHIBITION, LANDSCAPE, METABOLISM, MUTATIONS, PHOSPHOPROTEOME, REVEALS",

author = "Jingnan Huang and Frauke Swieringa and Solari, {Fiorella A} and Isabella Provenzale and Luigi Grassi and {De Simone}, Ilaria and Baaten, {Constance C F M J} and Rachel Cavill and Albert Sickmann and Mattia Frontini and Heemskerk, {Johan W M}",

year = "2021",

month = jun,

day = "11",

doi = "10.1038/s41598-021-91661-x",

language = "English",

volume = "11",

journal = "Scientific Reports",

issn = "2045-2322",

publisher = "Nature Publishing Group",

number = "1",

}

Huang, J, Swieringa, F, Solari, FA, Provenzale, I, Grassi, L, De Simone, I, Baaten, CCFMJ , Cavill, R, Sickmann, A, Frontini, M & Heemskerk, JWM 2021, 'Assessment of a complete and classified platelet proteome from genome-wide transcripts of human platelets and megakaryocytes covering platelet functions', Scientific Reports, vol. 11, no. 1, 12358. https://doi.org/10.1038/s41598-021-91661-x

TY - JOUR

T1 - Assessment of a complete and classified platelet proteome from genome-wide transcripts of human platelets and megakaryocytes covering platelet functions

AU - Huang, Jingnan

AU - Swieringa, Frauke

AU - Solari, Fiorella A

AU - Provenzale, Isabella

AU - Grassi, Luigi

AU - De Simone, Ilaria

AU - Baaten, Constance C F M J

AU - Cavill, Rachel

AU - Sickmann, Albert

AU - Frontini, Mattia

AU - Heemskerk, Johan W M

PY - 2021/6/11

Y1 - 2021/6/11

N2 - Novel platelet and megakaryocyte transcriptome analysis allows prediction of the full or theoretical proteome of a representative human platelet. Here, we integrated the established platelet proteomes from six cohorts of healthy subjects, encompassing 5.2 k proteins, with two novel genome-wide transcriptomes (57.8 k mRNAs). For 14.8 k protein-coding transcripts, we assigned the proteins to 21 UniProt-based classes, based on their preferential intracellular localization and presumed function. This classified transcriptome-proteome profile of platelets revealed: (i) Absence of 37.2 k genome-wide transcripts. (ii) High quantitative similarity of platelet and megakaryocyte transcriptomes (R = 0.75) for 14.8 k protein-coding genes, but not for 3.8 k RNA genes or 1.9 k pseudogenes (R = 0.43-0.54), suggesting redistribution of mRNAs upon platelet shedding from megakaryocytes. (iii) Copy numbers of 3.5 k proteins that were restricted in size by the corresponding transcript levels (iv) Near complete coverage of identified proteins in the relevant transcriptome (log2fpkm > 0.20) except for plasma-derived secretory proteins, pointing to adhesion and uptake of such proteins. (v) Underrepresentation in the identified proteome of nuclear-related, membrane and signaling proteins, as well proteins with low-level transcripts. We then constructed a prediction model, based on protein function, transcript level and (peri)nuclear localization, and calculated the achievable proteome at ~ 10 k proteins. Model validation identified 1.0 k additional proteins in the predicted classes. Network and database analysis revealed the presence of 2.4 k proteins with a possible role in thrombosis and hemostasis, and 138 proteins linked to platelet-related disorders. This genome-wide platelet transcriptome and (non)identified proteome database thus provides a scaffold for discovering the roles of unknown platelet proteins in health and disease.

AB - Novel platelet and megakaryocyte transcriptome analysis allows prediction of the full or theoretical proteome of a representative human platelet. Here, we integrated the established platelet proteomes from six cohorts of healthy subjects, encompassing 5.2 k proteins, with two novel genome-wide transcriptomes (57.8 k mRNAs). For 14.8 k protein-coding transcripts, we assigned the proteins to 21 UniProt-based classes, based on their preferential intracellular localization and presumed function. This classified transcriptome-proteome profile of platelets revealed: (i) Absence of 37.2 k genome-wide transcripts. (ii) High quantitative similarity of platelet and megakaryocyte transcriptomes (R = 0.75) for 14.8 k protein-coding genes, but not for 3.8 k RNA genes or 1.9 k pseudogenes (R = 0.43-0.54), suggesting redistribution of mRNAs upon platelet shedding from megakaryocytes. (iii) Copy numbers of 3.5 k proteins that were restricted in size by the corresponding transcript levels (iv) Near complete coverage of identified proteins in the relevant transcriptome (log2fpkm > 0.20) except for plasma-derived secretory proteins, pointing to adhesion and uptake of such proteins. (v) Underrepresentation in the identified proteome of nuclear-related, membrane and signaling proteins, as well proteins with low-level transcripts. We then constructed a prediction model, based on protein function, transcript level and (peri)nuclear localization, and calculated the achievable proteome at ~ 10 k proteins. Model validation identified 1.0 k additional proteins in the predicted classes. Network and database analysis revealed the presence of 2.4 k proteins with a possible role in thrombosis and hemostasis, and 138 proteins linked to platelet-related disorders. This genome-wide platelet transcriptome and (non)identified proteome database thus provides a scaffold for discovering the roles of unknown platelet proteins in health and disease.

KW - BLOOD

KW - F-ACTIN

KW - INHIBITION

KW - LANDSCAPE

KW - METABOLISM

KW - MUTATIONS

KW - PHOSPHOPROTEOME

KW - REVEALS

U2 - 10.1038/s41598-021-91661-x

DO - 10.1038/s41598-021-91661-x

M3 - Article

C2 - 34117303

SN - 2045-2322

VL - 11

JO - Scientific Reports

JF - Scientific Reports

IS - 1

M1 - 12358

ER -