Multi-cohort proteogenomic analyses reveal genetic effects across the proteome and diseasome
Authors
- Mine Koprulu
- Karl Smith-Byrne
- Brian Richard Ferolito
- Erin Macdonald-Dunlop
- Jian'an Luan
- Åsa K. Hedman
- Chibuzor Franklin Ogamba
- Jurgis Kuliesius
- Linda Repetto
- Anna Ramisch
- Fahim Abbasi
- Johan Ärnlöv
- Themistocles L. Assimes
- Hanna M. Björck
- Sophia Björkander
- Morten Böttcher
- Adam Stuart Butterworth
- Zhengming Chen
- Kelly Cho
- Robert Joseph Clarke
- Simon Riddington Cox
- Kamila Czene
- John Danesh
- George Dedoussis
- Sölve Elmståhl
- Niclas Eriksson
- Per Eriksson
- Tõnu Esko
- Aida Ferreiro-Iglesias
- Paul William Franks
- Jingyuan Fu
- J Michael Gaziano
- Mohsen Ghanbari
- Christian Gieger
- Arthur Gilly
- Harald Grallert
- Marc James Gunter
- Stefan Gustafsson
- Andreas Göteson
- Per Frans Leonard Hall
- Oskar Hansson
- Sarah Elizabeth Harris
- Caroline Hayward
- Christian Herder
- Natalia Hernandez-Pacheco
- Ziad Hijazi
- Robert F. Hillary
- Jemma Caroline Hopewell
- Shixian Hu
- Shih-Jen Hwang
- Christina Jern
- Åsa Johansson
- Lina Jonsson
- Anette Kalnapenkis
- Nicola Dorothy Kerrison
- Pik Fang Kho
- Lucija Klaric
- Leonhard Kohleick
- Julia Kraft
- Mikael Landén
- Daniel Levy
- Liming Li
- Lars Lind
- Jirong Long
- Niklas Mattsson-Carlgren
- Erik Melén
- Simon Kebede Merid
- Philipp Mertins
- Karl Michaëlsson
- Peter Loof Møller
- Federico Murgia
- Mette Nyegaard
- Young-Chan Park
- Ewan Pearson
- James Peters
- John Ross Petrie
- Grace Png
- Ozren Polašek
- Bram Peter Prins
- Stephan Ripke
- Michael Roden
- Palle Duun Rohde
- Saredo Said
- Xia Shen
- Jochen M. Schwenk
- Agneta Siegbahn
- J. Gustav Smith
- Tara M Stanne
- Karsten Suhre
- Johan Sundström
- Barbara Thorand
- Elsa Valdes-Marquez
- Costanza L. Vallerga
- Joyce B.J. van Meurs
- Ana Viñuela
- Urmo Võsa
- Lars Wallentin
- Robin G. Walters
- Nicholas John Wareham
- Joachim Eduard Weber
- Rinse Karel Weersma
- James F. Wilson
- Simon Winther
- Summaira Yasmeen
- Daniela Zanetti
- Eleftheria Zeggini
- Jing Hua Zhao
- Alexandra Zhernakova
- Daria V. Zhernakova
- Matthias Ziehm
- Benedikt Mathias Kessler
- Alexandre C. Pereira
- Anders Mälarstig
- Maik Pietzner
- Claudia Langenberg
Journal
- Cell
Citation
- Cell
Abstract
Understanding the genetic regulation of circulating protein levels can provide new insights into disease mechanisms. Here, we present the largest proteogenomic study to date (n = 78,664 participants across 38 studies), identifying >24,000 protein quantitative trait loci (QTLs) associated with 1,116 proteins, acting near to (n = 5,040) or distant (n = 19,698) from the cognate gene. Using machine learning-guided effector gene assignment, we provide genetic evidence for pathways, cell types, and tissues that modulate circulating protein levels, highlighting N-linked glycosylation as an important regulatory pathway. We demonstrate that genetic instruments of protein production/function (“cis”) versus modulation (“trans”) reveal distinct phenotypic insights. We identify proteins as candidates for drug targets and engagement (e.g., plasma furin and cardiovascular diseases) by comparing cis-based genetic evidence with protein-disease associations. Systematic triangulation of trans-protein QTLs (pQTLs) with genetic and protein associations across many diseases highlights potential drug repurposing opportunities, e.g., tyrosine kinase 2 (TYK2) inhibitors for rheumatoid arthritis. Our multi-cohort meta-analyses generate proteogenomic insights into disease mechanisms and new treatment opportunities.