- Objective: Used machine learning (ML) to systematically identify and quantify the contribution of over 1,800 characteristics (health, genetic, technical) to the variation in approximately 3,000 plasma protein levels across 43,240 UK Biobank individuals.
- Key Result: A median of 20 factors explained an average of 19.4% of protein variance. Modifiable characteristics (median: 10.0%) were found to explain significantly more variation than genetic factors (median: 3.9%).
- Implication: The study provides a crucial resource (knowledge graph, R package) and framework for understanding protein origins, clustering proteins by their drivers (e.g., disease, pre-analytical factors), and guiding the identification of biologically relevant biomarkers and drug target engagement markers.
No matching items