The Tabloid Proteome Orthogonal use of public proteomics data to derive biologically related protein network

Presented by Surya Gupta - Postdoctoral Researcher at VIB-UGent Mass-spectrometry based proteomics experiments produces large amounts of data. While typically acquired to answer specific biological questions, these data can also be reused in orthogonal ways to reveal biological knowledge. We have de...

Full description

Bibliographic Details
Main Author: Salon, Data
Format: eBook
Language:English
Published: Data Science Salon 2019
Edition:1st edition
Subjects:
Online Access:
Collection: O'Reilly - Collection details see MPG.ReNa
LEADER 02912nmm a2200301 u 4500
001 EB001951435
003 EBX01000000000000001114337
005 00000000000000.0
007 cr|||||||||||||||||||||
008 210123 ||| eng
100 1 |a Salon, Data 
245 0 0 |a The Tabloid Proteome  |h [electronic resource]  |b Orthogonal use of public proteomics data to derive biologically related protein network  |c Salon, Data 
250 |a 1st edition 
260 |b Data Science Salon  |c 2019 
300 |a 1 video file, approximately 19 min. 
653 |a Vidéo en continu 
653 |a Vidéos sur Internet 
653 |a streaming video / aat 
653 |a Internet videos / http://id.loc.gov/authorities/subjects/sh2007001612 
653 |a Streaming video / http://id.loc.gov/authorities/subjects/sh2005005237 
041 0 7 |a eng  |2 ISO 639-2 
989 |b OREILLY  |a O'Reilly 
500 |a Mode of access: World Wide Web 
500 |a Made available through: Safari, an O'Reilly Media Company 
776 |z 00000MJHOD3TZICI 
856 4 0 |u https://learning.oreilly.com/videos/~/00000MJHOD3TZICI/?ar  |x Verlag  |3 Volltext 
082 0 |a E VIDEO 
520 |a Presented by Surya Gupta - Postdoctoral Researcher at VIB-UGent Mass-spectrometry based proteomics experiments produces large amounts of data. While typically acquired to answer specific biological questions, these data can also be reused in orthogonal ways to reveal biological knowledge. We have developed a novel method for such orthogonal data reuse of public proteomics data to detect biologically associated protein pairs. Mass-spectrometry proteomics experiments were obtained and reprocessed from the PRIDE database. For the identified proteins, we calculated the co-occurrence score, using Jaccard similarity. Protein pairs with score of atleast 0.4 were mapped to five knowledgebases; Reactome, Ensembl, IntAct, BioGRID, and CORUM, to assign potential biological relevance. Of the 2325 protein pairs that pass the Jaccard similarity threshold, we 81% of protein pairs with biological annotation (68% with five knowledgebases and 13% with GO terms). While comparison with randomly selected protein pairs, less than 2% protein pairs were found to be annotated. Furthermore, to extend the usability and accessibility of the detected protein pairs for research community, an online database called Tabloid Proteome was established. Our approach shows that by re-using publically available data in a fully orthogonal way, effectively treating these data as a proteome-wide association study, we can extract various biologically meaningful patterns, which moreover, were quite complementary to associations detected by established protein-protein interaction techniques. Additionally, Tabloid Proteome features a simple yet powerful web interface that allows fast and easy access to all these protein associations, with their possible biological annotation