Data Science

Our works spans three directions: Artificial Intelligence methods to refine knowledge extraction from proteomics data, high dimensional biostatistical tools to assess the significance of biological discoveries, and software development to apply these methodologies to our own research and share them in transparent and reproducible manner. 

Methodological works in artificial intelligence and biostatistics:

  • Multi-omics analysis (notably proteogenomics),
  • Missing values imputation,
  • False discovery rate (FDR) control

Software tools:

  • Proline 
  • Prostar
  • All our other projects are available on our GitHub page including those accessible via public repositories such as the CRAN or BioConductor.

People involved

Recent publications

238648 DataScience 1 apa 6 date desc 253 https://www.edyp.fr/web/wp-content/plugins/zotpress/
%7B%22status%22%3A%22success%22%2C%22updateneeded%22%3Afalse%2C%22instance%22%3Afalse%2C%22meta%22%3A%7B%22request_last%22%3A0%2C%22request_next%22%3A0%2C%22used_cache%22%3Atrue%7D%2C%22data%22%3A%5B%7B%22key%22%3A%22RT2FI58V%22%2C%22library%22%3A%7B%22id%22%3A238648%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A1535877%2C%22username%22%3A%22Christophe%20Bruley%22%2C%22name%22%3A%22Christophe%20Bruley%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fchristophe_bruley%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Hijazi%20et%20al.%22%2C%22parsedDate%22%3A%222025-03-31%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%202%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BHijazi%2C%20H.%2C%20Manessier%2C%20J.%2C%20Brugiere%2C%20S.%2C%20Ravnsborg%2C%20T.%2C%20Cour%26%23xE7%3Bon%2C%20M.%2C%20Brule%2C%20B.%2C%20Merienne%2C%20K.%2C%20Jensen%2C%20O.%20N.%2C%20Hesse%2C%20A.-M.%2C%20Bruley%2C%20C.%2C%20%26amp%3B%20Pflieger%2C%20D.%20%282025%29.%20Mind%20Your%20Spectra%3A%20Points%20to%20be%20Aware%20of%20When%20Validating%20the%20Identification%20of%20Isobaric%20Histone%20Peptidoforms.%20%26lt%3Bi%26gt%3BJournal%20of%20Proteome%20Research%26lt%3B%5C%2Fi%26gt%3B.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1021%5C%2Facs.jproteome.4c01056%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1021%5C%2Facs.jproteome.4c01056%26lt%3B%5C%2Fa%26gt%3B%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Mind%20Your%20Spectra%3A%20Points%20to%20be%20Aware%20of%20When%20Validating%20the%20Identification%20of%20Isobaric%20Histone%20Peptidoforms%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Hassan%22%2C%22lastName%22%3A%22Hijazi%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Julie%22%2C%22lastName%22%3A%22Manessier%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Sabine%22%2C%22lastName%22%3A%22Brugiere%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Tina%22%2C%22lastName%22%3A%22Ravnsborg%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Marie%22%2C%22lastName%22%3A%22Cour%5Cu00e7on%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Baptiste%22%2C%22lastName%22%3A%22Brule%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Karine%22%2C%22lastName%22%3A%22Merienne%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Ole%20N.%22%2C%22lastName%22%3A%22Jensen%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Anne-Marie%22%2C%22lastName%22%3A%22Hesse%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Christophe%22%2C%22lastName%22%3A%22Bruley%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Delphine%22%2C%22lastName%22%3A%22Pflieger%22%7D%5D%2C%22abstractNote%22%3A%22Mass%20spectrometry%20has%20become%20central%20to%20identifying%20and%20quantifying%20histone%20post-translational%20modifications%20%28PTMs%29%2C%20surpassing%20limitations%20of%20antibody-based%20methods.%20Histones%20are%20dynamically%20modified%20by%20multiple%20structures%2C%20especially%20at%20lysine%20residues%20on%20their%20N-terminal%20tails%2C%20to%20regulate%20DNA-templated%20processes.%20Reliable%20identification%20of%20histone%20PTMs%20remains%20challenging%20and%20still%20requires%20manual%20curation.%20This%20study%20focused%20on%20the%20Lys27-Arg40%20stretch%20of%20histone%20H3%2C%20considered%20four%20sequence%20variants%2C%20an%20increasing%20number%20of%20lysine%20PTMs%20and%20artifacts%20coming%20from%20histone%20sample%20processing%2C%20which%20resulted%20in%20many%20isobaric%20peptides.%20Our%20analysis%20revealed%20the%20value%20of%20low-mass%20b1%20and%20cyclic%20immonium%20fragment%20ions%20to%20validate%20identification%20of%20the%20distinct%20peptidoforms.%20We%20examined%20how%20MS%5C%2FMS%20spectra%20are%20transformed%20by%20common%20identification%20software%20during%20the%20conversion%20of%20raw%20files%20into%20peak%20lists%2C%20and%20highlighted%20how%20some%20parameters%20may%20erase%20the%20informative%20low-mass%20fragments.%20We%20targeted%20the%20detection%20of%2040%20H3%20K27-R40%20variant%20%5Cu00d7%20PTM%20combinations%2C%20including%20the%20mouse-specific%20variants%20H3mm7%20and%20H3mm13%2C%20in%20histone%20samples%20extracted%20from%20mouse%20testis%20and%20brain%20via%20a%20parallel%20reaction%20monitoring%20analysis.%20We%20only%20detected%20very%20low%20levels%20of%20unmodified%20H3mm7.%20Our%20work%20contributes%20to%20reliably%20deciphering%20the%20histone%20code%20shaped%20by%20distinct%20sequence%20variants%20and%20numerous%20combinations%20of%20PTMs.%22%2C%22date%22%3A%222025-03-31%22%2C%22language%22%3A%22eng%22%2C%22DOI%22%3A%2210.1021%5C%2Facs.jproteome.4c01056%22%2C%22ISSN%22%3A%221535-3907%22%2C%22url%22%3A%22%22%2C%22collections%22%3A%5B%22VIGCK6G3%22%2C%22TK2D2GKR%22%2C%223B3CGQ6V%22%5D%2C%22dateModified%22%3A%222025-06-06T06%3A44%3A30Z%22%7D%7D%2C%7B%22key%22%3A%227LBLELWK%22%2C%22library%22%3A%7B%22id%22%3A238648%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A1535877%2C%22username%22%3A%22Christophe%20Bruley%22%2C%22name%22%3A%22Christophe%20Bruley%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fchristophe_bruley%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Etourneau%20et%20al.%22%2C%22parsedDate%22%3A%222025-03-22%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%202%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BEtourneau%2C%20L.%2C%20Fancello%2C%20L.%2C%20Wieczorek%2C%20S.%2C%20Varoquaux%2C%20N.%2C%20%26amp%3B%20Burger%2C%20T.%20%282025%29.%20Penalized%20likelihood%20optimization%20for%20censored%20missing%20value%20imputation%20in%20proteomics.%20%26lt%3Bi%26gt%3BBiostatistics%20%28Oxford%2C%20England%29%26lt%3B%5C%2Fi%26gt%3B%2C%20%26lt%3Bi%26gt%3B26%26lt%3B%5C%2Fi%26gt%3B%281%29%2C%20kxaf006.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1093%5C%2Fbiostatistics%5C%2Fkxaf006%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1093%5C%2Fbiostatistics%5C%2Fkxaf006%26lt%3B%5C%2Fa%26gt%3B%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Penalized%20likelihood%20optimization%20for%20censored%20missing%20value%20imputation%20in%20proteomics%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Lucas%22%2C%22lastName%22%3A%22Etourneau%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Laura%22%2C%22lastName%22%3A%22Fancello%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Samuel%22%2C%22lastName%22%3A%22Wieczorek%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Nelle%22%2C%22lastName%22%3A%22Varoquaux%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Thomas%22%2C%22lastName%22%3A%22Burger%22%7D%5D%2C%22abstractNote%22%3A%22Label-free%20bottom-up%20proteomics%20using%20mass%20spectrometry%20and%20liquid%20chromatography%20has%20long%20been%20established%20as%20one%20of%20the%20most%20popular%20high-throughput%20analysis%20workflows%20for%20proteome%20characterization.%20However%2C%20it%20produces%20data%20hindered%20by%20complex%20and%20heterogeneous%20missing%20values%2C%20which%20imputation%20has%20long%20remained%20problematic.%20To%20cope%20with%20this%2C%20we%20introduce%20Pirat%2C%20an%20algorithm%20that%20harnesses%20this%20challenge%20using%20an%20original%20likelihood%20maximization%20strategy.%20Notably%2C%20it%20models%20the%20instrument%20limit%20by%20learning%20a%20global%20censoring%20mechanism%20from%20the%20data%20available.%20Moreover%2C%20it%20estimates%20the%20covariance%20matrix%20between%20enzymatic%20cleavage%20products%20%28ie%20peptides%20or%20precursor%20ions%29%2C%20while%20offering%20a%20natural%20way%20to%20integrate%20complementary%20transcriptomic%20information%20when%20multi-omic%20assays%20are%20available.%20Our%20benchmarking%20on%20several%20datasets%20covering%20a%20variety%20of%20experimental%20designs%20%28number%20of%20samples%2C%20acquisition%20mode%2C%20missingness%20patterns%2C%20etc.%29%20and%20using%20a%20variety%20of%20metrics%20%28differential%20analysis%20ground%20truth%20or%20imputation%20errors%29%20shows%20that%20Pirat%20outperforms%20all%20pre-existing%20imputation%20methods.%20Beyond%20the%20interest%20of%20Pirat%20as%20an%20imputation%20tool%2C%20these%20results%20pinpoint%20the%20need%20for%20a%20paradigm%20change%20in%20proteomics%20imputation%2C%20as%20most%20pre-existing%20strategies%20could%20be%20boosted%20by%20incorporating%20similar%20models%20to%20account%20for%20the%20instrument%20censorship%20or%20for%20the%20correlation%20structures%2C%20either%20grounded%20to%20the%20analytical%20pipeline%20or%20arising%20from%20a%20multi-omic%20approach.%22%2C%22date%22%3A%222025-03-22%22%2C%22language%22%3A%22eng%22%2C%22DOI%22%3A%2210.1093%5C%2Fbiostatistics%5C%2Fkxaf006%22%2C%22ISSN%22%3A%221468-4357%22%2C%22url%22%3A%22%22%2C%22collections%22%3A%5B%22VIGCK6G3%22%2C%22TK2D2GKR%22%2C%223B3CGQ6V%22%5D%2C%22dateModified%22%3A%222025-06-06T06%3A51%3A58Z%22%7D%7D%2C%7B%22key%22%3A%22MQLXLEPL%22%2C%22library%22%3A%7B%22id%22%3A238648%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A1535877%2C%22username%22%3A%22Christophe%20Bruley%22%2C%22name%22%3A%22Christophe%20Bruley%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fchristophe_bruley%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22P%5Cu00e9rez%20Compte%20et%20al.%22%2C%22parsedDate%22%3A%222024-04-29%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%202%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BP%26%23xE9%3Brez%20Compte%2C%20D.%2C%20Etourneau%2C%20L.%2C%20Hesse%2C%20A.-M.%2C%20Kraut%2C%20A.%2C%20Barthelon%2C%20J.%2C%20Sturm%2C%20N.%2C%20Borges%2C%20H.%2C%20Biennier%2C%20S.%2C%20Cour%26%23xE7%3Bon%2C%20M.%2C%20de%20Saint%20Loup%2C%20M.%2C%20Mignot%2C%20V.%2C%20Costentin%2C%20C.%2C%20Burger%2C%20T.%2C%20Cout%26%23xE9%3B%2C%20Y.%2C%20Bruley%2C%20C.%2C%20Decaens%2C%20T.%2C%20Jaquinod%2C%20M.%2C%20Boursier%2C%20J.%2C%20%26amp%3B%20Brun%2C%20V.%20%282024%29.%20Plasma%20ALS%20and%20Gal-3BP%20differentiate%20early%20from%20advanced%20liver%20fibrosis%20in%20MASLD%20patients.%20%26lt%3Bi%26gt%3BBiomarker%20Research%26lt%3B%5C%2Fi%26gt%3B%2C%20%26lt%3Bi%26gt%3B12%26lt%3B%5C%2Fi%26gt%3B%281%29%2C%2044.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1186%5C%2Fs40364-024-00583-z%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1186%5C%2Fs40364-024-00583-z%26lt%3B%5C%2Fa%26gt%3B%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Plasma%20ALS%20and%20Gal-3BP%20differentiate%20early%20from%20advanced%20liver%20fibrosis%20in%20MASLD%20patients%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22David%22%2C%22lastName%22%3A%22P%5Cu00e9rez%20Compte%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Lucas%22%2C%22lastName%22%3A%22Etourneau%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Anne-Marie%22%2C%22lastName%22%3A%22Hesse%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Alexandra%22%2C%22lastName%22%3A%22Kraut%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Justine%22%2C%22lastName%22%3A%22Barthelon%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Nathalie%22%2C%22lastName%22%3A%22Sturm%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22H%5Cu00e9l%5Cu00e8ne%22%2C%22lastName%22%3A%22Borges%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Salom%5Cu00e9%22%2C%22lastName%22%3A%22Biennier%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Marie%22%2C%22lastName%22%3A%22Cour%5Cu00e7on%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Marc%22%2C%22lastName%22%3A%22de%20Saint%20Loup%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Victoria%22%2C%22lastName%22%3A%22Mignot%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Charlotte%22%2C%22lastName%22%3A%22Costentin%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Thomas%22%2C%22lastName%22%3A%22Burger%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Yohann%22%2C%22lastName%22%3A%22Cout%5Cu00e9%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Christophe%22%2C%22lastName%22%3A%22Bruley%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Thomas%22%2C%22lastName%22%3A%22Decaens%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Michel%22%2C%22lastName%22%3A%22Jaquinod%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22J%5Cu00e9r%5Cu00f4me%22%2C%22lastName%22%3A%22Boursier%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Virginie%22%2C%22lastName%22%3A%22Brun%22%7D%5D%2C%22abstractNote%22%3A%22BACKGROUND%3A%20Metabolic%20dysfunction-associated%20steatotic%20liver%20disease%20%28MASLD%29%20is%20estimated%20to%20affect%2030%25%20of%20the%20world%26%23039%3Bs%20population%2C%20and%20its%20prevalence%20is%20increasing%20in%20line%20with%20obesity.%20Liver%20fibrosis%20is%20closely%20related%20to%20mortality%2C%20making%20it%20the%20most%20important%20clinical%20parameter%20for%20MASLD.%20It%20is%20currently%20assessed%20by%20liver%20biopsy%20-%20an%20invasive%20procedure%20that%20has%20some%20limitations.%20There%20is%20thus%20an%20urgent%20need%20for%20a%20reliable%20non-invasive%20means%20to%20diagnose%20earlier%20MASLD%20stages.%5CnMETHODS%3A%20A%20discovery%20study%20was%20performed%20on%20158%20plasma%20samples%20from%20histologically-characterised%20MASLD%20patients%20using%20mass%20spectrometry%20%28MS%29-based%20quantitative%20proteomics.%20Differentially%20abundant%20proteins%20were%20selected%20for%20verification%20by%20ELISA%20in%20the%20same%20cohort.%20They%20were%20subsequently%20validated%20in%20an%20independent%20MASLD%20cohort%20%28n%5Cu2009%3D%5Cu2009200%29.%5CnRESULTS%3A%20From%20the%2072%20proteins%20differentially%20abundant%20between%20patients%20with%20early%20%28F0-2%29%20and%20advanced%20fibrosis%20%28F3-4%29%2C%20we%20selected%20Insulin-like%20growth%20factor-binding%20protein%20complex%20acid%20labile%20subunit%20%28ALS%29%20and%20Galectin-3-binding%20protein%20%28Gal-3BP%29%20for%20further%20study.%20In%20our%20validation%20cohort%2C%20AUROCs%20with%2095%25%20CIs%20of%200.744%20%5B0.673%5Cu00a0-%5Cu00a00.816%5D%20and%200.735%20%5B0.661%5Cu00a0-%5Cu00a00.81%5D%20were%20obtained%20for%20ALS%20and%20Gal-3BP%2C%20respectively.%20Combining%20ALS%20and%20Gal-3BP%20improved%20the%20assessment%20of%20advanced%20liver%20fibrosis%2C%20giving%20an%20AUROC%20of%200.796%20%5B0.731.%200.862%5D.%20The%20%7BALS%3B%20Gal-3BP%7D%20model%20surpassed%20classic%20fibrosis%20panels%20in%20predicting%20advanced%20liver%20fibrosis.%5CnCONCLUSIONS%3A%20Further%20investigations%20with%20complementary%20cohorts%20will%20be%20needed%20to%20confirm%20the%20usefulness%20of%20ALS%20and%20Gal-3BP%20individually%20and%20in%20combination%20with%20other%20biomarkers%20for%20diagnosis%20of%20liver%20fibrosis.%20With%20the%20availability%20of%20ELISA%20assays%2C%20these%20findings%20could%20be%20rapidly%20clinically%20translated%2C%20providing%20direct%20benefits%20for%20patients.%22%2C%22date%22%3A%222024-04-29%22%2C%22language%22%3A%22eng%22%2C%22DOI%22%3A%2210.1186%5C%2Fs40364-024-00583-z%22%2C%22ISSN%22%3A%222050-7771%22%2C%22url%22%3A%22%22%2C%22collections%22%3A%5B%22VIGCK6G3%22%2C%22TK2D2GKR%22%2C%223B3CGQ6V%22%5D%2C%22dateModified%22%3A%222025-06-06T06%3A51%3A51Z%22%7D%7D%2C%7B%22key%22%3A%22MJNTMVCZ%22%2C%22library%22%3A%7B%22id%22%3A238648%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A1535877%2C%22username%22%3A%22Christophe%20Bruley%22%2C%22name%22%3A%22Christophe%20Bruley%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fchristophe_bruley%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Burger%22%2C%22parsedDate%22%3A%222023-09%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%202%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BBurger%2C%20T.%20%282023%29.%20Controlling%20for%20false%20discoveries%20subsequently%20to%20large%20scale%20one-way%20ANOVA%20testing%20in%20proteomics%3A%20Practical%20considerations.%20%26lt%3Bi%26gt%3BProteomics%26lt%3B%5C%2Fi%26gt%3B%2C%20%26lt%3Bi%26gt%3B23%26lt%3B%5C%2Fi%26gt%3B%2818%29%2C%20e2200406.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1002%5C%2Fpmic.202200406%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1002%5C%2Fpmic.202200406%26lt%3B%5C%2Fa%26gt%3B%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Controlling%20for%20false%20discoveries%20subsequently%20to%20large%20scale%20one-way%20ANOVA%20testing%20in%20proteomics%3A%20Practical%20considerations%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Thomas%22%2C%22lastName%22%3A%22Burger%22%7D%5D%2C%22abstractNote%22%3A%22In%20discovery%20proteomics%2C%20as%20well%20as%20many%20other%20%26quot%3Bomic%26quot%3B%20approaches%2C%20the%20possibility%20to%20test%20for%20the%20differential%20abundance%20of%20hundreds%20%28or%20of%20thousands%29%20of%20features%20simultaneously%20is%20appealing%2C%20despite%20requiring%20specific%20statistical%20safeguards%2C%20among%20which%20controlling%20for%20the%20false%20discovery%20rate%20%28FDR%29%20has%20become%20standard.%20Moreover%2C%20when%20more%20than%20two%20biological%20conditions%20or%20group%20treatments%20are%20considered%2C%20it%20has%20become%20customary%20to%20rely%20on%20the%20one-way%20analysis%20of%20variance%20%28ANOVA%29%20framework%2C%20where%20a%20first%20global%20differential%20abundance%20landscape%20provided%20by%20an%20omnibus%20test%20can%20be%20subsequently%20refined%20using%20various%20post-hoc%20tests%20%28PHTs%29.%20However%2C%20the%20interactions%20between%20the%20FDR%20control%20procedures%20and%20the%20PHTs%20are%20complex%2C%20because%20both%20correspond%20to%20different%20types%20of%20multiple%20test%20corrections%20%28MTCs%29.%20This%20article%20surveys%20various%20ways%20to%20orchestrate%20them%20in%20a%20data%20processing%20workflow%20and%20discusses%20their%20pros%20and%20cons.%22%2C%22date%22%3A%222023-09%22%2C%22language%22%3A%22eng%22%2C%22DOI%22%3A%2210.1002%5C%2Fpmic.202200406%22%2C%22ISSN%22%3A%221615-9861%22%2C%22url%22%3A%22%22%2C%22collections%22%3A%5B%22VIGCK6G3%22%2C%22TK2D2GKR%22%2C%223B3CGQ6V%22%5D%2C%22dateModified%22%3A%222025-06-06T06%3A40%3A58Z%22%7D%7D%2C%7B%22key%22%3A%22WDUVKNDT%22%2C%22library%22%3A%7B%22id%22%3A238648%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A1535877%2C%22username%22%3A%22Christophe%20Bruley%22%2C%22name%22%3A%22Christophe%20Bruley%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fchristophe_bruley%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Dupierris%20et%20al.%22%2C%22parsedDate%22%3A%222023-01%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%202%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BDupierris%2C%20V.%2C%20Hesse%2C%20A.-M.%2C%20Menetrey%2C%20J.-P.%2C%20Bouyssi%26%23xE9%3B%2C%20D.%2C%20Burger%2C%20T.%2C%20Cout%26%23xE9%3B%2C%20Y.%2C%20%26amp%3B%20Bruley%2C%20C.%20%282023%29.%20Validation%20of%20MS%5C%2FMS%20Identifications%20and%20Label-Free%20Quantification%20Using%20Proline.%20%26lt%3Bi%26gt%3BMethods%20in%20Molecular%20Biology%20%28Clifton%2C%20N.J.%29%26lt%3B%5C%2Fi%26gt%3B%2C%20%26lt%3Bi%26gt%3B2426%26lt%3B%5C%2Fi%26gt%3B%2C%2067%26%23x2013%3B89.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1007%5C%2F978-1-0716-1967-4_4%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1007%5C%2F978-1-0716-1967-4_4%26lt%3B%5C%2Fa%26gt%3B%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Validation%20of%20MS%5C%2FMS%20Identifications%20and%20Label-Free%20Quantification%20Using%20Proline%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22V%5Cu00e9ronique%22%2C%22lastName%22%3A%22Dupierris%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Anne-Marie%22%2C%22lastName%22%3A%22Hesse%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jean-Philippe%22%2C%22lastName%22%3A%22Menetrey%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22David%22%2C%22lastName%22%3A%22Bouyssi%5Cu00e9%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Thomas%22%2C%22lastName%22%3A%22Burger%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Yohann%22%2C%22lastName%22%3A%22Cout%5Cu00e9%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Christophe%22%2C%22lastName%22%3A%22Bruley%22%7D%5D%2C%22abstractNote%22%3A%22In%20the%20proteomics%20field%2C%20the%20production%20and%20publication%20of%20reliable%20mass%20spectrometry%20%28MS%29-based%20label-free%20quantitative%20results%20is%20a%20major%20concern.%20Due%20to%20the%20intrinsic%20complexity%20of%20bottom-up%20proteomics%20experiments%20%28requiring%20aggregation%20of%20data%20relating%20to%20both%20precursor%20and%20fragment%20peptide%20ions%20into%20protein%20information%2C%20and%20matching%20this%20data%20across%20samples%29%2C%20inaccuracies%20and%20errors%20can%20occur%20throughout%20the%20data-processing%20pipeline.%20In%20a%20classical%20label-free%20quantification%20workflow%2C%20the%20validation%20of%20identification%20results%20is%20critical%20since%20errors%20made%20at%20this%20first%20stage%20of%20the%20workflow%20may%20have%20an%20impact%20on%20the%20following%20steps%20and%20therefore%20on%20the%20final%20result.%20Although%20false%20discovery%20rate%20%28FDR%29%20of%20the%20identification%20is%20usually%20controlled%20by%20using%20the%20popular%20target-decoy%20method%2C%20it%20has%20been%20demonstrated%20that%20this%20method%20can%20sometimes%20lead%20to%20inaccurate%20FDR%20estimates.%20This%20protocol%20shows%20how%20Proline%20can%20be%20used%20to%20validate%20identification%20results%20by%20using%20the%20method%20based%20on%20the%20Benjamini-Hochberg%20procedure%20and%20then%20quantify%20the%20identified%20ions%20and%20proteins%20in%20a%20single%20software%20environment%20providing%20data%20curation%20capabilities%20and%20computational%20efficiency.%22%2C%22date%22%3A%222023-01%22%2C%22language%22%3A%22eng%22%2C%22DOI%22%3A%2210.1007%5C%2F978-1-0716-1967-4_4%22%2C%22ISSN%22%3A%221940-6029%22%2C%22url%22%3A%22%22%2C%22collections%22%3A%5B%22VIGCK6G3%22%2C%22TK2D2GKR%22%2C%227KBU675S%22%5D%2C%22dateModified%22%3A%222025-06-06T06%3A41%3A06Z%22%7D%7D%2C%7B%22key%22%3A%22548GBEHI%22%2C%22library%22%3A%7B%22id%22%3A238648%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A1535877%2C%22username%22%3A%22Christophe%20Bruley%22%2C%22name%22%3A%22Christophe%20Bruley%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fchristophe_bruley%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Etourneau%20et%20al.%22%2C%22parsedDate%22%3A%222023-01%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%202%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BEtourneau%2C%20L.%2C%20Varoquaux%2C%20N.%2C%20%26amp%3B%20Burger%2C%20T.%20%282023%29.%20Unveiling%20the%20Links%20Between%20Peptide%20Identification%20and%20Differential%20Analysis%20FDR%20Controls%20by%20Means%20of%20a%20Practical%20Introduction%20to%20Knockoff%20Filters.%20%26lt%3Bi%26gt%3BMethods%20in%20Molecular%20Biology%20%28Clifton%2C%20N.J.%29%26lt%3B%5C%2Fi%26gt%3B%2C%20%26lt%3Bi%26gt%3B2426%26lt%3B%5C%2Fi%26gt%3B%2C%201%26%23x2013%3B24.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1007%5C%2F978-1-0716-1967-4_1%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1007%5C%2F978-1-0716-1967-4_1%26lt%3B%5C%2Fa%26gt%3B%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Unveiling%20the%20Links%20Between%20Peptide%20Identification%20and%20Differential%20Analysis%20FDR%20Controls%20by%20Means%20of%20a%20Practical%20Introduction%20to%20Knockoff%20Filters%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Lucas%22%2C%22lastName%22%3A%22Etourneau%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Nelle%22%2C%22lastName%22%3A%22Varoquaux%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Thomas%22%2C%22lastName%22%3A%22Burger%22%7D%5D%2C%22abstractNote%22%3A%22In%20proteomic%20differential%20analysis%2C%20FDR%20control%20is%20often%20performed%20through%20a%20multiple%20test%20correction%20%28i.e.%2C%20the%20adjustment%20of%20the%20original%20p-values%29.%20In%20this%20protocol%2C%20we%20apply%20a%20recent%20and%20alternative%20method%2C%20based%20on%20so-called%20knockoff%20filters.%20It%20shares%20interesting%20conceptual%20similarities%20with%20the%20target-decoy%20competition%20procedure%2C%20classically%20used%20in%20proteomics%20for%20FDR%20control%20at%20peptide%20identification.%20To%20provide%20practitioners%20with%20a%20unified%20understanding%20of%20FDR%20control%20in%20proteomics%2C%20we%20apply%20the%20knockoff%20procedure%20on%20real%20and%20simulated%20quantitative%20datasets.%20Leveraging%20these%20comparisons%2C%20we%20propose%20to%20adapt%20the%20knockoff%20procedure%20to%20better%20fit%20the%20specificities%20of%20quantitative%20proteomic%20data%20%28mainly%20very%20few%20samples%29.%20Performances%20of%20knockoff%20procedure%20are%20compared%20with%20those%20of%20the%20classical%20Benjamini-Hochberg%20procedure%2C%20hereby%20shedding%20a%20new%20light%20on%20the%20strengths%20and%20weaknesses%20of%20target-decoy%20competition.%22%2C%22date%22%3A%222023-01%22%2C%22language%22%3A%22eng%22%2C%22DOI%22%3A%2210.1007%5C%2F978-1-0716-1967-4_1%22%2C%22ISSN%22%3A%221940-6029%22%2C%22url%22%3A%22%22%2C%22collections%22%3A%5B%22VIGCK6G3%22%2C%22TK2D2GKR%22%2C%227KBU675S%22%5D%2C%22dateModified%22%3A%222025-06-06T06%3A41%3A17Z%22%7D%7D%5D%7D
Hijazi, H., Manessier, J., Brugiere, S., Ravnsborg, T., Courçon, M., Brule, B., Merienne, K., Jensen, O. N., Hesse, A.-M., Bruley, C., & Pflieger, D. (2025). Mind Your Spectra: Points to be Aware of When Validating the Identification of Isobaric Histone Peptidoforms. Journal of Proteome Research. https://doi.org/10.1021/acs.jproteome.4c01056
Etourneau, L., Fancello, L., Wieczorek, S., Varoquaux, N., & Burger, T. (2025). Penalized likelihood optimization for censored missing value imputation in proteomics. Biostatistics (Oxford, England), 26(1), kxaf006. https://doi.org/10.1093/biostatistics/kxaf006
Pérez Compte, D., Etourneau, L., Hesse, A.-M., Kraut, A., Barthelon, J., Sturm, N., Borges, H., Biennier, S., Courçon, M., de Saint Loup, M., Mignot, V., Costentin, C., Burger, T., Couté, Y., Bruley, C., Decaens, T., Jaquinod, M., Boursier, J., & Brun, V. (2024). Plasma ALS and Gal-3BP differentiate early from advanced liver fibrosis in MASLD patients. Biomarker Research, 12(1), 44. https://doi.org/10.1186/s40364-024-00583-z
Burger, T. (2023). Controlling for false discoveries subsequently to large scale one-way ANOVA testing in proteomics: Practical considerations. Proteomics, 23(18), e2200406. https://doi.org/10.1002/pmic.202200406
Dupierris, V., Hesse, A.-M., Menetrey, J.-P., Bouyssié, D., Burger, T., Couté, Y., & Bruley, C. (2023). Validation of MS/MS Identifications and Label-Free Quantification Using Proline. Methods in Molecular Biology (Clifton, N.J.), 2426, 67–89. https://doi.org/10.1007/978-1-0716-1967-4_4
Etourneau, L., Varoquaux, N., & Burger, T. (2023). Unveiling the Links Between Peptide Identification and Differential Analysis FDR Controls by Means of a Practical Introduction to Knockoff Filters. Methods in Molecular Biology (Clifton, N.J.), 2426, 1–24. https://doi.org/10.1007/978-1-0716-1967-4_1

Highlights

Proline 2.3 Release

Proline Release 2.3 This new release focuses mainly on Isobaric labelling quantification in particular by taking into account purity correction...

Read More

ProteoRE version 2.1 is released

We are pleased to announce the release of ProteoRE 2.1, a user-oriented Galaxy-based instance, for the functional interpretation and exploration...

Read More

Projects

Proline

EDyP

Proline Proline is a landmark software suite for quantitative proteomics allowing to collect, process, visualize and publish proteomics datasets [Bouyssié et al., […]

Read More

ProStaR

EDyP

ProStaR ProStaR, is devoted to the statistical analysis of label-free quantitative proteomics data produced by bottom-up LC-MS/MS experiments. Prostar project has two […]

Read More