Package: wrProteo 2.0.2

wrProteo: Proteomics Data Analysis Functions

Data analysis of proteomics experiments by mass spectrometry is supported by this collection of functions mostly dedicated to the analysis of (bottom-up) quantitative (XIC) data. Fasta-formatted proteomes (eg from UniProt Consortium <doi:10.1093/nar/gky1049>) can be read with automatic parsing and multiple annotation types (like species origin, abbreviated gene names, etc) extracted. Initial results from multiple software for protein (and peptide) quantitation can be imported (to a common format): MaxQuant (Tyanova et al 2016 <doi:10.1038/nprot.2016.136>), Dia-NN (Demichev et al 2020 <doi:10.1038/s41592-019-0638-x>), Fragpipe (da Veiga et al 2020 <doi:10.1038/s41592-020-0912-y>), ionbot (Degroeve et al 2021 <doi:10.1101/2021.07.02.450686>), MassChroq (Valot et al 2011 <doi:10.1002/pmic.201100120>), OpenMS (Strauss et al 2021 <doi:10.1038/nmeth.3959>), ProteomeDiscoverer (Orsburn 2021 <doi:10.3390/proteomes9010015>), Proline (Bouyssie et al 2020 <doi:10.1093/bioinformatics/btaa118>), AlphaPept (preprint Strauss et al <doi:10.1101/2021.07.23.453379>) and Wombat-P (Bouyssie et al 2023 <doi:10.1021/acs.jproteome.3c00636>. Meta-data provided by initial analysis software and/or in sdrf format can be integrated to the analysis. Quantitative proteomics measurements frequently contain multiple NA values, due to physical absence of given peptides in some samples, limitations in sensitivity or other reasons. Help is provided to inspect the data graphically to investigate the nature of NA-values via their respective replicate measurements and to help/confirm the choice of NA-replacement algorithms. Meta-data in sdrf-format (Perez-Riverol et al 2020 <doi:10.1021/acs.jproteome.0c00376>) or similar tabular formats can be imported and included. Missing values can be inspected and imputed based on the concept of NA-neighbours or other methods. Dedicated filtering and statistical testing using the framework of package 'limma' <doi:10.18129/B9.bioc.limma> can be run, enhanced by multiple rounds of NA-replacements to provide robustness towards rare stochastic events. Multi-species samples, as frequently used in benchmark-tests (eg Navarro et al 2016 <doi:10.1038/nbt.3685>, Ramus et al 2016 <doi:10.1016/j.jprot.2015.11.011>), can be run with special options considering such sub-groups during normalization and testing. Subsequently, ROC curves (Hand and Till 2001 <doi:10.1023/A:1010920819831>) can be constructed to compare multiple analysis approaches. As detailed example the data-set from Ramus et al 2016 <doi:10.1016/j.jprot.2015.11.011>) quantified by MaxQuant, ProteomeDiscoverer, and Proline is provided with a detailed analysis of heterologous spike-in proteins.

Authors:Wolfgang Raffelsberger [aut, cre]

wrProteo_2.0.2.tar.gz
wrProteo_2.0.2.zip(r-4.7)wrProteo_2.0.2.zip(r-4.6)wrProteo_2.0.2.zip(r-4.5)
wrProteo_2.0.2.tgz(r-4.6-any)wrProteo_2.0.2.tgz(r-4.5-any)
wrProteo_2.0.2.tar.gz(r-4.7-any)wrProteo_2.0.2.tar.gz(r-4.6-any)
wrProteo_2.0.2.tgz(r-4.6-emscripten)
manual.pdf |manual.html
DESCRIPTION
card.svg |card.png
wrProteo/json (API)

# Install 'wrProteo' in R:
install.packages('wrProteo', repos = c('https://wraff.r-universe.dev', 'https://cloud.r-project.org'))

On CRAN:

Conda:

This package does not link to any Github/Gitlab/R-forge repository. No issue tracker or development information is available.

3.49 score 1 packages 15 scripts 1.2k downloads 57 exports 9 dependencies

Last updated from:7bbf5f94df. Checks:9 OK. Indexed: yes.

TargetResultTimeFilesSyslog
linux-devel-x86_64OK173
source / vignettesOK288
linux-release-x86_64OK165
macos-release-arm64OK117
macos-oldrel-arm64OK116
windows-develOK95
windows-releaseOK149
windows-oldrelOK133
wasm-releaseOK157

Exports:.atomicMasses.checkKnitrProt.checkSetupGroups.commonSpecies.extrSpecPref.imputeNA.parseFastaHeader.plotQuantDistrAAmassAucROCcleanListCoNamescombineMultFilterNAimputconvAASeq2masscorColumnOrdercountNoOfCommonPeptidesexportAsWombatPexportSdrfDraftextractTestingResultsextrSpeciesAnnotfoldChangeArrow2fuseProteomicsProjectsgetUPS1accinspectSpeciesIndicisolNAneighbmassDeFormulamatrixNAinspectmatrixNAneighbourImputeplotROCrazorNoFilterreadAlphaPeptFilereadDiaNNFilereadDiaNNPeptidesreadFasta2readFragpipeFilereadIonbotPeptidesreadMassChroQFilereadMaxQuantFilereadMaxQuantPeptidesreadOpenMSFilereadProlineFilereadProtDiscovererPeptidesreadProtDiscovFilereadProteomeDiscovererFilereadProteomeDiscovererPeptidesreadSampleMetaDatareadSdrfreadUCSCtablereadUniProtExportreadWombatNormFileremoveSampleInListreplMissingProtNamesshortSoftwNamesummarizeForROCtest2grptestRobustToNAimputationVolcanoPlotW2writeFasta2

Dependencies:evaluatehighrknitrlimmaMASSstatmodwrMiscxfunyaml

Analyzing Proteomics UPS1 Spike-in Experiments (Example Ramus 2016 Dataset)
Introduction | Experimental Setup For Benchmark Tests | The Ramus Data-Set | Meta-Data Describing The Experiment (sdrf) | Key Elements And Additional Functions | Protein Identification and Initial Quantification | MaxQuant | ProteomeDiscoverer | Proline | Further Preparation Of Data | Basic Data Treatment | Structure of Experiment | Normalization | Presence of NA-values | NA-Imputation and Statistical Testing for Changes in Abundance | Analysis Using All Proteins Identified (Matrix + UPS1) | Pairwise Testing Summary | Volcano Plots | ROC for Multiple Pairs | Grouping of ROC Curves to Display Representative Ones | Plotting ROC Curves for the Best Cluster (the '+++++') | ROC Curves for 2nd Best Cluster (the '++++') | ROC Curves for the 3rd Best Cluster (the '+++') | ROC Curves for the 4th Best Cluster (the '++') | ROC Curves for the Weakest Cluster 1 (the '+') | Analysis Focussing on UPS1 Spike-In Proteins Only | Similarity by PCA (UPS1 proteins only) | PCA of UPS1 for ProteomeDiscoverer | PCA of UPS1 for MaxQuant | PCA of UPS1 for Proline | CV of Replicates | Testing All Individual UPS1 Proteins By Linear Regression | Linear Regression for each UPS1 : ProteomeDiscoverer | Linear Regression for each UPS1 : MaxQuant | Linear Regression for each UPS1 : Proline | Frequency Of Starting Levels For Regression | Global Comparison Of Regression Models | Summarize Linear Regression Results | Grouping of UPS1 Proteins to Display Representative Proteins | Representative UPS1-protein of the Best Group (the '+++++') | Representative UPS1-protein of the 2nd Best Group (the '++++') | Representative UPS1-protein of the 3rd Group (the '+++') | Representative UPS1-protein of the 4th Group (the '++') | Representative UPS1-protein of the 5th (And Last) Group (the '+') | Additional Comments | Acknowledgements | Session-Info

Last update: 2026-04-29
Started: 2020-10-18

Getting started with wrProteo
Introduction | Calculating Molecular Masses From Composition Formulas | Molecular masses based on (summed) chemical formulas | Molecular masses based on amino-acid sequence | Working With Fasta(Files) | Reading Fasta Files (from Uniprot) | Writing Sequences As Fasta Files | Analyzing Label-Free Quantitative Proteomics Data | Label-free Quantitative Proteomics Introduction | Import From Dedicated Quantification Algorithms/Software | MaxQuant: Import Protein Quantification Data | Adding Meta-Data at Import (Example MaxQuant) | Adding Experimental Setup (Sdrf) to Meta-Data at Import (Example MaxQuant) | Exporting Experimental Setup from MaxQuant to Draft-Sdrf | MaxQuant : Import Peptide Data | ProteomeDiscoverer : Import Protein Quantification | ProteomeDiscoverer : Import Peptide Data | DIA-NN: Import Protein Quantification Data | DIA-NN : Import Peptide Data | Proline : Import Protein Quantification Data | Fragpipe : Import Protein Quantification Data | MassChroQ : Import Protein Quantification Data | AlphaPept : Import Protein Quantification Data | Ionbot : Import Peptide Identification And Quantification Data | Wombat-P : Import Protein Quantification Data | OpenMS : Import Protein Quantification Data | Importing Sdrf Meta-Data | Removing Samples From Proteomics Projects | Combining Proteomics Projects | Normalization | Imputation of NA-values | Filtering | Statistical Testing | Data Exploration With Graphical Support | PCA | MA-plot | Volcano-Plot | Reporting Results | Further Steps | Protein Annotation | Appendix | Acknowledgements | Session-Info

Last update: 2026-04-29
Started: 2020-04-29

Readme and manuals

Help Manual

Help pageTopics
Molecular mass for Elements.atomicMasses
Checking presence of knitr and rmarkdown.checkKnitrProt
Additional/final Check And Adjustments To Sample-order After readSampleMetaData().checkSetupGroups
Get Matrix With UniProt Abbreviations For Selected Species As Well As Simple Names.commonSpecies
Extract Additional Information To Construct The Colum 'SpecType', Allows Adding Information From Fasta.extrSpecPref
Basic NA-imputaton (main).imputeNA
Parse Fasta Header.parseFastaHeader
Generic Plotting Of Density Distribution For Quantitation Import-functions.plotQuantDistr
Molecular mass for amino-acidsAAmass
AUC from ROC-curvesAucROC
Selective batch cleaning of sample- (ie column-) names in listcleanListCoNames
Combine Multiple Filters On NA-imputed DatacombineMultFilterNAimput
Molecular mass for amino-acidsconvAASeq2mass
Order Columns In List Of Matrixes, Data.frames And VectorscorColumnOrder
Compare in-silico digested proteomes for unique and shared peptides, counts per protein or as peptides Compare in-silico digested proteomes for unique and shared peptides, counts per protein or as peptides. The in-silico digestion may be performed separately using the package cleaver. Note: input must be list (or multiple names lists) of proteins with their respective peptides (eg by in-silico digestion).countNoOfCommonPeptides
Export As Wombat-P Set Of FilesexportAsWombatP
Export Sample Meta-data from Quantification-Software as Sdrf-draftexportSdrfDraft
Extract Results From Moderated t-testsextractTestingResults
Extract species annotationextrSpeciesAnnot
Add arrow for expected Fold-Change to VolcanoPlot or MA-plotfoldChangeArrow2
Combine Multiple Proteomics Data-SetsfuseProteomicsProjects
UniProt Accession-Numbers And Names Of UPS1 ProteinsgetUPS1acc
Inspect Species Indictaion Or Group of ProteinsinspectSpeciesIndic
Isolate NA-neighboursisolNAneighb
Molecular mass from chemical formulamassDeFormula
Histogram of content of NAs in matrixmatrixNAinspect
Imputation Of NA-Values Based On Non-NA ReplicatesmatrixNAneighbourImpute
Plot ROC curvesplotROC
Filter based on either number of total peptides and specific peptides or number of razor petidesrazorNoFilter
Read (Normalized) Quantitation Data Files Produced By AlphaPeptreadAlphaPeptFile
Read Tabulated Files Exported by DIA-NN At Protein LevelreadDiaNNFile
Read Tabulated Files Exported by DiaNN At Peptide LevelreadDiaNNPeptides
Read File Of Protein Sequences In Fasta FormatreadFasta2
Read Tabulated Files Exported by FragPipe At Protein LevelreadFragpipeFile
Read Tabulated Files Exported by Ionbot At Peptide LevelreadIonbotPeptides
Read tabulated files imported from MassChroQreadMassChroQFile
Read Quantitation Data-Files (proteinGroups.txt) Produced From MaxQuant At Protein LevelreadMaxQuantFile
Read Peptide Identification and Quantitation Data-Files (peptides.txt) Produced By MaxQuantreadMaxQuantPeptides
Read csv files exported by OpenMSreadOpenMSFile
Read xlsx, csv or tsv Files Exported From Proline And MS-AngelreadProlineFile
readProtDiscovererPeptides, deprecatedreadProtDiscovererPeptides
Read Tabulated Files Exported By ProteomeDiscoverer At Protein Level, DeprecatedreadProtDiscovFile
Read Tabulated Files Exported By ProteomeDiscoverer At Protein LevelreadProteomeDiscovererFile
Read Tabulated Files Exported By ProteomeDiscoverer At Peptide LevelreadProteomeDiscovererPeptides
Read Sample Meta-data from Quantification-Software And/Or Sdrf And Align To Experimental DatareadSampleMetaData
Read proteomics meta-data as sdrf filereadSdrf
Read annotation files from UCSCreadUCSCtable
Read protein annotation as exported from UniProt batch-conversionreadUniProtExport
Read (Normalized) Quantitation Data Files Produced By Wombat At Protein LevelreadWombatNormFile
Remove Samples/Columns From List Of MatrixesremoveSampleInList
Complement Missing EntryNames In AnnotationreplMissingProtNames
Get Short Names of Proteomics Quantitation SoftwareshortSoftwName
Summarize statistical test result for plotting ROC-curvessummarizeForROC
t-test each line of 2 groups of datatest2grp
Pairwise Testing Robust To NA-ImputationtestRobustToNAimputation
Deprecialed Volcano-plotVolcanoPlotW2
Write Sequences In Fasta Format To File This function writes sequences from character vector or parsed matrix to file as fasta formatted file (close to UniProt) This function also allows comparing the main vector of sequences with a reference vector 'ref' to check if any of the sequences therein are truncated.writeFasta2