Package: wrProteo 1.13.1

wrProteo: Proteomics Data Analysis Functions

Data analysis of proteomics experiments by mass spectrometry is supported by this collection of functions mostly dedicated to the analysis of (bottom-up) quantitative (XIC) data. Fasta-formatted proteomes (eg from UniProt Consortium <doi:10.1093/nar/gky1049>) can be read with automatic parsing and multiple annotation types (like species origin, abbreviated gene names, etc) extracted. Initial results from multiple software for protein (and peptide) quantitation can be imported (to a common format): MaxQuant (Tyanova et al 2016 <doi:10.1038/nprot.2016.136>), Dia-NN (Demichev et al 2020 <doi:10.1038/s41592-019-0638-x>), Fragpipe (da Veiga et al 2020 <doi:10.1038/s41592-020-0912-y>), ionbot (Degroeve et al 2021 <doi:10.1101/2021.07.02.450686>), MassChroq (Valot et al 2011 <doi:10.1002/pmic.201100120>), OpenMS (Strauss et al 2021 <doi:10.1038/nmeth.3959>), ProteomeDiscoverer (Orsburn 2021 <doi:10.3390/proteomes9010015>), Proline (Bouyssie et al 2020 <doi:10.1093/bioinformatics/btaa118>), AlphaPept (preprint Strauss et al <doi:10.1101/2021.07.23.453379>) and Wombat-P (Bouyssie et al 2023 <doi:10.1021/acs.jproteome.3c00636>. Meta-data provided by initial analysis software and/or in sdrf format can be integrated to the analysis. Quantitative proteomics measurements frequently contain multiple NA values, due to physical absence of given peptides in some samples, limitations in sensitivity or other reasons. Help is provided to inspect the data graphically to investigate the nature of NA-values via their respective replicate measurements and to help/confirm the choice of NA-replacement algorithms. Meta-data in sdrf-format (Perez-Riverol et al 2020 <doi:10.1021/acs.jproteome.0c00376>) or similar tabular formats can be imported and included. Missing values can be inspected and imputed based on the concept of NA-neighbours or other methods. Dedicated filtering and statistical testing using the framework of package 'limma' <doi:10.18129/B9.bioc.limma> can be run, enhanced by multiple rounds of NA-replacements to provide robustness towards rare stochastic events. Multi-species samples, as frequently used in benchmark-tests (eg Navarro et al 2016 <doi:10.1038/nbt.3685>, Ramus et al 2016 <doi:10.1016/j.jprot.2015.11.011>), can be run with special options considering such sub-groups during normalization and testing. Subsequently, ROC curves (Hand and Till 2001 <doi:10.1023/A:1010920819831>) can be constructed to compare multiple analysis approaches. As detailed example the data-set from Ramus et al 2016 <doi:10.1016/j.jprot.2015.11.011>) quantified by MaxQuant, ProteomeDiscoverer, and Proline is provided with a detailed analysis of heterologous spike-in proteins.

Authors:Wolfgang Raffelsberger [aut, cre]

wrProteo_1.13.1.tar.gz
wrProteo_1.13.1.zip(r-4.5)wrProteo_1.13.1.zip(r-4.4)wrProteo_1.13.1.zip(r-4.3)
wrProteo_1.13.1.tgz(r-4.5-any)wrProteo_1.13.1.tgz(r-4.4-any)wrProteo_1.13.1.tgz(r-4.3-any)
wrProteo_1.13.1.tar.gz(r-4.5-noble)wrProteo_1.13.1.tar.gz(r-4.4-noble)
wrProteo_1.13.1.tgz(r-4.4-emscripten)wrProteo_1.13.1.tgz(r-4.3-emscripten)
wrProteo.pdf |wrProteo.html✨
wrProteo/json (API)

# Install 'wrProteo' in R:

install.packages('wrProteo', repos = c('https://wraff.r-universe.dev', 'https://cloud.r-project.org'))

On CRAN:

This package does not link to any Github/Gitlab/R-forge repository. No issue tracker or development information is available.

3.61 score 1 packages 17 scripts 1.3k downloads 57 exports 9 dependencies

Last updated 6 hours agofrom:0deaa71da3. Checks:9 OK. Indexed: yes.

Target	Result	Latest binary
Doc / Vignettes	OK	Apr 02 2025
R-4.5-win	OK	Apr 02 2025
R-4.5-mac	OK	Apr 02 2025
R-4.5-linux	OK	Apr 02 2025
R-4.4-win	OK	Apr 02 2025
R-4.4-mac	OK	Apr 02 2025
R-4.4-linux	OK	Apr 02 2025
R-4.3-win	OK	Apr 02 2025
R-4.3-mac	OK	Apr 02 2025

Exports:.atomicMasses .checkKnitrProt .checkSetupGroups .commonSpecies .extrSpecPref .imputeNA .plotQuantDistr AAmass AucROC cleanListCoNames combineMultFilterNAimput convAASeq2mass corColumnOrder countNoOfCommonPeptides exportAsWombatP exportSdrfDraft extractTestingResults extrSpeciesAnnot foldChangeArrow2 fuseProteomicsProjects getUPS1acc inspectSpeciesIndic isolNAneighb massDeFormula matrixNAinspect matrixNAneighbourImpute plotROC razorNoFilter readAlphaPeptFile readDiaNNFile readDiaNNPeptides readFasta2 readFragpipeFile readIonbotPeptides readMassChroQFile readMaxQuantFile readMaxQuantPeptides readOpenMSFile readProlineFile readProtDiscovererPeptides readProtDiscovFile readProtDiscovPeptides readProteomeDiscovererFile readProteomeDiscovererPeptides readSampleMetaData readSdrf readUCSCtable readUniProtExport readWombatNormFile removeSampleInList replMissingProtNames shortSoftwName summarizeForROC test2grp testRobustToNAimputation VolcanoPlotW2 writeFasta2

Dependencies:evaluate highr knitr limma MASS statmod wrMisc xfun yaml

Analyzing Proteomics UPS1 Spike-in Experiments (Example Ramus 2016 Dataset)

Wolfgang Raffelsberger

Rendered fromwrProteoVignetteUPS1.Rmdusingknitr::rmarkdownon Apr 02 2025.

Last update: 2025-04-01
Started: 2020-10-18

Getting started with wrProteo

Wolfgang Raffelsberger

Rendered fromwrProteoVignette1.Rmdusingknitr::rmarkdownon Apr 02 2025.

Last update: 2025-04-01
Started: 2020-04-29

Help page	Topics
Molecular mass for Elements	.atomicMasses
Checking presence of knitr and rmarkdown	.checkKnitrProt
Additional/final Check And Adjustments To Sample-order After readSampleMetaData()	.checkSetupGroups
Get Matrix With UniProt Abbreviations For Selected Species As Well As Simple Names	.commonSpecies
Extract Additional Information To Construct The Colum 'SpecType'	.extrSpecPref
Basic NA-imputaton (main)	.imputeNA
Generic Plotting Of Density Distribution For Quantitation Import-functions	.plotQuantDistr
Molecular mass for amino-acids	AAmass
AUC from ROC-curves	AucROC
Selective batch cleaning of sample- (ie column-) names in list	cleanListCoNames
Combine Multiple Filters On NA-imputed Data	combineMultFilterNAimput
Molecular mass for amino-acids	convAASeq2mass
Order Columns In List Of Matrixes, Data.frames And Vectors	corColumnOrder
Compare in-silico digested proteomes for unique and shared peptides, counts per protein or as peptides Compare in-silico digested proteomes for unique and shared peptides, counts per protein or as peptides. The in-silico digestion may be performed separately using the package cleaver. Note: input must be list (or multiple names lists) of proteins with their respective peptides (eg by in-silico digestion).	countNoOfCommonPeptides
Export As Wombat-P Set Of Files	exportAsWombatP
Export Sample Meta-data from Quantification-Software as Sdrf-draft	exportSdrfDraft
Extract Results From Moderated t-tests	extractTestingResults
Extract species annotation	extrSpeciesAnnot
Add arrow for expected Fold-Change to VolcanoPlot or MA-plot	foldChangeArrow2
Combine Multiple Proteomics Data-Sets	fuseProteomicsProjects
Accession-Numbers And Names Of UPS1 Proteins	getUPS1acc
Inspect Species Indictaion Or Group of Proteins	inspectSpeciesIndic
Isolate NA-neighbours	isolNAneighb
Molecular mass from chemical formula	massDeFormula
Histogram of content of NAs in matrix	matrixNAinspect
Imputation of NA-values based on non-NA replicates	matrixNAneighbourImpute
Plot ROC curves	plotROC
Filter based on either number of total peptides and specific peptides or number of razor petides	razorNoFilter
Read (Normalized) Quantitation Data Files Produced By AlphaPept	readAlphaPeptFile
Read Tabulated Files Exported by DIA-NN At Protein Level	readDiaNNFile
Read Tabulated Files Exported by DiaNN At Peptide Level	readDiaNNPeptides
Read File Of Protein Sequences In Fasta Format	readFasta2
Read Tabulated Files Exported by FragPipe At Protein Level	readFragpipeFile
Read Tabulated Files Exported by Ionbot At Peptide Level	readIonbotPeptides
Read tabulated files imported from MassChroQ	readMassChroQFile
Read Quantitation Data-Files (proteinGroups.txt) Produced From MaxQuant At Protein Level	readMaxQuantFile
Read Peptide Identification and Quantitation Data-Files (peptides.txt) Produced By MaxQuant	readMaxQuantPeptides
Read csv files exported by OpenMS	readOpenMSFile
Read xlsx, csv or tsv files exported from Proline and MS-Angel	readProlineFile
readProtDiscovererPeptides, depreciated	readProtDiscovererPeptides
Read Tabulated Files Exported By ProteomeDiscoverer At Protein Level, Deprecated	readProtDiscovFile
Read Tabulated Files Exported by ProteomeDiscoverer At Peptide Level, Deprecated	readProtDiscovPeptides
Read Tabulated Files Exported By ProteomeDiscoverer At Protein Level	readProteomeDiscovererFile
Read Tabulated Files Exported by ProteomeDiscoverer At Peptide Level	readProteomeDiscovererPeptides
Read Sample Meta-data from Quantification-Software And/Or Sdrf And Align To Experimental Data	readSampleMetaData
Read proteomics meta-data as sdrf file	readSdrf
Read annotation files from UCSC	readUCSCtable
Read protein annotation as exported from UniProt batch-conversion	readUniProtExport
Read (Normalized) Quantitation Data Files Produced By Wombat At Protein Level	readWombatNormFile
Remove Samples/Columns From list of matrixes	removeSampleInList
Complement Missing EntryNames In Annotation	replMissingProtNames
Get Short Names of Proteomics Quantitation Software	shortSoftwName
Summarize statistical test result for plotting ROC-curves	summarizeForROC
t-test each line of 2 groups of data	test2grp
Pair-wise testing robust to NA-imputation	testRobustToNAimputation
Deprecialed Volcano-plot	VolcanoPlotW2
Write sequences in fasta format to file This function writes sequences from character vector as fasta formatted file (from UniProt) Line-headers are based on names of elements of input vector 'prot'. This function also allows comparing the main vector of sequences with a reference vector 'ref' to check if any of the sequences therein are truncated.	writeFasta2

Package: wrProteo 1.13.1

wrProteo: Proteomics Data Analysis Functions

Analyzing Proteomics UPS1 Spike-in Experiments (Example Ramus 2016 Dataset)

Getting started with wrProteo

Citation

Readme and manuals

Help Manual

Usage by other packages (reverse dependencies)