GenomeProt: an integrated proteogenomics data analysis platform for long-read RNA-Seq datasets
Developed by Hitesh Kore, Ching Yin Wan and Josie Gleeson at The University of Melbourne
Workflow steps:
Generate a custom proteogenomics database
Creates an amino acid FASTA of all ORFs in your data to use as input for FragPipe/MaxQuant etc.
Find short (10 to 'ORF length' amino acids) ORFs in UTRs of reference transcripts:
Integrate proteomics results with transcriptomics
Creates BED12s and GTFs of peptides, ORFs and transcripts for visualisation and produces summary data
Visualise results with IsoVis
The IsoVis website is displayed below for convenience. It is also accessible directly at: https://isomix.org/isovis/
Instructions for using IsoVis
Step 1: Click 'Upload data'.
Step 2: For the 'Transcript data' file, upload 'combined_annotations.gtf'. For the 'Transcript counts' file, upload 'bambu_transcript_counts.txt'.
Step 3: For the 'Peptide intensities' file, upload the peptide intensities file from the proteomics pipeline you used (e.g. 'report.pr_matrix.tsv'), then click 'Apply'.
Step 4: Type the symbol or ID of a gene to view, select it from the list of results displayed, then either press enter or click '>'.
Step 5: To see the mappings of peptides to open reading frames, click on the 'Stack options' dropdown menu and select 'Peptide mapping'.