Data preprocessing

by ndhieunguyen - opened Oct 15

Oct 15

I have a question about the steps needed to process RNA-seq data before applying the pretrained model for feature extraction. From what I understand, I need to utilize Transcripts Per Million (TPM) and log10 to process the RNA-seq data. However, I'm unsure if this approach is correct, as I could only find the processed data in the repository. Could you provide me with some pseudocode or a Python implementation for processing the raw count data?
Thank you very much.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment