Data preprocessing
#1
by
ndhieunguyen
- opened
I have a question about the steps needed to process RNA-seq data before applying the pretrained model for feature extraction. From what I understand, I need to utilize Transcripts Per Million (TPM) and log10 to process the RNA-seq data. However, I'm unsure if this approach is correct, as I could only find the processed data in the repository. Could you provide me with some pseudocode or a Python implementation for processing the raw count data?
Thank you very much.