Hi, I am processing this LICA-FR RNA-seq dataset and find that the raw counts of each gene are all the same across samples:
wc -l exp_seq.LICA-FR.tsv
9309021 exp_seq.LICA-FR.tsv
cat exp_seq.LICA-FR.tsv | cut -f10 | sort -u | wc -l
4353
Hi, I am processing this LICA-FR RNA-seq dataset and find that the raw counts of each gene are all the same across samples:
wc -l exp_seq.LICA-FR.tsv
9309021 exp_seq.LICA-FR.tsv
cat exp_seq.LICA-FR.tsv | cut -f10 | sort -u | wc -l
4353
I have the same question. That’s really weird.
Hi All,
Please note the following corrections:
## LICA-FR sequencing-based expression data
### November 26, 2019
The current sequencing expression data (exp_seq.LICA-FR_corrected.tsv.gz) at https://dcc.icgc.org/releases/current/Projects/LICA-FR contains **incorrect** raw read count values. Please download the corrected "exp_seq.LICA-FR.tsv.gz" file at https://dcc.icgc.org/releases/Supplementary/LICA-FR/corrected_data.
% cat $HOME/Downloads/exp_seq.LICA-FR_corrected.tsv | cut -f10 | sort -u | wc -l
46868
Cheers,
Edmund