Repeated raw count values in LICA-FR dataset

Hi, I am processing this LICA-FR RNA-seq dataset and find that the raw counts of each gene are all the same across samples:

wc -l exp_seq.LICA-FR.tsv 
9309021 exp_seq.LICA-FR.tsv

cat exp_seq.LICA-FR.tsv | cut -f10 | sort -u | wc -l
4353

I have the same question. That’s really weird.

Hi All,

Please note the following corrections:

## LICA-FR sequencing-based expression data

### November 26, 2019

The current sequencing expression data (exp_seq.LICA-FR_corrected.tsv.gz) at https://dcc.icgc.org/releases/current/Projects/LICA-FR contains **incorrect** raw read count values. Please download the corrected "exp_seq.LICA-FR.tsv.gz" file at https://dcc.icgc.org/releases/Supplementary/LICA-FR/corrected_data.
% cat $HOME/Downloads/exp_seq.LICA-FR_corrected.tsv | cut -f10 | sort -u | wc -l
46868

Cheers,
Edmund