差异基因用p-vaule筛选可以吗?

老师,您好:转录组分析时因为DEGs个数较少,用的p-vaule筛选的DEGs,审稿人一直问为什么不用FDR,老师这个问题该怎么回答。

Point 6: - Line 600: DEGs were determined using p-value < 0.05 and fold change ≥ 1.5. What is the rationale behind using pvalues instead of adjusted pvalues/FDR. Multiple testing correction is a very crucial aspect of RNA-Seq data analysis since there are thousands of genes being tested and using p values instead of adjusted p values leads to a large number of false positives.

The authors explain the rationale as “in instances where differentially expressed genes obtained by p-value screening are relatively fewer, like in our present case, applying the FDR method would result in the loss of too many or all true positives.”

Using p-values may have increased the number of DEGs, but it has also added a lot of false positives to the results. Unless every DEG is verified by an independent method, there is no way of clearly differentiating between false positives and true positives.

Moreover, when the number of DEGs obtained is low, it makes sense to look at other methods of modeling gene count data or reducing the FDR cutoff, etc.

RNA-Seq being an exploratory technique for hypothesis generation, must be accompanied by multiple testing correction to avoid validation of genes which may turn out to be false positives.

请先 登录 后评论

1 个回答

生信老顽童

当差异基因很少的情况下,如果调整FDR和差异倍数都无法增加差异数量的话。退而求其次的方法就是直接使用pvalue值来筛选差异。这样差异的数量可能会多一些。不过相应的假阳性也会高一点。

不过有些样品,比如细胞样品,由于背景相似度非常高,很有可能出现差异很少的情况。这时候就可以酌情考虑直接使用pvalue值。通过pvalue筛选的差异,只要在后续的定量验证中没有问题,那么就可以说明问题的。

分析不是一层不变的,得灵活处理。

请先 登录 后评论