Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some problems about 'maxGSSize' parameter in enricher_interval series function #28

Open
huangwb8 opened this issue Nov 4, 2019 · 1 comment

Comments

@huangwb8
Copy link

huangwb8 commented Nov 4, 2019

Hi~

Recently I have realized that there is a more hidden parameters called maxGSSize, which really influence the result of enricher/GSEA analysis. According to the raw code in DOSE,I think it may be involving in gene sets selection before we do some enrichment analysis(like functional enrichment in GO/KEGG or GSEA) based on the number of genes in them.

In practice, more gene sets would be evaluated with a larger maxGSSize and better results would aquired sometimes.

Here are my questions:

  • why the default of 'maxGSSize' is 500? As I know, many gene sets (for example, in MSigDB, containing thousands of genes) have genes more than 500. Is it because the larger gene sets is not suitable for that kind of analysis(GO/KEGG/GSEA)?
  • Is it resonable/recommanded if a larger number for maxGSSize is set in practice?

Thanks~

@HuXuantao
Copy link

me too

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants