Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Split functionality of eea.corpus into multiple scripts #1

Open
tiberiuichim opened this issue Apr 26, 2017 · 1 comment
Open

Split functionality of eea.corpus into multiple scripts #1

tiberiuichim opened this issue Apr 26, 2017 · 1 comment

Comments

@tiberiuichim
Copy link
Member

I see several problems that we want to handle:

  1. corpus preparation (take text, transform it, save it)
  2. "generate products". For example, generate an LDA topic visualisation and save its html+js payload to a folder, generate the TMVA topics browsers, etc
  3. Browse and use the generated products.

We can do these as command line scripts, for point 3 we can use a simple http directory index listing (if generated products are all static files).

@demarant
Copy link
Member

demarant commented May 2, 2017

@tiberiuichim
Yes we can refactor as we go.
I generally agree on the split.
To be aware is that 1) preparation is not generic, it is highly dependent on the 2) product output.
We are not building a framework here I think, we are merely using existing framework and techniques to interpret and build smart products on top of EEA corpus as input data.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants