Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ConvoKit data_directory configuration setting for corpora not aligned between configuration and download functions #225

Open
seanzhangkx8 opened this issue Jul 31, 2024 · 0 comments

Comments

@seanzhangkx8
Copy link
Contributor

In configuration file, there is a field data_directory which allows users to specify where they store the corpora. However, when users download the data using ConvoKit's download function, the corpus will always be downloaded to the default directory ~/.convokit/downloads, despite any configuration settings users made. That is because the data_directory configuration setting never gets accessed in download. It would be good to have them aligned in some way so all corpora are organized nicely together.

Code

Please refer to util.py and convokitConfig.py for detail.

Steps to reproduce

Try change data_directory configuration setting by
vim ~/.convokit/config.yml

Then, try download a corpus that is not already in your local directory, for example:
from convokit import Corpus, download
corpus = Corpus(filename=download("friends-corpus"))

See that the corpus is still getting downloaded to ~/.convokit/downloads instead of the specified data directory in configuration.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant