Skip to content
This repository has been archived by the owner on Sep 1, 2022. It is now read-only.

Keep Old Student Submissions (Using Pandas) #108

Closed
wants to merge 2 commits into from

Conversation

zkagin
Copy link
Collaborator

@zkagin zkagin commented Oct 14, 2020

Fixes #91

Instead of dropping and rewriting the student submissions database, this endpoint now appends new files and then removes duplicates based on the updateTime. This PR also adds some new testing functionality to allow for alternate return values so that it can properly test getting a value that needs to be updated.

Similar to #99, but uses pandas to deduplicate instead of SQL to do the deduplication. This might cause memory issues. @dchess Can you try running this to see if it works for you?

@zkagin zkagin requested a review from dchess October 14, 2020 23:01
@zkagin zkagin force-pushed the keep_old_student_submissions_v2 branch from 3b6d255 to 4342097 Compare October 14, 2020 23:14
@zkagin
Copy link
Collaborator Author

zkagin commented Oct 14, 2020

@dchess This does work for me, but it is taking a lot of memory and about ~2 extra minutes to run. I defer to you on which approach you would prefer.

@zkagin zkagin force-pushed the keep_old_student_submissions_v2 branch from 4342097 to ab77f96 Compare October 14, 2020 23:34
@zkagin
Copy link
Collaborator Author

zkagin commented Oct 14, 2020

@dchess Never mind, mine actually silently ends with what I assume is a memory error once I pulled the whole list. I unfortunately don't think this is a viable method.

@dchess dchess closed this Nov 19, 2020
@dchess dchess deleted the keep_old_student_submissions_v2 branch December 3, 2020 00:50
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Archive StudentSubmissions before refresh
2 participants