Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Destroy command should unregister dataset from DOI / persistent ID provider #2471

Closed
scolapasta opened this issue Aug 19, 2015 · 8 comments · Fixed by #6060
Closed

Destroy command should unregister dataset from DOI / persistent ID provider #2471

scolapasta opened this issue Aug 19, 2015 · 8 comments · Fixed by #6060
Assignees
Labels
Feature: DOI & Handle Feature: Publishing & Versions User Role: Sysadmin Installs, upgrades, and configures the system, connects via ssh
Milestone

Comments

@scolapasta
Copy link
Contributor

No description provided.

@bencomp
Copy link
Contributor

bencomp commented Aug 20, 2015

Could you please explain the term "unregister"? The PID provider usually has rules for this and may not even support taking a PID out of the registry.
To my knowledge the best way to take a PID out of the universe is to update it to redirect to a page (perhaps a generic one) explaining what happened to the dataset. This is similar to what happens when you deaccession a dataset.

@mercecrosas mercecrosas modified the milestone: In Review Nov 30, 2015
@scolapasta scolapasta removed this from the Not Assigned to a Release milestone Jan 28, 2016
@pdurbin
Copy link
Member

pdurbin commented Jun 29, 2017

@scolapasta @sekmiller @kcondon do you know if this was fixed as part of pull request #3826?

@pdurbin pdurbin added the User Role: Sysadmin Installs, upgrades, and configures the system, connects via ssh label Jul 13, 2017
@djbrooke djbrooke self-assigned this May 8, 2019
@djbrooke
Copy link
Contributor

djbrooke commented May 8, 2019

  • we believe we have this in place, but the Datacite API may have not existed at the time.
  • we would want this to work for handles, and EZID-generated DOIs as well
  • we'd want this to work for files as well (this is implemented in the code as a try-catch already and we may want to refactor)

@djbrooke djbrooke removed their assignment May 8, 2019
@djbrooke djbrooke assigned djbrooke and unassigned djbrooke Jun 26, 2019
@sekmiller
Copy link
Contributor

Neither EZID nor DataCite allow for the deletion of PIDs once they have been made public. EZID allows reserved (non-published) PIDs to be deleted. The current implementation of DataCite does not register PIDs until publication.

So for EZID PIDs, when a dataset is destroyed its PID's status is set of "unavailable, withdrawn by author" and its target url is changed from the Dataverse dataset page to an EZID PID page where the "unavailable" status is displayed.

For DataCite PIDs, on dataset destroy the status (and for any published files is set to "registered". A "registered" status removes the PID from any DataCite searches. Also, the target url remains pointing at Dataverse so that anyone with the url will get a 404 - not found error. (This was deemed appropriate as the dataset is truly gone and not just deaccessioned.)

Handles for destroyed datasets are fully deleted.

@poikilotherm
Copy link
Contributor

A quick note from our onboarding session with Datacite: they provide a tombstone mechanism, which should be preferred over a 404.

@sekmiller
Copy link
Contributor

Is there documentation of DataCite's tombstone mechanism?

@landreev
Copy link
Contributor

landreev commented Aug 2, 2019

If the documentation for this is not readily google-able, we can always ask them. Martin Fenner has been very willing to answer any technical questions (he reads support@datacite.org).

I don't have a strong opinion though on whether this is something we need - specifically, in the context of this issue, as part of the "Destroy dataset" command. This is kind of a nuclear option, reserved for special situations. And I know in the past some of the people who wanted the functionality were actually interested in erasing all traces of the dataset.

@sekmiller
Copy link
Contributor

@landreev the DeleteIdentifier method was already implemented in the HandlenetServiceBean.

@pdurbin pdurbin added this to the 4.16 milestone Oct 12, 2019
qqmyers added a commit to GlobalDataverseCommunityConsortium/dataverse that referenced this issue Jul 17, 2020
kcondon added a commit that referenced this issue Jul 28, 2020
…/7078

restore code from #2471, changes to merge that with new pid api
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Feature: DOI & Handle Feature: Publishing & Versions User Role: Sysadmin Installs, upgrades, and configures the system, connects via ssh
Projects
None yet
Development

Successfully merging a pull request may close this issue.

9 participants