Anastasiia Iarkaeva, Evgeny Bobrov, Jan Taubitz, Benjamin Gregory Carlisle, Nico Riedel
The Openness extraction form was implemented to support the automatedly detected Open Data statements by the ODDPub text mining algorithm by guiding and documenting the manual validation of Open Data statements. The extraction form was implemented in Numbat, a software aimed to extract and collect information about the articles for the systematic reviews and manage the resulting databases, also in large volumes.
Thus, the process of the semi-automated extraction of information on open datasets consists currently of 3 steps:
- Automated Open Data statements detection by ODDPub.
- Data preparation for manual validation.
- Manual validation in Numbat Openness extraction form.
The interactive protocol on the Openness form usage is published in protocols.io.
Steps 4.1-4.5 in the protocol provide the instruction on how to install and run Numbat software for own use.
Openness extraction form is available under MIT License.