Further adjustments and developments will be considered by observing and analysing best practice of the designated community and taking into account their feeedbackfeedback.
As stated in its mission statement the TextGrid Repository promotes free and open access to research data.
A further reuse scenario of data from the TextGridRep is related to the TextGridLab as further component of the virtual research environment. A researcher may make use of an already existimg existing XML encoded edition of a text and may add either further text, his annotation or further XML markup to the text. In this case a complete copy of the text with a new identifier (TextGrid URI) will be created, that can be finally published again in the TextGrid Repository. The metadata will automatically referee the new text or new edited text to its primary source (relation metadata field: refers to). The community may develop different digital editions with different XML schemas developed and adapted to their speciific research needs and later research generations may be able to analyse developments and approaches in text-based research related to certain topics, authors, genres or disciplines.
The lifecycle and its processes can be further specified for the TextGrid repository Repository in terms of a publication process. The publication process is a central part of scientific research in the humanities and for the TextGrid Repository and VRE. Publication is the basis for the dissemination of research results and data to allow all researchers to referee (cite) to each other’s work. New research can follow up on the basis of already existing scientific results and sources. The TextGrid Repository covers the fundamental processes of the DARIAH-DE research data lifecycle in its publication lifecycle within its framework of the virtual research environment as a whole.
Due to this interdisciplinarity the TextGrid metadata schema was developed as a chained system starting from a minimal set of mandatory metadata to ensure the reuse and evaluation of all data at a basic level. Depending on individual research needs the minimal mandatory metadata description can be expanded to a more complex schema. In addition, due to the xml encoding of textual data of the repository specific metadata can be researched through a full text search using TG-search. To evaluate data by accessing the (xml-)encoded texts is a usual procedure in the humanities and inherent to disciplines dealing with (digital) text based research. Text data can be enriched by expanding or enhancing the xml schema and adding of new marked up elements. This data can then be republished as a new related source for specific research needs. The repository supports format standards that ensure usability, access to data and its preservation for the designated community (see CollectionDevelopmentPolicyandDataQuality/Reuse 83230843). Within an ongoing collection development the repository stays in touch with the needs and the state of the art of the designated community and undertakes necessary steps including format changes or adding of new formats (see CollectionDevelopmentPolicyandDataQuality/Reuse83230843). Due to its commitment to open access and open science the repository supports in this context open formats in the sense of free file formats, that can be used by anyone at no monetary cost and whose specifications are visible and maintained by a standards organisation relevant for the designated community.
- The SUB and GWDG as two well recognised institutions with the respective relevant expertise declare their responsibility for the long-term operation of the repository through common founding of the Humanities Data Centre (HDC) as operator of the TextGrid Repository and to take care of providing all necessary ressources resources (technical, financial and in terms of knowledge and expertise of stuff members) – in addition to public project funding of associated projects and independantly whenever necessary. See in this context also the founding manifesto of the HDC.
- All phases of the TextGrid Repository's publication and preservation workflows are based on the Open Archiving Information System (see TextGrid and the Open Archival Information System (OAIS)),
- at bitstream preservation level the repository ensures data preservation in unchanged form for every item,
- for TEI/XML and TIFF as most suitable long-tem preservation formats according to the state of the art of the designated community and respective technical standards the repository offers support and will also undertake format changes in the future if necessary,
- further recommendations in terms of preferable long-term-preservation-formats are given,
- the data is accompanied by appropriate metadata standards for the professional cataloguing of the data and to enable use and reuse for research purposes,
- appropriate ingest procedures ensure that data are checked and validated according to community standards (such as mandatory metadata fields and generated additional administrative and technical metadata),
- the integrity and authenticity of data is regularly checked through a technical based routine,
- the repository has implemented periodical local and distributed backups (located in dedicated computing centers with strict access control) allowing to reinstall the repository data from backup and to recover data in case of technical failures,
- the infrastructure of the repository is regularly checked and maintained in its functionalities,
- security issues are covered through security and disaster plans including responsible persons and actions to undertake,
- documentation, data, metadata, and all related information are regularly maintained suitable to long-term archival storage,
- all involved entities and stuff members agree to regularly observe and evaluate if changes are to be considered necessary due to changing scientific practice or technical developments and how they are to be implemented (To see ongoing evaluations and planned actions that will be implemented see the section "Collection Development Policy and Data Quality/Reuse", and the wiki page "Digital Object MangementManagement"),
- also on an organisational and strategic level SUB and GWDG ensures that the repository stays closely related to its designated community and ongoing innovative developments through associated projects and engagement in new developments and initiatives at a national and international level.
for formatted text: PDF/A, ISO 19005-1: 2005 (http://www.iso.org/iso/catalogue_detail?csnumber=38920)
for audio formats: WAVE (http://msdn.microsoft.com/en-us/library/ms713498%28VS.85%29.asp)
for video files: MPEG 4 File Format, ISO/IEC 14496 (http://www.chiariglione.org/mpeg/standards/mpeg-4/mpeg-4.htm) All levels of the publication workflowa workflows of the TextGrid VRE and for the repository (as illustrated and described above) support and consultaion are given .
At all levels of the publication workflow and lifecycle of the TextGrid VRE and for the repository (as illustrated and described above in the section DataReuse 83230843) support and consultation are guranteed guaranteed and given by staff members of the SUB and DARIAH-DE dealing with the TextGrid VRE via:
- consultation of research projects
- user meetings and Workshops
- email for support
- user's mailing list
All ccoperating cooperating projects and research projects using the TextGrid VRE are entitled to receive at the beginning an initial consultaion consultation for starting using the TextGrid VRE and to be aware of relevant issues for data publication into the TextGrid Repository. Consultation and support is usually used by our designated community and highly recommended for the following issues:
The technical infrastructure of the TextGrid Repository runs on a well-supported operating system. The hardware, software and used technologies are appropriate to serve nationally and internationally research, teaching and learning by providing long term preservation, further processing, openly sharing and dissemination of digital research data according to ethical and scientific standards of the international research community. The Designated Community of the TextGrid Repository follows the good scientific practice as recommended by the German Research Association, which also does the University of Göttingen. In terms of practice this means, as highlightet highlighted publically by the University on the respective website (https://www.uni-goettingen.de/en/604506.html) as well as in related documents and listed here:
- General Data Protection Regulation (European Law, EUR-Lex)
- Data Protection in the EU (European CommisionCommission)
- Law topic data protection (European Commission)
- Legal grounds for processing data (European ComissionCommission)
- Legal grounds processing sensitive data (European ComissionCommission)
- Portal for licence information on research data (DARIAH-DE)
- Guide on legal issues for the humanities (DARIAH-DE Working paper by Paul Klimpel and John H. Weitzman, language: germann)
- Data licences for research data in the humanities (DARIAH-DE Working Paper by Beer, N. et al., language: german)
- Legal framework for research data (DataJus Project)