NDLTD - ETD-ms: an Interoperability Metadata Standard for Electronic Theses and Dissertations

ETD-ms: an Interoperability Metadata Standard for Electronic Theses and Dissertations -- version 1.01

This version

http://www.ndltd.org/standards/metadata/etd-ms-v1.01.html

Current version

http://www.thesis.org/standards/metadata/current.html

Previous versions

http://www.ndltd.org/standards/metadata/etd-ms-v1.00.html
http://www.ndltd.org/standards/metadata/draft-06-05-2001.html
http://www.ndltd.org/standards/metadata/draft-01-10-2001.html
http://www.ndltd.org/standards/metadata/draft-09-11-2000.html
http://www.ndltd.org/standards/metadata/draft-07-09-2000.html

Editors

Anthony Atkins

Ed Fox

Robert France

Hussein Saleman


Outline

1. Introduction

2. Authorities

3. Metadata Elements

3.1 dc.title
3.2 dc.creator
3.3 dc.subject
3.4 dc.description
3.5 dc.publisher
3.6 dc.contributor
3.7 dc.date
3.8 dc.type
3.9 dc.format
3.10 dc.identifier
3.11 dc.language
3.12 dc.coverage
3.13 dc.rights
3.14 thesis.degree

4.Global Qualifiers

5.Encodings

The abstract set of elements defined in this document can be encoded in a variety of standard formats. Two of particular interest to the digital library community are the MARC standard common to library catalog systems and the XML data markup language. The ETD-MS group provides a "Vanilla" encoding of the standard set into XML and a recommended encoding into MARC-21. The XML encoding follows Dublin Core standards everywhere that the abstract metadata set follows Dublin Core elements. THe MARC-21 recommendations follow AACR-2 policies and procedures for common bibliographic elements like title and publisher, with additions for elements particular to electronic theses and dissertations. A third encoding of the element set, into RDF, is envisioned. As this version goes public, that encoding has not yet been completed.
(5.1.1 is present now - rest of section 5 will be added later.)
5.1 "Vanilla" XML Encoding
5.1.1 Sample Record

5.2 MARC-21 Encoding
5.2.1 Sample Record

6.History

7.Contributors


1.Introduction

This document defines a standard set of metadata elements used to describe an electronic thesis or dissertation.

Institutions dealing with electronic theses and dissertations have all developed their own standards or adapted existing metadata standards. These metadata standards all attempt to describe the author, the work, and the context in which the work was produced in a way that will be useful to the researcher as well as the librarians and/or technical staff maintaining the work in its electronic form.

This document is not a replacement for the metadata schemes developed for a particular university or environment. Rather, this document should be used as a guideline to develop a faithful cross-walk between local metadata standards and a single standard used for sharing information about ETDs.


2. Authorities

Each reference to an individual or institution in any field should contain a string representing the name of the individual or institution as it appears in the work. The reference may also contain a URI which points to an authoritative record for that individual or institution.


3. Metadata Elements

The following is a description of the common Dublin Core metadata elements (and a new element specifically for theses). Guidelines are given as to which information related to an ETD belongs in each element.

If a more general element is described as mandatory, it should be specified fully, even if qualified elements are also specified. If a more general element is repeatable, it should be assumed that sub-elements are also repeatable. If an element contains free text, it must be repeatable to allow for ETDs that provide metadata in more than one language.


3.1 dc.title

Element Description Notes
dc.title A name given to the resource. In the case of theses and dissertations, this is the title of the work as it appears on the title page or equivalent. Mandatory
Repeatable
dc.title.alternative alternative title of the thesis or dissertation Optional,
Repeatable

3.2 dc.creator

Element Description Notes
dc.creator An entity primarily responsible for making the content of the resource. In the case of theses or dissertations, this field is appropriate for the author(s) of the work. Like other names and institutions, this field should be entered in free text form as it appears on the title page or equivalent, with a link to to an authority record if available. See "Authority" section for more information. Mandatory,
Repeatable

3.3 dc.subject

Element Description Notes
dc.subject The topic of the content of the resource. In the case of theses and dissertations, keywords or subjects listed on the title page can be entered as free text. The scheme qualifier should be used to indicate a controlled vocabulary. See Global Qualifiers for more information. Mandatory,
Repeatable

3.4 dc.description

Element Description Notes
dc.description An account of the content of the resource. In the case of theses and disserations, this is the full text of the abstract. Note: dc.description is interchangeable with the qualified element dc.description.abstract. Optional,
Repeatable
dc.description.abstract The full text of the abstract. Optional,
Repeatable
dc.description.note Additional information regarding the thesis or dissertation. Example: acceptance note of the department Optional,
Repeatable
dc.description.release Description of the version of the work. Should only be used for errata, etc.. Optional,
Repeatable

3.5 dc.publisher

Element Description Notes
dc.publisher An entity responsible for making the resource available. This is typically the group most directly responsible for digitizing and/or archiving the work. The publisher may or may not be exactly the same as thesis.degree.grantor. Like other names and institutions, this field should be entered in free text form as it appears on the title page or equivalent, with a link to to an authority record if available. See "Authority" section for more information. Optional,
Repeatable

3.6 dc.contributor

Element Description Notes
dc.contributor An entity responsible for making contributions to the content of the resource. Typical use would be for co-authors of parts of the work as well as advisors or committee members. Co-authors of the entire work would be more appropriate for the dc.creator field. Optional,
Repeatable
dc.contributor.role Role the person played in the creation or approval of the work. Examples: advisor, committee member, chair, co-chair, referree, juror. Optional

3.7 dc.date

Element Description Notes
dc.date A date associated with an event in the life cycle of the resource. In the case of theses and dissertations, this should be the date that appears on the title page or equivalent of the work. Should be recorded as defined in ISO 8601 and the profile recommended for implementing ISO 8601 dates in Dublin Core. Mandatory

3.8 dc.type

Element Description Notes
dc.type The nature or genre of the content of a resource. This field is used to distinguish the resource from works in other genres and to identify the types of content included in the resource. The string "Electronic Thesis or Dissertation" is recommended as one of the repeatable values for this element. In addition, specify types of content using the standard vocabulary found at: http://dublincore.org/documents/dcmi-type-vocabulary/. Degree and Education Level are now handled by the thesis.degree field. Mandatory
Repeatable

3.9 dc.format

Element Description Notes
dc.format The physical or digital manifestation of the resource. In the case of an electronic thesis or dissertation, this should contain a list of the electronic format(s) in which the work is stored and/or delivered. Use the standard MIME type whenever possible (for a list of "registered" MIME types, visit ftp://ftp.isi.edu/in-notes/iana/assignments/media-types/media-types). List as "unknown" if no format information is available, omit if the work is not available in electronic form. Optional,
Repeatable

3.10 dc.identifier

Element Description Notes
dc.identifier An unambiguous reference to the resource within a given context. This can and should be used to provide a URI where the work can be viewed or downloaded. Persistent URNs such as PURLs (http://purl.org/) or Handles (http://handle.net/) are recommended. Mandatory,
Repeatable

3.11 dc.language

Element Description Notes
dc.language A language of the intellectual content of the resource. This should be the primary language in which the work is recorded. Portions of the larger work that appear in other languages should use the lang qualifier. See Global Qualifiers. Language names themselves should be recorded using ISO 639-2 (or RFC 1766). If the language is not specified, it is assumed to be english (en). Optional,
Repeatable

3.12 dc.coverage

Element Description Notes
dc.coverage The extent or scope of the content of the resource. This element should be used for time periods or spatial regions. For any other type of "coverage", use dc.subject. Optional,
Repeatable

3.13 dc.rights

Element Description Notes
dc.rights Information about rights held in and over the resource. Typically, this describes the conditions under which the work may be distributed, reproduced, etc., how these conditions may change over time, and whom to contact regarding the copyright of the work. Optional,
Repeatable

3.14 thesis.degree

Element Description Notes
thesis.degree.name Name of the degree associated with the work as it appears within the work. (example: Masters in Operations Research) Optional,
Repeatable
thesis.degree.level Level of education associated with the document. Examples: bachelor's, master's, doctoral, post-doctoral, other. Optional,
Repeatable
thesis.degree.discipline Area of study of the intellectual content of the document. Usually, this will be the name of a program or department. Optional,
Repeatable
thesis.degree.grantor Institution granting the degree associated with the work. Like other names and institutions, this field should be entered in free text form as it appears on the title page or equivalent, with a link to to an authority record if available. See "Authority" section for more information. Optional,
Repeatable

4. Global Qualifiers

In addition to the attributes specified for various elements, there are also a handful of global attributes that can be specified for any element.

  1. lang
    An indication of the language in which the value of a field is written. This is entirely separate from the dc.language element, which indicates the primary language of the work itself.
  2. translated
    An empty element that indicates that the value of a field is a translation provided by someone other than the author. For translations provided by the author, simply indicate the language of the field itself (using the "lang" attribute).
  3. scheme
    Description of the vocabulary or scheme used to determine the subject(s)


5.1.1 Sample Record

The following is the metadata for an ETD from the Virginia Tech ETD collection presented in one possible XML encoding based on this standard. It would certainly be possible to develop alternate encodings in XML, or encodings in RDF. The XML schema used can be found at http://www.ndltd.org/standards/metadata/etdms/1.0/etdms.xsd.

<thesis xmlns="http://www.ndltd.org/standards/metadata/etdms/1.0/"
	xsi:schemaLocation="http://www.ndltd.org/standards/metadata/etdms/1.0/ 
	http://www.ndltd.org/standards/metadata/etdms/1.0/etdms.xsd">

  <title>Conceptual Development and Empirical Testing of an Outdoor
  Recreation Experience Model: The Recreation Experience Matrix
  (REM)</title>

  <creator>Walker, Gordon James</creator>

  <subject>outdoor recreation</subject>

  <subject>recreation experience preference scales</subject>

  <subject>recreation experience matrix</subject>

  <subject>recreation opportunity spectrum</subject>

  <description>This dissertation examines four issues, including:
  (a) whether outdoor recreation experiences not included in the
  Recreation Experience Preference (REP) scales exist; (b) whether
  these experiences can be categorized using a framework called the
  Recreation Experience Matrix (REM); (c) how well the Recreation
  Opportunity Spectrum (ROS) variables of activity, setting, and
  expertise explain the types of experiences outdoor recreationists
  receive; and (d) how well two new variables--primary mode and
  mode dependence--explain the types of experiences outdoor
  recreationists receive. In order to address these issues, an
  on-site questionnaire was distributed at Mount Rogers National
  Recreation Area in Virginia during October and November, 1995. A
  total of 410 people completed this questionnaire. Of these, 336
  provided useable addresses for a follow-up mail-out
  questionnaire, with 169 (50.3%) actually returning it. After
  performing a variety of statistical analyses, it was found that:
  (a) some outdoor recreationists did report having non-REP
  experiences involving identity, cognition, absorption, and
  self-concept; (b) indirect support does exist for classifying
  outdoor recreation experiences using the REM framework; and (c)
  the ROS variables of activity, setting, and expertise, do explain
  some outdoor recreation experiences, as do the new variables of
  primary mode and mode dependence.</description>

  <publisher>Virginia Polytechnic Institute and State
  University</publisher>

  <contributor role="committee_member">Daniel R.
  Williams</contributor>

  <contributor role="committee_member">K. Jill
  Kiecolt</contributor>

  <contributor role="committee_member">Bradley R.
  Hertel</contributor>

  <contributor role="chair">Joseph W. Roggenbuck</contributor>

  <contributor role="chair">R. Bruce Hull</contributor>

  <date>1997-03-31</date>

  <type>Electronic Thesis or Dissertation</type>

  <format>application/pdf</format>

  <identifier>
  http://scholar.lib.vt.edu/theses/available/etd-3345131939761081/</identifier>

  <language>en</language>

  <rights>unrestricted</rights>

  <rights>I hereby grant to Virginia Tech or its agents the right
  to archive and to make available my thesis or dissertation in
  whole or in part in the University Libraries in all forms of
  media, now or hereafter known. I retain all proprietary rights,
  such as patent rights. I also retain the right to use in future
  works (such as articles or books) all or part of this thesis or
  dissertation.</rights>

  <degree>
    <name>PHD</name>

    <level>doctoral</level>

    <discipline>Forestry</discipline>

    <grantor>Virginia Polytechnic Institute and State
    University</grantor>
  </degree>
</thesis>

	

6. Document History

This document is based primarly on the work of the Dublin Core Metadata Initiative and the proposed set of qualifiers put forward by the DC Date Working Group.

The version of this document dated 06-05-2001 was prepared in response to comments made by users of the ndltd-standards mailing list (see below for contributor information).

The version of this document dated 01-10-2001 was produced as a result of a two-day meeting that took place at OCLC in Dublin Ohio on January 9-10 of 2001 (see below for contributor information).

This document is also a distillation of the discussions which took place at the workshop "DTDs and the usage of new XML-technologies for electronic theses and dissertations" held in May of 2000 at Humboldt University in Berlin. A list of the participants is included in the contributors section. More information regarding this workshop can be found at: http://dochost.rz.hu-berlin.de/epdiss/dtd-workshop/.

This document is also based in part on a prior document entitled "NDLTD Metadata" drafted by Gail McMillan and Paul Mather.


7. Contributors

The version of this document dated 06-05-2001 was prepared in response to comments made by users of the ndltd-standards mailing list, including Tony Cargnelutti, Elaine Henjum, and Priscilla Caplan.

The version of this document dated 01-10-2001 was discussed and prepared as part of the "ETD Standards Meeting" held on January 9-10, 2001 at OCLC Online Computer Library Center, Inc. in Dublin, Ohio (USA) The following people attended the meeting:

  • Anthony Atkins, Virginia Tech
  • Thorsten Bahne, Gerhard-Mercator-Universitat (Germany)
  • Eric Childress, OCLC
  • John Espley, VTLS
  • Ed Fox, Virginia Tech
  • Robert France, Virginia Tech
  • Nuno Freire, Portuguese National Library
  • Thom Hickey, OCLC
  • Sarantos Kapidakis, National Documentation Centre (Greece)
  • Sunny Kim, Academy of Korean Studies / Virginia Tech
  • Mann-Ho Lee, Chungnam National University (Korea) / Virginia Tech
  • Akira Maeda, Nara Institute of Science & Technology (Japan) / Virginia Tech
  • Eric Miller, OCLC
  • Ed O'Neill, OCLC
  • Shalini Urs, University of Mysore (India) / Virginia Tech
  • Diane Vizine-Goetz, OCLC
  • Stu Weibel, OCLC

The following individuals and groups contributed to the previous draft of this document:

The following people participated in the metadata discussion led by Nuno Freire and Thorsten Bahne, which was part of the workshop "DTDs and the usage of new XML-technologies for electronic theses and dissertations" held at Humboldt University in Berlin in May of 2000:
  • Per Åkerlund, SLU (Swedish University of Agricultural Sciences) Libraries, SWEDEN, Per.Akerlund@bibul.slu.se
  • Anthony Atkins, Virginia Polytechnic Institute and State University, University Libraries, USA, anthony.atkins@vt.edu
  • Thorsten Bahne, Mathematics, University of Duisburg, Dissertation Online Project, GERMANY, bahne@math.uni-duisburg.de
  • Viviane Bouletreau, Sentiers, Université Lumiére, Lyon 2, FRANCE vboletr@univ-lyon2.fr
  • Wolfgang Coy, School of Informatics, Humboldt-University Berlin, GERMANY, coy@informatik.hu-berlin.de
  • Peter Diepold, Humboldt-University Berlin, School of Educational Science, GERMANY, diepold@educat.hu-berlin.de
  • Susanne Dobratz, Humboldt-University Berlin, Computing Center, GERMANY dobratz@rz.hu-berlin.de
  • Havard Fosseng, University of Oslo, Center for Information technology Services, NORWAY, havard.fosseng@usit.uio.no
  • Nuno Freire, National Library Portugal, PORTUGAL, nuno.freire@ext.bn.pt
  • Martin Hess, Informatik Uni Frankfurt: GERMANY hess@tm.informatik.uni-frankfurt.de
  • Uwe Müller, Humboldt-University Berlin, Computing Center GERMANY u.mueller@rz.hu-berlin.de
  • Phil Potter, University of Iowa, Academic Technologies (part of Information Technology Services), USA phil-potter@uiowa.edu
  • Sebastian Rahtz, TeX User Group, UK; sebastian.rahtz@computing-services.oxford.ac.uk
  • Paul W. Schaffner, University of Michigan at Ann Arbor, Library, USA, pfs@umich.edu
  • Matthias Schulz, Humboldt-University Berlin, Computing Center, GERMANY matthias.schulz.1@rz.hu-berlin.de
  • Peter Schirmbacher, Humboldt-University Berlin, Computing Center GERMANY schirmbacher@rz.hu-berlin.de
  • Tuija Sonkkila, Helsinki University of Technology, Library, FINLAND sonkkila@cc.hut.fi
  • Christof Steinbeck, Mack-Planck Institute of Chemical Ecology, Jena GERMANY, steinbeck@ice.mpg.de
  • Kerstin Zimmermann, University of Oldenburg; Department of Physics; GERMANY; kerstin@merlin.physik.uni-oldenburg.de

Last Updated on: by Anthony Atkins

upgrade your browser learn about Web standards