One document matched: draft-iab-doi-02.xml


<?xml version="1.0" encoding="US-ASCII"?>
<!DOCTYPE rfc SYSTEM "rfc2629.dtd" [
  <!ENTITY rfc3650 PUBLIC "" "http://xml.resource.org/public/rfc/bibxml/reference.RFC.3650.xml">
  <!ENTITY rfc3651 PUBLIC "" "http://xml.resource.org/public/rfc/bibxml/reference.RFC.3651.xml">
  <!ENTITY rfc3652 PUBLIC "" "http://xml.resource.org/public/rfc/bibxml/reference.RFC.3652.xml">
]>
<?xml-stylesheet type='text/xsl' href='rfc2629.xslt' ?>
<?rfc compact="yes" ?>
<?rfc subcompact="no" ?>
<?rfc toc="yes" ?>
<?rfc tocindent="yes" ?>
<?rfc tocdepth="2" ?>
<?rfc symrefs="yes" ?>
<?rfc sortrefs="yes"?>
<?rfc iprnotified="yes" ?>

<rfc category="info" docName="draft-iab-doi-02" ipr="trust200902">
  <front>
    <title abbrev="DOIs for RFCs">Assigning Digital Object Identifiers to RFCs</title>

    <author fullname="John Levine" initials="J." surname="Levine">
      <organization>Taughannock Networks</organization>

      <address>
        <postal>
          <street>PO Box 727</street>
          <city>Trumansburg</city>
          <code>14886</code>
          <region>NY</region>
        </postal>
        <phone>+1 831 480 2300</phone>
        <email>standards@taugh.com</email>
        <uri>http://jl.ly</uri>
      </address>
    </author>

    <date month="August" year="2014" />

    <area>RFC Editor</area>

    <keyword>DOI</keyword>
    <keyword>indexing</keyword>

    <abstract>
       <t>
	  The Digital Object Identifier (DOI) is a widely used system that
	  assigns unique identifiers to digital documents that can be queried
	  and managed in a consistent fashion.
	  We propose a method to assign DOIs to past and future RFCs.
       </t>
    </abstract>
  </front>
  <middle>
     <section title="Introduction">
	<t>
	  The Digital Object Identifier (DOI) is a widely used system that
	  assigns unique identifiers to digital documents that can be queried
	  and managed in a consistent fashion.  The structure of DOIs is
	  defined by <xref target="ISO-DOI">ISO 26324:2012</xref> and is implemented
	  by a group of registration agencies coordinated by the International DOI Foundation.
	</t>
	<t>
	   Each DOI is associated with bibliographic metadata about the object, including one or more URIs
	   where the object can be found.
	   The DOI system also provides many features not relevant to RFCs, such as the
	   ability to update the metadata after the DOI is assigned, and for organizations
	   to maintain local caches of metadata, e.g., a university or corporate
	   library that tracks its
	   copies of purchased documents so subsequent users don't buy them again.
	</t>
	<t>
	   The wide use of DOIs suggests that even though RFCs can be
	   downloaded directly from the IETF for free, organizations that use DOIs can
	   have trouble locating non-DOI documents.
	   DOIs with metadata that points to the
	   existing free online RFCs would make RFCs easier to find and use.
	   Some scholarly publishers accept DOIs as references in published documents,
	   and some versions of bibtex can automatically retrieve the bibliographic
	   data for a DOI and format it.
	   Hence DOIs would make RFCs easier to cite.
	</t>
	<t>
	   The benefits of DOIs apply equally to documents from all of the RFC submission streams,
	   so all RFCs would get DOIs.
	</t>
     </section>
     <section title="Structure and resolution of DOIs">
	<t>
	   DOIs are an application of the handle system defined by RFCs
	   <xref target="RFC3650" />, <xref target="RFC3651" />, and <xref target="RFC3652" />.
	   A DOI for an RFC might be
	</t>
	<figure>
	   <artwork>10.123456/rfc1149</artwork>
	</figure>
	<t>The first part of a DOI is the number 10, which means a DOI within the handle system,
	   a dot, and a unique number assigned to a publisher, in this example 123456.
	   This part is the DOI prefix.
	   Following that is a slash and a text string assigned by the publisher,
	   called the DOI suffix.
	   
	   Since the RFC Editor's series already
	   have numbers, it would be straightforward to use mnemonic suffixes.
	   DOIs might use the familiar series names and numbers, e.g., rfc1149.
	   (DOIs are case-insensitive.)
	   DOIs are treated as opaque identifiers, so the RFC Editor could use
	   some other naming scheme if it turned out to be advantageous to do
	   so.  The reliable way to find the DOI for an RFC would be to look it
	   up in the RFC index.
	</t>
	<t>
	   Although the handle system has its own protocol described in  <xref target="RFC3652" />,
	   the usual way to look up a DOI is to use web lookup.
	   A proposed "doi:" URN was never widely implemented, so the
	   standard way to look up a DOI is to use the public http proxy at http://dx.doi.org.
	   The sample DOI above could be looked up at:
	   </t>
	<figure>
	   <artwork>http://dx.doi.org/10.123456/rfc1149</artwork>
	</figure>
	<t>Whenever a publisher assigns a DOI, it provides the bibliographic metadata for the
	   object (henceforth called a document, since that what they are in this
	   context) to its registration agency which then makes it available to clients
	   that look up DOIs.
	   The document's metadata is typically uploaded to the registration agency in XML
	   using a web API.
	</t>
	<t>
	   Publishers have considerable flexibility as to what resides
	   at the URI(s) that a DOI refers to.
	   Sometimes it's the document itself, while for commercial publishers it's
	   typically a page with the abstract and bibliographic information, and some
	   way to buy the actual document.
	   Since some RFCs are in multiple formats (e.g., Postscript and text),
	   and some STDs and BCPs consist of multiple documents,
	   an appropriate URI would be that of the RFC Editor's info page that has
	   the document's abstract and links to the document(s) in various formats.
	   Hence the URI above would be set to redirect to
	</t>
	<figure>
	   <artwork>http://www.rfc-editor.org/info/rfc1149</artwork>
	</figure>
	<t>
	   More information on the structure and use of DOIs is in the
	   <xref target="DOI-HB">DOI Handbook</xref>.
	</t>
     </section>
     <section title="DOIs for RFCs">
	<t>Once the RFC series has DOIs assigned, it would be a good idea to include the DOI
	in the XML boilerplate of each RFC, which would both permit it to be displayed if
	a rendering engine wants to do that, and aid software that does things with DOIs.
	Online databases and indexes that include RFCs would be updated to include the DOI,
	e.g. the ACM Digital Library.
	(A practical advantage of this is that the DOI would link directly to the RFC
	Editor, rather
	than perhaps to a copy of an RFC behind a paywall.)
	</t>
	<t>
	   Since RFCs are immutable, existing RFCs still wouldn't mention their own DOIs within
	   the RFC itself, but putting the DOIs into indexes would still provide value.	   
	</t>
     </section>
     <section title="The process of assigning DOIs">
	<t>There are three phases to assigning DOIs to RFCs,
	   getting a DOI prefix, retroactively assigning DOIs to existing documents,
	   and updating the publication process to assign DOIs as new RFCs are published.
	</t>
	<section title="Getting a DOI prefix">
	   <t>
	   There are ten <xref target='DOI-RA'>registration agencies</xref> that assign
	   DOI prefixes.
	   Most of them serve specialized audiences or limited geographic areas, but there
	   are a few that handle scholarly and technical materials.
	   All registration agencies charge for DOIs to defray the cost of maintaining
	   the metadata databases.
	   The prices are fairly low, on the order of $660/year for membership,
	   15 cents per document deposit fees
	   for a bulk upload of the backfile (the existing RFCs), and $1/per document to deposit
	   them as they are published.
	   </t>
	</section>
	<section title="Retroactively assigning DOIs">
	   <t>
	      Other than paying the deposit fees,
	      assigning DOIs to all of the existing RFCs is primarily a software problem.
	      We'd need tools to extract or create the metadata for all of the RFCs and
	      submit it to the registration agency using the agency's online API.
	      Where we are aware of indexes and databases that include RFCs, we would
	      try to get them to include the DOI.
	   </t>
	</section>
	<section title="Assigning DOIs to new RFCs">
	   <t>
	      As new RFCs are published, the publication process will add steps to
	      collect and submit the bibliographic metadata to the registration agency.
	      This should be a largely or entirely automatic process, since it should be
	      possible to extract the metadata mechanically either from a document's
	      xml2rfc source, or from the existing catalogs of RFCs.
	   </t>
	</section>
	<section title="Use of DOIs in RFCs">
	   <t>
	      The DOI agency requests that documents that are assigned DOIs in turn
	      include DOIs when possible when referring to other organizations' documents.
	      Hence it would be a good idea to add a DOI field to the xml2rfc reference
	      entity, and request that authors provide DOIs when possible.
	      The RFC production center might add missing DOIs when it's easy to do so,
	      e.g., when the same reference with a DOI has appeared in a prior RFC, or
	      a quick online search finds the DOI.
	      With a DOI field in the xml2rfc reference structure, DOIs in references from citation
	      libraries would appear in the RFCs automatically once the DOIs are added to
	      the libraries.
	   </t>
	   <t>
	      Since it is usually possible to retrieve the bibliographic information
	      for a document from its DOI (see the discussion of bibtex above), it might
	      also be worth adding this feature to xml2rfc, so a reference with only a DOI
	      could be automatically fetched and expanded.
	   </t>
	</section>
     </section>
  </middle>
       
  <back>
    <references title="Informative References">
    &rfc3650;
    &rfc3651;
    &rfc3652;
    <reference anchor='ISO-DOI' target='http://www.iso.org/iso/catalogue_detail?csnumber=43506'>
       <front>
	  <title>ISO 26324:2012
	     Information and documentation -- Digital object identifier system</title>
	  <author><organization>International Organization for Standardization (ISO)</organization></author>
	  <date year='2012' />
       </front>
    </reference>
      
    <reference anchor='DOI-HB' target='http://www.doi.org/hb.html'>
       <front>
	  <title>DOI Handbook</title>
	  <author><organization>International DOI Foundation</organization></author>
	  <date year='2012' month='April' />
       </front>
    </reference>
    <reference anchor='DOI-RA' target='http://www.doi.org/registration_agencies.html'>
       <front>
	  <title>DOI Registration Agencies</title>
	  <author><organization>International DOI Foundation</organization></author>
	  <date year='2013' month='July' />
       </front>
    </reference>

    </references>
    <section title="Change Log">
      <section title="Changes from -01 to -02">
	 <t>Clarify submission process, multi-document DOIs.
	    Note all streams treated the same.
	    Remove unused reference.
	 </t>
      </section>
      <section title="Changes from -00 to -01">
        <t>DOI in the xml, not necessarily in the text
	</t>
        <t>Use of DOI in RFCs section.
	</t>
      </section>
    </section>
  </back>
</rfc>

PAFTECH AB 2003-20262026-04-24 08:53:21