Examining Attributes of Open Standard File Formats for Long-term Preservation and Open Access

Authors

  • Eun G Park McGill University
  • Sam Oh Sungkyunkwan University

DOI:

https://doi.org/10.6017/ital.v31i4.1946

Abstract

This study examines the attributes that have been used to assess file formats in literature and compiles the most frequently used attributes of file formats in order to establish open standard file format selection criteria.  A comprehensive review was undertaken to identify the current knowledge regarding file format selection criteria. The findings indicate that the most common criteria can be categorized into five major groups: functionality, metadata, openness, interoperability and independence. These attributes appear to be closely related. Additional attributes include presentation, authenticity, adoption, protection, preservation, reference and others. 

Author Biographies

Eun G Park, McGill University

associate professor in School of Information Studies, McGill University

Sam Oh, Sungkyunkwan University

Professor of Department of Library and Information Science, Sungkyunkwan University, Faculty Hall 40314
Seoul, 110-745, Korea

References

Library of Congress, “Sustainability of Digital Formats: Planning for Library of Congress Collections,” http://www.digitalpreservation.gov/formats/intro/intro.shtml (accessed November 21, 2011).

Global Digital Format Registry, http://www.gdfr.info/ (accessed November 21, 2011); The Technical Registry PRONOM, http://www.nationalarchives.gov.uk/aboutapps/pronom/ (accessed November 21, 2011).

Mike Folk and Bruce R. Barkstrom, “Attributes of File Formats for Long-Term Preservation of Scientific and Engineering Data in Digital Libraries” (paper presented at the Joint Conference on Digital Libraries (JCDL), Houston, TX, May 27-31, 2003), 1, http://www.larryblakeley.com/Articles/storage_archives_preservation/mike_folk_bruce_barkstrom200305.pdf (accessed November 21, 2011).

InterPARES 2 Project Glossary, p.24, http://www.interpares.org/ip2/ip2_term_pdf.cfm?pdf=glossary (accessed November 21, 2011).

Preservation Metadata: Implementation Strategies (PREMIS), PREMIS Data Dictionary for Preservation Metadata, 195, http://www.loc.gov/standards/premis/v2/premis-2-0.pdf (accessed November 21, 2011).

Ian Barnes, “Preservation of Word Processing Documents,” 4 (2006), http://www.apsr.edu.au/publications/word_processing_preservation.pdf (accessed November 21, 2011).

Ibid.

Gail Hodge and Nikkia Anderson, “Formats for Digital Preservation: A Review of Alternatives and Issues,” Information Services & Use 27 (2007): 46.

Folk and Barkstrom, “Attributes of File Formats for Long-Term Preservation of Scientific and Engineering Data in Digital Libraries.”

Barnes, “Preservation of Word Processing Documents.”

Carl Rauch, Harald Krottmaier, and Klaus Tochtermann, “File-Formats for Preservation: Evaluating the Long-Term Stability of File-Formats,” In Proceedings of the 11th International Conference on Electronic Publishing 2007 (Vienna, Austria, June 13-15, 2007): 101-106.

Susan J. Sullivan, “An Archival/Records Management Perspective on PDF/A,” Records Management Journal 16, no.1 (2006): 51-56.

Judith Rog and Caroline van Wijk, “Evaluating File Formats for Long-term Preservation,” 2008, http://www.kb.nl/hrd/dd/dd_links_en_publicaties/publicaties/KB_file_format_evaluation_method_27022008.pdf (accessed November 21, 2011).

D. K. Sahu, “Long Term Preservation: Which File Format to Use” (paper presented in Workshops on Open Access & Institutional Repository, Chennai, India, May 2-8, 2004), http://openmed.nic.in/1363/01/Long_term_preservation.pdf (accessed November 21, 2011).

CENDI Digital Preservation Task Group, “Formats for Digital Preservation: A Review of Alternatives and Issues,” http://www.cendi.gov/publications/CENDI_PresFormats_WhitePaper_03092007.pdf (accessed November 21, 2011).

Hodge and Anderson, “Formats for Digital Preservation: A Review of Alternatives and Issues.”

DAVID 4 Project (Digital ArchiVing, guIdeline and aDvice 4), “Standards for Fileformats,” 1, http://www.expertisecentrumdavid.be/davidproject/teksten/guideline4.pdf (accessed November 21, 2011).

Sullivan, “An Archival/Records Management Perspective on PDF/A.”

John Michael Potter, “Formats Conversion Technologies Set to Benefit Institutional Repositories,” http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.124.7881&rep=rep1&type=pdf (accessed November 21, 2011).

Eva Müller, et al., “Using XML for Long-Term Preservation: Experiences from the DiVA Project,” In Proceedings of the 6th International Symposium on Electronic Theses and Dissertations (May 20-24, 2003): 109-116, https://edoc.hu-berlin.de/conferences/etd2003/hansson-peter/HTML/index.html (accessed November 21, 2011).

Rene van Horik, “Image Formats: Practical Experiences” (paper presented in Erpanet Training, Vienna, May 10-11, 2004), 22, http://www.erpanet.org/events/2004/vienna/presentations/erpaTrainingVienna_Horik.pdf (accessed November 21, 2011).

Open standard is related to open access, which comes from the Open Access movement that allows resources to be freely available to the public and permits any user to use those resources (e.g. mainly electronic journals, repositories, databases, software applications, etc.) without financial, legal, or technical barriers to its access. See Amy E.C. Koehler, “Some Thoughts on the Meaning of Open Access for University Library Technical Services,” Serials Review 32, no. 1 (March 2006): 17 21; Budapest Open Access Initiative, “Budapest Open Access Initiative,” http://www.soros.org/openaccess/read.shtml (accessed November 21, 2011).

The National Archives, “Selecting File Formats for Long-term Preservation,” 6, http://www.kb.nl/hrd/dd/dd_links_en_publicaties/publicaties/KB_file_format_evaluation_method_27022008.pdf (accessed November 21, 2011).

Folk and Barkstrom, “Attributes of File Formats for Long-Term Preservation of Scientific and Engineering Data in Digital Libraries.”

Andreas Stanescu, “Assessing the Durability of Formats in a Digital Preservation Environment: the INFORM Methodology,” D-Lib Magazine 10, no. 11 (Nov. 2004), http://www.dlib.org/dlib/november04/stanescu/11stanescu.html (accessed November 21, 2011).

Malcolm Todd, “Technology Watch Report: File Formats for Preservation,” http://www.dpconline.org/advice/technology-watch-reports/ (accessed November 21, 2011).

Ibid.

Hodge and Anderson, “Formats for Digital Preservation: A Review of Alternatives and Issues.”

Edward M. Corrado, “The Importance of Open Access, Open Source, and Open Standards for Libraries,” Issues in Science and Technology Librarianship (Spring 2005), http://www.library.ucsb.edu/istl/05-spring/article2.html (accessed November 21, 2011).; Carl Vilbrandt, Galina Pasko, Alexander A. Pasko, Pierre-Alain Fayolle, Turlif Vilbrandt, Janet R. Goodwin, James M. Goodwin and Tosiyasu L. Kunii, “Cultural Heritage Preservation Using Constructive Shape Modeling,” Computer Graphics Forum 23, no. 1 (2004): 25-41; Marshall Breeding, “Preserving Digital Information,” Information Today 19, no. 5 (2002): 48-49.

Eun G. Park, “XML: Examining the Criteria to be Open Standard File Format,” (paper presented at the InterPARES 3 International Symposium, Oslo, Norway, September 17, 2010), http://www.interpares.org/display_file.cfm?doc=IP3_isym04_presentation_3-3_korea.pdf (accessed November 21, 2011).

Adrian Brown, “Digital Preservation Guidance Note: Selecting File Formats for Long-Term Preservation,” http://www.nationalarchives.gov.uk/documents/selecting-file-formats.pdf (accessed November 21, 2011).; Barnes (2006), Sahu (2006), and Potter (2006).

Stephen Abrams, et al., “PDF-A: The Development of a Digital Preservation Standard” (paper presented at the 69th Annual Meeting for the Society of American Archivists, New Orleans, Louisiana, August 14-21, 2005), http://www.aiim.org/documents/standards/PDF-A.ppt (accessed November 21, 2011).; Suillivan (2006), CENDI (2006), and Hodge & Anderson (2007).

The National Archives, http://www.kb.nl/hrd/dd/dd_links_en_publicaties/publicaties/KB_file_format_evaluation_method_27022008.pdf (accessed November 21, 2011).; ECMA International, “Office Open XML file formats – ECMA-376,” http://www.ecma-international.org/publications/standards/Ecma-376.htm (accessed November 21, 2011).

Christoph Becker, et al., “Systematic Characterisation of Objects in Digital Preservation: The Extensible Characterisation Languages,” http://www.jucs.org/jucs_14_18/systematic_characterisation_of_objects/jucs_14_18_2936_2952_becker.pdf (accessed November 21, 2011).; The National Archives, http://www.kb.nl/hrd/dd/dd_links_en_publicaties/publicaties/KB_file_format_evaluation_method_27022008.pdf (accessed November 21, 2011).

Folk and Barkstrom, “Attributes of File Formats for Long-Term Preservation of Scientific and Engineering Data in Digital Libraries.”

The National Archives, http://www.kb.nl/hrd/dd/dd_links_en_publicaties/publicaties/KB_file_format_evaluation_method_27022008.pdf (accessed November 21, 2011).; Rog and van Wijk, “Evaluating File Formats for Long-term Preservation.”

Rog and van Wijk, “Evaluating File Formats for Long-term Preservation.”

See Brown, “Digital Preservation Guidance Note: Selecting File Formats for Long-Term Preservation,” http://www.nationalarchives.gov.uk/documents/selecting-file-formats.pdf (accessed November 21, 2011).; Barnes (2006), Sahu (2006) and Potter (2006).

Stephen Abrams, et al., “PDF-A: The Development of a Digital Preservation Standard” (paper presented at the 69th Annual Meeting for the Society of American Archivists, New Orleans, Louisiana, August 14-21, 2005), http://www.aiim.org/documents/standards/PDF-A.ppt (accessed November 21, 2011).; Suillivan (2006), CENDI (2006), and Hodge & Anderson (2007).

Todd, “Technology Watch Report: File Formats for Preservation,” 33.

Evelyn Peters McLellan, “Selecting Digital File Formats for Long-Term Preservation: InterPARES 2 Project General Study 11 Final Report,” http://www.interpares.org/display_file.cfm?doc=ip2_file_formats(complete).pdf (accessed November 21, 2011).

Downloads

Published

2012-12-12

How to Cite

Park, E. G., & Oh, S. (2012). Examining Attributes of Open Standard File Formats for Long-term Preservation and Open Access. Information Technology and Libraries, 31(4), 46–67. https://doi.org/10.6017/ital.v31i4.1946

Issue

Section

Articles