ARMS Bulletin #7: December 2001


RM Home

ARMS Bulletins

RM Manual

General Records Schedules

Electronic Records Forum

Resources /
News / Links

Policies & Procedures

Contact &
Location Info

Site Search

Site Map


425 Steenbock Library
550 Babcock Drive
Madison, WI 53706-1201
Phone: (608) 262-5629
FAX: (608) 262-8899
Email: Records Management 

© 1996-2007 by the Regents of the University of Wisconsin.



Last updated:
January 4, 2002

Originally published:
January 4, 2002

 

Reproduction of University Records Using Digital Imaging Systems

Guidelines issued by the University Archives and Records Management Services and endorsed by the Campus Records Review Group.


University departments and units have access to a variety of information and records storage technologies. The choices they make have significant ramifications for the ongoing management of University records and information. Because of frequent technological change, including hardware and software obsolescence, University offices must take steps to manage and protect their records for as long as they are needed to meet operational needs and protect the legal, financial and historical interests in their records. For campus units in the process of deciding upon the appropriateness of digital imaging technology, this Bulletin answers several basic questions about this technology and provides a set of recommendations that address: project planning, systems specifications and selection, and system implementation.


Introduction

University departments have a wide variety of information and records storage choices. These choices bear directly upon how well a department will be able to access their information and records now and in the future. Increasingly, the following questions are being asked: "Is it acceptable to scan University records and store them on CD?" "Will the University Archives accept CDs for archival preservation?" "If I scan my documents and burn a CD, is there a set of criteria that I should follow?" "Is a CD a legally acceptable storage medium for official records?" This Bulletin responds to these questions and provides recommendations for the development and implementation of imaging systems for University records.

The Guidelines are advisory and not intended to serve as a set of requirements; however, the degree to which they are incorporated into system design will greatly affect the long-term accessibility of the electronic records involved. National technical standards and professional best practices form the background for the recommendations. Many of them are listed at the end of this Bulletin.

return to top


Background

What is meant by document scanning? What is meant by burning a CD?

There are two types of digital imaging: 1) imaging which is essentially taking a picture of a document and storing the image or 2) imaging to convert the image from analog to machine readable data.

In the context of these Guidelines digital imaging refers to the electronic technology that captures an image of a document and converts it to an electronic format for storage. It is a reproduction technology. It requires a scanner for image recognition; computer software, memory, and storage; and graphic terminals and laser printers to reproduce the stored images. Normally, some type of data compression software is used to compress the images prior to storage.

Compact Disc, or CD, technology is one of many information technologies that have become available for information storage. The technology is relatively easy to use and affordable. A CD-R is a type of compact disc technology. It is normally made of a rigid plastic and the more recent versions have high-density storage capacities. CDs are said to be "burned" because in placing data on the CD minute holes are burned into the plastic surface of the CD. CDs have become a popular storage medium because they can easily store large quantities of data and documents. The technology to scan and "burn" CDs can easily be configured into an office computer desktop. It permits offices to remove information from their computer hard drive or LAN and store it near line, keeping it accessible.

Is it acceptable to image University records and store them on CD?

ANSWER: Only if the unit responsible for the record has developed procedures to assure the CD will be readable during its required retention life.

University records regardless of their format are subject to Wisconsin Public Records laws and University rules with regard to records retention and disposition. University officials (including chairs, deans, and unit heads) are responsible for managing official records so that they remain accessible throughout their retention life. THe establishment of a records retention schedule determines the retention life; see ARMS Manual, Chapter 3 (http://www.library.wisc.edu/libraries/Archives/rm/manual/ManualRDA_ch3_.html) for instructions on developing a records schedule.

Wisconsin Public Records Laws do not require that records be created, maintained, or stored in any particular format. Administrative Rule 12, however, does require that records that are maintained ONLY in electronic media meet certain criteria to insure their access, retrievability, and authenticity. See the ARMS FAQ on Administrative Rule 12 (http://www.library.wisc.edu/libraries/Archives/rm/faq/faq_adm12.htm) or the Primer on Administrative Rule 12 (http://enterprise.state.wi.us/static/erecords/Primer.htm) for further description.

Are CDs acceptable as evidence in legal proceedings?

ANSWER: It is not the CD itself that will be legally acceptable; it is the information contained on the CD that may be utilized in a legal proceeding and therefore subject to the rules of evidence (see ARMS Bulletin #2 for a discussion of rules of evidence and electronic record keeping).

The issue of greatest concern is readability; can the information be accessed in the event of a court action? Rules about electronic records and record keeping are changing. Many State and Federal agencies have established rules and regulations about the maintenance of records. It is therefore advisable to monitor legal activities within your particular community or industry to determine what may or may not be legally acceptable.

It is also important to remember that all types of documentation can be subject to subpoena and discovery action. Courts are generally unsympathetic to a government office that cannot retrieve specific files or records and may rule unfavorably against the party who has lost the record.

What are the implications of storing University records exclusively on CD?

ANSWER: There are several implications for storing University records on CD, and University departments need to be aware of the implications of such a move and develop procedures to meet the rules.

First, it is important to remember that University records are in large part public records. State records law does not mandate that records be maintained in any particular format. Wisconsin's recent administrative rule on electronic records management does require, however, that in instances in which State agencies choose to retain their records in electronic formats ONLY, certain criteria must be met. In other words, if the image copy will be the only record retained or the paper original will be destroyed following the imaging process, the requirements outlined in Admin Rule 12 apply. In essence, the rule requires that records remain accessible, retrievable, and authentic throughout their retention life.

Electronic records require far more care and maintenance than their hard copy counterparts. For example, the signed, original time sheets must be maintained for a period of 5 years and then can be destroyed. Throughout that 5 years, the time sheets must remain accessible and retrievable if necessary to satisfy administrative need or in the case of audits. If the time sheets are imaged and stored on a CD, the department must make sure that the CD is readable throughout that 5-year retention life. To insure readability, the office will need to retain hardware and software that allow readability.

return to top


Recommended Procedures for University Departments and Other Units Considering Digital Imaging for Records Storage

The text of the following recommendations are extensively patterned after the "Digital Imaging Guidelines for State Government Records" - Kansas State Historical Society. We are grateful to them for allowing us to use them in preparing this Bulletin. Some language and references to specific processes have been modified to fit those of Wisconsin State government.

A. Project Planning

Recommendation 1: Prior to selecting a digital imaging system, conduct a records and workflow analysis to determine and document existing and planned agency information needs.

The examination of existing workflow patterns and records is the crucial first step in determining the need for a digital imaging system. A records analysis assesses existing operations to determine what records are best suited for digital imaging applications. A workflow analysis assesses the processes of records creation, access, and retrieval to determine areas where re-engineering can improve operational efficiency. This reorganization of business or work processes may be simple or extensive in approach. Implementing a digital imaging system significantly impacts the current work processes because personnel create, retrieve, use, and store documents in a different way. The detail and complexity of the process of re-engineering affects the project schedule, cost justification, and the bid specifications.

The University Archives and Records Management Service can assist in analyzing a department's record keeping systems.

Recommendation 2: Prior to selecting a digital imaging system, conduct a cost benefit analysis to determine the cost justification of a system purchase and to determine the possible benefits to the agency with its implementation.

Cost justifying a digital imaging system allows a financial comparison between the current and proposed record keeping systems to help in making a procurement decision. The cost-justification goal of a digital imaging system is to offset the cost of the equipment and software by reducing personnel and storage costs or allowing the existing staff to process more work through the improvement of work processes. To determine a cost estimate, the following components should be considered: system hardware, system software, application software, communications hardware and software, system maintenance, training, project management, facilities upgrades/site preparation, staffing costs, and other miscellaneous costs. A typical cost justification includes the following major areas: a study of current operations, a proposed system architecture, equipment pricing, and financial indicators, including payback period, net present value, and rate of return.

return to top

B. System Specifications and Selection

Recommendation 3: Require "open systems architecture" for digital imaging applications or require vendors to provide a bridge to systems with non-proprietary configurations.

Although the term "open systems architecture" is defined in various ways, public officials should follow a system design approach that permits future component upgrades with minimal degradation of system functions. This open system architecture allows the system to be upgraded over time without a significant risk of records loss. It also supports the importing and exporting of digital images to and from other sources. One key factor in achieving an open systems architecture is the adoption of non-proprietary standards. The flexibility of an open systems architecture helps enable long-term records to be accessed and transferred from one hardware or software platform to another.

Recommendation 4: Where data longevity or records integrity is a primary concern, use a recording media that is not rewritable.

The storage capacity of optical disks versus paper is a primary advantage to the use of digital imaging systems. However, optical disks are not the only option. Other storage solutions that can be used with digital imaging systems include output to microfiche or microfilm, digital tape, and magnetic disks. The selection of a storage media may depend on budget considerations for the unit.

When selecting optical storage media, the issues of data longevity and integrity must be considered. There are a variety of optical disks on the market today: Write Once Read Many (WORM), Rewritable, and Compact Disk-Read Only Memory (CD-ROM)/Compact Disk-Recordable (CD-R). Each has its own advantages and disadvantages. WORM and CD-R are not rewritable. Computer Output to Laser Disk (COLD) is also a non-rewritable option. These media offer a high level of data security because alteration of data is not achievable without destruction of the media itself. If a record is no longer needed, software may allow the pointer to the data to be disabled, preventing normal access. Because the data cannot truly be deleted, however, it may remain accessible by other means.

Recommendation 5: Use a non-proprietary digital image file format. If using a proprietary format, provide a bridge to a non-proprietary digital image file format.

A digital image file format is a structured container for information about each digital image and the image data. Information about the digital image file includes, but is not limited to, its name, width, length, resolution, and compression techniques. The computer requires this information to interpret the digital image. It is essential to use a non-proprietary image file format to ensure the ability to transfer successfully digital images between different systems or when a system is upgraded or modified.

American National Standards Institute (ANSI)/Association for Information and Image Management (AIIM) MS53-1993, Standard Recommended Practice - File Format for Storage and Exchange of Images - Bi-Level Image File Format: Part I details a standard definition for file formats. Despite the existence of a standard, there is not an agreed-upon, industry-wide image format standard. Many digital imaging systems use the Tagged Image File Format, or TIFF. Because different versions of TIFF exist (TIFF-5, TIFF-4, etc.), there is still no absolute guarantee that images can be transported seamlessly from one system to another. Comprehensive documentation of the digital image file format, including TIFF, is recommended.

A number of other file formats exist, such as Graphics Interchange Format (GIF), Joint Photographic Experts Group (JPEG), and Bitmap (BMP). These file formats are commonly used in conjunction with hypertext markup language (HTML) for Internet and intranet applications. Many systems or third-party graphics packages will convert images from one to another, although often with unpredictable results.

Recommendation 6: Use International Telecommunications Union (ITU) Group 3 and Group 4 compression techniques or have the vendor provide a bridge to these techniques.

The large file sizes of typical scanned documents require digital image compression to support data transmission and to promote storage efficiency. Today most digital imaging systems use standard compression algorithms to "shrink" images. Standard compression techniques are instrumental in ensuring a migration strategy for records needed for long-term use. Two international standards are currently available. Using compression techniques conforming to either of these specifications will increase the likelihood that the images can be used with other technologies or migrated between systems.

Recommendation 7: When determining document-scanning resolution, consider data storage requirements, document scanning throughput rates, and the accurate reproduction of the image. Validate vendor claims using a sampling of the unit's documents.

A digitized image consists of black and white dots or picture elements (pixels) measured in dots per inch (dpi). The higher the number of dpi, the higher the legibility of the reproduced image will be. Images scanned at higher dpi rates, however, use more storage space on the disk and may require longer scanning times. The selection of scanning density involves a trade-off between image clarity, storage capacity, and speed. When selecting a scanner, ask the vendor to perform a quality test on a broad sampling of documents at various dpi settings so that an appropriate end-to-end throughput rate and resolution can be determined.

For good quality images in imaging modern office records, use a scanning density of at least 300 dpi. A higher scanning density (600 dpi or higher) is appropriate for deteriorating documents, and documents with a visual element such as engineering drawings, maps, or documents with background detail. The display resolution of the inspection/verification monitor and printer should match the scanning density of the document scanner. When scanning continuous tone images, such as photographs, maps, and illustrations, use gray scale or color imaging technology.

Recommendation 8: Select equipment that conforms to the standard methodology for media error detection and correction. The system should provide techniques for monitoring and reporting verification of the records stored on a digital optical disk, and the system administrator should actively follow the status of the monitors.

Digital imaging technology uses two methods within the Error Detection and Correction (EDAC) system to minimize digital image recording and retrieval errors. The first method uses error correction codes to detect and correct data read errors automatically. The second employs correction code software to determine if and when the utilization of error correction codes is approaching a critical point. Monitoring the error correction status information provides an audit trail to measure the progress and degree of disk degradation. Tracking error correction trends will indicate an appropriate timetable for recopying disks.

The Association for Information and Image Management's (AIIM) Standards Committee has developed a standardized methodology for reporting the error rate data to the operating system for user evaluations. ANSI/AIIM MS 59-1996, Media Error Monitoring and Reporting Techniques for Verification of Stored Data on Optical Digital Data Disks, describes these standards.

Another precaution against losing long-term records because of defective disks is to require the use of digital optical disks with a guaranteed minimum shelf life of five years and a minimum post-write life of twenty years.

Recommendation 9: Specify that the Small Computer System Interface (SCSI) command "Write and Verify" be used when writing data to digital optical disks.

The "Write and Verify" command, available within the Small Computer System Interface (SCSI), is valuable for assessing how accurately the scanned information is transferred from the central processing unit of the computer to the digital optical disk. "Write and Verify" requires verification from the system that the digital image is correctly written to the disk and provides additional protection for continued access to long-term records.

Recommendation 10: Use an indexing database that provides for efficient retrieval, ease of use, and up-to-date information about the digital images stored in the system. The indexing database should be selected after an analysis of unit operations and user needs.

Reliable access to scanned images depends on an accurate, up-to-date index database. Indexing a digital image involves linking descriptive image information with header file information. Normally, index data is manually key-entered using the original documents or the scanned images, either at the time of image capture or later in the production process. Index data verification, in which database entries are compared with the original source documents for completeness and accuracy, is crucial because an erroneous index term may result in the inability to retrieve related images.

Recommendation 11: Provide specific plans for an ongoing process of migrating long-term and archival records from older to newer hardware and software platforms.

Agencies must ensure that their long-term and archival records are continually accessible, (Wisconsin Administrative Rule 12; see FAQ at http://www.library.wisc.edu/libraries/Archives/rm/faq/faq_adm12.htm). Systems as physical devices could be operational for ten years or more, but system technology will often be superseded within two to three years. If the system stores records with retention periods exceeding the life span of the hardware and software, it is essential that the administrator plan for future data migration. A migration strategy documents how an organization will transfer long-term and archival records from one generation of hardware and software to another generation without losing system functionality. The strategy should be written and available with current system documentation.

Current strategies for migrating digital imaging system records include: upgrading equipment and software as technology evolves and periodically recopying disks as required; recopying optical disks based upon projected longevity and/or periodic verification of the records; or, transferring the data from an obsolete generation of optical disks to a newly-emerging technology, in some cases bypassing the intermediate generation that is mature but at risk of becoming obsolete.

return to top

C. System Implementation

Recommendation 12: Assign a permanent staff member as systems administrator and require the vendor to provide a project director during the installation and training periods.

The assignment of a qualified staff member, preferably with systems administration experience, is critical to the effective implementation and maintenance of a digital imaging system. The systems administrator should be responsible for overall project management, and the development and maintenance of written system documentation that describes the requirements, capabilities, limitations, design, operation, and maintenance of the digital imaging system. Making a vendor representative responsible for installing the equipment and training the systems administrator and other appropriate unit staff will help to ensure successful implementation of the system.

Recommendation 13: Establish operational practices and provide technical and administrative documentation to ensure the future usability of the system, continued access to long-term records, and a sound foundation for assuring the system's legal integrity.

It is the responsibility of office administrators, rather than vendors and manufacturers, to maintain written documentation of system procedures, also called Standard Operating Procedures or SOPs, including access and security policies and procedures. Security and access policies should be developed to protect the system and the records from alteration or unauthorized use.

In regard to legal admissibility and trustworthiness, records stored on a digital imaging system should be treated no differently than records stored on magnetic disk or tape. The key is for the systems administrator to become familiar with how the rules of evidence apply to such records. Procedural controls should be established and followed to protect the integrity of the records.

These procedural controls should be documented and should reflect requirements for the legal acceptance of records as outlined in AIIM TR31-1992, Performance Guideline for the Admissibility of Records Produced by Information Technology Systems as Evidence. This AIIM performance guideline stresses the importance of specifying the processes used to create the records, demonstrating that records are produced and relied upon in the regular course of business, establishing quality control and audit procedures, conducting formal training programs, and providing written documentation for each procedure. Case histories indicate that system requirements for good archival maintenance are consistent with the requirements for the admission of records under the "rules of evidence" laws. Records administrators should be familiar with how the rules of evidence apply to Wisconsin's public records laws. Policies and procedures should be followed to protect the integrity of long-term records.

Recommendation 14: Perform a visual quality control evaluation of each scanned image and related index data. Write the scanned image to optical media only after the evaluation process is completed.

To help ensure the integrity of long-term and archival records stored on the system, staff members should perform a visual quality evaluation of each index entry and scanned image before writing the digital image to optical media. Overall system quality control is best when the scanned image is temporarily stored on magnetic media, permitting corrections through rescans as needed. Depending on the system configuration, corrections may be performed at the scanner capture station or at designated inspection/rescan workstations. Training and supervision of the operations staff is a key factor in maintaining acceptable image and index quality as well as user satisfaction with the system.

When the system is operational, a routine scanning quality test, as outlined in ANSI-AIIM MS44-1988 (R1993), Recommended Practice for Quality Control of Image Scanners, should be performed on a weekly or monthly basis.

Recommendation 15: Design backup procedures to create security copies of digitized images and their related index records.

System component reliability is critical to system success. Prolonged or repetitive downtime can seriously affect office operations. Creating a duplicate copy of records in another format or another system is an effective method of ensuring access to long-term information. Backup copies also support system integrity and legal admissibility requirements. The government office may select the backup storage media (optical, magnetic, paper, or microform) that best meets the office's records requirements. Security copies of the records should be stored in an offsite, environmentally controlled location.

Recommendation 16: Provide adequate environmental conditions for the digital optical disks.

Even in an optimum environment, digital optical disks are susceptible to deterioration. Adverse storage conditions, especially high humidity, can cause rapid deterioration of the media. A prudent storage guideline for digital optical disks is to adhere to the temperature and humidity levels recommended for magnetic media storage. Technical specialists recommend a stable environment, with a temperature between 65 and 75 degrees, and a relative humidity between 30 and 50 per cent. Digital optical disks should never be stored in direct sunlight nor placed near sources of heat.

Dust, debris, and fingerprints affect digital optical disks. Plastic cartridges should never be removed; nor should the cartridge shutter be opened to expose the digital optical disk's recording surface. To protect disks from warping, they should not be subject to pressure and should be stored in an upright position when not in the disk drive.

Agency officials should request that the vendor supply specifications for the storage of digital optical disks and ensure that office conditions meet these specifications during installation of a system.

Recommendation 17: Budget annually between fifteen and twenty per cent of the original system acquisition cost for upgrades, training, and maintenance.

Administrative managers should be aware of the high cost of maintaining and upgrading digital imaging systems. Unless these costs are factored into the continuing support of system maintenance and improvement, the system is in danger of becoming obsolete and requiring a far greater cost outlay to restore its effectiveness. Also, records stored in an outdated system tend to be at greater risk than those in a well-supported system. Continued planning and budgeting for the migration of long-term and archival records, as discussed in Recommendation 11, is essential for the success of any digital imaging project.

return to top


Appendix A - A Pre-Planning Checklist

The following checklist of questions can be used to guide your pre-planning efforts:

  • Who are your current and future users?
  • Will the digital images act only as "finding aids" (i.e., after locating a desired document through a search of the image database, will your users be referred to the original document for study/research)? Or, will the digital images serve as surrogates for the deteriorating originals, which will be stored or discarded?
  • Have the originals already been photographed? If so, can the slides/transparencies serve as source documents for digitization? Were color/gray-scale targets used during photography? How high is the photographic quality? What is the condition of the film?
  • What are your quality expactations? How do you define a "good image"? How much detail do you need to capture (e.g., do you need to capture the fine details such as the lace work around a woman's sleeve in a portrait, or fine lines in a pencil sketch?)? do digital images need to capture the colors faithfully, or can you (and your users) tolerate a minor color shift?
  • Will you provide Web access to the collection?
  • What are your users' hardware/software capabilities and network configurations? Will their equipment and network support the delivery of images in a timely fashion which retaining the image quality?

The above listing is taken from: http://www.library.cornell.edu/preservation/kodak/kodak-htm.htm#technology
Using Kodak Photo CD Technology for Preservation and Access: A Guide for Librarians, Archivists, and Curators by Anne R. Kenney and Oya Y. Rieger.

return to top


Appendix B - List of Sources

CD Technology Main
http://www.disctronics.co.uk/cdref/cdmain.htm

Digital Projects Guidelines (Arizona State Library, Archives and Public Records)
http://www.lib.az.us/digital/dg_a13.html

The following ANSI/AIIM standards can be purchased through http://www.ansi.org.

ANSI/AIIM MS44 (1988, R1993): Recommended Practice for Quality Control of Image Scanners

ANSI/AIIM MS54 (1993): Standard Recommended Practice - File Format for Storage and Exchange of Images - Bi-Level Image File Format: Part 1

ANSI/AIIM MS56 (1996): Media Error Monitoring and Reporting Techniques for Verification of Stored Data on Optical Digital Data Disks

ANSI/AIIM TR31 (1992): Performance Guideline for the Admissibility of Records Produced by Information Technology Systems as Evidence

return to top

ARMS Home | RM Home | RM Manual | General Schedules | ARMS Bulletins | Resources/News/Links | FAQ: Fact Sheets | Workshops & Training | Forms