UNLIMITED FREE
ACCESS
TO THE WORLD'S BEST IDEAS

SUBMIT
Already a GlobalSpec user? Log in.

This is embarrasing...

An error occurred while processing the form. Please try again in a few minutes.

Customize Your GlobalSpec Experience

Finish!
Privacy Policy

This is embarrasing...

An error occurred while processing the form. Please try again in a few minutes.

ISO 28500

Information and documentation - WARC file format

active, Most Current
Buy Now
Organization: ISO
Publication Date: 1 August 2017
Status: active
Page Count: 34
ICS Code (IT applications in information, documentation and publishing): 35.240.30
scope:

This document specifies the WARC file format:

- to store both the payload content and control information from mainstream Internet application layer protocols, such as the HTTP, DNS, and FTP;

- to store arbitrary metadata linked to other stored data (e.g. subject classifier, discovered language, encoding);

- to support data compression and maintain data record integrity;

- to store all control information from the harvesting protocol (e.g. request headers), not just response information;

- to store the results of data transformations linked to other stored data;

- to store a duplicate detection event linked to other stored data (to reduce storage in the presence of identical or substantially similar resources);

- to be extended without disruption to existing functionality;

- to support handling of overly long records by truncation or segmentation, where desired.

Document History

ISO 28500
August 1, 2017
Information and documentation - WARC file format
This document specifies the WARC file format: — to store both the payload content and control information from mainstream Internet application layer protocols, such as the HTTP, DNS, and FTP; — to...
May 15, 2009
Information and documentation — WARC file format
This International Standard specifies the WARC file format: — to store both the payload content and control information from mainstream Internet application layer protocols, such as the HTTP, DNS,...

References

Advertisement