NOTE: inner whitespace is significant.
(*) Some derivatives only apply to certain collections.
Derivatives for Movies Items
If your source file is format: | . . . then we will try to derive the following formats: | |||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
AVIF Thumbnails ZIP | Censor as Music JSON | Cinepack | Closed Caption Text | Closed Caption Text Disc | Closed Caption Text OCR | Frictionless Data Package Descriptor JSON | h.264 | h.264 720P | h.264 HD | h.264 IA | h.264 popcorn | Image-Only PDF Metadata JSON | JSON | JSON SRT | MP3 | MPEG1 | MPEG2 | MPEGTS/Thumbnail | NRT Processed | Page Numbers JSON | Speech Confidence JSON | Speech VS Music JSON | SubRip | Text PDF Metadata JSON | Thumbnail | TV Align SubRip | VOB | |
3GP | movies | movies (*) | movies | |||||||||||||||||||||||||
56Kb QuickTime | movies | movies (*) | movies | |||||||||||||||||||||||||
64Kb MPEG4 | movies (*) | movies | ||||||||||||||||||||||||||
64Kb QuickTime | movies | movies (*) | movies | |||||||||||||||||||||||||
256Kb MPEG4 | movies (*) | movies | ||||||||||||||||||||||||||
256Kb QuickTime | movies | movies (*) | movies | |||||||||||||||||||||||||
512Kb MPEG4 | movies | |||||||||||||||||||||||||||
Closed Caption Text | movies (*) | movies (*) | ||||||||||||||||||||||||||
Democracy Now Text | movies (*) | |||||||||||||||||||||||||||
DivX | movies | movies (*) | movies (*) | movies | ||||||||||||||||||||||||
DVD Info | movies (*) | |||||||||||||||||||||||||||
DV Video | movies | movies (*) | movies (*) | movies | ||||||||||||||||||||||||
Flash Video | movies | movies (*) | movies (*) | movies | ||||||||||||||||||||||||
FLV 400k | movies | movies | ||||||||||||||||||||||||||
h.264 HD | movies | movies (*) | movies (*) | movies | ||||||||||||||||||||||||
h.264 MPEG4 | movies | movies (*) | movies (*) | movies | ||||||||||||||||||||||||
h.264 TV | movies (*) | movies (*) | movies (*) | movies | ||||||||||||||||||||||||
h.264/MPEG2-TS | movies (*) | movies (*) | movies | movies (*) | movies (*) | movies (*) | movies | |||||||||||||||||||||
ISO Image | movies (*) | movies | movies (*) | movies (*) | movies | |||||||||||||||||||||||
IV50 | movies | movies (*) | movies (*) | movies | ||||||||||||||||||||||||
LinkTV FLV (512k) | movies | movies | ||||||||||||||||||||||||||
Matroska | movies | movies (*) | movies (*) | movies (*) | movies | |||||||||||||||||||||||
Metadata | movies (*) | |||||||||||||||||||||||||||
Micro Cards JP2 ZIP | movies | |||||||||||||||||||||||||||
Movie Frames | movies | |||||||||||||||||||||||||||
MPEG-4 Audio | movies (*) | |||||||||||||||||||||||||||
MPEG1 | movies (*) | movies (*) | movies (*) | movies | movies (*) | movies (*) | movies | |||||||||||||||||||||
MPEG2 | movies (*) | movies (*) | movies (*) | movies | movies (*) | movies (*) | movies (*) | movies | ||||||||||||||||||||
MPEG2-TS | movies (*) | movies | movies (*) | movies (*) | movies | |||||||||||||||||||||||
MPEG4 1.5Mbps | movies | movies | ||||||||||||||||||||||||||
MPEGTS | movies (*) | movies | ||||||||||||||||||||||||||
NRT | movies (*) | movies | ||||||||||||||||||||||||||
Ogg Theora | movies | movies (*) | movies (*) | movies | ||||||||||||||||||||||||
Ogg Video | movies | movies | ||||||||||||||||||||||||||
QuickTime | movies | movies (*) | movies (*) | movies (*) | movies | |||||||||||||||||||||||
QuickTime 1.5Mbps | movies | movies | ||||||||||||||||||||||||||
QuickTime 1Mbps | movies | movies | ||||||||||||||||||||||||||
Real Media | movies | movies (*) | movies (*) | movies | ||||||||||||||||||||||||
Speech Confidence JSON | movies | |||||||||||||||||||||||||||
Speech VS Music JSON | movies (*) | |||||||||||||||||||||||||||
SubRip | movies (*) | |||||||||||||||||||||||||||
VBR MP3 | movies (*) | |||||||||||||||||||||||||||
Web Archive Collection Zipped | movies | |||||||||||||||||||||||||||
WebM | movies | movies (*) | movies (*) | movies (*) | movies | |||||||||||||||||||||||
Web Video Text Tracks | movies | |||||||||||||||||||||||||||
Windows Media | movies | movies (*) | movies (*) | movies (*) | movies |
Derivatives for Audio Items
If your source file is format: | . . . then we will try to derive the following formats: | |||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
64Kbps MP3 | ASR | Columbia Peaks | Flac | Intermediate ASR JSON | LCP Encrypted Audiobook | MP3 Sample | MPEG-4 Audio | PNG | Spectrogram | VBR MP3 | Whisper ASR JSON | |
3GP Audio | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | ||||||
8Kbps MP3 | audio (*) | |||||||||||
16Kbps MP3 | audio (*) | |||||||||||
24bit Flac | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | ||||||
24Kbps MP3 | audio (*) | |||||||||||
32Kbps MP3 | audio (*) | audio (*) | ||||||||||
40Kbps MP3 | audio (*) | |||||||||||
48Kbps MP3 | audio (*) | audio (*) | ||||||||||
56Kbps MP3 | audio (*) | |||||||||||
64Kbps MP3 | audio (*) | |||||||||||
72Kbps MP3 | audio (*) | |||||||||||
80Kbps MP3 | audio (*) | |||||||||||
96Kbps MP3 | audio (*) | audio (*) | audio (*) | |||||||||
104Kbps MP3 | audio (*) | |||||||||||
112Kbps MP3 | audio (*) | |||||||||||
120Kbps MP3 | audio (*) | |||||||||||
128Kbps MP3 | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | ||||||
136Kbps MP3 | audio (*) | |||||||||||
144Kbps MP3 | audio (*) | |||||||||||
152Kbps MP3 | audio (*) | |||||||||||
160Kbps MP3 | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | ||||||
168Kbps MP3 | audio (*) | |||||||||||
176Kbps MP3 | audio (*) | |||||||||||
184Kbps MP3 | audio (*) | |||||||||||
192Kbps MP3 | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | ||||||
200Kbps MP3 | audio (*) | |||||||||||
208Kbps MP3 | audio (*) | |||||||||||
216Kbps MP3 | audio (*) | |||||||||||
224Kbps MP3 | audio (*) | |||||||||||
232Kbps MP3 | audio (*) | |||||||||||
240Kbps MP3 | audio (*) | |||||||||||
256Kbps MP3 | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | ||||||
288Kbps MP3 | audio (*) | |||||||||||
320Kbps MP3 | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | ||||||
Advanced Audio Coding | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | |||||
AIFF | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | ||||||
Apple Lossless Audio | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | |||||||
Digital Theater Systems Audio | audio (*) | audio (*) | audio | audio (*) | audio (*) | audio (*) | audio (*) | |||||
Flac | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | ||||||
h.264 | audio (*) | |||||||||||
h.264 720P | audio (*) | |||||||||||
Kbps MP3 | audio (*) | |||||||||||
MIDI | audio (*) | audio (*) | audio (*) | |||||||||
MP3 | audio (*) | audio (*) | ||||||||||
MP3 (other) | audio (*) | |||||||||||
MPEG4 | audio (*) | |||||||||||
MPEG Audio | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | ||||||
Ogg Vorbis | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | ||||||
Real Audio | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | |||||||
Segment Data | audio (*) | audio (*) | ||||||||||
Shorten | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | ||||||
WAVE | audio (*) | audio (*) | audio | audio (*) | audio (*) | audio (*) | audio (*) | |||||
WebA | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | ||||||
Whisper ASR JSON | audio | |||||||||||
Windows Media Audio | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) | audio (*) |
Derivatives for Texts Items
If your source file is format: | . . . then we will try to derive the following formats: | ||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Additional Text PDF | Book Cover | chOCR | Cloth Cover Detection Log | Content Addressable aRchive Log | Contents | DjVuTXT | Djvu XML | Grayscale PDF | HEVC | hOCR | JPEG from Video | LCP Encrypted EPUB | LCP Encrypted PDF | List Inversion Log | Metadata Log | Microfiche Partition Log | Microfilm Issues Log | OCR Page Index | OCR Search Text | Parsed GZ | PersonalArchiveLog | RePublisher Corrections Processing Log | RePublisher Final Processing Log | RePublisher Foldouts Processing Log | RePublisher Initial Processing Log | RePublisher Reprocessing Log | Scandata | Single Page Processed JP2 ZIP | Single Page Processed TIFF ZIP | Text PDF | Title Page Detection Log | Web Video Text Tracks | |
Abbyy GZ | texts | texts (*) | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Additional Text PDF | texts (*) | ||||||||||||||||||||||||||||||||
Book Metadata | texts | ||||||||||||||||||||||||||||||||
chOCR | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Cinepack | texts (*) | texts | |||||||||||||||||||||||||||||||
Comic Book RAR | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Comic Book TAR | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Comic Book ZIP | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Content Addressable aRchive Trigger | texts | ||||||||||||||||||||||||||||||||
DjVu | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Djvu XML | texts | texts (*) | texts (*) | texts | texts | texts | texts (*) | texts (*) | |||||||||||||||||||||||||
EPUB | texts | texts | texts | texts | texts | texts | texts | texts | texts (*) | texts | texts | texts | texts | texts | texts | texts | texts | texts (*) | texts | ||||||||||||||
Extra Metadata JSON | texts | ||||||||||||||||||||||||||||||||
Generic Raw Book Tar | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Generic Raw Book Zip | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
hOCR | texts | texts | texts | texts | texts | texts | texts | texts | texts (*) | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Image-Only PDF Metadata JSON | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Image Container PDF | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts (*) | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Intermediate ASR JSON | texts | ||||||||||||||||||||||||||||||||
Micro Cards Data JSON | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Microfiche Partition JSON | texts | ||||||||||||||||||||||||||||||||
Microfilm Issues JSON | texts | ||||||||||||||||||||||||||||||||
Motion JPEG | texts (*) | texts | |||||||||||||||||||||||||||||||
MusicBrainz Metadata | texts | ||||||||||||||||||||||||||||||||
OCLC xISBN JSON | texts | ||||||||||||||||||||||||||||||||
OCLC xISBN ZIP | texts | ||||||||||||||||||||||||||||||||
OpenDocument Presentation | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts (*) | texts | ||||||||||||||
OpenDocument Text Document | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts (*) | texts | ||||||||||||||
Original DjVu | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||||||||||||
Page Numbers JSON | texts (*) | texts | texts | texts | texts | texts | texts (*) | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts (*) | texts | ||||||||||||||
PersonalArchive | texts | ||||||||||||||||||||||||||||||||
Powerpoint | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts (*) | texts | ||||||||||||||
Raw BDRC Book Zip | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Raw China Book Zip | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | |||||||||||||
Raw Cornell Book Zip | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Raw Michigan Book Zip | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | |||||||||||||
Raw NIH Book Zip | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Raw Yale Book Zip | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Raw Yale Medical Book Zip | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Remediated EPUB | texts (*) | ||||||||||||||||||||||||||||||||
Rich Text Format | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts (*) | texts | ||||||||||||||
Scandata | texts (*) | texts (*) | texts (*) | texts (*) | |||||||||||||||||||||||||||||
Scribe Scandata ZIP | texts (*) | ||||||||||||||||||||||||||||||||
Simple List JSONL | texts | ||||||||||||||||||||||||||||||||
Single Page FIXME JPEG ZIP | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Single Page Original JP2 Tar | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts (*) | texts | texts | texts | texts | |||||||||||||
Single Page Original JP2 ZIP | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Single Page Original TIFF ZIP | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts (*) | texts | texts | texts | texts | texts | ||||||||||||
Single Page Processed JP2 Tar | texts | texts (*) | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Single Page Processed JP2 ZIP | texts | texts (*) | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Single Page Processed JPEG Tar | texts | texts (*) | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Single Page Processed JPEG ZIP | texts | texts (*) | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Single Page Processed TIFF ZIP | texts | texts (*) | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Single Page Raw JP2 Tar | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Single Page Raw JP2 ZIP | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Single Page Raw JPEG Tar | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Single Page Raw JPEG ZIP | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Text PDF | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts (*) | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Text PDF Metadata JSON | texts | texts (*) | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
Texts Transclusion Contents | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | ||||||||||||||
TTScribe Preimage ZIP | texts (*) | ||||||||||||||||||||||||||||||||
TTScribe RAW Preimage ZIP | texts (*) | ||||||||||||||||||||||||||||||||
Word Document | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts | texts (*) | texts |
Derivatives for Other Items
If your source file is format: | . . . then we will try to derive the following formats: | |||||
---|---|---|---|---|---|---|
ARC CDX Index | ARC Transformation GZ | HeritrixCrawlLog | JPEG Thumb | WARC CDX Index | WARC Transformation GZ | |
Animated GIF | question | |||||
Bitmap Image | question | |||||
HEIF | question | |||||
HeritrixJob | question | |||||
Internet Archive ARC | question | question (*) | ||||
Internet Archive ARC GZ | question | question (*) | ||||
JPEG | question | |||||
JPEG 2000 | question | |||||
PNG | question | |||||
Web ARChive | question | question (*) | ||||
Web ARChive GZ | question | question (*) | ||||
Web ARChive ZST | question |
Additional info on audio/video derivatives
-
h.264 / mp4 derivatives:
video (h.264): 640x480 (width scaled if wider/narrower than 4x3 ratio) ~768kb/sec (maxrate) audio (AAC): stereo, 128kb/sec 44.1Khz sampling
-
mp3 derivatives:
MP3 format, stereo, lame compressor "-standard" parameter/setting which targets ~140 kb/sec
Advanced techniques and help
To remove and/or prevent a particular audio or video derivative format
If for some reason, you prefer to not have your item create the derivative of one or more of the audio or video (mp3 mp4) formats, you can upload to your item a special "rules" named "_rules.conf". That file should be a text file, with a single line of each format to disallow.
So, for example, to make (all videos in) a video item *not* create our
"h.264"/mp4 formats, you would upload "_rules.conf"
containing the following:
h.264
To make all audio files in an item *not* create our
mp3 formats, you would upload "_rules.conf"
containing the following:
MP3
To prohibit *just* video and audio derived formats that are "lossy"
(eg: mp3, mp4)
you would upload "_rules.conf" containing the following:
CAT.lossy
NOTE: We only allow prohibiting lossy derivative now --
so CAT.ALL is now the same as CAT.lossy.
To prohibit *all* derived formats,
you would upload "_rules.conf" containing the following:
CAT.ALL
To make any previously created derivatives "disappear" after adding or updating a "_rules.conf" file to an item, use the "Item Manager" link to submit a "derive" task (which will remove the undesired derivatives). (Find the "Edit Item" link in the upper right of your item while you are logged in, click "change the information" link, click the "Item Manager" link near the top of that page, then hit the "derive" button).