]> Dogcows Code - chaz/tar/blob - doc/tar.texi
Fix typo
[chaz/tar] / doc / tar.texi
1 \input texinfo @c -*-texinfo-*-
2 @comment %**start of header
3 @setfilename tar.info
4 @include version.texi
5 @settitle GNU tar @value{VERSION}
6 @setchapternewpage odd
7
8 @finalout
9
10 @smallbook
11 @c %**end of header
12
13 @include rendition.texi
14 @include value.texi
15
16 @c Put everything in one index (arbitrarily chosen to be the concept index).
17 @syncodeindex fn cp
18 @syncodeindex ky cp
19 @syncodeindex pg cp
20 @syncodeindex vr cp
21
22 @defindex op
23
24 @copying
25
26 This manual is for @acronym{GNU} @command{tar} (version
27 @value{VERSION}, @value{UPDATED}), which creates and extracts files
28 from archives.
29
30 Copyright @copyright{} 1992, 1994, 1995, 1996, 1997, 1999, 2000, 2001,
31 2003, 2004 Free Software Foundation, Inc.
32
33 @quotation
34 Permission is granted to copy, distribute and/or modify this document
35 under the terms of the GNU Free Documentation License, Version 1.1 or
36 any later version published by the Free Software Foundation; with the
37 Invariant Sections being "GNU General Public License", with the
38 Front-Cover Texts being ``A GNU Manual,'' and with the Back-Cover Texts
39 as in (a) below. A copy of the license is included in the section
40 entitled "GNU Free Documentation License".
41
42 (a) The FSF's Back-Cover Text is: ``You are free to copy and modify
43 this GNU Manual. Buying copies from GNU Press supports the FSF in
44 developing GNU and promoting software freedom.''
45 @end quotation
46 @end copying
47
48 @dircategory Archiving
49 @direntry
50 * Tar: (tar). Making tape (or disk) archives.
51 @end direntry
52
53 @dircategory Individual utilities
54 @direntry
55 * tar: (tar)tar invocation. Invoking @GNUTAR{}.
56 @end direntry
57
58 @shorttitlepage @acronym{GNU} @command{tar}
59
60 @titlepage
61 @title @acronym{GNU} tar: an archiver tool
62 @subtitle @value{RENDITION} @value{VERSION}, @value{UPDATED}
63 @author John Gilmore, Jay Fenlason et al.
64
65 @page
66 @vskip 0pt plus 1filll
67 @insertcopying
68 @end titlepage
69
70 @ifnottex
71 @node Top
72 @top @acronym{GNU} tar: an archiver tool
73
74 @insertcopying
75
76 @cindex file archival
77 @cindex archiving files
78
79 The first part of this master menu lists the major nodes in this Info
80 document. The rest of the menu lists all the lower level nodes.
81 @end ifnottex
82
83 @c The master menu, created with texinfo-master-menu, goes here.
84 @c (However, getdate.texi's menu is interpolated by hand.)
85
86 @menu
87 * Introduction::
88 * Tutorial::
89 * tar invocation::
90 * operations::
91 * Backups::
92 * Choosing::
93 * Date input formats::
94 * Formats::
95 * Media::
96
97 Appendices
98
99 * Changes::
100 * Genfile::
101 * Snapshot Files::
102 * Free Software Needs Free Documentation::
103 * Copying This Manual::
104 * Index of Command Line Options::
105 * Index::
106
107 @detailmenu
108 --- The Detailed Node Listing ---
109
110 Introduction
111
112 * Book Contents:: What this Book Contains
113 * Definitions:: Some Definitions
114 * What tar Does:: What @command{tar} Does
115 * Naming tar Archives:: How @command{tar} Archives are Named
116 * Authors:: @GNUTAR{} Authors
117 * Reports:: Reporting bugs or suggestions
118
119 Tutorial Introduction to @command{tar}
120
121 * assumptions::
122 * stylistic conventions::
123 * basic tar options:: Basic @command{tar} Operations and Options
124 * frequent operations::
125 * Two Frequent Options::
126 * create:: How to Create Archives
127 * list:: How to List Archives
128 * extract:: How to Extract Members from an Archive
129 * going further::
130
131 Two Frequently Used Options
132
133 * file tutorial::
134 * verbose tutorial::
135 * help tutorial::
136
137 How to Create Archives
138
139 * prepare for examples::
140 * Creating the archive::
141 * create verbose::
142 * short create::
143 * create dir::
144
145 How to List Archives
146
147 * list dir::
148
149 How to Extract Members from an Archive
150
151 * extracting archives::
152 * extracting files::
153 * extract dir::
154 * failing commands::
155
156 Invoking @GNUTAR{}
157
158 * Synopsis::
159 * using tar options::
160 * Styles::
161 * All Options::
162 * help::
163 * defaults::
164 * verbose::
165 * interactive::
166
167 The Three Option Styles
168
169 * Mnemonic Options:: Mnemonic Option Style
170 * Short Options:: Short Option Style
171 * Old Options:: Old Option Style
172 * Mixing:: Mixing Option Styles
173
174 All @command{tar} Options
175
176 * Operation Summary::
177 * Option Summary::
178 * Short Option Summary::
179
180 @GNUTAR{} Operations
181
182 * Basic tar::
183 * Advanced tar::
184 * create options::
185 * extract options::
186 * backup::
187 * Applications::
188 * looking ahead::
189
190 Advanced @GNUTAR{} Operations
191
192 * Operations::
193 * append::
194 * update::
195 * concatenate::
196 * delete::
197 * compare::
198
199 How to Add Files to Existing Archives: @option{--append}
200
201 * appending files:: Appending Files to an Archive
202 * multiple::
203
204 Updating an Archive
205
206 * how to update::
207
208 Options Used by @option{--create}
209
210 * Ignore Failed Read::
211
212 Options Used by @option{--extract}
213
214 * Reading:: Options to Help Read Archives
215 * Writing:: Changing How @command{tar} Writes Files
216 * Scarce:: Coping with Scarce Resources
217
218 Options to Help Read Archives
219
220 * read full records::
221 * Ignore Zeros::
222
223 Changing How @command{tar} Writes Files
224
225 * Dealing with Old Files::
226 * Overwrite Old Files::
227 * Keep Old Files::
228 * Keep Newer Files::
229 * Unlink First::
230 * Recursive Unlink::
231 * Data Modification Times::
232 * Setting Access Permissions::
233 * Writing to Standard Output::
234 * remove files::
235
236 Coping with Scarce Resources
237
238 * Starting File::
239 * Same Order::
240
241 Performing Backups and Restoring Files
242
243 * Full Dumps:: Using @command{tar} to Perform Full Dumps
244 * Incremental Dumps:: Using @command{tar} to Perform Incremental Dumps
245 * Backup Levels:: Levels of Backups
246 * Backup Parameters:: Setting Parameters for Backups and Restoration
247 * Scripted Backups:: Using the Backup Scripts
248 * Scripted Restoration:: Using the Restore Script
249
250 Setting Parameters for Backups and Restoration
251
252 * General-Purpose Variables::
253 * Magnetic Tape Control::
254 * User Hooks::
255 * backup-specs example:: An Example Text of @file{Backup-specs}
256
257 Choosing Files and Names for @command{tar}
258
259 * file:: Choosing the Archive's Name
260 * Selecting Archive Members::
261 * files:: Reading Names from a File
262 * exclude:: Excluding Some Files
263 * Wildcards::
264 * after:: Operating Only on New Files
265 * recurse:: Descending into Directories
266 * one:: Crossing File System Boundaries
267
268 Reading Names from a File
269
270 * nul::
271
272 Excluding Some Files
273
274 * controlling pattern-matching with exclude::
275 * problems with exclude::
276
277 Crossing File System Boundaries
278
279 * directory:: Changing Directory
280 * absolute:: Absolute File Names
281
282 Date input formats
283
284 * General date syntax:: Common rules.
285 * Calendar date items:: 19 Dec 1994.
286 * Time of day items:: 9:20pm.
287 * Time zone items:: @sc{est}, @sc{pdt}, @sc{gmt}, ...
288 * Day of week items:: Monday and others.
289 * Relative items in date strings:: next tuesday, 2 years ago.
290 * Pure numbers in date strings:: 19931219, 1440.
291 * Seconds since the Epoch:: @@1078100502.
292 * Authors of get_date:: Bellovin, Eggert, Salz, Berets, et al.
293
294 Controlling the Archive Format
295
296 * Portability:: Making @command{tar} Archives More Portable
297 * Compression:: Using Less Space through Compression
298 * Attributes:: Handling File Attributes
299 * Standard:: The Standard Format
300 * Extensions:: @acronym{GNU} Extensions to the Archive Format
301 * cpio:: Comparison of @command{tar} and @command{cpio}
302
303 Making @command{tar} Archives More Portable
304
305 * Portable Names:: Portable Names
306 * dereference:: Symbolic Links
307 * old:: Old V7 Archives
308 * posix:: @acronym{POSIX} archives
309 * Checksumming:: Checksumming Problems
310 * Large or Negative Values:: Large files, negative time stamps, etc.
311
312 Using Less Space through Compression
313
314 * gzip:: Creating and Reading Compressed Archives
315 * sparse:: Archiving Sparse Files
316
317 Tapes and Other Archive Media
318
319 * Device:: Device selection and switching
320 * Remote Tape Server::
321 * Common Problems and Solutions::
322 * Blocking:: Blocking
323 * Many:: Many archives on one tape
324 * Using Multiple Tapes:: Using Multiple Tapes
325 * label:: Including a Label in the Archive
326 * verify::
327 * Write Protection::
328
329 Blocking
330
331 * Format Variations:: Format Variations
332 * Blocking Factor:: The Blocking Factor of an Archive
333
334 Many Archives on One Tape
335
336 * Tape Positioning:: Tape Positions and Tape Marks
337 * mt:: The @command{mt} Utility
338
339 Using Multiple Tapes
340
341 * Multi-Volume Archives:: Archives Longer than One Tape or Disk
342 * Tape Files:: Tape Files
343 * Tarcat:: Concatenate Volumes into a Single Archive
344
345 GNU tar internals and development
346
347 * Genfile::
348 * Snapshot Files::
349
350 Copying This Manual
351
352 * Free Software Needs Free Documentation::
353 * GNU Free Documentation License:: License for copying this manual
354
355 @end detailmenu
356 @end menu
357
358 @node Introduction
359 @chapter Introduction
360
361 @GNUTAR{} creates
362 and manipulates @dfn{archives} which are actually collections of
363 many other files; the program provides users with an organized and
364 systematic method for controlling a large amount of data.
365 The name ``tar'' originally came from the phrase ``Tape ARchive'', but
366 archives need not (and these days, typically do not) reside on tapes.
367
368 @menu
369 * Book Contents:: What this Book Contains
370 * Definitions:: Some Definitions
371 * What tar Does:: What @command{tar} Does
372 * Naming tar Archives:: How @command{tar} Archives are Named
373 * Authors:: @GNUTAR{} Authors
374 * Reports:: Reporting bugs or suggestions
375 @end menu
376
377 @node Book Contents
378 @section What this Book Contains
379
380 The first part of this chapter introduces you to various terms that will
381 recur throughout the book. It also tells you who has worked on @GNUTAR{}
382 and its documentation, and where you should send bug reports
383 or comments.
384
385 The second chapter is a tutorial (@pxref{Tutorial}) which provides a
386 gentle introduction for people who are new to using @command{tar}. It is
387 meant to be self contained, not requiring any reading from subsequent
388 chapters to make sense. It moves from topic to topic in a logical,
389 progressive order, building on information already explained.
390
391 Although the tutorial is paced and structured to allow beginners to
392 learn how to use @command{tar}, it is not intended solely for beginners.
393 The tutorial explains how to use the three most frequently used
394 operations (@samp{create}, @samp{list}, and @samp{extract}) as well as
395 two frequently used options (@samp{file} and @samp{verbose}). The other
396 chapters do not refer to the tutorial frequently; however, if a section
397 discusses something which is a complex variant of a basic concept, there
398 may be a cross reference to that basic concept. (The entire book,
399 including the tutorial, assumes that the reader understands some basic
400 concepts of using a Unix-type operating system; @pxref{Tutorial}.)
401
402 The third chapter presents the remaining five operations, and
403 information about using @command{tar} options and option syntax.
404
405 @FIXME{this sounds more like a @acronym{GNU} Project Manuals Concept [tm] more
406 than the reality. should think about whether this makes sense to say
407 here, or not.} The other chapters are meant to be used as a
408 reference. Each chapter presents everything that needs to be said
409 about a specific topic.
410
411 One of the chapters (@pxref{Date input formats}) exists in its
412 entirety in other @acronym{GNU} manuals, and is mostly self-contained.
413 In addition, one section of this manual (@pxref{Standard}) contains a
414 big quote which is taken directly from @command{tar} sources.
415
416 In general, we give both long and short (abbreviated) option names
417 at least once in each section where the relevant option is covered, so
418 that novice readers will become familiar with both styles. (A few
419 options have no short versions, and the relevant sections will
420 indicate this.)
421
422 @node Definitions
423 @section Some Definitions
424
425 @cindex archive
426 @cindex tar archive
427 The @command{tar} program is used to create and manipulate @command{tar}
428 archives. An @dfn{archive} is a single file which contains the contents
429 of many files, while still identifying the names of the files, their
430 owner(s), and so forth. (In addition, archives record access
431 permissions, user and group, size in bytes, and data modification time.
432 Some archives also record the file names in each archived directory, as
433 well as other file and directory information.) You can use @command{tar}
434 to @dfn{create} a new archive in a specified directory.
435
436 @cindex member
437 @cindex archive member
438 @cindex file name
439 @cindex member name
440 The files inside an archive are called @dfn{members}. Within this
441 manual, we use the term @dfn{file} to refer only to files accessible in
442 the normal ways (by @command{ls}, @command{cat}, and so forth), and the term
443 @dfn{member} to refer only to the members of an archive. Similarly, a
444 @dfn{file name} is the name of a file, as it resides in the file system,
445 and a @dfn{member name} is the name of an archive member within the
446 archive.
447
448 @cindex extraction
449 @cindex unpacking
450 The term @dfn{extraction} refers to the process of copying an archive
451 member (or multiple members) into a file in the file system. Extracting
452 all the members of an archive is often called @dfn{extracting the
453 archive}. The term @dfn{unpack} can also be used to refer to the
454 extraction of many or all the members of an archive. Extracting an
455 archive does not destroy the archive's structure, just as creating an
456 archive does not destroy the copies of the files that exist outside of
457 the archive. You may also @dfn{list} the members in a given archive
458 (this is often thought of as ``printing'' them to the standard output,
459 or the command line), or @dfn{append} members to a pre-existing archive.
460 All of these operations can be performed using @command{tar}.
461
462 @node What tar Does
463 @section What @command{tar} Does
464
465 @cindex tar
466 The @command{tar} program provides the ability to create @command{tar}
467 archives, as well as various other kinds of manipulation. For example,
468 you can use @command{tar} on previously created archives to extract files,
469 to store additional files, or to update or list files which were already
470 stored.
471
472 Initially, @command{tar} archives were used to store files conveniently on
473 magnetic tape. The name @command{tar} comes from this use; it stands for
474 @code{t}ape @code{ar}chiver. Despite the utility's name, @command{tar} can
475 direct its output to available devices, files, or other programs (using
476 pipes). @command{tar} may even access remote devices or files (as archives).
477
478 @FIXME{the following table entries need a bit of work..}
479
480 You can use @command{tar} archives in many ways. We want to stress a few
481 of them: storage, backup, and transportation.
482
483 @table @asis
484 @item Storage
485 Often, @command{tar} archives are used to store related files for
486 convenient file transfer over a network. For example, the
487 @acronym{GNU} Project distributes its software bundled into
488 @command{tar} archives, so that all the files relating to a particular
489 program (or set of related programs) can be transferred as a single
490 unit.
491
492 A magnetic tape can store several files in sequence. However, the tape
493 has no names for these files; it only knows their relative position on
494 the tape. One way to store several files on one tape and retain their
495 names is by creating a @command{tar} archive. Even when the basic transfer
496 mechanism can keep track of names, as FTP can, the nuisance of handling
497 multiple files, directories, and multiple links makes @command{tar}
498 archives useful.
499
500 Archive files are also used for long-term storage. You can think of
501 this as transportation from the present into the future. (It is a
502 science-fiction idiom that you can move through time as well as in
503 space; the idea here is that @command{tar} can be used to move archives in
504 all dimensions, even time!)
505
506 @item Backup
507 Because the archive created by @command{tar} is capable of preserving
508 file information and directory structure, @command{tar} is commonly
509 used for performing full and incremental backups of disks. A backup
510 puts a collection of files (possibly pertaining to many users and
511 projects) together on a disk or a tape. This guards against
512 accidental destruction of the information in those files.
513 @GNUTAR{} has special features that allow it to be
514 used to make incremental and full dumps of all the files in a
515 file system.
516
517 @item Transportation
518 You can create an archive on one system, transfer it to another system,
519 and extract the contents there. This allows you to transport a group of
520 files from one system to another.
521 @end table
522
523 @node Naming tar Archives
524 @section How @command{tar} Archives are Named
525
526 Conventionally, @command{tar} archives are given names ending with
527 @samp{.tar}. This is not necessary for @command{tar} to operate properly,
528 but this manual follows that convention in order to accustom readers to
529 it and to make examples more clear.
530
531 @cindex tar file
532 @cindex entry
533 @cindex tar entry
534 Often, people refer to @command{tar} archives as ``@command{tar} files,'' and
535 archive members as ``files'' or ``entries''. For people familiar with
536 the operation of @command{tar}, this causes no difficulty. However, in
537 this manual, we consistently refer to ``archives'' and ``archive
538 members'' to make learning to use @command{tar} easier for novice users.
539
540 @node Authors
541 @section @GNUTAR{} Authors
542
543 @GNUTAR{} was originally written by John Gilmore,
544 and modified by many people. The @acronym{GNU} enhancements were
545 written by Jay Fenlason, then Joy Kendall, and the whole package has
546 been further maintained by Thomas Bushnell, n/BSG, Fran@,{c}ois
547 Pinard, Paul Eggert, and finally Sergey Poznyakoff with the help of
548 numerous and kind users.
549
550 We wish to stress that @command{tar} is a collective work, and owes much to
551 all those people who reported problems, offered solutions and other
552 insights, or shared their thoughts and suggestions. An impressive, yet
553 partial list of those contributors can be found in the @file{THANKS}
554 file from the @GNUTAR{} distribution.
555
556 @FIXME{i want all of these names mentioned, Absolutely. BUT, i'm not
557 sure i want to spell out the history in this detail, at least not for
558 the printed book. i'm just not sure it needs to be said this way.
559 i'll think about it.}
560
561 @FIXME{History is more important, and surely more interesting, than
562 actual names. Quoting names without history would be meaningless. FP}
563
564 Jay Fenlason put together a draft of a @GNUTAR{}
565 manual, borrowing notes from the original man page from John Gilmore.
566 This was withdrawn in version 1.11. Thomas Bushnell, n/BSG and Amy
567 Gorin worked on a tutorial and manual for @GNUTAR{}.
568 Fran@,{c}ois Pinard put version 1.11.8 of the manual together by
569 taking information from all these sources and merging them. Melissa
570 Weisshaus finally edited and redesigned the book to create version
571 1.12. @FIXME{update version number as necessary; i'm being
572 optimistic!} @FIXME{Someone [maybe karl berry? maybe bob chassell?
573 maybe melissa? maybe julie sussman?] needs to properly index the
574 thing.}
575
576 For version 1.12, Daniel Hagerty contributed a great deal of technical
577 consulting. In particular, he is the primary author of @ref{Backups}.
578
579 In July, 2003 @GNUTAR{} was put on CVS at savannah.gnu.org
580 (see @url{http://savannah.gnu.org/projects/tar}), and
581 active development and maintenance work has started
582 again. Currently @GNUTAR{} is being maintained by Paul Eggert, Sergey
583 Poznyakoff and Jeff Bailey.
584
585 Support for @acronym{POSIX} archives was added by Sergey Poznyakoff.
586
587 @node Reports
588 @section Reporting bugs or suggestions
589
590 @cindex bug reports
591 @cindex reporting bugs
592 If you find problems or have suggestions about this program or manual,
593 please report them to @file{bug-tar@@gnu.org}.
594
595 When reporting a bug, please be sure to include as much detail as
596 possible, in order to reproduce it. @FIXME{Be more specific, I'd
597 like to make this node as detailed as 'Bug reporting' node in Emacs
598 manual}.
599
600 @node Tutorial
601 @chapter Tutorial Introduction to @command{tar}
602
603 This chapter guides you through some basic examples of three @command{tar}
604 operations: @option{--create}, @option{--list}, and @option{--extract}. If
605 you already know how to use some other version of @command{tar}, then you
606 may not need to read this chapter. This chapter omits most complicated
607 details about how @command{tar} works.
608
609 @menu
610 * assumptions::
611 * stylistic conventions::
612 * basic tar options:: Basic @command{tar} Operations and Options
613 * frequent operations::
614 * Two Frequent Options::
615 * create:: How to Create Archives
616 * list:: How to List Archives
617 * extract:: How to Extract Members from an Archive
618 * going further::
619 @end menu
620
621 @node assumptions
622 @section Assumptions this Tutorial Makes
623
624 This chapter is paced to allow beginners to learn about @command{tar}
625 slowly. At the same time, we will try to cover all the basic aspects of
626 these three operations. In order to accomplish both of these tasks, we
627 have made certain assumptions about your knowledge before reading this
628 manual, and the hardware you will be using:
629
630 @itemize @bullet
631 @item
632 Before you start to work through this tutorial, you should understand
633 what the terms ``archive'' and ``archive member'' mean
634 (@pxref{Definitions}). In addition, you should understand something
635 about how Unix-type operating systems work, and you should know how to
636 use some basic utilities. For example, you should know how to create,
637 list, copy, rename, edit, and delete files and directories; how to
638 change between directories; and how to figure out where you are in the
639 file system. You should have some basic understanding of directory
640 structure and how files are named according to which directory they are
641 in. You should understand concepts such as standard output and standard
642 input, what various definitions of the term ``argument'' mean, and the
643 differences between relative and absolute path names. @FIXME{and what
644 else?}
645
646 @item
647 This manual assumes that you are working from your own home directory
648 (unless we state otherwise). In this tutorial, you will create a
649 directory to practice @command{tar} commands in. When we show path names,
650 we will assume that those paths are relative to your home directory.
651 For example, my home directory path is @file{/home/fsf/melissa}. All of
652 my examples are in a subdirectory of the directory named by that path
653 name; the subdirectory is called @file{practice}.
654
655 @item
656 In general, we show examples of archives which exist on (or can be
657 written to, or worked with from) a directory on a hard disk. In most
658 cases, you could write those archives to, or work with them on any other
659 device, such as a tape drive. However, some of the later examples in
660 the tutorial and next chapter will not work on tape drives.
661 Additionally, working with tapes is much more complicated than working
662 with hard disks. For these reasons, the tutorial does not cover working
663 with tape drives. @xref{Media}, for complete information on using
664 @command{tar} archives with tape drives.
665
666 @FIXME{this is a cop out. need to add some simple tape drive info.}
667 @end itemize
668
669 @node stylistic conventions
670 @section Stylistic Conventions
671
672 In the examples, @samp{$} represents a typical shell prompt. It
673 precedes lines you should type; to make this more clear, those lines are
674 shown in @kbd{this font}, as opposed to lines which represent the
675 computer's response; those lines are shown in @code{this font}, or
676 sometimes @samp{like this}.
677
678 @c When we have lines which are too long to be
679 @c displayed in any other way, we will show them like this:
680
681 @node basic tar options
682 @section Basic @command{tar} Operations and Options
683
684 @command{tar} can take a wide variety of arguments which specify and define
685 the actions it will have on the particular set of files or the archive.
686 The main types of arguments to @command{tar} fall into one of two classes:
687 operations, and options.
688
689 Some arguments fall into a class called @dfn{operations}; exactly one of
690 these is both allowed and required for any instance of using @command{tar};
691 you may @emph{not} specify more than one. People sometimes speak of
692 @dfn{operating modes}. You are in a particular operating mode when you
693 have specified the operation which specifies it; there are eight
694 operations in total, and thus there are eight operating modes.
695
696 The other arguments fall into the class known as @dfn{options}. You are
697 not required to specify any options, and you are allowed to specify more
698 than one at a time (depending on the way you are using @command{tar} at
699 that time). Some options are used so frequently, and are so useful for
700 helping you type commands more carefully that they are effectively
701 ``required''. We will discuss them in this chapter.
702
703 You can write most of the @command{tar} operations and options in any
704 of three forms: long (mnemonic) form, short form, and old style. Some
705 of the operations and options have no short or ``old'' forms; however,
706 the operations and options which we will cover in this tutorial have
707 corresponding abbreviations. @FIXME{make sure this is still the case,
708 at the end}We will indicate those abbreviations appropriately to get
709 you used to seeing them. (Note that the ``old style'' option forms
710 exist in @GNUTAR{} for compatibility with Unix
711 @command{tar}. In this book we present a full discussion of this way
712 of writing options and operations (@pxref{Old Options}), and we discuss
713 the other two styles of writing options (@xref{Mnemonic Options}, and
714 @pxref{Short Options}).
715
716 In the examples and in the text of this tutorial, we usually use the
717 long forms of operations and options; but the ``short'' forms produce
718 the same result and can make typing long @command{tar} commands easier.
719 For example, instead of typing
720
721 @smallexample
722 @kbd{tar --create --verbose --file=afiles.tar apple angst aspic}
723 @end smallexample
724
725 @noindent
726 you can type
727 @smallexample
728 @kbd{tar -c -v -f afiles.tar apple angst aspic}
729 @end smallexample
730
731 @noindent
732 or even
733 @smallexample
734 @kbd{tar -cvf afiles.tar apple angst aspic}
735 @end smallexample
736
737 @noindent
738 For more information on option syntax, see @ref{Advanced tar}. In
739 discussions in the text, when we name an option by its long form, we
740 also give the corresponding short option in parentheses.
741
742 The term, ``option'', can be confusing at times, since ``operations''
743 are often lumped in with the actual, @emph{optional} ``options'' in certain
744 general class statements. For example, we just talked about ``short and
745 long forms of options and operations''. However, experienced @command{tar}
746 users often refer to these by shorthand terms such as, ``short and long
747 options''. This term assumes that the ``operations'' are included, also.
748 Context will help you determine which definition of ``options'' to use.
749
750 Similarly, the term ``command'' can be confusing, as it is often used in
751 two different ways. People sometimes refer to @command{tar} ``commands''.
752 A @command{tar} @dfn{command} is the entire command line of user input
753 which tells @command{tar} what to do --- including the operation, options,
754 and any arguments (file names, pipes, other commands, etc). However,
755 you will also sometimes hear the term ``the @command{tar} command''. When
756 the word ``command'' is used specifically like this, a person is usually
757 referring to the @command{tar} @emph{operation}, not the whole line.
758 Again, use context to figure out which of the meanings the speaker
759 intends.
760
761 @node frequent operations
762 @section The Three Most Frequently Used Operations
763
764 Here are the three most frequently used operations (both short and long
765 forms), as well as a brief description of their meanings. The rest of
766 this chapter will cover how to use these operations in detail. We will
767 present the rest of the operations in the next chapter.
768
769 @table @option
770 @item --create
771 @itemx -c
772 Create a new @command{tar} archive.
773 @item --list
774 @itemx -t
775 List the contents of an archive.
776 @item --extract
777 @itemx -x
778 Extract one or more members from an archive.
779 @end table
780
781 @node Two Frequent Options
782 @section Two Frequently Used Options
783
784 To understand how to run @command{tar} in the three operating modes listed
785 previously, you also need to understand how to use two of the options to
786 @command{tar}: @option{--file} (which takes an archive file as an argument)
787 and @option{--verbose}. (You are usually not @emph{required} to specify
788 either of these options when you run @command{tar}, but they can be very
789 useful in making things more clear and helping you avoid errors.)
790
791 @menu
792 * file tutorial::
793 * verbose tutorial::
794 * help tutorial::
795 @end menu
796
797 @node file tutorial
798 @unnumberedsubsec The @option{--file} Option
799
800 @table @option
801 @opindex file, tutorial
802 @item --file=@var{archive-name}
803 @itemx -f @var{archive-name}
804 Specify the name of an archive file.
805 @end table
806
807 You can specify an argument for the @option{--file=@var{archive-name}} (@option{-f @var{archive-name}}) option whenever you
808 use @command{tar}; this option determines the name of the archive file
809 that @command{tar} will work on.
810
811 @vrindex TAPE
812 If you don't specify this argument, then @command{tar} will examine
813 the environment variable @env{TAPE}. If it is set, its value will be
814 used as the archive name. Otherwise, @command{tar} will use the
815 default archive, determined at the compile time. Usually it is
816 standard output or some physical tape drive attached to your machine
817 (you can verify what the default is by running @kbd{tar
818 --show-defaults}, @pxref{defaults}). If there is no tape drive
819 attached, or the default is not meaningful, then @command{tar} will
820 print an error message. The error message might look roughly like one
821 of the following:
822
823 @smallexample
824 tar: can't open /dev/rmt8 : No such device or address
825 tar: can't open /dev/rsmt0 : I/O error
826 @end smallexample
827
828 @noindent
829 To avoid confusion, we recommend that you always specify an archive file
830 name by using @option{--file=@var{archive-name}} (@option{-f @var{archive-name}}) when writing your @command{tar} commands.
831 For more information on using the @option{--file=@var{archive-name}} (@option{-f @var{archive-name}}) option, see
832 @ref{file}.
833
834 @node verbose tutorial
835 @unnumberedsubsec The @option{--verbose} Option
836
837 @table @option
838 @opindex verbose, introduced
839 @item --verbose
840 @itemx -v
841 Show the files being worked on as @command{tar} is running.
842 @end table
843
844 @option{--verbose} (@option{-v}) shows details about the results of running
845 @command{tar}. This can be especially useful when the results might not be
846 obvious. For example, if you want to see the progress of @command{tar} as
847 it writes files into the archive, you can use the @option{--verbose}
848 option. In the beginning, you may find it useful to use
849 @option{--verbose} at all times; when you are more accustomed to
850 @command{tar}, you will likely want to use it at certain times but not at
851 others. We will use @option{--verbose} at times to help make something
852 clear, and we will give many examples both using and not using
853 @option{--verbose} to show the differences.
854
855 Sometimes, a single instance of @option{--verbose} on the command line
856 will show a full, @samp{ls} style listing of an archive or files,
857 giving sizes, owners, and similar information. @FIXME{Describe the
858 exact output format, e.g., how hard links are displayed.}
859 Other times, @option{--verbose} will only show files or members that the particular
860 operation is operating on at the time. In the latter case, you can
861 use @option{--verbose} twice in a command to get a listing such as that
862 in the former case. For example, instead of saying
863
864 @smallexample
865 @kbd{tar -cvf afiles.tar apple angst aspic}
866 @end smallexample
867
868 @noindent
869 above, you might say
870
871 @smallexample
872 @kbd{tar -cvvf afiles.tar apple angst aspic}
873 @end smallexample
874
875 @noindent
876 This works equally well using short or long forms of options. Using
877 long forms, you would simply write out the mnemonic form of the option
878 twice, like this:
879
880 @smallexample
881 $ @kbd{tar --create --verbose --verbose @dots{}}
882 @end smallexample
883
884 @noindent
885 Note that you must double the hyphens properly each time.
886
887 Later in the tutorial, we will give examples using @w{@option{--verbose
888 --verbose}}.
889
890 @node help tutorial
891 @unnumberedsubsec Getting Help: Using the @option{--help} Option
892
893 @table @option
894 @opindex help
895 @item --help
896
897 The @option{--help} option to @command{tar} prints out a very brief list of
898 all operations and option available for the current version of
899 @command{tar} available on your system.
900 @end table
901
902 @node create
903 @section How to Create Archives
904 @UNREVISED
905
906 @cindex Creation of the archive
907 @cindex Archive, creation of
908 One of the basic operations of @command{tar} is @option{--create} (@option{-c}), which
909 you use to create a @command{tar} archive. We will explain
910 @option{--create} first because, in order to learn about the other
911 operations, you will find it useful to have an archive available to
912 practice on.
913
914 To make this easier, in this section you will first create a directory
915 containing three files. Then, we will show you how to create an
916 @emph{archive} (inside the new directory). Both the directory, and
917 the archive are specifically for you to practice on. The rest of this
918 chapter and the next chapter will show many examples using this
919 directory and the files you will create: some of those files may be
920 other directories and other archives.
921
922 The three files you will archive in this example are called
923 @file{blues}, @file{folk}, and @file{jazz}. The archive is called
924 @file{collection.tar}.
925
926 This section will proceed slowly, detailing how to use @option{--create}
927 in @code{verbose} mode, and showing examples using both short and long
928 forms. In the rest of the tutorial, and in the examples in the next
929 chapter, we will proceed at a slightly quicker pace. This section
930 moves more slowly to allow beginning users to understand how
931 @command{tar} works.
932
933 @menu
934 * prepare for examples::
935 * Creating the archive::
936 * create verbose::
937 * short create::
938 * create dir::
939 @end menu
940
941 @node prepare for examples
942 @subsection Preparing a Practice Directory for Examples
943
944 To follow along with this and future examples, create a new directory
945 called @file{practice} containing files called @file{blues}, @file{folk}
946 and @file{jazz}. The files can contain any information you like:
947 ideally, they should contain information which relates to their names,
948 and be of different lengths. Our examples assume that @file{practice}
949 is a subdirectory of your home directory.
950
951 Now @command{cd} to the directory named @file{practice}; @file{practice}
952 is now your @dfn{working directory}. (@emph{Please note}: Although
953 the full path name of this directory is
954 @file{/@var{homedir}/practice}, in our examples we will refer to
955 this directory as @file{practice}; the @var{homedir} is presumed.
956
957 In general, you should check that the files to be archived exist where
958 you think they do (in the working directory) by running @command{ls}.
959 Because you just created the directory and the files and have changed to
960 that directory, you probably don't need to do that this time.
961
962 It is very important to make sure there isn't already a file in the
963 working directory with the archive name you intend to use (in this case,
964 @samp{collection.tar}), or that you don't care about its contents.
965 Whenever you use @samp{create}, @command{tar} will erase the current
966 contents of the file named by @option{--file=@var{archive-name}} (@option{-f @var{archive-name}}) if it exists. @command{tar}
967 will not tell you if you are about to overwrite an archive unless you
968 specify an option which does this (@pxref{backup}, for the
969 information on how to do so). To add files to an existing archive,
970 you need to use a different option, such as @option{--append} (@option{-r}); see
971 @ref{append} for information on how to do this.
972
973 @node Creating the archive
974 @subsection Creating the Archive
975
976 @opindex create, introduced
977 To place the files @file{blues}, @file{folk}, and @file{jazz} into an
978 archive named @file{collection.tar}, use the following command:
979
980 @smallexample
981 $ @kbd{tar --create --file=collection.tar blues folk jazz}
982 @end smallexample
983
984 The order of the arguments is not very important, @emph{when using long
985 option forms}. You could also say:
986
987 @smallexample
988 $ @kbd{tar blues --create folk --file=collection.tar jazz}
989 @end smallexample
990
991 @noindent
992 However, you can see that this order is harder to understand; this is
993 why we will list the arguments in the order that makes the commands
994 easiest to understand (and we encourage you to do the same when you use
995 @command{tar}, to avoid errors).
996
997 Note that the part of the command which says,
998 @w{@option{--file=collection.tar}} is considered to be @emph{one} argument.
999 If you substituted any other string of characters for
1000 @kbd{collection.tar}, then that string would become the name of the
1001 archive file you create.
1002
1003 The order of the options becomes more important when you begin to use
1004 short forms. With short forms, if you type commands in the wrong order
1005 (even if you type them correctly in all other ways), you may end up with
1006 results you don't expect. For this reason, it is a good idea to get
1007 into the habit of typing options in the order that makes inherent sense.
1008 @xref{short create}, for more information on this.
1009
1010 In this example, you type the command as shown above: @option{--create}
1011 is the operation which creates the new archive
1012 (@file{collection.tar}), and @option{--file} is the option which lets
1013 you give it the name you chose. The files, @file{blues}, @file{folk},
1014 and @file{jazz}, are now members of the archive, @file{collection.tar}
1015 (they are @dfn{file name arguments} to the @option{--create} operation).
1016 @FIXME{xref here to the discussion of file name args?}Now that they are
1017 in the archive, they are called @emph{archive members}, not files.
1018 (@pxref{Definitions,members}).
1019
1020 When you create an archive, you @emph{must} specify which files you
1021 want placed in the archive. If you do not specify any archive
1022 members, @GNUTAR{} will complain.
1023
1024 If you now list the contents of the working directory (@command{ls}), you will
1025 find the archive file listed as well as the files you saw previously:
1026
1027 @smallexample
1028 blues folk jazz collection.tar
1029 @end smallexample
1030
1031 @noindent
1032 Creating the archive @samp{collection.tar} did not destroy the copies of
1033 the files in the directory.
1034
1035 Keep in mind that if you don't indicate an operation, @command{tar} will not
1036 run and will prompt you for one. If you don't name any files, @command{tar}
1037 will complain. You must have write access to the working directory,
1038 or else you will not be able to create an archive in that directory.
1039
1040 @emph{Caution}: Do not attempt to use @option{--create} (@option{-c}) to add files to
1041 an existing archive; it will delete the archive and write a new one.
1042 Use @option{--append} (@option{-r}) instead. @xref{append}.
1043
1044 @node create verbose
1045 @subsection Running @option{--create} with @option{--verbose}
1046
1047 @opindex create, using with @option{--verbose}
1048 @opindex verbose, using with @option{--create}
1049 If you include the @option{--verbose} (@option{-v}) option on the command line,
1050 @command{tar} will list the files it is acting on as it is working. In
1051 verbose mode, the @code{create} example above would appear as:
1052
1053 @smallexample
1054 $ @kbd{tar --create --verbose --file=collection.tar blues folk jazz}
1055 blues
1056 folk
1057 jazz
1058 @end smallexample
1059
1060 This example is just like the example we showed which did not use
1061 @option{--verbose}, except that @command{tar} generated the remaining lines
1062 @iftex
1063 (note the different font styles).
1064 @end iftex
1065 @ifinfo
1066 .
1067 @end ifinfo
1068
1069 In the rest of the examples in this chapter, we will frequently use
1070 @code{verbose} mode so we can show actions or @command{tar} responses that
1071 you would otherwise not see, and which are important for you to
1072 understand.
1073
1074 @node short create
1075 @subsection Short Forms with @samp{create}
1076
1077 As we said before, the @option{--create} (@option{-c}) operation is one of the most
1078 basic uses of @command{tar}, and you will use it countless times.
1079 Eventually, you will probably want to use abbreviated (or ``short'')
1080 forms of options. A full discussion of the three different forms that
1081 options can take appears in @ref{Styles}; for now, here is what the
1082 previous example (including the @option{--verbose} (@option{-v}) option) looks like
1083 using short option forms:
1084
1085 @smallexample
1086 $ @kbd{tar -cvf collection.tar blues folk jazz}
1087 blues
1088 folk
1089 jazz
1090 @end smallexample
1091
1092 @noindent
1093 As you can see, the system responds the same no matter whether you use
1094 long or short option forms.
1095
1096 @FIXME{i don't like how this is worded:} One difference between using
1097 short and long option forms is that, although the exact placement of
1098 arguments following options is no more specific when using short forms,
1099 it is easier to become confused and make a mistake when using short
1100 forms. For example, suppose you attempted the above example in the
1101 following way:
1102
1103 @smallexample
1104 $ @kbd{tar -cfv collection.tar blues folk jazz}
1105 @end smallexample
1106
1107 @noindent
1108 In this case, @command{tar} will make an archive file called @file{v},
1109 containing the files @file{blues}, @file{folk}, and @file{jazz}, because
1110 the @samp{v} is the closest ``file name'' to the @option{-f} option, and
1111 is thus taken to be the chosen archive file name. @command{tar} will try
1112 to add a file called @file{collection.tar} to the @file{v} archive file;
1113 if the file @file{collection.tar} did not already exist, @command{tar} will
1114 report an error indicating that this file does not exist. If the file
1115 @file{collection.tar} does already exist (e.g., from a previous command
1116 you may have run), then @command{tar} will add this file to the archive.
1117 Because the @option{-v} option did not get registered, @command{tar} will not
1118 run under @samp{verbose} mode, and will not report its progress.
1119
1120 The end result is that you may be quite confused about what happened,
1121 and possibly overwrite a file. To illustrate this further, we will show
1122 you how an example we showed previously would look using short forms.
1123
1124 This example,
1125
1126 @smallexample
1127 $ @kbd{tar blues --create folk --file=collection.tar jazz}
1128 @end smallexample
1129
1130 @noindent
1131 is confusing as it is. When shown using short forms, however, it
1132 becomes much more so:
1133
1134 @smallexample
1135 $ @kbd{tar blues -c folk -f collection.tar jazz}
1136 @end smallexample
1137
1138 @noindent
1139 It would be very easy to put the wrong string of characters
1140 immediately following the @option{-f}, but doing that could sacrifice
1141 valuable data.
1142
1143 For this reason, we recommend that you pay very careful attention to
1144 the order of options and placement of file and archive names,
1145 especially when using short option forms. Not having the option name
1146 written out mnemonically can affect how well you remember which option
1147 does what, and therefore where different names have to be placed.
1148
1149 @node create dir
1150 @subsection Archiving Directories
1151
1152 @cindex Archiving Directories
1153 @cindex Directories, Archiving
1154 You can archive a directory by specifying its directory name as a
1155 file name argument to @command{tar}. The files in the directory will be
1156 archived relative to the working directory, and the directory will be
1157 re-created along with its contents when the archive is extracted.
1158
1159 To archive a directory, first move to its superior directory. If you
1160 have followed the previous instructions in this tutorial, you should
1161 type:
1162
1163 @smallexample
1164 $ @kbd{cd ..}
1165 $
1166 @end smallexample
1167
1168 @noindent
1169 This will put you into the directory which contains @file{practice},
1170 i.e., your home directory. Once in the superior directory, you can
1171 specify the subdirectory, @file{practice}, as a file name argument. To
1172 store @file{practice} in the new archive file @file{music.tar}, type:
1173
1174 @smallexample
1175 $ @kbd{tar --create --verbose --file=music.tar practice}
1176 @end smallexample
1177
1178 @noindent
1179 @command{tar} should output:
1180
1181 @smallexample
1182 practice/
1183 practice/blues
1184 practice/folk
1185 practice/jazz
1186 practice/collection.tar
1187 @end smallexample
1188
1189 Note that the archive thus created is not in the subdirectory
1190 @file{practice}, but rather in the current working directory---the
1191 directory from which @command{tar} was invoked. Before trying to archive a
1192 directory from its superior directory, you should make sure you have
1193 write access to the superior directory itself, not only the directory
1194 you are trying archive with @command{tar}. For example, you will probably
1195 not be able to store your home directory in an archive by invoking
1196 @command{tar} from the root directory; @xref{absolute}. (Note
1197 also that @file{collection.tar}, the original archive file, has itself
1198 been archived. @command{tar} will accept any file as a file to be
1199 archived, regardless of its content. When @file{music.tar} is
1200 extracted, the archive file @file{collection.tar} will be re-written
1201 into the file system).
1202
1203 If you give @command{tar} a command such as
1204
1205 @smallexample
1206 $ @kbd{tar --create --file=foo.tar .}
1207 @end smallexample
1208
1209 @noindent
1210 @command{tar} will report @samp{tar: ./foo.tar is the archive; not
1211 dumped}. This happens because @command{tar} creates the archive
1212 @file{foo.tar} in the current directory before putting any files into
1213 it. Then, when @command{tar} attempts to add all the files in the
1214 directory @file{.} to the archive, it notices that the file
1215 @file{./foo.tar} is the same as the archive @file{foo.tar}, and skips
1216 it. (It makes no sense to put an archive into itself.) @GNUTAR{}
1217 will continue in this case, and create the archive
1218 normally, except for the exclusion of that one file. (@emph{Please
1219 note:} Other versions of @command{tar} are not so clever; they will
1220 enter an infinite loop when this happens, so you should not depend on
1221 this behavior unless you are certain you are running @GNUTAR{}.)
1222 @FIXME{bob doesn't like this sentence, since he does
1223 it all the time, and we've been doing it in the editing passes for
1224 this manual: In general, make sure that the archive is not inside a
1225 directory being dumped.}
1226
1227 @node list
1228 @section How to List Archives
1229
1230 @opindex list
1231 Frequently, you will find yourself wanting to determine exactly what a
1232 particular archive contains. You can use the @option{--list} (@option{-t}) operation
1233 to get the member names as they currently appear in the archive, as well
1234 as various attributes of the files at the time they were archived. For
1235 example, you can examine the archive @file{collection.tar} that you
1236 created in the last section with the command,
1237
1238 @smallexample
1239 $ @kbd{tar --list --file=collection.tar}
1240 @end smallexample
1241
1242 @noindent
1243 The output of @command{tar} would then be:
1244
1245 @smallexample
1246 blues
1247 folk
1248 jazz
1249 @end smallexample
1250
1251 @FIXME{we hope this will change. if it doesn't, need to show the
1252 creation of bfiles somewhere above!!! : }
1253
1254 @noindent
1255 The archive @file{bfiles.tar} would list as follows:
1256
1257 @smallexample
1258 ./birds
1259 baboon
1260 ./box
1261 @end smallexample
1262
1263 @noindent
1264 Be sure to use a @option{--file=@var{archive-name}} (@option{-f @var{archive-name}}) option just as with @option{--create} (@option{-c})
1265 to specify the name of the archive.
1266
1267 @opindex list, using with @option{--verbose}
1268 @opindex verbose, using with @option{--list}
1269 If you use the @option{--verbose} (@option{-v}) option with @option{--list}, then
1270 @command{tar} will print out a listing reminiscent of @w{@samp{ls -l}},
1271 showing owner, file size, and so forth.
1272
1273 If you had used @option{--verbose} (@option{-v}) mode, the example above would look
1274 like:
1275
1276 @smallexample
1277 $ @kbd{tar --list --verbose --file=collection.tar folk}
1278 -rw-r--r-- myself user 62 1990-05-23 10:55 folk
1279 @end smallexample
1280
1281 @cindex listing member and file names
1282 @anchor{listing member and file names}
1283 It is important to notice that the output of @kbd{tar --list
1284 --verbose} does not necessarily match that produced by @kbd{tar
1285 --create --verbose} while creating the archive. It is because
1286 @GNUTAR{}, unless told explicitly not to do so, removes some directory
1287 prefixes from file names before storing them in the archive
1288 (@xref{absolute}, for more information). In other
1289 words, in verbose mode @GNUTAR{} shows @dfn{file names} when creating
1290 an archive and @dfn{member names} when listing it. Consider this
1291 example:
1292
1293 @smallexample
1294 @group
1295 $ @kbd{tar cfv archive /etc/mail}
1296 tar: Removing leading `/' from member names
1297 /etc/mail/
1298 /etc/mail/sendmail.cf
1299 /etc/mail/aliases
1300 $ @kbd{tar tf archive}
1301 etc/mail/
1302 etc/mail/sendmail.cf
1303 etc/mail/aliases
1304 @end group
1305 @end smallexample
1306
1307 @opindex show-stored-names
1308 This default behavior can sometimes be inconvenient. You can force
1309 @GNUTAR{} show member names when creating archive by supplying
1310 @option{--show-stored-names} option.
1311
1312 @table @option
1313 @item --show-stored-names
1314 Print member (not @emph{file}) names when creating the archive.
1315 @end table
1316
1317 @cindex File name arguments, using @option{--list} with
1318 @opindex list, using with file name arguments
1319 You can specify one or more individual member names as arguments when
1320 using @samp{list}. In this case, @command{tar} will only list the
1321 names of members you identify. For example, @w{@kbd{tar --list
1322 --file=afiles.tar apple}} would only print @file{apple}.
1323
1324 @FIXME{we hope the relevant aspects of this will change:}Because
1325 @command{tar} preserves paths, file names must be specified as they appear
1326 in the archive (ie., relative to the directory from which the archive
1327 was created). Therefore, it is essential when specifying member names
1328 to @command{tar} that you give the exact member names. For example,
1329 @w{@kbd{tar --list --file=bfiles birds}} would produce an error message
1330 something like @samp{tar: birds: Not found in archive}, because there is
1331 no member named @file{birds}, only one named @file{./birds}. While the
1332 names @file{birds} and @file{./birds} name the same file, @emph{member}
1333 names are compared using a simplistic name comparison, in which an exact
1334 match is necessary. @xref{absolute}.
1335
1336 However, @w{@kbd{tar --list --file=collection.tar folk}} would respond
1337 with @file{folk}, because @file{folk} is in the archive file
1338 @file{collection.tar}. If you are not sure of the exact file name, try
1339 listing all the files in the archive and searching for the one you
1340 expect to find; remember that if you use @option{--list} with no file
1341 names as arguments, @command{tar} will print the names of all the members
1342 stored in the specified archive.
1343
1344 @menu
1345 * list dir::
1346 @end menu
1347
1348 @node list dir
1349 @unnumberedsubsec Listing the Contents of a Stored Directory
1350
1351 To get information about the contents of an archived directory,
1352 use the directory name as a file name argument in conjunction with
1353 @option{--list} (@option{-t}). To find out file attributes, include the
1354 @option{--verbose} (@option{-v}) option.
1355
1356 For example, to find out about files in the directory @file{practice}, in
1357 the archive file @file{music.tar}, type:
1358
1359 @smallexample
1360 $ @kbd{tar --list --verbose --file=music.tar practice}
1361 @end smallexample
1362
1363 @command{tar} responds:
1364
1365 @smallexample
1366 drwxrwxrwx myself user 0 1990-05-31 21:49 practice/
1367 -rw-r--r-- myself user 42 1990-05-21 13:29 practice/blues
1368 -rw-r--r-- myself user 62 1990-05-23 10:55 practice/folk
1369 -rw-r--r-- myself user 40 1990-05-21 13:30 practice/jazz
1370 -rw-r--r-- myself user 10240 1990-05-31 21:49 practice/collection.tar
1371 @end smallexample
1372
1373 When you use a directory name as a file name argument, @command{tar} acts on
1374 all the files (including sub-directories) in that directory.
1375
1376 @node extract
1377 @section How to Extract Members from an Archive
1378 @UNREVISED
1379 @cindex Extraction
1380 @cindex Retrieving files from an archive
1381 @cindex Resurrecting files from an archive
1382
1383 @opindex extract
1384 Creating an archive is only half the job---there is no point in storing
1385 files in an archive if you can't retrieve them. The act of retrieving
1386 members from an archive so they can be used and manipulated as
1387 unarchived files again is called @dfn{extraction}. To extract files
1388 from an archive, use the @option{--extract} (@option{--get} or
1389 @option{-x}) operation. As with @option{--create}, specify the name
1390 of the archive with @option{--file} (@option{-f}) option. Extracting
1391 an archive does not modify the archive in any way; you can extract it
1392 multiple times if you want or need to.
1393
1394 Using @option{--extract}, you can extract an entire archive, or specific
1395 files. The files can be directories containing other files, or not. As
1396 with @option{--create} (@option{-c}) and @option{--list} (@option{-t}), you may use the short or the
1397 long form of the operation without affecting the performance.
1398
1399 @menu
1400 * extracting archives::
1401 * extracting files::
1402 * extract dir::
1403 * extracting untrusted archives::
1404 * failing commands::
1405 @end menu
1406
1407 @node extracting archives
1408 @subsection Extracting an Entire Archive
1409
1410 To extract an entire archive, specify the archive file name only, with
1411 no individual file names as arguments. For example,
1412
1413 @smallexample
1414 $ @kbd{tar -xvf collection.tar}
1415 @end smallexample
1416
1417 @noindent
1418 produces this:
1419
1420 @smallexample
1421 -rw-r--r-- me user 28 1996-10-18 16:31 jazz
1422 -rw-r--r-- me user 21 1996-09-23 16:44 blues
1423 -rw-r--r-- me user 20 1996-09-23 16:44 folk
1424 @end smallexample
1425
1426 @node extracting files
1427 @subsection Extracting Specific Files
1428
1429 To extract specific archive members, give their exact member names as
1430 arguments, as printed by @option{--list} (@option{-t}). If you had mistakenly deleted
1431 one of the files you had placed in the archive @file{collection.tar}
1432 earlier (say, @file{blues}), you can extract it from the archive without
1433 changing the archive's structure. Its contents will be identical to the
1434 original file @file{blues} that you deleted.
1435
1436 First, make sure you are in the @file{practice} directory, and list the
1437 files in the directory. Now, delete the file, @samp{blues}, and list
1438 the files in the directory again.
1439
1440 You can now extract the member @file{blues} from the archive file
1441 @file{collection.tar} like this:
1442
1443 @smallexample
1444 $ @kbd{tar --extract --file=collection.tar blues}
1445 @end smallexample
1446
1447 @noindent
1448 If you list the files in the directory again, you will see that the file
1449 @file{blues} has been restored, with its original permissions, data modification
1450 times, and owner.@FIXME{This is only accidentally true, but not in
1451 general. In most cases, one has to be root for restoring the owner, and
1452 use a special option for restoring permissions. Here, it just happens
1453 that the restoring user is also the owner of the archived members, and
1454 that the current @code{umask} is compatible with original permissions.}
1455 (These parameters will be identical to those which
1456 the file had when you originally placed it in the archive; any changes
1457 you may have made before deleting the file from the file system,
1458 however, will @emph{not} have been made to the archive member.) The
1459 archive file, @samp{collection.tar}, is the same as it was before you
1460 extracted @samp{blues}. You can confirm this by running @command{tar} with
1461 @option{--list} (@option{-t}).
1462
1463 @FIXME{we hope this will change:}Remember that as with other operations,
1464 specifying the exact member name is important. @w{@kbd{tar --extract
1465 --file=bfiles.tar birds}} will fail, because there is no member named
1466 @file{birds}. To extract the member named @file{./birds}, you must
1467 specify @w{@kbd{tar --extract --file=bfiles.tar ./birds}}. To find the
1468 exact member names of the members of an archive, use @option{--list} (@option{-t})
1469 (@pxref{list}).
1470
1471 You can extract a file to standard output by combining the above options
1472 with the @option{--to-stdout} (@option{-O}) option (@pxref{Writing to Standard
1473 Output}).
1474
1475 If you give the @option{--verbose} option, then @option{--extract}
1476 will print the names of the archive members as it extracts them.
1477
1478 @node extract dir
1479 @subsection Extracting Files that are Directories
1480
1481 Extracting directories which are members of an archive is similar to
1482 extracting other files. The main difference to be aware of is that if
1483 the extracted directory has the same name as any directory already in
1484 the working directory, then files in the extracted directory will be
1485 placed into the directory of the same name. Likewise, if there are
1486 files in the pre-existing directory with the same names as the members
1487 which you extract, the files from the extracted archive will replace
1488 the files already in the working directory (and possible
1489 subdirectories). This will happen regardless of whether or not the
1490 files in the working directory were more recent than those extracted
1491 (there exist, however, special options that alter this behavior
1492 @pxref{Writing}).
1493
1494 However, if a file was stored with a directory name as part of its file
1495 name, and that directory does not exist under the working directory when
1496 the file is extracted, @command{tar} will create the directory.
1497
1498 We can demonstrate how to use @option{--extract} to extract a directory
1499 file with an example. Change to the @file{practice} directory if you
1500 weren't there, and remove the files @file{folk} and @file{jazz}. Then,
1501 go back to the parent directory and extract the archive
1502 @file{music.tar}. You may either extract the entire archive, or you may
1503 extract only the files you just deleted. To extract the entire archive,
1504 don't give any file names as arguments after the archive name
1505 @file{music.tar}. To extract only the files you deleted, use the
1506 following command:
1507
1508 @smallexample
1509 $ @kbd{tar -xvf music.tar practice/folk practice/jazz}
1510 practice/folk
1511 practice/jazz
1512 @end smallexample
1513
1514 @noindent
1515 If you were to specify two @option{--verbose} (@option{-v}) options, @command{tar}
1516 would have displayed more detail about the extracted files, as shown
1517 in the example below:
1518
1519 @smallexample
1520 $ @kbd{tar -xvvf music.tar practice/folk practice/jazz}
1521 -rw-r--r-- me user 28 1996-10-18 16:31 practice/jazz
1522 -rw-r--r-- me user 20 1996-09-23 16:44 practice/folk
1523 @end smallexample
1524
1525 @noindent
1526 Because you created the directory with @file{practice} as part of the
1527 file names of each of the files by archiving the @file{practice}
1528 directory as @file{practice}, you must give @file{practice} as part
1529 of the file names when you extract those files from the archive.
1530
1531 @FIXME{IMPORTANT! show the final structure, here. figure out what it
1532 will be.}
1533
1534 @node extracting untrusted archives
1535 @subsection Extracting Archives from Untrusted Sources
1536
1537 Extracting files from archives can overwrite files that already exist.
1538 If you receive an archive from an untrusted source, you should make a
1539 new directory and extract into that directory, so that you don't have
1540 to worry about the extraction overwriting one of your existing files.
1541 For example, if @file{untrusted.tar} came from somewhere else on the
1542 Internet, and you don't necessarily trust its contents, you can
1543 extract it as follows:
1544
1545 @smallexample
1546 $ @kbd{mkdir newdir}
1547 $ @kbd{cd newdir}
1548 $ @kbd{tar -xvf ../untrusted.tar}
1549 @end smallexample
1550
1551 It is also a good practice to examine contents of the archive
1552 before extracting it, using @option{--list} (@option{-t}) option, possibly combined
1553 with @option{--verbose} (@option{-v}).
1554
1555 @node failing commands
1556 @subsection Commands That Will Fail
1557
1558 Here are some sample commands you might try which will not work, and why
1559 they won't work.
1560
1561 If you try to use this command,
1562
1563 @smallexample
1564 $ @kbd{tar -xvf music.tar folk jazz}
1565 @end smallexample
1566
1567 @noindent
1568 you will get the following response:
1569
1570 @smallexample
1571 tar: folk: Not found in archive
1572 tar: jazz: Not found in archive
1573 $
1574 @end smallexample
1575
1576 @noindent
1577 This is because these files were not originally @emph{in} the parent
1578 directory @file{..}, where the archive is located; they were in the
1579 @file{practice} directory, and their file names reflect this:
1580
1581 @smallexample
1582 $ @kbd{tar -tvf music.tar}
1583 practice/folk
1584 practice/jazz
1585 practice/rock
1586 @end smallexample
1587
1588 @FIXME{make sure the above works when going through the examples in
1589 order...}
1590
1591 @noindent
1592 Likewise, if you try to use this command,
1593
1594 @smallexample
1595 $ @kbd{tar -tvf music.tar folk jazz}
1596 @end smallexample
1597
1598 @noindent
1599 you would get a similar response. Members with those names are not in the
1600 archive. You must use the correct member names in order to extract the
1601 files from the archive.
1602
1603 If you have forgotten the correct names of the files in the archive,
1604 use @w{@kbd{tar --list --verbose}} to list them correctly.
1605
1606 @FIXME{more examples, here? hag thinks it's a good idea.}
1607
1608 @node going further
1609 @section Going Further Ahead in this Manual
1610
1611 @FIXME{need to write up a node here about the things that are going to
1612 be in the rest of the manual.}
1613
1614 @node tar invocation
1615 @chapter Invoking @GNUTAR{}
1616 @UNREVISED
1617
1618 This chapter is about how one invokes the @GNUTAR{}
1619 command, from the command synopsis (@pxref{Synopsis}). There are
1620 numerous options, and many styles for writing them. One mandatory
1621 option specifies the operation @command{tar} should perform
1622 (@pxref{Operation Summary}), other options are meant to detail how
1623 this operation should be performed (@pxref{Option Summary}).
1624 Non-option arguments are not always interpreted the same way,
1625 depending on what the operation is.
1626
1627 You will find in this chapter everything about option styles and rules for
1628 writing them (@pxref{Styles}). On the other hand, operations and options
1629 are fully described elsewhere, in other chapters. Here, you will find
1630 only synthetic descriptions for operations and options, together with
1631 pointers to other parts of the @command{tar} manual.
1632
1633 Some options are so special they are fully described right in this
1634 chapter. They have the effect of inhibiting the normal operation of
1635 @command{tar} or else, they globally alter the amount of feedback the user
1636 receives about what is going on. These are the @option{--help} and
1637 @option{--version} (@pxref{help}), @option{--verbose} (@pxref{verbose})
1638 and @option{--interactive} options (@pxref{interactive}).
1639
1640 @menu
1641 * Synopsis::
1642 * using tar options::
1643 * Styles::
1644 * All Options::
1645 * help::
1646 * defaults::
1647 * verbose::
1648 * interactive::
1649 @end menu
1650
1651 @node Synopsis
1652 @section General Synopsis of @command{tar}
1653
1654 The @GNUTAR{} program is invoked as either one of:
1655
1656 @smallexample
1657 @kbd{tar @var{option}@dots{} [@var{name}]@dots{}}
1658 @kbd{tar @var{letter}@dots{} [@var{argument}]@dots{} [@var{option}]@dots{} [@var{name}]@dots{}}
1659 @end smallexample
1660
1661 The second form is for when old options are being used.
1662
1663 You can use @command{tar} to store files in an archive, to extract them from
1664 an archive, and to do other types of archive manipulation. The primary
1665 argument to @command{tar}, which is called the @dfn{operation}, specifies
1666 which action to take. The other arguments to @command{tar} are either
1667 @dfn{options}, which change the way @command{tar} performs an operation,
1668 or file names or archive members, which specify the files or members
1669 @command{tar} is to act on.
1670
1671 You can actually type in arguments in any order, even if in this manual
1672 the options always precede the other arguments, to make examples easier
1673 to understand. Further, the option stating the main operation mode
1674 (the @command{tar} main command) is usually given first.
1675
1676 Each @var{name} in the synopsis above is interpreted as an archive member
1677 name when the main command is one of @option{--compare}
1678 (@option{--diff}, @option{-d}), @option{--delete}, @option{--extract}
1679 (@option{--get}, @option{-x}), @option{--list} (@option{-t}) or
1680 @option{--update} (@option{-u}). When naming archive members, you
1681 must give the exact name of the member in the archive, as it is
1682 printed by @option{--list}. For @option{--append} (@option{-r}) and
1683 @option{--create} (@option{-c}), these @var{name} arguments specify
1684 the names of either files or directory hierarchies to place in the archive.
1685 These files or hierarchies should already exist in the file system,
1686 prior to the execution of the @command{tar} command.
1687
1688 @command{tar} interprets relative file names as being relative to the
1689 working directory. @command{tar} will make all file names relative
1690 (by removing leading slashes when archiving or restoring files),
1691 unless you specify otherwise (using the @option{--absolute-names}
1692 option). @xref{absolute}, for more information about
1693 @option{--absolute-names}.
1694
1695 If you give the name of a directory as either a file name or a member
1696 name, then @command{tar} acts recursively on all the files and directories
1697 beneath that directory. For example, the name @file{/} identifies all
1698 the files in the file system to @command{tar}.
1699
1700 The distinction between file names and archive member names is especially
1701 important when shell globbing is used, and sometimes a source of confusion
1702 for newcomers. @xref{Wildcards}, for more information about globbing.
1703 The problem is that shells may only glob using existing files in the
1704 file system. Only @command{tar} itself may glob on archive members, so when
1705 needed, you must ensure that wildcard characters reach @command{tar} without
1706 being interpreted by the shell first. Using a backslash before @samp{*}
1707 or @samp{?}, or putting the whole argument between quotes, is usually
1708 sufficient for this.
1709
1710 Even if @var{name}s are often specified on the command line, they
1711 can also be read from a text file in the file system, using the
1712 @option{--files-from=@var{file-of-names}} (@option{-T @var{file-of-names}}) option.
1713
1714 If you don't use any file name arguments, @option{--append} (@option{-r}),
1715 @option{--delete} and @option{--concatenate} (@option{--catenate},
1716 @option{-A}) will do nothing, while @option{--create} (@option{-c})
1717 will usually yield a diagnostic and inhibit @command{tar} execution.
1718 The other operations of @command{tar} (@option{--list},
1719 @option{--extract}, @option{--compare}, and @option{--update})
1720 will act on the entire contents of the archive.
1721
1722 @cindex exit status
1723 @cindex return status
1724 Besides successful exits, @GNUTAR{} may fail for
1725 many reasons. Some reasons correspond to bad usage, that is, when the
1726 @command{tar} command is improperly written. Errors may be
1727 encountered later, while encountering an error processing the archive
1728 or the files. Some errors are recoverable, in which case the failure
1729 is delayed until @command{tar} has completed all its work. Some
1730 errors are such that it would not meaningful, or at least risky, to
1731 continue processing: @command{tar} then aborts processing immediately.
1732 All abnormal exits, whether immediate or delayed, should always be
1733 clearly diagnosed on @code{stderr}, after a line stating the nature of
1734 the error.
1735
1736 @GNUTAR{} returns only a few exit statuses. I'm really
1737 aiming simplicity in that area, for now. If you are not using the
1738 @option{--compare} @option{--diff}, @option{-d}) option, zero means
1739 that everything went well, besides maybe innocuous warnings. Nonzero
1740 means that something went wrong. Right now, as of today, ``nonzero''
1741 is almost always 2, except for remote operations, where it may be
1742 128.
1743
1744 @node using tar options
1745 @section Using @command{tar} Options
1746
1747 @GNUTAR{} has a total of eight operating modes which
1748 allow you to perform a variety of tasks. You are required to choose
1749 one operating mode each time you employ the @command{tar} program by
1750 specifying one, and only one operation as an argument to the
1751 @command{tar} command (two lists of four operations each may be found
1752 at @ref{frequent operations} and @ref{Operations}). Depending on
1753 circumstances, you may also wish to customize how the chosen operating
1754 mode behaves. For example, you may wish to change the way the output
1755 looks, or the format of the files that you wish to archive may require
1756 you to do something special in order to make the archive look right.
1757
1758 You can customize and control @command{tar}'s performance by running
1759 @command{tar} with one or more options (such as @option{--verbose}
1760 (@option{-v}), which we used in the tutorial). As we said in the
1761 tutorial, @dfn{options} are arguments to @command{tar} which are (as
1762 their name suggests) optional. Depending on the operating mode, you
1763 may specify one or more options. Different options will have different
1764 effects, but in general they all change details of the operation, such
1765 as archive format, archive name, or level of user interaction. Some
1766 options make sense with all operating modes, while others are
1767 meaningful only with particular modes. You will likely use some
1768 options frequently, while you will only use others infrequently, or
1769 not at all. (A full list of options is available in @pxref{All Options}.)
1770
1771 @vrindex TAR_OPTIONS, environment variable
1772 @anchor{TAR_OPTIONS}
1773 The @env{TAR_OPTIONS} environment variable specifies default options to
1774 be placed in front of any explicit options. For example, if
1775 @code{TAR_OPTIONS} is @samp{-v --unlink-first}, @command{tar} behaves as
1776 if the two options @option{-v} and @option{--unlink-first} had been
1777 specified before any explicit options. Option specifications are
1778 separated by whitespace. A backslash escapes the next character, so it
1779 can be used to specify an option containing whitespace or a backslash.
1780
1781 Note that @command{tar} options are case sensitive. For example, the
1782 options @option{-T} and @option{-t} are different; the first requires an
1783 argument for stating the name of a file providing a list of @var{name}s,
1784 while the second does not require an argument and is another way to
1785 write @option{--list} (@option{-t}).
1786
1787 In addition to the eight operations, there are many options to
1788 @command{tar}, and three different styles for writing both: long (mnemonic)
1789 form, short form, and old style. These styles are discussed below.
1790 Both the options and the operations can be written in any of these three
1791 styles.
1792
1793 @FIXME{menu at end of this node. need to think of an actual outline
1794 for this chapter; probably do that after stuff from chapter 4 is
1795 incorporated.}
1796
1797 @node Styles
1798 @section The Three Option Styles
1799
1800 There are three styles for writing operations and options to the command
1801 line invoking @command{tar}. The different styles were developed at
1802 different times during the history of @command{tar}. These styles will be
1803 presented below, from the most recent to the oldest.
1804
1805 Some options must take an argument. (For example, @option{--file}
1806 (@option{-f})) takes the name of an archive file as an argument. If
1807 you do not supply an archive file name, @command{tar} will use a
1808 default, but this can be confusing; thus, we recommend that you always
1809 supply a specific archive file name.) Where you @emph{place} the
1810 arguments generally depends on which style of options you choose. We
1811 will detail specific information relevant to each option style in the
1812 sections on the different option styles, below. The differences are
1813 subtle, yet can often be very important; incorrect option placement
1814 can cause you to overwrite a number of important files. We urge you
1815 to note these differences, and only use the option style(s) which
1816 makes the most sense to you until you feel comfortable with the others.
1817
1818 Some options @emph{may} take an argument (currently, there are
1819 two such options: @option{--backup} and @option{--occurrence}). Such
1820 options may have at most long and short forms, they do not have old style
1821 equivalent. The rules for specifying an argument for such options
1822 are stricter than those for specifying mandatory arguments. Please,
1823 pay special attention to them.
1824
1825 @menu
1826 * Mnemonic Options:: Mnemonic Option Style
1827 * Short Options:: Short Option Style
1828 * Old Options:: Old Option Style
1829 * Mixing:: Mixing Option Styles
1830 @end menu
1831
1832 @node Mnemonic Options
1833 @subsection Mnemonic Option Style
1834
1835 @FIXME{have to decide whether or not to replace other occurrences of
1836 "mnemonic" with "long", or *ugh* vice versa.}
1837
1838 Each option has at least one long (or mnemonic) name starting with two
1839 dashes in a row, e.g., @option{--list}. The long names are more clear than
1840 their corresponding short or old names. It sometimes happens that a
1841 single mnemonic option has many different different names which are
1842 synonymous, such as @option{--compare} and @option{--diff}. In addition,
1843 long option names can be given unique abbreviations. For example,
1844 @option{--cre} can be used in place of @option{--create} because there is no
1845 other mnemonic option which begins with @samp{cre}. (One way to find
1846 this out is by trying it and seeing what happens; if a particular
1847 abbreviation could represent more than one option, @command{tar} will tell
1848 you that that abbreviation is ambiguous and you'll know that that
1849 abbreviation won't work. You may also choose to run @samp{tar --help}
1850 to see a list of options. Be aware that if you run @command{tar} with a
1851 unique abbreviation for the long name of an option you didn't want to
1852 use, you are stuck; @command{tar} will perform the command as ordered.)
1853
1854 Mnemonic options are meant to be obvious and easy to remember, and their
1855 meanings are generally easier to discern than those of their
1856 corresponding short options (see below). For example:
1857
1858 @smallexample
1859 $ @kbd{tar --create --verbose --blocking-factor=20 --file=/dev/rmt0}
1860 @end smallexample
1861
1862 @noindent
1863 gives a fairly good set of hints about what the command does, even
1864 for those not fully acquainted with @command{tar}.
1865
1866 Mnemonic options which require arguments take those arguments
1867 immediately following the option name. There are two ways of
1868 specifying a mandatory argument. It can be separated from the
1869 option name either by an equal sign, or by any amount of
1870 white space characters. For example, the @option{--file} option (which
1871 tells the name of the @command{tar} archive) is given a file such as
1872 @file{archive.tar} as argument by using any of the following notations:
1873 @option{--file=archive.tar} or @option{--file archive.tar}.
1874
1875 In contrast, optional arguments must always be introduced using
1876 an equal sign. For example, the @option{--backup} option takes
1877 an optional argument specifying backup type. It must be used
1878 as @option{--backup=@var{backup-type}}.
1879
1880 @node Short Options
1881 @subsection Short Option Style
1882
1883 Most options also have a short option name. Short options start with
1884 a single dash, and are followed by a single character, e.g., @option{-t}
1885 (which is equivalent to @option{--list}). The forms are absolutely
1886 identical in function; they are interchangeable.
1887
1888 The short option names are faster to type than long option names.
1889
1890 Short options which require arguments take their arguments immediately
1891 following the option, usually separated by white space. It is also
1892 possible to stick the argument right after the short option name, using
1893 no intervening space. For example, you might write @w{@option{-f
1894 archive.tar}} or @option{-farchive.tar} instead of using
1895 @option{--file=archive.tar}. Both @option{--file=@var{archive-name}} and
1896 @w{@option{-f @var{archive-name}}} denote the option which indicates a
1897 specific archive, here named @file{archive.tar}.
1898
1899 Short options which take optional arguments take their arguments
1900 immediately following the option letter, @emph{without any intervening
1901 white space characters}.
1902
1903 Short options' letters may be clumped together, but you are not
1904 required to do this (as compared to old options; see below). When
1905 short options are clumped as a set, use one (single) dash for them
1906 all, e.g., @w{@samp{@command{tar} -cvf}}. Only the last option in
1907 such a set is allowed to have an argument@footnote{Clustering many
1908 options, the last of which has an argument, is a rather opaque way to
1909 write options. Some wonder if @acronym{GNU} @code{getopt} should not
1910 even be made helpful enough for considering such usages as invalid.}.
1911
1912 When the options are separated, the argument for each option which requires
1913 an argument directly follows that option, as is usual for Unix programs.
1914 For example:
1915
1916 @smallexample
1917 $ @kbd{tar -c -v -b 20 -f /dev/rmt0}
1918 @end smallexample
1919
1920 If you reorder short options' locations, be sure to move any arguments
1921 that belong to them. If you do not move the arguments properly, you may
1922 end up overwriting files.
1923
1924 @node Old Options
1925 @subsection Old Option Style
1926 @UNREVISED
1927
1928 Like short options, old options are single letters. However, old options
1929 must be written together as a single clumped set, without spaces separating
1930 them or dashes preceding them@footnote{Beware that if you precede options
1931 with a dash, you are announcing the short option style instead of the
1932 old option style; short options are decoded differently.}. This set
1933 of letters must be the first to appear on the command line, after the
1934 @command{tar} program name and some white space; old options cannot appear
1935 anywhere else. The letter of an old option is exactly the same letter as
1936 the corresponding short option. For example, the old option @samp{t} is
1937 the same as the short option @option{-t}, and consequently, the same as the
1938 mnemonic option @option{--list}. So for example, the command @w{@samp{tar
1939 cv}} specifies the option @option{-v} in addition to the operation @option{-c}.
1940
1941 @FIXME{bob suggests having an uglier example. :-) }
1942
1943 When options that need arguments are given together with the command,
1944 all the associated arguments follow, in the same order as the options.
1945 Thus, the example given previously could also be written in the old
1946 style as follows:
1947
1948 @smallexample
1949 $ @kbd{tar cvbf 20 /dev/rmt0}
1950 @end smallexample
1951
1952 @noindent
1953 Here, @samp{20} is the argument of @option{-b} and @samp{/dev/rmt0} is
1954 the argument of @option{-f}.
1955
1956 On the other hand, this old style syntax makes it difficult to match
1957 option letters with their corresponding arguments, and is often
1958 confusing. In the command @w{@samp{tar cvbf 20 /dev/rmt0}}, for example,
1959 @samp{20} is the argument for @option{-b}, @samp{/dev/rmt0} is the
1960 argument for @option{-f}, and @option{-v} does not have a corresponding
1961 argument. Even using short options like in @w{@samp{tar -c -v -b 20 -f
1962 /dev/rmt0}} is clearer, putting all arguments next to the option they
1963 pertain to.
1964
1965 If you want to reorder the letters in the old option argument, be
1966 sure to reorder any corresponding argument appropriately.
1967
1968 This old way of writing @command{tar} options can surprise even experienced
1969 users. For example, the two commands:
1970
1971 @smallexample
1972 @kbd{tar cfz archive.tar.gz file}
1973 @kbd{tar -cfz archive.tar.gz file}
1974 @end smallexample
1975
1976 @noindent
1977 are quite different. The first example uses @file{archive.tar.gz} as
1978 the value for option @samp{f} and recognizes the option @samp{z}. The
1979 second example, however, uses @file{z} as the value for option
1980 @samp{f} --- probably not what was intended.
1981
1982 Old options are kept for compatibility with old versions of @command{tar}.
1983
1984 This second example could be corrected in many ways, among which the
1985 following are equivalent:
1986
1987 @smallexample
1988 @kbd{tar -czf archive.tar.gz file}
1989 @kbd{tar -cf archive.tar.gz -z file}
1990 @kbd{tar cf archive.tar.gz -z file}
1991 @end smallexample
1992
1993 @FIXME{still could explain this better; it's redundant:}
1994
1995 @cindex option syntax, traditional
1996 As far as we know, all @command{tar} programs, @acronym{GNU} and
1997 non-@acronym{GNU}, support old options. @GNUTAR{}
1998 supports them not only for historical reasons, but also because many
1999 people are used to them. For compatibility with Unix @command{tar},
2000 the first argument is always treated as containing command and option
2001 letters even if it doesn't start with @samp{-}. Thus, @samp{tar c} is
2002 equivalent to @w{@samp{tar -c}:} both of them specify the
2003 @option{--create} (@option{-c}) command to create an archive.
2004
2005 @node Mixing
2006 @subsection Mixing Option Styles
2007
2008 All three styles may be intermixed in a single @command{tar} command,
2009 so long as the rules for each style are fully
2010 respected@footnote{Before @GNUTAR{} version 1.11.6,
2011 a bug prevented intermixing old style options with mnemonic options in
2012 some cases.}. Old style options and either of the modern styles of
2013 options may be mixed within a single @command{tar} command. However,
2014 old style options must be introduced as the first arguments only,
2015 following the rule for old options (old options must appear directly
2016 after the @command{tar} command and some white space). Modern options
2017 may be given only after all arguments to the old options have been
2018 collected. If this rule is not respected, a modern option might be
2019 falsely interpreted as the value of the argument to one of the old
2020 style options.
2021
2022 For example, all the following commands are wholly equivalent, and
2023 illustrate the many combinations and orderings of option styles.
2024
2025 @smallexample
2026 @kbd{tar --create --file=archive.tar}
2027 @kbd{tar --create -f archive.tar}
2028 @kbd{tar --create -farchive.tar}
2029 @kbd{tar --file=archive.tar --create}
2030 @kbd{tar --file=archive.tar -c}
2031 @kbd{tar -c --file=archive.tar}
2032 @kbd{tar -c -f archive.tar}
2033 @kbd{tar -c -farchive.tar}
2034 @kbd{tar -cf archive.tar}
2035 @kbd{tar -cfarchive.tar}
2036 @kbd{tar -f archive.tar --create}
2037 @kbd{tar -f archive.tar -c}
2038 @kbd{tar -farchive.tar --create}
2039 @kbd{tar -farchive.tar -c}
2040 @kbd{tar c --file=archive.tar}
2041 @kbd{tar c -f archive.tar}
2042 @kbd{tar c -farchive.tar}
2043 @kbd{tar cf archive.tar}
2044 @kbd{tar f archive.tar --create}
2045 @kbd{tar f archive.tar -c}
2046 @kbd{tar fc archive.tar}
2047 @end smallexample
2048
2049 On the other hand, the following commands are @emph{not} equivalent to
2050 the previous set:
2051
2052 @smallexample
2053 @kbd{tar -f -c archive.tar}
2054 @kbd{tar -fc archive.tar}
2055 @kbd{tar -fcarchive.tar}
2056 @kbd{tar -farchive.tarc}
2057 @kbd{tar cfarchive.tar}
2058 @end smallexample
2059
2060 @noindent
2061 These last examples mean something completely different from what the
2062 user intended (judging based on the example in the previous set which
2063 uses long options, whose intent is therefore very clear). The first
2064 four specify that the @command{tar} archive would be a file named
2065 @option{-c}, @samp{c}, @samp{carchive.tar} or @samp{archive.tarc},
2066 respectively. The first two examples also specify a single non-option,
2067 @var{name} argument having the value @samp{archive.tar}. The last
2068 example contains only old style option letters (repeating option
2069 @samp{c} twice), not all of which are meaningful (eg., @samp{.},
2070 @samp{h}, or @samp{i}), with no argument value. @FIXME{not sure i liked
2071 the first sentence of this paragraph..}
2072
2073 @node All Options
2074 @section All @command{tar} Options
2075
2076 The coming manual sections contain an alphabetical listing of all
2077 @command{tar} operations and options, with brief descriptions and cross
2078 references to more in-depth explanations in the body of the manual.
2079 They also contain an alphabetically arranged table of the short option
2080 forms with their corresponding long option. You can use this table as
2081 a reference for deciphering @command{tar} commands in scripts.
2082
2083 @menu
2084 * Operation Summary::
2085 * Option Summary::
2086 * Short Option Summary::
2087 @end menu
2088
2089 @node Operation Summary
2090 @subsection Operations
2091
2092 @table @option
2093
2094 @opindex append, summary
2095 @item --append
2096 @itemx -r
2097
2098 Appends files to the end of the archive. @xref{append}.
2099
2100 @opindex catenate, summary
2101 @item --catenate
2102 @itemx -A
2103
2104 Same as @option{--concatenate}. @xref{concatenate}.
2105
2106 @opindex compare, summary
2107 @item --compare
2108 @itemx -d
2109
2110 Compares archive members with their counterparts in the file
2111 system, and reports differences in file size, mode, owner,
2112 modification date and contents. @xref{compare}.
2113
2114 @opindex concatenate, summary
2115 @item --concatenate
2116 @itemx -A
2117
2118 Appends other @command{tar} archives to the end of the archive.
2119 @xref{concatenate}.
2120
2121 @opindex create, summary
2122 @item --create
2123 @itemx -c
2124
2125 Creates a new @command{tar} archive. @xref{create}.
2126
2127 @opindex delete, summary
2128 @item --delete
2129
2130 Deletes members from the archive. Don't try this on a archive on a
2131 tape! @xref{delete}.
2132
2133 @opindex diff, summary
2134 @item --diff
2135 @itemx -d
2136
2137 Same @option{--compare}. @xref{compare}.
2138
2139 @opindex extract, summary
2140 @item --extract
2141 @itemx -x
2142
2143 Extracts members from the archive into the file system. @xref{extract}.
2144
2145 @opindex get, summary
2146 @item --get
2147 @itemx -x
2148
2149 Same as @option{--extract}. @xref{extract}.
2150
2151 @opindex list, summary
2152 @item --list
2153 @itemx -t
2154
2155 Lists the members in an archive. @xref{list}.
2156
2157 @opindex update, summary
2158 @item --update
2159 @itemx -u
2160
2161 @FIXME{It was: A combination of the @option{--compare} and
2162 @option{--append} operations. This is not true and rather misleading,
2163 as @option{--compare} (@option{--diff}, @option{-d}) does a lot more than @option{--update} (@option{-u}) for
2164 ensuring files are identical.} Adds files to the end of the archive,
2165 but only if they are newer than their counterparts already in the
2166 archive, or if they do not already exist in the archive.
2167 @xref{update}.
2168
2169 @end table
2170
2171 @node Option Summary
2172 @subsection @command{tar} Options
2173
2174 @table @option
2175
2176 @opindex absolute-names, summary
2177 @item --absolute-names
2178 @itemx -P
2179
2180 Normally when creating an archive, @command{tar} strips an initial
2181 @samp{/} from member names. This option disables that behavior.
2182 @xref{absolute}.
2183
2184 @opindex after-date, summary
2185 @item --after-date
2186
2187 (See @option{--newer}, @pxref{after})
2188
2189 @opindex anchored, summary
2190 @item --anchored
2191 An exclude pattern must match an initial subsequence of the name's components.
2192 @xref{controlling pattern-matching with exclude}.
2193
2194 @opindex atime-preserve, summary
2195 @item --atime-preserve
2196 @itemx --atime-preserve=replace
2197 @itemx --atime-preserve=system
2198
2199 Attempt to preserve the access time of files when reading them. This
2200 option currently is effective only on files that you own, unless you
2201 have superuser privileges.
2202
2203 @option{--atime-preserve=replace} remembers the access time of a file
2204 before reading it, and then restores the access time afterwards. This
2205 may cause problems if other programs are reading the file at the same
2206 time, as the times of their accesses will be lost. On most platforms
2207 restoring the access time also requires @command{tar} to restore the
2208 data modification time too, so this option may also cause problems if
2209 other programs are writing the file at the same time. (Tar attempts
2210 to detect this situation, but cannot do so reliably due to race
2211 conditions.) Worse, on most platforms restoring the access time also
2212 updates the status change time, which means that this option is
2213 incompatible with incremental backups.
2214
2215 @option{--atime-preserve=system} avoids changing time stamps on files,
2216 without interfering with time stamp updates
2217 caused by other programs, so it works better with incremental backups.
2218 However, it requires a special @code{O_NOATIME} option from the
2219 underlying operating and file system implementation, and it also requires
2220 that searching directories does not update their access times. As of
2221 this writing (November 2005) this works only with Linux, and only with
2222 Linux kernels 2.6.8 and later. Worse, there is currently no reliable
2223 way to know whether this feature actually works. Sometimes
2224 @command{tar} knows that it does not work, and if you use
2225 @option{--atime-preserve=system} then @command{tar} complains and
2226 exits right away. But other times @command{tar} might think that the
2227 option works when it actually does not.
2228
2229 Currently @option{--atime-preserve} with no operand defaults to
2230 @option{--atime-preserve=replace}, but this may change in the future
2231 as support for @option{--atime-preserve=system} improves.
2232
2233 If your operating system does not support
2234 @option{--atime-preserve=system}, you might be able to preserve access
2235 times reliably by by using the @command{mount} command. For example,
2236 you can mount the file system read-only, or access the file system via
2237 a read-only loopback mount, or use the @samp{noatime} mount option
2238 available on some systems. However, mounting typically requires
2239 superuser privileges and can be a pain to manage.
2240
2241 @opindex backup, summary
2242 @item --backup=@var{backup-type}
2243
2244 Rather than deleting files from the file system, @command{tar} will
2245 back them up using simple or numbered backups, depending upon
2246 @var{backup-type}. @xref{backup}.
2247
2248 @opindex block-number, summary
2249 @item --block-number
2250 @itemx -R
2251
2252 With this option present, @command{tar} prints error messages for read errors
2253 with the block number in the archive file. @xref{block-number}.
2254
2255 @opindex blocking-factor, summary
2256 @item --blocking-factor=@var{blocking}
2257 @itemx -b @var{blocking}
2258
2259 Sets the blocking factor @command{tar} uses to @var{blocking} x 512 bytes per
2260 record. @xref{Blocking Factor}.
2261
2262 @opindex bzip2, summary
2263 @item --bzip2
2264 @itemx -j
2265
2266 This option tells @command{tar} to read or write archives through
2267 @code{bzip2}. @xref{gzip}.
2268
2269 @opindex checkpoint, summary
2270 @item --checkpoint
2271
2272 This option directs @command{tar} to print periodic checkpoint messages as it
2273 reads through the archive. Its intended for when you want a visual
2274 indication that @command{tar} is still running, but don't want to see
2275 @option{--verbose} output. @FIXME-xref{}
2276
2277 @opindex check-links, summary
2278 @item --check-links
2279 @itemx -l
2280 If this option was given, @command{tar} will check the number of links
2281 dumped for each processed file. If this number does not match the
2282 total number of hard links for the file, a warning message will be
2283 output.
2284
2285 Future versions will take @option{-l} as a short version of
2286 @option{--check-links}. However, current release still retains the old
2287 semantics for @option{-l}.
2288
2289 @xref{Changes}, for more information.
2290
2291 @opindex compress, summary
2292 @opindex uncompress, summary
2293 @item --compress
2294 @itemx --uncompress
2295 @itemx -Z
2296
2297 @command{tar} will use the @command{compress} program when reading or
2298 writing the archive. This allows you to directly act on archives
2299 while saving space. @xref{gzip}.
2300
2301 @opindex confirmation, summary
2302 @item --confirmation
2303
2304 (See @option{--interactive}.) @xref{interactive}.
2305
2306 @opindex dereference, summary
2307 @item --dereference
2308 @itemx -h
2309
2310 When creating a @command{tar} archive, @command{tar} will archive the
2311 file that a symbolic link points to, rather than archiving the
2312 symlink. @xref{dereference}.
2313
2314 @opindex directory, summary
2315 @item --directory=@var{dir}
2316 @itemx -C @var{dir}
2317
2318 When this option is specified, @command{tar} will change its current directory
2319 to @var{dir} before performing any operations. When this option is used
2320 during archive creation, it is order sensitive. @xref{directory}.
2321
2322 @opindex exclude, summary
2323 @item --exclude=@var{pattern}
2324
2325 When performing operations, @command{tar} will skip files that match
2326 @var{pattern}. @xref{exclude}.
2327
2328 @opindex exclude-from, summary
2329 @item --exclude-from=@var{file}
2330 @itemx -X @var{file}
2331
2332 Similar to @option{--exclude}, except @command{tar} will use the list of
2333 patterns in the file @var{file}. @xref{exclude}.
2334
2335 @opindex exclude-caches, summary
2336 @item --exclude-caches
2337
2338 Automatically excludes all directories
2339 containing a cache directory tag. @xref{exclude}.
2340
2341 @opindex file, summary
2342 @item --file=@var{archive}
2343 @itemx -f @var{archive}
2344
2345 @command{tar} will use the file @var{archive} as the @command{tar} archive it
2346 performs operations on, rather than @command{tar}'s compilation dependent
2347 default. @xref{file tutorial}.
2348
2349 @opindex files-from, summary
2350 @item --files-from=@var{file}
2351 @itemx -T @var{file}
2352
2353 @command{tar} will use the contents of @var{file} as a list of archive members
2354 or files to operate on, in addition to those specified on the
2355 command-line. @xref{files}.
2356
2357 @opindex force-local, summary
2358 @item --force-local
2359
2360 Forces @command{tar} to interpret the filename given to @option{--file}
2361 as a local file, even if it looks like a remote tape drive name.
2362 @xref{local and remote archives}.
2363
2364 @opindex format, summary
2365 @item --format=@var{format}
2366
2367 Selects output archive format. @var{Format} may be one of the
2368 following:
2369
2370 @table @samp
2371 @item v7
2372 Creates an archive that is compatible with Unix V7 @command{tar}.
2373
2374 @item oldgnu
2375 Creates an archive that is compatible with GNU @command{tar} version
2376 1.12 or earlier.
2377
2378 @item gnu
2379 Creates archive in GNU tar 1.13 format. Basically it is the same as
2380 @samp{oldgnu} with the only difference in the way it handles long
2381 numeric fields.
2382
2383 @item ustar
2384 Creates a @acronym{POSIX.1-1988} compatible archive.
2385
2386 @item posix
2387 Creates a @acronym{POSIX.1-2001 archive}.
2388
2389 @end table
2390
2391 @xref{Formats}, for a detailed discussion of these formats.
2392
2393 @opindex group, summary
2394 @item --group=@var{group}
2395
2396 Files added to the @command{tar} archive will have a group id of @var{group},
2397 rather than the group from the source file. @var{group} is first decoded
2398 as a group symbolic name, but if this interpretation fails, it has to be
2399 a decimal numeric group ID. @FIXME-xref{}
2400
2401 Also see the comments for the @option{--owner=@var{user}} option.
2402
2403 @opindex gzip, summary
2404 @opindex gunzip, summary
2405 @opindex ungzip, summary
2406 @item --gzip
2407 @itemx --gunzip
2408 @itemx --ungzip
2409 @itemx -z
2410
2411 This option tells @command{tar} to read or write archives through
2412 @command{gzip}, allowing @command{tar} to directly operate on several
2413 kinds of compressed archives transparently. @xref{gzip}.
2414
2415 @opindex help, summary
2416 @item --help
2417
2418 @command{tar} will print out a short message summarizing the operations and
2419 options to @command{tar} and exit. @xref{help}.
2420
2421 @opindex ignore-case, summary
2422 @item --ignore-case
2423 Ignore case when excluding files. @xref{controlling pattern-matching
2424 with exclude}.
2425
2426 @opindex ignore-command-error, summary
2427 @item --ignore-command-error
2428 Ignore exit codes of subprocesses. @xref{Writing to an External Program}.
2429
2430 @opindex ignore-failed-read, summary
2431 @item --ignore-failed-read
2432
2433 Do not exit unsuccessfully merely because an unreadable file was encountered.
2434 @xref{Reading}.
2435
2436 @opindex ignore-zeros, summary
2437 @item --ignore-zeros
2438 @itemx -i
2439
2440 With this option, @command{tar} will ignore zeroed blocks in the
2441 archive, which normally signals EOF. @xref{Reading}.
2442
2443 @opindex incremental, summary
2444 @item --incremental
2445 @itemx -G
2446
2447 Used to inform @command{tar} that it is working with an old
2448 @acronym{GNU}-format incremental backup archive. It is intended
2449 primarily for backwards compatibility only. @FIXME{incremental and
2450 listed-incremental}.
2451
2452 @opindex index-file, summary
2453 @item --index-file=@var{file}
2454
2455 Send verbose output to @var{file} instead of to standard output.
2456
2457 @opindex info-script, summary
2458 @opindex new-volume-script, summary
2459 @item --info-script=@var{script-file}
2460 @itemx --new-volume-script=@var{script-file}
2461 @itemx -F @var{script-file}
2462
2463 When @command{tar} is performing multi-tape backups, @var{script-file} is run
2464 at the end of each tape. If @var{script-file} exits with nonzero status,
2465 @command{tar} fails immediately. @xref{info-script}, for a detailed
2466 discussion of @var{script-file}.
2467
2468 @opindex interactive, summary
2469 @item --interactive
2470 @itemx --confirmation
2471 @itemx -w
2472
2473 Specifies that @command{tar} should ask the user for confirmation before
2474 performing potentially destructive options, such as overwriting files.
2475 @xref{interactive}.
2476
2477 @opindex keep-newer-files, summary
2478 @item --keep-newer-files
2479
2480 Do not replace existing files that are newer than their archive copies
2481 when extracting files from an archive.
2482
2483 @opindex keep-old-files, summary
2484 @item --keep-old-files
2485 @itemx -k
2486
2487 Do not overwrite existing files when extracting files from an archive.
2488 @xref{Keep Old Files}.
2489
2490 @opindex label, summary
2491 @item --label=@var{name}
2492 @itemx -V @var{name}
2493
2494 When creating an archive, instructs @command{tar} to write @var{name}
2495 as a name record in the archive. When extracting or listing archives,
2496 @command{tar} will only operate on archives that have a label matching
2497 the pattern specified in @var{name}. @xref{Tape Files}.
2498
2499 @opindex listed-incremental, summary
2500 @item --listed-incremental=@var{snapshot-file}
2501 @itemx -g @var{snapshot-file}
2502
2503 During a @option{--create} operation, specifies that the archive that
2504 @command{tar} creates is a new @acronym{GNU}-format incremental
2505 backup, using @var{snapshot-file} to determine which files to backup.
2506 With other operations, informs @command{tar} that the archive is in
2507 incremental format. @FIXME{incremental and listed-incremental}.
2508
2509 @opindex mode, summary
2510 @item --mode=@var{permissions}
2511
2512 When adding files to an archive, @command{tar} will use
2513 @var{permissions} for the archive members, rather than the permissions
2514 from the files. The program @command{chmod} and this @command{tar}
2515 option share the same syntax for what @var{permissions} might be.
2516 @xref{File permissions, Permissions, File permissions, fileutils,
2517 @acronym{GNU} file utilities}. This reference also has useful
2518 information for those not being overly familiar with the Unix
2519 permission system.
2520
2521 Of course, @var{permissions} might be plainly specified as an octal number.
2522 However, by using generic symbolic modifications to mode bits, this allows
2523 more flexibility. For example, the value @samp{a+rw} adds read and write
2524 permissions for everybody, while retaining executable bits on directories
2525 or on any other file already marked as executable.
2526
2527 @opindex multi-volume, summary
2528 @item --multi-volume
2529 @itemx -M
2530
2531 Informs @command{tar} that it should create or otherwise operate on a
2532 multi-volume @command{tar} archive. @xref{Using Multiple Tapes}.
2533
2534 @opindex new-volume-script, summary
2535 @item --new-volume-script
2536
2537 (see --info-script)
2538
2539 @opindex seek, summary
2540 @item --seek
2541 @itemx -n
2542
2543 Assume that the archive media supports seeks to arbitrary
2544 locations. Usually @command{tar} determines automatically whether
2545 the archive can be seeked or not. This option is intended for use
2546 in cases when such recognition fails.
2547
2548 @opindex newer, summary
2549 @item --newer=@var{date}
2550 @itemx --after-date=@var{date}
2551 @itemx -N
2552
2553 When creating an archive, @command{tar} will only add files that have changed
2554 since @var{date}. If @var{date} begins with @samp{/} or @samp{.}, it
2555 is taken to be the name of a file whose data modification time specifies
2556 the date. @xref{after}.
2557
2558 @opindex newer-mtime, summary
2559 @item --newer-mtime=@var{date}
2560
2561 Like @option{--newer}, but add only files whose
2562 contents have changed (as opposed to just @option{--newer}, which will
2563 also back up files for which any status information has changed).
2564
2565 @opindex no-anchored, summary
2566 @item --no-anchored
2567 An exclude pattern can match any subsequence of the name's components.
2568 @xref{controlling pattern-matching with exclude}.
2569
2570 @opindex no-ignore-case, summary
2571 @item --no-ignore-case
2572 Use case-sensitive matching when excluding files.
2573 @xref{controlling pattern-matching with exclude}.
2574
2575 @opindex no-ignore-command-error, summary
2576 @item --no-ignore-command-error
2577 Print warnings about subprocesses terminated with a non-zero exit
2578 code. @xref{Writing to an External Program}.
2579
2580 @opindex no-recursion, summary
2581 @item --no-recursion
2582
2583 With this option, @command{tar} will not recurse into directories.
2584 @xref{recurse}.
2585
2586 @opindex no-same-owner, summary
2587 @item --no-same-owner
2588 @itemx -o
2589
2590 When extracting an archive, do not attempt to preserve the owner
2591 specified in the @command{tar} archive. This the default behavior
2592 for ordinary users.
2593
2594 @opindex no-same-permissions, summary
2595 @item --no-same-permissions
2596
2597 When extracting an archive, subtract the user's umask from files from
2598 the permissions specified in the archive. This is the default behavior
2599 for ordinary users.
2600
2601 @opindex no-wildcards, summary
2602 @item --no-wildcards
2603 Do not use wildcards when excluding files.
2604 @xref{controlling pattern-matching with exclude}.
2605
2606 @opindex no-wildcards-match-slash, summary
2607 @item --no-wildcards-match-slash
2608 Wildcards do not match @samp{/} when excluding files.
2609 @xref{controlling pattern-matching with exclude}.
2610
2611 @opindex null, summary
2612 @item --null
2613
2614 When @command{tar} is using the @option{--files-from} option, this option
2615 instructs @command{tar} to expect filenames terminated with @option{NUL}, so
2616 @command{tar} can correctly work with file names that contain newlines.
2617 @xref{nul}.
2618
2619 @opindex numeric-owner, summary
2620 @item --numeric-owner
2621
2622 This option will notify @command{tar} that it should use numeric user
2623 and group IDs when creating a @command{tar} file, rather than names.
2624 @xref{Attributes}.
2625
2626 @item -o
2627 When extracting files, this option is a synonym for
2628 @option{--no-same-owner}, i.e. it prevents @command{tar} from
2629 restoring ownership of files being extracted.
2630
2631 When creating an archive, @option{-o} is a synonym for
2632 @option{--old-archive}. This behavior is for compatibility
2633 with previous versions of @GNUTAR{}, and will be
2634 removed in the future releases.
2635
2636 @xref{Changes}, for more information.
2637
2638 @opindex occurrence, summary
2639 @item --occurrence[=@var{number}]
2640
2641 This option can be used in conjunction with one of the subcommands
2642 @option{--delete}, @option{--diff}, @option{--extract} or
2643 @option{--list} when a list of files is given either on the command
2644 line or via @option{-T} option.
2645
2646 This option instructs @command{tar} to process only the @var{number}th
2647 occurrence of each named file. @var{Number} defaults to 1, so
2648
2649 @smallexample
2650 tar -x -f archive.tar --occurrence filename
2651 @end smallexample
2652
2653 @noindent
2654 will extract the first occurrence of @file{filename} from @file{archive.tar}
2655 and will terminate without scanning to the end of the archive.
2656
2657 @opindex old-archive, summary
2658 @item --old-archive
2659 Synonym for @option{--format=v7}.
2660
2661 @opindex one-file-system, summary
2662 @item --one-file-system
2663 @itemx -l
2664 Used when creating an archive. Prevents @command{tar} from recursing into
2665 directories that are on different file systems from the current
2666 directory.
2667
2668 Earlier versions of @GNUTAR{} understood @option{-l} as a
2669 synonym for @option{--one-file-system}. Although such usage is still
2670 allowed in the present version, it is @emph{strongly discouraged}.
2671 The future versions of @GNUTAR{} will use @option{-l} as
2672 a synonym for @option{--check-links}.
2673
2674 @xref{Changes}, for more information.
2675
2676 @opindex overwrite, summary
2677 @item --overwrite
2678
2679 Overwrite existing files and directory metadata when extracting files
2680 from an archive. @xref{Overwrite Old Files}.
2681
2682 @opindex overwrite-dir, summary
2683 @item --overwrite-dir
2684
2685 Overwrite the metadata of existing directories when extracting files
2686 from an archive. @xref{Overwrite Old Files}.
2687
2688 @opindex owner, summary
2689 @item --owner=@var{user}
2690
2691 Specifies that @command{tar} should use @var{user} as the owner of members
2692 when creating archives, instead of the user associated with the source
2693 file. @var{user} is first decoded as a user symbolic name, but if
2694 this interpretation fails, it has to be a decimal numeric user ID.
2695 @FIXME-xref{}
2696
2697 There is no value indicating a missing number, and @samp{0} usually means
2698 @code{root}. Some people like to force @samp{0} as the value to offer in
2699 their distributions for the owner of files, because the @code{root} user is
2700 anonymous anyway, so that might as well be the owner of anonymous archives.
2701
2702 This option does not affect extraction from archives.
2703
2704 @opindex pax-option, summary
2705 @item --pax-option=@var{keyword-list}
2706 @FIXME{Such a detailed description does not belong there, move it elsewhere.}
2707 This option is meaningful only with @acronym{POSIX.1-2001} archives
2708 (@pxref{posix}). It modifies the way @command{tar} handles the
2709 extended header keywords. @var{Keyword-list} is a comma-separated
2710 list of keyword options, each keyword option taking one of
2711 the following forms:
2712
2713 @table @asis
2714 @item delete=@var{pattern}
2715 When used with one of archive-creation commands,
2716 this option instructs @command{tar} to omit from extended header records
2717 that it produces any keywords matching the string @var{pattern}.
2718
2719 When used in extract or list mode, this option instructs tar
2720 to ignore any keywords matching the given @var{pattern} in the extended
2721 header records. In both cases, matching is performed using the pattern
2722 matching notation described in @acronym{POSIX 1003.2}, 3.13
2723 (See @cite{glob(7)}). For example:
2724
2725 @smallexample
2726 --pax-option delete=security.*
2727 @end smallexample
2728
2729 would suppress security-related information.
2730
2731 @item exthdr.name=@var{string}
2732
2733 This keyword allows user control over the name that is written into the
2734 ustar header blocks for the extended headers. The name is obtained
2735 from @var{string} after substituting the following meta-characters:
2736
2737 @multitable @columnfractions .30 .70
2738 @headitem Meta-character @tab Replaced By
2739 @item %d @tab The directory name of the file, equivalent to the
2740 result of the @command{dirname} utility on the translated pathname.
2741 @item %f @tab The filename of the file, equivalent to the result
2742 of the @command{basename} utility on the translated pathname.
2743 @item %p @tab The process ID of the @command{tar} process.
2744 @item %% @tab A @samp{%} character.
2745 @end multitable
2746
2747 Any other @samp{%} characters in @var{string} produce undefined
2748 results.
2749
2750 If no option @samp{exthdr.name=string} is specified, @command{tar}
2751 will use the following default value:
2752
2753 @smallexample
2754 %d/PaxHeaders.%p/%f
2755 @end smallexample
2756
2757 @item globexthdr.name=@var{string}
2758 This keyword allows user control over the name that is written into
2759 the ustar header blocks for global extended header records. The name
2760 shall will be obtained from the contents of @var{string}, after the
2761 following character substitutions have been made:
2762
2763 @multitable @columnfractions .30 .70
2764 @headitem Meta-character @tab Replaced By
2765 @item %n @tab An integer that represents the
2766 sequence number of the global extended header record in the archive,
2767 starting at 1.
2768 @item %p @tab The process ID of the @command{tar} process.
2769 @item %% @tab A @samp{%} character.
2770 @end multitable
2771
2772 Any other @samp{%} characters in string produce undefined results.
2773
2774 If no option @samp{globexthdr.name=string} is specified, @command{tar}
2775 will use the following default value:
2776
2777 @smallexample
2778 $TMPDIR/GlobalHead.%p.%n
2779 @end smallexample
2780
2781 @noindent
2782 where @samp{$TMPDIR} represents the value of the @var{TMPDIR}
2783 environment variable. If @var{TMPDIR} is not set, @command{tar}
2784 uses @samp{/tmp}.
2785
2786 @item @var{keyword}=@var{value}
2787 When used with one of archive-creation commands, these keyword/value pairs
2788 will be included at the beginning of the archive in a global extended
2789 header record. When used with one of archive-reading commands,
2790 @command{tar} will behave as if it has encountered these keyword/value
2791 pairs at the beginning of the archive in a global extended header
2792 record.
2793
2794 @item @var{keyword}:=@var{value}
2795 When used with one of archive-creation commands, these keyword/value pairs
2796 will be included as records at the beginning of an extended header for
2797 each file. This is effectively equivalent to @var{keyword}=@var{value}
2798 form except that it creates no global extended header records.
2799
2800 When used with one of archive-reading commands, @command{tar} will
2801 behave as if these keyword/value pairs were included as records at the
2802 end of each extended header; thus, they will override any global or
2803 file-specific extended header record keywords of the same names.
2804 For example, in the command:
2805
2806 @smallexample
2807 tar --format=posix --create \
2808 --file archive --pax-option gname:=user .
2809 @end smallexample
2810
2811 the group name will be forced to a new value for all files
2812 stored in the archive.
2813 @end table
2814
2815 @opindex portability, summary
2816 @item --portability
2817 @itemx --old-archive
2818 Synonym for @option{--format=v7}.
2819
2820 @opindex posix, summary
2821 @item --posix
2822 Same as @option{--format=posix}.
2823
2824 @opindex preserve, summary
2825 @item --preserve
2826
2827 Synonymous with specifying both @option{--preserve-permissions} and
2828 @option{--same-order}. @xref{Setting Access Permissions}.
2829
2830 @opindex preserve-order, summary
2831 @item --preserve-order
2832
2833 (See @option{--same-order}; @pxref{Reading}.)
2834
2835 @opindex preserve-permissions, summary
2836 @opindex same-permissions, summary
2837 @item --preserve-permissions
2838 @itemx --same-permissions
2839 @itemx -p
2840
2841 When @command{tar} is extracting an archive, it normally subtracts the
2842 users' umask from the permissions specified in the archive and uses
2843 that number as the permissions to create the destination file.
2844 Specifying this option instructs @command{tar} that it should use the
2845 permissions directly from the archive. @xref{Setting Access Permissions}.
2846
2847 @opindex read-full-records, summary
2848 @item --read-full-records
2849 @itemx -B
2850
2851 Specifies that @command{tar} should reblock its input, for reading
2852 from pipes on systems with buggy implementations. @xref{Reading}.
2853
2854 @opindex record-size, summary
2855 @item --record-size=@var{size}
2856
2857 Instructs @command{tar} to use @var{size} bytes per record when accessing the
2858 archive. @xref{Blocking Factor}.
2859
2860 @opindex recursion, summary
2861 @item --recursion
2862
2863 With this option, @command{tar} recurses into directories.
2864 @xref{recurse}.
2865
2866 @opindex recursive-unlink, summary
2867 @item --recursive-unlink
2868
2869 Remove existing
2870 directory hierarchies before extracting directories of the same name
2871 from the archive. @xref{Recursive Unlink}.
2872
2873 @opindex remove-files, summary
2874 @item --remove-files
2875
2876 Directs @command{tar} to remove the source file from the file system after
2877 appending it to an archive. @xref{remove files}.
2878
2879 @opindex restrict, summary
2880 @item --restrict
2881
2882 Disable use of some potentially harmful @command{tar} options.
2883 Currently this option disables shell invocaton from multi-volume menu
2884 (@pxref{Using Multiple Tapes}).
2885
2886 @opindex rmt-command, summary
2887 @item --rmt-command=@var{cmd}
2888
2889 Notifies @command{tar} that it should use @var{cmd} instead of
2890 the default @file{/usr/libexec/rmt} (@pxref{Remote Tape Server}).
2891
2892 @opindex rsh-command, summary
2893 @item --rsh-command=@var{cmd}
2894
2895 Notifies @command{tar} that is should use @var{cmd} to communicate with remote
2896 devices. @xref{Device}.
2897
2898 @opindex same-order, summary
2899 @item --same-order
2900 @itemx --preserve-order
2901 @itemx -s
2902
2903 This option is an optimization for @command{tar} when running on machines with
2904 small amounts of memory. It informs @command{tar} that the list of file
2905 arguments has already been sorted to match the order of files in the
2906 archive. @xref{Reading}.
2907
2908 @opindex same-owner, summary
2909 @item --same-owner
2910
2911 When extracting an archive, @command{tar} will attempt to preserve the owner
2912 specified in the @command{tar} archive with this option present.
2913 This is the default behavior for the superuser; this option has an
2914 effect only for ordinary users. @xref{Attributes}.
2915
2916 @opindex same-permissions, summary
2917 @item --same-permissions
2918
2919 (See @option{--preserve-permissions}; @pxref{Setting Access Permissions}.)
2920
2921 @opindex show-defaults, summary
2922 @item --show-defaults
2923
2924 Displays the default options used by @command{tar} and exits
2925 successfully. This option is intended for use in shell scripts.
2926 Here is an example of what you can see using this option:
2927
2928 @smallexample
2929 $ tar --show-defaults
2930 --format=gnu -f- -b20
2931 @end smallexample
2932
2933 @opindex show-omitted-dirs, summary
2934 @item --show-omitted-dirs
2935
2936 Instructs @command{tar} to mention directories its skipping over when
2937 operating on a @command{tar} archive. @xref{show-omitted-dirs}.
2938
2939 @opindex show-stored-names, summary
2940 @item --show-stored-names
2941
2942 This option has effect only when used in conjunction with one of
2943 archive creation operations. It instructs tar to list the member names
2944 stored in the archive, as opposed to the actual file
2945 names. @xref{listing member and file names}.
2946
2947 @opindex sparse, summary
2948 @item --sparse
2949 @itemx -S
2950
2951 Invokes a @acronym{GNU} extension when adding files to an archive that handles
2952 sparse files efficiently. @xref{sparse}.
2953
2954 @opindex starting-file, summary
2955 @item --starting-file=@var{name}
2956 @itemx -K @var{name}
2957
2958 This option affects extraction only; @command{tar} will skip extracting
2959 files in the archive until it finds one that matches @var{name}.
2960 @xref{Scarce}.
2961
2962 @opindex strip-components, summary
2963 @item --strip-components=@var{number}
2964 Strip given @var{number} of leading components from file names before
2965 extraction.@footnote{This option was called @option{--strip-path} in
2966 version 1.14.} For example, if archive @file{archive.tar} contained
2967 @file{/some/file/name}, then running
2968
2969 @smallexample
2970 tar --extract --file archive.tar --strip-components=2
2971 @end smallexample
2972
2973 @noindent
2974 would extracted this file to file @file{name}.
2975
2976 @opindex suffix, summary
2977 @item --suffix=@var{suffix}
2978
2979 Alters the suffix @command{tar} uses when backing up files from the default
2980 @samp{~}. @xref{backup}.
2981
2982 @opindex tape-length, summary
2983 @item --tape-length=@var{num}
2984 @itemx -L @var{num}
2985
2986 Specifies the length of tapes that @command{tar} is writing as being
2987 @w{@var{num} x 1024} bytes long. @xref{Using Multiple Tapes}.
2988
2989 @opindex test-label, summary
2990 @item --test-label
2991
2992 Reads the volume label. If an argument is specified, test whether it
2993 matches the volume label. @xref{--test-label option}.
2994
2995 @opindex to-command, summary
2996 @item --to-command=@var{command}
2997
2998 During extraction @command{tar} will pipe extracted files to the
2999 standard input of @var{command}. @xref{Writing to an External Program}.
3000
3001 @opindex to-stdout, summary
3002 @item --to-stdout
3003 @itemx -O
3004
3005 During extraction, @command{tar} will extract files to stdout rather
3006 than to the file system. @xref{Writing to Standard Output}.
3007
3008 @opindex totals, summary
3009 @item --totals
3010
3011 Displays the total number of bytes written after creating an archive.
3012 @xref{verbose}.
3013
3014 @opindex touch, summary
3015 @item --touch
3016 @itemx -m
3017
3018 Sets the data modification time of extracted files to the extraction time,
3019 rather than the data modification time stored in the archive.
3020 @xref{Data Modification Times}.
3021
3022 @opindex uncompress, summary
3023 @item --uncompress
3024
3025 (See @option{--compress}. @pxref{gzip})
3026
3027 @opindex ungzip, summary
3028 @item --ungzip
3029
3030 (See @option{--gzip}. @pxref{gzip})
3031
3032 @opindex unlink-first, summary
3033 @item --unlink-first
3034 @itemx -U
3035
3036 Directs @command{tar} to remove the corresponding file from the file
3037 system before extracting it from the archive. @xref{Unlink First}.
3038
3039 @opindex use-compress-program, summary
3040 @item --use-compress-program=@var{prog}
3041
3042 Instructs @command{tar} to access the archive through @var{prog}, which is
3043 presumed to be a compression program of some sort. @xref{gzip}.
3044
3045 @opindex utc, summary
3046 @item --utc
3047
3048 Display file modification dates in @acronym{UTC}. This option implies
3049 @option{--verbose}.
3050
3051 @opindex verbose, summary
3052 @item --verbose
3053 @itemx -v
3054
3055 Specifies that @command{tar} should be more verbose about the operations its
3056 performing. This option can be specified multiple times for some
3057 operations to increase the amount of information displayed.
3058 @xref{verbose}.
3059
3060 @opindex verify, summary
3061 @item --verify
3062 @itemx -W
3063
3064 Verifies that the archive was correctly written when creating an
3065 archive. @xref{verify}.
3066
3067 @opindex version, summary
3068 @item --version
3069
3070 @command{tar} will print an informational message about what version
3071 it is and a copyright message, some credits, and then exit.
3072 @xref{help}.
3073
3074 @opindex volno-file, summary
3075 @item --volno-file=@var{file}
3076
3077 Used in conjunction with @option{--multi-volume}. @command{tar} will keep track
3078 of which volume of a multi-volume archive its working in @var{file}.
3079 @xref{volno-file}.
3080
3081 @opindex wildcards, summary
3082 @item --wildcards
3083 Use wildcards when excluding files.
3084 @xref{controlling pattern-matching with exclude}.
3085
3086 @opindex wildcards-match-slash, summary
3087 @item --wildcards-match-slash
3088 Wildcards match @samp{/} when excluding files.
3089 @xref{controlling pattern-matching with exclude}.
3090 @end table
3091
3092 @node Short Option Summary
3093 @subsection Short Options Cross Reference
3094
3095 Here is an alphabetized list of all of the short option forms, matching
3096 them with the equivalent long option.
3097
3098 @table @option
3099
3100 @item -A
3101
3102 @option{--concatenate}
3103
3104 @item -B
3105
3106 @option{--read-full-records}
3107
3108 @item -C
3109
3110 @option{--directory}
3111
3112 @item -F
3113
3114 @option{--info-script}
3115
3116 @item -G
3117
3118 @option{--incremental}
3119
3120 @item -K
3121
3122 @option{--starting-file}
3123
3124 @item -L
3125
3126 @option{--tape-length}
3127
3128 @item -M
3129
3130 @option{--multi-volume}
3131
3132 @item -N
3133
3134 @option{--newer}
3135
3136 @item -O
3137
3138 @option{--to-stdout}
3139
3140 @item -P
3141
3142 @option{--absolute-names}
3143
3144 @item -R
3145
3146 @option{--block-number}
3147
3148 @item -S
3149
3150 @option{--sparse}
3151
3152 @item -T
3153
3154 @option{--files-from}
3155
3156 @item -U
3157
3158 @option{--unlink-first}
3159
3160 @item -V
3161
3162 @option{--label}
3163
3164 @item -W
3165
3166 @option{--verify}
3167
3168 @item -X
3169
3170 @option{--exclude-from}
3171
3172 @item -Z
3173
3174 @option{--compress}
3175
3176 @item -b
3177
3178 @option{--blocking-factor}
3179
3180 @item -c
3181
3182 @option{--create}
3183
3184 @item -d
3185
3186 @option{--compare}
3187
3188 @item -f
3189
3190 @option{--file}
3191
3192 @item -g
3193
3194 @option{--listed-incremental}
3195
3196 @item -h
3197
3198 @option{--dereference}
3199
3200 @item -i
3201
3202 @option{--ignore-zeros}
3203
3204 @item -j
3205
3206 @option{--bzip2}
3207
3208 @item -k
3209
3210 @option{--keep-old-files}
3211
3212 @item -l
3213
3214 @option{--one-file-system}. Use of this short option is deprecated. It
3215 is retained for compatibility with the earlier versions of GNU
3216 @command{tar}, and will be changed in future releases.
3217
3218 @xref{Changes}, for more information.
3219
3220 @item -m
3221
3222 @option{--touch}
3223
3224 @item -o
3225
3226 When creating --- @option{--no-same-owner}, when extracting ---
3227 @option{--portability}.
3228
3229 The later usage is deprecated. It is retained for compatibility with
3230 the earlier versions of @GNUTAR{}. In the future releases
3231 @option{-o} will be equivalent to @option{--no-same-owner} only.
3232
3233 @item -p
3234
3235 @option{--preserve-permissions}
3236
3237 @item -r
3238
3239 @option{--append}
3240
3241 @item -s
3242
3243 @option{--same-order}
3244
3245 @item -t
3246
3247 @option{--list}
3248
3249 @item -u
3250
3251 @option{--update}
3252
3253 @item -v
3254
3255 @option{--verbose}
3256
3257 @item -w
3258
3259 @option{--interactive}
3260
3261 @item -x
3262
3263 @option{--extract}
3264
3265 @item -z
3266
3267 @option{--gzip}
3268
3269 @end table
3270
3271 @node help
3272 @section @GNUTAR{} documentation
3273
3274 @cindex Getting program version number
3275 @opindex version
3276 @cindex Version of the @command{tar} program
3277 Being careful, the first thing is really checking that you are using
3278 @GNUTAR{}, indeed. The @option{--version} option
3279 will generate a message giving confirmation that you are using
3280 @GNUTAR{}, with the precise version of @GNUTAR{}
3281 you are using. @command{tar} identifies itself and
3282 prints the version number to the standard output, then immediately
3283 exits successfully, without doing anything else, ignoring all other
3284 options. For example, @w{@samp{tar --version}} might return:
3285
3286 @smallexample
3287 tar (@acronym{GNU} tar) @value{VERSION}
3288 @end smallexample
3289
3290 @noindent
3291 The first occurrence of @samp{tar} in the result above is the program
3292 name in the package (for example, @command{rmt} is another program),
3293 while the second occurrence of @samp{tar} is the name of the package
3294 itself, containing possibly many programs. The package is currently
3295 named @samp{tar}, after the name of the main program it
3296 contains@footnote{There are plans to merge the @command{cpio} and
3297 @command{tar} packages into a single one which would be called
3298 @code{paxutils}. So, who knows if, one of this days, the
3299 @option{--version} would not yield @w{@samp{tar (@acronym{GNU}
3300 paxutils) 3.2}}}.
3301
3302 @cindex Obtaining help
3303 @cindex Listing all @command{tar} options
3304 @opindex help, introduction
3305 Another thing you might want to do is checking the spelling or meaning
3306 of some particular @command{tar} option, without resorting to this
3307 manual, for once you have carefully read it. @GNUTAR{}
3308 has a short help feature, triggerable through the
3309 @option{--help} option. By using this option, @command{tar} will
3310 print a usage message listing all available options on standard
3311 output, then exit successfully, without doing anything else and
3312 ignoring all other options. Even if this is only a brief summary, it
3313 may be several screens long. So, if you are not using some kind of
3314 scrollable window, you might prefer to use something like:
3315
3316 @smallexample
3317 $ @kbd{tar --help | less}
3318 @end smallexample
3319
3320 @noindent
3321 presuming, here, that you like using @command{less} for a pager. Other
3322 popular pagers are @command{more} and @command{pg}. If you know about some
3323 @var{keyword} which interests you and do not want to read all the
3324 @option{--help} output, another common idiom is doing:
3325
3326 @smallexample
3327 tar --help | grep @var{keyword}
3328 @end smallexample
3329
3330 @noindent
3331 for getting only the pertinent lines. Notice, however, that some
3332 @command{tar} options have long description lines and the above
3333 command will list only the first of them.
3334
3335 @opindex usage
3336 If you only wish to check the spelling of an option, running @kbd{tar
3337 --usage} may be a better choice. This will display a terse list of
3338 @command{tar} option without accompanying explanations.
3339
3340 The short help output is quite succinct, and you might have to get
3341 back to the full documentation for precise points. If you are reading
3342 this paragraph, you already have the @command{tar} manual in some
3343 form. This manual is available in a variety of forms from
3344 @url{http://www.gnu.org/software/tar/manual}. It may printed out of the @GNUTAR{}
3345 distribution, provided you have @TeX{} already installed somewhere,
3346 and a laser printer around. Just configure the distribution, execute
3347 the command @w{@samp{make dvi}}, then print @file{doc/tar.dvi} the
3348 usual way (contact your local guru to know how). If @GNUTAR{}
3349 has been conveniently installed at your place, this
3350 manual is also available in interactive, hypertextual form as an Info
3351 file. Just call @w{@samp{info tar}} or, if you do not have the
3352 @command{info} program handy, use the Info reader provided within
3353 @acronym{GNU} Emacs, calling @samp{tar} from the main Info menu.
3354
3355 There is currently no @code{man} page for @GNUTAR{}.
3356 If you observe such a @code{man} page on the system you are running,
3357 either it does not belong to @GNUTAR{}, or it has not
3358 been produced by @acronym{GNU}. Some package maintainers convert
3359 @kbd{tar --help} output to a man page, using @command{help2man}. In
3360 any case, please bear in mind that the authoritative source of
3361 information about @GNUTAR{} is this Texinfo documentation.
3362
3363 @node defaults
3364 @section Obtaining @GNUTAR{} default values
3365
3366 @opindex show-defaults
3367 @GNUTAR{} has some predefined defaults that are used when you do not
3368 explicitely specify another values. To obtain a list of such
3369 defaults, use @option{--show-defaults} option. This will output the
3370 values in the form of @command{tar} command line options:
3371
3372 @smallexample
3373 @group
3374 @kbd{tar --show-defaults}
3375 --format=gnu -f- -b20 --rmt-command=/etc/rmt --rsh-command=/usr/bin/rsh
3376 @end group
3377 @end smallexample
3378
3379 @noindent
3380 The above output shows that this version of @GNUTAR{} defaults to
3381 using @samp{gnu} archive format (@pxref{Formats}), it uses standard
3382 output as the archive, if no @option{--file} option has been given
3383 (@pxref{file tutorial}), the default blocking factor is 20
3384 (@pxref{Blocking Factor}). It also shows the default locations where
3385 @command{tar} will look for @command{rmt} and @command{rsh} binaries.
3386
3387 @node verbose
3388 @section Checking @command{tar} progress
3389
3390 Typically, @command{tar} performs most operations without reporting any
3391 information to the user except error messages. When using @command{tar}
3392 with many options, particularly ones with complicated or
3393 difficult-to-predict behavior, it is possible to make serious mistakes.
3394 @command{tar} provides several options that make observing @command{tar}
3395 easier. These options cause @command{tar} to print information as it
3396 progresses in its job, and you might want to use them just for being
3397 more careful about what is going on, or merely for entertaining
3398 yourself. If you have encountered a problem when operating on an
3399 archive, however, you may need more information than just an error
3400 message in order to solve the problem. The following options can be
3401 helpful diagnostic tools.
3402
3403 @cindex Verbose operation
3404 @opindex verbose
3405 Normally, the @option{--list} (@option{-t}) command to list an archive
3406 prints just the file names (one per line) and the other commands are
3407 silent. When used with most operations, the @option{--verbose}
3408 (@option{-v}) option causes @command{tar} to print the name of each
3409 file or archive member as it is processed. This and the other options
3410 which make @command{tar} print status information can be useful in
3411 monitoring @command{tar}.
3412
3413 With @option{--create} or @option{--extract}, @option{--verbose} used
3414 once just prints the names of the files or members as they are processed.
3415 Using it twice causes @command{tar} to print a longer listing
3416 (reminiscent of @samp{ls -l}) for each member. Since @option{--list}
3417 already prints the names of the members, @option{--verbose} used once
3418 with @option{--list} causes @command{tar} to print an @samp{ls -l}
3419 type listing of the files in the archive. The following examples both
3420 extract members with long list output:
3421
3422 @smallexample
3423 $ @kbd{tar --extract --file=archive.tar --verbose --verbose}
3424 $ @kbd{tar xvvf archive.tar}
3425 @end smallexample
3426
3427 Verbose output appears on the standard output except when an archive is
3428 being written to the standard output, as with @samp{tar --create
3429 --file=- --verbose} (@samp{tar cfv -}, or even @samp{tar cv}---if the
3430 installer let standard output be the default archive). In that case
3431 @command{tar} writes verbose output to the standard error stream.
3432
3433 If @option{--index-file=@var{file}} is specified, @command{tar} sends
3434 verbose output to @var{file} rather than to standard output or standard
3435 error.
3436
3437 @cindex Obtaining total status information
3438 @opindex totals
3439 The @option{--totals} option---which is only meaningful when used with
3440 @option{--create} (@option{-c})---causes @command{tar} to print the total
3441 amount written to the archive, after it has been fully created.
3442
3443 @cindex Progress information
3444 @opindex checkpoint
3445 The @option{--checkpoint} option prints an occasional message
3446 as @command{tar} reads or writes the archive. In fact, it prints
3447 a message each 10 records read or written. It is designed for
3448 those who don't need the more detailed (and voluminous) output of
3449 @option{--block-number} (@option{-R}), but do want visual confirmation that @command{tar}
3450 is actually making forward progress.
3451
3452 @FIXME{There is some confusion here. It seems that -R once wrote a
3453 message at @samp{every} record read or written.}
3454
3455 @opindex show-omitted-dirs
3456 @anchor{show-omitted-dirs}
3457 The @option{--show-omitted-dirs} option, when reading an archive---with
3458 @option{--list} or @option{--extract}, for example---causes a message
3459 to be printed for each directory in the archive which is skipped.
3460 This happens regardless of the reason for skipping: the directory might
3461 not have been named on the command line (implicitly or explicitly),
3462 it might be excluded by the use of the @option{--exclude=@var{pattern}} option, or
3463 some other reason.
3464
3465 @opindex block-number
3466 @cindex Block number where error occurred
3467 @anchor{block-number}
3468 If @option{--block-number} (@option{-R}) is used, @command{tar} prints, along with
3469 every message it would normally produce, the block number within the
3470 archive where the message was triggered. Also, supplementary messages
3471 are triggered when reading blocks full of NULs, or when hitting end of
3472 file on the archive. As of now, if the archive if properly terminated
3473 with a NUL block, the reading of the file may stop before end of file
3474 is met, so the position of end of file will not usually show when
3475 @option{--block-number} (@option{-R}) is used. Note that @GNUTAR{}
3476 drains the archive before exiting when reading the
3477 archive from a pipe.
3478
3479 @cindex Error message, block number of
3480 This option is especially useful when reading damaged archives, since
3481 it helps pinpoint the damaged sections. It can also be used with
3482 @option{--list} (@option{-t}) when listing a file-system backup tape, allowing you to
3483 choose among several backup tapes when retrieving a file later, in
3484 favor of the tape where the file appears earliest (closest to the
3485 front of the tape). @xref{backup}.
3486
3487 @node interactive
3488 @section Asking for Confirmation During Operations
3489 @cindex Interactive operation
3490
3491 Typically, @command{tar} carries out a command without stopping for
3492 further instructions. In some situations however, you may want to
3493 exclude some files and archive members from the operation (for instance
3494 if disk or storage space is tight). You can do this by excluding
3495 certain files automatically (@pxref{Choosing}), or by performing
3496 an operation interactively, using the @option{--interactive} (@option{-w}) option.
3497 @command{tar} also accepts @option{--confirmation} for this option.
3498
3499 @opindex interactive
3500 When the @option{--interactive} (@option{-w}) option is specified, before
3501 reading, writing, or deleting files, @command{tar} first prints a message
3502 for each such file, telling what operation it intends to take, then asks
3503 for confirmation on the terminal. The actions which require
3504 confirmation include adding a file to the archive, extracting a file
3505 from the archive, deleting a file from the archive, and deleting a file
3506 from disk. To confirm the action, you must type a line of input
3507 beginning with @samp{y}. If your input line begins with anything other
3508 than @samp{y}, @command{tar} skips that file.
3509
3510 If @command{tar} is reading the archive from the standard input,
3511 @command{tar} opens the file @file{/dev/tty} to support the interactive
3512 communications.
3513
3514 Verbose output is normally sent to standard output, separate from
3515 other error messages. However, if the archive is produced directly
3516 on standard output, then verbose output is mixed with errors on
3517 @code{stderr}. Producing the archive on standard output may be used
3518 as a way to avoid using disk space, when the archive is soon to be
3519 consumed by another process reading it, say. Some people felt the need
3520 of producing an archive on stdout, still willing to segregate between
3521 verbose output and error output. A possible approach would be using a
3522 named pipe to receive the archive, and having the consumer process to
3523 read from that named pipe. This has the advantage of letting standard
3524 output free to receive verbose output, all separate from errors.
3525
3526 @node operations
3527 @chapter @GNUTAR{} Operations
3528
3529 @menu
3530 * Basic tar::
3531 * Advanced tar::
3532 * create options::
3533 * extract options::
3534 * backup::
3535 * Applications::
3536 * looking ahead::
3537 @end menu
3538
3539 @node Basic tar
3540 @section Basic @GNUTAR{} Operations
3541
3542 The basic @command{tar} operations, @option{--create} (@option{-c}),
3543 @option{--list} (@option{-t}) and @option{--extract} (@option{--get},
3544 @option{-x}), are currently presented and described in the tutorial
3545 chapter of this manual. This section provides some complementary notes
3546 for these operations.
3547
3548 @table @option
3549 @opindex create, complementary notes
3550 @item --create
3551 @itemx -c
3552
3553 Creating an empty archive would have some kind of elegance. One can
3554 initialize an empty archive and later use @option{--append}
3555 (@option{-r}) for adding all members. Some applications would not
3556 welcome making an exception in the way of adding the first archive
3557 member. On the other hand, many people reported that it is
3558 dangerously too easy for @command{tar} to destroy a magnetic tape with
3559 an empty archive@footnote{This is well described in @cite{Unix-haters
3560 Handbook}, by Simson Garfinkel, Daniel Weise & Steven Strassmann, IDG
3561 Books, ISBN 1-56884-203-1.}. The two most common errors are:
3562
3563 @enumerate
3564 @item
3565 Mistakingly using @code{create} instead of @code{extract}, when the
3566 intent was to extract the full contents of an archive. This error
3567 is likely: keys @kbd{c} and @kbd{x} are right next to each other on
3568 the QWERTY keyboard. Instead of being unpacked, the archive then
3569 gets wholly destroyed. When users speak about @dfn{exploding} an
3570 archive, they usually mean something else :-).
3571
3572 @item
3573 Forgetting the argument to @code{file}, when the intent was to create
3574 an archive with a single file in it. This error is likely because a
3575 tired user can easily add the @kbd{f} key to the cluster of option
3576 letters, by the mere force of habit, without realizing the full
3577 consequence of doing so. The usual consequence is that the single
3578 file, which was meant to be saved, is rather destroyed.
3579 @end enumerate
3580
3581 So, recognizing the likelihood and the catastrophical nature of these
3582 errors, @GNUTAR{} now takes some distance from elegance, and
3583 cowardly refuses to create an archive when @option{--create} option is
3584 given, there are no arguments besides options, and
3585 @option{--files-from} (@option{-T}) option is @emph{not} used. To get
3586 around the cautiousness of @GNUTAR{} and nevertheless create an
3587 archive with nothing in it, one may still use, as the value for the
3588 @option{--files-from} option, a file with no names in it, as shown in
3589 the following commands:
3590
3591 @smallexample
3592 @kbd{tar --create --file=empty-archive.tar --files-from=/dev/null}
3593 @kbd{tar cfT empty-archive.tar /dev/null}
3594 @end smallexample
3595
3596 @opindex extract, complementary notes
3597 @item --extract
3598 @itemx --get
3599 @itemx -x
3600
3601 A socket is stored, within a @GNUTAR{} archive, as a pipe.
3602
3603 @item @option{--list} (@option{-t})
3604
3605 @GNUTAR{} now shows dates as @samp{1996-08-30},
3606 while it used to show them as @samp{Aug 30 1996}. Preferably,
3607 people should get used to ISO 8601 dates. Local American dates should
3608 be made available again with full date localization support, once
3609 ready. In the meantime, programs not being localizable for dates
3610 should prefer international dates, that's really the way to go.
3611
3612 Look up @url{http://www.ft.uni-erlangen.de/~mskuhn/iso-time.html} if you
3613 are curious, it contains a detailed explanation of the ISO 8601 standard.
3614
3615 @end table
3616
3617 @node Advanced tar
3618 @section Advanced @GNUTAR{} Operations
3619
3620 Now that you have learned the basics of using @GNUTAR{}, you may want
3621 to learn about further ways in which @command{tar} can help you.
3622
3623 This chapter presents five, more advanced operations which you probably
3624 won't use on a daily basis, but which serve more specialized functions.
3625 We also explain the different styles of options and why you might want
3626 to use one or another, or a combination of them in your @command{tar}
3627 commands. Additionally, this chapter includes options which allow you to
3628 define the output from @command{tar} more carefully, and provide help and
3629 error correction in special circumstances.
3630
3631 @FIXME{check this after the chapter is actually revised to make sure
3632 it still introduces the info in the chapter correctly : ).}
3633
3634 @menu
3635 * Operations::
3636 * append::
3637 * update::
3638 * concatenate::
3639 * delete::
3640 * compare::
3641 @end menu
3642
3643 @node Operations
3644 @subsection The Five Advanced @command{tar} Operations
3645 @UNREVISED
3646
3647 In the last chapter, you learned about the first three operations to
3648 @command{tar}. This chapter presents the remaining five operations to
3649 @command{tar}: @option{--append}, @option{--update}, @option{--concatenate},
3650 @option{--delete}, and @option{--compare}.
3651
3652 You are not likely to use these operations as frequently as those
3653 covered in the last chapter; however, since they perform specialized
3654 functions, they are quite useful when you do need to use them. We
3655 will give examples using the same directory and files that you created
3656 in the last chapter. As you may recall, the directory is called
3657 @file{practice}, the files are @samp{jazz}, @samp{blues}, @samp{folk},
3658 @samp{rock}, and the two archive files you created are
3659 @samp{collection.tar} and @samp{music.tar}.
3660
3661 We will also use the archive files @samp{afiles.tar} and
3662 @samp{bfiles.tar}. @samp{afiles.tar} contains the members @samp{apple},
3663 @samp{angst}, and @samp{aspic}. @samp{bfiles.tar} contains the members
3664 @samp{./birds}, @samp{baboon}, and @samp{./box}.
3665
3666 Unless we state otherwise, all practicing you do and examples you follow
3667 in this chapter will take place in the @file{practice} directory that
3668 you created in the previous chapter; see @ref{prepare for examples}.
3669 (Below in this section, we will remind you of the state of the examples
3670 where the last chapter left them.)
3671
3672 The five operations that we will cover in this chapter are:
3673
3674 @table @option
3675 @item --append
3676 @itemx -r
3677 Add new entries to an archive that already exists.
3678 @item --update
3679 @itemx -r
3680 Add more recent copies of archive members to the end of an archive, if
3681 they exist.
3682 @item --concatenate
3683 @itemx --catenate
3684 @itemx -A
3685 Add one or more pre-existing archives to the end of another archive.
3686 @item --delete
3687 Delete items from an archive (does not work on tapes).
3688 @item --compare
3689 @itemx --diff
3690 @itemx -d
3691 Compare archive members to their counterparts in the file system.
3692 @end table
3693
3694 @node append
3695 @subsection How to Add Files to Existing Archives: @option{--append}
3696 @UNREVISED
3697
3698 @opindex append
3699 If you want to add files to an existing archive, you don't need to
3700 create a new archive; you can use @option{--append} (@option{-r}).
3701 The archive must already exist in order to use @option{--append}. (A
3702 related operation is the @option{--update} operation; you can use this
3703 to add newer versions of archive members to an existing archive. To learn how to
3704 do this with @option{--update}, @pxref{update}.)
3705
3706 If you use @option{--append} to add a file that has the same name as an
3707 archive member to an archive containing that archive member, then the
3708 old member is not deleted. What does happen, however, is somewhat
3709 complex. @command{tar} @emph{allows} you to have infinite number of files
3710 with the same name. Some operations treat these same-named members no
3711 differently than any other set of archive members: for example, if you
3712 view an archive with @option{--list} (@option{-t}), you will see all
3713 of those members listed, with their data modification times, owners, etc.
3714
3715 Other operations don't deal with these members as perfectly as you might
3716 prefer; if you were to use @option{--extract} to extract the archive,
3717 only the most recently added copy of a member with the same name as four
3718 other members would end up in the working directory. This is because
3719 @option{--extract} extracts an archive in the order the members appeared
3720 in the archive; the most recently archived members will be extracted
3721 last. Additionally, an extracted member will @emph{replace} a file of
3722 the same name which existed in the directory already, and @command{tar}
3723 will not prompt you about this@footnote{Unless you give it
3724 @option{--keep-old-files} option, or the disk copy is newer than the
3725 the one in the archive and you invoke @command{tar} with
3726 @option{--keep-newer-files} option}. Thus, only the most recently archived
3727 member will end up being extracted, as it will replace the one
3728 extracted before it, and so on.
3729
3730 There exists a special option that allows you to get around this
3731 behavior and extract (or list) only a particular copy of the file.
3732 This is @option{--occurrence} option. If you run @command{tar} with
3733 this option, it will extract only the first copy of the file. You
3734 may also give this option an argument specifying the number of
3735 copy to be extracted. Thus, for example if the archive
3736 @file{archive.tar} contained three copies of file @file{myfile}, then
3737 the command
3738
3739 @smallexample
3740 tar --extract --file archive.tar --occurrence=2 myfile
3741 @end smallexample
3742
3743 @noindent
3744 would extract only the second copy. @xref{Option
3745 Summary,---occurrence}, for the description of @option{--occurrence}
3746 option.
3747
3748 @FIXME{ hag -- you might want to incorporate some of the above into the
3749 MMwtSN node; not sure. i didn't know how to make it simpler...
3750
3751 There are a few ways to get around this. (maybe xref Multiple Members
3752 with the Same Name.}
3753
3754 @cindex Members, replacing with other members
3755 @cindex Replacing members with other members
3756 If you want to replace an archive member, use @option{--delete} to
3757 delete the member you want to remove from the archive, , and then use
3758 @option{--append} to add the member you want to be in the archive. Note
3759 that you can not change the order of the archive; the most recently
3760 added member will still appear last. In this sense, you cannot truly
3761 ``replace'' one member with another. (Replacing one member with another
3762 will not work on certain types of media, such as tapes; see @ref{delete}
3763 and @ref{Media}, for more information.)
3764
3765 @menu
3766 * appending files:: Appending Files to an Archive
3767 * multiple::
3768 @end menu
3769
3770 @node appending files
3771 @subsubsection Appending Files to an Archive
3772 @UNREVISED
3773 @cindex Adding files to an Archive
3774 @cindex Appending files to an Archive
3775 @cindex Archives, Appending files to
3776
3777 The simplest way to add a file to an already existing archive is the
3778 @option{--append} (@option{-r}) operation, which writes specified files into the
3779 archive whether or not they are already among the archived files.
3780 When you use @option{--append}, you @emph{must} specify file name
3781 arguments, as there is no default. If you specify a file that already
3782 exists in the archive, another copy of the file will be added to the
3783 end of the archive. As with other operations, the member names of the
3784 newly added files will be exactly the same as their names given on the
3785 command line. The @option{--verbose} (@option{-v}) option will print
3786 out the names of the files as they are written into the archive.
3787
3788 @option{--append} cannot be performed on some tape drives, unfortunately,
3789 due to deficiencies in the formats those tape drives use. The archive
3790 must be a valid @command{tar} archive, or else the results of using this
3791 operation will be unpredictable. @xref{Media}.
3792
3793 To demonstrate using @option{--append} to add a file to an archive,
3794 create a file called @file{rock} in the @file{practice} directory.
3795 Make sure you are in the @file{practice} directory. Then, run the
3796 following @command{tar} command to add @file{rock} to
3797 @file{collection.tar}:
3798
3799 @smallexample
3800 $ @kbd{tar --append --file=collection.tar rock}
3801 @end smallexample
3802
3803 @noindent
3804 If you now use the @option{--list} (@option{-t}) operation, you will see that
3805 @file{rock} has been added to the archive:
3806
3807 @smallexample
3808 $ @kbd{tar --list --file=collection.tar}
3809 -rw-r--r-- me user 28 1996-10-18 16:31 jazz
3810 -rw-r--r-- me user 21 1996-09-23 16:44 blues
3811 -rw-r--r-- me user 20 1996-09-23 16:44 folk
3812 -rw-r--r-- me user 20 1996-09-23 16:44 rock
3813 @end smallexample
3814
3815 @FIXME{in theory, dan will (soon) try to turn this node into what it's
3816 title claims it will become...}
3817
3818 @node multiple
3819 @subsubsection Multiple Files with the Same Name
3820
3821 You can use @option{--append} (@option{-r}) to add copies of files which have been
3822 updated since the archive was created. (However, we do not recommend
3823 doing this since there is another @command{tar} option called
3824 @option{--update}; @pxref{update} for more information. We describe this
3825 use of @option{--append} here for the sake of completeness.) @FIXME{is
3826 this really a good idea, to give this whole description for something
3827 which i believe is basically a Stupid way of doing something? certain
3828 aspects of it show ways in which tar is more broken than i'd personally
3829 like to admit to, specifically the last sentence. On the other hand, i
3830 don't think it's a good idea to be saying that we explicitly don't
3831 recommend using something, but i can't see any better way to deal with
3832 the situation.}When you extract the archive, the older version will be
3833 effectively lost. This works because files are extracted from an
3834 archive in the order in which they were archived. Thus, when the
3835 archive is extracted, a file archived later in time will replace a
3836 file of the same name which was archived earlier, even though the older
3837 version of the file will remain in the archive unless you delete all
3838 versions of the file.
3839
3840 Supposing you change the file @file{blues} and then append the changed
3841 version to @file{collection.tar}. As you saw above, the original
3842 @file{blues} is in the archive @file{collection.tar}. If you change the
3843 file and append the new version of the file to the archive, there will
3844 be two copies in the archive. When you extract the archive, the older
3845 version of the file will be extracted first, and then replaced by the
3846 newer version when it is extracted.
3847
3848 You can append the new, changed copy of the file @file{blues} to the
3849 archive in this way:
3850
3851 @smallexample
3852 $ @kbd{tar --append --verbose --file=collection.tar blues}
3853 blues
3854 @end smallexample
3855
3856 @noindent
3857 Because you specified the @option{--verbose} option, @command{tar} has
3858 printed the name of the file being appended as it was acted on. Now
3859 list the contents of the archive:
3860
3861 @smallexample
3862 $ @kbd{tar --list --verbose --file=collection.tar}
3863 -rw-r--r-- me user 28 1996-10-18 16:31 jazz
3864 -rw-r--r-- me user 21 1996-09-23 16:44 blues
3865 -rw-r--r-- me user 20 1996-09-23 16:44 folk
3866 -rw-r--r-- me user 20 1996-09-23 16:44 rock
3867 -rw-r--r-- me user 58 1996-10-24 18:30 blues
3868 @end smallexample
3869
3870 @noindent
3871 The newest version of @file{blues} is now at the end of the archive
3872 (note the different creation dates and file sizes). If you extract
3873 the archive, the older version of the file @file{blues} will be
3874 replaced by the newer version. You can confirm this by extracting
3875 the archive and running @samp{ls} on the directory.
3876
3877 If you wish to extract the first occurrence of the file @file{blues}
3878 from the archive, use @option{--occurrence} option, as shown in
3879 the following example:
3880
3881 @smallexample
3882 $ @kbd{tar --extract -vv --occurrence --file=collection.tar blues}
3883 -rw-r--r-- me user 21 1996-09-23 16:44 blues
3884 @end smallexample
3885
3886 @xref{Writing}, for more information on @option{--extract} and
3887 @xref{Option Summary, --occurrence}, for the description of
3888 @option{--occurrence} option.
3889
3890 @node update
3891 @subsection Updating an Archive
3892 @UNREVISED
3893 @cindex Updating an archive
3894
3895 @opindex update
3896 In the previous section, you learned how to use @option{--append} to
3897 add a file to an existing archive. A related operation is
3898 @option{--update} (@option{-u}). The @option{--update} operation
3899 updates a @command{tar} archive by comparing the date of the specified
3900 archive members against the date of the file with the same name. If
3901 the file has been modified more recently than the archive member, then
3902 the newer version of the file is added to the archive (as with
3903 @option{--append}).
3904
3905 Unfortunately, you cannot use @option{--update} with magnetic tape drives.
3906 The operation will fail.
3907
3908 @FIXME{other examples of media on which --update will fail? need to ask
3909 charles and/or mib/thomas/dave shevett..}
3910
3911 Both @option{--update} and @option{--append} work by adding to the end
3912 of the archive. When you extract a file from the archive, only the
3913 version stored last will wind up in the file system, unless you use
3914 the @option{--backup} option. @FIXME-ref{Multiple Members with the
3915 Same Name}
3916
3917 @menu
3918 * how to update::
3919 @end menu
3920
3921 @node how to update
3922 @subsubsection How to Update an Archive Using @option{--update}
3923
3924 You must use file name arguments with the @option{--update} (@option{-u}) operation.
3925 If you don't specify any files, @command{tar} won't act on any files and
3926 won't tell you that it didn't do anything (which may end up confusing
3927 you).
3928
3929 @FIXME{note: the above parenthetical added because in fact, this
3930 behavior just confused the author. :-) }
3931
3932 To see the @option{--update} option at work, create a new file,
3933 @file{classical}, in your practice directory, and some extra text to the
3934 file @file{blues}, using any text editor. Then invoke @command{tar} with
3935 the @samp{update} operation and the @option{--verbose} (@option{-v}) option specified,
3936 using the names of all the files in the practice directory as file name
3937 arguments:
3938
3939 @smallexample
3940 $ @kbd{tar --update -v -f collection.tar blues folk rock classical}
3941 blues
3942 classical
3943 $
3944 @end smallexample
3945
3946 @noindent
3947 Because we have specified verbose mode, @command{tar} prints out the names
3948 of the files it is working on, which in this case are the names of the
3949 files that needed to be updated. If you run @samp{tar --list} and look
3950 at the archive, you will see @file{blues} and @file{classical} at its
3951 end. There will be a total of two versions of the member @samp{blues};
3952 the one at the end will be newer and larger, since you added text before
3953 updating it.
3954
3955 (The reason @command{tar} does not overwrite the older file when updating
3956 it is because writing to the middle of a section of tape is a difficult
3957 process. Tapes are not designed to go backward. @xref{Media}, for more
3958 information about tapes.
3959
3960 @option{--update} (@option{-u}) is not suitable for performing backups for two
3961 reasons: it does not change directory content entries, and it
3962 lengthens the archive every time it is used. The @GNUTAR{}
3963 options intended specifically for backups are more
3964 efficient. If you need to run backups, please consult @ref{Backups}.
3965
3966 @node concatenate
3967 @subsection Combining Archives with @option{--concatenate}
3968
3969 @cindex Adding archives to an archive
3970 @cindex Concatenating Archives
3971 @opindex concatenate
3972 @opindex catenate
3973 @c @cindex @option{-A} described
3974 Sometimes it may be convenient to add a second archive onto the end of
3975 an archive rather than adding individual files to the archive. To add
3976 one or more archives to the end of another archive, you should use the
3977 @option{--concatenate} (@option{--catenate}, @option{-A}) operation.
3978
3979 To use @option{--concatenate}, give the first archive with
3980 @option{--file} option and name the rest of archives to be
3981 concatenated on the command line. The members, and their member
3982 names, will be copied verbatim from those archives to the first one.
3983 @FIXME-ref{This can cause multiple members to have the same name, for
3984 information on how this affects reading the archive, Multiple
3985 Members with the Same Name.}
3986 The new, concatenated archive will be called by the same name as the
3987 one given with the @option{--file} option. As usual, if you omit
3988 @option{--file}, @command{tar} will use the value of the environment
3989 variable @env{TAPE}, or, if this has not been set, the default archive name.
3990
3991 @FIXME{There is no way to specify a new name...}
3992
3993 To demonstrate how @option{--concatenate} works, create two small archives
3994 called @file{bluesrock.tar} and @file{folkjazz.tar}, using the relevant
3995 files from @file{practice}:
3996
3997 @smallexample
3998 $ @kbd{tar -cvf bluesrock.tar blues rock}
3999 blues
4000 classical
4001 $ @kbd{tar -cvf folkjazz.tar folk jazz}
4002 folk
4003 jazz
4004 @end smallexample
4005
4006 @noindent
4007 If you like, You can run @samp{tar --list} to make sure the archives
4008 contain what they are supposed to:
4009
4010 @smallexample
4011 $ @kbd{tar -tvf bluesrock.tar}
4012 -rw-r--r-- melissa user 105 1997-01-21 19:42 blues
4013 -rw-r--r-- melissa user 33 1997-01-20 15:34 rock
4014 $ @kbd{tar -tvf folkjazz.tar}
4015 -rw-r--r-- melissa user 20 1996-09-23 16:44 folk
4016 -rw-r--r-- melissa user 65 1997-01-30 14:15 jazz
4017 @end smallexample
4018
4019 We can concatenate these two archives with @command{tar}:
4020
4021 @smallexample
4022 $ @kbd{cd ..}
4023 $ @kbd{tar --concatenate --file=bluesrock.tar jazzfolk.tar}
4024 @end smallexample
4025
4026 If you now list the contents of the @file{bluesclass.tar}, you will see
4027 that now it also contains the archive members of @file{jazzfolk.tar}:
4028
4029 @smallexample
4030 $ @kbd{tar --list --file=bluesrock.tar}
4031 blues
4032 rock
4033 jazz
4034 folk
4035 @end smallexample
4036
4037 When you use @option{--concatenate}, the source and target archives must
4038 already exist and must have been created using compatible format
4039 parameters. Notice, that @command{tar} does not check whether the
4040 archives it concatenates have compatible formats, it does not
4041 even check if the files are really tar archives.
4042
4043 Like @option{--append} (@option{-r}), this operation cannot be performed on some
4044 tape drives, due to deficiencies in the formats those tape drives use.
4045
4046 @cindex @code{concatenate} vs @command{cat}
4047 @cindex @command{cat} vs @code{concatenate}
4048 It may seem more intuitive to you to want or try to use @command{cat} to
4049 concatenate two archives instead of using the @option{--concatenate}
4050 operation; after all, @command{cat} is the utility for combining files.
4051
4052 However, @command{tar} archives incorporate an end-of-file marker which
4053 must be removed if the concatenated archives are to be read properly as
4054 one archive. @option{--concatenate} removes the end-of-archive marker
4055 from the target archive before each new archive is appended. If you use
4056 @command{cat} to combine the archives, the result will not be a valid
4057 @command{tar} format archive. If you need to retrieve files from an
4058 archive that was added to using the @command{cat} utility, use the
4059 @option{--ignore-zeros} (@option{-i}) option. @xref{Ignore Zeros}, for further
4060 information on dealing with archives improperly combined using the
4061 @command{cat} shell utility.
4062
4063 @node delete
4064 @subsection Removing Archive Members Using @option{--delete}
4065 @UNREVISED
4066 @cindex Deleting files from an archive
4067 @cindex Removing files from an archive
4068
4069 @opindex delete
4070 You can remove members from an archive by using the @option{--delete}
4071 option. Specify the name of the archive with @option{--file}
4072 (@option{-f}) and then specify the names of the members to be deleted;
4073 if you list no member names, nothing will be deleted. The
4074 @option{--verbose} option will cause @command{tar} to print the names
4075 of the members as they are deleted. As with @option{--extract}, you
4076 must give the exact member names when using @samp{tar --delete}.
4077 @option{--delete} will remove all versions of the named file from the
4078 archive. The @option{--delete} operation can run very slowly.
4079
4080 Unlike other operations, @option{--delete} has no short form.
4081
4082 @cindex Tapes, using @option{--delete} and
4083 @cindex Deleting from tape archives
4084 This operation will rewrite the archive. You can only use
4085 @option{--delete} on an archive if the archive device allows you to
4086 write to any point on the media, such as a disk; because of this, it
4087 does not work on magnetic tapes. Do not try to delete an archive member
4088 from a magnetic tape; the action will not succeed, and you will be
4089 likely to scramble the archive and damage your tape. There is no safe
4090 way (except by completely re-writing the archive) to delete files from
4091 most kinds of magnetic tape. @xref{Media}.
4092
4093 To delete all versions of the file @file{blues} from the archive
4094 @file{collection.tar} in the @file{practice} directory, make sure you
4095 are in that directory, and then,
4096
4097 @smallexample
4098 $ @kbd{tar --list --file=collection.tar}
4099 blues
4100 folk
4101 jazz
4102 rock
4103 practice/blues
4104 practice/folk
4105 practice/jazz
4106 practice/rock
4107 practice/blues
4108 $ @kbd{tar --delete --file=collection.tar blues}
4109 $ @kbd{tar --list --file=collection.tar}
4110 folk
4111 jazz
4112 rock
4113 $
4114 @end smallexample
4115
4116 @FIXME{I changed the order of these nodes around and haven't had a chance
4117 to fix the above example's results, yet. I have to play with this and
4118 follow it and see what it actually does!}
4119
4120 The @option{--delete} option has been reported to work properly when
4121 @command{tar} acts as a filter from @code{stdin} to @code{stdout}.
4122
4123 @node compare
4124 @subsection Comparing Archive Members with the File System
4125 @cindex Verifying the currency of an archive
4126 @UNREVISED
4127
4128 @opindex compare
4129 The @option{--compare} (@option{-d}), or @option{--diff} operation compares
4130 specified archive members against files with the same names, and then
4131 reports differences in file size, mode, owner, modification date and
4132 contents. You should @emph{only} specify archive member names, not file
4133 names. If you do not name any members, then @command{tar} will compare the
4134 entire archive. If a file is represented in the archive but does not
4135 exist in the file system, @command{tar} reports a difference.
4136
4137 You have to specify the record size of the archive when modifying an
4138 archive with a non-default record size.
4139
4140 @command{tar} ignores files in the file system that do not have
4141 corresponding members in the archive.
4142
4143 The following example compares the archive members @file{rock},
4144 @file{blues} and @file{funk} in the archive @file{bluesrock.tar} with
4145 files of the same name in the file system. (Note that there is no file,
4146 @file{funk}; @command{tar} will report an error message.)
4147
4148 @smallexample
4149 $ @kbd{tar --compare --file=bluesrock.tar rock blues funk}
4150 rock
4151 blues
4152 tar: funk not found in archive
4153 @end smallexample
4154
4155 The spirit behind the @option{--compare} (@option{--diff}, @option{-d}) option is to check whether the
4156 archive represents the current state of files on disk, more than validating
4157 the integrity of the archive media. For this later goal, @xref{verify}.
4158
4159 @node create options
4160 @section Options Used by @option{--create}
4161
4162 @opindex create, additional options
4163 The previous chapter described the basics of how to use
4164 @option{--create} (@option{-c}) to create an archive from a set of files.
4165 @xref{create}. This section described advanced options to be used with
4166 @option{--create}.
4167
4168 @menu
4169 * Ignore Failed Read::
4170 @end menu
4171
4172 @node Ignore Failed Read
4173 @subsection Ignore Fail Read
4174
4175 @table @option
4176 @item --ignore-failed-read
4177 Do not exit with nonzero on unreadable files or directories.
4178 @end table
4179
4180 @node extract options
4181 @section Options Used by @option{--extract}
4182 @UNREVISED
4183
4184 @FIXME{i need to get dan to go over these options with me and see if
4185 there's a better way of organizing them.}
4186
4187 @opindex extract, additional options
4188 The previous chapter showed how to use @option{--extract} to extract
4189 an archive into the file system. Various options cause @command{tar} to
4190 extract more information than just file contents, such as the owner,
4191 the permissions, the modification date, and so forth. This section
4192 presents options to be used with @option{--extract} when certain special
4193 considerations arise. You may review the information presented in
4194 @ref{extract} for more basic information about the
4195 @option{--extract} operation.
4196
4197 @menu
4198 * Reading:: Options to Help Read Archives
4199 * Writing:: Changing How @command{tar} Writes Files
4200 * Scarce:: Coping with Scarce Resources
4201 @end menu
4202
4203 @node Reading
4204 @subsection Options to Help Read Archives
4205 @cindex Options when reading archives
4206 @UNREVISED
4207
4208 @cindex Reading incomplete records
4209 @cindex Records, incomplete
4210 @opindex read-full-records
4211 Normally, @command{tar} will request data in full record increments from
4212 an archive storage device. If the device cannot return a full record,
4213 @command{tar} will report an error. However, some devices do not always
4214 return full records, or do not require the last record of an archive to
4215 be padded out to the next record boundary. To keep reading until you
4216 obtain a full record, or to accept an incomplete record if it contains
4217 an end-of-archive marker, specify the @option{--read-full-records} (@option{-B}) option
4218 in conjunction with the @option{--extract} or @option{--list} operations.
4219 @xref{Blocking}.
4220
4221 The @option{--read-full-records} (@option{-B}) option is turned on by default when
4222 @command{tar} reads an archive from standard input, or from a remote
4223 machine. This is because on BSD Unix systems, attempting to read a
4224 pipe returns however much happens to be in the pipe, even if it is
4225 less than was requested. If this option were not enabled, @command{tar}
4226 would fail as soon as it read an incomplete record from the pipe.
4227
4228 If you're not sure of the blocking factor of an archive, you can
4229 read the archive by specifying @option{--read-full-records} (@option{-B}) and
4230 @option{--blocking-factor=@var{512-size}} (@option{-b
4231 @var{512-size}}), using a blocking factor larger than what the archive
4232 uses. This lets you avoid having to determine the blocking factor
4233 of an archive. @xref{Blocking Factor}.
4234
4235 @menu
4236 * read full records::
4237 * Ignore Zeros::
4238 @end menu
4239
4240 @node read full records
4241 @unnumberedsubsubsec Reading Full Records
4242
4243 @FIXME{need sentence or so of intro here}
4244
4245 @table @option
4246 @opindex read-full-records
4247 @item --read-full-records
4248 @item -B
4249 Use in conjunction with @option{--extract} (@option{--get},
4250 @option{-x}) to read an archive which contains incomplete records, or
4251 one which has a blocking factor less than the one specified.
4252 @end table
4253
4254 @node Ignore Zeros
4255 @unnumberedsubsubsec Ignoring Blocks of Zeros
4256
4257 @cindex End-of-archive blocks, ignoring
4258 @cindex Ignoring end-of-archive blocks
4259 @opindex ignore-zeros
4260 Normally, @command{tar} stops reading when it encounters a block of zeros
4261 between file entries (which usually indicates the end of the archive).
4262 @option{--ignore-zeros} (@option{-i}) allows @command{tar} to
4263 completely read an archive which contains a block of zeros before the
4264 end (i.e., a damaged archive, or one that was created by concatenating
4265 several archives together).
4266
4267 The @option{--ignore-zeros} (@option{-i}) option is turned off by default because many
4268 versions of @command{tar} write garbage after the end-of-archive entry,
4269 since that part of the media is never supposed to be read. @GNUTAR{}
4270 does not write after the end of an archive, but seeks to
4271 maintain compatiblity among archiving utilities.
4272
4273 @table @option
4274 @item --ignore-zeros
4275 @itemx -i
4276 To ignore blocks of zeros (i.e., end-of-archive entries) which may be
4277 encountered while reading an archive. Use in conjunction with
4278 @option{--extract} or @option{--list}.
4279 @end table
4280
4281 @node Writing
4282 @subsection Changing How @command{tar} Writes Files
4283 @UNREVISED
4284
4285 @FIXME{Introductory paragraph}
4286
4287 @menu
4288 * Dealing with Old Files::
4289 * Overwrite Old Files::
4290 * Keep Old Files::
4291 * Keep Newer Files::
4292 * Unlink First::
4293 * Recursive Unlink::
4294 * Data Modification Times::
4295 * Setting Access Permissions::
4296 * Writing to Standard Output::
4297 * Writing to an External Program::
4298 * remove files::
4299 @end menu
4300
4301 @node Dealing with Old Files
4302 @unnumberedsubsubsec Options Controlling the Overwriting of Existing Files
4303
4304 @opindex overwrite-dir, introduced
4305 When extracting files, if @command{tar} discovers that the extracted
4306 file already exists, it normally replaces the file by removing it before
4307 extracting it, to prevent confusion in the presence of hard or symbolic
4308 links. (If the existing file is a symbolic link, it is removed, not
4309 followed.) However, if a directory cannot be removed because it is
4310 nonempty, @command{tar} normally overwrites its metadata (ownership,
4311 permission, etc.). The @option{--overwrite-dir} option enables this
4312 default behavior. To be more cautious and preserve the metadata of
4313 such a directory, use the @option{--no-overwrite-dir} option.
4314
4315 @cindex Overwriting old files, prevention
4316 @opindex keep-old-files, introduced
4317 To be even more cautious and prevent existing files from being replaced, use
4318 the @option{--keep-old-files} (@option{-k}) option. It causes @command{tar} to refuse
4319 to replace or update a file that already exists, i.e., a file with the
4320 same name as an archive member prevents extraction of that archive
4321 member. Instead, it reports an error.
4322
4323 @opindex overwrite, introduced
4324 To be more aggressive about altering existing files, use the
4325 @option{--overwrite} option. It causes @command{tar} to overwrite
4326 existing files and to follow existing symbolic links when extracting.
4327
4328 @cindex Protecting old files
4329 Some people argue that @GNUTAR{} should not hesitate
4330 to overwrite files with other files when extracting. When extracting
4331 a @command{tar} archive, they expect to see a faithful copy of the
4332 state of the file system when the archive was created. It is debatable
4333 that this would always be a proper behavior. For example, suppose one
4334 has an archive in which @file{usr/local} is a link to
4335 @file{usr/local2}. Since then, maybe the site removed the link and
4336 renamed the whole hierarchy from @file{/usr/local2} to
4337 @file{/usr/local}. Such things happen all the time. I guess it would
4338 not be welcome at all that @GNUTAR{} removes the
4339 whole hierarchy just to make room for the link to be reinstated
4340 (unless it @emph{also} simultaneously restores the full
4341 @file{/usr/local2}, of course!) @GNUTAR{} is indeed
4342 able to remove a whole hierarchy to reestablish a symbolic link, for
4343 example, but @emph{only if} @option{--recursive-unlink} is specified
4344 to allow this behavior. In any case, single files are silently
4345 removed.
4346
4347 @opindex unlink-first, introduced
4348 Finally, the @option{--unlink-first} (@option{-U}) option can improve performance in
4349 some cases by causing @command{tar} to remove files unconditionally
4350 before extracting them.
4351
4352 @node Overwrite Old Files
4353 @unnumberedsubsubsec Overwrite Old Files
4354
4355 @table @option
4356 @opindex overwrite
4357 @item --overwrite
4358 Overwrite existing files and directory metadata when extracting files
4359 from an archive.
4360
4361 This causes @command{tar} to write extracted files into the file system without
4362 regard to the files already on the system; i.e., files with the same
4363 names as archive members are overwritten when the archive is extracted.
4364 It also causes @command{tar} to extract the ownership, permissions,
4365 and time stamps onto any preexisting files or directories.
4366 If the name of a corresponding file name is a symbolic link, the file
4367 pointed to by the symbolic link will be overwritten instead of the
4368 symbolic link itself (if this is possible). Moreover, special devices,
4369 empty directories and even symbolic links are automatically removed if
4370 they are in the way of extraction.
4371
4372 Be careful when using the @option{--overwrite} option, particularly when
4373 combined with the @option{--absolute-names} (@option{-P}) option, as this combination
4374 can change the contents, ownership or permissions of any file on your
4375 system. Also, many systems do not take kindly to overwriting files that
4376 are currently being executed.
4377
4378 @opindex overwrite-dir
4379 @item --overwrite-dir
4380 Overwrite the metadata of directories when extracting files from an
4381 archive, but remove other files before extracting.
4382 @end table
4383
4384 @node Keep Old Files
4385 @unnumberedsubsubsec Keep Old Files
4386
4387 @table @option
4388 @opindex keep-old-files
4389 @item --keep-old-files
4390 @itemx -k
4391 Do not replace existing files from archive. The
4392 @option{--keep-old-files} (@option{-k}) option prevents @command{tar}
4393 from replacing existing files with files with the same name from the
4394 archive. The @option{--keep-old-files} option is meaningless with
4395 @option{--list} (@option{-t}). Prevents @command{tar} from replacing
4396 files in the file system during extraction.
4397 @end table
4398
4399 @node Keep Newer Files
4400 @unnumberedsubsubsec Keep Newer Files
4401
4402 @table @option
4403 @opindex keep-newer-files
4404 @item --keep-newer-files
4405 Do not replace existing files that are newer than their archive
4406 copies. This option is meaningless with @option{--list} (@option{-t}).
4407 @end table
4408
4409 @node Unlink First
4410 @unnumberedsubsubsec Unlink First
4411
4412 @table @option
4413 @opindex unlink-first
4414 @item --unlink-first
4415 @itemx -U
4416 Remove files before extracting over them.
4417 This can make @command{tar} run a bit faster if you know in advance
4418 that the extracted files all need to be removed. Normally this option
4419 slows @command{tar} down slightly, so it is disabled by default.
4420 @end table
4421
4422 @node Recursive Unlink
4423 @unnumberedsubsubsec Recursive Unlink
4424
4425 @table @option
4426 @opindex recursive-unlink
4427 @item --recursive-unlink
4428 When this option is specified, try removing files and directory hierarchies
4429 before extracting over them. @emph{This is a dangerous option!}
4430 @end table
4431
4432 If you specify the @option{--recursive-unlink} option,
4433 @command{tar} removes @emph{anything} that keeps you from extracting a file
4434 as far as current permissions will allow it. This could include removal
4435 of the contents of a full directory hierarchy.
4436
4437 @node Data Modification Times
4438 @unnumberedsubsubsec Setting Data Modification Times
4439
4440 @cindex Data modification times of extracted files
4441 @cindex Modification times of extracted files
4442 Normally, @command{tar} sets the data modification times of extracted
4443 files to the corresponding times recorded for the files in the archive, but
4444 limits the permissions of extracted files by the current @code{umask}
4445 setting.
4446
4447 To set the data modification times of extracted files to the time when
4448 the files were extracted, use the @option{--touch} (@option{-m}) option in
4449 conjunction with @option{--extract} (@option{--get}, @option{-x}).
4450
4451 @table @option
4452 @opindex touch
4453 @item --touch
4454 @itemx -m
4455 Sets the data modification time of extracted archive members to the time
4456 they were extracted, not the time recorded for them in the archive.
4457 Use in conjunction with @option{--extract} (@option{--get}, @option{-x}).
4458 @end table
4459
4460 @node Setting Access Permissions
4461 @unnumberedsubsubsec Setting Access Permissions
4462
4463 @cindex Permissions of extracted files
4464 @cindex Modes of extracted files
4465 To set the modes (access permissions) of extracted files to those
4466 recorded for those files in the archive, use @option{--same-permissions}
4467 in conjunction with the @option{--extract} (@option{--get},
4468 @option{-x}) operation. @FIXME{Should be aliased to ignore-umask.}
4469
4470 @table @option
4471 @opindex preserve-permission
4472 @opindex same-permission
4473 @item --preserve-permission
4474 @itemx --same-permission
4475 @c @itemx --ignore-umask
4476 @itemx -p
4477 Set modes of extracted archive members to those recorded in the
4478 archive, instead of current umask settings. Use in conjunction with
4479 @option{--extract} (@option{--get}, @option{-x}).
4480 @end table
4481
4482 @node Writing to Standard Output
4483 @unnumberedsubsubsec Writing to Standard Output
4484
4485 @cindex Writing extracted files to standard output
4486 @cindex Standard output, writing extracted files to
4487 To write the extracted files to the standard output, instead of
4488 creating the files on the file system, use @option{--to-stdout} (@option{-O}) in
4489 conjunction with @option{--extract} (@option{--get}, @option{-x}). This option is useful if you are
4490 extracting files to send them through a pipe, and do not need to
4491 preserve them in the file system. If you extract multiple members,
4492 they appear on standard output concatenated, in the order they are
4493 found in the archive.
4494
4495 @table @option
4496 @opindex to-stdout
4497 @item --to-stdout
4498 @itemx -O
4499 Writes files to the standard output. Use only in conjunction with
4500 @option{--extract} (@option{--get}, @option{-x}). When this option is
4501 used, instead of creating the files specified, @command{tar} writes
4502 the contents of the files extracted to its standard output. This may
4503 be useful if you are only extracting the files in order to send them
4504 through a pipe. This option is meaningless with @option{--list}
4505 (@option{-t}).
4506 @end table
4507
4508 This can be useful, for example, if you have a tar archive containing
4509 a big file and don't want to store the file on disk before processing
4510 it. You can use a command like this:
4511
4512 @smallexample
4513 tar -xOzf foo.tgz bigfile | process
4514 @end smallexample
4515
4516 or even like this if you want to process the concatenation of the files:
4517
4518 @smallexample
4519 tar -xOzf foo.tgz bigfile1 bigfile2 | process
4520 @end smallexample
4521
4522 Hovewer, @option{--to-command} may be more convenient for use with
4523 multiple files. See the next section.
4524
4525 @node Writing to an External Program
4526 @unnumberedsubsubsec Writing to an External Program
4527
4528 You can instruct @command{tar} to send the contents of each extracted
4529 file to the standard input of an external program:
4530
4531 @table @option
4532 @opindex to-program
4533 @item --to-program=@var{command}
4534 Extract files and pipe their contents to the standard input of
4535 @var{command}. When this option is used, instead of creating the
4536 files specified, @command{tar} invokes @var{command} and pipes the
4537 contents of the files to its standard output. @var{Command} may
4538 contain command line arguments. The program is executed via
4539 @code{sh -c}. Notice, that @var{command} is executed once for each regular file
4540 extracted. Non-regular files (directories, etc.) are ignored when this
4541 option is used.
4542 @end table
4543
4544 The command can obtain the information about the file it processes
4545 from the following environment variables:
4546
4547 @table @var
4548 @vrindex TAR_FILETYPE, to-command environment
4549 @item TAR_FILETYPE
4550 Type of the file. It is a single letter with the following meaning:
4551
4552 @multitable @columnfractions 0.10 0.90
4553 @item f @tab Regular file
4554 @item d @tab Directory
4555 @item l @tab Symbolic link
4556 @item h @tab Hard link
4557 @item b @tab Block device
4558 @item c @tab Character device
4559 @end multitable
4560
4561 Currently only regular files are supported.
4562
4563 @vrindex TAR_MODE, to-command environment
4564 @item TAR_MODE
4565 File mode, an octal number.
4566
4567 @vrindex TAR_FILENAME, to-command environment
4568 @item TAR_FILENAME
4569 The name of the file.
4570
4571 @vrindex TAR_REALNAME, to-command environment
4572 @item TAR_REALNAME
4573 Name of the file as stored in the archive.
4574
4575 @vrindex TAR_UNAME, to-command environment
4576 @item TAR_UNAME
4577 Name of the file owner.
4578
4579 @vrindex TAR_GNAME, to-command environment
4580 @item TAR_GNAME
4581 Name of the file owner group.
4582
4583 @vrindex TAR_ATIME, to-command environment
4584 @item TAR_ATIME
4585 Time of last access. It is a decimal number, representing seconds
4586 since the epoch. If the archive provides times with nanosecond
4587 precision, the nanoseconds are appended to the timestamp after a
4588 decimal point.
4589
4590 @vrindex TAR_MTIME, to-command environment
4591 @item TAR_MTIME
4592 Time of last modification.
4593
4594 @vrindex TAR_CTIME, to-command environment
4595 @item TAR_CTIME
4596 Time of last status change.
4597
4598 @vrindex TAR_SIZE, to-command environment
4599 @item TAR_SIZE
4600 Size of the file.
4601
4602 @vrindex TAR_UID, to-command environment
4603 @item TAR_UID
4604 UID of the file owner.
4605
4606 @vrindex TAR_GID, to-command environment
4607 @item TAR_GID
4608 GID of the file owner.
4609 @end table
4610
4611 In addition to these variables, @env{TAR_VERSION} contains the
4612 @GNUTAR{} version number.
4613
4614 If @var{command} exits with a non-0 status, @command{tar} will print
4615 an error message similar to the following:
4616
4617 @smallexample
4618 tar: 2345: Child returned status 1
4619 @end smallexample
4620
4621 Here, @samp{2345} is the PID of the finished process.
4622
4623 If this behavior is not wanted, use @option{--ignore-command-error}:
4624
4625 @table @option
4626 @opindex ignore-command-error
4627 @item --ignore-command-error
4628 Ignore exit codes of subprocesses. Notice that if the program
4629 exits on signal or otherwise terminates abnormally, the error message
4630 will be printed even if this option is used.
4631
4632 @opindex no-ignore-command-error
4633 @item --no-ignore-command-error
4634 Cancel the effect of any previous @option{--ignore-command-error}
4635 option. This option is useful if you have set
4636 @option{--ignore-command-error} in @env{TAR_OPTIONS}
4637 (@pxref{TAR_OPTIONS}) and wish to temporarily cancel it.
4638 @end table
4639
4640 @node remove files
4641 @unnumberedsubsubsec Removing Files
4642
4643 @FIXME{the various macros in the front of the manual think that this
4644 option goes in this section. i have no idea; i only know it's nowhere
4645 else in the book...}
4646
4647 @table @option
4648 @opindex remove-files
4649 @item --remove-files
4650 Remove files after adding them to the archive.
4651 @end table
4652
4653 @node Scarce
4654 @subsection Coping with Scarce Resources
4655 @UNREVISED
4656
4657 @cindex Small memory
4658 @cindex Running out of space
4659
4660 @menu
4661 * Starting File::
4662 * Same Order::
4663 @end menu
4664
4665 @node Starting File
4666 @unnumberedsubsubsec Starting File
4667
4668 @table @option
4669 @opindex starting-file
4670 @item --starting-file=@var{name}
4671 @itemx -K @var{name}
4672 Starts an operation in the middle of an archive. Use in conjunction
4673 with @option{--extract} (@option{--get}, @option{-x}) or @option{--list} (@option{-t}).
4674 @end table
4675
4676 @cindex Middle of the archive, starting in the
4677 If a previous attempt to extract files failed due to lack of disk
4678 space, you can use @option{--starting-file=@var{name}} (@option{-K
4679 @var{name}}) to start extracting only after member @var{name} of the
4680 archive. This assumes, of course, that there is now free space, or
4681 that you are now extracting into a different file system. (You could
4682 also choose to suspend @command{tar}, remove unnecessary files from
4683 the file system, and then restart the same @command{tar} operation.
4684 In this case, @option{--starting-file} is not necessary.
4685 @xref{Incremental Dumps}, @xref{interactive}, and @ref{exclude}.)
4686
4687 @node Same Order
4688 @unnumberedsubsubsec Same Order
4689
4690 @table @option
4691 @cindex Large lists of file names on small machines
4692 @opindex same-order
4693 @opindex preserve-order
4694 @item --same-order
4695 @itemx --preserve-order
4696 @itemx -s
4697 To process large lists of file names on machines with small amounts of
4698 memory. Use in conjunction with @option{--compare} (@option{--diff},
4699 @option{-d}), @option{--list} (@option{-t}) or @option{--extract}
4700 (@option{--get}, @option{-x}).
4701 @end table
4702
4703 The @option{--same-order} (@option{--preserve-order}, @option{-s}) option tells @command{tar} that the list of file
4704 names to be listed or extracted is sorted in the same order as the
4705 files in the archive. This allows a large list of names to be used,
4706 even on a small machine that would not otherwise be able to hold all
4707 the names in memory at the same time. Such a sorted list can easily be
4708 created by running @samp{tar -t} on the archive and editing its output.
4709
4710 This option is probably never needed on modern computer systems.
4711
4712 @node backup
4713 @section Backup options
4714
4715 @cindex backup options
4716
4717 @GNUTAR{} offers options for making backups of files
4718 before writing new versions. These options control the details of
4719 these backups. They may apply to the archive itself before it is
4720 created or rewritten, as well as individual extracted members. Other
4721 @acronym{GNU} programs (@command{cp}, @command{install}, @command{ln},
4722 and @command{mv}, for example) offer similar options.
4723
4724 Backup options may prove unexpectedly useful when extracting archives
4725 containing many members having identical name, or when extracting archives
4726 on systems having file name limitations, making different members appear
4727 has having similar names through the side-effect of name truncation.
4728 (This is true only if we have a good scheme for truncated backup names,
4729 which I'm not sure at all: I suspect work is needed in this area.)
4730 When any existing file is backed up before being overwritten by extraction,
4731 then clashing files are automatically be renamed to be unique, and the
4732 true name is kept for only the last file of a series of clashing files.
4733 By using verbose mode, users may track exactly what happens.
4734
4735 At the detail level, some decisions are still experimental, and may
4736 change in the future, we are waiting comments from our users. So, please
4737 do not learn to depend blindly on the details of the backup features.
4738 For example, currently, directories themselves are never renamed through
4739 using these options, so, extracting a file over a directory still has
4740 good chances to fail. Also, backup options apply to created archives,
4741 not only to extracted members. For created archives, backups will not
4742 be attempted when the archive is a block or character device, or when it
4743 refers to a remote file.
4744
4745 For the sake of simplicity and efficiency, backups are made by renaming old
4746 files prior to creation or extraction, and not by copying. The original
4747 name is restored if the file creation fails. If a failure occurs after a
4748 partial extraction of a file, both the backup and the partially extracted
4749 file are kept.
4750
4751 @table @samp
4752 @item --backup[=@var{method}]
4753 @opindex backup
4754 @vindex VERSION_CONTROL
4755 @cindex backups
4756 Back up files that are about to be overwritten or removed.
4757 Without this option, the original versions are destroyed.
4758
4759 Use @var{method} to determine the type of backups made.
4760 If @var{method} is not specified, use the value of the @env{VERSION_CONTROL}
4761 environment variable. And if @env{VERSION_CONTROL} is not set,
4762 use the @samp{existing} method.
4763
4764 @vindex version-control @r{Emacs variable}
4765 This option corresponds to the Emacs variable @samp{version-control};
4766 the same values for @var{method} are accepted as in Emacs. This option
4767 also allows more descriptive names. The valid @var{method}s are:
4768
4769 @table @samp
4770 @item t
4771 @itemx numbered
4772 @cindex numbered @r{backup method}
4773 Always make numbered backups.
4774
4775 @item nil
4776 @itemx existing
4777 @cindex existing @r{backup method}
4778 Make numbered backups of files that already have them, simple backups
4779 of the others.
4780
4781 @item never
4782 @itemx simple
4783 @cindex simple @r{backup method}
4784 Always make simple backups.
4785
4786 @end table
4787
4788 @item --suffix=@var{suffix}
4789 @opindex suffix
4790 @cindex backup suffix
4791 @vindex SIMPLE_BACKUP_SUFFIX
4792 Append @var{suffix} to each backup file made with @option{--backup}. If this
4793 option is not specified, the value of the @env{SIMPLE_BACKUP_SUFFIX}
4794 environment variable is used. And if @env{SIMPLE_BACKUP_SUFFIX} is not
4795 set, the default is @samp{~}, just as in Emacs.
4796
4797 @end table
4798
4799 Some people express the desire to @emph{always} use the @option{--backup}
4800 option, by defining some kind of alias or script. This is not as easy
4801 as one may think, due to the fact that old style options should appear first
4802 and consume arguments a bit unpredictably for an alias or script. But,
4803 if you are ready to give up using old style options, you may resort to
4804 using something like (a Bourne shell function here):
4805
4806 @smallexample
4807 tar () @{ /usr/local/bin/tar --backup $*; @}
4808 @end smallexample
4809
4810 @node Applications
4811 @section Notable @command{tar} Usages
4812 @UNREVISED
4813
4814 @FIXME{Using Unix file linking capability to recreate directory
4815 structures---linking files into one subdirectory and then
4816 @command{tar}ring that directory.}
4817
4818 @FIXME{Nice hairy example using absolute-names, newer, etc.}
4819
4820 @findex uuencode
4821 You can easily use archive files to transport a group of files from
4822 one system to another: put all relevant files into an archive on one
4823 computer system, transfer the archive to another system, and extract
4824 the contents there. The basic transfer medium might be magnetic tape,
4825 Internet FTP, or even electronic mail (though you must encode the
4826 archive with @command{uuencode} in order to transport it properly by
4827 mail). Both machines do not have to use the same operating system, as
4828 long as they both support the @command{tar} program.
4829
4830 For example, here is how you might copy a directory's contents from
4831 one disk to another, while preserving the dates, modes, owners and
4832 link-structure of all the files therein. In this case, the transfer
4833 medium is a @dfn{pipe}, which is one a Unix redirection mechanism:
4834
4835 @smallexample
4836 $ @kbd{cd sourcedir; tar -cf - . | (cd targetdir; tar -xf -)}
4837 @end smallexample
4838
4839 @noindent
4840 The command also works using short option forms:
4841
4842 @smallexample
4843 $ @w{@kbd{cd sourcedir; tar --create --file=- . | (cd targetdir; tar --extract --file=-)}}
4844 @end smallexample
4845
4846 @noindent
4847 This is one of the easiest methods to transfer a @command{tar} archive.
4848
4849 @node looking ahead
4850 @section Looking Ahead: The Rest of this Manual
4851
4852 You have now seen how to use all eight of the operations available to
4853 @command{tar}, and a number of the possible options. The next chapter
4854 explains how to choose and change file and archive names, how to use
4855 files to store names of other files which you can then call as
4856 arguments to @command{tar} (this can help you save time if you expect to
4857 archive the same list of files a number of times), and so forth.
4858 @FIXME{in case it's not obvious, i'm making this up in some sense
4859 based on my limited memory of what the next chapter *really* does. i
4860 just wanted to flesh out this final section a little bit so i'd
4861 remember to stick it in here. :-)}
4862
4863 If there are too many files to conveniently list on the command line,
4864 you can list the names in a file, and @command{tar} will read that file.
4865 @xref{files}.
4866
4867 There are various ways of causing @command{tar} to skip over some files,
4868 and not archive them. @xref{Choosing}.
4869
4870 @node Backups
4871 @chapter Performing Backups and Restoring Files
4872 @UNREVISED
4873
4874 @GNUTAR{} is distributed along with the scripts
4875 which the Free Software Foundation uses for performing backups. There
4876 is no corresponding scripts available yet for doing restoration of
4877 files. Even if there is a good chance those scripts may be satisfying
4878 to you, they are not the only scripts or methods available for doing
4879 backups and restore. You may well create your own, or use more
4880 sophisticated packages dedicated to that purpose.
4881
4882 Some users are enthusiastic about @code{Amanda} (The Advanced Maryland
4883 Automatic Network Disk Archiver), a backup system developed by James
4884 da Silva @file{jds@@cs.umd.edu} and available on many Unix systems.
4885 This is free software, and it is available at these places:
4886
4887 @smallexample
4888 http://www.cs.umd.edu/projects/amanda/amanda.html
4889 ftp://ftp.cs.umd.edu/pub/amanda
4890 @end smallexample
4891
4892 @FIXME{
4893
4894 Here is a possible plan for a future documentation about the backuping
4895 scripts which are provided within the @GNUTAR{}
4896 distribution.
4897
4898 @itemize @bullet
4899 @item dumps
4900 @itemize @minus
4901 @item what are dumps
4902 @item different levels of dumps
4903 @itemize +
4904 @item full dump = dump everything
4905 @item level 1, level 2 dumps etc
4906 A level @var{n} dump dumps everything changed since the last level
4907 @var{n}-1 dump (?)
4908 @end itemize
4909 @item how to use scripts for dumps (ie, the concept)
4910 @itemize +
4911 @item scripts to run after editing backup specs (details)
4912 @end itemize
4913 @item Backup Specs, what is it.
4914 @itemize +
4915 @item how to customize
4916 @item actual text of script [/sp/dump/backup-specs]
4917 @end itemize
4918 @item Problems
4919 @itemize +
4920 @item rsh doesn't work
4921 @item rtape isn't installed
4922 @item (others?)
4923 @end itemize
4924 @item the @option{--incremental} option of tar
4925 @item tapes
4926 @itemize +
4927 @item write protection
4928 @item types of media, different sizes and types, useful for different things
4929 @item files and tape marks
4930 one tape mark between files, two at end.
4931 @item positioning the tape
4932 MT writes two at end of write,
4933 backspaces over one when writing again.
4934 @end itemize
4935 @end itemize
4936 @end itemize
4937 }
4938
4939 This chapter documents both the provided shell scripts and @command{tar}
4940 options which are more specific to usage as a backup tool.
4941
4942 To @dfn{back up} a file system means to create archives that contain
4943 all the files in that file system. Those archives can then be used to
4944 restore any or all of those files (for instance if a disk crashes or a
4945 file is accidentally deleted). File system @dfn{backups} are also
4946 called @dfn{dumps}.
4947
4948 @menu
4949 * Full Dumps:: Using @command{tar} to Perform Full Dumps
4950 * Incremental Dumps:: Using @command{tar} to Perform Incremental Dumps
4951 * Backup Levels:: Levels of Backups
4952 * Backup Parameters:: Setting Parameters for Backups and Restoration
4953 * Scripted Backups:: Using the Backup Scripts
4954 * Scripted Restoration:: Using the Restore Script
4955 @end menu
4956
4957 @node Full Dumps
4958 @section Using @command{tar} to Perform Full Dumps
4959 @UNREVISED
4960
4961 @cindex full dumps
4962 @cindex dumps, full
4963
4964 @cindex corrupted archives
4965 Full dumps should only be made when no other people or programs
4966 are modifying files in the file system. If files are modified while
4967 @command{tar} is making the backup, they may not be stored properly in
4968 the archive, in which case you won't be able to restore them if you
4969 have to. (Files not being modified are written with no trouble, and do
4970 not corrupt the entire archive.)
4971
4972 You will want to use the @option{--label=@var{archive-label}}
4973 (@option{-V @var{archive-label}}) option to give the archive a
4974 volume label, so you can tell what this archive is even if the label
4975 falls off the tape, or anything like that.
4976
4977 Unless the file system you are dumping is guaranteed to fit on
4978 one volume, you will need to use the @option{--multi-volume} (@option{-M}) option.
4979 Make sure you have enough tapes on hand to complete the backup.
4980
4981 If you want to dump each file system separately you will need to use
4982 the @option{--one-file-system} (@option{-l}) option to prevent
4983 @command{tar} from crossing file system boundaries when storing
4984 (sub)directories.
4985
4986 The @option{--incremental} (@option{-G}) (@pxref{Incremental Dumps})
4987 option is not needed, since this is a complete copy of everything in
4988 the file system, and a full restore from this backup would only be
4989 done onto a completely
4990 empty disk.
4991
4992 Unless you are in a hurry, and trust the @command{tar} program (and your
4993 tapes), it is a good idea to use the @option{--verify} (@option{-W})
4994 option, to make sure your files really made it onto the dump properly.
4995 This will also detect cases where the file was modified while (or just
4996 after) it was being archived. Not all media (notably cartridge tapes)
4997 are capable of being verified, unfortunately.
4998
4999 @node Incremental Dumps
5000 @section Using @command{tar} to Perform Incremental Dumps
5001
5002 @dfn{Incremental backup} is a special form of @GNUTAR{} archive that
5003 stores additional metadata so that exact state of the file system
5004 can be restored when extracting the archive.
5005
5006 @GNUTAR{} currently offers two options for handling incremental
5007 backups: @option{--listed-incremental=@var{snapshot-file}} (@option{-g
5008 @var{snapshot-file}}) and @option{--incremental} (@option{-G}).
5009
5010 @opindex listed-incremental
5011 The option @option{--listed-incremental} instructs tar to operate on
5012 an incremental archive with additional metadata stored in a standalone
5013 file, called a @dfn{snapshot file}. The purpose of this file is to help
5014 determine which files have been changed, added or deleted since the
5015 last backup, so that the next incremental backup will contain only
5016 modified files. The name of the snapshot file is given as an argument
5017 to the option:
5018
5019 @table @option
5020 @item --listed-incremental=@var{file}
5021 @itemx -g @var{file}
5022 Handle incremental backups with snapshot data in @var{file}.
5023 @end table
5024
5025 To create an incremental backup, you would use
5026 @option{--listed-incremental} together with @option{--create}
5027 (@pxref{create}). For example:
5028
5029 @smallexample
5030 $ @kbd{tar --create \
5031 --file=archive.1.tar \
5032 --listed-incremental=/var/log/usr.snar \
5033 /usr}
5034 @end smallexample
5035
5036 This will create in @file{archive.1.tar} an incremental backup of
5037 the @file{/usr} file system, storing additional metadata in the file
5038 @file{/var/log/usr.snar}. If this file does not exist, it will be
5039 created. The created archive will then be a @dfn{level 0 backup};
5040 please see the next section for more on backup levels.
5041
5042 Otherwise, if the file @file{/var/log/usr.snar} exists, it
5043 determines which files are modified. In this case only these files will be
5044 stored in the archive. Suppose, for example, that after running the
5045 above command, you delete file @file{/usr/doc/old} and create
5046 directory @file{/usr/local/db} with the following contents:
5047
5048 @smallexample
5049 $ @kbd{ls /usr/local/db}
5050 /usr/local/db/data
5051 /usr/local/db/index
5052 @end smallexample
5053
5054 Some time later you create another incremental backup. You will
5055 then see:
5056
5057 @smallexample
5058 $ @kbd{tar --create \
5059 --file=archive.2.tar \
5060 --listed-incremental=/var/log/usr.snar \
5061 /usr}
5062 tar: usr/local/db: Directory is new
5063 usr/local/db/
5064 usr/local/db/data
5065 usr/local/db/index
5066 @end smallexample
5067
5068 @noindent
5069 The created archive @file{archive.2.tar} will contain only these
5070 three members. This archive is called a @dfn{level 1 backup}. Notice
5071 that @file{/var/log/usr.snar} will be updated with the new data, so if
5072 you plan to create more @samp{level 1} backups, it is necessary to
5073 create a working copy of the snapshot file before running
5074 @command{tar}. The above example will then be modified as follows:
5075
5076 @smallexample
5077 $ @kbd{cp /var/log/usr.snar /var/log/usr.snar-1}
5078 $ @kbd{tar --create \
5079 --file=archive.2.tar \
5080 --listed-incremental=/var/log/usr.snar-1 \
5081 /usr}
5082 @end smallexample
5083
5084 Incremental dumps depend crucially on time stamps, so the results are
5085 unreliable if you modify a file's time stamps during dumping (e.g.,
5086 with the @option{--atime-preserve=replace} option), or if you set the clock
5087 backwards.
5088
5089 Metadata stored in snapshot files include device numbers, which,
5090 obviously is supposed to be a non-volatile value. However, it turns
5091 out that NFS devices have undependable values when an automounter
5092 gets in the picture. This can lead to a great deal of spurious
5093 redumping in incremental dumps, so it is somewhat useless to compare
5094 two NFS devices numbers over time. The solution implemented currently
5095 is to considers all NFS devices as being equal when it comes to
5096 comparing directories; this is fairly gross, but there does not seem
5097 to be a better way to go.
5098
5099 Note that incremental archives use @command{tar} extensions and may
5100 not be readable by non-@acronym{GNU} versions of the @command{tar} program.
5101
5102 @opindex listed-incremental, using with @option{--extract}
5103 @opindex extract, using with @option{--listed-incremental}
5104 To extract from the incremental dumps, use
5105 @option{--listed-incremental} together with @option{--extract}
5106 option (@pxref{extracting files}). In this case, @command{tar} does
5107 not need to access snapshot file, since all the data necessary for
5108 extraction are stored in the archive itself. So, when extracting, you
5109 can give whatever argument to @option{--listed-incremental}, the usual
5110 practice is to use @option{--listed-incremental=/dev/null}.
5111 Alternatively, you can use @option{--incremental}, which needs no
5112 arguments. In general, @option{--incremental} (@option{-G}) can be
5113 used as a shortcut for @option{--listed-incremental} when listing or
5114 extracting incremental backups (for more information, regarding this
5115 option, @pxref{incremental-op}).
5116
5117 When extracting from the incremental backup @GNUTAR{} attempts to
5118 restore the exact state the file system had when the archive was
5119 created. In particular, it will @emph{delete} those files in the file
5120 system that did not exist in their directories when the archive was
5121 created. If you have created several levels of incremental files,
5122 then in order to restore the exact contents the file system had when
5123 the last level was created, you will need to restore from all backups
5124 in turn. Continuing our example, to restore the state of @file{/usr}
5125 file system, one would do@footnote{Notice, that since both archives
5126 were created withouth @option{-P} option (@pxref{absolute}), these
5127 commands should be run from the root file system.}:
5128
5129 @smallexample
5130 $ @kbd{tar --extract \
5131 --listed-incremental=/dev/null \
5132 --file archive.1.tar}
5133 $ @kbd{tar --extract \
5134 --listed-incremental=/dev/null \
5135 --file archive.2.tar}
5136 @end smallexample
5137
5138 To list the contents of an incremental archive, use @option{--list}
5139 (@pxref{list}), as usual. To obtain more information about the
5140 archive, use @option{--listed-incremental} or @option{--incremental}
5141 combined with two @option{--verbose} options@footnote{Two
5142 @option{--verbose} options were selected to avoid breaking usual
5143 verbose listing output (@option{--list --verbose}) when using in
5144 scripts.
5145
5146 @opindex incremental, using with @option{--list}
5147 @opindex listed-incremental, using with @option{--list}
5148 @opindex list, using with @option{--incremental}
5149 @opindex list, using with @option{--listed-incremental}
5150 Versions of @GNUTAR{} up to 1.15.1 used to dump verbatim binary
5151 contents of the DUMPDIR header (with terminating nulls) when
5152 @option{--incremental} or @option{--listed-incremental} option was
5153 given, no matter what the verbosity level. This behavior, and,
5154 especially, the binary output it produced were considered incovenient
5155 and were changed in version 1.16}:
5156
5157 @smallexample
5158 @kbd{tar --list --incremental --verbose --verbose archive.tar}
5159 @end smallexample
5160
5161 This command will print, for each directory in the archive, the list
5162 of files in that directory at the time the archive was created. This
5163 information is put out in a format which is both human-readable and
5164 unambiguous for a program: each file name is printed as
5165
5166 @smallexample
5167 @var{x} @var{file}
5168 @end smallexample
5169
5170 @noindent
5171 where @var{x} is a letter describing the status of the file: @samp{Y}
5172 if the file is present in the archive, @samp{N} if the file is not
5173 included in the archive, or a @samp{D} if the file is a directory (and
5174 is included in the archive).@FIXME-xref{dumpdir format}. Each such
5175 line is terminated by a newline character. The last line is followed
5176 by an additional newline to indicate the end of the data.
5177
5178 @anchor{incremental-op}The option @option{--incremental} (@option{-G})
5179 gives the same behavior as @option{--listed-incremental} when used
5180 with @option{--list} and @option{--extract} options. When used with
5181 @option{--create} option, it creates an incremental archive without
5182 creating snapshot file. Thus, it is impossible to create several
5183 levels of incremental backups with @option{--incremental} option.
5184
5185 @node Backup Levels
5186 @section Levels of Backups
5187
5188 An archive containing all the files in the file system is called a
5189 @dfn{full backup} or @dfn{full dump}. You could insure your data by
5190 creating a full dump every day. This strategy, however, would waste a
5191 substantial amount of archive media and user time, as unchanged files
5192 are daily re-archived.
5193
5194 It is more efficient to do a full dump only occasionally. To back up
5195 files between full dumps, you can use @dfn{incremental dumps}. A @dfn{level
5196 one} dump archives all the files that have changed since the last full
5197 dump.
5198
5199 A typical dump strategy would be to perform a full dump once a week,
5200 and a level one dump once a day. This means some versions of files
5201 will in fact be archived more than once, but this dump strategy makes
5202 it possible to restore a file system to within one day of accuracy by
5203 only extracting two archives---the last weekly (full) dump and the
5204 last daily (level one) dump. The only information lost would be in
5205 files changed or created since the last daily backup. (Doing dumps
5206 more than once a day is usually not worth the trouble).
5207
5208 @GNUTAR{} comes with scripts you can use to do full
5209 and level-one (actually, even level-two and so on) dumps. Using
5210 scripts (shell programs) to perform backups and restoration is a
5211 convenient and reliable alternative to typing out file name lists
5212 and @command{tar} commands by hand.
5213
5214 Before you use these scripts, you need to edit the file
5215 @file{backup-specs}, which specifies parameters used by the backup
5216 scripts and by the restore script. This file is usually located
5217 in @file{/etc/backup} directory. @xref{Backup Parameters}, for its
5218 detailed description. Once the backup parameters are set, you can
5219 perform backups or restoration by running the appropriate script.
5220
5221 The name of the backup script is @code{backup}. The name of the
5222 restore script is @code{restore}. The following sections describe
5223 their use in detail.
5224
5225 @emph{Please Note:} The backup and restoration scripts are
5226 designed to be used together. While it is possible to restore files by
5227 hand from an archive which was created using a backup script, and to create
5228 an archive by hand which could then be extracted using the restore script,
5229 it is easier to use the scripts. @xref{Incremental Dumps}, before
5230 making such an attempt.
5231
5232 @node Backup Parameters
5233 @section Setting Parameters for Backups and Restoration
5234
5235 The file @file{backup-specs} specifies backup parameters for the
5236 backup and restoration scripts provided with @command{tar}. You must
5237 edit @file{backup-specs} to fit your system configuration and schedule
5238 before using these scripts.
5239
5240 Syntactically, @file{backup-specs} is a shell script, containing
5241 mainly variable assignments. However, any valid shell construct
5242 is allowed in this file. Particularly, you may wish to define
5243 functions within that script (e.g., see @code{RESTORE_BEGIN} below).
5244 For more information about shell script syntax, please refer to
5245 @url{http://www.opengroup.org/onlinepubs/009695399/utilities/xcu_chap02.html#ta
5246 g_02, the definition of the Shell Command Language}. See also
5247 @ref{Top,,Bash Features,bashref,Bash Reference Manual}.
5248
5249 The shell variables controlling behavior of @code{backup} and
5250 @code{restore} are described in the following subsections.
5251
5252 @menu
5253 * General-Purpose Variables::
5254 * Magnetic Tape Control::
5255 * User Hooks::
5256 * backup-specs example:: An Example Text of @file{Backup-specs}
5257 @end menu
5258
5259 @node General-Purpose Variables
5260 @subsection General-Purpose Variables
5261
5262 @defvr {Backup variable} ADMINISTRATOR
5263 The user name of the backup administrator. @code{Backup} scripts
5264 sends a backup report to this address.
5265 @end defvr
5266
5267 @defvr {Backup variable} BACKUP_HOUR
5268 The hour at which the backups are done. This can be a number from 0
5269 to 23, or the time specification in form @var{hours}:@var{minutes},
5270 or the string @samp{now}.
5271
5272 This variable is used by @code{backup}. Its value may be overridden
5273 using @option{--time} option (@pxref{Scripted Backups}).
5274 @end defvr
5275
5276 @defvr {Backup variable} TAPE_FILE
5277
5278 The device @command{tar} writes the archive to. If @var{TAPE_FILE}
5279 is a remote archive (@pxref{remote-dev}), backup script will suppose
5280 that your @command{mt} is able to access remote devices. If @var{RSH}
5281 (@pxref{RSH}) is set, @option{--rsh-command} option will be added to
5282 invocations of @command{mt}.
5283 @end defvr
5284
5285 @defvr {Backup variable} BLOCKING
5286
5287 The blocking factor @command{tar} will use when writing the dump archive.
5288 @xref{Blocking Factor}.
5289 @end defvr
5290
5291 @defvr {Backup variable} BACKUP_DIRS
5292
5293 A list of file systems to be dumped (for @code{backup}), or restored
5294 (for @code{restore}). You can include any directory
5295 name in the list --- subdirectories on that file system will be
5296 included, regardless of how they may look to other networked machines.
5297 Subdirectories on other file systems will be ignored.
5298
5299 The host name specifies which host to run @command{tar} on, and should
5300 normally be the host that actually contains the file system. However,
5301 the host machine must have @GNUTAR{} installed, and
5302 must be able to access the directory containing the backup scripts and
5303 their support files using the same file name that is used on the
5304 machine where the scripts are run (ie. what @command{pwd} will print
5305 when in that directory on that machine). If the host that contains
5306 the file system does not have this capability, you can specify another
5307 host as long as it can access the file system through NFS.
5308
5309 If the list of file systems is very long you may wish to put it
5310 in a separate file. This file is usually named
5311 @file{/etc/backup/dirs}, but this name may be overridden in
5312 @file{backup-specs} using @code{DIRLIST} variable.
5313 @end defvr
5314
5315 @defvr {Backup variable} DIRLIST
5316
5317 A path to the file containing the list of the file systems to backup
5318 or restore. By default it is @file{/etc/backup/dirs}.
5319 @end defvr
5320
5321 @defvr {Backup variable} BACKUP_FILES
5322
5323 A list of individual files to be dumped (for @code{backup}), or restored
5324 (for @code{restore}). These should be accessible from the machine on
5325 which the backup script is run.
5326
5327 If the list of file systems is very long you may wish to store it
5328 in a separate file. This file is usually named
5329 @file{/etc/backup/files}, but this name may be overridden in
5330 @file{backup-specs} using @code{FILELIST} variable.
5331 @end defvr
5332
5333 @defvr {Backup variable} FILELIST
5334
5335 A path to the file containing the list of the individual files to backup
5336 or restore. By default it is @file{/etc/backup/files}.
5337 @end defvr
5338
5339 @defvr {Backup variable} MT
5340
5341 Full file name of @command{mt} binary.
5342 @end defvr
5343
5344 @defvr {Backup variable} RSH
5345 @anchor{RSH}
5346 Full file name of @command{rsh} binary or its equivalent. You may wish to
5347 set it to @code{ssh}, to improve security. In this case you will have
5348 to use public key authentication.
5349 @end defvr
5350
5351 @defvr {Backup variable} RSH_COMMAND
5352
5353 Full file name of @command{rsh} binary on remote mashines. This will
5354 be passed via @option{--rsh-command} option to the remote invocation
5355 of @GNUTAR{}.
5356 @end defvr
5357
5358 @defvr {Backup variable} VOLNO_FILE
5359
5360 Name of temporary file to hold volume numbers. This needs to be accessible
5361 by all the machines which have file systems to be dumped.
5362 @end defvr
5363
5364 @defvr {Backup variable} XLIST
5365
5366 Name of @dfn{exclude file list}. An @dfn{exclude file list} is a file
5367 located on the remote machine and containing the list of files to
5368 be excluded from the backup. Exclude file lists are searched in
5369 /etc/tar-backup directory. A common use for exclude file lists
5370 is to exclude files containing security-sensitive information
5371 (e.g., @file{/etc/shadow} from backups).
5372
5373 This variable affects only @code{backup}.
5374 @end defvr
5375
5376 @defvr {Backup variable} SLEEP_TIME
5377
5378 Time to sleep between dumps of any two successive file systems
5379
5380 This variable affects only @code{backup}.
5381 @end defvr
5382
5383 @defvr {Backup variable} DUMP_REMIND_SCRIPT
5384
5385 Script to be run when it's time to insert a new tape in for the next
5386 volume. Administrators may want to tailor this script for their site.
5387 If this variable isn't set, @GNUTAR{} will display its built-in prompt
5388 @FIXME-xref{describe it somewhere!}, and will expect confirmation from
5389 the console.
5390 @end defvr
5391
5392 @defvr {Backup variable} SLEEP_MESSAGE
5393
5394 Message to display on the terminal while waiting for dump time. Usually
5395 this will just be some literal text.
5396 @end defvr
5397
5398 @defvr {Backup variable} TAR
5399
5400 Full file name of the @GNUTAR{} executable. If this is not set, backup
5401 scripts will search @command{tar} in the current shell path.
5402 @end defvr
5403
5404 @node Magnetic Tape Control
5405 @subsection Magnetic Tape Control
5406
5407 Backup scripts access tape device using special @dfn{hook functions}.
5408 These functions take a single argument -- the name of the tape
5409 device. Their names are kept in the following variables:
5410
5411 @defvr {Backup variable} MT_BEGIN
5412 The name of @dfn{begin} function. This function is called before
5413 accessing the drive. By default it retensions the tape:
5414
5415 @smallexample
5416 MT_BEGIN=mt_begin
5417
5418 mt_begin() @{
5419 mt -f "$1" retension
5420 @}
5421 @end smallexample
5422 @end defvr
5423
5424 @defvr {Backup variable} MT_REWIND
5425 The name of @dfn{rewind} function. The default definition is as
5426 follows:
5427
5428 @smallexample
5429 MT_REWIND=mt_rewind
5430
5431 mt_rewind() @{
5432 mt -f "$1" rewind
5433 @}
5434 @end smallexample
5435
5436 @end defvr
5437
5438 @defvr {Backup variable} MT_OFFLINE
5439 The name of the function switching the tape off line. By default
5440 it is defined as follows:
5441
5442 @smallexample
5443 MT_OFFLINE=mt_offline
5444
5445 mt_offline() @{
5446 mt -f "$1" offl
5447 @}
5448 @end smallexample
5449 @end defvr
5450
5451 @defvr {Backup variable} MT_STATUS
5452 The name of the function used to obtain the status of the archive device,
5453 including error count. Default definition:
5454
5455 @smallexample
5456 MT_STATUS=mt_status
5457
5458 mt_status() @{
5459 mt -f "$1" status
5460 @}
5461 @end smallexample
5462 @end defvr
5463
5464 @node User Hooks
5465 @subsection User Hooks
5466
5467 @dfn{User hooks} are shell functions executed before and after
5468 each @command{tar} invocation. Thus, there are @dfn{backup
5469 hooks}, which are executed before and after dumping each file
5470 system, and @dfn{restore hooks}, executed before and
5471 after restoring a file system. Each user hook is a shell function
5472 taking four arguments:
5473
5474 @deffn {User Hook Function} hook @var{level} @var{host} @var{fs} @var{fsname}
5475 Its arguments are:
5476
5477 @table @var
5478 @item level
5479 Current backup or restore level.
5480
5481 @item host
5482 Name or IP address of the host machine being dumped or restored.
5483
5484 @item fs
5485 Full path name to the file system being dumped or restored.
5486
5487 @item fsname
5488 File system name with directory separators replaced with colons. This
5489 is useful, e.g., for creating unique files.
5490 @end table
5491 @end deffn
5492
5493 Following variables keep the names of user hook functions
5494
5495 @defvr {Backup variable} DUMP_BEGIN
5496 Dump begin function. It is executed before dumping the file system.
5497 @end defvr
5498
5499 @defvr {Backup variable} DUMP_END
5500 Executed after dumping the file system.
5501 @end defvr
5502
5503 @defvr {Backup variable} RESTORE_BEGIN
5504 Executed before restoring the file system.
5505 @end defvr
5506
5507 @defvr {Backup variable} RESTORE_END
5508 Executed after restoring the file system.
5509 @end defvr
5510
5511 @node backup-specs example
5512 @subsection An Example Text of @file{Backup-specs}
5513
5514 The following is an example of @file{backup-specs}:
5515
5516 @smallexample
5517 # site-specific parameters for file system backup.
5518
5519 ADMINISTRATOR=friedman
5520 BACKUP_HOUR=1
5521 TAPE_FILE=/dev/nrsmt0
5522
5523 # Use @code{ssh} instead of the less secure @code{rsh}
5524 RSH=/usr/bin/ssh
5525 RSH_COMMAND=/usr/bin/ssh
5526
5527 # Override MT_STATUS function:
5528 my_status() @{
5529 mts -t $TAPE_FILE
5530 @}
5531 MT_STATUS=my_status
5532
5533 # Disable MT_OFFLINE function
5534 MT_OFFLINE=:
5535
5536 BLOCKING=124
5537 BACKUP_DIRS="
5538 albert:/fs/fsf
5539 apple-gunkies:/gd
5540 albert:/fs/gd2
5541 albert:/fs/gp
5542 geech:/usr/jla
5543 churchy:/usr/roland
5544 albert:/
5545 albert:/usr
5546 apple-gunkies:/
5547 apple-gunkies:/usr
5548 gnu:/hack
5549 gnu:/u
5550 apple-gunkies:/com/mailer/gnu
5551 apple-gunkies:/com/archive/gnu"
5552
5553 BACKUP_FILES="/com/mailer/aliases /com/mailer/league*[a-z]"
5554
5555 @end smallexample
5556
5557 @node Scripted Backups
5558 @section Using the Backup Scripts
5559
5560 The syntax for running a backup script is:
5561
5562 @smallexample
5563 backup --level=@var{level} --time=@var{time}
5564 @end smallexample
5565
5566 The @option{level} option requests the dump level. Thus, to produce
5567 a full dump, specify @code{--level=0} (this is the default, so
5568 @option{--level} may be omitted if its value is @code{0}).
5569 @footnote{For backward compatibility, the @code{backup} will also
5570 try to deduce the requested dump level from the name of the
5571 script itself. If the name consists of a string @samp{level-}
5572 followed by a single decimal digit, that digit is taken as
5573 the dump level number. Thus, you may create a link from @code{backup}
5574 to @code{level-1} and then run @code{level-1} whenever you need to
5575 create a level one dump.}
5576
5577 The @option{--time} option determines when should the backup be
5578 run. @var{Time} may take three forms:
5579
5580 @table @asis
5581 @item @var{hh}:@var{mm}
5582
5583 The dump must be run at @var{hh} hours @var{mm} minutes.
5584
5585 @item @var{hh}
5586
5587 The dump must be run at @var{hh} hours
5588
5589 @item now
5590
5591 The dump must be run immediately.
5592 @end table
5593
5594 You should start a script with a tape or disk mounted. Once you
5595 start a script, it prompts you for new tapes or disks as it
5596 needs them. Media volumes don't have to correspond to archive
5597 files --- a multi-volume archive can be started in the middle of a
5598 tape that already contains the end of another multi-volume archive.
5599 The @code{restore} script prompts for media by its archive volume,
5600 so to avoid an error message you should keep track of which tape
5601 (or disk) contains which volume of the archive (@pxref{Scripted
5602 Restoration}).
5603
5604 The backup scripts write two files on the file system. The first is a
5605 record file in @file{/etc/tar-backup/}, which is used by the scripts
5606 to store and retrieve information about which files were dumped. This
5607 file is not meant to be read by humans, and should not be deleted by
5608 them. @xref{Snapshot Files}, for a more detailed explanation of this
5609 file.
5610
5611 The second file is a log file containing the names of the file systems
5612 and files dumped, what time the backup was made, and any error
5613 messages that were generated, as well as how much space was left in
5614 the media volume after the last volume of the archive was written.
5615 You should check this log file after every backup. The file name is
5616 @file{log-@var{mm-dd-yyyy}-level-@var{n}}, where @var{mm-dd-yyyy}
5617 represents current date, and @var{n} represents current dump level number.
5618
5619 The script also prints the name of each system being dumped to the
5620 standard output.
5621
5622 Following is the full list of options accepted by @code{backup}
5623 script:
5624
5625 @table @option
5626 @item -l @var{level}
5627 @itemx --level=@var{level}
5628 Do backup level @var{level} (default 0).
5629
5630 @item -f
5631 @itemx --force
5632 Force backup even if today's log file already exists.
5633
5634 @item -v[@var{level}]
5635 @itemx --verbose[=@var{level}]
5636 Set verbosity level. The higher the level is, the more debugging
5637 information will be output during execution. Devault @var{level}
5638 is 100, which means the highest debugging level.
5639
5640 @item -t @var{start-time}
5641 @itemx --time=@var{start-time}
5642 Wait till @var{time}, then do backup.
5643
5644 @item -h
5645 @itemx --help
5646 Display short help message and exit.
5647
5648 @item -L
5649 @itemx --license
5650 Display program license and exit.
5651
5652 @item -V
5653 @itemx --version
5654 Display program version and exit.
5655 @end table
5656
5657
5658 @node Scripted Restoration
5659 @section Using the Restore Script
5660
5661 To restore files that were archived using a scripted backup, use the
5662 @code{restore} script. Its usage is quite straightforward. In the
5663 simplest form, invoke @code{restore --all}, it will
5664 then restore all the file systems and files specified in
5665 @file{backup-specs} (@pxref{General-Purpose Variables,BACKUP_DIRS}).
5666
5667 You may select the file systems (and/or files) to restore by
5668 giving @code{restore} list of @dfn{patterns} in its command
5669 line. For example, running
5670
5671 @smallexample
5672 restore 'albert:*'
5673 @end smallexample
5674
5675 @noindent
5676 will restore all file systems on the machine @samp{albert}. A more
5677 complicated example:
5678
5679 @smallexample
5680 restore 'albert:*' '*:/var'
5681 @end smallexample
5682
5683 @noindent
5684 This command will restore all file systems on the machine @samp{albert}
5685 as well as @file{/var} file system on all machines.
5686
5687 By default @code{restore} will start restoring files from the lowest
5688 available dump level (usually zero) and will continue through
5689 all available dump levels. There may be situations where such a
5690 thorough restore is not necessary. For example, you may wish to
5691 restore only files from the recent level one backup. To do so,
5692 use @option{--level} option, as shown in the example below:
5693
5694 @smallexample
5695 restore --level=1
5696 @end smallexample
5697
5698 The full list of options accepted by @code{restore} follows:
5699
5700 @table @option
5701 @item -a
5702 @itemx --all
5703 Restore all file systems and files specified in @file{backup-specs}
5704
5705 @item -l @var{level}
5706 @itemx --level=@var{level}
5707 Start restoring from the given backup level, instead of the default 0.
5708
5709 @item -v[@var{level}]
5710 @itemx --verbose[=@var{level}]
5711 Set verbosity level. The higher the level is, the more debugging
5712 information will be output during execution. Devault @var{level}
5713 is 100, which means the highest debugging level.
5714
5715 @item -h
5716 @itemx --help
5717 Display short help message and exit.
5718
5719 @item -L
5720 @itemx --license
5721 Display program license and exit.
5722
5723 @item -V
5724 @itemx --version
5725 Display program version and exit.
5726 @end table
5727
5728 You should start the restore script with the media containing the
5729 first volume of the archive mounted. The script will prompt for other
5730 volumes as they are needed. If the archive is on tape, you don't need
5731 to rewind the tape to to its beginning---if the tape head is
5732 positioned past the beginning of the archive, the script will rewind
5733 the tape as needed. @FIXME-xref{Media, for a discussion of tape
5734 positioning.}
5735
5736 @quotation
5737 @strong{Warning:} The script will delete files from the active file
5738 system if they were not in the file system when the archive was made.
5739 @end quotation
5740
5741 @xref{Incremental Dumps}, for an explanation of how the script makes
5742 that determination.
5743
5744 @node Choosing
5745 @chapter Choosing Files and Names for @command{tar}
5746 @UNREVISED
5747
5748 @FIXME{Melissa (still) Doesn't Really Like This ``Intro'' Paragraph!!!}
5749
5750 Certain options to @command{tar} enable you to specify a name for your
5751 archive. Other options let you decide which files to include or exclude
5752 from the archive, based on when or whether files were modified, whether
5753 the file names do or don't match specified patterns, or whether files
5754 are in specified directories.
5755
5756 @menu
5757 * file:: Choosing the Archive's Name
5758 * Selecting Archive Members::
5759 * files:: Reading Names from a File
5760 * exclude:: Excluding Some Files
5761 * Wildcards::
5762 * after:: Operating Only on New Files
5763 * recurse:: Descending into Directories
5764 * one:: Crossing File System Boundaries
5765 @end menu
5766
5767 @node file
5768 @section Choosing and Naming Archive Files
5769 @UNREVISED
5770
5771 @FIXME{should the title of this section actually be, "naming an
5772 archive"?}
5773
5774 @cindex Naming an archive
5775 @cindex Archive Name
5776 @cindex Choosing an archive file
5777 @cindex Where is the archive?
5778 By default, @command{tar} uses an archive file name that was compiled when
5779 it was built on the system; usually this name refers to some physical
5780 tape drive on the machine. However, the person who installed @command{tar}
5781 on the system may not set the default to a meaningful value as far as
5782 most users are concerned. As a result, you will usually want to tell
5783 @command{tar} where to find (or create) the archive. The @option{--file=@var{archive-name}} (@option{-f @var{archive-name}})
5784 option allows you to either specify or name a file to use as the archive
5785 instead of the default archive file location.
5786
5787 @table @option
5788 @opindex file, short description
5789 @item --file=@var{archive-name}
5790 @itemx -f @var{archive-name}
5791 Name the archive to create or operate on. Use in conjunction with
5792 any operation.
5793 @end table
5794
5795 For example, in this @command{tar} command,
5796
5797 @smallexample
5798 $ @kbd{tar -cvf collection.tar blues folk jazz}
5799 @end smallexample
5800
5801 @noindent
5802 @file{collection.tar} is the name of the archive. It must directly
5803 follow the @option{-f} option, since whatever directly follows @option{-f}
5804 @emph{will} end up naming the archive. If you neglect to specify an
5805 archive name, you may end up overwriting a file in the working directory
5806 with the archive you create since @command{tar} will use this file's name
5807 for the archive name.
5808
5809 An archive can be saved as a file in the file system, sent through a
5810 pipe or over a network, or written to an I/O device such as a tape,
5811 floppy disk, or CD write drive.
5812
5813 @cindex Writing new archives
5814 @cindex Archive creation
5815 If you do not name the archive, @command{tar} uses the value of the
5816 environment variable @env{TAPE} as the file name for the archive. If
5817 that is not available, @command{tar} uses a default, compiled-in archive
5818 name, usually that for tape unit zero (ie. @file{/dev/tu00}).
5819 @command{tar} always needs an archive name.
5820
5821 If you use @file{-} as an @var{archive-name}, @command{tar} reads the
5822 archive from standard input (when listing or extracting files), or
5823 writes it to standard output (when creating an archive). If you use
5824 @file{-} as an @var{archive-name} when modifying an archive,
5825 @command{tar} reads the original archive from its standard input and
5826 writes the entire new archive to its standard output.
5827
5828 @FIXME{might want a different example here; this is already used in
5829 "notable tar usages".}
5830
5831 @smallexample
5832 $ @kbd{cd sourcedir; tar -cf - . | (cd targetdir; tar -xf -)}
5833 @end smallexample
5834
5835 @FIXME{help!}
5836
5837 @cindex Standard input and output
5838 @cindex tar to standard input and output
5839 @anchor{remote-dev}
5840 To specify an archive file on a device attached to a remote machine,
5841 use the following:
5842
5843 @smallexample
5844 @kbd{--file=@var{hostname}:/@var{dev}/@var{file name}}
5845 @end smallexample
5846
5847 @noindent
5848 @command{tar} will complete the remote connection, if possible, and
5849 prompt you for a username and password. If you use
5850 @option{--file=@@@var{hostname}:/@var{dev}/@var{file name}}, @command{tar}
5851 will complete the remote connection, if possible, using your username
5852 as the username on the remote machine.
5853
5854 @cindex Local and remote archives
5855 @anchor{local and remote archives}
5856 If the archive file name includes a colon (@samp{:}), then it is assumed
5857 to be a file on another machine. If the archive file is
5858 @samp{@var{user}@@@var{host}:@var{file}}, then @var{file} is used on the
5859 host @var{host}. The remote host is accessed using the @command{rsh}
5860 program, with a username of @var{user}. If the username is omitted
5861 (along with the @samp{@@} sign), then your user name will be used.
5862 (This is the normal @command{rsh} behavior.) It is necessary for the
5863 remote machine, in addition to permitting your @command{rsh} access, to
5864 have the @file{rmt} program installed (This command is included in
5865 the @GNUTAR{} distribution and by default is installed under
5866 @file{@var{prefix}/libexec/rmt}, were @var{prefix} means your
5867 installation prefix). If you need to use a file whose name includes a
5868 colon, then the remote tape drive behavior
5869 can be inhibited by using the @option{--force-local} option.
5870
5871 @FIXME{i know we went over this yesterday, but bob (and now i do again,
5872 too) thinks it's out of the middle of nowhere. it doesn't seem to tie
5873 into what came before it well enough <<i moved it now, is it better
5874 here?>>. bob also comments that if Amanda isn't free software, we
5875 shouldn't mention it..}
5876
5877 When the archive is being created to @file{/dev/null}, @GNUTAR{}
5878 tries to minimize input and output operations. The
5879 Amanda backup system, when used with @GNUTAR{}, has
5880 an initial sizing pass which uses this feature.
5881
5882 @node Selecting Archive Members
5883 @section Selecting Archive Members
5884 @cindex Specifying files to act on
5885 @cindex Specifying archive members
5886
5887 @dfn{File Name arguments} specify which files in the file system
5888 @command{tar} operates on, when creating or adding to an archive, or which
5889 archive members @command{tar} operates on, when reading or deleting from
5890 an archive. @xref{Operations}.
5891
5892 To specify file names, you can include them as the last arguments on
5893 the command line, as follows:
5894 @smallexample
5895 @kbd{tar} @var{operation} [@var{option1} @var{option2} @dots{}] [@var{file name-1} @var{file name-2} @dots{}]
5896 @end smallexample
5897
5898 If a file name begins with dash (@samp{-}), preceede it with
5899 @option{--add-file} option to preventit from being treated as an
5900 option.
5901
5902 If you specify a directory name as a file name argument, all the files
5903 in that directory are operated on by @command{tar}.
5904
5905 If you do not specify files when @command{tar} is invoked with
5906 @option{--create} (@option{-c}), @command{tar} operates on all the non-directory files in
5907 the working directory. If you specify either @option{--list} (@option{-t}) or
5908 @option{--extract} (@option{--get}, @option{-x}), @command{tar} operates on all the archive members in the
5909 archive. If you specify any operation other than one of these three,
5910 @command{tar} does nothing.
5911
5912 By default, @command{tar} takes file names from the command line. However,
5913 there are other ways to specify file or member names, or to modify the
5914 manner in which @command{tar} selects the files or members upon which to
5915 operate. @FIXME{add xref here}In general, these methods work both for
5916 specifying the names of files and archive members.
5917
5918 @node files
5919 @section Reading Names from a File
5920
5921 @cindex Reading file names from a file
5922 @cindex Lists of file names
5923 @cindex File Name arguments, alternatives
5924 Instead of giving the names of files or archive members on the command
5925 line, you can put the names into a file, and then use the
5926 @option{--files-from=@var{file-of-names}} (@option{-T @var{file-of-names}}) option to @command{tar}. Give the name of the file
5927 which contains the list of files to include as the argument to
5928 @option{--files-from}. In the list, the file names should be separated by
5929 newlines. You will frequently use this option when you have generated
5930 the list of files to archive with the @command{find} utility.
5931
5932 @table @option
5933 @opindex files-from
5934 @item --files-from=@var{file name}
5935 @itemx -T @var{file name}
5936 Get names to extract or create from file @var{file name}.
5937 @end table
5938
5939 If you give a single dash as a file name for @option{--files-from}, (i.e.,
5940 you specify either @code{--files-from=-} or @code{-T -}), then the file
5941 names are read from standard input.
5942
5943 Unless you are running @command{tar} with @option{--create}, you can not use
5944 both @code{--files-from=-} and @code{--file=-} (@code{-f -}) in the same
5945 command.
5946
5947 Any number of @option{-T} options can be given in the command line.
5948
5949 @FIXME{add bob's example, from his message on 2-10-97}
5950
5951 The following example shows how to use @command{find} to generate a list of
5952 files smaller than 400K in length and put that list into a file
5953 called @file{small-files}. You can then use the @option{-T} option to
5954 @command{tar} to specify the files from that file, @file{small-files}, to
5955 create the archive @file{little.tgz}. (The @option{-z} option to
5956 @command{tar} compresses the archive with @command{gzip}; @pxref{gzip} for
5957 more information.)
5958
5959 @smallexample
5960 $ @kbd{find . -size -400 -print > small-files}
5961 $ @kbd{tar -c -v -z -T small-files -f little.tgz}
5962 @end smallexample
5963
5964 @noindent
5965 In the file list given by @option{-T} option, any file name beginning
5966 with @samp{-} character is considered a @command{tar} option and is
5967 processed accordingly.@footnote{Versions of @GNUTAR{} up to 1.15.1
5968 recognized only @option{-C} option in file lists, and only if the
5969 option and its argument occupied two consecutive lines.} For example,
5970 the common use of this feature is to change to another directory by
5971 specifying @option{-C} option:
5972
5973 @smallexample
5974 @group
5975 $ @kbd{cat list}
5976 -C/etc
5977 passwd
5978 hosts
5979 -C/lib
5980 libc.a
5981 $ @kbd{tar -c -f foo.tar --files-from list}
5982 @end group
5983 @end smallexample
5984
5985 @noindent
5986 In this example, @command{tar} will first switch to @file{/etc}
5987 directory and add files @file{passwd} and @file{hosts} to the
5988 archive. Then it will change to @file{/lib} directory and will archive
5989 the file @file{libc.a}. Thus, the resulting archive @file{foo.tar} will
5990 contain:
5991
5992 @smallexample
5993 @group
5994 $ @kbd{tar tf foo.tar}
5995 passwd
5996 hosts
5997 libc.a
5998 @end group
5999 @end smallexample
6000
6001 @noindent
6002 @opindex directory, using in @option{--files-from} argument
6003 Notice that the option parsing algorithm used with @option{-T} is
6004 stricter than the one used by shell. Namely, when specifying option
6005 arguments, you should observe the following rules:
6006
6007 @itemize @bullet
6008 @item
6009 When using short (single-letter) option form, its argument must
6010 immediately follow the option letter, without any intervening
6011 whitespace. For example: @code{-Cdir}.
6012
6013 @item
6014 When using long option form, the option argument must be separated
6015 from the option by a single equal sign. No whitespace is allowed on
6016 any side of the equal sign. For example: @code{--directory=dir}.
6017
6018 @item
6019 For both short and long option forms, the option argument can be given
6020 on the next line after the option name, e.g.:
6021
6022 @smallexample
6023 @group
6024 --directory
6025 dir
6026 @end group
6027 @end smallexample
6028
6029 @noindent
6030 and
6031
6032 @smallexample
6033 @group
6034 -C
6035 dir
6036 @end group
6037 @end smallexample
6038 @end itemize
6039
6040 @opindex add-file
6041 If you happen to have a file whose name starts with @samp{-},
6042 precede it with @option{--add-file} option to prevent it from
6043 being recognized as an option. For example: @code{--add-file --my-file}.
6044
6045 @menu
6046 * nul::
6047 @end menu
6048
6049 @node nul
6050 @subsection @code{NUL} Terminated File Names
6051
6052 @cindex File names, terminated by @code{NUL}
6053 @cindex @code{NUL} terminated file names
6054 The @option{--null} option causes @option{--files-from=@var{file-of-names}} (@option{-T @var{file-of-names}}) to read file
6055 names terminated by a @code{NUL} instead of a newline, so files whose
6056 names contain newlines can be archived using @option{--files-from}.
6057
6058 @table @option
6059 @opindex null
6060 @item --null
6061 Only consider @code{NUL} terminated file names, instead of files that
6062 terminate in a newline.
6063 @end table
6064
6065 The @option{--null} option is just like the one in @acronym{GNU}
6066 @command{xargs} and @command{cpio}, and is useful with the
6067 @option{-print0} predicate of @acronym{GNU} @command{find}. In
6068 @command{tar}, @option{--null} also disables special handling for
6069 file names that begin with dash.
6070
6071 This example shows how to use @command{find} to generate a list of files
6072 larger than 800K in length and put that list into a file called
6073 @file{long-files}. The @option{-print0} option to @command{find} is just
6074 like @option{-print}, except that it separates files with a @code{NUL}
6075 rather than with a newline. You can then run @command{tar} with both the
6076 @option{--null} and @option{-T} options to specify that @command{tar} get the
6077 files from that file, @file{long-files}, to create the archive
6078 @file{big.tgz}. The @option{--null} option to @command{tar} will cause
6079 @command{tar} to recognize the @code{NUL} separator between files.
6080
6081 @smallexample
6082 $ @kbd{find . -size +800 -print0 > long-files}
6083 $ @kbd{tar -c -v --null --files-from=long-files --file=big.tar}
6084 @end smallexample
6085
6086 @FIXME{say anything else here to conclude the section?}
6087
6088 @node exclude
6089 @section Excluding Some Files
6090 @UNREVISED
6091
6092 @cindex File names, excluding files by
6093 @cindex Excluding files by name and pattern
6094 @cindex Excluding files by file system
6095 To avoid operating on files whose names match a particular pattern,
6096 use the @option{--exclude} or @option{--exclude-from} options.
6097
6098 @table @option
6099 @opindex exclude
6100 @item --exclude=@var{pattern}
6101 Causes @command{tar} to ignore files that match the @var{pattern}.
6102 @end table
6103
6104 @findex exclude
6105 The @option{--exclude=@var{pattern}} option prevents any file or member whose name
6106 matches the shell wildcard (@var{pattern}) from being operated on.
6107 For example, to create an archive with all the contents of the directory
6108 @file{src} except for files whose names end in @file{.o}, use the
6109 command @samp{tar -cf src.tar --exclude='*.o' src}.
6110
6111 You may give multiple @option{--exclude} options.
6112
6113 @table @option
6114 @opindex exclude-from
6115 @item --exclude-from=@var{file}
6116 @itemx -X @var{file}
6117 Causes @command{tar} to ignore files that match the patterns listed in
6118 @var{file}.
6119 @end table
6120
6121 @findex exclude-from
6122 Use the @option{--exclude-from} option to read a
6123 list of patterns, one per line, from @var{file}; @command{tar} will
6124 ignore files matching those patterns. Thus if @command{tar} is
6125 called as @w{@samp{tar -c -X foo .}} and the file @file{foo} contains a
6126 single line @file{*.o}, no files whose names end in @file{.o} will be
6127 added to the archive.
6128
6129 @FIXME{do the exclude options files need to have stuff separated by
6130 newlines the same as the files-from option does?}
6131
6132 @table @option
6133 @opindex exclude-caches
6134 @item --exclude-caches
6135 Causes @command{tar} to ignore directories containing a cache directory tag.
6136 @end table
6137
6138 @findex exclude-caches
6139 When creating an archive, the @option{--exclude-caches} option causes
6140 @command{tar} to exclude all directories that contain a @dfn{cache
6141 directory tag}. A cache directory tag is a short file with the
6142 well-known name @file{CACHEDIR.TAG} and having a standard header
6143 specified in @url{http://www.brynosaurus.com/cachedir/spec.html}.
6144 Various applications write cache directory tags into directories they
6145 use to hold regenerable, non-precious data, so that such data can be
6146 more easily excluded from backups.
6147
6148 @menu
6149 * controlling pattern-matching with exclude::
6150 * problems with exclude::
6151 @end menu
6152
6153 @node controlling pattern-matching with exclude
6154 @unnumberedsubsec Controlling Pattern-Matching with the @code{exclude} Options
6155
6156 Normally, a pattern matches a name if an initial subsequence of the
6157 name's components matches the pattern, where @samp{*}, @samp{?}, and
6158 @samp{[...]} are the usual shell wildcards, @samp{\} escapes wildcards,
6159 and wildcards can match @samp{/}.
6160
6161 Other than optionally stripping leading @samp{/} from names
6162 (@pxref{absolute}), patterns and names are used as-is. For
6163 example, trailing @samp{/} is not trimmed from a user-specified name
6164 before deciding whether to exclude it.
6165
6166 However, this matching procedure can be altered by the options listed
6167 below. These options accumulate. For example:
6168
6169 @smallexample
6170 --ignore-case --exclude='makefile' --no-ignore-case ---exclude='readme'
6171 @end smallexample
6172
6173 ignores case when excluding @samp{makefile}, but not when excluding
6174 @samp{readme}.
6175
6176 @table @option
6177 @opindex anchored
6178 @opindex no-anchored
6179 @item --anchored
6180 @itemx --no-anchored
6181 If anchored, a pattern must match an initial subsequence
6182 of the name's components. Otherwise, the pattern can match any
6183 subsequence. Default is @option{--no-anchored}.
6184
6185 @opindex ignore-case
6186 @opindex no-ignore-case
6187 @item --ignore-case
6188 @itemx --no-ignore-case
6189 When ignoring case, upper-case patterns match lower-case names and vice versa.
6190 When not ignoring case (the default), matching is case-sensitive.
6191
6192 @opindex wildcards
6193 @opindex no-wildcards
6194 @item --wildcards
6195 @itemx --no-wildcards
6196 When using wildcards (the default), @samp{*}, @samp{?}, and @samp{[...]}
6197 are the usual shell wildcards, and @samp{\} escapes wildcards.
6198 Otherwise, none of these characters are special, and patterns must match
6199 names literally.
6200
6201 @opindex wildcards-match-slash
6202 @opindex no-wildcards-match-slash
6203 @item --wildcards-match-slash
6204 @itemx --no-wildcards-match-slash
6205 When wildcards match slash (the default), a wildcard like @samp{*} in
6206 the pattern can match a @samp{/} in the name. Otherwise, @samp{/} is
6207 matched only by @samp{/}.
6208
6209 @end table
6210
6211 The @option{--recursion} and @option{--no-recursion} options
6212 (@pxref{recurse}) also affect how exclude patterns are interpreted. If
6213 recursion is in effect, a pattern excludes a name if it matches any of
6214 the name's parent directories.
6215
6216 @node problems with exclude
6217 @unnumberedsubsec Problems with Using the @code{exclude} Options
6218
6219 @opindex exclude, potential problems with
6220 Some users find @samp{exclude} options confusing. Here are some common
6221 pitfalls:
6222
6223 @itemize @bullet
6224 @item
6225 The main operating mode of @command{tar} does not act on a path name
6226 explicitly listed on the command line if one of its file name
6227 components is excluded. In the example above, if
6228 you create an archive and exclude files that end with @samp{*.o}, but
6229 explicitly name the file @samp{dir.o/foo} after all the options have been
6230 listed, @samp{dir.o/foo} will be excluded from the archive.
6231
6232 @item
6233 You can sometimes confuse the meanings of @option{--exclude} and
6234 @option{--exclude-from}. Be careful: use @option{--exclude} when files
6235 to be excluded are given as a pattern on the command line. Use
6236 @option{--exclude-from} to introduce the name of a file which contains
6237 a list of patterns, one per line; each of these patterns can exclude
6238 zero, one, or many files.
6239
6240 @item
6241 When you use @option{--exclude=@var{pattern}}, be sure to quote the @var{pattern}
6242 parameter, so @GNUTAR{} sees wildcard characters
6243 like @samp{*}. If you do not do this, the shell might expand the
6244 @samp{*} itself using files at hand, so @command{tar} might receive a
6245 list of files instead of one pattern, or none at all, making the
6246 command somewhat illegal. This might not correspond to what you want.
6247
6248 For example, write:
6249
6250 @smallexample
6251 $ @kbd{tar -c -f @var{archive.tar} --exclude '*.o' @var{directory}}
6252 @end smallexample
6253
6254 @noindent
6255 rather than:
6256
6257 @smallexample
6258 $ @kbd{tar -c -f @var{archive.tar} --exclude *.o @var{directory}}
6259 @end smallexample
6260
6261 @item
6262 You must use use shell syntax, or globbing, rather than @code{regexp}
6263 syntax, when using exclude options in @command{tar}. If you try to use
6264 @code{regexp} syntax to describe files to be excluded, your command
6265 might fail.
6266
6267 @item
6268 In earlier versions of @command{tar}, what is now the
6269 @option{--exclude-from} option was called @option{--exclude} instead.
6270 Now, @option{--exclude} applies to patterns listed on the command
6271 line and @option{--exclude-from} applies to patterns listed in a
6272 file.
6273
6274 @end itemize
6275
6276 @node Wildcards
6277 @section Wildcards Patterns and Matching
6278
6279 @dfn{Globbing} is the operation by which @dfn{wildcard} characters,
6280 @samp{*} or @samp{?} for example, are replaced and expanded into all
6281 existing files matching the given pattern. However, @command{tar} often
6282 uses wildcard patterns for matching (or globbing) archive members instead
6283 of actual files in the file system. Wildcard patterns are also used for
6284 verifying volume labels of @command{tar} archives. This section has the
6285 purpose of explaining wildcard syntax for @command{tar}.
6286
6287 @FIXME{the next few paragraphs need work.}
6288
6289 A @var{pattern} should be written according to shell syntax, using wildcard
6290 characters to effect globbing. Most characters in the pattern stand
6291 for themselves in the matched string, and case is significant: @samp{a}
6292 will match only @samp{a}, and not @samp{A}. The character @samp{?} in the
6293 pattern matches any single character in the matched string. The character
6294 @samp{*} in the pattern matches zero, one, or more single characters in
6295 the matched string. The character @samp{\} says to take the following
6296 character of the pattern @emph{literally}; it is useful when one needs to
6297 match the @samp{?}, @samp{*}, @samp{[} or @samp{\} characters, themselves.
6298
6299 The character @samp{[}, up to the matching @samp{]}, introduces a character
6300 class. A @dfn{character class} is a list of acceptable characters
6301 for the next single character of the matched string. For example,
6302 @samp{[abcde]} would match any of the first five letters of the alphabet.
6303 Note that within a character class, all of the ``special characters''
6304 listed above other than @samp{\} lose their special meaning; for example,
6305 @samp{[-\\[*?]]} would match any of the characters, @samp{-}, @samp{\},
6306 @samp{[}, @samp{*}, @samp{?}, or @samp{]}. (Due to parsing constraints,
6307 the characters @samp{-} and @samp{]} must either come @emph{first} or
6308 @emph{last} in a character class.)
6309
6310 @cindex Excluding characters from a character class
6311 @cindex Character class, excluding characters from
6312 If the first character of the class after the opening @samp{[}
6313 is @samp{!} or @samp{^}, then the meaning of the class is reversed.
6314 Rather than listing character to match, it lists those characters which
6315 are @emph{forbidden} as the next single character of the matched string.
6316
6317 Other characters of the class stand for themselves. The special
6318 construction @samp{[@var{a}-@var{e}]}, using an hyphen between two
6319 letters, is meant to represent all characters between @var{a} and
6320 @var{e}, inclusive.
6321
6322 @FIXME{need to add a sentence or so here to make this clear for those
6323 who don't have dan around.}
6324
6325 Periods (@samp{.}) or forward slashes (@samp{/}) are not considered
6326 special for wildcard matches. However, if a pattern completely matches
6327 a directory prefix of a matched string, then it matches the full matched
6328 string: excluding a directory also excludes all the files beneath it.
6329
6330 @node after
6331 @section Operating Only on New Files
6332 @UNREVISED
6333
6334 @cindex Excluding file by age
6335 @cindex Data Modification time, excluding files by
6336 @cindex Modification time, excluding files by
6337 @cindex Age, excluding files by
6338 The @option{--after-date=@var{date}} (@option{--newer=@var{date}},
6339 @option{-N @var{date}}) option causes @command{tar} to only work on
6340 files whose data modification or status change times are newer than
6341 the @var{date} given. If @var{date} starts with @samp{/} or @samp{.},
6342 it is taken to be a file name; the data modification time of that file
6343 is used as the date. If you use this option when creating or appending
6344 to an archive, the archive will only include new files. If you use
6345 @option{--after-date} when extracting an archive, @command{tar} will
6346 only extract files newer than the @var{date} you specify.
6347
6348 If you only want @command{tar} to make the date comparison based on
6349 modification of the file's data (rather than status
6350 changes), then use the @option{--newer-mtime=@var{date}} option.
6351
6352 You may use these options with any operation. Note that these options
6353 differ from the @option{--update} (@option{-u}) operation in that they
6354 allow you to specify a particular date against which @command{tar} can
6355 compare when deciding whether or not to archive the files.
6356
6357 @table @option
6358 @opindex after-date
6359 @opindex newer
6360 @item --after-date=@var{date}
6361 @itemx --newer=@var{date}
6362 @itemx -N @var{date}
6363 Only store files newer than @var{date}.
6364
6365 Acts on files only if their data modification or status change times are
6366 later than @var{date}. Use in conjunction with any operation.
6367
6368 If @var{date} starts with @samp{/} or @samp{.}, it is taken to be a file
6369 name; the data modification time of that file is used as the date.
6370
6371 @opindex newer-mtime
6372 @item --newer-mtime=@var{date}
6373 Acts like @option{--after-date}, but only looks at data modification times.
6374 @end table
6375
6376 These options limit @command{tar} to operate only on files which have
6377 been modified after the date specified. A file's status is considered to have
6378 changed if its contents have been modified, or if its owner,
6379 permissions, and so forth, have been changed. (For more information on
6380 how to specify a date, see @ref{Date input formats}; remember that the
6381 entire date argument must be quoted if it contains any spaces.)
6382
6383 Gurus would say that @option{--after-date} tests both the data
6384 modification time (@code{mtime}, the time the contents of the file
6385 were last modified) and the status change time (@code{ctime}, the time
6386 the file's status was last changed: owner, permissions, etc.@:)
6387 fields, while @option{--newer-mtime} tests only the @code{mtime}
6388 field.
6389
6390 To be precise, @option{--after-date} checks @emph{both} @code{mtime} and
6391 @code{ctime} and processes the file if either one is more recent than
6392 @var{date}, while @option{--newer-mtime} only checks @code{mtime} and
6393 disregards @code{ctime}. Neither does it use @code{atime} (the last time the
6394 contents of the file were looked at).
6395
6396 Date specifiers can have embedded spaces. Because of this, you may need
6397 to quote date arguments to keep the shell from parsing them as separate
6398 arguments.
6399
6400 @FIXME{Need example of --newer-mtime with quoted argument.}
6401
6402 @quotation
6403 @strong{Please Note:} @option{--after-date} and @option{--newer-mtime}
6404 should not be used for incremental backups. Some files (such as those
6405 in renamed directories) are not selected properly by these options.
6406 @xref{Incremental Dumps}.
6407 @end quotation
6408
6409 @noindent
6410 @FIXME{which tells -- need to fill this in!}
6411
6412 @node recurse
6413 @section Descending into Directories
6414 @UNREVISED
6415 @cindex Avoiding recursion in directories
6416 @cindex Descending directories, avoiding
6417 @cindex Directories, avoiding recursion
6418 @cindex Recursion in directories, avoiding
6419
6420 @FIXME{arrggh! this is still somewhat confusing to me. :-< }
6421
6422 @FIXME{show dan bob's comments, from 2-10-97}
6423
6424 Usually, @command{tar} will recursively explore all directories (either
6425 those given on the command line or through the @option{--files-from}
6426 option) for the various files they contain. However, you may not always
6427 want @command{tar} to act this way.
6428
6429 @opindex no-recursion
6430 The @option{--no-recursion} option inhibits @command{tar}'s recursive descent
6431 into specified directories. If you specify @option{--no-recursion}, you can
6432 use the @command{find} utility for hunting through levels of directories to
6433 construct a list of file names which you could then pass to @command{tar}.
6434 @command{find} allows you to be more selective when choosing which files to
6435 archive; see @ref{files} for more information on using @command{find} with
6436 @command{tar}, or look.
6437
6438 @table @option
6439 @item --no-recursion
6440 Prevents @command{tar} from recursively descending directories.
6441
6442 @opindex recursion
6443 @item --recursion
6444 Requires @command{tar} to recursively descend directories.
6445 This is the default.
6446 @end table
6447
6448 When you use @option{--no-recursion}, @GNUTAR{} grabs
6449 directory entries themselves, but does not descend on them
6450 recursively. Many people use @command{find} for locating files they
6451 want to back up, and since @command{tar} @emph{usually} recursively
6452 descends on directories, they have to use the @samp{@w{! -d}} option
6453 to @command{find} @FIXME{needs more explanation or a cite to another
6454 info file}as they usually do not want all the files in a directory.
6455 They then use the @option{--files-from} option to archive the files
6456 located via @command{find}.
6457
6458 The problem when restoring files archived in this manner is that the
6459 directories themselves are not in the archive; so the
6460 @option{--same-permissions} (@option{--preserve-permissions},
6461 @option{-p}) option does not affect them---while users might really
6462 like it to. Specifying @option{--no-recursion} is a way to tell
6463 @command{tar} to grab only the directory entries given to it, adding
6464 no new files on its own.
6465
6466 The @option{--no-recursion} option also applies when extracting: it
6467 causes @command{tar} to extract only the matched directory entries, not
6468 the files under those directories.
6469
6470 The @option{--no-recursion} option also affects how exclude patterns
6471 are interpreted (@pxref{controlling pattern-matching with exclude}).
6472
6473 The @option{--no-recursion} and @option{--recursion} options apply to
6474 later options and operands, and can be overridden by later occurrences
6475 of @option{--no-recursion} and @option{--recursion}. For example:
6476
6477 @smallexample
6478 $ @kbd{tar -cf jams.tar --no-recursion grape --recursion grape/concord}
6479 @end smallexample
6480
6481 @noindent
6482 creates an archive with one entry for @file{grape}, and the recursive
6483 contents of @file{grape/concord}, but no entries under @file{grape}
6484 other than @file{grape/concord}.
6485
6486 @node one
6487 @section Crossing File System Boundaries
6488 @cindex File system boundaries, not crossing
6489 @UNREVISED
6490
6491 @command{tar} will normally automatically cross file system boundaries in
6492 order to archive files which are part of a directory tree. You can
6493 change this behavior by running @command{tar} and specifying
6494 @option{--one-file-system} (@option{-l}). This option only affects files that are
6495 archived because they are in a directory that is being archived;
6496 @command{tar} will still archive files explicitly named on the command line
6497 or through @option{--files-from}, regardless of where they reside.
6498
6499 @table @option
6500 @opindex one-file-system
6501 @item --one-file-system
6502 @itemx -l
6503 Prevents @command{tar} from crossing file system boundaries when
6504 archiving. Use in conjunction with any write operation.
6505 @end table
6506
6507 The @option{--one-file-system} option causes @command{tar} to modify its
6508 normal behavior in archiving the contents of directories. If a file in
6509 a directory is not on the same file system as the directory itself, then
6510 @command{tar} will not archive that file. If the file is a directory
6511 itself, @command{tar} will not archive anything beneath it; in other words,
6512 @command{tar} will not cross mount points.
6513
6514 It is reported that using this option, the mount point is is archived,
6515 but nothing under it.
6516
6517 This option is useful for making full or incremental archival backups of
6518 a file system. If this option is used in conjunction with
6519 @option{--verbose} (@option{-v}), files that are excluded are mentioned by name on the
6520 standard error.
6521
6522 @menu
6523 * directory:: Changing Directory
6524 * absolute:: Absolute File Names
6525 @end menu
6526
6527 @node directory
6528 @subsection Changing the Working Directory
6529 @UNREVISED
6530
6531 @FIXME{need to read over this node now for continuity; i've switched
6532 things around some.}
6533
6534 @cindex Changing directory mid-stream
6535 @cindex Directory, changing mid-stream
6536 @cindex Working directory, specifying
6537 To change the working directory in the middle of a list of file names,
6538 either on the command line or in a file specified using
6539 @option{--files-from} (@option{-T}), use @option{--directory} (@option{-C}).
6540 This will change the working directory to the specified directory
6541 after that point in the list.
6542
6543 @table @option
6544 @opindex directory
6545 @item --directory=@var{directory}
6546 @itemx -C @var{directory}
6547 Changes the working directory in the middle of a command line.
6548 @end table
6549
6550 For example,
6551
6552 @smallexample
6553 $ @kbd{tar -c -f jams.tar grape prune -C food cherry}
6554 @end smallexample
6555
6556 @noindent
6557 will place the files @file{grape} and @file{prune} from the current
6558 directory into the archive @file{jams.tar}, followed by the file
6559 @file{cherry} from the directory @file{food}. This option is especially
6560 useful when you have several widely separated files that you want to
6561 store in the same archive.
6562
6563 Note that the file @file{cherry} is recorded in the archive under the
6564 precise name @file{cherry}, @emph{not} @file{food/cherry}. Thus, the
6565 archive will contain three files that all appear to have come from the
6566 same directory; if the archive is extracted with plain @samp{tar
6567 --extract}, all three files will be written in the current directory.
6568
6569 Contrast this with the command,
6570
6571 @smallexample
6572 $ @kbd{tar -c -f jams.tar grape prune -C food red/cherry}
6573 @end smallexample
6574
6575 @noindent
6576 which records the third file in the archive under the name
6577 @file{red/cherry} so that, if the archive is extracted using
6578 @samp{tar --extract}, the third file will be written in a subdirectory
6579 named @file{orange-colored}.
6580
6581 You can use the @option{--directory} option to make the archive
6582 independent of the original name of the directory holding the files.
6583 The following command places the files @file{/etc/passwd},
6584 @file{/etc/hosts}, and @file{/lib/libc.a} into the archive
6585 @file{foo.tar}:
6586
6587 @smallexample
6588 $ @kbd{tar -c -f foo.tar -C /etc passwd hosts -C /lib libc.a}
6589 @end smallexample
6590
6591 @noindent
6592 However, the names of the archive members will be exactly what they were
6593 on the command line: @file{passwd}, @file{hosts}, and @file{libc.a}.
6594 They will not appear to be related by file name to the original
6595 directories where those files were located.
6596
6597 Note that @option{--directory} options are interpreted consecutively. If
6598 @option{--directory} specifies a relative file name, it is interpreted
6599 relative to the then current directory, which might not be the same as
6600 the original current working directory of @command{tar}, due to a previous
6601 @option{--directory} option.
6602
6603 When using @option{--files-from} (@pxref{files}), you can put various
6604 @command{tar} options (including @option{-C}) in the file list. Notice,
6605 however, that in this case the option and its argument may not be
6606 separated by whitespace. If you use short option, its argument must
6607 either follow the option letter immediately, without any intervening
6608 whitespace, or occupy the next line. Otherwise, if you use long
6609 option, separate its argument by an equal sign.
6610
6611 For instance, the file list for the above example will be:
6612
6613 @smallexample
6614 @group
6615 -C
6616 /etc
6617 passwd
6618 hosts
6619 -C
6620 /lib
6621 libc.a
6622 @end group
6623 @end smallexample
6624
6625 @noindent
6626 To use it, you would invoke @command{tar} as follows:
6627
6628 @smallexample
6629 $ @kbd{tar -c -f foo.tar --files-from list}
6630 @end smallexample
6631
6632 Notice also that you can only use the short option variant in the file
6633 list, i.e., always use @option{-C}, not @option{--directory}.
6634
6635 The interpretation of @option{--directory} is disabled by
6636 @option{--null} option.
6637
6638 @node absolute
6639 @subsection Absolute File Names
6640 @UNREVISED
6641
6642 @table @option
6643 @opindex absolute-names
6644 @item --absolute-names
6645 @itemx -P
6646 Do not strip leading slashes from file names, and permit file names
6647 containing a @file{..} file name component.
6648 @end table
6649
6650 By default, @GNUTAR{} drops a leading @samp{/} on
6651 input or output, and complains about file names containing a @file{..}
6652 component. This option turns off this behavior.
6653
6654 When @command{tar} extracts archive members from an archive, it strips any
6655 leading slashes (@samp{/}) from the member name. This causes absolute
6656 member names in the archive to be treated as relative file names. This
6657 allows you to have such members extracted wherever you want, instead of
6658 being restricted to extracting the member in the exact directory named
6659 in the archive. For example, if the archive member has the name
6660 @file{/etc/passwd}, @command{tar} will extract it as if the name were
6661 really @file{etc/passwd}.
6662
6663 File names containing @file{..} can cause problems when extracting, so
6664 @command{tar} normally warns you about such files when creating an
6665 archive, and rejects attempts to extracts such files.
6666
6667 Other @command{tar} programs do not do this. As a result, if you
6668 create an archive whose member names start with a slash, they will be
6669 difficult for other people with a non-@GNUTAR{}
6670 program to use. Therefore, @GNUTAR{} also strips
6671 leading slashes from member names when putting members into the
6672 archive. For example, if you ask @command{tar} to add the file
6673 @file{/bin/ls} to an archive, it will do so, but the member name will
6674 be @file{bin/ls}.@footnote{A side effect of this is that when
6675 @option{--create} is used with @option{--verbose} the resulting output
6676 is not, generally speaking, the same as the one you'd get running
6677 @kbd{tar --list} command. This may be important if you use some
6678 scripts for comparing both outputs. @xref{listing member and file names},
6679 for the information on how to handle this case.}
6680
6681 If you use the @option{--absolute-names} (@option{-P}) option,
6682 @command{tar} will do none of these transformations.
6683
6684 To archive or extract files relative to the root directory, specify
6685 the @option{--absolute-names} (@option{-P}) option.
6686
6687 Normally, @command{tar} acts on files relative to the working
6688 directory---ignoring superior directory names when archiving, and
6689 ignoring leading slashes when extracting.
6690
6691 When you specify @option{--absolute-names} (@option{-P}),
6692 @command{tar} stores file names including all superior directory
6693 names, and preserves leading slashes. If you only invoked
6694 @command{tar} from the root directory you would never need the
6695 @option{--absolute-names} option, but using this option
6696 may be more convenient than switching to root.
6697
6698 @FIXME{Should be an example in the tutorial/wizardry section using this
6699 to transfer files between systems.}
6700
6701 @FIXME{Is write access an issue?}
6702
6703 @table @option
6704 @item --absolute-names
6705 Preserves full file names (including superior directory names) when
6706 archiving files. Preserves leading slash when extracting files.
6707
6708 @end table
6709
6710 @FIXME{this is still horrible; need to talk with dan on monday.}
6711
6712 @command{tar} prints out a message about removing the @samp{/} from
6713 file names. This message appears once per @GNUTAR{}
6714 invocation. It represents something which ought to be told; ignoring
6715 what it means can cause very serious surprises, later.
6716
6717 Some people, nevertheless, do not want to see this message. Wanting to
6718 play really dangerously, one may of course redirect @command{tar} standard
6719 error to the sink. For example, under @command{sh}:
6720
6721 @smallexample
6722 $ @kbd{tar -c -f archive.tar /home 2> /dev/null}
6723 @end smallexample
6724
6725 @noindent
6726 Another solution, both nicer and simpler, would be to change to
6727 the @file{/} directory first, and then avoid absolute notation.
6728 For example:
6729
6730 @smallexample
6731 $ @kbd{(cd / && tar -c -f archive.tar home)}
6732 $ @kbd{tar -c -f archive.tar -C / home}
6733 @end smallexample
6734
6735 @include getdate.texi
6736
6737 @node Formats
6738 @chapter Controlling the Archive Format
6739
6740 @cindex Tar archive formats
6741 Due to historical reasons, there are several formats of tar archives.
6742 All of them are based on the same principles, but have some subtle
6743 differences that often make them incompatible with each other.
6744
6745 GNU tar is able to create and handle archives in a variety of formats.
6746 The most frequently used formats are (in alphabetical order):
6747
6748 @table @asis
6749 @item gnu
6750 Format used by @GNUTAR{} versions up to 1.13.25. This format derived
6751 from an early @acronym{POSIX} standard, adding some improvements such as
6752 sparse file handling and incremental archives. Unfortunately these
6753 features were implemented in a way incompatible with other archive
6754 formats.
6755
6756 Archives in @samp{gnu} format are able to hold pathnames of unlimited
6757 length.
6758
6759 @item oldgnu
6760 Format used by @GNUTAR{} of versions prior to 1.12.
6761
6762 @item v7
6763 Archive format, compatible with the V7 implementation of tar. This
6764 format imposes a number of limitations. The most important of them
6765 are:
6766
6767 @enumerate
6768 @item The maximum length of a file name is limited to 99 characters.
6769 @item The maximum length of a symbolic link is limited to 99 characters.
6770 @item It is impossible to store special files (block and character
6771 devices, fifos etc.)
6772 @item Maximum value of user or group ID is limited to 2097151 (7777777
6773 octal)
6774 @item V7 archives do not contain symbolic ownership information (user
6775 and group name of the file owner).
6776 @end enumerate
6777
6778 This format has traditionally been used by Automake when producing
6779 Makefiles. This practice will change in the future, in the meantime,
6780 however this means that projects containing filenames more than 99
6781 characters long will not be able to use @GNUTAR{} @value{VERSION} and
6782 Automake prior to 1.9.
6783
6784 @item ustar
6785 Archive format defined by @acronym{POSIX.1-1988} specification. It stores
6786 symbolic ownership information. It is also able to store
6787 special files. However, it imposes several restrictions as well:
6788
6789 @enumerate
6790 @item The maximum length of a file name is limited to 256 characters,
6791 provided that the filename can be split at directory separator in
6792 two parts, first of them being at most 155 bytes long. So, in most
6793 cases the maximum file name length will be shorter than 256
6794 characters.
6795 @item The maximum length of a symbolic link name is limited to
6796 100 characters.
6797 @item Maximum size of a file the archive is able to accomodate
6798 is 8GB
6799 @item Maximum value of UID/GID is 2097151.
6800 @item Maximum number of bits in device major and minor numbers is 21.
6801 @end enumerate
6802
6803 @item star
6804 Format used by J@"org Schilling @command{star}
6805 implementation. @GNUTAR{} is able to read @samp{star} archives but
6806 currently does not produce them.
6807
6808 @item posix
6809 Archive format defined by @acronym{POSIX.1-2001} specification. This is the
6810 most flexible and feature-rich format. It does not impose any
6811 restrictions on file sizes or filename lengths. This format is quite
6812 recent, so not all tar implementations are able to handle it properly.
6813 However, this format is designed in such a way that any tar
6814 implementation able to read @samp{ustar} archives will be able to read
6815 most @samp{posix} archives as well, with the only exception that any
6816 additional information (such as long file names etc.) will in such
6817 case be extracted as plain text files along with the files it refers to.
6818
6819 This archive format will be the default format for future versions
6820 of @GNUTAR{}.
6821
6822 @end table
6823
6824 The following table summarizes the limitations of each of these
6825 formats:
6826
6827 @multitable @columnfractions .10 .20 .20 .20 .20
6828 @headitem Format @tab UID @tab File Size @tab Path Name @tab Devn
6829 @item gnu @tab 1.8e19 @tab Unlimited @tab Unlimited @tab 63
6830 @item oldgnu @tab 1.8e19 @tab Unlimited @tab Unlimited @tab 63
6831 @item v7 @tab 2097151 @tab 8GB @tab 99 @tab n/a
6832 @item ustar @tab 2097151 @tab 8GB @tab 256 @tab 21
6833 @item posix @tab Unlimited @tab Unlimited @tab Unlimited @tab Unlimited
6834 @end multitable
6835
6836 The default format for @GNUTAR{} is defined at compilation
6837 time. You may check it by running @command{tar --help}, and examining
6838 the last lines of its output. Usually, @GNUTAR{} is configured
6839 to create archives in @samp{gnu} format, however, future version will
6840 switch to @samp{posix}.
6841
6842 @menu
6843 * Portability:: Making @command{tar} Archives More Portable
6844 * Compression:: Using Less Space through Compression
6845 * Attributes:: Handling File Attributes
6846 * Standard:: The Standard Format
6847 * Extensions:: @acronym{GNU} Extensions to the Archive Format
6848 * cpio:: Comparison of @command{tar} and @command{cpio}
6849 @end menu
6850
6851 @node Portability
6852 @section Making @command{tar} Archives More Portable
6853
6854 Creating a @command{tar} archive on a particular system that is meant to be
6855 useful later on many other machines and with other versions of @command{tar}
6856 is more challenging than you might think. @command{tar} archive formats
6857 have been evolving since the first versions of Unix. Many such formats
6858 are around, and are not always compatible with each other. This section
6859 discusses a few problems, and gives some advice about making @command{tar}
6860 archives more portable.
6861
6862 One golden rule is simplicity. For example, limit your @command{tar}
6863 archives to contain only regular files and directories, avoiding
6864 other kind of special files. Do not attempt to save sparse files or
6865 contiguous files as such. Let's discuss a few more problems, in turn.
6866
6867 @FIXME{Discuss GNU extensions (incremental backups, multi-volume
6868 archives and archive labels) in GNU and PAX formats.}
6869
6870 @menu
6871 * Portable Names:: Portable Names
6872 * dereference:: Symbolic Links
6873 * old:: Old V7 Archives
6874 * ustar:: Ustar Archives
6875 * gnu:: GNU and old GNU format archives.
6876 * posix:: @acronym{POSIX} archives
6877 * Checksumming:: Checksumming Problems
6878 * Large or Negative Values:: Large files, negative time stamps, etc.
6879 @end menu
6880
6881 @node Portable Names
6882 @subsection Portable Names
6883
6884 Use portable file and member names. A name is portable if it contains
6885 only ASCII letters and digits, @samp{/}, @samp{.}, @samp{_}, and
6886 @samp{-}; it cannot be empty, start with @samp{-} or @samp{//}, or
6887 contain @samp{/-}. Avoid deep directory nesting. For portability to
6888 old Unix hosts, limit your file name components to 14 characters or
6889 less.
6890
6891 If you intend to have your @command{tar} archives to be read under
6892 MSDOS, you should not rely on case distinction for file names, and you
6893 might use the @acronym{GNU} @command{doschk} program for helping you
6894 further diagnosing illegal MSDOS names, which are even more limited
6895 than System V's.
6896
6897 @node dereference
6898 @subsection Symbolic Links
6899 @cindex File names, using symbolic links
6900 @cindex Symbolic link as file name
6901
6902 @opindex dereference
6903 Normally, when @command{tar} archives a symbolic link, it writes a
6904 block to the archive naming the target of the link. In that way, the
6905 @command{tar} archive is a faithful record of the file system contents.
6906 @option{--dereference} (@option{-h}) is used with @option{--create} (@option{-c}), and causes
6907 @command{tar} to archive the files symbolic links point to, instead of
6908 the links themselves. When this option is used, when @command{tar}
6909 encounters a symbolic link, it will archive the linked-to file,
6910 instead of simply recording the presence of a symbolic link.
6911
6912 The name under which the file is stored in the file system is not
6913 recorded in the archive. To record both the symbolic link name and
6914 the file name in the system, archive the file under both names. If
6915 all links were recorded automatically by @command{tar}, an extracted file
6916 might be linked to a file name that no longer exists in the file
6917 system.
6918
6919 If a linked-to file is encountered again by @command{tar} while creating
6920 the same archive, an entire second copy of it will be stored. (This
6921 @emph{might} be considered a bug.)
6922
6923 So, for portable archives, do not archive symbolic links as such,
6924 and use @option{--dereference} (@option{-h}): many systems do not support
6925 symbolic links, and moreover, your distribution might be unusable if
6926 it contains unresolved symbolic links.
6927
6928 @node old
6929 @subsection Old V7 Archives
6930 @cindex Format, old style
6931 @cindex Old style format
6932 @cindex Old style archives
6933 @cindex v7 archive format
6934
6935 Certain old versions of @command{tar} cannot handle additional
6936 information recorded by newer @command{tar} programs. To create an
6937 archive in V7 format (not ANSI), which can be read by these old
6938 versions, specify the @option{--format=v7} option in
6939 conjunction with the @option{--create} (@option{-c}) (@command{tar} also
6940 accepts @option{--portability} or @samp{op-old-archive} for this
6941 option). When you specify it,
6942 @command{tar} leaves out information about directories, pipes, fifos,
6943 contiguous files, and device files, and specifies file ownership by
6944 group and user IDs instead of group and user names.
6945
6946 When updating an archive, do not use @option{--format=v7}
6947 unless the archive was created using this option.
6948
6949 In most cases, a @emph{new} format archive can be read by an @emph{old}
6950 @command{tar} program without serious trouble, so this option should
6951 seldom be needed. On the other hand, most modern @command{tar}s are
6952 able to read old format archives, so it might be safer for you to
6953 always use @option{--format=v7} for your distributions.
6954
6955 @node ustar
6956 @subsection Ustar Archive Format
6957
6958 @cindex ustar archive format
6959 Archive format defined by @acronym{POSIX}.1-1988 specification is called
6960 @code{ustar}. Although it is more flexible than the V7 format, it
6961 still has many restrictions (@xref{Formats,ustar}, for the detailed
6962 description of @code{ustar} format). Along with V7 format,
6963 @code{ustar} format is a good choice for archives intended to be read
6964 with other implementations of @command{tar}.
6965
6966 To create archive in @code{ustar} format, use @option{--format=ustar}
6967 option in conjunction with the @option{--create} (@option{-c}).
6968
6969 @node gnu
6970 @subsection @acronym{GNU} and old @GNUTAR{} format
6971
6972 @cindex GNU archive format
6973 @cindex Old GNU archive format
6974 @GNUTAR{} was based on an early draft of the
6975 @acronym{POSIX} 1003.1 @code{ustar} standard. @acronym{GNU} extensions to
6976 @command{tar}, such as the support for file names longer than 100
6977 characters, use portions of the @command{tar} header record which were
6978 specified in that @acronym{POSIX} draft as unused. Subsequent changes in
6979 @acronym{POSIX} have allocated the same parts of the header record for
6980 other purposes. As a result, @GNUTAR{} format is
6981 incompatible with the current @acronym{POSIX} specification, and with
6982 @command{tar} programs that follow it.
6983
6984 In the majority of cases, @command{tar} will be configured to create
6985 this format by default. This will change in the future releases, since
6986 we plan to make @samp{posix} format the default.
6987
6988 To force creation a @GNUTAR{} archive, use option
6989 @option{--format=gnu}.
6990
6991 @node posix
6992 @subsection @GNUTAR{} and @acronym{POSIX} @command{tar}
6993
6994 @cindex POSIX archive format
6995 @cindex PAX archive format
6996 The version @value{VERSION} of @GNUTAR{} is able
6997 to read and create archives conforming to @acronym{POSIX.1-2001} standard.
6998
6999 A @acronym{POSIX} conformant archive will be created if @command{tar}
7000 was given @option{--format=posix} option.
7001
7002 @node Checksumming
7003 @subsection Checksumming Problems
7004
7005 SunOS and HP-UX @command{tar} fail to accept archives created using
7006 @GNUTAR{} and containing non-ASCII file names, that
7007 is, file names having characters with the eight bit set, because they
7008 use signed checksums, while @GNUTAR{} uses unsigned
7009 checksums while creating archives, as per @acronym{POSIX} standards. On
7010 reading, @GNUTAR{} computes both checksums and
7011 accept any. It is somewhat worrying that a lot of people may go
7012 around doing backup of their files using faulty (or at least
7013 non-standard) software, not learning about it until it's time to
7014 restore their missing files with an incompatible file extractor, or
7015 vice versa.
7016
7017 @GNUTAR{} compute checksums both ways, and accept
7018 any on read, so @acronym{GNU} tar can read Sun tapes even with their
7019 wrong checksums. @GNUTAR{} produces the standard
7020 checksum, however, raising incompatibilities with Sun. That is to
7021 say, @GNUTAR{} has not been modified to
7022 @emph{produce} incorrect archives to be read by buggy @command{tar}'s.
7023 I've been told that more recent Sun @command{tar} now read standard
7024 archives, so maybe Sun did a similar patch, after all?
7025
7026 The story seems to be that when Sun first imported @command{tar}
7027 sources on their system, they recompiled it without realizing that
7028 the checksums were computed differently, because of a change in
7029 the default signing of @code{char}'s in their compiler. So they
7030 started computing checksums wrongly. When they later realized their
7031 mistake, they merely decided to stay compatible with it, and with
7032 themselves afterwards. Presumably, but I do not really know, HP-UX
7033 has chosen that their @command{tar} archives to be compatible with Sun's.
7034 The current standards do not favor Sun @command{tar} format. In any
7035 case, it now falls on the shoulders of SunOS and HP-UX users to get
7036 a @command{tar} able to read the good archives they receive.
7037
7038 @node Large or Negative Values
7039 @subsection Large or Negative Values
7040 @cindex large values
7041 @cindex future time stamps
7042 @cindex negative time stamps
7043 @UNREVISED{}
7044
7045 The above sections suggest to use @samp{oldest possible} archive
7046 format if in doubt. However, sometimes it is not possible. If you
7047 attempt to archive a file whose metadata cannot be represented using
7048 required format, @GNUTAR{} will print error message and ignore such a
7049 file. You will than have to switch to a format that is able to
7050 handle such values. The format summary table (@pxref{Formats}) will
7051 help you to do so.
7052
7053 In particular, when trying to archive files larger than 8GB or with
7054 timestamps not in the range 1970-01-01 00:00:00 through 2242-03-16
7055 12:56:31 @sc{utc}, you will have to chose between @acronym{GNU} and
7056 @acronym{POSIX} archive formats. When considering which format to
7057 choose, bear in mind that the @acronym{GNU} format uses
7058 two's-complement base-256 notation to store values that do not fit
7059 into standard @acronym{ustar} range. Such archives can generally be
7060 read only by a @GNUTAR{} implementation. Moreover, they sometimes
7061 cannot be correctly restored on another hosts even by @GNUTAR{}. For
7062 example, using two's complement representation for negative time
7063 stamps that assumes a signed 32-bit @code{time_t} generates archives
7064 that are not portable to hosts with differing @code{time_t}
7065 representations.
7066
7067 On the other hand, @acronym{POSIX} archives, generally speaking, can
7068 be extracted by any tar implementation that understands older
7069 @acronym{ustar} format. The only exception are files larger than 8GB.
7070
7071 @FIXME{Describe how @acronym{POSIX} archives are extracted by non
7072 POSIX-aware tars.}
7073
7074 @node Compression
7075 @section Using Less Space through Compression
7076
7077 @menu
7078 * gzip:: Creating and Reading Compressed Archives
7079 * sparse:: Archiving Sparse Files
7080 @end menu
7081
7082 @node gzip
7083 @subsection Creating and Reading Compressed Archives
7084 @cindex Compressed archives
7085 @cindex Storing archives in compressed format
7086
7087 @GNUTAR{} is able to create and read compressed archives. It supports
7088 @command{gzip} and @command{bzip2} compression programs. For backward
7089 compatibilty, it also supports @command{compress} command, although
7090 we strongly recommend against using it, since there is a patent
7091 covering the algorithm it uses and you could be sued for patent
7092 infringement merely by running @command{compress}! Besides, it is less
7093 effective than @command{gzip} and @command{bzip2}.
7094
7095 Creating a compressed archive is simple: you just specify a
7096 @dfn{compression option} along with the usual archive creation
7097 commands. The compression option is @option{-z} (@option{--gzip}) to
7098 create a @command{gzip} compressed archive, @option{-j}
7099 (@option{--bzip2}) to create a @command{bzip2} compressed archive, and
7100 @option{-Z} (@option{--compress}) to use @command{compress} program.
7101 For example:
7102
7103 @smallexample
7104 $ @kbd{tar cfz archive.tar.gz .}
7105 @end smallexample
7106
7107 Reading compressed archive is even simpler: you don't need to specify
7108 any additional options as @GNUTAR{} recognizes its format
7109 automatically. Thus, the following commands will list and extract the
7110 archive created in previous example:
7111
7112 @smallexample
7113 # List the compressed archive
7114 $ @kbd{tar tf archive.tar.gz}
7115 # Extract the compressed archive
7116 $ @kbd{tar xf archive.tar.gz}
7117 @end smallexample
7118
7119 The only case when you have to specify a decompression option while
7120 reading the archive is when reading from a pipe or from a tape drive
7121 that does not support random access. However, in this case @GNUTAR{}
7122 will indicate which option you should use. For example:
7123
7124 @smallexample
7125 $ @kbd{cat archive.tar.gz | tar tf -}
7126 tar: Archive is compressed. Use -z option
7127 tar: Error is not recoverable: exiting now
7128 @end smallexample
7129
7130 If you see such diagnostics, just add the suggested option to the
7131 invocation of @GNUTAR{}:
7132
7133 @smallexample
7134 $ @kbd{cat archive.tar.gz | tar tfz -}
7135 @end smallexample
7136
7137 Notice also, that there are several restrictions on operations on
7138 compressed archives. First of all, compressed archives cannot be
7139 modified, i.e., you cannot update (@option{--update} (@option{-u})) them or delete
7140 (@option{--delete}) members from them. Likewise, you cannot append
7141 another @command{tar} archive to a compressed archive using
7142 @option{--append} (@option{-r})). Secondly, multi-volume archives cannot be
7143 compressed.
7144
7145 The following table summarizes compression options used by @GNUTAR{}.
7146
7147 @table @option
7148 @opindex gzip
7149 @opindex ungzip
7150 @item -z
7151 @itemx --gzip
7152 @itemx --ungzip
7153 Filter the archive through @command{gzip}.
7154
7155 You can use @option{--gzip} and @option{--gunzip} on physical devices
7156 (tape drives, etc.) and remote files as well as on normal files; data
7157 to or from such devices or remote files is reblocked by another copy
7158 of the @command{tar} program to enforce the specified (or default) record
7159 size. The default compression parameters are used; if you need to
7160 override them, set @env{GZIP} environment variable, e.g.:
7161
7162 @smallexample
7163 $ @kbd{GZIP=--best tar cfz archive.tar.gz subdir}
7164 @end smallexample
7165
7166 @noindent
7167 Another way would be to avoid the @option{--gzip} (@option{--gunzip}, @option{--ungzip}, @option{-z}) option and run
7168 @command{gzip} explicitly:
7169
7170 @smallexample
7171 $ @kbd{tar cf - subdir | gzip --best -c - > archive.tar.gz}
7172 @end smallexample
7173
7174 @cindex corrupted archives
7175 About corrupted compressed archives: @command{gzip}'ed files have no
7176 redundancy, for maximum compression. The adaptive nature of the
7177 compression scheme means that the compression tables are implicitly
7178 spread all over the archive. If you lose a few blocks, the dynamic
7179 construction of the compression tables becomes unsynchronized, and there
7180 is little chance that you could recover later in the archive.
7181
7182 There are pending suggestions for having a per-volume or per-file
7183 compression in @GNUTAR{}. This would allow for viewing the
7184 contents without decompression, and for resynchronizing decompression at
7185 every volume or file, in case of corrupted archives. Doing so, we might
7186 lose some compressibility. But this would have make recovering easier.
7187 So, there are pros and cons. We'll see!
7188
7189 @opindex bzip2
7190 @item -j
7191 @itemx --bzip2
7192 Filter the archive through @code{bzip2}. Otherwise like @option{--gzip}.
7193
7194 @opindex compress
7195 @opindex uncompress
7196 @item -Z
7197 @itemx --compress
7198 @itemx --uncompress
7199 Filter the archive through @command{compress}. Otherwise like @option{--gzip}.
7200
7201 The @acronym{GNU} Project recommends you not use
7202 @command{compress}, because there is a patent covering the algorithm it
7203 uses. You could be sued for patent infringement merely by running
7204 @command{compress}.
7205
7206 @opindex use-compress-program
7207 @item --use-compress-program=@var{prog}
7208 Use external compression program @var{prog}. Use this option if you
7209 have a compression program that @GNUTAR{} does not support. There
7210 are two requirements to which @var{prog} should comply:
7211
7212 First, when called without options, it should read data from standard
7213 input, compress it and output it on standard output.
7214
7215 Secondly, if called with @option{-d} argument, it should do exactly
7216 the opposite, i.e., read the compressed data from the standard input
7217 and produce uncompressed data on the standard output.
7218 @end table
7219
7220 @FIXME{I have one question, or maybe it's a suggestion if there isn't a way
7221 to do it now. I would like to use @option{--gzip}, but I'd also like
7222 the output to be fed through a program like @acronym{GNU}
7223 @command{ecc} (actually, right now that's @samp{exactly} what I'd like
7224 to use :-)), basically adding ECC protection on top of compression.
7225 It seems as if this should be quite easy to do, but I can't work out
7226 exactly how to go about it. Of course, I can pipe the standard output
7227 of @command{tar} through @command{ecc}, but then I lose (though I
7228 haven't started using it yet, I confess) the ability to have
7229 @command{tar} use @command{rmt} for it's I/O (I think).
7230
7231 I think the most straightforward thing would be to let me specify a
7232 general set of filters outboard of compression (preferably ordered,
7233 so the order can be automatically reversed on input operations, and
7234 with the options they require specifiable), but beggars shouldn't be
7235 choosers and anything you decide on would be fine with me.
7236
7237 By the way, I like @command{ecc} but if (as the comments say) it can't
7238 deal with loss of block sync, I'm tempted to throw some time at adding
7239 that capability. Supposing I were to actually do such a thing and
7240 get it (apparently) working, do you accept contributed changes to
7241 utilities like that? (Leigh Clayton @file{loc@@soliton.com}, May 1995).
7242
7243 Isn't that exactly the role of the @option{--use-compress-prog=@var{program}} option?
7244 I never tried it myself, but I suspect you may want to write a
7245 @var{prog} script or program able to filter stdin to stdout to
7246 way you want. It should recognize the @option{-d} option, for when
7247 extraction is needed rather than creation.
7248
7249 It has been reported that if one writes compressed data (through the
7250 @option{--gzip} or @option{--compress} options) to a DLT and tries to use
7251 the DLT compression mode, the data will actually get bigger and one will
7252 end up with less space on the tape.}
7253
7254 @node sparse
7255 @subsection Archiving Sparse Files
7256 @cindex Sparse Files
7257 @UNREVISED
7258
7259 @table @option
7260 @opindex sparse
7261 @item -S
7262 @itemx --sparse
7263 Handle sparse files efficiently.
7264 @end table
7265
7266 This option causes all files to be put in the archive to be tested for
7267 sparseness, and handled specially if they are. The @option{--sparse}
7268 (@option{-S}) option is useful when many @code{dbm} files, for example, are being
7269 backed up. Using this option dramatically decreases the amount of
7270 space needed to store such a file.
7271
7272 In later versions, this option may be removed, and the testing and
7273 treatment of sparse files may be done automatically with any special
7274 @acronym{GNU} options. For now, it is an option needing to be specified on
7275 the command line with the creation or updating of an archive.
7276
7277 Files in the file system occasionally have ``holes.'' A hole in a file
7278 is a section of the file's contents which was never written. The
7279 contents of a hole read as all zeros. On many operating systems,
7280 actual disk storage is not allocated for holes, but they are counted
7281 in the length of the file. If you archive such a file, @command{tar}
7282 could create an archive longer than the original. To have @command{tar}
7283 attempt to recognize the holes in a file, use @option{--sparse} (@option{-S}). When
7284 you use this option, then, for any file using less disk space than
7285 would be expected from its length, @command{tar} searches the file for
7286 consecutive stretches of zeros. It then records in the archive for
7287 the file where the consecutive stretches of zeros are, and only
7288 archives the ``real contents'' of the file. On extraction (using
7289 @option{--sparse} is not needed on extraction) any such
7290 files have holes created wherever the continuous stretches of zeros
7291 were found. Thus, if you use @option{--sparse}, @command{tar} archives
7292 won't take more space than the original.
7293
7294 A file is sparse if it contains blocks of zeros whose existence is
7295 recorded, but that have no space allocated on disk. When you specify
7296 the @option{--sparse} option in conjunction with the @option{--create}
7297 (@option{-c}) operation, @command{tar} tests all files for sparseness
7298 while archiving. If @command{tar} finds a file to be sparse, it uses a
7299 sparse representation of the file in the archive. @xref{create}, for
7300 more information about creating archives.
7301
7302 @option{--sparse} is useful when archiving files, such as dbm files,
7303 likely to contain many nulls. This option dramatically
7304 decreases the amount of space needed to store such an archive.
7305
7306 @quotation
7307 @strong{Please Note:} Always use @option{--sparse} when performing file
7308 system backups, to avoid archiving the expanded forms of files stored
7309 sparsely in the system.
7310
7311 Even if your system has no sparse files currently, some may be
7312 created in the future. If you use @option{--sparse} while making file
7313 system backups as a matter of course, you can be assured the archive
7314 will never take more space on the media than the files take on disk
7315 (otherwise, archiving a disk filled with sparse files might take
7316 hundreds of tapes). @FIXME-xref{incremental when node name is set.}
7317 @end quotation
7318
7319 @command{tar} ignores the @option{--sparse} option when reading an archive.
7320
7321 @table @option
7322 @item --sparse
7323 @itemx -S
7324 Files stored sparsely in the file system are represented sparsely in
7325 the archive. Use in conjunction with write operations.
7326 @end table
7327
7328 However, users should be well aware that at archive creation time,
7329 @GNUTAR{} still has to read whole disk file to
7330 locate the @dfn{holes}, and so, even if sparse files use little space
7331 on disk and in the archive, they may sometimes require inordinate
7332 amount of time for reading and examining all-zero blocks of a file.
7333 Although it works, it's painfully slow for a large (sparse) file, even
7334 though the resulting tar archive may be small. (One user reports that
7335 dumping a @file{core} file of over 400 megabytes, but with only about
7336 3 megabytes of actual data, took about 9 minutes on a Sun Sparcstation
7337 ELC, with full CPU utilization.)
7338
7339 This reading is required in all cases and is not related to the fact
7340 the @option{--sparse} option is used or not, so by merely @emph{not}
7341 using the option, you are not saving time@footnote{Well! We should say
7342 the whole truth, here. When @option{--sparse} is selected while creating
7343 an archive, the current @command{tar} algorithm requires sparse files to be
7344 read twice, not once. We hope to develop a new archive format for saving
7345 sparse files in which one pass will be sufficient.}.
7346
7347 Programs like @command{dump} do not have to read the entire file; by
7348 examining the file system directly, they can determine in advance
7349 exactly where the holes are and thus avoid reading through them. The
7350 only data it need read are the actual allocated data blocks.
7351 @GNUTAR{} uses a more portable and straightforward
7352 archiving approach, it would be fairly difficult that it does
7353 otherwise. Elizabeth Zwicky writes to @file{comp.unix.internals}, on
7354 1990-12-10:
7355
7356 @quotation
7357 What I did say is that you cannot tell the difference between a hole and an
7358 equivalent number of nulls without reading raw blocks. @code{st_blocks} at
7359 best tells you how many holes there are; it doesn't tell you @emph{where}.
7360 Just as programs may, conceivably, care what @code{st_blocks} is (care
7361 to name one that does?), they may also care where the holes are (I have
7362 no examples of this one either, but it's equally imaginable).
7363
7364 I conclude from this that good archivers are not portable. One can
7365 arguably conclude that if you want a portable program, you can in good
7366 conscience restore files with as many holes as possible, since you can't
7367 get it right.
7368 @end quotation
7369
7370 @node Attributes
7371 @section Handling File Attributes
7372 @UNREVISED
7373
7374 When @command{tar} reads files, it updates their access times. To
7375 avoid this, use the @option{--atime-preserve[=METHOD]} option, which can either
7376 reset the access time retroactively or avoid changing it in the first
7377 place.
7378
7379 Handling of file attributes
7380
7381 @table @option
7382 @opindex atime-preserve
7383 @item --atime-preserve
7384 @itemx --atime-preserve=replace
7385 @itemx --atime-preserve=system
7386 Preserve the access times of files that are read. This works only for
7387 files that you own, unless you have superuser privileges.
7388
7389 @option{--atime-preserve=replace} works on most systems, but it also
7390 restores the data modification time and updates the status change
7391 time. Hence it doesn't interact with incremental dumps nicely
7392 (@pxref{Backups}), and it can set access or data modification times
7393 incorrectly if other programs access the file while @command{tar} is
7394 running.
7395
7396 @option{--atime-preserve=system} avoids changing the access time in
7397 the first place, if the operating system supports this.
7398 Unfortunately, this may or may not work on any given operating system
7399 or file system. If @command{tar} knows for sure it won't work, it
7400 complains right away.
7401
7402 Currently @option{--atime-preserve} with no operand defaults to
7403 @option{--atime-preserve=replace}, but this is intended to change to
7404 @option{--atime-preserve=system} when the latter is better-supported.
7405
7406 @opindex touch
7407 @item -m
7408 @itemx --touch
7409 Do not extract data modification time.
7410
7411 When this option is used, @command{tar} leaves the data modification times
7412 of the files it extracts as the times when the files were extracted,
7413 instead of setting it to the times recorded in the archive.
7414
7415 This option is meaningless with @option{--list} (@option{-t}).
7416
7417 @opindex same-owner
7418 @item --same-owner
7419 Create extracted files with the same ownership they have in the
7420 archive.
7421
7422 This is the default behavior for the superuser,
7423 so this option is meaningful only for non-root users, when @command{tar}
7424 is executed on those systems able to give files away. This is
7425 considered as a security flaw by many people, at least because it
7426 makes quite difficult to correctly account users for the disk space
7427 they occupy. Also, the @code{suid} or @code{sgid} attributes of
7428 files are easily and silently lost when files are given away.
7429
7430 When writing an archive, @command{tar} writes the user id and user name
7431 separately. If it can't find a user name (because the user id is not
7432 in @file{/etc/passwd}), then it does not write one. When restoring,
7433 and doing a @code{chmod} like when you use @option{--same-permissions},
7434 @FIXME{same-owner?}it tries to look the name (if one was written)
7435 up in @file{/etc/passwd}. If it fails, then it uses the user id
7436 stored in the archive instead.
7437
7438 @opindex no-same-owner
7439 @item --no-same-owner
7440 @itemx -o
7441 Do not attempt to restore ownership when extracting. This is the
7442 default behavior for ordinary users, so this option has an effect
7443 only for the superuser.
7444
7445 @opindex numeric-owner
7446 @item --numeric-owner
7447 The @option{--numeric-owner} option allows (ANSI) archives to be written
7448 without user/group name information or such information to be ignored
7449 when extracting. It effectively disables the generation and/or use
7450 of user/group name information. This option forces extraction using
7451 the numeric ids from the archive, ignoring the names.
7452
7453 This is useful in certain circumstances, when restoring a backup from
7454 an emergency floppy with different passwd/group files for example.
7455 It is otherwise impossible to extract files with the right ownerships
7456 if the password file in use during the extraction does not match the
7457 one belonging to the file system(s) being extracted. This occurs,
7458 for example, if you are restoring your files after a major crash and
7459 had booted from an emergency floppy with no password file or put your
7460 disk into another machine to do the restore.
7461
7462 The numeric ids are @emph{always} saved into @command{tar} archives.
7463 The identifying names are added at create time when provided by the
7464 system, unless @option{--old-archive} (@option{-o}) is used. Numeric ids could be
7465 used when moving archives between a collection of machines using
7466 a centralized management for attribution of numeric ids to users
7467 and groups. This is often made through using the NIS capabilities.
7468
7469 When making a @command{tar} file for distribution to other sites, it
7470 is sometimes cleaner to use a single owner for all files in the
7471 distribution, and nicer to specify the write permission bits of the
7472 files as stored in the archive independently of their actual value on
7473 the file system. The way to prepare a clean distribution is usually
7474 to have some Makefile rule creating a directory, copying all needed
7475 files in that directory, then setting ownership and permissions as
7476 wanted (there are a lot of possible schemes), and only then making a
7477 @command{tar} archive out of this directory, before cleaning
7478 everything out. Of course, we could add a lot of options to
7479 @GNUTAR{} for fine tuning permissions and ownership.
7480 This is not the good way, I think. @GNUTAR{} is
7481 already crowded with options and moreover, the approach just explained
7482 gives you a great deal of control already.
7483
7484 @opindex same-permissions, short description
7485 @opindex preserve-permissions, short description
7486 @item -p
7487 @itemx --same-permissions
7488 @itemx --preserve-permissions
7489 Extract all protection information.
7490
7491 This option causes @command{tar} to set the modes (access permissions) of
7492 extracted files exactly as recorded in the archive. If this option
7493 is not used, the current @code{umask} setting limits the permissions
7494 on extracted files. This option is by default enabled when
7495 @command{tar} is executed by a superuser.
7496
7497
7498 This option is meaningless with @option{--list} (@option{-t}).
7499
7500 @opindex preserve
7501 @item --preserve
7502 Same as both @option{--same-permissions} and @option{--same-order}.
7503
7504 The @option{--preserve} option has no equivalent short option name.
7505 It is equivalent to @option{--same-permissions} plus @option{--same-order}.
7506
7507 @FIXME{I do not see the purpose of such an option. (Neither I. FP.)}
7508
7509 @end table
7510
7511 @node Standard
7512 @section Basic Tar Format
7513 @UNREVISED
7514
7515 While an archive may contain many files, the archive itself is a
7516 single ordinary file. Like any other file, an archive file can be
7517 written to a storage device such as a tape or disk, sent through a
7518 pipe or over a network, saved on the active file system, or even
7519 stored in another archive. An archive file is not easy to read or
7520 manipulate without using the @command{tar} utility or Tar mode in
7521 @acronym{GNU} Emacs.
7522
7523 Physically, an archive consists of a series of file entries terminated
7524 by an end-of-archive entry, which consists of two 512 blocks of zero
7525 bytes. A file
7526 entry usually describes one of the files in the archive (an
7527 @dfn{archive member}), and consists of a file header and the contents
7528 of the file. File headers contain file names and statistics, checksum
7529 information which @command{tar} uses to detect file corruption, and
7530 information about file types.
7531
7532 Archives are permitted to have more than one member with the same
7533 member name. One way this situation can occur is if more than one
7534 version of a file has been stored in the archive. For information
7535 about adding new versions of a file to an archive, see @ref{update}.
7536 @FIXME-xref{To learn more about having more than one archive member with the
7537 same name, see -backup node, when it's written.}
7538
7539 In addition to entries describing archive members, an archive may
7540 contain entries which @command{tar} itself uses to store information.
7541 @xref{label}, for an example of such an archive entry.
7542
7543 A @command{tar} archive file contains a series of blocks. Each block
7544 contains @code{BLOCKSIZE} bytes. Although this format may be thought
7545 of as being on magnetic tape, other media are often used.
7546
7547 Each file archived is represented by a header block which describes
7548 the file, followed by zero or more blocks which give the contents
7549 of the file. At the end of the archive file there are two 512-byte blocks
7550 filled with binary zeros as an end-of-file marker. A reasonable system
7551 should write such end-of-file marker at the end of an archive, but
7552 must not assume that such a block exists when reading an archive. In
7553 particular @GNUTAR{} always issues a warning if it does not encounter it.
7554
7555 The blocks may be @dfn{blocked} for physical I/O operations.
7556 Each record of @var{n} blocks (where @var{n} is set by the
7557 @option{--blocking-factor=@var{512-size}} (@option{-b @var{512-size}}) option to @command{tar}) is written with a single
7558 @w{@samp{write ()}} operation. On magnetic tapes, the result of
7559 such a write is a single record. When writing an archive,
7560 the last record of blocks should be written at the full size, with
7561 blocks after the zero block containing all zeros. When reading
7562 an archive, a reasonable system should properly handle an archive
7563 whose last record is shorter than the rest, or which contains garbage
7564 records after a zero block.
7565
7566 The header block is defined in C as follows. In the @GNUTAR{}
7567 distribution, this is part of file @file{src/tar.h}:
7568
7569 @smallexample
7570 @include header.texi
7571 @end smallexample
7572
7573 All characters in header blocks are represented by using 8-bit
7574 characters in the local variant of ASCII. Each field within the
7575 structure is contiguous; that is, there is no padding used within
7576 the structure. Each character on the archive medium is stored
7577 contiguously.
7578
7579 Bytes representing the contents of files (after the header block
7580 of each file) are not translated in any way and are not constrained
7581 to represent characters in any character set. The @command{tar} format
7582 does not distinguish text files from binary files, and no translation
7583 of file contents is performed.
7584
7585 The @code{name}, @code{linkname}, @code{magic}, @code{uname}, and
7586 @code{gname} are null-terminated character strings. All other fields
7587 are zero-filled octal numbers in ASCII. Each numeric field of width
7588 @var{w} contains @var{w} minus 1 digits, and a null.
7589
7590 The @code{name} field is the file name of the file, with directory names
7591 (if any) preceding the file name, separated by slashes.
7592
7593 @FIXME{how big a name before field overflows?}
7594
7595 The @code{mode} field provides nine bits specifying file permissions
7596 and three bits to specify the Set UID, Set GID, and Save Text
7597 (@dfn{sticky}) modes. Values for these bits are defined above.
7598 When special permissions are required to create a file with a given
7599 mode, and the user restoring files from the archive does not hold such
7600 permissions, the mode bit(s) specifying those special permissions
7601 are ignored. Modes which are not supported by the operating system
7602 restoring files from the archive will be ignored. Unsupported modes
7603 should be faked up when creating or updating an archive; e.g., the
7604 group permission could be copied from the @emph{other} permission.
7605
7606 The @code{uid} and @code{gid} fields are the numeric user and group
7607 ID of the file owners, respectively. If the operating system does
7608 not support numeric user or group IDs, these fields should be ignored.
7609
7610 The @code{size} field is the size of the file in bytes; linked files
7611 are archived with this field specified as zero. @FIXME-xref{Modifiers, in
7612 particular the @option{--incremental} (@option{-G}) option.}
7613
7614 The @code{mtime} field is the data modification time of the file at
7615 the time it was archived. It is the ASCII representation of the octal
7616 value of the last time the file's contents were modified, represented
7617 as an integer number of
7618 seconds since January 1, 1970, 00:00 Coordinated Universal Time.
7619
7620 The @code{chksum} field is the ASCII representation of the octal value
7621 of the simple sum of all bytes in the header block. Each 8-bit
7622 byte in the header is added to an unsigned integer, initialized to
7623 zero, the precision of which shall be no less than seventeen bits.
7624 When calculating the checksum, the @code{chksum} field is treated as
7625 if it were all blanks.
7626
7627 The @code{typeflag} field specifies the type of file archived. If a
7628 particular implementation does not recognize or permit the specified
7629 type, the file will be extracted as if it were a regular file. As this
7630 action occurs, @command{tar} issues a warning to the standard error.
7631
7632 The @code{atime} and @code{ctime} fields are used in making incremental
7633 backups; they store, respectively, the particular file's access and
7634 status change times.
7635
7636 The @code{offset} is used by the @option{--multi-volume} (@option{-M}) option, when
7637 making a multi-volume archive. The offset is number of bytes into
7638 the file that we need to restart at to continue the file on the next
7639 tape, i.e., where we store the location that a continued file is
7640 continued at.
7641
7642 The following fields were added to deal with sparse files. A file
7643 is @dfn{sparse} if it takes in unallocated blocks which end up being
7644 represented as zeros, i.e., no useful data. A test to see if a file
7645 is sparse is to look at the number blocks allocated for it versus the
7646 number of characters in the file; if there are fewer blocks allocated
7647 for the file than would normally be allocated for a file of that
7648 size, then the file is sparse. This is the method @command{tar} uses to
7649 detect a sparse file, and once such a file is detected, it is treated
7650 differently from non-sparse files.
7651
7652 Sparse files are often @code{dbm} files, or other database-type files
7653 which have data at some points and emptiness in the greater part of
7654 the file. Such files can appear to be very large when an @samp{ls
7655 -l} is done on them, when in truth, there may be a very small amount
7656 of important data contained in the file. It is thus undesirable
7657 to have @command{tar} think that it must back up this entire file, as
7658 great quantities of room are wasted on empty blocks, which can lead
7659 to running out of room on a tape far earlier than is necessary.
7660 Thus, sparse files are dealt with so that these empty blocks are
7661 not written to the tape. Instead, what is written to the tape is a
7662 description, of sorts, of the sparse file: where the holes are, how
7663 big the holes are, and how much data is found at the end of the hole.
7664 This way, the file takes up potentially far less room on the tape,
7665 and when the file is extracted later on, it will look exactly the way
7666 it looked beforehand. The following is a description of the fields
7667 used to handle a sparse file:
7668
7669 The @code{sp} is an array of @code{struct sparse}. Each @code{struct
7670 sparse} contains two 12-character strings which represent an offset
7671 into the file and a number of bytes to be written at that offset.
7672 The offset is absolute, and not relative to the offset in preceding
7673 array element.
7674
7675 The header can hold four of these @code{struct sparse} at the moment;
7676 if more are needed, they are not stored in the header.
7677
7678 The @code{isextended} flag is set when an @code{extended_header}
7679 is needed to deal with a file. Note that this means that this flag
7680 can only be set when dealing with a sparse file, and it is only set
7681 in the event that the description of the file will not fit in the
7682 allotted room for sparse structures in the header. In other words,
7683 an extended_header is needed.
7684
7685 The @code{extended_header} structure is used for sparse files which
7686 need more sparse structures than can fit in the header. The header can
7687 fit 4 such structures; if more are needed, the flag @code{isextended}
7688 gets set and the next block is an @code{extended_header}.
7689
7690 Each @code{extended_header} structure contains an array of 21
7691 sparse structures, along with a similar @code{isextended} flag
7692 that the header had. There can be an indeterminate number of such
7693 @code{extended_header}s to describe a sparse file.
7694
7695 @table @asis
7696
7697 @item @code{REGTYPE}
7698 @itemx @code{AREGTYPE}
7699 These flags represent a regular file. In order to be compatible
7700 with older versions of @command{tar}, a @code{typeflag} value of
7701 @code{AREGTYPE} should be silently recognized as a regular file.
7702 New archives should be created using @code{REGTYPE}. Also, for
7703 backward compatibility, @command{tar} treats a regular file whose name
7704 ends with a slash as a directory.
7705
7706 @item @code{LNKTYPE}
7707 This flag represents a file linked to another file, of any type,
7708 previously archived. Such files are identified in Unix by each
7709 file having the same device and inode number. The linked-to name is
7710 specified in the @code{linkname} field with a trailing null.
7711
7712 @item @code{SYMTYPE}
7713 This represents a symbolic link to another file. The linked-to name
7714 is specified in the @code{linkname} field with a trailing null.
7715
7716 @item @code{CHRTYPE}
7717 @itemx @code{BLKTYPE}
7718 These represent character special files and block special files
7719 respectively. In this case the @code{devmajor} and @code{devminor}
7720 fields will contain the major and minor device numbers respectively.
7721 Operating systems may map the device specifications to their own
7722 local specification, or may ignore the entry.
7723
7724 @item @code{DIRTYPE}
7725 This flag specifies a directory or sub-directory. The directory
7726 name in the @code{name} field should end with a slash. On systems where
7727 disk allocation is performed on a directory basis, the @code{size} field
7728 will contain the maximum number of bytes (which may be rounded to
7729 the nearest disk block allocation unit) which the directory may
7730 hold. A @code{size} field of zero indicates no such limiting. Systems
7731 which do not support limiting in this manner should ignore the
7732 @code{size} field.
7733
7734 @item @code{FIFOTYPE}
7735 This specifies a FIFO special file. Note that the archiving of a
7736 FIFO file archives the existence of this file and not its contents.
7737
7738 @item @code{CONTTYPE}
7739 This specifies a contiguous file, which is the same as a normal
7740 file except that, in operating systems which support it, all its
7741 space is allocated contiguously on the disk. Operating systems
7742 which do not allow contiguous allocation should silently treat this
7743 type as a normal file.
7744
7745 @item @code{A} @dots{} @code{Z}
7746 These are reserved for custom implementations. Some of these are
7747 used in the @acronym{GNU} modified format, as described below.
7748
7749 @end table
7750
7751 Other values are reserved for specification in future revisions of
7752 the P1003 standard, and should not be used by any @command{tar} program.
7753
7754 The @code{magic} field indicates that this archive was output in
7755 the P1003 archive format. If this field contains @code{TMAGIC},
7756 the @code{uname} and @code{gname} fields will contain the ASCII
7757 representation of the owner and group of the file respectively.
7758 If found, the user and group IDs are used rather than the values in
7759 the @code{uid} and @code{gid} fields.
7760
7761 For references, see ISO/IEC 9945-1:1990 or IEEE Std 1003.1-1990, pages
7762 169-173 (section 10.1) for @cite{Archive/Interchange File Format}; and
7763 IEEE Std 1003.2-1992, pages 380-388 (section 4.48) and pages 936-940
7764 (section E.4.48) for @cite{pax - Portable archive interchange}.
7765
7766 @node Extensions
7767 @section @acronym{GNU} Extensions to the Archive Format
7768 @UNREVISED
7769
7770 The @acronym{GNU} format uses additional file types to describe new types of
7771 files in an archive. These are listed below.
7772
7773 @table @code
7774 @item GNUTYPE_DUMPDIR
7775 @itemx 'D'
7776 This represents a directory and a list of files created by the
7777 @option{--incremental} (@option{-G}) option. The @code{size} field gives the total
7778 size of the associated list of files. Each file name is preceded by
7779 either a @samp{Y} (the file should be in this archive) or an @samp{N}.
7780 (The file is a directory, or is not stored in the archive.) Each file
7781 name is terminated by a null. There is an additional null after the
7782 last file name.
7783
7784 @item GNUTYPE_MULTIVOL
7785 @itemx 'M'
7786 This represents a file continued from another volume of a multi-volume
7787 archive created with the @option{--multi-volume} (@option{-M}) option. The original
7788 type of the file is not given here. The @code{size} field gives the
7789 maximum size of this piece of the file (assuming the volume does
7790 not end before the file is written out). The @code{offset} field
7791 gives the offset from the beginning of the file where this part of
7792 the file begins. Thus @code{size} plus @code{offset} should equal
7793 the original size of the file.
7794
7795 @item GNUTYPE_SPARSE
7796 @itemx 'S'
7797 This flag indicates that we are dealing with a sparse file. Note
7798 that archiving a sparse file requires special operations to find
7799 holes in the file, which mark the positions of these holes, along
7800 with the number of bytes of data to be found after the hole.
7801
7802 @item GNUTYPE_VOLHDR
7803 @itemx 'V'
7804 This file type is used to mark the volume header that was given with
7805 the @option{--label=@var{archive-label}} (@option{-V @var{archive-label}}) option when the archive was created. The @code{name}
7806 field contains the @code{name} given after the @option{--label=@var{archive-label}} (@option{-V @var{archive-label}}) option.
7807 The @code{size} field is zero. Only the first file in each volume
7808 of an archive should have this type.
7809
7810 @end table
7811
7812 You may have trouble reading a @acronym{GNU} format archive on a
7813 non-@acronym{GNU} system if the options @option{--incremental} (@option{-G}),
7814 @option{--multi-volume} (@option{-M}), @option{--sparse} (@option{-S}), or @option{--label=@var{archive-label}} (@option{-V @var{archive-label}}) were
7815 used when writing the archive. In general, if @command{tar} does not
7816 use the @acronym{GNU}-added fields of the header, other versions of
7817 @command{tar} should be able to read the archive. Otherwise, the
7818 @command{tar} program will give an error, the most likely one being a
7819 checksum error.
7820
7821 @node cpio
7822 @section Comparison of @command{tar} and @command{cpio}
7823 @UNREVISED
7824
7825 @FIXME{Reorganize the following material}
7826
7827 The @command{cpio} archive formats, like @command{tar}, do have maximum
7828 pathname lengths. The binary and old ASCII formats have a max path
7829 length of 256, and the new ASCII and CRC ASCII formats have a max
7830 path length of 1024. @acronym{GNU} @command{cpio} can read and write archives
7831 with arbitrary pathname lengths, but other @command{cpio} implementations
7832 may crash unexplainedly trying to read them.
7833
7834 @command{tar} handles symbolic links in the form in which it comes in BSD;
7835 @command{cpio} doesn't handle symbolic links in the form in which it comes
7836 in System V prior to SVR4, and some vendors may have added symlinks
7837 to their system without enhancing @command{cpio} to know about them.
7838 Others may have enhanced it in a way other than the way I did it
7839 at Sun, and which was adopted by AT&T (and which is, I think, also
7840 present in the @command{cpio} that Berkeley picked up from AT&T and put
7841 into a later BSD release---I think I gave them my changes).
7842
7843 (SVR4 does some funny stuff with @command{tar}; basically, its @command{cpio}
7844 can handle @command{tar} format input, and write it on output, and it
7845 probably handles symbolic links. They may not have bothered doing
7846 anything to enhance @command{tar} as a result.)
7847
7848 @command{cpio} handles special files; traditional @command{tar} doesn't.
7849
7850 @command{tar} comes with V7, System III, System V, and BSD source;
7851 @command{cpio} comes only with System III, System V, and later BSD
7852 (4.3-tahoe and later).
7853
7854 @command{tar}'s way of handling multiple hard links to a file can handle
7855 file systems that support 32-bit inumbers (e.g., the BSD file system);
7856 @command{cpio}s way requires you to play some games (in its "binary"
7857 format, i-numbers are only 16 bits, and in its "portable ASCII" format,
7858 they're 18 bits---it would have to play games with the "file system ID"
7859 field of the header to make sure that the file system ID/i-number pairs
7860 of different files were always different), and I don't know which
7861 @command{cpio}s, if any, play those games. Those that don't might get
7862 confused and think two files are the same file when they're not, and
7863 make hard links between them.
7864
7865 @command{tar}s way of handling multiple hard links to a file places only
7866 one copy of the link on the tape, but the name attached to that copy
7867 is the @emph{only} one you can use to retrieve the file; @command{cpio}s
7868 way puts one copy for every link, but you can retrieve it using any
7869 of the names.
7870
7871 @quotation
7872 What type of check sum (if any) is used, and how is this calculated.
7873 @end quotation
7874
7875 See the attached manual pages for @command{tar} and @command{cpio} format.
7876 @command{tar} uses a checksum which is the sum of all the bytes in the
7877 @command{tar} header for a file; @command{cpio} uses no checksum.
7878
7879 @quotation
7880 If anyone knows why @command{cpio} was made when @command{tar} was present
7881 at the unix scene,
7882 @end quotation
7883
7884 It wasn't. @command{cpio} first showed up in PWB/UNIX 1.0; no
7885 generally-available version of UNIX had @command{tar} at the time. I don't
7886 know whether any version that was generally available @emph{within AT&T}
7887 had @command{tar}, or, if so, whether the people within AT&T who did
7888 @command{cpio} knew about it.
7889
7890 On restore, if there is a corruption on a tape @command{tar} will stop at
7891 that point, while @command{cpio} will skip over it and try to restore the
7892 rest of the files.
7893
7894 The main difference is just in the command syntax and header format.
7895
7896 @command{tar} is a little more tape-oriented in that everything is blocked
7897 to start on a record boundary.
7898
7899 @quotation
7900 Is there any differences between the ability to recover crashed
7901 archives between the two of them. (Is there any chance of recovering
7902 crashed archives at all.)
7903 @end quotation
7904
7905 Theoretically it should be easier under @command{tar} since the blocking
7906 lets you find a header with some variation of @samp{dd skip=@var{nn}}.
7907 However, modern @command{cpio}'s and variations have an option to just
7908 search for the next file header after an error with a reasonable chance
7909 of resyncing. However, lots of tape driver software won't allow you to
7910 continue past a media error which should be the only reason for getting
7911 out of sync unless a file changed sizes while you were writing the
7912 archive.
7913
7914 @quotation
7915 If anyone knows why @command{cpio} was made when @command{tar} was present
7916 at the unix scene, please tell me about this too.
7917 @end quotation
7918
7919 Probably because it is more media efficient (by not blocking everything
7920 and using only the space needed for the headers where @command{tar}
7921 always uses 512 bytes per file header) and it knows how to archive
7922 special files.
7923
7924 You might want to look at the freely available alternatives. The
7925 major ones are @command{afio}, @GNUTAR{}, and
7926 @command{pax}, each of which have their own extensions with some
7927 backwards compatibility.
7928
7929 Sparse files were @command{tar}red as sparse files (which you can
7930 easily test, because the resulting archive gets smaller, and
7931 @acronym{GNU} @command{cpio} can no longer read it).
7932
7933 @node Media
7934 @chapter Tapes and Other Archive Media
7935 @UNREVISED
7936
7937 A few special cases about tape handling warrant more detailed
7938 description. These special cases are discussed below.
7939
7940 Many complexities surround the use of @command{tar} on tape drives. Since
7941 the creation and manipulation of archives located on magnetic tape was
7942 the original purpose of @command{tar}, it contains many features making
7943 such manipulation easier.
7944
7945 Archives are usually written on dismountable media---tape cartridges,
7946 mag tapes, or floppy disks.
7947
7948 The amount of data a tape or disk holds depends not only on its size,
7949 but also on how it is formatted. A 2400 foot long reel of mag tape
7950 holds 40 megabytes of data when formatted at 1600 bits per inch. The
7951 physically smaller EXABYTE tape cartridge holds 2.3 gigabytes.
7952
7953 Magnetic media are re-usable---once the archive on a tape is no longer
7954 needed, the archive can be erased and the tape or disk used over.
7955 Media quality does deteriorate with use, however. Most tapes or disks
7956 should be discarded when they begin to produce data errors. EXABYTE
7957 tape cartridges should be discarded when they generate an @dfn{error
7958 count} (number of non-usable bits) of more than 10k.
7959
7960 Magnetic media are written and erased using magnetic fields, and
7961 should be protected from such fields to avoid damage to stored data.
7962 Sticking a floppy disk to a filing cabinet using a magnet is probably
7963 not a good idea.
7964
7965 @menu
7966 * Device:: Device selection and switching
7967 * Remote Tape Server::
7968 * Common Problems and Solutions::
7969 * Blocking:: Blocking
7970 * Many:: Many archives on one tape
7971 * Using Multiple Tapes:: Using Multiple Tapes
7972 * label:: Including a Label in the Archive
7973 * verify::
7974 * Write Protection::
7975 @end menu
7976
7977 @node Device
7978 @section Device Selection and Switching
7979 @UNREVISED
7980
7981 @table @option
7982 @item -f [@var{hostname}:]@var{file}
7983 @itemx --file=[@var{hostname}:]@var{file}
7984 Use archive file or device @var{file} on @var{hostname}.
7985 @end table
7986
7987 This option is used to specify the file name of the archive @command{tar}
7988 works on.
7989
7990 If the file name is @samp{-}, @command{tar} reads the archive from standard
7991 input (when listing or extracting), or writes it to standard output
7992 (when creating). If the @samp{-} file name is given when updating an
7993 archive, @command{tar} will read the original archive from its standard
7994 input, and will write the entire new archive to its standard output.
7995
7996 If the file name contains a @samp{:}, it is interpreted as
7997 @samp{hostname:file name}. If the @var{hostname} contains an @dfn{at}
7998 sign (@samp{@@}), it is treated as @samp{user@@hostname:file name}. In
7999 either case, @command{tar} will invoke the command @command{rsh} (or
8000 @command{remsh}) to start up an @command{/usr/libexec/rmt} on the remote
8001 machine. If you give an alternate login name, it will be given to the
8002 @command{rsh}.
8003 Naturally, the remote machine must have an executable
8004 @command{/usr/libexec/rmt}. This program is free software from the
8005 University of California, and a copy of the source code can be found
8006 with the sources for @command{tar}; it's compiled and installed by default.
8007 The exact path to this utility is determined when configuring the package.
8008 It is @file{@var{prefix}/libexec/rmt}, where @var{prefix} stands for
8009 your installation prefix. This location may also be overridden at
8010 runtime by using @option{rmt-command=@var{command}} option (@xref{Option Summary,
8011 ---rmt-command}, for detailed description of this option. @xref{Remote
8012 Tape Server}, for the description of @command{rmt} command).
8013
8014 If this option is not given, but the environment variable @env{TAPE}
8015 is set, its value is used; otherwise, old versions of @command{tar}
8016 used a default archive name (which was picked when @command{tar} was
8017 compiled). The default is normally set up to be the @dfn{first} tape
8018 drive or other transportable I/O medium on the system.
8019
8020 Starting with version 1.11.5, @GNUTAR{} uses
8021 standard input and standard output as the default device, and I will
8022 not try anymore supporting automatic device detection at installation
8023 time. This was failing really in too many cases, it was hopeless.
8024 This is now completely left to the installer to override standard
8025 input and standard output for default device, if this seems
8026 preferable. Further, I think @emph{most} actual usages of
8027 @command{tar} are done with pipes or disks, not really tapes,
8028 cartridges or diskettes.
8029
8030 Some users think that using standard input and output is running
8031 after trouble. This could lead to a nasty surprise on your screen if
8032 you forget to specify an output file name---especially if you are going
8033 through a network or terminal server capable of buffering large amounts
8034 of output. We had so many bug reports in that area of configuring
8035 default tapes automatically, and so many contradicting requests, that
8036 we finally consider the problem to be portably intractable. We could
8037 of course use something like @samp{/dev/tape} as a default, but this
8038 is @emph{also} running after various kind of trouble, going from hung
8039 processes to accidental destruction of real tapes. After having seen
8040 all this mess, using standard input and output as a default really
8041 sounds like the only clean choice left, and a very useful one too.
8042
8043 @GNUTAR{} reads and writes archive in records, I
8044 suspect this is the main reason why block devices are preferred over
8045 character devices. Most probably, block devices are more efficient
8046 too. The installer could also check for @samp{DEFTAPE} in
8047 @file{<sys/mtio.h>}.
8048
8049 @table @option
8050 @opindex force-local, short description
8051 @item --force-local
8052 Archive file is local even if it contains a colon.
8053
8054 @opindex rsh-command
8055 @item --rsh-command=@var{command}
8056 Use remote @var{command} instead of @command{rsh}. This option exists
8057 so that people who use something other than the standard @command{rsh}
8058 (e.g., a Kerberized @command{rsh}) can access a remote device.
8059
8060 When this command is not used, the shell command found when
8061 the @command{tar} program was installed is used instead. This is
8062 the first found of @file{/usr/ucb/rsh}, @file{/usr/bin/remsh},
8063 @file{/usr/bin/rsh}, @file{/usr/bsd/rsh} or @file{/usr/bin/nsh}.
8064 The installer may have overridden this by defining the environment
8065 variable @env{RSH} @emph{at installation time}.
8066
8067 @item -[0-7][lmh]
8068 Specify drive and density.
8069
8070 @opindex multi-volume, short description
8071 @item -M
8072 @itemx --multi-volume
8073 Create/list/extract multi-volume archive.
8074
8075 This option causes @command{tar} to write a @dfn{multi-volume} archive---one
8076 that may be larger than will fit on the medium used to hold it.
8077 @xref{Multi-Volume Archives}.
8078
8079 @opindex tape-length, short description
8080 @item -L @var{num}
8081 @itemx --tape-length=@var{num}
8082 Change tape after writing @var{num} x 1024 bytes.
8083
8084 This option might be useful when your tape drivers do not properly
8085 detect end of physical tapes. By being slightly conservative on the
8086 maximum tape length, you might avoid the problem entirely.
8087
8088 @opindex info-script, short description
8089 @opindex new-volume-script, short description
8090 @item -F @var{file}
8091 @itemx --info-script=@var{file}
8092 @itemx --new-volume-script=@var{file}
8093 Execute @file{file} at end of each tape. This implies
8094 @option{--multi-volume} (@option{-M}). @xref{info-script}, for a detailed
8095 description of this option.
8096 @end table
8097
8098 @node Remote Tape Server
8099 @section The Remote Tape Server
8100
8101 @cindex remote tape drive
8102 @pindex rmt
8103 In order to access the tape drive on a remote machine, @command{tar}
8104 uses the remote tape server written at the University of California at
8105 Berkeley. The remote tape server must be installed as
8106 @file{@var{prefix}/libexec/rmt} on any machine whose tape drive you
8107 want to use. @command{tar} calls @command{rmt} by running an
8108 @command{rsh} or @command{remsh} to the remote machine, optionally
8109 using a different login name if one is supplied.
8110
8111 A copy of the source for the remote tape server is provided. It is
8112 Copyright @copyright{} 1983 by the Regents of the University of
8113 California, but can be freely distributed. It is compiled and
8114 installed by default.
8115
8116 @cindex absolute file names
8117 Unless you use the @option{--absolute-names} (@option{-P}) option,
8118 @GNUTAR{} will not allow you to create an archive that contains
8119 absolute file names (a file name beginning with @samp{/}.) If you try,
8120 @command{tar} will automatically remove the leading @samp{/} from the
8121 file names it stores in the archive. It will also type a warning
8122 message telling you what it is doing.
8123
8124 When reading an archive that was created with a different
8125 @command{tar} program, @GNUTAR{} automatically
8126 extracts entries in the archive which have absolute file names as if
8127 the file names were not absolute. This is an important feature. A
8128 visitor here once gave a @command{tar} tape to an operator to restore;
8129 the operator used Sun @command{tar} instead of @GNUTAR{},
8130 and the result was that it replaced large portions of
8131 our @file{/bin} and friends with versions from the tape; needless to
8132 say, we were unhappy about having to recover the file system from
8133 backup tapes.
8134
8135 For example, if the archive contained a file @file{/usr/bin/computoy},
8136 @GNUTAR{} would extract the file to @file{usr/bin/computoy},
8137 relative to the current directory. If you want to extract the files in
8138 an archive to the same absolute names that they had when the archive
8139 was created, you should do a @samp{cd /} before extracting the files
8140 from the archive, or you should either use the @option{--absolute-names}
8141 option, or use the command @samp{tar -C / @dots{}}.
8142
8143 @cindex Ultrix 3.1 and write failure
8144 Some versions of Unix (Ultrix 3.1 is known to have this problem),
8145 can claim that a short write near the end of a tape succeeded,
8146 when it actually failed. This will result in the -M option not
8147 working correctly. The best workaround at the moment is to use a
8148 significantly larger blocking factor than the default 20.
8149
8150 In order to update an archive, @command{tar} must be able to backspace the
8151 archive in order to reread or rewrite a record that was just read (or
8152 written). This is currently possible only on two kinds of files: normal
8153 disk files (or any other file that can be backspaced with @samp{lseek}),
8154 and industry-standard 9-track magnetic tape (or any other kind of tape
8155 that can be backspaced with the @code{MTIOCTOP} @code{ioctl}.
8156
8157 This means that the @option{--append}, @option{--concatenate}, and
8158 @option{--delete} commands will not work on any other kind of file.
8159 Some media simply cannot be backspaced, which means these commands and
8160 options will never be able to work on them. These non-backspacing
8161 media include pipes and cartridge tape drives.
8162
8163 Some other media can be backspaced, and @command{tar} will work on them
8164 once @command{tar} is modified to do so.
8165
8166 Archives created with the @option{--multi-volume}, @option{--label}, and
8167 @option{--incremental} (@option{-G}) options may not be readable by other version
8168 of @command{tar}. In particular, restoring a file that was split over
8169 a volume boundary will require some careful work with @command{dd}, if
8170 it can be done at all. Other versions of @command{tar} may also create
8171 an empty file whose name is that of the volume header. Some versions
8172 of @command{tar} may create normal files instead of directories archived
8173 with the @option{--incremental} (@option{-G}) option.
8174
8175 @node Common Problems and Solutions
8176 @section Some Common Problems and their Solutions
8177
8178 @ifclear PUBLISH
8179
8180 @format
8181 errors from system:
8182 permission denied
8183 no such file or directory
8184 not owner
8185
8186 errors from @command{tar}:
8187 directory checksum error
8188 header format error
8189
8190 errors from media/system:
8191 i/o error
8192 device busy
8193 @end format
8194
8195 @end ifclear
8196
8197 @node Blocking
8198 @section Blocking
8199 @UNREVISED
8200
8201 @dfn{Block} and @dfn{record} terminology is rather confused, and it
8202 is also confusing to the expert reader. On the other hand, readers
8203 who are new to the field have a fresh mind, and they may safely skip
8204 the next two paragraphs, as the remainder of this manual uses those
8205 two terms in a quite consistent way.
8206
8207 John Gilmore, the writer of the public domain @command{tar} from which
8208 @GNUTAR{} was originally derived, wrote (June 1995):
8209
8210 @quotation
8211 The nomenclature of tape drives comes from IBM, where I believe
8212 they were invented for the IBM 650 or so. On IBM mainframes, what
8213 is recorded on tape are tape blocks. The logical organization of
8214 data is into records. There are various ways of putting records into
8215 blocks, including @code{F} (fixed sized records), @code{V} (variable
8216 sized records), @code{FB} (fixed blocked: fixed size records, @var{n}
8217 to a block), @code{VB} (variable size records, @var{n} to a block),
8218 @code{VSB} (variable spanned blocked: variable sized records that can
8219 occupy more than one block), etc. The @code{JCL} @samp{DD RECFORM=}
8220 parameter specified this to the operating system.
8221
8222 The Unix man page on @command{tar} was totally confused about this.
8223 When I wrote @code{PD TAR}, I used the historically correct terminology
8224 (@command{tar} writes data records, which are grouped into blocks).
8225 It appears that the bogus terminology made it into @acronym{POSIX} (no surprise
8226 here), and now Fran@,{c}ois has migrated that terminology back
8227 into the source code too.
8228 @end quotation
8229
8230 The term @dfn{physical block} means the basic transfer chunk from or
8231 to a device, after which reading or writing may stop without anything
8232 being lost. In this manual, the term @dfn{block} usually refers to
8233 a disk physical block, @emph{assuming} that each disk block is 512
8234 bytes in length. It is true that some disk devices have different
8235 physical blocks, but @command{tar} ignore these differences in its own
8236 format, which is meant to be portable, so a @command{tar} block is always
8237 512 bytes in length, and @dfn{block} always mean a @command{tar} block.
8238 The term @dfn{logical block} often represents the basic chunk of
8239 allocation of many disk blocks as a single entity, which the operating
8240 system treats somewhat atomically; this concept is only barely used
8241 in @GNUTAR{}.
8242
8243 The term @dfn{physical record} is another way to speak of a physical
8244 block, those two terms are somewhat interchangeable. In this manual,
8245 the term @dfn{record} usually refers to a tape physical block,
8246 @emph{assuming} that the @command{tar} archive is kept on magnetic tape.
8247 It is true that archives may be put on disk or used with pipes,
8248 but nevertheless, @command{tar} tries to read and write the archive one
8249 @dfn{record} at a time, whatever the medium in use. One record is made
8250 up of an integral number of blocks, and this operation of putting many
8251 disk blocks into a single tape block is called @dfn{reblocking}, or
8252 more simply, @dfn{blocking}. The term @dfn{logical record} refers to
8253 the logical organization of many characters into something meaningful
8254 to the application. The term @dfn{unit record} describes a small set
8255 of characters which are transmitted whole to or by the application,
8256 and often refers to a line of text. Those two last terms are unrelated
8257 to what we call a @dfn{record} in @GNUTAR{}.
8258
8259 When writing to tapes, @command{tar} writes the contents of the archive
8260 in chunks known as @dfn{records}. To change the default blocking
8261 factor, use the @option{--blocking-factor=@var{512-size}} (@option{-b
8262 @var{512-size}}) option. Each record will then be composed of
8263 @var{512-size} blocks. (Each @command{tar} block is 512 bytes.
8264 @xref{Standard}.) Each file written to the archive uses at least one
8265 full record. As a result, using a larger record size can result in
8266 more wasted space for small files. On the other hand, a larger record
8267 size can often be read and written much more efficiently.
8268
8269 Further complicating the problem is that some tape drives ignore the
8270 blocking entirely. For these, a larger record size can still improve
8271 performance (because the software layers above the tape drive still
8272 honor the blocking), but not as dramatically as on tape drives that
8273 honor blocking.
8274
8275 When reading an archive, @command{tar} can usually figure out the
8276 record size on itself. When this is the case, and a non-standard
8277 record size was used when the archive was created, @command{tar} will
8278 print a message about a non-standard blocking factor, and then operate
8279 normally. On some tape devices, however, @command{tar} cannot figure
8280 out the record size itself. On most of those, you can specify a
8281 blocking factor (with @option{--blocking-factor}) larger than the
8282 actual blocking factor, and then use the @option{--read-full-records}
8283 (@option{-B}) option. (If you specify a blocking factor with
8284 @option{--blocking-factor} and don't use the
8285 @option{--read-full-records} option, then @command{tar} will not
8286 attempt to figure out the recording size itself.) On some devices,
8287 you must always specify the record size exactly with
8288 @option{--blocking-factor} when reading, because @command{tar} cannot
8289 figure it out. In any case, use @option{--list} (@option{-t}) before
8290 doing any extractions to see whether @command{tar} is reading the archive
8291 correctly.
8292
8293 @command{tar} blocks are all fixed size (512 bytes), and its scheme for
8294 putting them into records is to put a whole number of them (one or
8295 more) into each record. @command{tar} records are all the same size;
8296 at the end of the file there's a block containing all zeros, which
8297 is how you tell that the remainder of the last record(s) are garbage.
8298
8299 In a standard @command{tar} file (no options), the block size is 512
8300 and the record size is 10240, for a blocking factor of 20. What the
8301 @option{--blocking-factor} option does is sets the blocking factor,
8302 changing the record size while leaving the block size at 512 bytes.
8303 20 was fine for ancient 800 or 1600 bpi reel-to-reel tape drives;
8304 most tape drives these days prefer much bigger records in order to
8305 stream and not waste tape. When writing tapes for myself, some tend
8306 to use a factor of the order of 2048, say, giving a record size of
8307 around one megabyte.
8308
8309 If you use a blocking factor larger than 20, older @command{tar}
8310 programs might not be able to read the archive, so we recommend this
8311 as a limit to use in practice. @GNUTAR{}, however,
8312 will support arbitrarily large record sizes, limited only by the
8313 amount of virtual memory or the physical characteristics of the tape
8314 device.
8315
8316 @menu
8317 * Format Variations:: Format Variations
8318 * Blocking Factor:: The Blocking Factor of an Archive
8319 @end menu
8320
8321 @node Format Variations
8322 @subsection Format Variations
8323 @cindex Format Parameters
8324 @cindex Format Options
8325 @cindex Options, archive format specifying
8326 @cindex Options, format specifying
8327 @UNREVISED
8328
8329 Format parameters specify how an archive is written on the archive
8330 media. The best choice of format parameters will vary depending on
8331 the type and number of files being archived, and on the media used to
8332 store the archive.
8333
8334 To specify format parameters when accessing or creating an archive,
8335 you can use the options described in the following sections.
8336 If you do not specify any format parameters, @command{tar} uses
8337 default parameters. You cannot modify a compressed archive.
8338 If you create an archive with the @option{--blocking-factor} option
8339 specified (@pxref{Blocking Factor}), you must specify that
8340 blocking-factor when operating on the archive. @xref{Formats}, for other
8341 examples of format parameter considerations.
8342
8343 @node Blocking Factor
8344 @subsection The Blocking Factor of an Archive
8345 @cindex Blocking Factor
8346 @cindex Record Size
8347 @cindex Number of blocks per record
8348 @cindex Number of bytes per record
8349 @cindex Bytes per record
8350 @cindex Blocks per record
8351 @UNREVISED
8352
8353 @opindex blocking-factor
8354 The data in an archive is grouped into blocks, which are 512 bytes.
8355 Blocks are read and written in whole number multiples called
8356 @dfn{records}. The number of blocks in a record (ie. the size of a
8357 record in units of 512 bytes) is called the @dfn{blocking factor}.
8358 The @option{--blocking-factor=@var{512-size}} (@option{-b
8359 @var{512-size}}) option specifies the blocking factor of an archive.
8360 The default blocking factor is typically 20 (i.e., 10240 bytes), but
8361 can be specified at installation. To find out the blocking factor of
8362 an existing archive, use @samp{tar --list --file=@var{archive-name}}.
8363 This may not work on some devices.
8364
8365 Records are separated by gaps, which waste space on the archive media.
8366 If you are archiving on magnetic tape, using a larger blocking factor
8367 (and therefore larger records) provides faster throughput and allows you
8368 to fit more data on a tape (because there are fewer gaps). If you are
8369 archiving on cartridge, a very large blocking factor (say 126 or more)
8370 greatly increases performance. A smaller blocking factor, on the other
8371 hand, may be useful when archiving small files, to avoid archiving lots
8372 of nulls as @command{tar} fills out the archive to the end of the record.
8373 In general, the ideal record size depends on the size of the
8374 inter-record gaps on the tape you are using, and the average size of the
8375 files you are archiving. @xref{create}, for information on
8376 writing archives.
8377
8378 @FIXME{Need example of using a cartridge with blocking factor=126 or more.}
8379
8380 Archives with blocking factors larger than 20 cannot be read
8381 by very old versions of @command{tar}, or by some newer versions
8382 of @command{tar} running on old machines with small address spaces.
8383 With @GNUTAR{}, the blocking factor of an archive is limited
8384 only by the maximum record size of the device containing the archive,
8385 or by the amount of available virtual memory.
8386
8387 Also, on some systems, not using adequate blocking factors, as sometimes
8388 imposed by the device drivers, may yield unexpected diagnostics. For
8389 example, this has been reported:
8390
8391 @smallexample
8392 Cannot write to /dev/dlt: Invalid argument
8393 @end smallexample
8394
8395 @noindent
8396 In such cases, it sometimes happen that the @command{tar} bundled by
8397 the system is aware of block size idiosyncrasies, while @GNUTAR{}
8398 requires an explicit specification for the block size,
8399 which it cannot guess. This yields some people to consider
8400 @GNUTAR{} is misbehaving, because by comparison,
8401 @cite{the bundle @command{tar} works OK}. Adding @w{@kbd{-b 256}},
8402 for example, might resolve the problem.
8403
8404 If you use a non-default blocking factor when you create an archive, you
8405 must specify the same blocking factor when you modify that archive. Some
8406 archive devices will also require you to specify the blocking factor when
8407 reading that archive, however this is not typically the case. Usually, you
8408 can use @option{--list} (@option{-t}) without specifying a blocking factor---@command{tar}
8409 reports a non-default record size and then lists the archive members as
8410 it would normally. To extract files from an archive with a non-standard
8411 blocking factor (particularly if you're not sure what the blocking factor
8412 is), you can usually use the @option{--read-full-records} (@option{-B}) option while
8413 specifying a blocking factor larger then the blocking factor of the archive
8414 (ie. @samp{tar --extract --read-full-records --blocking-factor=300}.
8415 @xref{list}, for more information on the @option{--list} (@option{-t})
8416 operation. @xref{Reading}, for a more detailed explanation of that option.
8417
8418 @table @option
8419 @item --blocking-factor=@var{number}
8420 @itemx -b @var{number}
8421 Specifies the blocking factor of an archive. Can be used with any
8422 operation, but is usually not necessary with @option{--list} (@option{-t}).
8423 @end table
8424
8425 Device blocking
8426
8427 @table @option
8428 @item -b @var{blocks}
8429 @itemx --blocking-factor=@var{blocks}
8430 Set record size to @math{@var{blocks} * 512} bytes.
8431
8432 This option is used to specify a @dfn{blocking factor} for the archive.
8433 When reading or writing the archive, @command{tar}, will do reads and writes
8434 of the archive in records of @math{@var{block}*512} bytes. This is true
8435 even when the archive is compressed. Some devices requires that all
8436 write operations be a multiple of a certain size, and so, @command{tar}
8437 pads the archive out to the next record boundary.
8438
8439 The default blocking factor is set when @command{tar} is compiled, and is
8440 typically 20. Blocking factors larger than 20 cannot be read by very
8441 old versions of @command{tar}, or by some newer versions of @command{tar}
8442 running on old machines with small address spaces.
8443
8444 With a magnetic tape, larger records give faster throughput and fit
8445 more data on a tape (because there are fewer inter-record gaps).
8446 If the archive is in a disk file or a pipe, you may want to specify
8447 a smaller blocking factor, since a large one will result in a large
8448 number of null bytes at the end of the archive.
8449
8450 When writing cartridge or other streaming tapes, a much larger
8451 blocking factor (say 126 or more) will greatly increase performance.
8452 However, you must specify the same blocking factor when reading or
8453 updating the archive.
8454
8455 Apparently, Exabyte drives have a physical block size of 8K bytes.
8456 If we choose our blocksize as a multiple of 8k bytes, then the problem
8457 seems to disappear. Id est, we are using block size of 112 right
8458 now, and we haven't had the problem since we switched@dots{}
8459
8460 With @GNUTAR{} the blocking factor is limited only
8461 by the maximum record size of the device containing the archive, or by
8462 the amount of available virtual memory.
8463
8464 However, deblocking or reblocking is virtually avoided in a special
8465 case which often occurs in practice, but which requires all the
8466 following conditions to be simultaneously true:
8467 @itemize @bullet
8468 @item
8469 the archive is subject to a compression option,
8470 @item
8471 the archive is not handled through standard input or output, nor
8472 redirected nor piped,
8473 @item
8474 the archive is directly handled to a local disk, instead of any special
8475 device,
8476 @item
8477 @option{--blocking-factor} is not explicitly specified on the @command{tar}
8478 invocation.
8479 @end itemize
8480
8481 If the output goes directly to a local disk, and not through
8482 stdout, then the last write is not extended to a full record size.
8483 Otherwise, reblocking occurs. Here are a few other remarks on this
8484 topic:
8485
8486 @itemize @bullet
8487
8488 @item
8489 @command{gzip} will complain about trailing garbage if asked to
8490 uncompress a compressed archive on tape, there is an option to turn
8491 the message off, but it breaks the regularity of simply having to use
8492 @samp{@var{prog} -d} for decompression. It would be nice if gzip was
8493 silently ignoring any number of trailing zeros. I'll ask Jean-loup
8494 Gailly, by sending a copy of this message to him.
8495
8496 @item
8497 @command{compress} does not show this problem, but as Jean-loup pointed
8498 out to Michael, @samp{compress -d} silently adds garbage after
8499 the result of decompression, which tar ignores because it already
8500 recognized its end-of-file indicator. So this bug may be safely
8501 ignored.
8502
8503 @item
8504 @samp{gzip -d -q} will be silent about the trailing zeros indeed,
8505 but will still return an exit status of 2 which tar reports in turn.
8506 @command{tar} might ignore the exit status returned, but I hate doing
8507 that, as it weakens the protection @command{tar} offers users against
8508 other possible problems at decompression time. If @command{gzip} was
8509 silently skipping trailing zeros @emph{and} also avoiding setting the
8510 exit status in this innocuous case, that would solve this situation.
8511
8512 @item
8513 @command{tar} should become more solid at not stopping to read a pipe at
8514 the first null block encountered. This inelegantly breaks the pipe.
8515 @command{tar} should rather drain the pipe out before exiting itself.
8516 @end itemize
8517
8518 @opindex ignore-zeros, short description
8519 @item -i
8520 @itemx --ignore-zeros
8521 Ignore blocks of zeros in archive (means EOF).
8522
8523 The @option{--ignore-zeros} (@option{-i}) option causes @command{tar} to ignore blocks
8524 of zeros in the archive. Normally a block of zeros indicates the
8525 end of the archive, but when reading a damaged archive, or one which
8526 was created by concatenating several archives together, this option
8527 allows @command{tar} to read the entire archive. This option is not on
8528 by default because many versions of @command{tar} write garbage after
8529 the zeroed blocks.
8530
8531 Note that this option causes @command{tar} to read to the end of the
8532 archive file, which may sometimes avoid problems when multiple files
8533 are stored on a single physical tape.
8534
8535 @opindex read-full-records, short description
8536 @item -B
8537 @itemx --read-full-records
8538 Reblock as we read (for reading 4.2BSD pipes).
8539
8540 If @option{--read-full-records} is used, @command{tar}
8541 will not panic if an attempt to read a record from the archive does
8542 not return a full record. Instead, @command{tar} will keep reading
8543 until it has obtained a full
8544 record.
8545
8546 This option is turned on by default when @command{tar} is reading
8547 an archive from standard input, or from a remote machine. This is
8548 because on BSD Unix systems, a read of a pipe will return however
8549 much happens to be in the pipe, even if it is less than @command{tar}
8550 requested. If this option was not used, @command{tar} would fail as
8551 soon as it read an incomplete record from the pipe.
8552
8553 This option is also useful with the commands for updating an archive.
8554
8555 @end table
8556
8557 Tape blocking
8558
8559 @FIXME{Appropriate options should be moved here from elsewhere.}
8560
8561 @cindex blocking factor
8562 @cindex tape blocking
8563
8564 When handling various tapes or cartridges, you have to take care of
8565 selecting a proper blocking, that is, the number of disk blocks you
8566 put together as a single tape block on the tape, without intervening
8567 tape gaps. A @dfn{tape gap} is a small landing area on the tape
8568 with no information on it, used for decelerating the tape to a
8569 full stop, and for later regaining the reading or writing speed.
8570 When the tape driver starts reading a record, the record has to
8571 be read whole without stopping, as a tape gap is needed to stop the
8572 tape motion without loosing information.
8573
8574 @cindex Exabyte blocking
8575 @cindex DAT blocking
8576 Using higher blocking (putting more disk blocks per tape block) will use
8577 the tape more efficiently as there will be less tape gaps. But reading
8578 such tapes may be more difficult for the system, as more memory will be
8579 required to receive at once the whole record. Further, if there is a
8580 reading error on a huge record, this is less likely that the system will
8581 succeed in recovering the information. So, blocking should not be too
8582 low, nor it should be too high. @command{tar} uses by default a blocking of
8583 20 for historical reasons, and it does not really matter when reading or
8584 writing to disk. Current tape technology would easily accommodate higher
8585 blockings. Sun recommends a blocking of 126 for Exabytes and 96 for DATs.
8586 We were told that for some DLT drives, the blocking should be a multiple
8587 of 4Kb, preferably 64Kb (@w{@kbd{-b 128}}) or 256 for decent performance.
8588 Other manufacturers may use different recommendations for the same tapes.
8589 This might also depends of the buffering techniques used inside modern
8590 tape controllers. Some imposes a minimum blocking, or a maximum blocking.
8591 Others request blocking to be some exponent of two.
8592
8593 So, there is no fixed rule for blocking. But blocking at read time
8594 should ideally be the same as blocking used at write time. At one place
8595 I know, with a wide variety of equipment, they found it best to use a
8596 blocking of 32 to guarantee that their tapes are fully interchangeable.
8597
8598 I was also told that, for recycled tapes, prior erasure (by the same
8599 drive unit that will be used to create the archives) sometimes lowers
8600 the error rates observed at rewriting time.
8601
8602 I might also use @option{--number-blocks} instead of
8603 @option{--block-number}, so @option{--block} will then expand to
8604 @option{--blocking-factor} unambiguously.
8605
8606 @node Many
8607 @section Many Archives on One Tape
8608
8609 @FIXME{Appropriate options should be moved here from elsewhere.}
8610
8611 @findex ntape @r{device}
8612 Most tape devices have two entries in the @file{/dev} directory, or
8613 entries that come in pairs, which differ only in the minor number for
8614 this device. Let's take for example @file{/dev/tape}, which often
8615 points to the only or usual tape device of a given system. There might
8616 be a corresponding @file{/dev/nrtape} or @file{/dev/ntape}. The simpler
8617 name is the @emph{rewinding} version of the device, while the name
8618 having @samp{nr} in it is the @emph{no rewinding} version of the same
8619 device.
8620
8621 A rewinding tape device will bring back the tape to its beginning point
8622 automatically when this device is opened or closed. Since @command{tar}
8623 opens the archive file before using it and closes it afterwards, this
8624 means that a simple:
8625
8626 @smallexample
8627 $ @kbd{tar cf /dev/tape @var{directory}}
8628 @end smallexample
8629
8630 @noindent
8631 will reposition the tape to its beginning both prior and after saving
8632 @var{directory} contents to it, thus erasing prior tape contents and
8633 making it so that any subsequent write operation will destroy what has
8634 just been saved.
8635
8636 @cindex tape positioning
8637 So, a rewinding device is normally meant to hold one and only one file.
8638 If you want to put more than one @command{tar} archive on a given tape, you
8639 will need to avoid using the rewinding version of the tape device. You
8640 will also have to pay special attention to tape positioning. Errors in
8641 positioning may overwrite the valuable data already on your tape. Many
8642 people, burnt by past experiences, will only use rewinding devices and
8643 limit themselves to one file per tape, precisely to avoid the risk of
8644 such errors. Be fully aware that writing at the wrong position on a
8645 tape loses all information past this point and most probably until the
8646 end of the tape, and this destroyed information @emph{cannot} be
8647 recovered.
8648
8649 To save @var{directory-1} as a first archive at the beginning of a
8650 tape, and leave that tape ready for a second archive, you should use:
8651
8652 @smallexample
8653 $ @kbd{mt -f /dev/nrtape rewind}
8654 $ @kbd{tar cf /dev/nrtape @var{directory-1}}
8655 @end smallexample
8656
8657 @cindex tape marks
8658 @dfn{Tape marks} are special magnetic patterns written on the tape
8659 media, which are later recognizable by the reading hardware. These
8660 marks are used after each file, when there are many on a single tape.
8661 An empty file (that is to say, two tape marks in a row) signal the
8662 logical end of the tape, after which no file exist. Usually,
8663 non-rewinding tape device drivers will react to the close request issued
8664 by @command{tar} by first writing two tape marks after your archive, and by
8665 backspacing over one of these. So, if you remove the tape at that time
8666 from the tape drive, it is properly terminated. But if you write
8667 another file at the current position, the second tape mark will be
8668 erased by the new information, leaving only one tape mark between files.
8669
8670 So, you may now save @var{directory-2} as a second archive after the
8671 first on the same tape by issuing the command:
8672
8673 @smallexample
8674 $ @kbd{tar cf /dev/nrtape @var{directory-2}}
8675 @end smallexample
8676
8677 @noindent
8678 and so on for all the archives you want to put on the same tape.
8679
8680 Another usual case is that you do not write all the archives the same
8681 day, and you need to remove and store the tape between two archive
8682 sessions. In general, you must remember how many files are already
8683 saved on your tape. Suppose your tape already has 16 files on it, and
8684 that you are ready to write the 17th. You have to take care of skipping
8685 the first 16 tape marks before saving @var{directory-17}, say, by using
8686 these commands:
8687
8688 @smallexample
8689 $ @kbd{mt -f /dev/nrtape rewind}
8690 $ @kbd{mt -f /dev/nrtape fsf 16}
8691 $ @kbd{tar cf /dev/nrtape @var{directory-17}}
8692 @end smallexample
8693
8694 In all the previous examples, we put aside blocking considerations, but
8695 you should do the proper things for that as well. @xref{Blocking}.
8696
8697 @menu
8698 * Tape Positioning:: Tape Positions and Tape Marks
8699 * mt:: The @command{mt} Utility
8700 @end menu
8701
8702 @node Tape Positioning
8703 @subsection Tape Positions and Tape Marks
8704 @UNREVISED
8705
8706 Just as archives can store more than one file from the file system,
8707 tapes can store more than one archive file. To keep track of where
8708 archive files (or any other type of file stored on tape) begin and
8709 end, tape archive devices write magnetic @dfn{tape marks} on the
8710 archive media. Tape drives write one tape mark between files,
8711 two at the end of all the file entries.
8712
8713 If you think of data as a series of records "rrrr"'s, and tape marks as
8714 "*"'s, a tape might look like the following:
8715
8716 @smallexample
8717 rrrr*rrrrrr*rrrrr*rr*rrrrr**-------------------------
8718 @end smallexample
8719
8720 Tape devices read and write tapes using a read/write @dfn{tape
8721 head}---a physical part of the device which can only access one
8722 point on the tape at a time. When you use @command{tar} to read or
8723 write archive data from a tape device, the device will begin reading
8724 or writing from wherever on the tape the tape head happens to be,
8725 regardless of which archive or what part of the archive the tape
8726 head is on. Before writing an archive, you should make sure that no
8727 data on the tape will be overwritten (unless it is no longer needed).
8728 Before reading an archive, you should make sure the tape head is at
8729 the beginning of the archive you want to read. You can do it manually
8730 via @code{mt} utility (@pxref{mt}). The @code{restore} script does
8731 that automatically (@pxref{Scripted Restoration}).
8732
8733 If you want to add new archive file entries to a tape, you should
8734 advance the tape to the end of the existing file entries, backspace
8735 over the last tape mark, and write the new archive file. If you were
8736 to add two archives to the example above, the tape might look like the
8737 following:
8738
8739 @smallexample
8740 rrrr*rrrrrr*rrrrr*rr*rrrrr*rrr*rrrr**----------------
8741 @end smallexample
8742
8743 @node mt
8744 @subsection The @command{mt} Utility
8745 @UNREVISED
8746
8747 @FIXME{Is it true that this only works on non-block devices?
8748 should explain the difference, (fixed or variable).}
8749 @xref{Blocking Factor}.
8750
8751 You can use the @command{mt} utility to advance or rewind a tape past a
8752 specified number of archive files on the tape. This will allow you
8753 to move to the beginning of an archive before extracting or reading
8754 it, or to the end of all the archives before writing a new one.
8755 @FIXME{Why isn't there an "advance 'til you find two tape marks
8756 together"?}
8757
8758 The syntax of the @command{mt} command is:
8759
8760 @smallexample
8761 @kbd{mt [-f @var{tapename}] @var{operation} [@var{number}]}
8762 @end smallexample
8763
8764 where @var{tapename} is the name of the tape device, @var{number} is
8765 the number of times an operation is performed (with a default of one),
8766 and @var{operation} is one of the following:
8767
8768 @FIXME{is there any use for record operations?}
8769
8770 @table @option
8771 @item eof
8772 @itemx weof
8773 Writes @var{number} tape marks at the current position on the tape.
8774
8775 @item fsf
8776 Moves tape position forward @var{number} files.
8777
8778 @item bsf
8779 Moves tape position back @var{number} files.
8780
8781 @item rewind
8782 Rewinds the tape. (Ignores @var{number}).
8783
8784 @item offline
8785 @itemx rewoff1
8786 Rewinds the tape and takes the tape device off-line. (Ignores @var{number}).
8787
8788 @item status
8789 Prints status information about the tape unit.
8790
8791 @end table
8792
8793 @FIXME{Is there a better way to frob the spacing on the list?}
8794
8795 If you don't specify a @var{tapename}, @command{mt} uses the environment
8796 variable @env{TAPE}; if @env{TAPE} is not set, @command{mt} uses the device
8797 @file{/dev/rmt12}.
8798
8799 @command{mt} returns a 0 exit status when the operation(s) were
8800 successful, 1 if the command was unrecognized, and 2 if an operation
8801 failed.
8802
8803 @node Using Multiple Tapes
8804 @section Using Multiple Tapes
8805 @UNREVISED
8806
8807 Often you might want to write a large archive, one larger than will fit
8808 on the actual tape you are using. In such a case, you can run multiple
8809 @command{tar} commands, but this can be inconvenient, particularly if you
8810 are using options like @option{--exclude=@var{pattern}} or dumping entire file systems.
8811 Therefore, @command{tar} supports multiple tapes automatically.
8812
8813 Use @option{--multi-volume} (@option{-M}) on the command line, and
8814 then @command{tar} will, when it reaches the end of the tape, prompt
8815 for another tape, and continue the archive. Each tape will have an
8816 independent archive, and can be read without needing the other. (As
8817 an exception to this, the file that @command{tar} was archiving when
8818 it ran out of tape will usually be split between the two archives; in
8819 this case you need to extract from the first archive, using
8820 @option{--multi-volume}, and then put in the second tape when
8821 prompted, so @command{tar} can restore both halves of the file.)
8822
8823 @GNUTAR{} multi-volume archives do not use a truly portable format.
8824 You need @GNUTAR{} at both ends to process them properly.
8825
8826 When prompting for a new tape, @command{tar} accepts any of the following
8827 responses:
8828
8829 @table @kbd
8830 @item ?
8831 Request @command{tar} to explain possible responses
8832 @item q
8833 Request @command{tar} to exit immediately.
8834 @item n @var{file name}
8835 Request @command{tar} to write the next volume on the file @var{file name}.
8836 @item !
8837 Request @command{tar} to run a subshell. This option can be disabled
8838 by giving @option{--restrict} command line option to @command{tar}.
8839 @item y
8840 Request @command{tar} to begin writing the next volume.
8841 @end table
8842
8843 (You should only type @samp{y} after you have changed the tape;
8844 otherwise @command{tar} will write over the volume it just finished.)
8845
8846 @cindex End-of-archive info script
8847 @cindex Info script
8848 @anchor{info-script}
8849 @opindex info-script
8850 @opindex new-volume-script
8851 If you want more elaborate behavior than this, give @command{tar} the
8852 @option{--info-script=@var{script-name}}
8853 (@option{--new-volume-script=@var{script-name}}, @option{-F
8854 @var{script-name}}) option. The file @var{script-name} is expected to
8855 be a program (or shell script) to be run instead of the normal
8856 prompting procedure. It is executed without any command line
8857 arguments. Additional data is passed to it via the following
8858 environment variables:
8859
8860 @table @env
8861 @vrindex TAR_VERSION, info script environment variable
8862 @item TAR_VERSION
8863 @GNUTAR{} version number.
8864
8865 @vrindex TAR_ARCHIVE, info script environment variable
8866 @item TAR_ARCHIVE
8867 The name of the archive @command{tar} is processing.
8868
8869 @vrindex TAR_VOLUME, info script environment variable
8870 @item TAR_VOLUME
8871 Ordinal number of the volume @command{tar} is about to start.
8872
8873 @vrindex TAR_SUBCOMMAND, info script environment variable
8874 @item TAR_SUBCOMMAND
8875 Short option describing the operation @command{tar} is executed.
8876 @xref{Operations}, for a complete list of subcommand options.
8877
8878 @vrindex TAR_FORMAT, info script environment variable
8879 @item TAR_FORMAT
8880 Format of the archive being processed. @xref{Formats}, for a complete
8881 list of archive format names.
8882 @end table
8883
8884 The info script can instruct @command{tar} to use new archive name,
8885 by writing in to file descriptor 3 (see below for an
8886 example).
8887
8888 If the info script fails, @command{tar} exits; otherwise, it begins
8889 writing the next volume.
8890
8891 The method @command{tar} uses to detect end of tape is not perfect, and
8892 fails on some operating systems or on some devices. You can use the
8893 @option{--tape-length=@var{size}} (@option{-L @var{size}}) option if
8894 @command{tar} can't detect the end of the tape itself. This option
8895 selects @option{--multi-volume} (@option{-M}) automatically. The
8896 @var{size} argument should then be the usable size of the tape in
8897 units of 1024 bytes. But for many devices, and floppy disks in
8898 particular, this option is never required for real, as far as we know.
8899
8900 @cindex Volume number file
8901 @cindex volno file
8902 @anchor{volno-file}
8903 @opindex volno-file
8904 The volume number used by @command{tar} in its tape-change prompt
8905 can be changed; if you give the
8906 @option{--volno-file=@var{file-of-number}} option, then
8907 @var{file-of-number} should be an unexisting file to be created, or
8908 else, a file already containing a decimal number. That number will be
8909 used as the volume number of the first volume written. When
8910 @command{tar} is finished, it will rewrite the file with the
8911 now-current volume number. (This does not change the volume number
8912 written on a tape label, as per @ref{label}, it @emph{only} affects
8913 the number used in the prompt.)
8914
8915 If you want @command{tar} to cycle through a series of files or tape
8916 drives, there are three approaches to choose from. First of all, you
8917 can give @command{tar} multiple @option{--file} options. In this case
8918 the specified files will be used, in sequence, as the successive
8919 volumes of the archive. Only when the first one in the sequence needs
8920 to be used again will @command{tar} prompt for a tape change (or run
8921 the info script). Secondly, you can use the @samp{n} response to the
8922 tape-change prompt, and, finally, you can use an info script, that
8923 writes new archive name to file descriptor. The following example
8924 illustrates this approach:
8925
8926 @smallexample
8927 @group
8928 #! /bin/sh
8929 echo Preparing volume $TAR_VOLUME of $TAR_ARCHIVE.
8930
8931 name=`expr $TAR_ARCHIVE : '\(.*\)-.*'`
8932 case $TAR_SUBCOMMAND in
8933 -c) ;;
8934 -d|-x|-t) test -r $@{name:-$TAR_ARCHIVE@}-$TAR_VOLUME || exit 1
8935 ;;
8936 *) exit 1
8937 esac
8938
8939 echo $@{name:-$TAR_ARCHIVE@}-$TAR_VOLUME >&3
8940 @end group
8941 @end smallexample
8942
8943 Each volume of a multi-volume archive is an independent @command{tar}
8944 archive, complete in itself. For example, you can list or extract any
8945 volume alone; just don't specify @option{--multi-volume}
8946 (@option{-M}). However, if one file in the archive is split across
8947 volumes, the only way to extract it successfully is with a
8948 multi-volume extract command @option{--extract --multi-volume}
8949 (@option{-xM}) starting on or before the volume where the file begins.
8950
8951 For example, let's presume someone has two tape drives on a system
8952 named @file{/dev/tape0} and @file{/dev/tape1}. For having @GNUTAR{}
8953 to switch to the second drive when it needs to write the
8954 second tape, and then back to the first tape, etc., just do either of:
8955
8956 @smallexample
8957 $ @kbd{tar --create --multi-volume --file=/dev/tape0 --file=/dev/tape1 @var{files}}
8958 $ @kbd{tar cMff /dev/tape0 /dev/tape1 @var{files}}
8959 @end smallexample
8960
8961 @menu
8962 * Multi-Volume Archives:: Archives Longer than One Tape or Disk
8963 * Tape Files:: Tape Files
8964 * Tarcat:: Concatenate Volumes into a Single Archive
8965
8966 @end menu
8967
8968 @node Multi-Volume Archives
8969 @subsection Archives Longer than One Tape or Disk
8970 @cindex Multi-volume archives
8971 @UNREVISED
8972
8973 @opindex multi-volume
8974 To create an archive that is larger than will fit on a single unit of
8975 the media, use the @option{--multi-volume} (@option{-M}) option in conjunction with
8976 the @option{--create} option (@pxref{create}). A @dfn{multi-volume}
8977 archive can be manipulated like any other archive (provided the
8978 @option{--multi-volume} option is specified), but is stored on more
8979 than one tape or disk.
8980
8981 When you specify @option{--multi-volume}, @command{tar} does not report an
8982 error when it comes to the end of an archive volume (when reading), or
8983 the end of the media (when writing). Instead, it prompts you to load
8984 a new storage volume. If the archive is on a magnetic tape, you
8985 should change tapes when you see the prompt; if the archive is on a
8986 floppy disk, you should change disks; etc.
8987
8988 You can read each individual volume of a multi-volume archive as if it
8989 were an archive by itself. For example, to list the contents of one
8990 volume, use @option{--list}, without @option{--multi-volume} specified.
8991 To extract an archive member from one volume (assuming it is described
8992 that volume), use @option{--extract}, again without
8993 @option{--multi-volume}.
8994
8995 If an archive member is split across volumes (ie. its entry begins on
8996 one volume of the media and ends on another), you need to specify
8997 @option{--multi-volume} to extract it successfully. In this case, you
8998 should load the volume where the archive member starts, and use
8999 @samp{tar --extract --multi-volume}---@command{tar} will prompt for later
9000 volumes as it needs them. @xref{extracting archives}, for more
9001 information about extracting archives.
9002
9003 @option{--info-script=@var{script-name}}
9004 (@option{--new-volume-script=@var{script-name}}, @option{-F
9005 @var{script-name}}) (@pxref{info-script}) is like
9006 @option{--multi-volume} (@option{-M}), except that @command{tar} does
9007 not prompt you directly to change media volumes when a volume is
9008 full---instead, @command{tar} runs commands you have stored in
9009 @var{script-name}. For example, this option can be used to eject
9010 cassettes, or to broadcast messages such as @samp{Someone please come
9011 change my tape} when performing unattended backups. When
9012 @var{script-name} is done, @command{tar} will assume that the media
9013 has been changed.
9014
9015 Multi-volume archives can be modified like any other archive. To add
9016 files to a multi-volume archive, you need to only mount the last
9017 volume of the archive media (and new volumes, if needed). For all
9018 other operations, you need to use the entire archive.
9019
9020 If a multi-volume archive was labeled using
9021 @option{--label=@var{archive-label}} (@option{-V @var{archive-label}})
9022 (@pxref{label}) when it was created, @command{tar} will not
9023 automatically label volumes which are added later. To label
9024 subsequent volumes, specify @option{--label=@var{archive-label}} again
9025 in conjunction with the @option{--append}, @option{--update} or
9026 @option{--concatenate} operation.
9027
9028 @cindex Labeling multi-volume archives
9029 @FIXME{example}
9030
9031 @FIXME{There should be a sample program here, including an exit
9032 before end. Is the exit status even checked in tar? :-(}
9033
9034 @table @option
9035 @item --multi-volume
9036 @itemx -M
9037 Creates a multi-volume archive, when used in conjunction with
9038 @option{--create} (@option{-c}). To perform any other operation on a multi-volume
9039 archive, specify @option{--multi-volume} in conjunction with that
9040 operation.
9041
9042 @item --info-script=@var{program-file}
9043 @itemx --new-volume-script=@var{program-file}
9044 @itemx -F @var{program-file}
9045 Creates a multi-volume archive via a script. Used in conjunction with
9046 @option{--create} (@option{-c}). @xref{info-script}, dor a detailed discussion.
9047 @end table
9048
9049 Beware that there is @emph{no} real standard about the proper way, for
9050 a @command{tar} archive, to span volume boundaries. If you have a
9051 multi-volume created by some vendor's @command{tar}, there is almost
9052 no chance you could read all the volumes with @GNUTAR{}.
9053 The converse is also true: you may not expect
9054 multi-volume archives created by @GNUTAR{} to be
9055 fully recovered by vendor's @command{tar}. Since there is little
9056 chance that, in mixed system configurations, some vendor's
9057 @command{tar} will work on another vendor's machine, and there is a
9058 great chance that @GNUTAR{} will work on most of
9059 them, your best bet is to install @GNUTAR{} on all
9060 machines between which you know exchange of files is possible.
9061
9062 @node Tape Files
9063 @subsection Tape Files
9064 @UNREVISED
9065
9066 To give the archive a name which will be recorded in it, use the
9067 @option{--label=@var{volume-label}} (@option{-V @var{volume-label}})
9068 option. This will write a special block identifying
9069 @var{volume-label} as the name of the archive to the front of the
9070 archive which will be displayed when the archive is listed with
9071 @option{--list}. If you are creating a multi-volume archive with
9072 @option{--multi-volume} (@pxref{Using Multiple Tapes}), then the
9073 volume label will have @samp{Volume @var{nnn}} appended to the name
9074 you give, where @var{nnn} is the number of the volume of the archive.
9075 (If you use the @option{--label=@var{volume-label}}) option when
9076 reading an archive, it checks to make sure the label on the tape
9077 matches the one you give. @xref{label}.
9078
9079 When @command{tar} writes an archive to tape, it creates a single
9080 tape file. If multiple archives are written to the same tape, one
9081 after the other, they each get written as separate tape files. When
9082 extracting, it is necessary to position the tape at the right place
9083 before running @command{tar}. To do this, use the @command{mt} command.
9084 For more information on the @command{mt} command and on the organization
9085 of tapes into a sequence of tape files, see @ref{mt}.
9086
9087 People seem to often do:
9088
9089 @smallexample
9090 @kbd{--label="@var{some-prefix} `date +@var{some-format}`"}
9091 @end smallexample
9092
9093 or such, for pushing a common date in all volumes or an archive set.
9094
9095 @node Tarcat
9096 @subsection Concatenate Volumes into a Single Archive
9097
9098 @pindex tarcat
9099 Sometimes it is necessary to convert existing @GNUTAR{} multi-volume
9100 archive to a single @command{tar} archive. Simply concatenating all
9101 volumes into one will not work, since each volume carries an additional
9102 information at the beginning. @GNUTAR{} is shipped with the shell
9103 script @command{tarcat} designed for this purpose.
9104
9105 The script takes a list of files comprising a multi-volume archive
9106 and creates the resulting archive at the standard output. For example:
9107
9108 @smallexample
9109 @kbd{tarcat vol.1 vol.2 vol.3 | tar tf -}
9110 @end smallexample
9111
9112 The script implements a simple heuristics to determine the format of
9113 the first volume file and to decide how to process the rest of the
9114 files. However, it makes no attempt to verify whether the files are
9115 given in order or even if they are valid @command{tar} archives.
9116 It uses @command{dd} and does not filter its standard error, so you
9117 will usually see lots of spurious messages.
9118
9119 @FIXME{The script is not installed. Should we install it?}
9120
9121 @node label
9122 @section Including a Label in the Archive
9123 @cindex Labeling an archive
9124 @cindex Labels on the archive media
9125 @UNREVISED
9126
9127 @opindex label
9128 To avoid problems caused by misplaced paper labels on the archive
9129 media, you can include a @dfn{label} entry---an archive member which
9130 contains the name of the archive---in the archive itself. Use the
9131 @option{--label=@var{archive-label}} (@option{-V @var{archive-label}})
9132 option in conjunction with the @option{--create} operation to include
9133 a label entry in the archive as it is being created.
9134
9135 @table @option
9136 @item --label=@var{archive-label}
9137 @itemx -V @var{archive-label}
9138 Includes an @dfn{archive-label} at the beginning of the archive when
9139 the archive is being created, when used in conjunction with the
9140 @option{--create} operation. Checks to make sure the archive label
9141 matches the one specified (when used in conjunction with any other
9142 operation.
9143 @end table
9144
9145 If you create an archive using both
9146 @option{--label=@var{archive-label}} (@option{-V @var{archive-label}})
9147 and @option{--multi-volume} (@option{-M}), each volume of the archive
9148 will have an archive label of the form @samp{@var{archive-label}
9149 Volume @var{n}}, where @var{n} is 1 for the first volume, 2 for the
9150 next, and so on. @xref{Using Multiple Tapes}, for information on
9151 creating multiple volume archives.
9152
9153 @cindex Volume label, listing
9154 @cindex Listing volume label
9155 The volume label will be displayed by @option{--list} along with
9156 the file contents. If verbose display is requested, it will also be
9157 explicitely marked as in the example below:
9158
9159 @smallexample
9160 @group
9161 $ @kbd{tar --verbose --list --file=iamanarchive}
9162 V--------- 0 0 0 1992-03-07 12:01 iamalabel--Volume Header--
9163 -rw-r--r-- ringo user 40 1990-05-21 13:30 iamafilename
9164 @end group
9165 @end smallexample
9166
9167 @opindex test-label
9168 @anchor{--test-label option}
9169 However, @option{--list} option will cause listing entire
9170 contents of the archive, which may be undesirable (for example, if the
9171 archive is stored on a tape). You can request checking only the volume
9172 by specifying @option{--test-label} option. This option reads only the
9173 first block of an archive, so it can be used with slow storage
9174 devices. For example:
9175
9176 @smallexample
9177 @group
9178 $ @kbd{tar --test-label --file=iamanarchive}
9179 iamalabel
9180 @end group
9181 @end smallexample
9182
9183 If @option{--test-label} is used with a single command line
9184 argument, @command{tar} compares the volume label with the
9185 argument. It exits with code 0 if the two strings match, and with code
9186 2 otherwise. In this case no output is displayed. For example:
9187
9188 @smallexample
9189 @group
9190 $ @kbd{tar --test-label --file=iamanarchive 'iamalable'}
9191 @result{} 0
9192 $ @kbd{tar --test-label --file=iamanarchive 'iamalable' alabel}
9193 @result{} 1
9194 @end group
9195 @end smallexample
9196
9197 If you request any operation, other than @option{--create}, along
9198 with using @option{--label} option, @command{tar} will first check if
9199 the archive label matches the one specified and will refuse to proceed
9200 if it does not. Use this as a safety precaution to avoid accidentally
9201 overwriting existing archives. For example, if you wish to add files
9202 to @file{archive}, presumably labelled with string @samp{My volume},
9203 you will get:
9204
9205 @smallexample
9206 @group
9207 $ @kbd{tar -rf archive --label 'My volume' .}
9208 tar: Archive not labeled to match `My volume'
9209 @end group
9210 @end smallexample
9211
9212 @noindent
9213 in case its label does not match. This will work even if
9214 @file{archive} is not labelled at all.
9215
9216 Similarly, @command{tar} will refuse to list or extract the
9217 archive if its label doesn't match the @var{archive-label}
9218 specified. In those cases, @var{archive-label} argument is interpreted
9219 as a globbing-style pattern which must match the actual magnetic
9220 volume label. @xref{exclude}, for a precise description of how match
9221 is attempted@footnote{Previous versions of @command{tar} used full
9222 regular expression matching, or before that, only exact string
9223 matching, instead of wildcard matchers. We decided for the sake of
9224 simplicity to use a uniform matching device through
9225 @command{tar}.}. If the switch @option{--multi-volume} (@option{-M}) is being used,
9226 the volume label matcher will also suffix @var{archive-label} by
9227 @w{@samp{ Volume [1-9]*}} if the initial match fails, before giving
9228 up. Since the volume numbering is automatically added in labels at
9229 creation time, it sounded logical to equally help the user taking care
9230 of it when the archive is being read.
9231
9232 The @option{--label} was once called @option{--volume}, but is not
9233 available under that name anymore.
9234
9235 You can also use @option{--label} to get a common information on
9236 all tapes of a series. For having this information different in each
9237 series created through a single script used on a regular basis, just
9238 manage to get some date string as part of the label. For example:
9239
9240 @smallexample
9241 @group
9242 $ @kbd{tar cfMV /dev/tape "Daily backup for `date +%Y-%m-%d`"}
9243 $ @kbd{tar --create --file=/dev/tape --multi-volume \
9244 --volume="Daily backup for `date +%Y-%m-%d`"}
9245 @end group
9246 @end smallexample
9247
9248 Also note that each label has its own date and time, which corresponds
9249 to when @GNUTAR{} initially attempted to write it,
9250 often soon after the operator launches @command{tar} or types the
9251 carriage return telling that the next tape is ready. Comparing date
9252 labels does give an idea of tape throughput only if the delays for
9253 rewinding tapes and the operator switching them were negligible, which
9254 is usually not the case.
9255
9256 @node verify
9257 @section Verifying Data as It is Stored
9258 @cindex Verifying a write operation
9259 @cindex Double-checking a write operation
9260
9261 @table @option
9262 @item -W
9263 @itemx --verify
9264 @opindex verify, short description
9265 Attempt to verify the archive after writing.
9266 @end table
9267
9268 This option causes @command{tar} to verify the archive after writing it.
9269 Each volume is checked after it is written, and any discrepancies
9270 are recorded on the standard error output.
9271
9272 Verification requires that the archive be on a back-space-able medium.
9273 This means pipes, some cartridge tape drives, and some other devices
9274 cannot be verified.
9275
9276 You can insure the accuracy of an archive by comparing files in the
9277 system with archive members. @command{tar} can compare an archive to the
9278 file system as the archive is being written, to verify a write
9279 operation, or can compare a previously written archive, to insure that
9280 it is up to date.
9281
9282 @opindex verify, using with @option{--create}
9283 @opindex create, using with @option{--verify}
9284 To check for discrepancies in an archive immediately after it is
9285 written, use the @option{--verify} (@option{-W}) option in conjunction with
9286 the @option{--create} operation. When this option is
9287 specified, @command{tar} checks archive members against their counterparts
9288 in the file system, and reports discrepancies on the standard error.
9289
9290 To verify an archive, you must be able to read it from before the end
9291 of the last written entry. This option is useful for detecting data
9292 errors on some tapes. Archives written to pipes, some cartridge tape
9293 drives, and some other devices cannot be verified.
9294
9295 One can explicitly compare an already made archive with the file
9296 system by using the @option{--compare} (@option{--diff}, @option{-d})
9297 option, instead of using the more automatic @option{--verify} option.
9298 @xref{compare}.
9299
9300 Note that these two options have a slightly different intent. The
9301 @option{--compare} option checks how identical are the logical contents of some
9302 archive with what is on your disks, while the @option{--verify} option is
9303 really for checking if the physical contents agree and if the recording
9304 media itself is of dependable quality. So, for the @option{--verify}
9305 operation, @command{tar} tries to defeat all in-memory cache pertaining to
9306 the archive, while it lets the speed optimization undisturbed for the
9307 @option{--compare} option. If you nevertheless use @option{--compare} for
9308 media verification, you may have to defeat the in-memory cache yourself,
9309 maybe by opening and reclosing the door latch of your recording unit,
9310 forcing some doubt in your operating system about the fact this is really
9311 the same volume as the one just written or read.
9312
9313 The @option{--verify} option would not be necessary if drivers were indeed
9314 able to detect dependably all write failures. This sometimes require many
9315 magnetic heads, some able to read after the writes occurred. One would
9316 not say that drivers unable to detect all cases are necessarily flawed,
9317 as long as programming is concerned.
9318
9319 The @option{--verify} (@option{-W}) option will not work in
9320 conjunction with the @option{--multi-volume} (@option{-M}) option or
9321 the @option{--append} (@option{-r}), @option{--update} (@option{-u})
9322 and @option{--delete} operations. @xref{Operations}, for more
9323 information on these operations.
9324
9325 Also, since @command{tar} normally strips leading @samp{/} from file
9326 names (@pxref{absolute}), a command like @samp{tar --verify -cf
9327 /tmp/foo.tar /etc} will work as desired only if the working directory is
9328 @file{/}, as @command{tar} uses the archive's relative member names
9329 (e.g., @file{etc/motd}) when verifying the archive.
9330
9331 @node Write Protection
9332 @section Write Protection
9333
9334 Almost all tapes and diskettes, and in a few rare cases, even disks can
9335 be @dfn{write protected}, to protect data on them from being changed.
9336 Once an archive is written, you should write protect the media to prevent
9337 the archive from being accidentally overwritten or deleted. (This will
9338 protect the archive from being changed with a tape or floppy drive---it
9339 will not protect it from magnet fields or other physical hazards).
9340
9341 The write protection device itself is usually an integral part of the
9342 physical media, and can be a two position (write enabled/write
9343 disabled) switch, a notch which can be popped out or covered, a ring
9344 which can be removed from the center of a tape reel, or some other
9345 changeable feature.
9346
9347 @node Changes
9348 @appendix Changes
9349
9350 This appendix lists some important user-visible changes between
9351 version @GNUTAR{} @value{VERSION} and previous versions. An up-to-date
9352 version of this document is available at
9353 @uref{http://www.gnu.org/@/software/@/tar/manual/changes.html,the
9354 @GNUTAR{} documentation page}.
9355
9356 @table @asis
9357 @item Use of short option @option{-o}.
9358
9359 Earlier versions of @GNUTAR{} understood @option{-o} command line
9360 option as a synonym for @option{--old-archive}.
9361
9362 @GNUTAR{} starting from version 1.13.90 understands this option as
9363 a synonym for @option{--no-same-owner}. This is compatible with
9364 UNIX98 @command{tar} implementations.
9365
9366 However, to facilitate transition, @option{-o} option retains its
9367 old semantics when it is used with one of archive-creation commands.
9368 Users are encouraged to use @option{--format=oldgnu} instead.
9369
9370 It is especially important, since versions of @acronym{GNU} Automake
9371 up to and including 1.8.4 invoke tar with this option to produce
9372 distribution tarballs. @xref{Formats,v7}, for the detailed discussion
9373 of this issue and its implications.
9374
9375 Future versions of @GNUTAR{} will understand @option{-o} only as a
9376 synonym for @option{--no-same-owner}.
9377
9378 @item Use of short option @option{-l}
9379
9380 Earlier versions of @GNUTAR{} understood @option{-l} option as a
9381 synonym for @option{--one-file-system}. Such usage is deprecated.
9382 For compatibility with other implementations future versions of
9383 @GNUTAR{} will understand this option as a synonym for
9384 @option{--check-links}.
9385
9386 @item Use of options @option{--portability} and @option{--old-archive}
9387
9388 These options are deprecated. Please use @option{--format=v7} instead.
9389
9390 @item Use of option @option{--posix}
9391
9392 This option is deprecated. Please use @option{--format=posix} instead.
9393 @end table
9394
9395 @node Genfile
9396 @appendix Genfile
9397 @include genfile.texi
9398
9399 @node Snapshot Files
9400 @appendix Format of the Incremental Snapshot Files
9401 @include snapshot.texi
9402
9403 @node Free Software Needs Free Documentation
9404 @appendix Free Software Needs Free Documentation
9405 @include freemanuals.texi
9406
9407 @node Copying This Manual
9408 @appendix Copying This Manual
9409
9410 @menu
9411 * GNU Free Documentation License:: License for copying this manual
9412 @end menu
9413
9414 @include fdl.texi
9415
9416 @node Index of Command Line Options
9417 @appendix Index of Command Line Options
9418
9419 This appendix contains an index of all @GNUTAR{} long command line
9420 options. The options are listed without the preceeding double-dash.
9421
9422 @FIXME{@itemize
9423 @item Make sure @emph{all} options are indexed.
9424 @item Provide an index of short options
9425 @end itemize}
9426
9427 @printindex op
9428
9429 @node Index
9430 @appendix Index
9431
9432 @printindex cp
9433
9434 @summarycontents
9435 @contents
9436 @bye
9437
9438 @c Local variables:
9439 @c texinfo-column-for-description: 32
9440 @c End:
9441
This page took 0.439724 seconds and 5 git commands to generate.