mirror of
https://github.com/Stichting-MINIX-Research-Foundation/netbsd.git
synced 2025-09-09 15:18:01 -04:00
184 lines
5.8 KiB
Groff
184 lines
5.8 KiB
Groff
.TH agrep 1 "November 21, 2004" "TRE agrep @PACKAGE_VERSION@"
|
|
.SH NAME
|
|
agrep \- print lines approximately matching a pattern
|
|
.SH SYNOPSIS
|
|
.B agrep
|
|
.RI [ OPTION ]...
|
|
.I PATTERN
|
|
.RI [ FILE ]...
|
|
.SH DESCRIPTION
|
|
Searches for approximate matches of PATTERN in each FILE or standard
|
|
input. Example: `agrep \-2 optimize foo.txt' outputs all lines in
|
|
file `foo.txt' that match "optimize" within two errors. E.g. lines
|
|
which contain "optimise", "optmise", and "opitmize" all match.
|
|
.SH OPTIONS
|
|
.SS "Regexp selection and interpretation:"
|
|
.TP
|
|
.BI \-e " PATTERN" "\fR,\fP \-\^\-regexp=" PATTERN
|
|
Use
|
|
.I PATTERN
|
|
as a regular expression; useful to protect patterns beginning with
|
|
.BR \- .
|
|
.TP
|
|
.BR \-i ", " \-\^\-ignore\-case
|
|
Ignore case distinctions (as defined by the current locale) in
|
|
.I PATTERN
|
|
and input files.
|
|
.TP
|
|
.BR \-k ", " \-\^\-literal
|
|
Treat
|
|
.I PATTERN
|
|
as a literal string, that is, a fixed string with no special
|
|
characters.
|
|
.TP
|
|
.BR \-w ", " \-\^\-word\-regexp
|
|
Force
|
|
.I PATTERN
|
|
to match only whole words. A "whole word" is a substring which either
|
|
starts at the beginning or the record or is preceded by a non-word
|
|
constituent character. Similarly, the substring must either end at
|
|
the end of the record or be followed by a non-word constituent
|
|
character. Word-constituent characters are alphanumerics (as
|
|
defined by the current locale) and the underscore character. Note
|
|
that the non-word constituent characters
|
|
.B must
|
|
surround the match; they cannot be counted as errors.
|
|
.SS "Approximate matching settings:"
|
|
.TP
|
|
.BI \-D " NUM" "\fR,\fP \-\^\-delete\-cost=" NUM
|
|
Set cost of missing characters to
|
|
.IR NUM .
|
|
.TP
|
|
.BI \-I " NUM" "\fR,\fP \-\^\-insert\-cost=" NUM
|
|
Set cost of extra characters to
|
|
.IR NUM .
|
|
.TP
|
|
.BI \-S " NUM" "\fR,\fP \-\^\-substitute\-cost=" NUM
|
|
Set cost of incorrect characters to
|
|
.IR NUM .
|
|
Note that a deletion (a missing character) and an insertion (an extra
|
|
character) together constitute a substituted character, but the cost
|
|
will be the that of a deletion and an insertion added together. Thus,
|
|
if the const of a substitution is set to be larger than the sum
|
|
of the costs of deletion and insertion, direct substitutions will
|
|
never be done.
|
|
.TP
|
|
.BI \-E " NUM" "\fR,\fP \-\^\-max\-errors=" NUM
|
|
Select records that have at most
|
|
.I NUM
|
|
errors.
|
|
.TP
|
|
-\fI#\fR
|
|
Select records that have at most \fI#\fR errors (\fI#\fR is a
|
|
digit between 0 and 9).
|
|
.SS "Miscellaneous:"
|
|
.TP
|
|
.BI \-d " PATTERN" "\fR,\fP \-\^\-delimiter=" PATTERN
|
|
Set the record delimiter regular expression to
|
|
.IR PATTERN .
|
|
The text between two delimiters, before the first delimiter, and after
|
|
the last delimiter is considered to be a record. The default record
|
|
delimiter is the regexp "\\n", so by default a record is a line.
|
|
.I PATTERN
|
|
can be any regular expression that does not match the empty string.
|
|
For example, using
|
|
.B \-d
|
|
"^From "
|
|
defines mail messages as records in a Mailbox format file.
|
|
.TP
|
|
.BR \-v ", " \-\^\-invert\-match
|
|
Select non-matching records instead of matching records.
|
|
.TP
|
|
.BR \-V ", " \-\^\-version
|
|
Print version information and exit.
|
|
.TP
|
|
.BR \-y ", " \-\^\-nothing
|
|
Does nothing. This options exists only for compatibility with the
|
|
non-free agrep program.
|
|
.TP
|
|
.BR \-\^\-help
|
|
Display a brief help message and exit.
|
|
.SS "Output control:"
|
|
.TP
|
|
.BR \-B ", " \-\^\-best\-match
|
|
Only output the best matching records, that is, the records with the
|
|
lowest cost. This is currently implemented by making two passes over
|
|
the input files and cannot be used when reading from standard input.
|
|
.TP
|
|
.BR \-\^\-color ", " \-\^\-colour
|
|
Highlight the matching strings in the output with a color marker. The
|
|
color string is taken from the
|
|
.BI GREP_COLOR
|
|
environment variable. The default color is red.
|
|
.TP
|
|
.BR \-c ", " \-\^\-count
|
|
Only print a count of matching records per each input file,
|
|
suppressing normal output.
|
|
.TP
|
|
.BR \-h ", " \-\^\-no\-filename
|
|
Suppress the prefixing filename on output when multiple files are
|
|
searched.
|
|
.TP
|
|
.BR \-H ", " \-\^\-with\-filename
|
|
Prefix each output record with the name of the input file where the
|
|
record was read from.
|
|
.TP
|
|
.BR \-l ", " \-\^\-files\-with\-matches
|
|
Only print the name of each input file which contains at least one
|
|
match, suppressing normal output. The scanning for each file will
|
|
stop on the first match.
|
|
.TP
|
|
.BR \-n ", " \-\^\-record\-number
|
|
Prefix each output record with its sequence number in the input file.
|
|
The number of the first record is 1.
|
|
.TP
|
|
.BR \-q ", " \-\^\-quiet ", " \-\^\-silent
|
|
Do not write anything to standard output. Exit immediately with zero
|
|
exit status if a match is found.
|
|
.TP
|
|
.BR \-s ", " \-\^\-show\-cost
|
|
Print match cost with output.
|
|
.TP
|
|
.BR \-\^\-show\-position
|
|
Prefix each output record with the start and end offset of the first
|
|
match within the record. The offset of the first character of the
|
|
record is 0. The end position is given as the offset of the first
|
|
character after the match.
|
|
.TP
|
|
.BR \-M ", " \-\^\-delimiter\-after
|
|
By default, the record delimiter is the newline character and is
|
|
output after the matching record. If
|
|
.B \-d
|
|
is used, the record delimiter will be output before the matching
|
|
record. This option causes the delimiter to be output after the
|
|
matching record.
|
|
.PP
|
|
With no
|
|
.IR FILE ,
|
|
or when
|
|
.I FILE
|
|
is -, reads standard input. If less than two
|
|
.IR FILE s
|
|
are given
|
|
.B \-h
|
|
is assumed, otherwise
|
|
.B \-H
|
|
is the default.
|
|
.SH DIAGNOSTICS
|
|
Exit status is 0 if a match is found, 1 for no match, and 2 if there
|
|
were errors. If
|
|
.B \-E
|
|
or -\fI#\fR is not specified, only exact matches are selected.
|
|
.PP
|
|
.I PATTERN
|
|
is a POSIX extended regular expression (ERE) with the TRE extensions.
|
|
.SH "REPORTING BUGS"
|
|
Report bugs to the TRE mailing list
|
|
.BR "" < tre-general@lists.laurikari.net >.
|
|
.SH COPYRIGHT
|
|
Copyright \(co 2002-2004 Ville Laurikari.
|
|
.br
|
|
This is free software, and comes with ABSOLUTELY NO WARRANTY.
|
|
You are welcome to redistribute this software under certain
|
|
conditions; see the source for the full license text.
|