# lesspipe.sh, a preprocessor for less

![GitHub Release](https://img.shields.io/github/v/release/wofr06/lesspipe) ![GitHub License](https://img.shields.io/github/license/wofr06/lesspipe) [![Ask DeepWiki](https://deepwiki.com/badge.svg)](https://deepwiki.com/wofr06/lesspipe) [![CodeFactor](https://www.codefactor.io/repository/github/wofr06/lesspipe/badge)](https://www.codefactor.io/repository/github/wofr06/lesspipe) ![Static Badge](https://img.shields.io/badge/Author-Wolfgang_Friebel-purple)


Latest version available as:
 [zip file on GitHub](https://github.com/wofr06/lesspipe.sh/archive/lesspipe.zip)
and the [repository on GitHub](https://github.com/wofr06/lesspipe)

The main web page is on [lesspipe.org](https://lesspipe.org)

A thorough analysis of the program has been generated by Devin at [deepwiki](https://deepwiki.com/wofr06/lesspipe/1-overview)

The development version can be cloned using git:
 `git clone https://github.com/wofr06/lesspipe.git`
You can report bugs or make proposals to improve lesspipe on GitHub or contact
the author by email [(wp.friebel@gmail.com)](mailto:wp.friebel@gmail.com).

## Contents

 0. Motivation
 1. Introduction
 2. Usage
 3. Required programs
 4. Supported file formats
    - Supported compression methods and archive formats
    - List of preprocessed file types
    - Conversion of files with alternate character encoding
 5. Colorizing the output
    - Syntax highlighting
      - Syntax highlighting choices
      - List of supported languages
    - Colored Directory listing
    - Colored listing of tar file contents
 6. Calling less from standard input
 7. Displaying files with special characters in the file name
 8. Tab completion for zsh and bash
 9. User defined filtering
 10. Debugging
 11. (Old) documentation about lesspipe
 12. External links
    -   URLs to some utilities
    -  References
 13. Contributors

## 0. Motivation

 If you use

 - the pager `less` in the command line,
 - the version control system `git`,
 - the text editor `Vim` or
 - the mail client `mutt`,

 then lesspipe.sh enables these programs to *read* non-text files, such as:

 - PDFs,
 - (Microsoft or LibreOffice) Office documents, or even
 - media (such as JPG or PNG images, MP3 audio or video) files

 where *read* means,

 - (format and) show the contained text (of a document or tag in a media file),
   or
 - show sensible file information (such as length of the video).

 To enable `less` respectively `git`, `Vim` or `mutt` to read non-text files by
 lesspipe.sh, see

 - Section 2 on the Usage of lesspipe.sh, respectively
 - the Wiki at [GitHub](https://github.com/wofr06/lesspipe/wiki)

  For the text and info extraction, lesspipe.sh will depend on external tools,
  but many use cases are covered by an installation of

  - LibreOffice and a common text browser (such as `lynx`),
  - pdftotext, and
  - mediainfo (or exiftool).

## 1. Introduction

 To browse files under UNIX the excellent viewer less [1] can be used. By
 setting the environment variable **LESSOPEN**, less can be enhanced by external
 filters to become even more powerful. Most Linux distributions come already
 with a "lesspipe.sh" that covers the most common situations.

 The input filter for less described here is called "lesspipe.sh". It is able
 to process a wide variety of file formats. It enables users to deeply inspect
 archives and to display the contents of files in archives without having to
 unpack them before. That means file contents can be properly interpreted even
 if the files are compressed and contained in a hierarchy of archives (often
 found in RPM or DEB archives containing source tarballs). The filter is easily
 extensible for new formats.

 The input filter which is also called "lesspipe.sh" is a bash script, but
 works as well as a zsh script.

 The filter does different things depending on the file format. In most cases
 it is determined on the output of the `file --mime` command [2], that
 returns the mime type. In some cases the mime type is too vague and then
 the `file` command yielding a textual description or the file suffix is used
 to determine what to display. As a last resort the file extension is consulted.

 By default less wraps long lines unless called with the option -S or
 --chop-long-lines. That can be changed interactively by typing -S followed by
 ENTER when viewing files with long lines. It is e.g. quite useful for tabular
 display of csv files with many columns.

 Auxiliary programs are provided to enhance the TAB completion mechanism in
 bash and zsh shells and to help colorizing the filtered output.

## 2. Usage

 (see also the man page lesspipe.1)

 To activate lesspipe.sh the environment variable **LESSOPEN** has to be defined
 in the following way:
```
        LESSOPEN="|lesspipe.sh %s"; export LESSOPEN  # (sh like shells)
```
 If `lesspipe.sh` is not in the UNIX search path or if the wrong `lesspipe.sh`
 is found in the search path, then the full path to `lesspipe.sh` should be
 given in the above commands and the invocations below. The commands work only
 in the described manner if the file name is lesspipe.sh.

 If it is installed under a different name then calling it without an argument
 will work as a filter with LESSQUIET set and expecting input from STDIN.

 The command to set **LESSOPEN** can also be displayed by calling `lesspipe.sh`
 without arguments. This can even be used to set **LESSOPEN** directly:
```
        eval "$(lesspipe.sh)"             # (bash) or
        lesspipe.sh | source /dev/stdin   # (zsh)
```
 or add the settings to .bashrc and .zshrc:
```
        lesspipe.sh >> ~/.bashrc        # (bash) and
        lesspipe.sh >> ~/.zshrc         # (zsh) and
```
 Several Linux distributions do now set **LESSOPEN** by default and if the
 contents of the variable is not referring to this lesspipe.sh version, it
 has to be redefined to get the functionality described here.

 As `lesspipe.sh` is accepting only a single argument, a hierarchical list of
 file names has to be separated by a non-blank character. A colon is rarely
 found in file names, therefore it has been chosen as the separator character.
 If a file name does however contain at least one isolated colon, the equal
 sign (=) can be used as an alternate separator character. At each stage in
 extracting files from such a hierarchy, the file type is determined. This
 guarantees a correct processing and display of the last file in the list.

 To view files in archives, the following command can be used:
```
        less archive_file:contained_file
```
 This can be used to extract files from an archive:
```
        less archive_file:contained_file > extracted_file
```
 For extracting files less is not required, that can be done also using:
```
        lesspipe.sh archive_file:contained_file > extracted_file
```
 Even a file in an archive, that itself is contained in yet
 another archive can be viewed this way:
```
        less super_archive:archive_file:contained_file
```
 The script is able to extract files up to a depth of 6 where applying a
 decompression algorithm counts as a separate level. In a few rare cases, the
 file command does not recognize the correct format.
 In such cases, the filtering can be suppressed by a trailing colon on the file
 name. That can also be used to output the original unmodified file or to
 suppress syntax highlighting (see below).

 Several environment variables can influence the behavior of lesspipe.sh.

 **LESSQUIET** will suppress additional output not belonging to the file contents
 if set to a non-empty value.

 **LESS** can be used to switch on colored less output (should contain -R).

 **LESSCOLORIZER** can be set to prefer a highlighting program from the following
 choices (`nvimpager` `batcat` `bat` `pygmentize` `e2ansi-cat` `source-highlight` `vimcolor` `code2color`).
 Otherwise the first program in that list that is installed will be used, with
 the caveat that `bat` will use the [ansi theme](https://github.com/wofr06/lesspipe/issues/155#issuecomment-2312972276) instead of its default colors.

## 3. Required programs

 Most of the programs are checked for its existence before they get called
 in lesspipe.sh. However some of the programs are assumed to always be
 installed. That is foremost `bash` or `zsh` (have the appropriate first line
 in the script), then `file` and other utilities like `cat`,
 `grep`, `ln`, `ls`, `mkdir`, `rm`, `strings`, `tar` and `tr`.
 For testing lesspipe.sh, the scripts `test.sh` and `vimcolor` can be used.
 The `bash`/`zsh` script vimcolor requires a working `vim` or `nvim`.

## 4. Supported file formats

 Currently `lesspipe.sh` [3] supports the following compression methods
 and file types (i.e. the file contents gets transformed by `lesspipe.sh`):

### 4.1 Supported compression methods and archive formats
- gzip, compress	requires `gzip`
- bzip2			requires `bzip2`
- lzma			requires `lzma` or `7z`
- xz			requires `xz` or `7z`
- zstd			requires `zstd`
- brotli		requires `bro`
- lz4			requires `lz4`
- zlib			requires `pigz` or `zlib-flate` (plain compressed files)
- dmg			requires `7z` (zlib compressed archives)
- tar			requires optionally `archive_color` for colorizing
- ar library		requires `bsdtar` or `ar`
- zip archive		requires `bsdtar` or `unzip`
- jar archive		requires `bsdtar` or `unzip`
- rar archive		requires `bsdtar` or `unrar` or `rar`
- 7-zip archive		requires `7zz` or `7zr` or `7z` or `7za`
- lzip archive		requires `lzip`
- iso images		requires `bsdtar` or `isoinfo` or `7z`
- rpm			requires `rpm2cpio` and `cpio` or `bsdtar`
- Debian		requires `bsdtar` or `ar`
- cab			requires `cabextract` or `7z`
- cpio			requires `cpio` or `bsdtar` or `7z`
- appimage		requires `unsquashfs`
- snap			requires `snap` and `unsquashfs`

### 4.2 List of preprocessed file types
- directory		displayed using `ls -lA`
- nroff(man)		requires `mandoc` or `man` or `groff`
- shared library	requires `nm`
- MS Word (doc)		requires `wvText` or `catdoc` or `libreoffice`
- Powerpoint (ppt)	requires `libreoffice`
- Excel (xls)		requires `in2csv` (csvkit) or `xls2csv`
- odt			requires `pandoc` or `odt2txt` or `libreoffice`
- odp			requires `libreoffice`
- ods			requires `xlscat` or `libreoffice`
- MS Word (docx)	requires `pandoc` or `docx2txt` or `libreoffice`
- Powerpoint (pptx)	requires `pptx2md` or `libreoffice`
- Excel (xlsx)		requires `xlsx2csv` (>= 0.8.3) or `in2csv` or `xlscat` or `excel2csv` or `libreoffice`
- csv			requires `csvtable` or `csvlook` or `column` or `pandoc`
- rtf			requires `unrtf` or `libreoffice`
- epub			requires `pandoc`
- html,xml		requires one of `xmq`, `w3m`, `lynx`, `elinks` or `html2text`
- pdf			requires `pdftotext` or `pdftohtml`
- perl pod		requires `pod2text` or `perldoc`
- dvi			requires `dvi2tty`
- djvu			requires `djvutxt`
- ps			requires `ps2ascii` (from the gs package)
- mp3			requires `ffprobe` or `eyeD3` or `id3v2`
- multimedia formats	requires `ffprobe` or `mediainfo` or `exiftools`
- image formats		requires `mediainfo` or `exiftools` or `identify`
- hdf, nc4		requires `h5dump` or `ncdump` (NetCDF format)
- crt, pem, csr, crl	requires `openssl`
- matlab		requires `matdump`
- Jupyter notebook	requires `pandoc`
- markdown		requires `pandoc` or a suitable colorizer (e.g. `pygmentize`)
- java.class		requires `procyon`
- MacOS X plist		requires `plistutil`
- binary data		requires `strings`
- json			requires `jq`
- device tree blobs	requires `dtc` (extension dtb or dts)

 Files in HTML and Perl POD format are always rendered. Sometimes
 however the original contents of the file should be viewed instead.
 That can be achieved by appending a colon to the file name. If the correct
 file type (html, pod) follows, the output can get colorized (see also
 the section below).

 If the binary xmq is installed, then xml is rendered differently, so that
 the xml structure is better recognized. A similar display for html contents
 using xmq is achieved by appending a colon to the file name. To get the
 original html file contents, two colons are required in this case.
 The color scheme used is for a light background. To select the dark color
 scheme the ENV variable XMQ_THEME has to be set to the value dark.

### 4.3 Conversion of files with alternate character encoding
 If the file utility reports text with an encoding different from the one
 used in the terminal, then the text will be transformed using `iconv` into
 the default encoding. This does assume the file command gets the file
 encoding right, which can be wrong in some situations. An appended colon
 to the file name does suppress the conversion.

## 5. Colorizing the output

 Syntax highlighting and other methods of colorizing the output
 is only activated if the environment variable **LESS** is existing and contains
 the option -R (or -r) or less is called with one of these options.

 The display of wrapped long lines and moving backward in a file using the
 option -r can give weird output and is not recommended. For an explanation see
 [the FAQ on www.greenwoodsoftware.com](https://www.greenwoodsoftware.com/less/faq.html#dashr)

### 5.1 Syntax highlighting
 Syntax highlighting is not always wanted and can slow down the file display
 considerably. Therefore, an ENV variable **LESS_MAXSIZE_COLOR** (default: 200000)
 has been introduced. Larger files do not get colorized. The highlighting can
 be switched off as well by appending a colon after the file name. This is also
 true for colored archive listings.
 If the wrong language was chosen for syntax highlighting or no language was
 recognized, then the correct one can be forced by appending a colon and a
 suffix to the file name (example for a file with perl syntax):
```
        less plfile:pl or less plfile:perl (depending on the colorizer)
```
#### 5.1.1 Syntax highlighting choices
 The filter is able to do syntax highlighting for a wide variety of file
 types. If installed, `nvimpager` is used for colorizing the output. If
 not, `bat`/`batcat`, `pygmentize`, `e2ansi-cat`, `source-highlight`, `vimcolor`
 and `code2color` are
 tried. Among these colorizers a preferred one can be forced for colorizing
 by setting the ENV variable **LESSCOLORIZER** to the name of the colorizer.
 For `pygmentize` and `bat/batcat` a restricted set of options can be added:
```
        LESSCOLORIZER='pygmentize -O style=foo'
        LESSCOLORIZER='bat --style=foo --theme=bar' # --theme=default for default theme
```
 Much better syntax highlighting is obtained using the `less` emulation of `vim`:
 The editor `vim` comes with a file `less.sh`, e.g. on Ubuntu located in
 /usr/share/vim/vimXX/macros (XX being the version number). Assuming that file
 location, a function `lessc` (bash, zsh, ksh users)
```
        lessc () { /usr/share/vim/vimXX/macros/less.sh "$@"}
```
 is defined and `lessc filename` is used to view the colorful file contents.
 The same can be achieved using less and `vimcolor`, but that is much slower.

#### 5.1.2 List of supported languages
 To see which languages are supported the list can be printed using the
following colorizer commands:

```
bat --list-languages
batcat --list-languages
pygmentize -L lexers
M-x -mode [TAB] within emacs
source-highlight --lang-list
code2color -h
vimcolor -L (this command is valid both for vimcolor and nvimpager)
```

### 5.2 Colored Directory listing
 Depending on the operating system ls is called with appropriate options to
 produce colored output.

### 5.3 Colored listing of tar file contents
 If the executable `archive_color` is installed, then the listing of tar file
 contents is colored in a similar fashion as directory contents.

## 6. Log file handling
 Log files are typically recognized as plain text files only. Within
 `lesspipe.sh` a file is recognized as a log file if it has the extension .log
 or the string :log is appended to the file name.

### 6.1 Watching growing files
 For all plain text files not recognized as log files a colorizer is tried
 nevertheless as it sometimes can detect the correct file type and color the
 file aproppriately. This does however destroy the abilty to watch growing
 files using the F command within less.
 To retain the ability to watch growing files without the .log extension
 or when the log file colorizer `tspin` was called for log files,
 a single colon has to be given to the less call as the second argument.
 It can also be achieved by listing such files in a file `.lessignore` in the
 users home directory. The entries are fully specified path names or contain
 globbing characters. Such a file could look as follows:

```
 syslog
 */syslog
 *.syslog
```
 This would allow watching growing files with the name syslog, all syslog
 files in arbitrary directories and all files with the ending .syslog.

### 6.2 Colorizing log files
 If the file is recognized as log file (see above), the program `tspin` is
 installed and the file size is below the value **LESS_MAXSIZE_COLOR**
 (default 200000 bytes) then file gets colorized.

 As already described, colorizing can be suppressed by appending a colon to
 the file name. To colorize e.g the syslog the following command can be used:
```
        less /var/log/syslog:log
```

 To colorize growing files you have to use `tspin` directly such as:
```
        tspin -f /var/log/syslog
```
## 7. Calling less from standard input

 Normally `lesspipe.sh` is not called when less is used within a pipe, such as
```
        cat somefile | less
```
 This restriction is removed when the **LESSOPEN** variable starts with the
 characters |- or ||-.
 Then the colon notation for extracting and displaying files in archives
 does not work. As a way out `lesspipe.sh` analyses the command line and looks
 for the last argument given to less. If it starts with a colon, it is
 interpreted from `lesspipe.sh` as a continuation of the first parameter.
 Examples:
```
        cat some_c_file | less - :c          # equivalent to less some_c_file:c
        cat archive | less - :contained_file # extracts a file from the archive
```
## 8. Displaying files with special characters in the file name

 Shell meta characters in file names: space (frequently used in windows
 file names),

 the characters | & ; ( ) ` < > " ' # ~ = $ * ? [ ] or \\

 must be escaped by a \ when used in the shell, e.g. `less a\ b.tar.gz:a\\"b`
 will display the file a"b contained in the gzipped tar archive a b.tar.gz.

## 9. Tab completion for zsh and bash

 An existing `zsh` completion script has been enhanced to provide tab completion
 within archives, similar to what is possible with the `tar` command completion.
 A `bash` completion script has been modeled loosely after the `zsh` completion.

 In both shells it is now possible to complete contents of archive format files
 such as tar, zip, rpm, deb files etc. This works as well in compressed files
 (e.g. tar.gz) and in chained archives, e.g.in source rpm files containing
 tar.gz files.

 To make it work, the script `lesscomplete` has to be executable and must be
 found in one of the directories listed in the `$PATH` environment variable.

 In bash, the bash-completion (usually a package with that name) has to be
 installed.
 In zsh, the completion system has to be loaded and the menucomplete option
 has to be set. That can be achieved with
```
        autoload compinit
        compinit -Uz
        setopt menucomplete
```
 The completion mechanism is triggered after entering a colon or an equal sign
 as for example in

```
        less archive_file:<TAB>                   # and then
        less archive_file:partial_result<TAB>
        less archive_file:contained_archive:<TAB> # etc.
```

 In bash, the completion mechanism has to be initialized as well. This is done
 sourcing the bash-completion script usually found in /usr/share/bash-completion
 or /etc. On many system this is already done in the system initialisation
 scripts.
## 10. User defined filtering

 The lesspipe.sh filtering can be replaced or enhanced by a user defined
 program. Such a program has to be called either `.lessfilter` (and be placed in
 the user's home directory), or `lessfilter` (and be accessible from a directory
 mentioned in the environment variable `PATH`).
 That program has to be executable and has to end with an exit code 0, if the
 filtering was done within that script. Otherwise, a nonzero exit code means
 the filtering is left to lesspipe.sh. For an example see the man page.

 This mechanism can be used to add filtering for new formats or e.g. inhibit
 filtering for certain file types.

## 11. Debugging

 If the script does not work as expected for a given file contents, one could
 try to output the commands executed by lesspipe.sh. That is achieved by

```
        bash -x lesspipe.sh file_name > /dev/null # or zsh -x
```
 It is also possible setting temporarily the **LESSOPEN** variable to e.g.
```
        LESSOPEN='|bash -x /usr/local/bin/lesspipe.sh %s'
```
 and then use `less` with the file to be displayed. The normal output goes to
 STDOUT and the commands executed to STDERR.

## 12. (Old) documentation about lesspipe

 In English

- [CERN Computer Newsletter 2002/1](https://ref.web.cern.ch/CERN/CNL/2002/001/unix-less/)
- [mentioned in Bash Cookbook, O'Reilly 2007](https://www.oreilly.com/library/view/bash-cookbook/0596526784/ch08s15.html)
- [Article on GitHub, 2025](https://github.com/wofr06/lesspipe/wiki/article_en)

 In German:

- german.txt (distributed with lesspipe, not updated)
- [Bessere Sicht (Linux Magazin 01/2001)](https://www.linux-magazin.de/ausgaben/2001/01/bessere-sicht/)
- [lesspipe (Linux User 04/2002](https://www.linux-community.de/ausgaben/linuxuser/2002/04/lesspipe/)
- [lesspipe 2.0 (Linux Magazin 07/2022)](https://www.linux-magazin.de/ausgaben/2022/07/lesspipe-2-0/)
- [Artikel auf GitHub, 2025](https://github.com/wofr06/lesspipe/wiki/article_de)

## 13. External links

(last checked: Mar 22 2026):

### 13.1 URLs to some utilities (with last known release)

- [7z, 7zr](https://github.com/ip7z/7zip) (2026)
- [7zz](https://sourceforge.net/projects/sevenzip/) (2026)
- archive_color - modified version of tarcolor (contained in this package)
- [cabextract](https://www.cabextract.org.uk/) (2023)
- [catdoc,xls2csv](https://www.wagner.pp.ru/~vitus/software/catdoc/) (2016)
- [code2color](https://github.com/wofr06/code2color) (2026)
- [csvtable](https://github.com/wofr06/csvtable) (2024)
- [djvutxt](https://djvu.sourceforge.net/) (2025)
- [docx2txt](https://docx2txt.sourceforge.net/) (2014)
- [dtc](https://git.kernel.org/cgit/utils/dtc/dtc.git) (2026)
- [dvi2tty](https://www.ctan.org/tex-archive/dviware/dvi2tty/) (2016)
- [e2ansi-cat](https://github.com/Lindydancer/e2ansi) (2026)
- [excel2csv](https://github.com/informationsea/excel2csv) (2018)
- [html2text](https://github.com/grobian/html2text) (2025)
- [id3v2](https://id3v2.sourceforge.net/) (2010)
- [lzip](https://www.nongnu.org/lzip/lzip.html) (2026)
- [matdump](https://sourceforge.net/projects/matio/) (2026)
- [mediainfo](https://mediaarea.net/MediaInfo/) (2026)
- [nvimpager](https://github.com/lucc/nvimpager) (2026)
- [odt2txt](https://github.com/dstosberg/odt2txt) (2017)
- [pandoc](https://pandoc.org/) (2026)
- [pigz](https://github.com/madler/pigz) (2023)
- [pptx2md](https://github.com/ssine/pptx2md) (2024)
- [tspin](https://github.com/bensadeh/tailspin) (2026)
- [unrtf](https://www.gnu.org/software/unrtf/) (2018)
- [vimcolor](https://github.com/wofr06/vimcolor) (2026)
- [wvText](https://github.com/AbiWord/wv/) (2014)
- [xlscat](https://metacpan.org/dist/Spreadsheet-Read) (2026)
- [xlsx2csv (>=0.8.3)](https://github.com/dilshod/xlsx2csv) (2026)
- [xmq](https://github.com/libxmq/xmq/releases/latest) (2026)
- [zlib-flate](https://github.com/qpdf/qpdf)(2026)

### 13.2 References
- [1] [www.greenwoodsoftware.com/less](https://www.greenwoodsoftware.com/less) (less)
- [2] [www.darwinsys.com/file](https://www.darwinsys.com/file/) (file)
- [3] [github.com/wofr06/lesspipe](https://github.com/wofr06/lesspipe) (lesspipe)
- [5] [www.palfrader.org/code2html](https://www.palfrader.org/code2html/)	(code2html)

## 14. Contributors

 The script lesspipe.sh is constantly enhanced by suggestions from users and
 reporting bugs or deficiencies. Thanks to (in alphabetical order):
 (contributors after Sep 2015 see GitHub history)

 Marc Abramowitz, James Ahlborn, Sören Andersen, Andrew Barnert,
 Peter D. Barnes, Jr., Eduard Bloch, Mathieu Bouillaguet, Florian Cramer,
 Philippe Defert, Antonio Diaz Diaz, Bastian Fuchs, Matt Ghali, Carl Greco,
 Stephan Hegel, Michel Hermier, Tobias Hoffmann, Christian Höltje,
 Jürgen Kahnert, Sebastian Kayser, Ben Kibbey, Peter Kostka,
 Heinrich Kuettler, Antony Lee, Vincent Lefèvre, David Leverton, Jay Levitt,
 Vladimir Linek, Oliver Mangold, Istvan Marko, Markus Meyer, Remi Mommsen,
 Derek B. Noonburg, Martin Otte, Jim Pryor, Slaven Rezic, Daniel Risacher,
 Jens Schleusener, Ken Teague, Matt Thompson, Paul Townsend, Petr Uzel,
 Chelban Vasile, Götz Waschk, Michael Wiedmann, Dale Wijnand, Peter Wu.
