mirror of
https://github.com/python/cpython.git
synced 2024-12-04 07:15:09 +08:00
836f5433f7
`glob.glob()` currently calls itself recursively to build a list of matches of the dirname part of the pattern and then filters by the basename part. This is effectively BFS. ``glob.glob('*/*/*/*/*/foo')`` will build a huge list of all directories 5 levels deep even if only a handful of them contain a ``foo`` entry. A generator-based recusion would never have to store these list at once by implementing DFS. This patch converts the `glob` function to an `iglob` recursive generator . `glob()` now just returns ``list(iglob(pattern))``. I also cleaned up the code a bit (reduced duplicate `has_magic()` checks and created a second `glob0` helper func so that the main loop need not be duplicated). Thanks to Cherniavsky Beni for the patch!
52 lines
1.8 KiB
TeX
52 lines
1.8 KiB
TeX
\section{\module{glob} ---
|
|
\UNIX{} style pathname pattern expansion}
|
|
|
|
\declaremodule{standard}{glob}
|
|
\modulesynopsis{\UNIX\ shell style pathname pattern expansion.}
|
|
|
|
|
|
The \module{glob} module finds all the pathnames matching a specified
|
|
pattern according to the rules used by the \UNIX{} shell. No tilde
|
|
expansion is done, but \code{*}, \code{?}, and character ranges
|
|
expressed with \code{[]} will be correctly matched. This is done by
|
|
using the \function{os.listdir()} and \function{fnmatch.fnmatch()}
|
|
functions in concert, and not by actually invoking a subshell. (For
|
|
tilde and shell variable expansion, use \function{os.path.expanduser()}
|
|
and \function{os.path.expandvars()}.)
|
|
\index{filenames!pathname expansion}
|
|
|
|
\begin{funcdesc}{glob}{pathname}
|
|
Return a possibly-empty list of path names that match \var{pathname},
|
|
which must be a string containing a path specification.
|
|
\var{pathname} can be either absolute (like
|
|
\file{/usr/src/Python-1.5/Makefile}) or relative (like
|
|
\file{../../Tools/*/*.gif}), and can contain shell-style wildcards.
|
|
Broken symlinks are included in the results (as in the shell).
|
|
\end{funcdesc}
|
|
|
|
\begin{funcdesc}{iglob}{pathname}
|
|
Return an iterator which yields the same values as \function{glob()}
|
|
without actually storing them all simultaneously.
|
|
\versionadded{2.5}
|
|
\end{funcdesc}
|
|
|
|
For example, consider a directory containing only the following files:
|
|
\file{1.gif}, \file{2.txt}, and \file{card.gif}. \function{glob()}
|
|
will produce the following results. Notice how any leading components
|
|
of the path are preserved.
|
|
|
|
\begin{verbatim}
|
|
>>> import glob
|
|
>>> glob.glob('./[0-9].*')
|
|
['./1.gif', './2.txt']
|
|
>>> glob.glob('*.gif')
|
|
['1.gif', 'card.gif']
|
|
>>> glob.glob('?.gif')
|
|
['1.gif']
|
|
\end{verbatim}
|
|
|
|
|
|
\begin{seealso}
|
|
\seemodule{fnmatch}{Shell-style filename (not path) expansion}
|
|
\end{seealso}
|