𐂠GuidesWget

wget has 09287310928723874018274 options and i can never remember the ones i actually use


  • --wait=seconds -- set this to like at least 1 maybe higher to not ddos teh web site

  • --random-wait -- randomly chooses between 0.5*--wait and 1.5*wait if you wanna be sneakier

  • -r turn on recursive retrieving. you almost certainly want to limit this in some capacity, such as

    • -l depth limit depth of recursion to depth levels of recursion

  • this whole section is worth copying in full

   Recursive Accept/Reject Options
     -A acclist --accept acclist
     -R rejlist --reject rejlist
         Specify comma-separated lists of file name suffixes or patterns to
         accept or reject. Note that if any of the wildcard characters, *, ?,
         [ or ], appear in an element of acclist or rejlist, it will be
         treated as a pattern, rather than a suffix.  In this case, you have
         to enclose the pattern into quotes to prevent your shell from
         expanding it, like in -A "*.mp3" or -A '*.mp3'.

     --accept-regex urlregex
     --reject-regex urlregex
         Specify a regular expression to accept or reject the complete URL.

     --regex-type regextype
         Specify the regular expression type.  Possible types are posix or
         pcre.  Note that to be able to use pcre type, wget has to be compiled
         with libpcre support.
     -np
     --no-parent
         Do not ever ascend to the parent directory when retrieving
         recursively.  This is a useful option, since it guarantees that only
         the files below a certain hierarchy will be downloaded.