dan [Fri, 20 Oct 2000 07:28:57 +0000 (00:28 -0700)]
[svn] ftp.c (getftp): Applied Piotr Sulecki <Piotr.Sulecki@ios.krakow.pl>'s
patch to work around FTP servers that incorrectly respond to the
"REST" command with the remaining size rather than the total
file size.
dan [Fri, 20 Oct 2000 06:59:30 +0000 (23:59 -0700)]
[svn] Applied John Daily <jdaily@cyberdude.com>'s patch for his "quad" commands (which
I renamed to "lockable_boolean") in the .wgetrc (currently just passive_ftp).
Wrote documentation for his changes and added the missing "referer" to the
.wgetrc section (making mention of the issue of "referrer" being the correct
spelling).
dan [Mon, 9 Oct 2000 22:43:11 +0000 (15:43 -0700)]
[svn] --retr-symlinks was not previously documented properly. Based on my newfound
understanding of what its limitations are, added a TODO item. Also made a minor
tweak in html.c to silence a warning.
dan [Wed, 30 Aug 2000 11:26:21 +0000 (04:26 -0700)]
[svn] * *.{gmo,po,pot}: Regenerated after modifying wget --help output.
* ftp.c (ftp_retrieve_list): Use new INFINITE_RECURSION #define.
* html.c: htmlfindurl() now takes final `dash_p_leaf_HTML' parameter.
Wrapped some > 80-column lines. When -p is specified and we're at a
leaf node, do not traverse <A>, <AREA>, or <LINK> tags other than
<LINK REL="stylesheet">.
* html.h (htmlfindurl): Now takes final `dash_p_leaf_HTML' parameter.
* init.c: Added new -p / --page-requisites / page_requisites option.
* main.c (print_help): Clarified that -l inf and -l 0 both allow
infinite recursion. Changed the unhelpful --mirrior description
to simply give the options it's equivalent to. Added new -p option.
(main): Added some comments; handle new -p / --page-requisites.
* options.h (struct options): Added new page_requisites field.
* recur.c: Changed "URL-s" to "URLs" and "HTML-s" to "HTMLs".
Calculate and pass down new `dash_p_leaf_HTML' parameter to
get_urls_html(). Use new INFINITE_RECURSION #define.
* retr.c: Changed "URL-s" to "URLs". get_urls_html() now takes
final `dash_p_leaf_HTML' parameter.
* url.c: get_urls_html() and htmlfindurl() now take final
`dash_p_leaf_HTML' parameter.
* url.h (get_urls_html): Now takes final `dash_p_leaf_HTML' parameter.
* wget.h: Added some comments and new INFINITE_RECURSION #define.
* wget.texi (Recursive Retrieval Options): Documented new -p option.
dan [Fri, 25 Aug 2000 22:23:22 +0000 (15:23 -0700)]
[svn] * MACHINES: Alphabetized, changed "architectures" to "OSes and architecures",
added missing company names, removed needless ^L, made AIX entry more general
to reflect my testing, removed the non-factual "this version of", and fixed
some grammatical errors.
dan [Wed, 23 Aug 2000 03:11:55 +0000 (20:11 -0700)]
[svn] * main.c (print_help): Modified -nc description to mention that it also prevents
the creation of multiple versions of the same file with ".<number>" suffixes.
dan [Wed, 23 Aug 2000 03:04:20 +0000 (20:04 -0700)]
[svn] * wget.texi (Download Options): --no-clobber's documentation was
severely lacking -- ameliorated the situation. Some of the
previously-undocumented stuff (like the multiple-file-version numeric-suffixing)
that's now mentioned for the first (and only) time in the -nc documentation
should probably be mentioned elsewhere, but due to the way that wget.texi's
hierarchy is laid out, I had a hard time finding anywhere else appropriate.
dan [Fri, 21 Jul 2000 23:16:10 +0000 (16:16 -0700)]
[svn] TODO: -k should convert "hostless absolute" URLs, like <A HREF="/index.html">.
However, Brian McMahon <bm@iucr.org> wants the old incorrect behavior to still
be available as an option, as he depends on it to allow mirrors of his site to
send CGI queries to his original site, but still get graphics off of the mirror
site. Perhaps this would be better dealt with by adding an option to tell -k
not to convert certain URLs patterns?
dan [Fri, 9 Jun 2000 21:40:26 +0000 (14:40 -0700)]
[svn] Oops. Meant to check this change in with my last one, but the commit wouldn't
go through without doing an update first, and I forgot to make the change the
second time. Just changed an erroneous main.c (main) to main.c (print_help).
dan [Fri, 9 Jun 2000 20:59:56 +0000 (13:59 -0700)]
[svn] Heiko's --help output for --waitretry was over 80 columns. Shortened. It also
said that 0 seconds are waited after the first retry, which I believe is
incorrect and does not match what's written elsewhere (e.g. wget.texi). Changed
to 1.
dan [Tue, 23 May 2000 02:29:38 +0000 (19:29 -0700)]
[svn] Damir Dzeko <ddzeko@zesoi.fer.hr> did not document his new --referer option.
Did so (--help output and wget.texi). Also tweaked --help output for --execute.
dan [Thu, 20 Apr 2000 22:06:43 +0000 (15:06 -0700)]
[svn] Really using "stepwise refinement" on this file, aren't I? Realized during the
usual shower meditation session this morning that I hadn't changed the text 'The
"wait" command above' when I moved waitretry up so "wait" no longer _is_ above.
Fixed to say "below" and got a little more wordy on the "linear backoff".
dan [Wed, 19 Apr 2000 02:09:28 +0000 (19:09 -0700)]
[svn] Realized this morning in the shower that I put the (uncommented) waitretry
setting in the local section, which is prefaced by a comment saying that stuff
in there shouldn't be set in the global file. Moved the setting to the global
section.
dan [Thu, 13 Apr 2000 19:37:52 +0000 (12:37 -0700)]
[svn] * Makefile.in (sample.wgetrc.munged_for_texi_inclusion): Added build,
dependencies, and distclean cleanup of this new file.
* sample.wgetrc: Uncommented waitretry and set it to 10, clarified some wording,
and re-wrapped some text to 71 columns due to @sample indentation in
wget.texi.
* wget.texi: Herold further expounded on the behavior of waitretry -- reworded
docs again. Changed note saying _all_ lines in sample.wgetrc are commented
out. Don't have an entire hand- cut-and-pasted copy of sample.wgetrc in this
file -- use @include.
dan [Thu, 13 Apr 2000 04:37:51 +0000 (21:37 -0700)]
[svn] Makefile (install.wgetrc): I completely missed the message that the new wgetrc
wasn't being installed the first couple of times I ran `make install' after
changing sample.wgetrc. Added blank lines around the message and a "<Hit RETURN
to acknowledge>", and reworded the message to be a bit more clear.
dan [Thu, 13 Apr 2000 03:41:58 +0000 (20:41 -0700)]
[svn] Oops. I intentionally did my "cvs diff" before regenerating the .info* files
to make it easy to send a patch to the list, and thus those files weren't noted
as having been changed and I forgot to regenerate and commit them.
dan [Thu, 13 Apr 2000 01:42:34 +0000 (18:42 -0700)]
[svn] * sample.wgetrc: Added entries for backup_converted and waitretry.
* wget.texi (waitretry): Herold Heiko <Heiko.Herold@previnet.it>'s
new option was undocumented until now. Reworded the suggested documentation he
sent to the list.
dan [Thu, 6 Apr 2000 03:36:28 +0000 (20:36 -0700)]
[svn] Reworded the opening paragraph to reflect that there are now more developers
besides Hrvoje, and added the following three items I've been meaning to get to:
* Make -K compare X.orig to X and move the former on top of the latter if
they're the same, rather than leaving identical .orig files laying around.
* Add an option to save all text/html files with a .html extension so that when
grabbing the output of a dynamically-generated remote page, you'll end up with
a filename that will cause _your_ webserver to realize the saved static HTML
file isn't text/plain.
* Allow mirroring of FTP URLs where logging in puts you somewhere else besides
'/'.
dan [Wed, 5 Apr 2000 03:08:10 +0000 (20:08 -0700)]
[svn] * host.c (store_hostaddress): R. K. Owen's patch introduces a "left shift count
>= width of type" warning on 32-bit architectures. Got rid of it by tricking
the compiler w/ a variable.
* url.c (UNSAFE_CHAR): The macro didn't include all the illegal characters per
RFC1738, namely everything above '~'. It also generated a warning on OSes
where char =~ unsigned char. Fixed.
dan [Sat, 11 Mar 2000 06:48:06 +0000 (22:48 -0800)]
[svn] * TODO: Removed done item: we now have an option (-G) that makes it easy to
download a single HTML document and all its constituents.
* po/*.{gmo,po,pot}: Regenerated after adding new options.
* po/hr.po: Hrvoje forgot '\n's on his translations of my altered messages,
causing msgfmt to balk and `make install' to fail.
* wget.texi (Recursive Retrieval Options): In -K description, added a link to
the discussion of interaction with -N.
(Recursive Accept/Reject Options): Did some alphabetizing and added descriptions
of new --follow-tags and -G / --ignore-tags options.
(Following Links): Changed "the loads of" to "loads of".
(Wgetrc Commands): Added descriptions of new follow_tags and ignore_tags
commands.
* html.c (idmatch): Implemented checking of my new --follow-tags and
--ignore-tags options.
* init.c (commands): Added comment reminding people adding new entries doing
allocation to add corresponding freeing in cleanup().
(commands): Added new followtags and ignoretags commands.
(cleanup): Free storage for new followtags and ignoretags.
* main.c: Use of "comma-separated list" was random -- normalized it. Did some
alphabetization. Added comments pointing out "Options without arguments" and
"Options accepting an argument" sections of long_options[]. Added new options
--follow-tags and -G / --ignore-tags. Added comment that Damir's --referer is
currently undocumented. Added comment that Heiko's --waitretry is partially
undocumented (mentioned in --help but not in wget.texi). Moved improperly
sorted 24, 129, and 'G' cases.
* options.h (struct options): Added new fields follow_tags and ignore_tags.
* wget.h: Added "#define EQ 0" so we can say "strcmp(a, b) == EQ".
dan [Thu, 2 Mar 2000 23:49:37 +0000 (15:49 -0800)]
[svn] Dan Berger responded to my email. Added his explanation of what his patch was
coded for (downloading StarOffice from Sun's website). He says he doesn't use
wget any more, so he won't be writing a patch that allows downloading that
without breaking anything (such a patch would apparently involve stopping
certain characters in the URL from being escaped).
dan [Thu, 2 Mar 2000 22:48:07 +0000 (14:48 -0800)]
[svn] Dan Berger's query string patch is totally bogus. If you have two different
URLs, gen_page.cgi?page1 and get_page.cgi?page2, they'll both be saved as
get_page.cgi and the second will overwrite the first. Also, parameters to
implicit CGIs, like "http://www.host.com/db/?2000-03-02" cause the URLs to be
printed with trailing garbage characters, and could seg fault. I'm not sure
what Dan had in mind with this patch (no explanatory comments), but I'm removing
it for now. If he can rewrite it so it doesn't break stuff, okay.
dan [Thu, 2 Mar 2000 21:17:47 +0000 (13:17 -0800)]
[svn] Hrvoje didn't regenerate the .info files after changing wget.texi.
Got rid of newly-introduced nested-if warnings in ftp.c and http.c. Fixed
apparently completely untested code in main.c that was trying to provide --wait
/ --waitretry backwards compatibility, but had multiple fundamental bugs.
dan [Thu, 2 Mar 2000 09:17:57 +0000 (01:17 -0800)]
[svn] Did a `make update-po` in here after finding out that I needed the GNU gettext
package to regenerate these files (after changing a couple of messages).
dan [Thu, 2 Mar 2000 06:33:48 +0000 (22:33 -0800)]
[svn] Implemented the item I formerly had in the TODO: When -K and -N are used
together, we compare local file X.orig (if extant) against server file X.
Previously -k and -N were worthless in combination because the local converted
files always differed from the server versions.
dan [Wed, 1 Mar 2000 01:03:39 +0000 (17:03 -0800)]
[svn] Upped version number from 1.5.3. to 1.5.3+dev. Because the development source
is available via anonymous CVS and desirable features are being added, it's
quite possible for end-users to be getting their hands on development versions.
They may report bugs, so if we don't change the version number, we'll have to
continually followup the statement "I'm using version 1.5.3" with the question
"The FTP archive or the CVS source?" Better to just make this development
version have a unique number. Once we're ready to actually release the next
version, we can up the version from 1.5.3+dev to 1.5.4, or 1.6, or whatever it
turns out to be (depending on how much development gets done).
Also made minor updates (dates, email addresses) to wget.texi.
dan [Wed, 1 Mar 2000 00:50:52 +0000 (16:50 -0800)]
[svn] Upped version number from 1.5.3. to 1.5.3+dev. Because the development source
is available via anonymous CVS and desirable features are being added, it's
quite possible for end-users to be getting their hands on development versions.
They may report bugs, so if we don't change the version number, we'll have to
continually followup the statement "I'm using version 1.5.3" with the question
"The FTP archive or the CVS source?" Better to just make this development
version have a unique number. Once we're ready to actually release the next
version, we can up the version from 1.5.3+dev to 1.5.4, or 1.6, or whatever it
turns out to be (depending on how much development gets done).
Also made minor updates (dates, email addresses) to wget.texi.
dan [Tue, 29 Feb 2000 19:24:17 +0000 (11:24 -0800)]
[svn] Really just a test to see if my write access works. Changed "through SSLeay" to
"through SSLeay or OpenSSL" (I believe someone's actually already gotten the
latter working, and hopefully they'll delete this item when they commit their
changes).