1 2000-11-01 Hrvoje Niksic <hniksic@arsdigita.com>
3 * url.c (path_simplify_with_kludge): New function.
4 (path_simplify_with_kludge): Disable it. Instead...
5 (parse_dir): ...make sure that at this point the right thing is
6 done, i.e. that "query" part of the URL (?...) is always assigned
7 to the file, never to the directory portion of the path.
9 2000-11-01 Hrvoje Niksic <hniksic@arsdigita.com>
11 * retr.c (retrieve_url): Detect redirection cycles.
13 2000-11-01 Hrvoje Niksic <hniksic@arsdigita.com>
15 * url.c (get_urls_html): Decode HTML entities using
18 * html.c (htmlfindurl): Don't count the `#' in numeric entities
19 (&#NNN;) as an HTML fragemnt.
20 (html_decode_entities): New function.
22 2000-11-01 Hrvoje Niksic <hniksic@arsdigita.com>
24 * html.c (htmlfindurl): Fix recognition of # HTML fragments.
26 2000-11-01 Hrvoje Niksic <hniksic@arsdigita.com>
28 * url.c (construct): Rewritten for clarity. Avoids the
29 unnecessary copying and stack-allocation the old version
32 2000-10-31 Hrvoje Niksic <hniksic@arsdigita.com>
34 * ftp.c (getftp): Ditto.
36 * http.c (gethttp): Rewind the stream when retrying from scratch.
38 2000-10-31 Hrvoje Niksic <hniksic@arsdigita.com>
40 * retr.c (retrieve_url): Use url_concat() to handle relative
41 redirections instead of /ad hoc/ code.
43 * url.c (url_concat): New function encapsulating weird
45 (urllen_http_hack): New function.
46 (construct): When constructing new URLs, recognize that `?' does
47 not form part of the file name in HTTP.
49 2000-10-13 Adrian Aichner <adrian@xemacs.org>
51 * retr.c: Add msec timing support for WINDOWS.
52 * retr.c (reset_timer): GetSystemTime() on WINDOWS.
53 * retr.c (elapsed_time): Calculate delta time to msec on WINDOWS.
55 2000-10-27 Dan Harkless <dan-wget@dilvish.speed.net>
57 * retr.c (retrieve_url): Manually applied T. Bharath
58 <TBharath@responsenetworks.com>'s patch to get wget to grok
59 illegal relative URL redirects. Reformatted and re-commented it.
61 2000-10-23 Dan Harkless <dan-wget@dilvish.speed.net>
63 * connect.c (make_connection and bindport): Manually applied Rob
64 Mayoff <mayoff@dqd.com>'s 1.5.3 patch to add --bind-address,
65 changing coding style to GNU's.
67 * ftp.c (ftp_loop_internal): --delete-after wasn't implemented for
68 files downloaded via FTP. Per a comment, .listing files were not
69 counted towards number of bytes and files downloaded because they're
70 deleted anyway. Well, they aren't under -nr, so count them then.
72 * init.c: Manually applied Rob Mayoff's 1.5.3 patch to add
73 --bind-address, alphabetizing, changing coding style to GNU's,
74 commenting, and renaming cmd_ip_address() to cmd_address() to
75 imply hostnames also okay.
77 * main.c (main): --delete-after didn't delete the root of the
78 tree. Ignore --convert-links if --delete-after was specified.
79 Manually applied Rob Mayoff's 1.5.3 patch to add --bind-address,
80 fixing duplicate use of added-since-1.5.3 case value.
81 (print_help): Clarified that --delete-after deletes local files.
82 Rob forgot to add a line for his new --bind-address option.
84 * options.h (struct options): Manually applied Rob Mayoff's patch
85 to add --bind-address (bind_address structure member).
87 * recur.c (recursive_retrieve): Improved comment; added DEBUGP().
88 Ignore --convert-links if --delete-after was specified.
90 * retr.c (retrieve_from_file): Just added a DEBUGP().
92 2000-10-19 Dan Harkless <dan-wget@dilvish.speed.net>
94 * ftp.c (ftp_loop_internal): downloaded_file() enumerators changed.
95 (getftp): Applied Piotr Sulecki <Piotr.Sulecki@ios.krakow.pl>'s
96 patch to work around FTP servers that incorrectly respond to the
97 "REST" command with the remaining size rather than the total file size.
99 * http.c (gethttp): Improved a comment and added code to tack on
100 ".html" to text/html files without that extension when -E specified.
101 (http_loop): Use new downloaded_file() enumerators and deal with
102 the case of gethttp() called xrealloc() on u->local.
104 * init.c (commands): Added new "htmlextension" command.
105 Also renamed John Daily's cmd_quad() to the more descriptive
106 cmd_lockable_boolean(), alpha-sorted the CMD_DECLARE()s and
107 removed duplicate cmd_boolean() declaration.
109 * main.c (print_help): Added my new -E / --html-extension option.
110 (main): Undocumented --email-address option previously used -E synonym.
111 Stole it away for the much more deserving --html-extension's use.
113 * options.h (struct options): Added html_extension field.
115 * url.c (convert_links): URL X that we saved as X.html locally due
116 to -E needs to be backed up as X.orig, not X.html.orig. Added comments.
117 (downloaded_file): Now remembers if we added .html extension to a file.
119 * url.h (downloaded_file_t): Added extra enumerators to support above.
120 (downloaded_file): Now takes and returns a downloaded_file_t.
122 * wget.h (unnamed "dt" enum): Added ADDED_HTML_EXTENSION enumerator.
124 2000-10-09 Dan Harkless <dan-wget@dilvish.speed.net>
126 * html.c (htmlfindurl): Added unneeded initialization to quiet warning.
128 * main.c (print_help): Clarified what --retr-symlinks does.
130 2000-09-15 John Daily <jdaily@cyberdude.com>
132 * init.c: Add support for "always" and "never" values to allow
133 .wgetrc to override commandline (useful e.g. with .pm files
134 calling `wget --passive-ftp' when your firewall doesn't allow that).
136 * ftp.c (getftp): passive_ftp is first option to support always/never.
138 2000-08-30 Dan Harkless <dan-wget@dilvish.speed.net>
140 * ftp.c (ftp_retrieve_list): Use new INFINITE_RECURSION #define.
142 * html.c: htmlfindurl() now takes final `dash_p_leaf_HTML' parameter.
143 Wrapped some > 80-column lines. When -p is specified and we're at a
144 leaf node, do not traverse <A>, <AREA>, or <LINK> tags other than
145 <LINK REL="stylesheet">.
147 * html.h (htmlfindurl): Now takes final `dash_p_leaf_HTML' parameter.
149 * init.c: Added new -p / --page-requisites / page_requisites option.
151 * main.c (print_help): Clarified that -l inf and -l 0 both allow
152 infinite recursion. Changed the unhelpful --mirrior description
153 to simply give the options it's equivalent to. Added new -p option.
154 (main): Added some comments; handle new -p / --page-requisites.
156 * options.h (struct options): Added new page_requisites field.
158 * recur.c: Changed "URL-s" to "URLs" and "HTML-s" to "HTMLs".
159 Calculate and pass down new `dash_p_leaf_HTML' parameter to
160 get_urls_html(). Use new INFINITE_RECURSION #define.
162 * retr.c: Changed "URL-s" to "URLs". get_urls_html() now takes
163 final `dash_p_leaf_HTML' parameter.
165 * url.c: get_urls_html() and htmlfindurl() now take final
166 `dash_p_leaf_HTML' parameter.
168 * url.h (get_urls_html): Now takes final `dash_p_leaf_HTML' parameter.
170 * wget.h: Added some comments and new INFINITE_RECURSION #define.
172 2000-08-23 Dan Harkless <dan-wget@dilvish.speed.net>
174 * main.c (print_help): -B / --base was not mentioned.
176 2000-08-22 Dan Harkless <dan-wget@dilvish.speed.net>
178 * main.c (print_help): Modified -nc description to mention that it
179 also prevents the creation of multiple versions of the same file
180 with ".<number>" suffixes.
182 2000-07-14 Jan Prikryl <prikryl@cg.tuwien.ac.at>
184 * retr.c (retrieve_url): Consistently strdup opt.referer when
187 2000-06-09 Dan Harkless <dan-wget@dilvish.speed.net>
189 * main.c (print_help): --help output for --waitretry was over 80 cols.
191 2000-06-09 Hrvoje Niksic <hniksic@iskon.hr>
193 * url.c (encode_string): Fix comment.
194 Suggested by Herold Heiko <Heiko.Herold@previnet.it>.
196 2000-06-01 Const Kaplinsky <const@ce.cctpu.edu.ru>
198 * ftp.c (ftp_retrieve_list): Change permissions only on plain
201 2000-06-01 Hrvoje Niksic <hniksic@iskon.hr>
203 * url.c (str_url): Print the port number only if it's different
204 from the default port number for that protocol.
206 2000-05-22 Dan Harkless <dan-wget@dilvish.speed.net>
208 * main.c (print_help): Added --help line for Damir Dzeko
209 <ddzeko@zesoi.fer.hr>'s until-now-undocumented --referer option.
210 Removed comments that --referer and --waitretry were undocumented.
211 Changed "`.wgetrc' command" to "`.wgetrc'-style command" on --help
214 2000-05-18 Hrvoje Niksic <hniksic@iskon.hr>
216 * ftp.c (getftp): Ditto.
218 * http.c (gethttp): Check for return value of fclose/fflush.
220 2000-04-12 Hrvoje Niksic <hniksic@iskon.hr>
222 * host.c (store_hostaddress): Instead of shifting ADDR, start
223 copying from the correct address.
225 2000-04-12 Hrvoje Niksic <hniksic@iskon.hr>
227 * http.c (gethttp): Don't free REQUEST -- it was allocated with
229 Pointed out by Gisle Vanem <gvanem@eunet.no>.
231 2000-04-04 Dan Harkless <dan-wget@dilvish.speed.net>
233 * host.c (store_hostaddress): R. K. Owen's patch introduces a
234 "left shift count >= width of type" warning on 32-bit
235 architectures. Got rid of it by tricking the compiler w/ a variable.
237 * url.c (UNSAFE_CHAR): The macro didn't include all the illegal
238 characters per RFC1738, namely everything above '~'. It also
239 generated a warning on OSes where char =~ unsigned char. Fixed.
241 1998-10-17 Hrvoje Niksic <hniksic@srce.hr>
243 * http.c (http_process_type): Removed needless strdup(), a memory
246 1998-09-25 Hrvoje Niksic <hniksic@srce.hr>
248 * html.c (htmlfindurl): Set PH to the first occurrence of `#'.
250 1998-09-25 Simon Munton <simonm@m4data.co.uk>
252 * init.c (wgetrc_file_name): Don't free HOME under Windows.
254 1998-12-01 "R. K. Owen" <rkowen@Nersc.GOV>
256 * host.c (store_hostaddress): Fix for big endian 64-bit machines.
258 1998-12-01 Hrvoje Niksic <hniksic@srce.hr>
260 * url.c (UNSAFE_CHAR): New macro.
261 (contains_unsafe): Use it.
262 (encode_string): Ditto.
264 1998-12-01 Hrvoje Niksic <hniksic@srce.hr>
266 * main.c (i18n_initialize): Use LC_MESSAGES only if available.
268 2000-03-31 Hrvoje Niksic <hniksic@srce.hr>
270 * Use TOUPPER/TOLOWER.
272 1998-12-22 Alexander V. Lukyanov <lav@yars.free.net>
274 * ftp-opie.c (btoe): Zero-terminate OSTORE.
276 2000-03-21 Hrvoje Niksic <hniksic@iskon.hr>
278 * wget.h (DO_REALLOC_FROM_ALLOCA): Ditto.
280 * sysdep.h (ISALNUM): New macro.
284 2000-03-10 Dan Harkless <dan-wget@dilvish.speed.net>
286 * html.c (idmatch): Implemented checking of my new --follow-tags
287 and --ignore-tags options.
289 * init.c (commands): Added comment reminding people adding new
290 entries doing allocation to add corresponding freeing in cleanup().
291 (commands): Added new followtags and ignoretags commands.
292 (cleanup): Free storage for new followtags and ignoretags.
294 * main.c: Use of "comma-separated list" was random -- normalized
295 it. Did some alphabetization. Added comments pointing out
296 "Options without arguments" and "Options accepting an argument"
297 sections of long_options[]. Added new options --follow-tags and
298 -G / --ignore-tags. Added comment that Damir's --referer is
299 currently undocumented. Added comment that Heiko's --waitretry is
300 partially undocumented (mentioned in --help but not in
301 wget.texi). Moved improperly sorted 24, 129, and 'G' cases.
303 * options.h (struct options): Added new fields follow_tags and
306 * wget.h: Added "#define EQ 0" so we can say "strcmp(a, b) == EQ".
308 2000-03-02 Dan Harkless <dan-wget@dilvish.speed.net>
310 * ftp.c (ftp_loop_internal): Heiko introduced "suggest explicit
311 braces to avoid ambiguous `else'" warnings. Eliminated them.
313 * http.c (gethttp): Dan Berger's query string patch is totally
314 bogus. If you have two different URLs, gen_page.cgi?page1 and
315 get_page.cgi?page2, they'll both be saved as get_page.cgi and the
316 second will overwrite the first. Also, parameters to implicit
317 CGIs, like "http://www.host.com/db/?2000-03-02" cause the URLs to
318 be printed with trailing garbage characters, and could seg fault.
319 Backing out the patch, which Dan B. informed me by email was just
320 a kludge to download StarOffice from Sun made necessary due to
321 wget's unconditional escaping of certain characters (room for an
323 (http_loop): Heiko introduced "suggest explicit braces to avoid
324 ambiguous `else'" warnings. Eliminated them.
326 * main.c: Heiko's --wait / --waitretry backwards compatibility
327 code looks to have been totally untested -- automatic variable
328 'wr' was used without being initialized, and a long int was passed
329 into setval()'s char* val parameter.
331 * recur.c (parse_robots): Applied Edward J. Sabol
332 <sabol@alderaan.gsfc.nasa.gov>'s patch for Guan Yang's reported
333 problem with "User-agent:<space>*<space>" lines in robots.txt.
335 * url.c (parseurl, str_url): Removing Dan Berger's code (see
336 http.c above for explanation).
338 1999-08-25 Heiko Herold <Heiko.Herold@previnet.it>
340 * ftp.c: Respect new option waitretry.
342 2000-01-30 Damir Dzeko <ddzeko@zesoi.fer.hr>
344 * http.c (gethttp): Send custom Referer, if required.
346 1999-09-24 Charles G Waldman <cgw@fnal.gov>
348 * netrc.c (parse_netrc): Allow passwords to contain spaces.
350 * netrc.c (parse_netrc): New function.
352 1999-09-17 Dan Berger <dberger@ix.netcom.com>
354 * http.c (gethttp): Send it.
356 * url.c (parseurl): Detect query string in HTTP URL-s.
359 2000-03-02 HIROSE Masaaki <hirose31@t3.rim.or.jp>
361 * html.c (html_allow): Add <link href=...> and <script src=...>.
363 1999-05-02 andrew deryabin <djsf@softhome.net>
365 * http.c (gethttp): Specify port in `Host' header only if it's
368 1998-11-03 Edward J. Sabol <sabol@alderaan.gsfc.nasa.gov>
370 * recur.c (recursive_retrieve): If a finite maximum depth is
371 specified, and we're are already at that depth, don't download the
372 HTML file for parsing.
374 2000-03-01 Dan Harkless <dan-wget@dilvish.speed.net>
376 * ftp.c (ftp_loop_internal): Call new downloaded_file() function,
377 even though we don't do conversion on HTML files retrieved via
378 FTP, so _current_ usage of downloaded_file() makes this call unneeded.
379 (ftp_retrieve_list): Added a comment saying where we need to
380 stat() a .orig file if FTP'd HTML file conversion is ever implemented.
381 (ftp_retrieve_list): "Local file '%s' is more recent," is sometimes
382 a lie -- reworded as "Server file no newer than local file '%s' --".
384 * http.c (http_loop): Fixed a typo and clarified a comment.
385 (http_loop): When -K and -N are specified together, compare size
386 and timestamp of server file X against local file X.orig (if
387 extant) rather than converted local file X.
388 (http_loop): "Local file '%s' is more recent," is sometimes a lie
389 -- reworded as "Server file no newer than local file '%s' --".
390 (http_loop): Call new downloaded_file() function to prevent
391 wrongful overwriting of .orig file when -N is specified.
393 * url.c (convert_links): When -K specified, only rename X to
394 X.orig if downloaded_file() returns TRUE. Otherwise when we skip
395 file X due to -N, we clobber an X.orig from a previous invocation.
396 (convert_links): Call the failsafe xstrdup(), not the real strdup().
397 (convert_links): Added a note asking anyone who understands how
398 multiple URLs can correspond to a single file to comment it.
399 (downloaded_file): Added this new function.
401 * url.h (downloaded_file): Added prototype for this new function
402 as well as its downloaded_file_t enum type.
404 * wget.h (boolean): Added this new typedef and TRUE and FALSE #defines.
406 2000-02-29 Dan Harkless <dan-wget@dilvish.speed.net>
408 * version.c: Upped version to developer-only "1.5.3+dev".
410 2000-02-18 Dan Harkless <dan-wget@dilvish.speed.net>
412 * init.c (backup_converted): Added this new option.
414 * main.c (-K / --backup-converted): Added this new option.
416 * options.h (backup_converted): Added this new option.
418 * url.c (convert_links): When backup_converted is specified, save
419 file X as X.orig before converting.
421 * url.h (urlpos): Fixed typo -- said "Rekative" instead of "Relative".
423 1998-09-21 Hrvoje Niksic <hniksic@srce.hr>
425 * version.c: Wget 1.5.3 is released.
427 1998-09-21 Hrvoje Niksic <hniksic@srce.hr>
429 * host.c (ftp_getaddress): Don't warn when reverse-lookup of local
430 address doesn't yield FQDN.
432 1998-09-21 Andreas Schwab <schwab@issan.informatik.uni-dortmund.de>
434 * cmpt.c (strerror): Fix declaration of sys_errlist.
436 1998-09-11 Hrvoje Niksic <hniksic@srce.hr>
438 * main.c (main): Don't use an array subscript as the first
439 argument to STRDUP_ALLOCA.
442 1998-09-11 Szakacsits Szabolcs <szaka@sienet.hu>
444 * html.c (htmlfindurl): Download table background.
446 1998-09-11 Hans Grobler <grobh@conde.ee.sun.ac.za>
448 * init.c (parse_line): Would free *com before allocating it.
449 (parse_line): Would free com instead of *com.
451 1998-09-10 Howard Gayle <howard@fjst.com>
453 * url.c (get_urls_html): Would drop the last character of the
456 1998-09-10 Hrvoje Niksic <hniksic@srce.hr>
458 * http.c (http_loop): Don't print status code if quiet.
460 1998-09-10 Kaveh R. Ghazi <ghazi@caip.rutgers.edu>
462 * log.c: Use <stdarg.h> only when __STDC__.
464 1998-09-10 Adam D. Moss <adam@foxbox.org>
466 * html.c (htmlfindurl): Download <layer src=...>.
468 1998-09-10 Howard Gayle <howard@fjst.com>
470 * ftp.c (ftp_retrieve_list): Don't update the time stamp of a file
473 1998-06-27 Hrvoje Niksic <hniksic@srce.hr>
475 * utils.c: Include <libc.h> on NeXT.
477 1998-06-26 Heinz Salzmann <heinz.salzmann@intermetall.de>
479 * url.c (get_urls_html): Fix calculation of URL position.
481 1998-06-23 Hrvoje Niksic <hniksic@srce.hr>
483 * version.c: Wget 1.5.2 is released.
485 1998-06-23 Dave Love <d.love@dl.ac.uk>
487 * ftp.c, init.c, netrc.c: Include errno.h.
489 * http.c: Include errno.h and time header.
491 * Makefile.in (exext): Define.
492 (install.bin, uninstall.bin): Use it.
494 1998-06-21 Hrvoje Niksic <hniksic@srce.hr>
496 * http.c (http_loop): Don't attempt to compare local and remote
497 sizes if the remote size is unknown.
499 1998-06-16 Hrvoje Niksic <hniksic@srce.hr>
501 * url.c (get_urls_html): Use malloc() instead of alloca in the
504 1998-06-13 Hrvoje Niksic <hniksic@srce.hr>
506 * version.c: Wget 1.5.2-b4 is released.
508 1998-06-13 Hrvoje Niksic <hniksic@srce.hr>
510 * url.c (get_urls_html): Ignore spaces before and after the URI.
512 1998-06-08 Wanderlei Antonio Cavassin <cavassin@conectiva.com.br>
514 * ftp.c (getftp): Translate `done'.
516 1998-06-06 Hrvoje Niksic <hniksic@srce.hr>
518 * version.c: Wget 1.5.2-b3 is released.
520 1998-06-06 Alexander Kourakos <awk@bnt.com>
522 * init.c (cleanup): Close dfp, don't free it.
524 1998-06-06 Hrvoje Niksic <hniksic@srce.hr>
526 * utils.c (make_directory): Twiddle.
528 * config.h.in: Added template for access().
530 1998-06-05 Mathieu Guillaume <mat@cythere.com>
532 * html.c (htmlfindurl): Download <input src=...>
534 1998-06-03 Hrvoje Niksic <hniksic@srce.hr>
536 * utils.c (file_exists_p): Use access() with two arguments.
538 1998-05-27 Martin Kraemer <Martin.Kraemer@mch.sni.de>
540 * netrc.c (parse_netrc): Correct logic.
542 1998-05-27 Hrvoje Niksic <hniksic@srce.hr>
544 * ftp.c (getftp): Added `break'; suggested by Lin Zhe Min
545 <ljm@ljm.wownet.net>.
547 1998-05-24 Hrvoje Niksic <hniksic@srce.hr>
549 * version.c: Wget 1.5.2-b2 is released.
551 1998-05-18 Juan Jose Rodriguez <jcnsoft@jal1.telmex.net.mx>
553 * mswindows.h: Don't translate mkdir to _mkdir under Borland.
555 1998-05-17 Hrvoje Niksic <hniksic@srce.hr>
557 * retr.c (elapsed_time): Return correct value when
558 HAVE_GETTIMEOFDAY is undefined.
560 1998-05-13 Hrvoje Niksic <hniksic@srce.hr>
562 * version.c: Wget 1.5.2-b1 is released.
564 1998-05-08 Hrvoje Niksic <hniksic@srce.hr>
566 * getopt.c (_getopt_internal): Use exec_name instead of argv[0].
567 (_getopt_internal): Don't translate `#if 0'-ed strings.
569 1998-05-06 Douglas E. Wegscheid <wegscd@whirlpool.com>
571 * mswindows.c (ws_handler): Use fork_to_background().
573 1998-05-05 Hrvoje Niksic <hniksic@srce.hr>
575 * version.c: Wget 1.5.1 is released.
577 1998-05-05 Hrvoje Niksic <hniksic@srce.hr>
579 * http.c (parse_http_status_line): Avoid `minor' and `major'
582 1998-05-02 Hrvoje Niksic <hniksic@srce.hr>
584 * utils.c (mkdirhier): Renamed to make_directory.
586 1998-05-01 Hrvoje Niksic <hniksic@srce.hr>
588 * mswindows.c (fork_to_background): Define under Windows.
590 * utils.c (fork_to_background): New function.
592 * html.c (htmlfindurl): Removed rerdundant casts.
594 1998-05-01 Douglas E. Wegscheid <wegscd@whirlpool.com>
596 * mswindows.c (ws_mypath): Cache the path.
598 1998-04-30 Douglas E. Wegscheid <wegscd@whirlpool.com>
600 * ftp.h: Prefix enum ftype members with FT_.
602 * ftp-ls.c, ftp.c, html.h: Adjust accordingly.
604 * mswindows.h: Use stat under Borland, _stat under MSVC.
606 1998-04-28 Hrvoje Niksic <hniksic@srce.hr>
608 * http.c (known_authentication_scheme_p): New function.
609 (gethttp): Handle authorization more correctly.
611 * ftp-basic.h: Removed.
615 * utils.c: Include <unistd.h> before <pwd.h>; needed under SunOS
617 (numdigit): Use `while' loop.
619 * http.c (create_authorization_line): Detect authentication
620 schemes case-insensitively.
622 * http.c (extract_header_attr): Use strdupdelim().
623 (digest_authentication_encode): Move declaration of local
624 variables to smaller scope.
625 (digest_authentication_encode): Reset REALM, OPAQUE and NONCE.
626 (create_authorization_line): Detect authentication schemes
629 * utils.c (touch): Constify.
631 * http.c (gethttp): Report a nicer error when no data is received.
633 * rbuf.h (RBUF_READCHAR): Ditto.
635 * ftp-basic.c (ftp_response): Use sizeof.
637 1998-04-27 Hrvoje Niksic <hniksic@srce.hr>
639 * retr.c (print_percentage): EXPECTED is long, not int.
640 (print_percentage): Use floating-point arithmetic to avoid
641 overflow with large files' sizes multiplied with 100.
643 1998-04-27 Gregor Hoffleit <flight@mathi.uni-heidelberg.de>
645 * config.h.in: Added pid_t stub.
647 * sysdep.h (S_ISREG): Moved here from mswindows.h (NeXT doesn't
650 1998-04-20 Hrvoje Niksic <hniksic@srce.hr>
652 * version.c: Wget 1.5.0 is released.
654 1998-04-18 Hrvoje Niksic <hniksic@srce.hr>
656 * url.c (str_url): Ditto.
658 * ftp-basic.c (ftp_rest): Use new name.
660 * utils.c (long_to_string): Renamed from prnum().
662 1998-04-16 Hrvoje Niksic <hniksic@srce.hr>
664 * version.c: Wget 1.5-b17 is released.
666 1998-04-08 Hrvoje Niksic <hniksic@srce.hr>
668 * headers.c (header_get): New argument FLAGS.
670 * http.c (gethttp): If request is malformed, bail out of the
672 (gethttp): Check for empty header *after* the status line checks.
673 (gethttp): Disallow continuations for status line.
675 1998-04-08 Hrvoje Niksic <hniksic@srce.hr>
677 * version.c: Wget 1.5-b16 is released.
679 1998-04-08 Hrvoje Niksic <hniksic@srce.hr>
681 * init.c (commands): Renamed `always_rest' to `continue'.
683 1998-04-05 Hrvoje Niksic <hniksic@srce.hr>
687 * log.c (logputs): New argument.
691 1998-04-04 Hrvoje Niksic <hniksic@srce.hr>
693 * http.c (http_atotm): Update comment.
695 * main.c (i18n_initialize): Set LC_MESSAGES, not LC_ALL.
697 * wget.h: Renamed ENABLED_NLS to HAVE_NLS.
699 * main.c (i18n_initialize): New function.
702 * log.c: Include <unistd.h>.
704 * retr.c (show_progress): Cast alloca to char *.
706 1998-04-04 Hrvoje Niksic <hniksic@srce.hr>
708 * version.c: Wget 1.5-b15 is released.
710 1998-04-04 Hrvoje Niksic <hniksic@srce.hr>
712 * utils.h: Declare file_non_directory_p().
714 1998-04-03 Hrvoje Niksic <hniksic@srce.hr>
716 * main.c (main): It's `tries', not `numtries' now.
718 1998-04-01 Hrvoje Niksic <hniksic@srce.hr>
720 * init.c (getperms): Removed.
722 1998-04-01 Tim Charron <tcharron@interlog.com>
724 * log.c (logvprintf): Don't use ARGS twice.
726 1998-04-01 John <john@futuresguide.com>
728 * mswindows.c: Cleaned up.
730 1998-04-01 Hrvoje Niksic <hniksic@srce.hr>
732 * version.c: Wget 1.5-b14 is released.
734 1998-04-01 Hrvoje Niksic <hniksic@srce.hr>
736 * ftp-opie.c (STRLEN4): New macro.
739 1998-04-01 Junio Hamano <junio@twinsun.com>
741 * http.c: Document all the Digest functions.
743 1998-04-01 Hrvoje Niksic <hniksic@srce.hr>
745 * utils.c (file_non_directory_p): Renamed from isfile().
747 * mswindows.h (S_ISREG): New macro, suggested by Tim Adam.
749 1998-03-31 Hrvoje Niksic <hniksic@srce.hr>
751 * utils.c (mkdirhier): Use 0777 instead of opt.dirmode.
753 * init.c (cmd_spec_dotstyle): Use 48 dots per line for binary
755 (cmd_permissions): Removed.
757 * config.h.in: Add template for WORDS_BIGENDIAN.
759 1998-03-31 Junio Hamano <junio@twinsun.com>
761 * http.c (HEXD2asc): New macro.
764 1998-03-31 Hrvoje Niksic <hniksic@srce.hr>
766 * version.c: Wget 1.5-b13 is released.
768 1998-03-31 Hrvoje Niksic <hniksic@srce.hr>
770 * main.c (main): Don't try to use `com'.
772 1998-03-30 Hrvoje Niksic <hniksic@srce.hr>
774 * init.c (cmd_permissions): New function.
776 1998-03-30 Hrvoje Niksic <hniksic@srce.hr>
778 * version.c: Wget 1.5-b12 is released.
780 1998-03-30 Hrvoje Niksic <hniksic@srce.hr>
782 * init.c (commands): Renamed `numtries' to `tries'.
783 (cmd_spec_debug): Removed.
784 (home_dir): Under Windows, return `C:\' if HOME is undefined.
786 1998-03-29 Hrvoje Niksic <hniksic@srce.hr>
788 * config.h.in: Define _XOPEN_SOURCE.
790 * init.c (check_user_specified_header): New function.
791 (cmd_spec_header): Use it.
792 (cmd_spec_useragent): New function.
794 1998-03-29 Hrvoje Niksic <hniksic@srce.hr>
796 * version.c: Wget 1.5-b11 is released.
798 1998-03-28 Hrvoje Niksic <hniksic@srce.hr>
800 * wget.h: Include <libintl.h> only if NLS is enabled.
802 1998-03-26 Hrvoje Niksic <hniksic@srce.hr>
804 * options.h (struct options): Made `wait' a long.
805 (struct options): Ditto for `timeout'.
807 1998-03-19 Hrvoje Niksic <hniksic@srce.hr>
809 * utils.c (exists): Renamed to file_exists_p.
810 (file_exists_p): Use access() if available.
812 1998-03-17 Hrvoje Niksic <hniksic@srce.hr>
814 * utils.c (memfatal): Set save_log_p to 0 to avoid potential
817 * log.c: do_logging -> save_log_p.
819 * config.h.in: Added template for HAVE_VSNPRINTF.
821 1998-03-16 Hrvoje Niksic <hniksic@srce.hr>
825 * http.c: Protect declaration against non-ANSI compiler.
827 * log.c (logvprintf): Use vsnprintf() if available.
829 * getopt.c (main): Don't translate test stuff.
831 1998-03-16 Hrvoje Niksic <hniksic@srce.hr>
833 * version.c: Wget 1.5-b10 is released.
835 1998-03-11 Hrvoje Niksic <hniksic@srce.hr>
837 * ftp.c (getftp): Don't translate "CWD %s".
839 * wget.h (GCC_FORMAT_ATTR): Renamed from FORMAT_ATTR.
841 1998-03-07 Hrvoje Niksic <hniksic@srce.hr>
843 * ftp-opie.c (btoe): Use memcpy() instead of strncat().
845 * log.c (logputs): New function.
846 (logvprintf): Renamed from vlogmsg; use logputs().
848 * retr.c (show_progress): Print `[100%]' when the retrieval is
851 * init.c (run_wgetrc): Use FILE, not PATH.
852 (wgetrc_file_name): Ditto.
854 1998-03-07 Tim Adam <tma@osa.com.au>
856 * recur.c (parse_robots): Correctly reset `entries' on empty
859 1998-03-07 Hrvoje Niksic <hniksic@srce.hr>
861 * init.c (cmd_spec_debug): Use cmd_boolean().
863 1998-02-23 Hrvoje Niksic <hniksic@srce.hr>
865 * http.c (gethttp): Create proxy-authorization correctly.
867 1998-02-22 Hrvoje Niksic <hniksic@srce.hr>
871 * getopt.c: Use ANSI function definitions.
873 * ftp-opie.c: New file.
875 * options.h: Don't redefine EXTERN.
877 * init.c: Sort it correctly.
879 1998-02-22 Hrvoje Niksic <hniksic@srce.hr>
881 * version.c: Wget 1.5-b9 is released.
883 1998-02-22 Hrvoje Niksic <hniksic@srce.hr>
885 * recur.c (recursive_retrieve): Reset `first_time'.
887 * ftp.c (getftp): Added `default' clause to switches of uerr_t.
889 * rbuf.c (rbuf_peek): Simplified.
890 (rbuf_flush): Use MINVAL.
892 * wget.h (MINVAL): Moved from url.h.
894 * rbuf.h (RBUF_FD): New macro.
896 * url.c (add_url): Add to the head of the list.
898 * ftp.c (ftp_retrieve_list): Set the permissions to downloaded
900 (getftp): Set the default permissions to 0600.
902 1998-02-21 Hrvoje Niksic <hniksic@srce.hr>
904 * url.c (get_urls_html): Ditto.
905 (convert_links): Ditto.
907 * recur.c (parse_robots): Ditto.
909 * html.c (ftp_index): Ditto.
911 * ftp-ls.c (ftp_parse_unix_ls): Open file as binary.
913 * init.c (defaults): Initialize `opt' to zero via memset.
915 * http.c (digest_authentication_encode): goto considered harmful.
917 1998-02-19 Hrvoje Niksic <hniksic@srce.hr>
919 * ftp.c (delelement): Simplify and fix leak.
921 1998-02-18 Hrvoje Niksic <hniksic@srce.hr>
923 * http.c (dump_hash): Use HEXD2ASC instead of home-grown stuff.
925 * url.h (HEXD2ASC): Removed warning.
927 * init.c (comind): Use binary search.
928 (commands): Reorganized.
930 (cmd_boolean): New function.
932 (cmd_number_inf): Ditto.
935 (cmd_directory_vector): Ditto.
938 (cmd_spec_debug): Ditto.
939 (cmd_spec_dirmode): Ditto.
940 (cmd_spec_dirstruct): Ditto.
941 (cmd_spec_dotstyle): Ditto.
942 (cmd_spec_header): Ditto.
943 (cmd_spec_htmlify): Ditto.
944 (cmd_spec_mirror): Ditto.
945 (cmd_spec_outputdocument): Ditto.
946 (cmd_spec_recursive): Ditto.
947 (settime): Merged with cmd_time().
948 (setbytes): Merged with cmd_bytes().
949 (setonoff): Merged with cmd_boolean().
952 1998-02-17 Hrvoje Niksic <hniksic@srce.hr>
954 * Makefile.in (distclean): Remove `config.h'.
956 1998-02-17 Hrvoje Niksic <hniksic@srce.hr>
958 * version.c: Wget 1.5-b8 is released.
960 1998-02-17 Hrvoje Niksic <hniksic@srce.hr>
962 * http.c (digest_authentication_encode): New function.
963 (create_authorization_line): Use it.
964 (dump_hash): New function.
965 (digest_authentication_encode): Use it.
967 * fnmatch.c: Renamed from `mtch.c'.
969 1998-02-15 Karl Eichwalder <ke@suse.de>
971 * main.c (main): Tag "Written by..." string as translatable.
973 1998-02-15 Hrvoje Niksic <hniksic@srce.hr>
975 * wget.h (FREE_MAYBE): New macro.
977 * http.c (create_authorization_line): Don't use ANSI C string
978 concatenation feature.
979 (basic_authentication_encode): Use alloca() for temporary
988 * ftp-basic.c: Protect declaration against non-ANSI compiler.
990 * http.c (create_authorization_line): Cast `unsigned char *' to
991 `char *' for sprintf, to shut up the noisy Digital Unix cc.
993 1998-02-15 Hrvoje Niksic <hniksic@srce.hr>
995 * version.c: Wget 1.5-b7 is released.
997 1998-02-15 Hrvoje Niksic <hniksic@srce.hr>
999 * cmpt.c (strstr): Synched with glibc-2.0.6.
1001 * ftp-basic.c (calculate_skey_response): Ditto.
1002 (calculate_skey_response): Use alloca().
1004 * http.c (create_authorization_line): Work with FSF's version of
1007 * md5.c: New file, from GNU libc.
1009 1998-02-14 Hrvoje Niksic <hniksic@srce.hr>
1011 * url.h (URL_CLEANSE): Name the temporary variable more carefully.
1013 1998-02-13 Hrvoje Niksic <hniksic@srce.hr>
1015 * http.c (basic_authentication_encode): New function, instead of
1018 1998-02-13 Junio Hamano <junio@twinsun.com>
1020 * http.c: Add HTTP-DA support.
1021 * ftp-basic.c: Add Opie/S-key support.
1022 * config.h.in, Makefile.in: Add HTTP-DA and Opie/S-key support.
1023 * md5.c, md5.h: New files.
1025 1998-02-13 Hrvoje Niksic <hniksic@srce.hr>
1027 * http.c (http_process_range): Renamed from hprocrange().
1028 (http_process_range): Parse the whole header.
1030 * headers.c: New file.
1031 (header_process): New function.
1032 (header_get): Renamed from fetch_next_header.
1034 * all: Include utils.h only where necessary.
1036 * wget.h: Declare xmalloc(), xrealloc() and xstrdup() here.
1038 * wget.h: Add provisions for dmalloc.
1040 1998-02-12 Hrvoje Niksic <hniksic@srce.hr>
1042 * version.c: Wget 1.5-b6 is released.
1044 1998-02-12 Hrvoje Niksic <hniksic@srce.hr>
1046 * ftp.c (ftp_loop): Determine `filename' more precisely.
1048 * init.c (setval): Don't set `opt.quiet' if output-document is
1051 * log.c (log_init): Print to STDERR instead of STDOUT.
1052 (vlogmsg): Use STDERR by default.
1055 1998-02-11 Simon Josefsson <jas@pdc.kth.se>
1057 * host.c: Use addr_in again.
1059 1998-02-08 Karl Eichwalder <karl@suse.de>
1061 * http.c (gethttp): Fixed typo.
1063 1998-02-08 Hrvoje Niksic <hniksic@srce.hr>
1065 * version.c: Wget 1.5-b5 is released.
1067 1998-02-08 Hrvoje Niksic <hniksic@srce.hr>
1069 * retr.c (show_progress): Use it.
1071 * log.c (logflush): New function.
1073 * wget.h: Utilize __attribute__ if on gcc.
1075 1998-02-07 Hrvoje Niksic <hniksic@srce.hr>
1077 * http.c (base64_encode_line): New argument LENGTH.
1078 (BASIC_AUTHENTICATION_ENCODE): Use it.
1079 (BASIC_AUTHENTICATION_ENCODE): Take length of HEADER into account.
1081 * main.c (main): Fixed fprintf() format mismatch.
1083 1998-02-06 Hrvoje Niksic <hniksic@srce.hr>
1085 * version.c: Wget 1.5-b4 is released.
1087 1998-02-03 Simon Josefsson <jas@pdc.kth.se>
1089 * host.c: use sockaddr_in instead of addr_in.
1091 1998-02-04 Hrvoje Niksic <hniksic@srce.hr>
1093 * init.c (cleanup): Use it.
1095 * recur.c (recursive_cleanup): New function.
1097 * retr.c (retrieve_from_file): Ditto.
1099 * main.c (main): Use it.
1101 * recur.c (recursive_reset): New function.
1103 * retr.c (retrieve_from_file): Ditto.
1105 * main.c (main): Simplify call to recursive_retrieve().
1107 * recur.c (recursive_retrieve): Removed FLAGS argument.
1109 * http.c (gethttp): Changed call to iwrite().
1111 1998-02-03 Hrvoje Niksic <hniksic@srce.hr>
1113 * url.c (get_urls_html): Ditto.
1114 (free_urlpos): Ditto.
1118 * retr.c (retrieve_url): Move declaration of local variables to
1121 * url.c (urlproto): Use it.
1124 (get_urls_html): Ditto.
1126 * utils.h (ARRAY_SIZE): New macro.
1128 * url.c (proto): Moved from url.h.
1130 * url.h (URL_CLEANSE): Reformatted.
1131 (USE_PROXY_P): Renamed from USE_PROXY.
1133 * ftp-basic.c: Adjust to the new interface of iwrite().
1135 * ftp-basic.c (ftp_port): Use alloca().
1137 1998-02-03 Hrvoje Niksic <hniksic@srce.hr>
1139 * version.c: Wget 1.5-b3 is released.
1141 * host.c (ftp_getaddress): Don't print to stderr directly.
1143 * init.c (setbytes): Support `g' for gigabytes.
1144 (cmdtype): New specification CTIME.
1145 (setval): Use it with settime().
1146 (commands): Use it for WAIT and TIMEOUT.
1148 1998-02-02 Hrvoje Niksic <hniksic@srce.hr>
1150 * http.c (BASIC_AUTHENTICATION_ENCODE): New macro.
1153 * utils.c (unique_name_1): Moved from url.c.
1154 (unique_name): Ditto.
1156 * url.c (url_filename): Ditto.
1158 * log.c (redirect_output): Changed call to unique_name().
1160 * url.c (unique_name_1): Renamed from unique_name().
1161 (unique_name): Changed interface.
1163 * init.c (enum cmdid): Moved from init.h.
1165 (struct cmd): Ditto.
1167 * main.c (main): Use it.
1168 (main): Moved `--backups' to not have a short option.
1170 * options.h (struct options): New member BACKGROUND.
1172 * main.c (print_help): Rearranged.
1173 (main): New long options for -n* short options: --no-directories,
1174 --no-host-directories, --non-verbose, --no-host-lookup and
1175 --dont-remove-listing.
1177 1998-02-01 Hrvoje Niksic <hniksic@srce.hr>
1179 * main.c (main): Use log_close().
1181 * log.c: New variable LOGFP.
1183 (redirect_output): Don't open /dev/null; set LOGFP to stdin
1185 (log_close): New function.
1187 * options.h (struct options): Removed LFILE.
1189 * log.c (log_enable): Removed.
1191 * main.c (main): Use it.
1193 * log.c (log_init): New function.
1195 * url.c (get_urls_html): Removed needless assignment to BASE.
1197 * host.c (add_hlist): Don't set CMP needlessly.
1199 * utils.c (match_backwards): Ditto.
1200 (in_acclist): Ditto.
1202 * url.c (findurl): Ditto.
1204 * netrc.c (parse_netrc): Ditto.
1206 * log.c (log_dump): Ditto.
1208 * html.c (html_quote_string): Ditto.
1210 * ftp-basic.c (ftp_request): Made static.
1212 * connect.c: Made global variables static.
1214 * url.c (construct): Ditto.
1216 * init.c (init_path): Avoid assignment inside `if'-condition.
1218 * ftp.c: Don't include in.h or winsock.h.
1220 * ftp.c (ftp_loop): Use SZ.
1222 * connect.c (bindport): Cast &addrlen to int *.
1225 * init.c (initialize): Don't use SYSTEM_WGETRC unconditionally.
1227 1998-01-31 Hrvoje Niksic <hniksic@srce.hr>
1229 * ftp.c (getftp): Initialize opt.ftp_pass here.
1230 (ftp_retrieve_dirs): Use alloca().
1232 * init.c (defaults): Don't initialize opt.ftp_pass.
1234 * sysdep.h (S_ISLNK): Declare for OS/2; ditto for lstat.
1235 From Ivan F. Martinez <ivanfm@ecodigit.com.br>.
1237 1998-01-31 Hrvoje Niksic <hniksic@srce.hr>
1239 * recur.c (parse_robots): Check for comments more correctly.
1241 * host.c (ftp_getaddress): Use STRDUP_ALLOCA.
1242 (ftp_getaddress): Add diagnostics when reverse-lookup yields only
1245 1998-01-31 Hrvoje Niksic <hniksic@srce.hr>
1247 * version.c: Wget 1.5-b2 is released.
1249 * netrc.c (NETRC_FILE_NAME): Moved from netrc.h.
1251 * utils.c (proclist): Pass FNM_PATHNAME to fnmatch().
1253 * ftp-basic.c (ftp_pasv): Avoid unnecessary casting to unsigned
1256 * log.c: Don't attempt to hide arguments from ansi2knr.
1258 * cmpt.c: Synched strptime() and mktime() with glibc-2.0.6.
1260 * ansi2knr.c: Use a later version, from fileutils-3.16l alpha.
1262 * ftp.c (getftp): Ditto.
1264 * http.c (gethttp): Use it.
1266 * retr.c (get_contents): New argument EXPECTED; pass it to
1268 (show_progress): New argument EXPECTED; use it to display
1271 * init.c (setval): Ditto.
1273 * http.c (gethttp): Ditto.
1276 * ftp.c (getftp): Ditto.
1277 (ftp_loop_internal): Ditto.
1279 * ftp-ls.c (ftp_parse_unix_ls): Use abort() instead of assert(0).
1281 * sysdep.h (CLOSE): Simplify; use DEBUGP.
1283 * netrc.c (search_netrc): Use alloca().
1285 * init.c (defaults): Initialize no_flush.
1287 * log.c (vlogmsg): Don't flush if no_flush.
1289 * options.h (struct options): New variable no_flush.
1291 * main.c (main): Don't play games with buffering.
1293 * log.c (vlogmsg): Flush the output after every message.
1295 1998-01-31 Hrvoje Niksic <hniksic@srce.hr>
1297 * init.c (parse_line): Ditto.
1299 * url.c (get_urls_html): Ditto.
1301 * main.c (main): Don't cast to unsigned char.
1303 * init.c (run_wgetrc): Don't cast to unsigned char.
1304 (parse_line): Accept char instead of unsigned char.
1306 * html.c (htmlfindurl): Use char instead of unsigned char.
1310 * sysdep.h: Add wrappers to ctype macros to make them
1313 1998-01-30 Hrvoje Niksic <hniksic@srce.hr>
1315 * html.c (htmlfindurl): Download <img lowsrc=...>
1317 * main.c (main): Ignore SIGPIPE.
1319 * connect.c (select_fd): New argument WRITEP.
1320 (iwrite): Call select_fd().
1322 1997-02-27 Fila Kolodny <fila@ibi.com>
1324 * ftp.c (ftp_retrieve_list): If retrieving symlink and the proper
1325 one already exists, just skip it.
1327 1998-01-30 Hrvoje Niksic <hniksic@srce.hr>
1329 * http.c (gethttp): Cosmetic changes.
1331 * http.c (check_end): Allow `+D...' instead of `GMT'.
1332 From Fabrizio Pollastri <pollastri@cstv.to.cnr.it>.
1334 * url.c (process_ftp_type): New function.
1337 * connect.c (iwrite): Allow writing in a few chunks.
1338 (bindport): Made SRV static, so addr can point to it.
1339 (select_fd): Removed HPUX kludge.
1341 * host.c (free_hlist): Incorporated into clean_hosts().
1343 1998-01-29 Hrvoje Niksic <hniksic@srce.hr>
1345 * host.c (hlist): Made static.
1346 (search_address): Cosmetic change.
1348 1998-01-29 Hrvoje Niksic <hniksic@srce.hr>
1350 * version.c: Wget v1.5-b1 is released.
1352 * http.c (hgetlen): Use sizeof() to get the header length.
1355 (hgetlocation): Ditto.
1356 (hgetmodified): Ditto.
1357 (haccepts_none): Ditto.
1359 * main.c (main): Updated `--version' and `--help' output, as per
1360 Francois Pinard's suggestions.
1362 * main.c: Include locale.h; call setlocale(), bindtextdomain() and
1365 * config.h.in: Define stubs for I18N3.
1367 * wget.h: Include libintl.h.
1369 1998-01-28 Hrvoje Niksic <hniksic@srce.hr>
1371 * url.c (mkstruct): Check for opt.cut_dirs.
1372 (mkstruct): alloca()-te more, xmalloc() less.
1374 * utils.c (load_file): Check for ferror().
1376 * url.c (get_urls_file): Close only the files we opened.
1377 (get_urls_html): Ditto.
1378 (count_slashes): New function.
1382 * http.c (gethttp): Respect username and password provided by
1384 (base64_encode_line): Write into an existing buffer instead of
1385 malloc-ing a new one.
1386 (struct http_stat): Moved from http.h
1388 * retr.c (retrieve_url): Free SUF.
1390 * all: Removed lots of unnecessary .h dependencies.
1392 * html.c (global_state): Made static.
1394 * utils.h (ALLOCA_ARRAY): New macro.
1396 * main.c (main): New option `--cut-dirs'.
1398 * url.c (construct): Use alloca() for T.
1400 * utils.c (mkdirhier): Use STRDUP_ALLOCA.
1402 * host.c (_host_t): Moved from host.h.
1403 (struct host): Renamed from _host_t.
1404 (store_hostaddress): Use STRDUP_ALLOCA for INET_S.
1407 * host.h: Don't include url.h.
1409 * ftp.c (LIST_FILENAME): Moved from ftp.h.
1411 * init.c (DEFAULT_FTP_ACCT): Moved from ftp.h.
1413 * main.c (main): Enable log if the output goes to a TTY.
1415 * connect.h: Removed unused constant `BACKLOG'.
1417 * config.h.in: Check for isatty().
1419 * Makefile.in (LINK): Use CFLAGS when linking.
1421 1998-01-27 Hrvoje Niksic <hniksic@srce.hr>
1423 * mswindows.c (ws_hangup): Use redirect_output().
1425 * main.c (redirect_output_signal): New function; use
1428 * log.c (redirect_output): New function, based on hangup(), which
1431 * log.c (vlogmsg): New function.
1433 * wget.h (DEBUGP): Use debug_logmsg().
1435 * main.c (hangup): Use it.
1437 * log.c (log_dump): New function.
1439 * utils.h (DO_REALLOC): Use `long' for various sizes.
1441 * http.c (hskip_lws): Use `while', for clarity.
1442 (HTTP_DYNAMIC_LINE_BUFFER): New constant.
1443 (fetch_next_header): Use it instead of DYNAMIC_LINE_BUFFER.
1445 * ftp-basic.c (FTP_DYNAMIC_LINE_BUFFER): New constant.
1446 (ftp_response): Use it instead of DYNAMIC_LINE_BUFFER.
1448 * utils.c (DYNAMIC_LINE_BUFFER): Moved from utils.c.
1449 (LEGIBLE_SEPARATOR): Ditto.
1450 (FILE_BUFFER_SIZE): Ditto.
1452 * retr.c (BUFFER_SIZE): Moved from retr.h.
1455 (logmsg): Moved from utils.c.
1456 (debug_logmsg): New function.
1458 * mswindows.h: Include it here.
1462 * utils.c: Don't include <windows.h>.
1464 1998-01-25 Hrvoje Niksic <hniksic@srce.hr>
1466 * host.c (ftp_getaddress): Ditto.
1468 * main.c (main): Use it.
1470 * utils.h (STRDUP_ALLOCA): New macro.
1472 * init.c: Prepend `wget: ' to error messages printed on stderr.
1474 * utils.c (mkdirhier): Renamed from mymkdir.
1475 (touch): Renamed from my_touch.
1476 (pwd_cuserid): Renamed from my_cuserid().
1478 1998-01-24 Andy Eskilsson <andy.eskilsson@telelogic.se>
1480 * utils.c (accdir): Process wildcards.
1481 (proclist): New function.
1482 (accdir): Use it to avoid code repetition.
1484 1998-01-24 Hrvoje Niksic <hniksic@srce.hr>
1486 * recur.c (parse_robots): Respect opt.useragent; use alloca().
1488 * http.c (gethttp): Construct useragent accordingly.
1490 * version.c: Changed version string to numbers-only.
1492 * main.c (print_help): List all the options.
1494 * mswindows.c (windows_main_junk): Initialize argv0 here.
1496 1998-01-24 Karl Heuer <kwzh@gnu.org>
1498 * netrc.c (search_netrc): Initialize `l' only after processing
1501 * main.c (main): Don't trap SIGHUP if it's being ignored.
1503 1998-01-24 Hrvoje Niksic <hniksic@srce.hr>
1505 * all: Use logmsg().
1507 * utils.c (time_str): Moved from retr.c.
1508 (logmsg): New function.
1509 (logmsg_noflush): Ditto.
1511 * rbuf.c: New file, moved buf_* functions here.
1513 * ftp.c (ftp_expected_bytes): Moved from ftp-basic.c.
1515 * ftp-basic.c (ftp_rest): Use prnum().
1517 * ftp-basic.c: Ditto.
1519 * ftp.c: Use the new reading functions and macros.
1521 * retr.c (buf_initialize): New function.
1522 (buf_initialized_p): Ditto.
1523 (buf_uninitialize): Ditto.
1526 * http.c (fetch_next_header): Use the BUF_READCHAR macro for
1528 (gethttp): Use alloca() where appropriate.
1530 * retr.c (buf_readchar): Use it.
1531 (buf_peek): Use rstreams.
1533 * retr.h (BUF_READCHAR): New macro.
1535 * init.c (home_dir): Rewritten for clarity.
1538 * mswindows.c (ws_backgnd): Made static.
1539 (read_registry): Ditto.
1540 (ws_cleanup): Ditto.
1541 (ws_handler): Ditto.
1543 1998-01-23 Hrvoje Niksic <hniksic@srce.hr>
1545 * alloca.c: New file.
1547 * Makefile.in (ALLOCA): Define.
1549 * mswindows.c (ws_help): Constify.
1550 (ws_help): Use alloca.
1552 * mswindows.c: Reformat.
1554 * all: Added _(...) annotations for I18N snarfing and translation.
1556 * host.c (ftp_getaddress): Nuke SYSINFO.
1557 (ftp_getaddress): Don't use getdomainname().
1558 (ftp_getaddress): Use uname(), where available.
1560 * http.c (gethttp): Protect a stray fprintf().
1562 * init.c (settime): New function.
1563 (setval): Treat WAIT specially, allowing suffixes like `m' for
1566 1998-01-21 Hrvoje Niksic <hniksic@srce.hr>
1568 * url.c (get_urls_html): Use alloca() for TEMP.
1570 1998-01-21 Jordan Mendelson <jordy@wserv.com>
1572 * url.c (rotate_backups): New function.
1574 * http.c (gethttp): Ditto.
1576 * ftp.c (getftp): Rotate backups.
1578 1997-12-18 Hrvoje Niksic <hniksic@srce.hr>
1580 * all: Renamed nmalloc(), nrealloc() and nstrdup() to xmalloc(),
1581 xrealloc() and xstrdup(). Use the new functions.
1583 * url.c (decode_string): Made static.
1586 (parse_uname): Ditto.
1589 (construct_relative): Ditto.
1591 * retr.c (show_progress): Made static.
1593 * recur.c (robots_url): Made static.
1594 (retrieve_robots): Ditto.
1595 (parse_robots): Ditto.
1596 (robots_match): Ditto.
1600 * main.c (printhelp): Made static.
1603 * init.c (comind): Made static.
1606 (run_wgetrc): Ditto.
1614 * http.c (fetch_next_header): Made static.
1615 (hparsestatline): Ditto.
1620 (hgetlocation): Ditto.
1621 (hgetmodified): Ditto.
1622 (haccepts_none): Ditto.
1624 (base64_encode_line): Ditto.
1625 (mktime_from_utc): Ditto.
1626 (http_atotm): Ditto.
1628 * html.c (idmatch): Made static.
1630 * host.c (search_host): Made static.
1631 (search_address): Ditto.
1632 (free_hlist): Ditto.
1634 * ftp.c (getftp): Made static.
1635 (ftp_loop_internal): Ditto.
1636 (ftp_get_listing): Ditto.
1637 (ftp_retrieve_list): Ditto.
1638 (ftp_retrieve_dirs): Ditto.
1639 (ftp_retrieve_glob): Ditto.
1640 (freefileinfo): Ditto.
1641 (delelement): Ditto.
1643 * ftp-ls.c (symperms): Made static.
1644 (ftp_parse_unix_ls): Ditto.
1646 * connect.c (select_fd): Made static.
1648 * utils.c (xmalloc): Renamed from nmalloc.
1649 (xrealloc): Renamed from nrealloc.
1650 (xstrdup): Renamed from nstrdup.
1652 * getopt.c (exchange): Use alloca.
1654 * mswindows.c (mycuserid): Use strncpy.
1656 * New files mswindows.c, mswindows.h, sysdep.h. winjunk.c,
1657 systhings.h, windecl.h and winjunk.h removed.
1659 * mswindows.c (sleep): New function.
1661 * utils.c: Include <windows.h> under Windows.
1663 1997-06-12 Darko Budor <dbudor@zesoi.fer.hr>
1665 * url.h (URL_UNSAFE): Change default under Windows.
1667 * retr.c (retrieve_from_file): Respect opt.delete_after.
1669 * main.c (main): Call ws_help on Windows.
1671 * winjunk.c (windows_main_junk): New function.
1673 * main.c (main): Junk-process argv[0].
1675 * http.c (mktime_from_utc): Return -1 if mktime failed.
1677 * http.c (http_loop): Ditto.
1679 * ftp.c (ftp_loop_internal): Change title on Windows when using a
1682 * winjunk.c (getdomainname): Lots of functions.
1684 1997-06-12 Hrvoje Niksic <hniksic@srce.hr>
1686 * cmpt.c (strptime_internal): Handle years more correctly for
1689 1997-06-09 Mike Thomas <mthomas@reality.ctron.com>
1691 * http.c (gethttp): Allocate enough space for
1692 `Proxy-Authorization' header.
1694 1997-05-10 Hrvoje Niksic <hniksic@srce.hr>
1696 * version.c: Wget/1.4.5 is released.
1698 1997-05-10 Hrvoje Niksic <hniksic@srce.hr>
1700 * retr.c (get_contents): Check return value of fwrite more
1703 1997-03-30 Andreas Schwab <schwab@issan.informatik.uni-dortmund.de>
1705 * cmpt.c (strptime_internal) [case 'Y']: Always subtract 1900 from
1706 year, regardless of century.
1708 1997-03-30 Hrvoje Niksic <hniksic@srce.hr>
1710 * utils.c (isfile): Use `lstat' instead of `stat'.
1712 1997-03-29 Hrvoje Niksic <hniksic@srce.hr>
1714 * utils.c (numdigit): Use explicit test.
1716 1997-03-21 Hrvoje Niksic <hniksic@srce.hr>
1718 * http.c (http_loop): Always use `url_filename' to get u->local.
1720 1997-03-20 Hrvoje Niksic <hniksic@srce.hr>
1722 * url.c: Recognize https.
1724 1997-03-13 Hrvoje Niksic <hniksic@srce.hr>
1726 * recur.c (recursive_retrieve): Lowercase just the host name.
1728 1997-03-09 Hrvoje Niksic <hniksic@srce.hr>
1730 * url.c (get_urls_file): Use the correct test.
1731 (get_urls_html): Ditto.
1733 1997-03-07 Hrvoje Niksic <hniksic@srce.hr>
1735 * connect.c: Reverted addrlen to int.
1737 * init.c (parse_line): Check for -1 instead of NONE.
1739 * version.c: Changed version to 1.4.5.
1741 1997-02-17 Hrvoje Niksic <hniksic@srce.hr>
1743 * init.c: New option netrc.
1744 (initialize): Don't parse .netrc.
1746 * cmpt.c (recursive): Return rp.
1747 (strptime_internal): Match the long strings first, the abbreviated
1750 1997-02-16 Hrvoje Niksic <hniksic@srce.hr>
1752 * http.c (check_end): New function.
1753 (http_atotm): Use it.
1755 1997-02-13 gilles Cedoc <gilles@cedocar.fr>
1757 * http.c (gethttp): Use them.
1759 * init.c: New options proxy_user and proxy_passwd.
1761 1997-02-14 Hrvoje Niksic <hniksic@srce.hr>
1763 * ftp.c (ftp_retrieve_list): Create links even if not relative.
1765 1997-02-10 Hrvoje Niksic <hniksic@srce.hr>
1767 * recur.c (recursive_retrieve): Lowercase the host name, if the
1768 URL is not "optimized".
1770 * host.c (realhost): Return l->hostname, even if it matches with
1773 1997-02-10 Marin Purgar <pmc@asgard.hr>
1775 * connect.c: Make addrlen size_t instead of int.
1778 1997-02-09 Gregor Hoffleit <flight@mathi.uni-heidelberg.DE>
1780 * systhings.h: Define S_ISLNK on NeXT too.
1782 1997-02-09 Hrvoje Niksic <hniksic@srce.hr>
1784 * version.c: Released 1.4.3.
1786 * url.c: Futher update to list of protostrings.
1787 (skip_proto): Skip `//' correctly for FTP and HTTP.
1789 * url.c (get_urls_html): Handle bogus `http:' things a little
1792 * main.c (main): Removed `follow-ftp' from `f'.
1793 (main): Dumped the `prefix-files' and `file-prefix' options and
1794 features; old and bogus.
1795 (main): Exit on failed setval() in `-e'.
1797 * http.c (fetch_next_header): Use it to detect header continuation
1800 * retr.c (buf_peek): New function.
1802 1997-02-08 Hrvoje Niksic <hniksic@srce.hr>
1804 * wget.h: Include time.h and stuff.
1806 1997-02-08 Roger Beeman <beeman@cisco.com>
1808 * ftp.c: Include <time.h>
1810 1997-02-07 Hrvoje Niksic <hniksic@srce.hr>
1812 * url.c (findurl): Would read over buffer limits.
1814 1997-02-06 Hrvoje Niksic <hniksic@srce.hr>
1816 * ftp-ls.c (ftp_parse_unix_ls): Allow spaces in file names.
1818 1997-02-05 Hrvoje Niksic <hniksic@srce.hr>
1820 * http.c (http_atotm): Initialize tm.is_dst.
1822 1997-02-02 Hrvoje Niksic <hniksic@srce.hr>
1824 * http.c (gethttp): Don't print the number of retrieved headers.
1826 * main.c (main): New option `--no-clobber', alias for `-nc'.
1828 * url.c: Recognize `https://'.
1830 1997-02-01 Hrvoje Niksic <hniksic@srce.hr>
1832 * host.c (herrmsg): Don't use h_errno.
1834 1997-01-30 Hrvoje Niksic <hniksic@srce.hr>
1836 * host.c (accept_domain): Use it.
1838 * main.c (main): New option `--exclude-domains'.
1840 * retr.c (retrieve_url): Use it.
1841 (retrieve_url): Bail out when an URL is redirecting to itself.
1843 * url.c (url_equal): New function.
1845 1997-01-29 Hrvoje Niksic <hniksic@srce.hr>
1847 * connect.c: Include arpa/inet.h instead of arpa/nameser.h.
1849 * http.c (mk_utc_time): New function.
1850 (http_atotm): Use it; handle time zones correctly.
1852 1997-01-28 Hrvoje Niksic <hniksic@srce.hr>
1856 * ftp-basic.c: Use it instead of WRITE.
1858 * connect.c (iwrite): New function.
1860 1997-01-27 Hrvoje Niksic <hniksic@srce.hr>
1862 * cmpt.c (mktime): New function.
1864 * netrc.c: Include <sys/types.h>.
1866 * main.c (main): Wouldn't recognize --spider.
1868 * retr.c (rate): Use `B', `KB' and `MB'.
1869 (reset_timer,elapsed_time): Moved from utils.c.
1871 * ftp.c (ftp_retrieve_list): Ditto.
1873 * http.c (http_loop): Don't touch the file if opt.dfp.
1875 1997-01-24 Hrvoje Niksic <hniksic@srce.hr>
1879 * ftp.c (ftp_retrieve_glob): New argument semantics.
1880 (ftp_retrieve_dirs): Use it.
1883 * html.c (htmlfindurl): Recognize `'' as the quote char.
1885 1997-01-23 Hrvoje Niksic <hniksic@srce.hr>
1887 * ftp.c (ftp_loop_internal): Use it.
1889 * utils.c (remove_link): New function.
1891 1997-01-22 Hrvoje Niksic <hniksic@srce.hr>
1893 * retr.c (retrieve_url): Require STRICT redirection URL.
1895 * url.c (parseurl): New argument STRICT.
1897 * http.c (hparsestatline): Be a little-bit less strict about
1900 1997-01-21 Hrvoje Niksic <hniksic@srce.hr>
1902 * http.c (gethttp): Use it.
1904 * main.c (main): Don't use '<digit>' as options.
1906 * init.c: New option ignore_length.
1908 * http.c (gethttp): Ditto.
1909 (http_loop): Check for redirection without Location:.
1910 (gethttp): Don't print Length unless RETROKF.
1912 * ftp.c (getftp): Use it.
1914 * url.c (mkalldirs): New function.
1916 * utils.c (mymkdir): Don't check for existing non-directory.
1918 * url.c (mkstruct): Don't create the directory.
1920 1997-01-20 Hrvoje Niksic <hniksic@srce.hr>
1922 * init.c (setval): Removed NO_RECURSION checks.
1924 1997-01-19 Hrvoje Niksic <hniksic@srce.hr>
1926 * version.c: "Released" 1.4.3-pre2.
1928 * recur.c (recursive_retrieve): Bypass host checking only if URL
1929 is ftp AND parent URL is not ftp.
1931 * ftp-basic.c (ftp_request): Print out Turtle Power.
1933 * ftp.c (ftp_loop): Call ftp_retrieve_glob with 0 if there's no
1935 (ftp_retrieve_glob): Call ftp_loop_internal even on empty list, if
1938 * http.c (gethttp): Be a little bit smarter about status codes.
1940 * recur.c (recursive_retrieve): Always reset opt.recursive when
1943 1997-01-18 Hrvoje Niksic <hniksic@srce.hr>
1945 * retr.c (retrieve_url): New variable location_changed; use it for
1946 tests instead of mynewloc.
1947 (retrieve_url): Allow heuristic adding of html.
1949 * url.c (url_filename): Don't use the `%' in Windows file names.
1951 * http.c (http_loop): Always time-stamp the local file.
1953 * http.c (http_loop): Ditto.
1955 * ftp.c (ftp_retrieve_list): Use it.
1957 * utils.c (my_touch): New function.
1959 * ftp.c (ftp_retrieve_list): Use #ifdef HAVE_STRUCT_UTIMBUF
1960 instead of #ifndef NeXT.
1962 * utils.c (strptime): New version, by Ulrich Drepper.
1964 1997-01-17 Hrvoje Niksic <hniksic@srce.hr>
1966 * http.c (haccepts_none): Renamed from `haccepts_bytes'.
1967 (gethttp): If haccepts_none(), disable ACCEPTRANGES.
1968 (http_loop): Would remove ACCEPTRANGES.
1970 * ftp.c (getftp): Call ftp_list with NULL.
1972 1997-01-15 Hrvoje Niksic <hniksic@srce.hr>
1974 * html.c (ftp_index): Don't print minutes and seconds if we don't
1975 know them; beautify the output.
1977 * ftp.c (getftp): Don't close the socket on FTPNSFOD.
1979 1997-01-14 Hrvoje Niksic <hniksic@srce.hr>
1981 * utils.c (strptime): New function.
1982 (strptime): Don't use get_alt_number.
1983 (strptime): Don't use locale.
1984 (match_string): Made it a function.
1986 1997-01-12 Hrvoje Niksic <hniksic@srce.hr>
1988 * http.c (http_atotm): New function.
1989 (http_loop): Use it.
1991 * atotm.c: Removed from the distribution.
1993 * http.c (base64_encode_line): Rewrite.
1995 1997-01-09 Hrvoje Niksic <hniksic@srce.hr>
1997 * ftp.c (getftp): Use ftp_expected_bytes; print size.
1999 * ftp-basic.c (ftp_response): Use ftp_last_respline.
2000 (ftp_expected_bytes): New function.
2002 * ftp.c (getftp): Print the unauthoritative file length.
2004 * ftp-ls.c: Renamed from ftp-unix.c.
2005 (ftp_parse_ls): Moved from ftp.c.
2006 (ftp_parse_unix_ls): Recognize seconds in time spec.
2007 (ftp_parse_unix_ls): Recognize year-less dates of the previous
2010 1997-01-08 Hrvoje Niksic <hniksic@srce.hr>
2012 * ftp-basic.c: Don't declare errno if #defined.
2014 * host.c (ftp_getaddress): Check for sysinfo legally.
2016 1997-01-08 Darko Budor <dbudor@diana.zems.fer.hr>
2018 * connect.c (iread): Use READ.
2020 1996-12-23 Hrvoje Niksic <hniksic@srce.hr>
2022 * url.c: Recognize finger, rlogin, tn3270, mid and cid as valid
2025 1996-12-22 Hrvoje Niksic <hniksic@srce.hr>
2027 * host.c (ftp_getaddress): Allow `.' in hostname.
2029 1996-12-26 Darko Budor <dbudor@zems.fer.hr>
2031 * wget.h: READ and WRITE macros for use instead of read and write
2032 on sockets, grep READ *.c, grep WRITE *.c
2034 * wsstartup.c: new file - startup for winsock
2036 * wsstartup.h: new file
2038 * win32decl.h: new file - fixup for <errno.h> and winsock trouble
2040 * configure.bat: Configure utility for MSVC
2042 * src/Makefile.ms,config.h.ms: new files for use with MSVC 4.x
2044 1996-12-22 Hrvoje Niksic <hniksic@srce.hr>
2046 * version.c: Released 1.4.3-pre.
2048 * utils.c (prnum): Accept long.
2049 (legible): Use prnum().
2051 * connect.c (make_connection): Accept port as short.
2054 * http.c (gethttp): Use search_netrc.
2056 1996-12-21 Hrvoje Niksic <hniksic@srce.hr>
2058 * ftp.c (getftp): Use search_netrc.
2060 * netrc.c (free_netrc): New function.
2062 * init.c (home_dir): New function.
2064 * url.c (convert_links): Allow REL2ABS changes.
2066 1996-12-21 Gordon Matzigkeit <gord@gnu.ai.mit.edu>
2068 * netrc.c: New file.
2069 (parse_netrc, maybe_add_to_list): New functions.
2071 1996-12-17 Hrvoje Niksic <hniksic@srce.hr>
2073 * retr.c (retrieve_url): Reset opt.recursion before calling
2074 ftp_loop if it is reached through newloc.
2076 * init.c (run_wgetrc): Print the wgetrc path too, when reporting
2077 error; don't use "Syntax error", since we don't know if it is
2078 really a syntax error.
2080 1996-12-16 Hrvoje Niksic <hniksic@srce.hr>
2082 * utils.c (acceptable): Extract the filename part of the path.
2084 * recur.c (recursive_retrieve): Call acceptable() with the right
2085 argument; would bug out on wildcards.
2087 * init.c (parse_line): Likewise.
2089 * html.c (htmlfindurl): Cast to char * when calling stuff.
2091 1996-12-15 Hrvoje Niksic <hniksic@srce.hr>
2093 * ftp.c (getftp): Use ftp_pasv.
2095 * ftp-basic.c (ftp_request): Accept NULL value.
2096 (ftp_pasv): New function.
2098 * options.h (struct options): Add passive FTP option.
2100 1996-12-15 Hrvoje Niksic <hniksic@srce.hr>
2102 * url.c (parseurl): Debug output.
2104 * utils.c (path_simplify): New one, adapted from bash's
2105 canonicalize_pathname().
2107 1996-12-14 Hrvoje Niksic <hniksic@srce.hr>
2109 * ftp.c (getftp): Don't discard the buffer.
2111 * retr.c (get_contents): New parameter nobuf.
2113 1996-12-13 Shawn McHorse <riffraff@txdirect.net>
2115 * html.c (htmlfindurl): Recognize <meta contents="d; URL=...".
2117 * init.c (setval): Strip the trailing slashes on CVECDIR.
2119 1996-12-13 Hrvoje Niksic <hniksic@srce.hr>
2121 * init.c: Make excludes and includes under CVECDIR instead of
2124 1996-12-13 Shawn McHorse <riffraff@txdirect.net>
2126 * url.c (get_urls_html): Skip "http:".
2128 1996-12-13 Hrvoje Niksic <hniksic@srce.hr>
2130 * utils.c (strcasecmp): From glibc.
2131 (strncasecmp): Also.
2134 * url.c: Added javascript: to the list of URLs prefixes.
2136 1996-12-12 Shawn McHorse <riffraff@txdirect.net>
2138 * recur.c (retrieve_robots): Print the warning message only if
2141 1996-12-12 Gregor Hoffleit <flight@mathi.uni-heidelberg.DE>
2143 * ftp.c (ftp_retrieve_list): Use NeXT old utime interface.
2145 1996-12-12 Hrvoje Niksic <hniksic@srce.hr>
2147 * systhings.h: New file.
2149 * ../configure.in: Check for utime.h
2151 * ftp.c: Check whether we have unistd.h.
2153 1996-11-27 Hrvoje Niksic <hniksic@srce.hr>
2155 * recur.c (recursive_retrieve): Send the canonical URL as referer.
2156 (recursive_retrieve): Call get_urls_html with the canonical URL.
2158 1996-12-13 Kaveh R. Ghazi <ghazi@caip.rutgers.edu>
2160 * (configure.in, config.h.in, src/Makefile.in, src/*.[ch]): Add
2161 ansi2knr support for compilers which don't support ANSI style
2162 function prototypes and signatures.
2164 * (aclocal.m4, src/ansi2knr.c, src/ansi2knr.1): New files.
2166 1996-11-26 Hrvoje Niksic <hniksic@srce.hr>
2168 * url.c: Use it; Recognize paths ending with "." and ".." as
2170 (url_filename): Append .n whenever file exists and could be a
2173 * url.h (ISDDOT): New macro.
2175 * init.c (parse_line): Use unsigned char.
2177 * url.c (get_urls_html): Cast to unsigned char * when calling
2180 * html.c (htmlfindurl): Use unsigned char.
2182 * version.c: Changed version to 1.4.3.
2184 1996-11-25 Hrvoje Niksic <hniksic@srce.hr>
2186 * version.c: Released 1.4.2.
2188 * ftp.c (getftp): Simplified assertion.
2189 (ftp_loop_internal): Remove symlink before downloading.
2190 (ftp_retrieve_list): Unlink the symlink name before attempting to
2193 * options.h (struct options): Renamed print_server_response to
2196 * ftp.c (rel_constr): Removed.
2197 (ftp_retrieve_list): Don't use it.
2198 (ftp_retrieve_list): Use opt.retr_symlinks.
2200 1996-11-24 Hrvoje Niksic <hniksic@srce.hr>
2202 * main.c (main): New option retr_symlinks.
2204 * url.c (convert_links): Print verbose message.
2206 1996-11-24 Hrvoje Niksic <hniksic@srce.hr>
2208 * http.c (http_loop): Reset newloc in the beginning of function;
2209 would cause FMR in retrieve_url.
2211 1996-11-23 Hrvoje Niksic <hniksic@srce.hr>
2213 * recur.c (convert_all_links): Find the URL of each HTML document,
2214 and feed it to get_urls_html; would bug out.
2215 (convert_all_links): Check for l2 instead of dl; removed dl.
2217 * url.c (convert_links): Don't refer to freed newname.
2219 * recur.c (recursive_retrieve): Add this_url to urls_downloaded.
2221 * main.c (main): Print the OS_TYPE in the debug output, too.
2223 * recur.c (recursive_retrieve): Check for opt.delete_after.
2225 * main.c (main): New option delete-after.
2227 * init.c (setval): Cleaned up.
2229 1996-11-21 Hrvoje Niksic <hniksic@srce.hr>
2231 * Makefile.in (wget): Make `wget' the default target.
2233 * ftp.c (ftp_loop_internal): Move noclobber checking out of the
2235 (ftp_retrieve_list): Warn about non-matching sizes.
2237 * http.c (http_loop): Made -nc non-dependent on opt.recursive.
2239 * init.c (setnum): Renamed from setnuminf; New argument flags.
2242 * main.c (main): Sorted the options.
2243 (main): New option --wait.
2245 1996-11-21 Shawn McHorse <riffraff@txdirect.net>
2247 * html.c (htmlfindurl): Reset s->in_quote after getting out of
2250 1996-11-20 Hrvoje Niksic <hniksic@srce.hr>
2252 * version.c: Changed version to 1.4.2.
2254 1996-11-20 Hrvoje Niksic <hniksic@srce.hr>
2256 * version.c: Released 1.4.1.
2258 * html.c (html_quote_string): New function.
2259 (ftp_index): Use it.
2260 (htmlfindurl): A more gentle ending debug message.
2262 * ftp.c (ftp_loop): Check for opt.htmlify.
2264 * init.c: New command htmlify.
2266 * ftp.c (getftp): Nicer error messages, with `'-encapsulated
2268 (ftp_loop): Print size of index.html.
2270 * init.c (setval): Implement "styles".
2272 * main.c (main): New option dotstyle.
2274 1996-11-19 Hrvoje Niksic <hniksic@srce.hr>
2276 * ftp.c (getftp): Close the master socket in case of errors, after
2279 * connect.c (bindport): Initialize msock to -1.
2281 * ftp.c (getftp): Initialize dtsock to -1.
2283 * connect.c (closeport): Don't close sock if sock == -1.
2285 1996-11-18 Hrvoje Niksic <hniksic@srce.hr>
2287 * init.c (setnuminf): Nuked default value -- just leave unchanged.
2288 (setval): Don't send default values.
2289 (defaults): Use DEFAULT_TIMEOUT -- aaargh.
2291 * options.h (struct options): Use long for dot_bytes.
2293 * init.c (setquota): Renamed to setbytes.
2294 (setval): Use setbytes on DOTBYTES.
2296 1996-11-17 Hrvoje Niksic <hniksic@srce.hr>
2298 * ftp.c (getftp): Initialize con->dltime.
2300 * recur.c (recursive_retrieve): Use same_host instead of
2301 try_robots; simply load robots_txt whenever the host is changed.
2302 (recursive_retrieve): Free forbidden before calling parse_robots.
2304 1996-11-16 Hrvoje Niksic <hniksic@srce.hr>
2306 * retr.c (show_progress): Use them.
2308 * options.h (struct options): New options dot_bytes, dots_on_line
2311 1996-11-16 Mark Boyns <boyns@sdsu.edu>
2313 * recur.c (recursive_retrieve): Retrieve directories regardless of
2314 acc/rej rules; check for empty u->file.
2316 1996-11-14 Hrvoje Niksic <hniksic@srce.hr>
2318 * init.c (setval): Use it.
2320 * utils.c (merge_vecs): New function.
2322 * init.c (setval): Reset the list-type functions when encountering
2325 1996-11-14 Shawn McHorse <riffraff@txdirect.net>
2327 * recur.c (recursive_retrieve): Use base_url instead of this_url
2330 1996-11-14 Shawn McHorse <riffraff@txdirect.net>
2332 * html.c (htmlfindurl): Reset s->in_quote after exiting the quote.
2334 1996-11-13 Hrvoje Niksic <hniksic@srce.hr>
2336 * utils.c (sepstring): Rewrote; don't use strtok.
2338 * recur.c (recursive_retrieve): Enter assorted this_url to slist
2339 when running the first time.
2340 (retrieve_robots): Warn to ignore errors when robots are loaded.
2342 * utils.c (load_file): Moved from url.c.
2344 * http.c: Made static variables const too in h* functions.
2346 * main.c (main): Renamed --continue-ftp to --continue.
2348 * recur.c (recursive_retrieve): Use it.
2350 * utils.c (frontcmp): New function.
2352 * url.c (accdir): New function.
2354 * html.c (htmlfindurl): Recognize <area href=...>.
2356 * ftp.c (ftp_retrieve_dirs): Implemented opt.includes.
2358 * init.c (setval): Free the existing opt.excludes and
2359 opt.includes, if available.
2361 * main.c (main): New option -I.
2363 1996-11-12 Hrvoje Niksic <hniksic@srce.hr>
2365 * ftp.c (ftp_retrieve_glob): Do not weed out directories.
2367 * version.c: Changed version to 1.4.1.
2369 1996-11-11 Hrvoje Niksic <hniksic@srce.hr>
2371 * version.c: Released 1.4.0.
2373 1996-11-10 Hrvoje Niksic <hniksic@srce.hr>
2375 * main.c (main): Free com and val after parse_line.
2376 (printhelp): Reorder the listing.
2378 * http.c: More robust header parsing.
2380 * http.c: Allow any number of spaces, or no spaces, precede ':'.
2381 (hskip_lws): New function.
2382 (haccepts_bytes): New function.
2385 * init.c (setval): Check header sanity.
2386 (setval): Allow resetting of headers.
2388 1996-11-10 Hrvoje Niksic <hniksic@srce.hr>
2390 * http.c (http_loop): Don't use has_wildcards.
2392 * http.c (gethttp): Free all_headers -- would leak.
2394 * recur.c (recursive_retrieve): Initialize depth to 1 instead of
2395 0 -- this fixes a long-standing bug in -rl.
2397 1996-11-09 Hrvoje Niksic <hniksic@srce.hr>
2399 * ftp.c: Use -1 as "impossible" value for con->fd.
2401 * url.h (URL_SEPARATOR): Don't treat `*' and `+' as separators.
2403 * init.c (parse_line): Use isalpha.
2405 * ftp-unix.c: Use HAVE_UNISTD_H.
2407 * mtch.c (has_wildcards): Don't match \.
2409 * http.c (http_loop): Warn on HTTP wildcard usage.
2411 1996-11-08 Hrvoje Niksic <hniksic@srce.hr>
2413 * url.c (url_filename): Do not create numbered suffixes if
2414 opt.noclobber -- would bug out on -nc.
2416 1996-11-07 Hrvoje Niksic <hniksic@srce.hr>
2418 * recur.c (parse_robots): Don't chuck out the commands without
2419 arguments (`Disallow:<empty>' didn't work).
2420 (parse_robots): Compare versions lowercase.
2421 (parse_robots): Match on base_version, not version_string!
2422 (parse_robots): Handle comments properly.
2423 (parse_robots): Match versions in a sane way.
2425 * init.c: Print nicer error messages.
2427 * version.c: Changed version to 1.4.0.
2429 1996-11-06 Hrvoje Niksic <hniksic@srce.hr>
2431 * version.c: Released 1.4.0-test2.
2433 * init.c (run_wgetrc): Close fp.
2435 * ftp.c (ftp_retrieve_dirs): Allocate the correct length for
2438 1996-11-06 Hrvoje Niksic <hniksic@srce.hr>
2440 * init.c (setquota): Allow inf as quota specification.
2442 1996-11-05 Hrvoje Niksic <hniksic@srce.hr>
2444 * ftp.c (ftp_retrieve_dirs): Return QUOTEXC if quota exceeded.
2445 (ftp_retrieve_glob): Return QUOTEXC on quota exceeded.
2447 * main.c (main): Check for quota by comparison with downloaded
2448 stuff, not from status.
2450 * connect.c (select_fd): Should compile on HPUX without warnings now.
2452 * ftp.c (ftp_get_listing): Check whether ftp_loop_internal
2455 1996-11-04 Hrvoje Niksic <hniksic@srce.hr>
2457 * ftp.c (ftp_retrieve_glob): Print the pattern nicely.
2458 (getftp): Return FTPRETRINT on control connection error.
2460 * html.c (htmlfindurl): Recognize <embed src=...> and
2462 (ftp_index): Handle username and password correctly.
2464 * main.c (main): Made `-np' a synonim for --no-parent.
2466 1996-11-02 Hrvoje Niksic <hniksic@srce.hr>
2468 * ftp.c (ftp_loop): Check for opt.ftp_glob too before calling
2471 * version.c: Changed version to 1.4.0-test2.
2473 1996-11-02 Hrvoje Niksic <hniksic@srce.hr>
2475 * version.c: Released 1.4.0-test1.
2477 * url.c (str_url): Don't use sprintf when creating %2F-prefixed
2479 (convert_links): Removed definition of make_backup.
2481 * http.h: Removed definition of MAX_ERROR_LENGTH.
2483 * host.c (ftp_getaddress): Check for "(none)" domains.
2485 * ftp.c (ftp_retrieve_dirs): Docfix.
2487 * http.c (gethttp): Use ou->referer instead of u->referer.
2489 * retr.c (retrieve_url): Reset u to avoid freeing pointers twice;
2490 this was known to cause coredumps on Linux.
2492 * html.c (ftp_index): Cast the argument to local_time to time_t *.
2494 1996-11-01 Hrvoje Niksic <hniksic@srce.hr>
2496 * connect.c (select_fd): Use exceptfds -- once and for all.
2498 * retr.c (retrieve_from_file): Free filename after
2500 (retrieve_from_file): Send RFIRST_TIME to recursive_retrieve on
2501 first-time retrieval.
2502 (retrieve_from_file): Return uerr_t; new argument, count.
2503 (retrieve_from_file): Break on QUOTEXC.
2505 * init.c (setquota): Fixed a bug that caused rejection of
2506 non-postfixed values..
2508 1996-10-30 Hrvoje Niksic <hniksic@srce.hr>
2510 * version.c: Changed name to wget.
2512 * connect.c (iread): Smarter use of select.
2513 (select_fd): Set errno on timeout. If not timeout, return 1
2516 1996-10-29 Hrvoje Niksic <hniksic@srce.hr>
2518 * ftp.c (ftp_loop_internal): Don't use con->cmd before
2521 1996-10-26 Hrvoje Niksic <hniksic@srce.hr>
2523 * http.c (gethttp): Send correct referer when using proxy.
2524 (gethttp): Use struct urlinfo ou to access the relevant data; send
2525 correct authorization in all cases.
2527 * host.c (same_host): Use skip_uname to skip username and
2530 * url.c (skip_uname): New function.
2533 * host.c (same_host): Do not assume HTTP -- same_host should now
2534 be totally foolproof.
2536 * url.c (skip_proto): New function.
2537 (parse_uname): Use it.
2539 * http.c (gethttp): Create local user and passwd from what is
2542 * url.c (parseurl): Check for HTTP username and password too.
2544 1996-10-25 Hrvoje Niksic <hniksic@srce.hr>
2546 * config.h.in: Removed #define gethostbyname R...
2548 1996-10-22 Hrvoje Niksic <hniksic@srce.hr>
2550 * version.c: Changed version to 1.4.0-test1.
2552 1996-10-21 Hrvoje Niksic <hniksic@srce.hr>
2554 * version.c: "Released" 1.4b29.
2556 * recur.c (recursive_retrieve): Check for no_parent.
2558 * init.c (setval): Option update.
2560 * main.c (main): New option no-parent.
2562 * options.h (struct options): New variable no_parent.
2564 * recur.c (recursive_retrieve): Only files are checked for
2565 opt.accepts and opt.rejects.
2566 (recursive_retrieve): Check directories for opt.excludes.
2567 (recursive_retrieve): Make the dir absolute when checking
2570 * html.c (htmlfindurl): Recognize <applet code=...> and <script
2573 1996-10-18 Hrvoje Niksic <hniksic@srce.hr>
2575 * ftp.c (getftp): Do not line-break assert entries at all.
2576 (ftp_retrieve_dirs): docfix.
2578 * connect.c (select_fd): Use fd + 1 as nfds.
2580 * version.c: Changed version to 1.4b29.
2582 1996-10-18 Hrvoje Niksic <hniksic@srce.hr>
2584 * version.c: "Released" 1.4b28.
2586 * ftp.c (ftp_loop_internal): Check whether f->size == len and
2587 don't continue the loop if it is.
2588 (ftp_get_listing): Remove list_filename on unsuccesful loop.
2590 1996-10-17 Hrvoje Niksic <hniksic@srce.hr>
2592 * ftp.c (ftp_loop_internal): Use strcpy to initialize tmp.
2593 (getftp): Do not use multiline assert.
2595 * http.c (hparsestatline): Use mjr and mnr instead of major and
2596 minor, which don't compile on Ultrix.
2597 (http_loop): Use strcpy() to initialize tmp.
2599 * all: Geturl -> Fetch
2601 1996-10-17 Hrvoje Niksic <hniksic@srce.hr>
2603 * recur.c (parse_robots): Fixed an off-by-one bug when looking for
2606 * html.c (htmlfindurl): Fixed several possible off-by-one bugs by
2607 moving `bufsize &&' to the beginning of each check in for-loops.
2609 * recur.c (parse_robots): Close fp on exit.
2611 * url.c (mymkdir): Check for each directory before creating.
2613 1996-10-16 Hrvoje Niksic <hniksic@srce.hr>
2615 * version.c: Changed version to 1.4b28.
2617 1996-10-16 Hrvoje Niksic <hniksic@srce.hr>
2619 * version.c: "Released" 1.4b27.
2621 * init.c (parse_line): Use isspace.
2622 (parse_line): Free *com on all errors.
2624 * ftp.c (ftp_loop): Change FTPOK to RETROK before exiting.
2625 (delelement): Use next instead of f->next and prev instead of
2627 (delelement): Free the members of the deleted element.
2629 * http.c (http_loop): Do not return RETROK on code != 20x.
2631 * init.c (cleanup): Free opt.user_header.
2632 (cleanup): Free opt.domains.
2634 * url.c (freelists): Moved to cleanup().
2636 * http.c (hparsestatline): Docfix.
2638 * main.c (main): Return with error status on unsuccesful
2641 * init.c (setval): Do not remove listing when mirroring.
2643 * url.c (url_filename): Use opt.fileprefix.
2645 * ftp.c (ftp_get_listing): Use url_filename to get filename for
2648 * main.c (main): New option: -rn.
2650 1996-10-15 Hrvoje Niksic <hniksic@srce.hr>
2652 * Makefile.in (RM): Added RM = rm -f.
2654 * host.c (clean_hosts): New function.
2655 (free_hlist): Just free the list, no reset.
2657 * version.c: Changed version to 1.4b27.
2659 1996-10-13 Hrvoje Niksic <hniksic@srce.hr>
2661 * version.c: "Released" 1.4b26.
2663 * retr.c (retrieve_from_file): If call get_urls_html with
2664 opt.spider to make it silent in spider mode.
2666 * url.c (str_url): Use CLEANDUP instead of URL_CLEANSE.
2668 * url.h (CLEANDUP): New macro.
2670 * http.c (gethttp): Fixed a bug that freed location only when it
2673 * retr.c (retrieve_url): Free url if it will not be stored,
2674 i.e. newloc is NULL.
2676 * html.c (htmlfindurl): Handle exiting from quotes correctly; the
2677 old version would bug out on <a href="x#a"href="y">.
2679 * html.h (state_t): New member in_quote.
2681 * html.c (htmlfindurl): Free s->attr at the beginning of
2684 * recur.c (recursive_retrieve): Recognize RCLEANUP.
2685 (tried_robots): Make hosts a global variable.
2686 (recursive_retrieve): Free constr after URL host optimization.
2687 (tried_robots): Free urlinfo before exiting.
2689 * utils.c (free_slist): New function.
2691 * recur.c (recursive_retrieve): Use flags to add cleanup
2694 * main.c (main): Free filename after recursive_retrieve.
2696 * http.c (gethttp): Store successful responses too.
2698 1996-10-12 Hrvoje Niksic <hniksic@srce.hr>
2700 * all: Constified the whole source. This required some minor
2701 changes in many functions in url.c, possibly introducing bugs -- I
2704 * ftp-basic.c: Removed last_respline.
2706 * http.c (gethttp): Free type.
2708 * host.c (same_host): Free real1 and real2.
2710 * main.c (main): New option --spider.
2712 * retr.c (get_contents): Don't reset errno.
2714 * main.c (main): Sorted the options.
2716 * connect.c (iread): Set errno to ETIMEDOUT only if it was left
2717 uninitialized by select().
2719 * http.c (http_loop): Print the time when the connection is
2721 (gethttp): Debug-print the HTTP request.
2723 1996-10-11 Hrvoje Niksic <hniksic@srce.hr>
2725 * connect.c (iread): Do not try reading after timeout.
2727 * main.c (main): Would bug out on -T.
2729 * connect.c (select_fd): Do not use exceptfds.
2730 (iread): Set ETIMEDOUT on select_fd <= 0.
2732 * version.c: Changed version to 1.4b26.
2734 1996-10-10 Hrvoje Niksic <hniksic@srce.hr>
2736 * version.c: "Released" 1.4b25.
2738 * ftp-unix.c (ftp_parse_unix_ls): Ignore lines without file name
2741 * http.c (gethttp): Add errcode to struct hstat.
2742 (http_loop): Use it.
2744 * url.c (no_proxy_match): Simplify using char** for no_proxy.
2746 * options.h (struct options): Make opt.no_proxy a vector.
2748 * utils.c (sepstring): Use !*s instead of !strlen(s).
2750 * init.c (setval): Set opt.maxreclevel to 0 on --mirror.
2751 (getperms): Use ISODIGIT instead of isdigit.
2753 * ftp.c (getftp): Print time.
2755 * main.c (main): Use legible output of downloaded quantity.
2757 * ftp.c (getftp): Use elapsed_time().
2758 (ftp_loop_internal): Use rate().
2760 * http.c (http_loop): Add download ratio output; Use rate().
2762 * utils.c (rate): New function.
2764 1996-10-09 Hrvoje Niksic <hniksic@srce.hr>
2766 * http.c (http_loop): Use timer.
2768 * ftp.c: Split to ftp-basic.c and ftp.c
2770 * utils.c (reset_timer): New function.
2771 (elapsed_time): New function.
2773 * retr.c (show_progress): Make bytes_in_line and offs long; should
2774 work on 16-bit machines.
2776 1996-10-08 Hrvoje Niksic <hniksic@srce.hr>
2778 * url.c (in_acclist): New argument backward.
2780 * ftp.c (ftp_retrieve_glob): Use acceptable() to determine whether
2781 a file should be retrieved according to suffix.
2782 (ftp_get_listing): Check the return value of unlink; Do not call
2783 ftp_retrieve_dirs if depth reached maxreclevel.
2784 (ftp_retrieve_dirs): Check whether the directory is in
2787 * main.c (main): Print the version number at the beginning of
2789 (main): Use strrchr when creating exec_name.
2791 * ftp.c (ftp_retrieve_glob): Do not close control connection.
2793 * version.c: Changed version to 1.4b25.
2795 1996-10-07 Hrvoje Niksic <hniksic@srce.hr>
2797 * version.c: "Released" 1.4b24.
2799 * Makefile.in: Rewrite.
2801 * ftp.c (ftp_loop_internal): Likewise.
2803 * retr.c (time_str): Check for failed time().
2805 * html.c (htmlfindurl): Recognize <fig src> and <overlay src> from
2808 * retr.c (time_str): Return time_t *.
2810 * connect.c (bindport): Close msock on unsuccesful bind.
2811 (bindport): The same for getsockname and listen.
2813 * retr.c (retrieve_url): Allow any number of retries on
2816 * http.c (gethttp): Do not treat errno == 0 as timeout.
2817 (http_loop): Likewise.
2818 (http_loop): Cosmetic changes.
2820 * connect.c (iread): Set errno to ETIMEDOUT in case of timeout.
2822 * retr.c (get_contents): Reset errno.
2824 * ftp.c (getftp): Minor fixes.
2826 1996-10-06 Hrvoje Niksic <hniksic@srce.hr>
2828 * http.c: Do not use backups.
2830 * geturl.1 (WARNING): Warn that man-page could be obsolete.
2832 * getopt.c (getopt_long): Moved to getopt.c
2834 * geturl.texi: Enhanced.
2836 * main.c (main): Use it.
2838 * recur.c (convert_all_links): New function.
2840 * utils.c (add_slist): New argument flags.
2842 * recur.c (recursive_retrieve): Update a list of downloaded URLs.
2843 (parse_robots): Do not chuck out empty value fields.
2844 (parse_robots): Make yourself welcome on empty Disallow.
2846 * version.c: Changed version to 1.4b24.
2848 1996-10-06 Hrvoje Niksic <hniksic@srce.hr>
2850 * version.c: "Released" 1.4b23.
2852 * ftp.c (ftp_loop_internal): Get the time after getftp.
2854 * Makefile.in (install.info): New target.
2857 * http.c (http_loop): Fix output when doing -O.
2859 1996-10-05 Hrvoje Niksic <hniksic@srce.hr>
2861 * geturl.texi: New file.
2863 * main.c (main): Do not print the warnings and download summary if
2866 * version.c: Changed version to 1.4b23.
2868 1996-10-05 Hrvoje Niksic <hniksic@srce.hr>
2870 * "Released" 1.4b22.
2872 * atotm.c (atotm): Use True and False instead of TRUE and FALSE,
2873 to avoid redefinition warnings.
2875 * host.c (store_hostaddress): Use memcpy() to copy the address
2876 returned by inet_addr.
2878 * version.c: Changed version to 1.4b22.
2880 1996-10-04 Hrvoje Niksic <hniksic@srce.hr>
2882 * version.c: "Released" 1.4b21.
2884 * ftp-unix.c (ftp_parse_ls): Renamed to ftp_parse_unix_ls.
2886 * ftp.c (ftp_port): Use conaddr.
2887 (getftp): Print the file length.
2888 (ftp_retrieve_list): Check the stamps of plain files only.
2890 * connect.c (closeport): Do not call shutdown().
2891 (conaddr): New function.
2893 * html.c (ftp_index): Made it dfp-aware.
2895 * init.c (cleanup): New name of freemem. Close opt.dfp.
2897 * ftp.c (getftp): Use opt.dfp if it is set.
2899 * ftp-unix.c (ftp_parse_ls): Recognize time in h:mm format.
2901 * ftp.c (ftp_retrieve_dirs): Fixed a bug that caused incorrect
2902 CWDs to be sent with recursive FTP retrievals.
2904 1996-10-03 Hrvoje Niksic <hniksic@srce.hr>
2906 * recur.c (parse_robots): Made it more compliant with "official"
2909 * http.c: New function.
2911 * ftp-unix.c (ftp_parse_ls): Added better debug output.
2913 * ftp.c (getftp): Print out the LIST in case of
2914 opt.print_server_response.
2916 * version.c: Changed version to 1.4b21.
2918 1996-10-01 Hrvoje Niksic <hniksic@srce.hr>
2920 * version.c: "Released" 1.4b20.
2924 * http.c (gethttp): Preset lengths of various headers instead of
2925 calculating them dynamically.
2926 (gethttp): Check for 206 partial contents.
2928 1996-09-30 Hrvoje Niksic <hniksic@srce.hr>
2930 * configure.in: Set SYSTEM_GETURLRC to $libdir/geturlrc
2932 * http.c (gethttp): Send the port number in the Host: header.
2934 1996-09-29 Hrvoje Niksic <hniksic@srce.hr>
2936 * http.c (gethttp): Send host: header.
2937 (gethttp): Add the possibility of user-defined headers.
2938 (gethttp): Move decision about pragma: no-cache to http_loop,
2940 (gethttp): Pass a struct instead of enormous argument list.
2941 (http_loop): Use a new, fancier display format.
2942 (ftp_loop): Likewise.
2944 * main.c: (hangup): Turn off buffering of the new log file.
2946 * install-sh: Likewise.
2948 * config.sub: Replace with the one in autoconf-2.10
2952 * init.c: New options httpuser and httppasswd.
2954 * http.c: (base64_encode_line): New function.
2955 (gethttp): Send authentication.
2957 * connect.c (make_connection): Use store_hostaddress.
2959 1996-09-28 Hrvoje Niksic <hniksic@srce.hr>
2961 * host.c (store_hostaddress): New function.
2965 * http.c (hgetrange): New function.
2966 (gethttp): Use ranges.
2968 * utils.c (numdigit): Accept long instead of int.
2970 * http.c (http_loop): Add restart capabilities.
2972 * ftp.c (ftp_retrieve_glob): Fixed a bug that could cause matchres
2973 being used uninitialized.
2974 (ftp_retrieve_list): Similar fix.
2976 * host.c (add_hlist): Fixed a bug that could cause cmp being used
2979 * url.c (construct_relative): New function.
2981 * recur.c (recursive_retrieve): Use it.
2983 * retr.c (convert_links): New function.
2985 1996-09-27 Hrvoje Niksic <hniksic@srce.hr>
2987 * url.c (free_urlpos): New function.
2989 * recur.c (recursive_retrieve): Adapt.
2991 * url.c (get_urls_html): Return a linked list instead of a vector.
2993 * url.c (get_urls_file): Return a linked list instead of a vector.
2997 * http.c (gethttp): Implement it.
2999 * init.c (setval): New option: SAVEHEADERS
3001 * ftp.c (ftp_loop_internal): Do not set restval if listing is to
3002 be retrieved. Lack of this test caused bugs when the connection
3003 was lost during listing.
3005 * retr.c (retrieve_url): Fixed a bug that caused
3006 coredumps. *newloc is now reset by default.
3007 (retrieve_url): Lift the twenty-tries limit on proxies.
3009 * version.c: Changed version to 1.4b20.
3011 1996-09-20 Hrvoje Niksic <hniksic@srce.hr>
3013 * version.c: "Released" 1.4b19.
3015 1996-09-19 Hrvoje Niksic <hniksic@srce.hr>
3017 * ftp.c (ftp_loop_internal): Renamed from ftp_1fl_loop.
3018 (getftp): Changed prototype to accept ccon *.
3020 1996-09-17 Hrvoje Niksic <hniksic@srce.hr>
3022 * ftp.c (ftp_retrieve_list): Fixed a bug that caused setting
3023 incorrect values to files pointed to by symbolic links.
3024 (ftp_1fl_loop): Do not count listings among the downloaded URL-s.
3026 1996-09-16 Hrvoje Niksic <hniksic@srce.hr>
3028 * url.c (mkstruct): Do not prepend "./" in front of a pathname.
3030 * main.c (main): New option: --user-agent.
3036 * init.c (setval): Ditto.
3038 * main.c (main): Rename "server-headers" to "server-response".
3040 * ftp-unix.c (ftp_parse_ls): Check for asterisks at the end of
3041 executables in 'ls -F' listings.
3043 1996-09-15 Hrvoje Niksic <hniksic@srce.hr>
3045 * url.c (parseurl): Remove realloc() and sprintf().
3046 (str_url): Get rid of sprintf().
3048 * recur.c (recursive_retrieve): Enable FTP recursion through proxy
3051 * url.h (URL_CLEANSE): Made it else-resistant.
3052 (USE_PROXY): New macro.
3054 1996-09-14 Hrvoje Niksic <hniksic@srce.hr>
3058 * version.c: Changed version to 1.4b19.
3060 1996-09-14 Hrvoje Niksic <hniksic@srce.hr>
3062 * version.c: "Released" 1.4b18.
3064 * url.c: Made it reallocate space exponentially.
3066 1996-09-14 Drazen Kacar <dave@fly.cc.fer.hr>
3068 * html.c (htmlfindurl): Added <frame src> and <iframe src> among
3069 the list of stuff to fetch.
3071 1996-09-13 Hrvoje Niksic <hniksic@srce.hr>
3073 * url.c (get_urls_html): Fixed a bug that caused SIGSEGV's with
3076 * html.c (htmlfindurl): Rewrite.
3078 * http.c (gethttp): Use opt.proxy_cache.
3080 * main.c (main): Added --cache option.
3082 * ftp.c (ftp_response): Print server response if opt.print_server
3084 (getftp): Print newlines after each request if the server response
3086 (ftp_response): Copy the last response line to last_respline.
3088 * http.c (gethttp): Add Pragma: nocache for retried
3091 * ftp.c (getftp): Use it.
3093 * retr.c (buf_discard): New function.
3095 * ftp.c (ftp_response): Use buf_readchar().
3096 (getftp): Flush the control connection buffer before calling
3099 * retr.c (buf_readchar): New function.
3100 (buf_flush): New function.
3101 (get_contents): Use buf_readchar() instead of read(x, x, 1).
3102 (get_contents): Use buf_flush.
3104 1996-09-12 Hrvoje Niksic <hniksic@srce.hr>
3106 * ftp.c: Incorporate changes to ftp_response.
3108 * ftp.c (ftp_response): Allocate the server response dynamically,
3109 as in read_whole_line and fetch_next_header.
3111 * utils.c (read_whole_line): Fixed a bug that prevented reading
3112 the last line if it is not \n-terminated. Also fixed a possible
3115 * http.c (fetch_next_header): Return malloc-ed string as large as
3117 (gethttp): Use new fetch_next_header.
3119 1996-09-12 Hrvoje Niksic <hniksic@srce.hr>
3121 * http.c (hgetlen): Compute the header length the first time only.
3123 (hgetlocation): Ditto.
3124 (hgetmodified): Ditto.
3126 1996-09-11 Hrvoje Niksic <hniksic@srce.hr>
3128 * sample.geturlrc: Update.
3130 1996-09-10 Hrvoje Niksic <hniksic@srce.hr>
3132 * http.c (http_loop): Ditto.
3134 * ftp.c (getftp): Open the output file as binary.
3136 * version.c: Changed version to 1.4b18.
3138 1996-09-10 Hrvoje Niksic <hniksic@srce.hr>
3140 * version.c: "Released" 1.4b17.
3142 * ftp-unix.c (ftp_parse_ls): If unable to open file, return NULL
3143 instead of failed assertion.
3145 1996-09-09 Hrvoje Niksic <hniksic@srce.hr>
3147 * ftp.c (ftp_get_listing): Add a numbered suffix to LIST_FILENAME
3148 if a file of that name already exists.
3150 1996-09-05 Hrvoje Niksic <hniksic@srce.hr>
3152 * ftp.c (ftp_1fl_loop): Handler FTPPORTERR and FOPENERR correctly.
3154 * config.h.in: Define gethostbyname as Rgethostbyname when using
3157 * configure.in: Check for -lresolv if using Socks.
3159 * version.c: Changed version to 1.4b17.
3161 1996-07-15 Hrvoje Niksic <hniksic@srce.hr>
3163 * version.c: "Released" 1.4b16.
3165 * http.c (gethttp): More intelligent check for first line of HTTP
3167 (gethttp): Would bug out on time-stamping.
3169 * version.c: Changed version to 1.4b16.
3171 1996-07-11 Hrvoje Niksic <hniksic@srce.hr>
3173 * version.c: Released 1.4b15.
3175 * http.c (http_loop): Print \n after the loop entry, not before.
3177 * url.c (url_filename): Use ISDOT.
3179 * url.h (ISDOT): New macro.
3181 * recur.c (recursive_retrieve): Change only opt.recursive for
3184 1996-07-11 Antonio Rosella <antonio.rosella@agip.it>
3186 * socks/geturl.cgi: Fixed version No.
3188 * socks/download-netscape.html: Ditto.
3190 * socks/download.html: Changed socks.html to download.html.
3192 1996-07-11 Hrvoje Niksic <hniksic@srce.hr>
3194 * url.c (url_filename): Check for opt.dirstruct instead for
3195 opt.recursive && opt.dirstruct.
3197 * init.c (defaults): Ditto.
3198 (defaults): Reset dirstruct by default.
3199 (setval): Set opt.dirstruct whenever setting recursive.
3201 * init.h: Removed FORCEDIRHIER.
3203 * INSTALL: Added -L to socks-description.
3205 * version.c: Changed version to 1.4b15.
3207 1996-07-10 Hrvoje Niksic <hniksic@srce.hr>
3209 * version.c: "Released" 1.4b14.
3211 * geturl.1: Update AUTHOR to include Rosella as contributor.
3215 * socks/geturl.cgi: Simplified command creation, nuked <blink>.
3217 * socks/geturl.cgi: Wrap nutscape extensions within if $netscape.
3218 (cal_time): Fix == to eq.
3220 * socks/geturl.cgi: GPL-ized with permission of A. Rosella.
3222 * geturl.1 (hostname): Moved URL CONVENTIONS to the beginning.
3224 * Makefile.in: Use @VERSION@.
3226 * configure.in: Check version from version.c.
3228 * socks/geturl.cgi: Changed /pub/bin/perl to /usr/bin/perl.
3230 * socks/download.html: Created from download-netscape.html, made
3233 * recur.c (recursive_retrieve): Set opt.force_dir_hier when
3234 following FTP links from recursions.
3236 1996-07-09 Hrvoje Niksic <hniksic@srce.hr>
3238 * url.c (mymkdir): Fixed a bug that prevented mymkdir() to create
3239 absolute directories correctly.
3241 * version.c: Changed version to 1.4b14.
3243 1996-07-09 Hrvoje Niksic <hniksic@srce.hr>
3245 * version.c: "Released" 1.4b13.
3247 * url.c (make_backup): New function.
3249 * http.c (http_loop): Make a backup copy of the local file (using
3250 rename(2)) before opening it.
3252 * main.c (main): Added --backups.
3254 * host.c (ftp_getaddress): Bail out on failed mycuserid().
3255 (ftp_getaddress): Check for leading dot on MY_DOMAIN.
3256 (ftp_getaddress): Check for empty, null or (null) domain.
3258 * url.c (get_urls_html): If this_url is NULL, the base must have a
3260 (parseurl): Use has_proto.
3262 * retr.c (retrieve_url): Warn when proxy is used with more than 20
3265 * url.c (mkstruct): Create the directory (calling mymkdir()) only
3266 if it is not already there.
3267 (has_proto): New function.
3268 (get_urls_html): Eliminate the remaining call to findurl -- use
3273 * main.c: Change -X to -x.
3275 * url.c (url_filename): Simplify creation of filename if
3276 prefix_files is set.
3277 (url_filename): Simplify everything. And I do mean *everything*.
3278 (mkstruct): Add dir_prefix before hostname.
3279 (path_simplify): Fixed a bug that caused writing outside the path
3280 string in case of "." and ".." path strings.
3282 1996-07-06 Hrvoje Niksic <hniksic@srce.hr>
3284 * init.c: Added --mirror.
3286 * main.c (main): Added -X to force saving of directory hierarchy.
3288 * ftp.c (ftp_retrieve_list): Added recursion depth counter.
3289 (ftp_retrieve_list): Check whether quota is exceeded.
3291 * url.c (get_urls_html): Skip leading blanks for absolute URIs.
3293 * http.c (gethttp): Use referer if present.
3295 * recur.c (recursive_retrieve): Set u->referer before calling
3298 * url.c (newurl): Use memset to nullify the struct members.
3299 (freeurl): Free the referer field too.
3301 * url.h: Added referer to urlinfo.
3303 * geturl.1: Updated the manual to document some of the new features.
3305 * utils.c (numdigit): Moved from url.c.
3307 * README: Rewritten.
3309 * config.h.in: Add the support for socks.
3311 * configure.in: Add the support for socks.
3313 * url.c (url_filename): If the dir_prefix is ".", work with just
3315 (url_filename): Do not look for .n extensions if timestamping if
3318 * retr.c (show_progress): Skip the over-abundant restval data, and
3319 print the rest of it with ',' instead of '.'.
3321 1996-07-05 Hrvoje Niksic <hniksic@srce.hr>
3323 * retr.c (show_progress): Changed second arg. to long (as it
3325 (show_progress): Moved to retr.c.
3326 (get_contents): Moved to retr.c.
3328 * version.c: Change version to 1.4b13.
3330 1996-07-05 Hrvoje Lacko <hlacko@fly.cc.fer.hr>
3332 * url.c (in_acclist): Would return after the first suffix.
3334 1996-07-04 Hrvoje Niksic <hniksic@srce.hr>
3336 * version.c: "Released" 1.4b12.
3338 * url.c (path_simplify): More kludgifications.
3339 (get_urls_html): Use new parameters for htmlfindurl.
3341 * html.c: Removed memorizing "parser states", since the new
3342 organization does not require them.
3344 * init.c (run_geturlrc): Use read_whole_line.
3346 * ftp-unix.c (ftp_parse_ls): Use read_whole_line.
3348 * recur.c (parse_robots): Use read_whole_line.
3350 * utils.c (read_whole_line): New function.
3352 * recur.c (tried_robots): Use add_slist/in_slist, *much* cleaner.
3354 * host.c (ngethostbyname): Call inet_addr just once. Yet to be
3355 tested on OSF and Ultrix.
3356 (add_hlist): New function.
3357 (free_hlist): New function.
3358 (search_host): New function.
3359 (search_address): New function.
3360 (realhost): Use search_host, search_address and add_hlist.
3361 (same_host): Replaced realloc() with strdupdelim(), made
3362 case-insensitive, fixed a memory leak.
3364 * html.c (ftp_index): Fixed tm_min and tm_sec to be tm_hour and
3365 tm_min, like intended.
3367 * version.c: Change user agent information to
3370 1996-07-03 Hrvoje Niksic <hniksic@srce.hr>
3372 * utils.c: Renamed nmalloc.c to utils.c, .h likewise.
3374 * url.c (acceptable): Always accept directories.
3376 * ftp-unix.c (ftp_parse_ls): Support brain-damaged "ls -F"-loving
3377 servers by stripping trailing @ from symlinks and trailing / from
3380 * ftp.c (ftp_loop): Debugged the "enhanced" heuristics. :-)
3382 * url.c (skip_url): Use toupper instead of UCASE.
3384 * host.c (sufmatch): Made it case-insensitive.
3386 * url.c (match_backwards_or_pattern): Fixed i == -1 to j == -1.
3387 (match_backwards): New function, instead of
3388 match_backwards_or_pattern.
3390 * recur.c (recursive_retrieve): Increased performance by
3391 introducing inl, which reduces number of calls to in_slist to only
3394 * ftp.c (ftp_loop): Enhanced the heuristics that decides which
3397 * main.c (printhelp): Removed the warranty stuff.
3399 1996-07-02 Hrvoje Niksic <hniksic@srce.hr>
3401 * url.c (add_slist): Simplify.
3402 (match_backwards_or_pattern): New function.
3403 (in_acclist): Use match_backwards_or_pattern.
3406 1996-06-30 Hrvoje Niksic <hniksic@srce.hr>
3408 * ftp.c (ftp_loop): Call ftp_index on empty file names, if not
3411 * html.c (ftp_index): Fixed to work. Beautified the output.
3413 * ftp.c (ftp_retrieve_glob): Another argument to control whether
3414 globbing is to be used.
3415 (ftp_retrieve_list): Compare the time-stamps of local and remote
3416 files to determine whether to download.
3418 1996-06-29 Hrvoje Niksic <hniksic@srce.hr>
3420 * ftp.c (rel_constr): New function.
3422 * retr.c (retrieve_from_file): Check for text/html before
3423 retrieving recursively.
3425 * main.c (main): Check whether the file is HTML before going into
3426 recursive HTML retrieving.
3428 * ftp.c (ftp_retrieve_list): Manage directories.
3429 (ftp_retrieve_glob): Pass all the file-types to ftp_retrieve_list.
3430 (ftp_1fl_loop): Fixed a bug that caused con->com to be incorrectly
3431 initialized, causing bugchecks in getftp to fail.
3433 * configure.in: Check for symlink.
3435 * ftp.c (ftp_retrieve_list): Added support for symlinks.
3437 * version.c: "Released" 1.4b10.
3439 * atotm.c (atotm): Redeclared as time_t.
3441 * init.c: New variable "timestamping".
3443 * main.c (main): New option 'N'.
3445 * http.c (hgetlocation): Case-insensitive match.
3446 (hgetmodified): New function.
3447 (http_loop): Implement time-stamping.
3449 1996-06-28 Hrvoje Niksic <hniksic@srce.hr>
3451 * version.c: Changed version to 1.4b10
3453 * atotm.c: New file, from phttpd.
3455 * options.h (struct options): New parameter timestamping.
3457 * version.c: 1.4b9 "released".
3459 * recur.c (recursive_retrieve): Used linked list (ulist) for
3460 faster storing of URLs.
3462 * url.c (get_urls_html): Removed the old kludge with comparing the
3463 outputs of htmlfindurl and findurl.
3464 (get_urls_html): Added better protocol support here.
3465 (create_hash): Removed, as well as add_hash and in_hash.
3466 (addslist): New function.
3469 * version.c: Released 1.4b8, changed version to b9.
3471 1996-06-27 Hrvoje Niksic <hniksic@srce.hr>
3473 * ftp.c (freefileinfo): New function.
3474 (delelement): New function.
3478 * ftp.c (ftp_loop): Use ccon.
3479 (ftp_retrieve_glob): Likewise.
3481 * ftp.h: Define ccon, to define status of control connection.
3483 * ftp.c (ftp_get_listing): New function.
3484 (ftp_retrieve_more): New function.
3485 (ftp_retrieve_glob): New function.
3487 1996-06-25 Hrvoje Niksic <hniksic@srce.hr>
3489 * configure.in: Removed the search for cuserid().
3491 * init.c (getmode): Renamed to getperms.
3493 1996-06-24 Hrvoje Niksic <hniksic@srce.hr>
3495 * version.c: New version.
3497 * main.c (hangup): New function, that handles hangup. Hangup
3498 signal now causes geturl to stop writing on stdout, and to write
3501 * ftp.c (getftp): "Released" 1.4b7.
3503 * html.c (htmlfindurl): Ignore everything inside <head>...</head>.
3504 (ftp_index): Use fileinfo/urlinfo.
3506 * ftp-unix.c (ftp_parse_ls): New function.
3507 (symperms): New function.
3509 * ftp.c (ftp_1fl_loop): New function, to handle 1-file loops.
3511 * retr.c (retrieve_url): Added FTP support.
3513 1996-06-23 Hrvoje Niksic <hniksic@srce.hr>
3515 * geturl.h: Removed NOTFTP2HTML enum.
3516 Added DO_LOGIN, DO_CWD and DO_LIST. LIST_ONLY is obsolete.
3518 * ftp.c (getftp): Resynched with urlinfo.
3519 (getftp): Removed HMTL-ization of index.html from getftp.
3521 * version.c: 1.4b6 "released".
3523 * options.h (options): New struct, to keep options in.
3525 * http.c (http_loop,gethttp): Synched with proxy.
3527 * retr.c (retrieve_url): Implemented proxy retrieval.
3529 * main.c (main): Use retrieve_from_file.
3531 1996-06-22 Hrvoje Niksic <hniksic@srce.hr>
3533 * retr.c (retrieve_from_file): New function.
3535 * url.c (parseurl): Modified to return URLOK if all OK. Protocol
3536 can be found in u->proto.
3538 * ftp.c (ftp_response): Fixed to accept multi-line responses as
3541 * recr.c (recursive_retrieve): Take newloc from retrieve_url.
3543 * url.c (mymkdir): Removed the file of the same name, if one
3545 (isfile): New function.
3546 (mkstruct): Fixed the '/' glitches.
3547 (path_simplify): Hacked to treat something/.. correctly.
3549 1996-06-21 Hrvoje Niksic <hniksic@srce.hr>
3551 * http.c (gethttp): Close the socket after error in headers.
3552 (http_loop): HEOF no longer a fatal header.
3554 * loop.c (retrieve_url): When dt is NULL, do not modify it. This
3555 simplifies the syntax of calling retrieve_url.
3557 * recr.c (recursive_retrieve): Modified to use get_urls_html.
3559 * url.c (get_urls_file): New function.
3560 (get_urls_html): New function.
3562 * recr.c (recursive_retrieve): Patched up to conform to the
3565 * http.c (gethttp): Synched with the rest...
3566 (gethttp): Treat only CONREFUSED specially, with connection
3569 * init.c,geturl.1,http.c (http_loop): Removed kill_error.
3571 1996-06-20 Hrvoje Niksic <hniksic@srce.hr>
3573 * http.c (http_loop): New function.
3575 * loop.c: Removed *lots* of stuff from retrieve_url.
3577 * url.c (parseurl): Changed to work with urlinfo. Integrated
3578 username finding and path parsing.
3579 (newurl): New function.
3580 (freeurl): New function.
3581 (mkstruct): Removed the old bogosities, made it urlinfo-compliant.
3582 (url_filename): Likewise.
3583 (path_simplify): Accept relative paths too.
3584 (opt_url): Made urlinfo-compliant, removed bogosities.
3585 (path_simplify): Expanded to accept relative paths.
3586 (str_url): A replacement for hide_url
3587 (decode_string): Fixed a bug that caused malfunctioning when
3588 encountering an illegal %.. combination.
3589 (opt_url): Removed the argument. Dot-optimizations are now default.
3591 * nmalloc.c (strdupdelim): New function.
3593 * url.h: Added the urlinfo structure
3595 1996-06-19 Hrvoje Niksic <hniksic@srce.hr>
3597 * url.c (hide_url): Thrown out the protocol assertion. Do not
3598 change the URL if the protocol if not recognized.
3599 (findurl): Put continue instead of break.
3601 1996-06-18 Hrvoje Niksic <hniksic@srce.hr>
3603 * sample.geturlrc: Changed the defaults to be commented out and
3604 harmless (previous defaults caused pains if copied to
3607 * http.c (gethttp): Print the HTTP request in debug mode.
3609 * connect.c (iread): Added EINTR check loop to select-ing
3610 too. EINTR is now correctly handled with select().
3614 1996-05-07 Hrvoje Niksic <hniksic@srce.hr>
3616 * host.c (same_host): Made the function a little bit more
3617 intelligent regarding diversified URL syntaxes.
3619 * url.c (skip_url): Spaces are now skipped after URL:
3621 * Released 1.3.1 with the patch to prevent crashing when sending
3622 NULL to robot* functions and the patch to compile "out of the box"
3625 * recr.c (recursive_retrieve): Added checking whether this_url is
3626 NULL when calling the robot functions.
3628 * ChangeLog: New file.