1 1998-12-01 "R. K. Owen" <rkowen@Nersc.GOV>
3 * host.c (store_hostaddress): Fix for big endian 64-bit machines.
5 1998-12-01 Hrvoje Niksic <hniksic@srce.hr>
7 * url.c (UNSAFE_CHAR): New macro.
8 (contains_unsafe): Use it.
9 (encode_string): Ditto.
11 1998-12-01 Hrvoje Niksic <hniksic@srce.hr>
13 * main.c (i18n_initialize): Use LC_MESSAGES only if available.
15 2000-03-31 Hrvoje Niksic <hniksic@srce.hr>
17 * Use TOUPPER/TOLOWER.
19 1998-12-22 Alexander V. Lukyanov <lav@yars.free.net>
21 * ftp-opie.c (btoe): Zero-terminate OSTORE.
23 2000-03-21 Hrvoje Niksic <hniksic@iskon.hr>
25 * wget.h (DO_REALLOC_FROM_ALLOCA): Ditto.
27 * sysdep.h (ISALNUM): New macro.
31 2000-03-10 Dan Harkless <dan-wget@dilvish.speed.net>
33 * html.c (idmatch): Implemented checking of my new --follow-tags
34 and --ignore-tags options.
36 * init.c (commands): Added comment reminding people adding new
37 entries doing allocation to add corresponding freeing in cleanup().
38 (commands): Added new followtags and ignoretags commands.
39 (cleanup): Free storage for new followtags and ignoretags.
41 * main.c: Use of "comma-separated list" was random -- normalized
42 it. Did some alphabetization. Added comments pointing out
43 "Options without arguments" and "Options accepting an argument"
44 sections of long_options[]. Added new options --follow-tags and
45 -G / --ignore-tags. Added comment that Damir's --referer is
46 currently undocumented. Added comment that Heiko's --waitretry is
47 partially undocumented (mentioned in --help but not in
48 wget.texi). Moved improperly sorted 24, 129, and 'G' cases.
50 * options.h (struct options): Added new fields follow_tags and
53 * wget.h: Added "#define EQ 0" so we can say "strcmp(a, b) == EQ".
55 2000-03-02 Dan Harkless <dan-wget@dilvish.speed.net>
57 * ftp.c (ftp_loop_internal): Heiko introduced "suggest explicit
58 braces to avoid ambiguous `else'" warnings. Eliminated them.
60 * http.c (gethttp): Dan Berger's query string patch is totally
61 bogus. If you have two different URLs, gen_page.cgi?page1 and
62 get_page.cgi?page2, they'll both be saved as get_page.cgi and the
63 second will overwrite the first. Also, parameters to implicit
64 CGIs, like "http://www.host.com/db/?2000-03-02" cause the URLs to
65 be printed with trailing garbage characters, and could seg fault.
66 Backing out the patch, which Dan B. informed me by email was just
67 a kludge to download StarOffice from Sun made necessary due to
68 wget's unconditional escaping of certain characters (room for an
70 (http_loop): Heiko introduced "suggest explicit braces to avoid
71 ambiguous `else'" warnings. Eliminated them.
73 * main.c: Heiko's --wait / --waitretry backwards compatibility
74 code looks to have been totally untested -- automatic variable
75 'wr' was used without being initialized, and a long int was passed
76 into setval()'s char* val parameter.
78 * recur.c (parse_robots): Applied Edward J. Sabol
79 <sabol@alderaan.gsfc.nasa.gov>'s patch for Guan Yang's reported
80 problem with "User-agent:<space>*<space>" lines in robots.txt.
82 * url.c (parseurl, str_url): Removing Dan Berger's code (see
83 http.c above for explanation).
85 1999-08-25 Heiko Herold <Heiko.Herold@previnet.it>
87 * ftp.c: Respect new option waitretry.
89 2000-01-30 Damir Dzeko <ddzeko@zesoi.fer.hr>
91 * http.c (gethttp): Send custom Referer, if required.
93 1999-09-24 Charles G Waldman <cgw@fnal.gov>
95 * netrc.c (parse_netrc): Allow passwords to contain spaces.
97 * netrc.c (parse_netrc): New function.
99 1999-09-17 Dan Berger <dberger@ix.netcom.com>
101 * http.c (gethttp): Send it.
103 * url.c (parseurl): Detect query string in HTTP URL-s.
106 2000-03-02 HIROSE Masaaki <hirose31@t3.rim.or.jp>
108 * html.c (html_allow): Add <link href=...> and <script src=...>.
110 1999-05-02 andrew deryabin <djsf@softhome.net>
112 * http.c (gethttp): Specify port in `Host' header only if it's
115 1998-11-03 Edward J. Sabol <sabol@alderaan.gsfc.nasa.gov>
117 * recur.c (recursive_retrieve): If a finite maximum depth is
118 specified, and we're are already at that depth, don't download the
119 HTML file for parsing.
121 2000-03-01 Dan Harkless <dan-wget@dilvish.speed.net>
123 * ftp.c (ftp_loop_internal): Call new downloaded_file() function,
124 even though we don't do conversion on HTML files retrieved via
125 FTP, so _current_ usage of downloaded_file() makes this call unneeded.
126 (ftp_retrieve_list): Added a comment saying where we need to
127 stat() a .orig file if FTP'd HTML file conversion is ever implemented.
128 (ftp_retrieve_list): "Local file '%s' is more recent," is sometimes
129 a lie -- reworded as "Server file no newer than local file '%s' --".
131 * http.c (http_loop): Fixed a typo and clarified a comment.
132 (http_loop): When -K and -N are specified together, compare size
133 and timestamp of server file X against local file X.orig (if
134 extant) rather than converted local file X.
135 (http_loop): "Local file '%s' is more recent," is sometimes a lie
136 -- reworded as "Server file no newer than local file '%s' --".
137 (http_loop): Call new downloaded_file() function to prevent
138 wrongful overwriting of .orig file when -N is specified.
140 * url.c (convert_links): When -K specified, only rename X to
141 X.orig if downloaded_file() returns TRUE. Otherwise when we skip
142 file X due to -N, we clobber an X.orig from a previous invocation.
143 (convert_links): Call the failsafe xstrdup(), not the real strdup().
144 (convert_links): Added a note asking anyone who understands how
145 multiple URLs can correspond to a single file to comment it.
146 (downloaded_file): Added this new function.
148 * url.h (downloaded_file): Added prototype for this new function
149 as well as its downloaded_file_t enum type.
151 * wget.h (boolean): Added this new typedef and TRUE and FALSE #defines.
153 2000-02-29 Dan Harkless <dan-wget@dilvish.speed.net>
155 * version.c: Upped version to developer-only "1.5.3+dev".
157 2000-02-18 Dan Harkless <dan-wget@dilvish.speed.net>
159 * init.c (backup_converted): Added this new option.
161 * main.c (-K / --backup-converted): Added this new option.
163 * options.h (backup_converted): Added this new option.
165 * url.c (convert_links): When backup_converted is specified, save
166 file X as X.orig before converting.
168 * url.h (urlpos): Fixed typo -- said "Rekative" instead of "Relative".
170 1998-09-21 Hrvoje Niksic <hniksic@srce.hr>
172 * version.c: Wget 1.5.3 is released.
174 1998-09-21 Hrvoje Niksic <hniksic@srce.hr>
176 * host.c (ftp_getaddress): Don't warn when reverse-lookup of local
177 address doesn't yield FQDN.
179 1998-09-21 Andreas Schwab <schwab@issan.informatik.uni-dortmund.de>
181 * cmpt.c (strerror): Fix declaration of sys_errlist.
183 1998-09-11 Hrvoje Niksic <hniksic@srce.hr>
185 * main.c (main): Don't use an array subscript as the first
186 argument to STRDUP_ALLOCA.
189 1998-09-11 Szakacsits Szabolcs <szaka@sienet.hu>
191 * html.c (htmlfindurl): Download table background.
193 1998-09-11 Hans Grobler <grobh@conde.ee.sun.ac.za>
195 * init.c (parse_line): Would free *com before allocating it.
196 (parse_line): Would free com instead of *com.
198 1998-09-10 Howard Gayle <howard@fjst.com>
200 * url.c (get_urls_html): Would drop the last character of the
203 1998-09-10 Hrvoje Niksic <hniksic@srce.hr>
205 * http.c (http_loop): Don't print status code if quiet.
207 1998-09-10 Kaveh R. Ghazi <ghazi@caip.rutgers.edu>
209 * log.c: Use <stdarg.h> only when __STDC__.
211 1998-09-10 Adam D. Moss <adam@foxbox.org>
213 * html.c (htmlfindurl): Download <layer src=...>.
215 1998-09-10 Howard Gayle <howard@fjst.com>
217 * ftp.c (ftp_retrieve_list): Don't update the time stamp of a file
220 1998-06-27 Hrvoje Niksic <hniksic@srce.hr>
222 * utils.c: Include <libc.h> on NeXT.
224 1998-06-26 Heinz Salzmann <heinz.salzmann@intermetall.de>
226 * url.c (get_urls_html): Fix calculation of URL position.
228 1998-06-23 Hrvoje Niksic <hniksic@srce.hr>
230 * version.c: Wget 1.5.2 is released.
232 1998-06-23 Dave Love <d.love@dl.ac.uk>
234 * ftp.c, init.c, netrc.c: Include errno.h.
236 * http.c: Include errno.h and time header.
238 * Makefile.in (exext): Define.
239 (install.bin, uninstall.bin): Use it.
241 1998-06-21 Hrvoje Niksic <hniksic@srce.hr>
243 * http.c (http_loop): Don't attempt to compare local and remote
244 sizes if the remote size is unknown.
246 1998-06-16 Hrvoje Niksic <hniksic@srce.hr>
248 * url.c (get_urls_html): Use malloc() instead of alloca in the
251 1998-06-13 Hrvoje Niksic <hniksic@srce.hr>
253 * version.c: Wget 1.5.2-b4 is released.
255 1998-06-13 Hrvoje Niksic <hniksic@srce.hr>
257 * url.c (get_urls_html): Ignore spaces before and after the URI.
259 1998-06-08 Wanderlei Antonio Cavassin <cavassin@conectiva.com.br>
261 * ftp.c (getftp): Translate `done'.
263 1998-06-06 Hrvoje Niksic <hniksic@srce.hr>
265 * version.c: Wget 1.5.2-b3 is released.
267 1998-06-06 Alexander Kourakos <awk@bnt.com>
269 * init.c (cleanup): Close dfp, don't free it.
271 1998-06-06 Hrvoje Niksic <hniksic@srce.hr>
273 * utils.c (make_directory): Twiddle.
275 * config.h.in: Added template for access().
277 1998-06-05 Mathieu Guillaume <mat@cythere.com>
279 * html.c (htmlfindurl): Download <input src=...>
281 1998-06-03 Hrvoje Niksic <hniksic@srce.hr>
283 * utils.c (file_exists_p): Use access() with two arguments.
285 1998-05-27 Martin Kraemer <Martin.Kraemer@mch.sni.de>
287 * netrc.c (parse_netrc): Correct logic.
289 1998-05-27 Hrvoje Niksic <hniksic@srce.hr>
291 * ftp.c (getftp): Added `break'; suggested by Lin Zhe Min
292 <ljm@ljm.wownet.net>.
294 1998-05-24 Hrvoje Niksic <hniksic@srce.hr>
296 * version.c: Wget 1.5.2-b2 is released.
298 1998-05-18 Juan Jose Rodriguez <jcnsoft@jal1.telmex.net.mx>
300 * mswindows.h: Don't translate mkdir to _mkdir under Borland.
302 1998-05-17 Hrvoje Niksic <hniksic@srce.hr>
304 * retr.c (elapsed_time): Return correct value when
305 HAVE_GETTIMEOFDAY is undefined.
307 1998-05-13 Hrvoje Niksic <hniksic@srce.hr>
309 * version.c: Wget 1.5.2-b1 is released.
311 1998-05-08 Hrvoje Niksic <hniksic@srce.hr>
313 * getopt.c (_getopt_internal): Use exec_name instead of argv[0].
314 (_getopt_internal): Don't translate `#if 0'-ed strings.
316 1998-05-06 Douglas E. Wegscheid <wegscd@whirlpool.com>
318 * mswindows.c (ws_handler): Use fork_to_background().
320 1998-05-05 Hrvoje Niksic <hniksic@srce.hr>
322 * version.c: Wget 1.5.1 is released.
324 1998-05-05 Hrvoje Niksic <hniksic@srce.hr>
326 * http.c (parse_http_status_line): Avoid `minor' and `major'
329 1998-05-02 Hrvoje Niksic <hniksic@srce.hr>
331 * utils.c (mkdirhier): Renamed to make_directory.
333 1998-05-01 Hrvoje Niksic <hniksic@srce.hr>
335 * mswindows.c (fork_to_background): Define under Windows.
337 * utils.c (fork_to_background): New function.
339 * html.c (htmlfindurl): Removed rerdundant casts.
341 1998-05-01 Douglas E. Wegscheid <wegscd@whirlpool.com>
343 * mswindows.c (ws_mypath): Cache the path.
345 1998-04-30 Douglas E. Wegscheid <wegscd@whirlpool.com>
347 * ftp.h: Prefix enum ftype members with FT_.
349 * ftp-ls.c, ftp.c, html.h: Adjust accordingly.
351 * mswindows.h: Use stat under Borland, _stat under MSVC.
353 1998-04-28 Hrvoje Niksic <hniksic@srce.hr>
355 * http.c (known_authentication_scheme_p): New function.
356 (gethttp): Handle authorization more correctly.
358 * ftp-basic.h: Removed.
362 * utils.c: Include <unistd.h> before <pwd.h>; needed under SunOS
364 (numdigit): Use `while' loop.
366 * http.c (create_authorization_line): Detect authentication
367 schemes case-insensitively.
369 * http.c (extract_header_attr): Use strdupdelim().
370 (digest_authentication_encode): Move declaration of local
371 variables to smaller scope.
372 (digest_authentication_encode): Reset REALM, OPAQUE and NONCE.
373 (create_authorization_line): Detect authentication schemes
376 * utils.c (touch): Constify.
378 * http.c (gethttp): Report a nicer error when no data is received.
380 * rbuf.h (RBUF_READCHAR): Ditto.
382 * ftp-basic.c (ftp_response): Use sizeof.
384 1998-04-27 Hrvoje Niksic <hniksic@srce.hr>
386 * retr.c (print_percentage): EXPECTED is long, not int.
387 (print_percentage): Use floating-point arithmetic to avoid
388 overflow with large files' sizes multiplied with 100.
390 1998-04-27 Gregor Hoffleit <flight@mathi.uni-heidelberg.de>
392 * config.h.in: Added pid_t stub.
394 * sysdep.h (S_ISREG): Moved here from mswindows.h (NeXT doesn't
397 1998-04-20 Hrvoje Niksic <hniksic@srce.hr>
399 * version.c: Wget 1.5.0 is released.
401 1998-04-18 Hrvoje Niksic <hniksic@srce.hr>
403 * url.c (str_url): Ditto.
405 * ftp-basic.c (ftp_rest): Use new name.
407 * utils.c (long_to_string): Renamed from prnum().
409 1998-04-16 Hrvoje Niksic <hniksic@srce.hr>
411 * version.c: Wget 1.5-b17 is released.
413 1998-04-08 Hrvoje Niksic <hniksic@srce.hr>
415 * headers.c (header_get): New argument FLAGS.
417 * http.c (gethttp): If request is malformed, bail out of the
419 (gethttp): Check for empty header *after* the status line checks.
420 (gethttp): Disallow continuations for status line.
422 1998-04-08 Hrvoje Niksic <hniksic@srce.hr>
424 * version.c: Wget 1.5-b16 is released.
426 1998-04-08 Hrvoje Niksic <hniksic@srce.hr>
428 * init.c (commands): Renamed `always_rest' to `continue'.
430 1998-04-05 Hrvoje Niksic <hniksic@srce.hr>
434 * log.c (logputs): New argument.
438 1998-04-04 Hrvoje Niksic <hniksic@srce.hr>
440 * http.c (http_atotm): Update comment.
442 * main.c (i18n_initialize): Set LC_MESSAGES, not LC_ALL.
444 * wget.h: Renamed ENABLED_NLS to HAVE_NLS.
446 * main.c (i18n_initialize): New function.
449 * log.c: Include <unistd.h>.
451 * retr.c (show_progress): Cast alloca to char *.
453 1998-04-04 Hrvoje Niksic <hniksic@srce.hr>
455 * version.c: Wget 1.5-b15 is released.
457 1998-04-04 Hrvoje Niksic <hniksic@srce.hr>
459 * utils.h: Declare file_non_directory_p().
461 1998-04-03 Hrvoje Niksic <hniksic@srce.hr>
463 * main.c (main): It's `tries', not `numtries' now.
465 1998-04-01 Hrvoje Niksic <hniksic@srce.hr>
467 * init.c (getperms): Removed.
469 1998-04-01 Tim Charron <tcharron@interlog.com>
471 * log.c (logvprintf): Don't use ARGS twice.
473 1998-04-01 John <john@futuresguide.com>
475 * mswindows.c: Cleaned up.
477 1998-04-01 Hrvoje Niksic <hniksic@srce.hr>
479 * version.c: Wget 1.5-b14 is released.
481 1998-04-01 Hrvoje Niksic <hniksic@srce.hr>
483 * ftp-opie.c (STRLEN4): New macro.
486 1998-04-01 Junio Hamano <junio@twinsun.com>
488 * http.c: Document all the Digest functions.
490 1998-04-01 Hrvoje Niksic <hniksic@srce.hr>
492 * utils.c (file_non_directory_p): Renamed from isfile().
494 * mswindows.h (S_ISREG): New macro, suggested by Tim Adam.
496 1998-03-31 Hrvoje Niksic <hniksic@srce.hr>
498 * utils.c (mkdirhier): Use 0777 instead of opt.dirmode.
500 * init.c (cmd_spec_dotstyle): Use 48 dots per line for binary
502 (cmd_permissions): Removed.
504 * config.h.in: Add template for WORDS_BIGENDIAN.
506 1998-03-31 Junio Hamano <junio@twinsun.com>
508 * http.c (HEXD2asc): New macro.
511 1998-03-31 Hrvoje Niksic <hniksic@srce.hr>
513 * version.c: Wget 1.5-b13 is released.
515 1998-03-31 Hrvoje Niksic <hniksic@srce.hr>
517 * main.c (main): Don't try to use `com'.
519 1998-03-30 Hrvoje Niksic <hniksic@srce.hr>
521 * init.c (cmd_permissions): New function.
523 1998-03-30 Hrvoje Niksic <hniksic@srce.hr>
525 * version.c: Wget 1.5-b12 is released.
527 1998-03-30 Hrvoje Niksic <hniksic@srce.hr>
529 * init.c (commands): Renamed `numtries' to `tries'.
530 (cmd_spec_debug): Removed.
531 (home_dir): Under Windows, return `C:\' if HOME is undefined.
533 1998-03-29 Hrvoje Niksic <hniksic@srce.hr>
535 * config.h.in: Define _XOPEN_SOURCE.
537 * init.c (check_user_specified_header): New function.
538 (cmd_spec_header): Use it.
539 (cmd_spec_useragent): New function.
541 1998-03-29 Hrvoje Niksic <hniksic@srce.hr>
543 * version.c: Wget 1.5-b11 is released.
545 1998-03-28 Hrvoje Niksic <hniksic@srce.hr>
547 * wget.h: Include <libintl.h> only if NLS is enabled.
549 1998-03-26 Hrvoje Niksic <hniksic@srce.hr>
551 * options.h (struct options): Made `wait' a long.
552 (struct options): Ditto for `timeout'.
554 1998-03-19 Hrvoje Niksic <hniksic@srce.hr>
556 * utils.c (exists): Renamed to file_exists_p.
557 (file_exists_p): Use access() if available.
559 1998-03-17 Hrvoje Niksic <hniksic@srce.hr>
561 * utils.c (memfatal): Set save_log_p to 0 to avoid potential
564 * log.c: do_logging -> save_log_p.
566 * config.h.in: Added template for HAVE_VSNPRINTF.
568 1998-03-16 Hrvoje Niksic <hniksic@srce.hr>
572 * http.c: Protect declaration against non-ANSI compiler.
574 * log.c (logvprintf): Use vsnprintf() if available.
576 * getopt.c (main): Don't translate test stuff.
578 1998-03-16 Hrvoje Niksic <hniksic@srce.hr>
580 * version.c: Wget 1.5-b10 is released.
582 1998-03-11 Hrvoje Niksic <hniksic@srce.hr>
584 * ftp.c (getftp): Don't translate "CWD %s".
586 * wget.h (GCC_FORMAT_ATTR): Renamed from FORMAT_ATTR.
588 1998-03-07 Hrvoje Niksic <hniksic@srce.hr>
590 * ftp-opie.c (btoe): Use memcpy() instead of strncat().
592 * log.c (logputs): New function.
593 (logvprintf): Renamed from vlogmsg; use logputs().
595 * retr.c (show_progress): Print `[100%]' when the retrieval is
598 * init.c (run_wgetrc): Use FILE, not PATH.
599 (wgetrc_file_name): Ditto.
601 1998-03-07 Tim Adam <tma@osa.com.au>
603 * recur.c (parse_robots): Correctly reset `entries' on empty
606 1998-03-07 Hrvoje Niksic <hniksic@srce.hr>
608 * init.c (cmd_spec_debug): Use cmd_boolean().
610 1998-02-23 Hrvoje Niksic <hniksic@srce.hr>
612 * http.c (gethttp): Create proxy-authorization correctly.
614 1998-02-22 Hrvoje Niksic <hniksic@srce.hr>
618 * getopt.c: Use ANSI function definitions.
620 * ftp-opie.c: New file.
622 * options.h: Don't redefine EXTERN.
624 * init.c: Sort it correctly.
626 1998-02-22 Hrvoje Niksic <hniksic@srce.hr>
628 * version.c: Wget 1.5-b9 is released.
630 1998-02-22 Hrvoje Niksic <hniksic@srce.hr>
632 * recur.c (recursive_retrieve): Reset `first_time'.
634 * ftp.c (getftp): Added `default' clause to switches of uerr_t.
636 * rbuf.c (rbuf_peek): Simplified.
637 (rbuf_flush): Use MINVAL.
639 * wget.h (MINVAL): Moved from url.h.
641 * rbuf.h (RBUF_FD): New macro.
643 * url.c (add_url): Add to the head of the list.
645 * ftp.c (ftp_retrieve_list): Set the permissions to downloaded
647 (getftp): Set the default permissions to 0600.
649 1998-02-21 Hrvoje Niksic <hniksic@srce.hr>
651 * url.c (get_urls_html): Ditto.
652 (convert_links): Ditto.
654 * recur.c (parse_robots): Ditto.
656 * html.c (ftp_index): Ditto.
658 * ftp-ls.c (ftp_parse_unix_ls): Open file as binary.
660 * init.c (defaults): Initialize `opt' to zero via memset.
662 * http.c (digest_authentication_encode): goto considered harmful.
664 1998-02-19 Hrvoje Niksic <hniksic@srce.hr>
666 * ftp.c (delelement): Simplify and fix leak.
668 1998-02-18 Hrvoje Niksic <hniksic@srce.hr>
670 * http.c (dump_hash): Use HEXD2ASC instead of home-grown stuff.
672 * url.h (HEXD2ASC): Removed warning.
674 * init.c (comind): Use binary search.
675 (commands): Reorganized.
677 (cmd_boolean): New function.
679 (cmd_number_inf): Ditto.
682 (cmd_directory_vector): Ditto.
685 (cmd_spec_debug): Ditto.
686 (cmd_spec_dirmode): Ditto.
687 (cmd_spec_dirstruct): Ditto.
688 (cmd_spec_dotstyle): Ditto.
689 (cmd_spec_header): Ditto.
690 (cmd_spec_htmlify): Ditto.
691 (cmd_spec_mirror): Ditto.
692 (cmd_spec_outputdocument): Ditto.
693 (cmd_spec_recursive): Ditto.
694 (settime): Merged with cmd_time().
695 (setbytes): Merged with cmd_bytes().
696 (setonoff): Merged with cmd_boolean().
699 1998-02-17 Hrvoje Niksic <hniksic@srce.hr>
701 * Makefile.in (distclean): Remove `config.h'.
703 1998-02-17 Hrvoje Niksic <hniksic@srce.hr>
705 * version.c: Wget 1.5-b8 is released.
707 1998-02-17 Hrvoje Niksic <hniksic@srce.hr>
709 * http.c (digest_authentication_encode): New function.
710 (create_authorization_line): Use it.
711 (dump_hash): New function.
712 (digest_authentication_encode): Use it.
714 * fnmatch.c: Renamed from `mtch.c'.
716 1998-02-15 Karl Eichwalder <ke@suse.de>
718 * main.c (main): Tag "Written by..." string as translatable.
720 1998-02-15 Hrvoje Niksic <hniksic@srce.hr>
722 * wget.h (FREE_MAYBE): New macro.
724 * http.c (create_authorization_line): Don't use ANSI C string
725 concatenation feature.
726 (basic_authentication_encode): Use alloca() for temporary
735 * ftp-basic.c: Protect declaration against non-ANSI compiler.
737 * http.c (create_authorization_line): Cast `unsigned char *' to
738 `char *' for sprintf, to shut up the noisy Digital Unix cc.
740 1998-02-15 Hrvoje Niksic <hniksic@srce.hr>
742 * version.c: Wget 1.5-b7 is released.
744 1998-02-15 Hrvoje Niksic <hniksic@srce.hr>
746 * cmpt.c (strstr): Synched with glibc-2.0.6.
748 * ftp-basic.c (calculate_skey_response): Ditto.
749 (calculate_skey_response): Use alloca().
751 * http.c (create_authorization_line): Work with FSF's version of
754 * md5.c: New file, from GNU libc.
756 1998-02-14 Hrvoje Niksic <hniksic@srce.hr>
758 * url.h (URL_CLEANSE): Name the temporary variable more carefully.
760 1998-02-13 Hrvoje Niksic <hniksic@srce.hr>
762 * http.c (basic_authentication_encode): New function, instead of
765 1998-02-13 Junio Hamano <junio@twinsun.com>
767 * http.c: Add HTTP-DA support.
768 * ftp-basic.c: Add Opie/S-key support.
769 * config.h.in, Makefile.in: Add HTTP-DA and Opie/S-key support.
770 * md5.c, md5.h: New files.
772 1998-02-13 Hrvoje Niksic <hniksic@srce.hr>
774 * http.c (http_process_range): Renamed from hprocrange().
775 (http_process_range): Parse the whole header.
777 * headers.c: New file.
778 (header_process): New function.
779 (header_get): Renamed from fetch_next_header.
781 * all: Include utils.h only where necessary.
783 * wget.h: Declare xmalloc(), xrealloc() and xstrdup() here.
785 * wget.h: Add provisions for dmalloc.
787 1998-02-12 Hrvoje Niksic <hniksic@srce.hr>
789 * version.c: Wget 1.5-b6 is released.
791 1998-02-12 Hrvoje Niksic <hniksic@srce.hr>
793 * ftp.c (ftp_loop): Determine `filename' more precisely.
795 * init.c (setval): Don't set `opt.quiet' if output-document is
798 * log.c (log_init): Print to STDERR instead of STDOUT.
799 (vlogmsg): Use STDERR by default.
802 1998-02-11 Simon Josefsson <jas@pdc.kth.se>
804 * host.c: Use addr_in again.
806 1998-02-08 Karl Eichwalder <karl@suse.de>
808 * http.c (gethttp): Fixed typo.
810 1998-02-08 Hrvoje Niksic <hniksic@srce.hr>
812 * version.c: Wget 1.5-b5 is released.
814 1998-02-08 Hrvoje Niksic <hniksic@srce.hr>
816 * retr.c (show_progress): Use it.
818 * log.c (logflush): New function.
820 * wget.h: Utilize __attribute__ if on gcc.
822 1998-02-07 Hrvoje Niksic <hniksic@srce.hr>
824 * http.c (base64_encode_line): New argument LENGTH.
825 (BASIC_AUTHENTICATION_ENCODE): Use it.
826 (BASIC_AUTHENTICATION_ENCODE): Take length of HEADER into account.
828 * main.c (main): Fixed fprintf() format mismatch.
830 1998-02-06 Hrvoje Niksic <hniksic@srce.hr>
832 * version.c: Wget 1.5-b4 is released.
834 1998-02-03 Simon Josefsson <jas@pdc.kth.se>
836 * host.c: use sockaddr_in instead of addr_in.
838 1998-02-04 Hrvoje Niksic <hniksic@srce.hr>
840 * init.c (cleanup): Use it.
842 * recur.c (recursive_cleanup): New function.
844 * retr.c (retrieve_from_file): Ditto.
846 * main.c (main): Use it.
848 * recur.c (recursive_reset): New function.
850 * retr.c (retrieve_from_file): Ditto.
852 * main.c (main): Simplify call to recursive_retrieve().
854 * recur.c (recursive_retrieve): Removed FLAGS argument.
856 * http.c (gethttp): Changed call to iwrite().
858 1998-02-03 Hrvoje Niksic <hniksic@srce.hr>
860 * url.c (get_urls_html): Ditto.
861 (free_urlpos): Ditto.
865 * retr.c (retrieve_url): Move declaration of local variables to
868 * url.c (urlproto): Use it.
871 (get_urls_html): Ditto.
873 * utils.h (ARRAY_SIZE): New macro.
875 * url.c (proto): Moved from url.h.
877 * url.h (URL_CLEANSE): Reformatted.
878 (USE_PROXY_P): Renamed from USE_PROXY.
880 * ftp-basic.c: Adjust to the new interface of iwrite().
882 * ftp-basic.c (ftp_port): Use alloca().
884 1998-02-03 Hrvoje Niksic <hniksic@srce.hr>
886 * version.c: Wget 1.5-b3 is released.
888 * host.c (ftp_getaddress): Don't print to stderr directly.
890 * init.c (setbytes): Support `g' for gigabytes.
891 (cmdtype): New specification CTIME.
892 (setval): Use it with settime().
893 (commands): Use it for WAIT and TIMEOUT.
895 1998-02-02 Hrvoje Niksic <hniksic@srce.hr>
897 * http.c (BASIC_AUTHENTICATION_ENCODE): New macro.
900 * utils.c (unique_name_1): Moved from url.c.
901 (unique_name): Ditto.
903 * url.c (url_filename): Ditto.
905 * log.c (redirect_output): Changed call to unique_name().
907 * url.c (unique_name_1): Renamed from unique_name().
908 (unique_name): Changed interface.
910 * init.c (enum cmdid): Moved from init.h.
914 * main.c (main): Use it.
915 (main): Moved `--backups' to not have a short option.
917 * options.h (struct options): New member BACKGROUND.
919 * main.c (print_help): Rearranged.
920 (main): New long options for -n* short options: --no-directories,
921 --no-host-directories, --non-verbose, --no-host-lookup and
922 --dont-remove-listing.
924 1998-02-01 Hrvoje Niksic <hniksic@srce.hr>
926 * main.c (main): Use log_close().
928 * log.c: New variable LOGFP.
930 (redirect_output): Don't open /dev/null; set LOGFP to stdin
932 (log_close): New function.
934 * options.h (struct options): Removed LFILE.
936 * log.c (log_enable): Removed.
938 * main.c (main): Use it.
940 * log.c (log_init): New function.
942 * url.c (get_urls_html): Removed needless assignment to BASE.
944 * host.c (add_hlist): Don't set CMP needlessly.
946 * utils.c (match_backwards): Ditto.
949 * url.c (findurl): Ditto.
951 * netrc.c (parse_netrc): Ditto.
953 * log.c (log_dump): Ditto.
955 * html.c (html_quote_string): Ditto.
957 * ftp-basic.c (ftp_request): Made static.
959 * connect.c: Made global variables static.
961 * url.c (construct): Ditto.
963 * init.c (init_path): Avoid assignment inside `if'-condition.
965 * ftp.c: Don't include in.h or winsock.h.
967 * ftp.c (ftp_loop): Use SZ.
969 * connect.c (bindport): Cast &addrlen to int *.
972 * init.c (initialize): Don't use SYSTEM_WGETRC unconditionally.
974 1998-01-31 Hrvoje Niksic <hniksic@srce.hr>
976 * ftp.c (getftp): Initialize opt.ftp_pass here.
977 (ftp_retrieve_dirs): Use alloca().
979 * init.c (defaults): Don't initialize opt.ftp_pass.
981 * sysdep.h (S_ISLNK): Declare for OS/2; ditto for lstat.
982 From Ivan F. Martinez <ivanfm@ecodigit.com.br>.
984 1998-01-31 Hrvoje Niksic <hniksic@srce.hr>
986 * recur.c (parse_robots): Check for comments more correctly.
988 * host.c (ftp_getaddress): Use STRDUP_ALLOCA.
989 (ftp_getaddress): Add diagnostics when reverse-lookup yields only
992 1998-01-31 Hrvoje Niksic <hniksic@srce.hr>
994 * version.c: Wget 1.5-b2 is released.
996 * netrc.c (NETRC_FILE_NAME): Moved from netrc.h.
998 * utils.c (proclist): Pass FNM_PATHNAME to fnmatch().
1000 * ftp-basic.c (ftp_pasv): Avoid unnecessary casting to unsigned
1003 * log.c: Don't attempt to hide arguments from ansi2knr.
1005 * cmpt.c: Synched strptime() and mktime() with glibc-2.0.6.
1007 * ansi2knr.c: Use a later version, from fileutils-3.16l alpha.
1009 * ftp.c (getftp): Ditto.
1011 * http.c (gethttp): Use it.
1013 * retr.c (get_contents): New argument EXPECTED; pass it to
1015 (show_progress): New argument EXPECTED; use it to display
1018 * init.c (setval): Ditto.
1020 * http.c (gethttp): Ditto.
1023 * ftp.c (getftp): Ditto.
1024 (ftp_loop_internal): Ditto.
1026 * ftp-ls.c (ftp_parse_unix_ls): Use abort() instead of assert(0).
1028 * sysdep.h (CLOSE): Simplify; use DEBUGP.
1030 * netrc.c (search_netrc): Use alloca().
1032 * init.c (defaults): Initialize no_flush.
1034 * log.c (vlogmsg): Don't flush if no_flush.
1036 * options.h (struct options): New variable no_flush.
1038 * main.c (main): Don't play games with buffering.
1040 * log.c (vlogmsg): Flush the output after every message.
1042 1998-01-31 Hrvoje Niksic <hniksic@srce.hr>
1044 * init.c (parse_line): Ditto.
1046 * url.c (get_urls_html): Ditto.
1048 * main.c (main): Don't cast to unsigned char.
1050 * init.c (run_wgetrc): Don't cast to unsigned char.
1051 (parse_line): Accept char instead of unsigned char.
1053 * html.c (htmlfindurl): Use char instead of unsigned char.
1057 * sysdep.h: Add wrappers to ctype macros to make them
1060 1998-01-30 Hrvoje Niksic <hniksic@srce.hr>
1062 * html.c (htmlfindurl): Download <img lowsrc=...>
1064 * main.c (main): Ignore SIGPIPE.
1066 * connect.c (select_fd): New argument WRITEP.
1067 (iwrite): Call select_fd().
1069 1997-02-27 Fila Kolodny <fila@ibi.com>
1071 * ftp.c (ftp_retrieve_list): If retrieving symlink and the proper
1072 one already exists, just skip it.
1074 1998-01-30 Hrvoje Niksic <hniksic@srce.hr>
1076 * http.c (gethttp): Cosmetic changes.
1078 * http.c (check_end): Allow `+D...' instead of `GMT'.
1079 From Fabrizio Pollastri <pollastri@cstv.to.cnr.it>.
1081 * url.c (process_ftp_type): New function.
1084 * connect.c (iwrite): Allow writing in a few chunks.
1085 (bindport): Made SRV static, so addr can point to it.
1086 (select_fd): Removed HPUX kludge.
1088 * host.c (free_hlist): Incorporated into clean_hosts().
1090 1998-01-29 Hrvoje Niksic <hniksic@srce.hr>
1092 * host.c (hlist): Made static.
1093 (search_address): Cosmetic change.
1095 1998-01-29 Hrvoje Niksic <hniksic@srce.hr>
1097 * version.c: Wget v1.5-b1 is released.
1099 * http.c (hgetlen): Use sizeof() to get the header length.
1102 (hgetlocation): Ditto.
1103 (hgetmodified): Ditto.
1104 (haccepts_none): Ditto.
1106 * main.c (main): Updated `--version' and `--help' output, as per
1107 Francois Pinard's suggestions.
1109 * main.c: Include locale.h; call setlocale(), bindtextdomain() and
1112 * config.h.in: Define stubs for I18N3.
1114 * wget.h: Include libintl.h.
1116 1998-01-28 Hrvoje Niksic <hniksic@srce.hr>
1118 * url.c (mkstruct): Check for opt.cut_dirs.
1119 (mkstruct): alloca()-te more, xmalloc() less.
1121 * utils.c (load_file): Check for ferror().
1123 * url.c (get_urls_file): Close only the files we opened.
1124 (get_urls_html): Ditto.
1125 (count_slashes): New function.
1129 * http.c (gethttp): Respect username and password provided by
1131 (base64_encode_line): Write into an existing buffer instead of
1132 malloc-ing a new one.
1133 (struct http_stat): Moved from http.h
1135 * retr.c (retrieve_url): Free SUF.
1137 * all: Removed lots of unnecessary .h dependencies.
1139 * html.c (global_state): Made static.
1141 * utils.h (ALLOCA_ARRAY): New macro.
1143 * main.c (main): New option `--cut-dirs'.
1145 * url.c (construct): Use alloca() for T.
1147 * utils.c (mkdirhier): Use STRDUP_ALLOCA.
1149 * host.c (_host_t): Moved from host.h.
1150 (struct host): Renamed from _host_t.
1151 (store_hostaddress): Use STRDUP_ALLOCA for INET_S.
1154 * host.h: Don't include url.h.
1156 * ftp.c (LIST_FILENAME): Moved from ftp.h.
1158 * init.c (DEFAULT_FTP_ACCT): Moved from ftp.h.
1160 * main.c (main): Enable log if the output goes to a TTY.
1162 * connect.h: Removed unused constant `BACKLOG'.
1164 * config.h.in: Check for isatty().
1166 * Makefile.in (LINK): Use CFLAGS when linking.
1168 1998-01-27 Hrvoje Niksic <hniksic@srce.hr>
1170 * mswindows.c (ws_hangup): Use redirect_output().
1172 * main.c (redirect_output_signal): New function; use
1175 * log.c (redirect_output): New function, based on hangup(), which
1178 * log.c (vlogmsg): New function.
1180 * wget.h (DEBUGP): Use debug_logmsg().
1182 * main.c (hangup): Use it.
1184 * log.c (log_dump): New function.
1186 * utils.h (DO_REALLOC): Use `long' for various sizes.
1188 * http.c (hskip_lws): Use `while', for clarity.
1189 (HTTP_DYNAMIC_LINE_BUFFER): New constant.
1190 (fetch_next_header): Use it instead of DYNAMIC_LINE_BUFFER.
1192 * ftp-basic.c (FTP_DYNAMIC_LINE_BUFFER): New constant.
1193 (ftp_response): Use it instead of DYNAMIC_LINE_BUFFER.
1195 * utils.c (DYNAMIC_LINE_BUFFER): Moved from utils.c.
1196 (LEGIBLE_SEPARATOR): Ditto.
1197 (FILE_BUFFER_SIZE): Ditto.
1199 * retr.c (BUFFER_SIZE): Moved from retr.h.
1202 (logmsg): Moved from utils.c.
1203 (debug_logmsg): New function.
1205 * mswindows.h: Include it here.
1209 * utils.c: Don't include <windows.h>.
1211 1998-01-25 Hrvoje Niksic <hniksic@srce.hr>
1213 * host.c (ftp_getaddress): Ditto.
1215 * main.c (main): Use it.
1217 * utils.h (STRDUP_ALLOCA): New macro.
1219 * init.c: Prepend `wget: ' to error messages printed on stderr.
1221 * utils.c (mkdirhier): Renamed from mymkdir.
1222 (touch): Renamed from my_touch.
1223 (pwd_cuserid): Renamed from my_cuserid().
1225 1998-01-24 Andy Eskilsson <andy.eskilsson@telelogic.se>
1227 * utils.c (accdir): Process wildcards.
1228 (proclist): New function.
1229 (accdir): Use it to avoid code repetition.
1231 1998-01-24 Hrvoje Niksic <hniksic@srce.hr>
1233 * recur.c (parse_robots): Respect opt.useragent; use alloca().
1235 * http.c (gethttp): Construct useragent accordingly.
1237 * version.c: Changed version string to numbers-only.
1239 * main.c (print_help): List all the options.
1241 * mswindows.c (windows_main_junk): Initialize argv0 here.
1243 1998-01-24 Karl Heuer <kwzh@gnu.org>
1245 * netrc.c (search_netrc): Initialize `l' only after processing
1248 * main.c (main): Don't trap SIGHUP if it's being ignored.
1250 1998-01-24 Hrvoje Niksic <hniksic@srce.hr>
1252 * all: Use logmsg().
1254 * utils.c (time_str): Moved from retr.c.
1255 (logmsg): New function.
1256 (logmsg_noflush): Ditto.
1258 * rbuf.c: New file, moved buf_* functions here.
1260 * ftp.c (ftp_expected_bytes): Moved from ftp-basic.c.
1262 * ftp-basic.c (ftp_rest): Use prnum().
1264 * ftp-basic.c: Ditto.
1266 * ftp.c: Use the new reading functions and macros.
1268 * retr.c (buf_initialize): New function.
1269 (buf_initialized_p): Ditto.
1270 (buf_uninitialize): Ditto.
1273 * http.c (fetch_next_header): Use the BUF_READCHAR macro for
1275 (gethttp): Use alloca() where appropriate.
1277 * retr.c (buf_readchar): Use it.
1278 (buf_peek): Use rstreams.
1280 * retr.h (BUF_READCHAR): New macro.
1282 * init.c (home_dir): Rewritten for clarity.
1285 * mswindows.c (ws_backgnd): Made static.
1286 (read_registry): Ditto.
1287 (ws_cleanup): Ditto.
1288 (ws_handler): Ditto.
1290 1998-01-23 Hrvoje Niksic <hniksic@srce.hr>
1292 * alloca.c: New file.
1294 * Makefile.in (ALLOCA): Define.
1296 * mswindows.c (ws_help): Constify.
1297 (ws_help): Use alloca.
1299 * mswindows.c: Reformat.
1301 * all: Added _(...) annotations for I18N snarfing and translation.
1303 * host.c (ftp_getaddress): Nuke SYSINFO.
1304 (ftp_getaddress): Don't use getdomainname().
1305 (ftp_getaddress): Use uname(), where available.
1307 * http.c (gethttp): Protect a stray fprintf().
1309 * init.c (settime): New function.
1310 (setval): Treat WAIT specially, allowing suffixes like `m' for
1313 1998-01-21 Hrvoje Niksic <hniksic@srce.hr>
1315 * url.c (get_urls_html): Use alloca() for TEMP.
1317 1998-01-21 Jordan Mendelson <jordy@wserv.com>
1319 * url.c (rotate_backups): New function.
1321 * http.c (gethttp): Ditto.
1323 * ftp.c (getftp): Rotate backups.
1325 1997-12-18 Hrvoje Niksic <hniksic@srce.hr>
1327 * all: Renamed nmalloc(), nrealloc() and nstrdup() to xmalloc(),
1328 xrealloc() and xstrdup(). Use the new functions.
1330 * url.c (decode_string): Made static.
1333 (parse_uname): Ditto.
1336 (construct_relative): Ditto.
1338 * retr.c (show_progress): Made static.
1340 * recur.c (robots_url): Made static.
1341 (retrieve_robots): Ditto.
1342 (parse_robots): Ditto.
1343 (robots_match): Ditto.
1347 * main.c (printhelp): Made static.
1350 * init.c (comind): Made static.
1353 (run_wgetrc): Ditto.
1361 * http.c (fetch_next_header): Made static.
1362 (hparsestatline): Ditto.
1367 (hgetlocation): Ditto.
1368 (hgetmodified): Ditto.
1369 (haccepts_none): Ditto.
1371 (base64_encode_line): Ditto.
1372 (mktime_from_utc): Ditto.
1373 (http_atotm): Ditto.
1375 * html.c (idmatch): Made static.
1377 * host.c (search_host): Made static.
1378 (search_address): Ditto.
1379 (free_hlist): Ditto.
1381 * ftp.c (getftp): Made static.
1382 (ftp_loop_internal): Ditto.
1383 (ftp_get_listing): Ditto.
1384 (ftp_retrieve_list): Ditto.
1385 (ftp_retrieve_dirs): Ditto.
1386 (ftp_retrieve_glob): Ditto.
1387 (freefileinfo): Ditto.
1388 (delelement): Ditto.
1390 * ftp-ls.c (symperms): Made static.
1391 (ftp_parse_unix_ls): Ditto.
1393 * connect.c (select_fd): Made static.
1395 * utils.c (xmalloc): Renamed from nmalloc.
1396 (xrealloc): Renamed from nrealloc.
1397 (xstrdup): Renamed from nstrdup.
1399 * getopt.c (exchange): Use alloca.
1401 * mswindows.c (mycuserid): Use strncpy.
1403 * New files mswindows.c, mswindows.h, sysdep.h. winjunk.c,
1404 systhings.h, windecl.h and winjunk.h removed.
1406 * mswindows.c (sleep): New function.
1408 * utils.c: Include <windows.h> under Windows.
1410 1997-06-12 Darko Budor <dbudor@zesoi.fer.hr>
1412 * url.h (URL_UNSAFE): Change default under Windows.
1414 * retr.c (retrieve_from_file): Respect opt.delete_after.
1416 * main.c (main): Call ws_help on Windows.
1418 * winjunk.c (windows_main_junk): New function.
1420 * main.c (main): Junk-process argv[0].
1422 * http.c (mktime_from_utc): Return -1 if mktime failed.
1424 * http.c (http_loop): Ditto.
1426 * ftp.c (ftp_loop_internal): Change title on Windows when using a
1429 * winjunk.c (getdomainname): Lots of functions.
1431 1997-06-12 Hrvoje Niksic <hniksic@srce.hr>
1433 * cmpt.c (strptime_internal): Handle years more correctly for
1436 1997-06-09 Mike Thomas <mthomas@reality.ctron.com>
1438 * http.c (gethttp): Allocate enough space for
1439 `Proxy-Authorization' header.
1441 1997-05-10 Hrvoje Niksic <hniksic@srce.hr>
1443 * version.c: Wget/1.4.5 is released.
1445 1997-05-10 Hrvoje Niksic <hniksic@srce.hr>
1447 * retr.c (get_contents): Check return value of fwrite more
1450 1997-03-30 Andreas Schwab <schwab@issan.informatik.uni-dortmund.de>
1452 * cmpt.c (strptime_internal) [case 'Y']: Always subtract 1900 from
1453 year, regardless of century.
1455 1997-03-30 Hrvoje Niksic <hniksic@srce.hr>
1457 * utils.c (isfile): Use `lstat' instead of `stat'.
1459 1997-03-29 Hrvoje Niksic <hniksic@srce.hr>
1461 * utils.c (numdigit): Use explicit test.
1463 1997-03-21 Hrvoje Niksic <hniksic@srce.hr>
1465 * http.c (http_loop): Always use `url_filename' to get u->local.
1467 1997-03-20 Hrvoje Niksic <hniksic@srce.hr>
1469 * url.c: Recognize https.
1471 1997-03-13 Hrvoje Niksic <hniksic@srce.hr>
1473 * recur.c (recursive_retrieve): Lowercase just the host name.
1475 1997-03-09 Hrvoje Niksic <hniksic@srce.hr>
1477 * url.c (get_urls_file): Use the correct test.
1478 (get_urls_html): Ditto.
1480 1997-03-07 Hrvoje Niksic <hniksic@srce.hr>
1482 * connect.c: Reverted addrlen to int.
1484 * init.c (parse_line): Check for -1 instead of NONE.
1486 * version.c: Changed version to 1.4.5.
1488 1997-02-17 Hrvoje Niksic <hniksic@srce.hr>
1490 * init.c: New option netrc.
1491 (initialize): Don't parse .netrc.
1493 * cmpt.c (recursive): Return rp.
1494 (strptime_internal): Match the long strings first, the abbreviated
1497 1997-02-16 Hrvoje Niksic <hniksic@srce.hr>
1499 * http.c (check_end): New function.
1500 (http_atotm): Use it.
1502 1997-02-13 gilles Cedoc <gilles@cedocar.fr>
1504 * http.c (gethttp): Use them.
1506 * init.c: New options proxy_user and proxy_passwd.
1508 1997-02-14 Hrvoje Niksic <hniksic@srce.hr>
1510 * ftp.c (ftp_retrieve_list): Create links even if not relative.
1512 1997-02-10 Hrvoje Niksic <hniksic@srce.hr>
1514 * recur.c (recursive_retrieve): Lowercase the host name, if the
1515 URL is not "optimized".
1517 * host.c (realhost): Return l->hostname, even if it matches with
1520 1997-02-10 Marin Purgar <pmc@asgard.hr>
1522 * connect.c: Make addrlen size_t instead of int.
1525 1997-02-09 Gregor Hoffleit <flight@mathi.uni-heidelberg.DE>
1527 * systhings.h: Define S_ISLNK on NeXT too.
1529 1997-02-09 Hrvoje Niksic <hniksic@srce.hr>
1531 * version.c: Released 1.4.3.
1533 * url.c: Futher update to list of protostrings.
1534 (skip_proto): Skip `//' correctly for FTP and HTTP.
1536 * url.c (get_urls_html): Handle bogus `http:' things a little
1539 * main.c (main): Removed `follow-ftp' from `f'.
1540 (main): Dumped the `prefix-files' and `file-prefix' options and
1541 features; old and bogus.
1542 (main): Exit on failed setval() in `-e'.
1544 * http.c (fetch_next_header): Use it to detect header continuation
1547 * retr.c (buf_peek): New function.
1549 1997-02-08 Hrvoje Niksic <hniksic@srce.hr>
1551 * wget.h: Include time.h and stuff.
1553 1997-02-08 Roger Beeman <beeman@cisco.com>
1555 * ftp.c: Include <time.h>
1557 1997-02-07 Hrvoje Niksic <hniksic@srce.hr>
1559 * url.c (findurl): Would read over buffer limits.
1561 1997-02-06 Hrvoje Niksic <hniksic@srce.hr>
1563 * ftp-ls.c (ftp_parse_unix_ls): Allow spaces in file names.
1565 1997-02-05 Hrvoje Niksic <hniksic@srce.hr>
1567 * http.c (http_atotm): Initialize tm.is_dst.
1569 1997-02-02 Hrvoje Niksic <hniksic@srce.hr>
1571 * http.c (gethttp): Don't print the number of retrieved headers.
1573 * main.c (main): New option `--no-clobber', alias for `-nc'.
1575 * url.c: Recognize `https://'.
1577 1997-02-01 Hrvoje Niksic <hniksic@srce.hr>
1579 * host.c (herrmsg): Don't use h_errno.
1581 1997-01-30 Hrvoje Niksic <hniksic@srce.hr>
1583 * host.c (accept_domain): Use it.
1585 * main.c (main): New option `--exclude-domains'.
1587 * retr.c (retrieve_url): Use it.
1588 (retrieve_url): Bail out when an URL is redirecting to itself.
1590 * url.c (url_equal): New function.
1592 1997-01-29 Hrvoje Niksic <hniksic@srce.hr>
1594 * connect.c: Include arpa/inet.h instead of arpa/nameser.h.
1596 * http.c (mk_utc_time): New function.
1597 (http_atotm): Use it; handle time zones correctly.
1599 1997-01-28 Hrvoje Niksic <hniksic@srce.hr>
1603 * ftp-basic.c: Use it instead of WRITE.
1605 * connect.c (iwrite): New function.
1607 1997-01-27 Hrvoje Niksic <hniksic@srce.hr>
1609 * cmpt.c (mktime): New function.
1611 * netrc.c: Include <sys/types.h>.
1613 * main.c (main): Wouldn't recognize --spider.
1615 * retr.c (rate): Use `B', `KB' and `MB'.
1616 (reset_timer,elapsed_time): Moved from utils.c.
1618 * ftp.c (ftp_retrieve_list): Ditto.
1620 * http.c (http_loop): Don't touch the file if opt.dfp.
1622 1997-01-24 Hrvoje Niksic <hniksic@srce.hr>
1626 * ftp.c (ftp_retrieve_glob): New argument semantics.
1627 (ftp_retrieve_dirs): Use it.
1630 * html.c (htmlfindurl): Recognize `'' as the quote char.
1632 1997-01-23 Hrvoje Niksic <hniksic@srce.hr>
1634 * ftp.c (ftp_loop_internal): Use it.
1636 * utils.c (remove_link): New function.
1638 1997-01-22 Hrvoje Niksic <hniksic@srce.hr>
1640 * retr.c (retrieve_url): Require STRICT redirection URL.
1642 * url.c (parseurl): New argument STRICT.
1644 * http.c (hparsestatline): Be a little-bit less strict about
1647 1997-01-21 Hrvoje Niksic <hniksic@srce.hr>
1649 * http.c (gethttp): Use it.
1651 * main.c (main): Don't use '<digit>' as options.
1653 * init.c: New option ignore_length.
1655 * http.c (gethttp): Ditto.
1656 (http_loop): Check for redirection without Location:.
1657 (gethttp): Don't print Length unless RETROKF.
1659 * ftp.c (getftp): Use it.
1661 * url.c (mkalldirs): New function.
1663 * utils.c (mymkdir): Don't check for existing non-directory.
1665 * url.c (mkstruct): Don't create the directory.
1667 1997-01-20 Hrvoje Niksic <hniksic@srce.hr>
1669 * init.c (setval): Removed NO_RECURSION checks.
1671 1997-01-19 Hrvoje Niksic <hniksic@srce.hr>
1673 * version.c: "Released" 1.4.3-pre2.
1675 * recur.c (recursive_retrieve): Bypass host checking only if URL
1676 is ftp AND parent URL is not ftp.
1678 * ftp-basic.c (ftp_request): Print out Turtle Power.
1680 * ftp.c (ftp_loop): Call ftp_retrieve_glob with 0 if there's no
1682 (ftp_retrieve_glob): Call ftp_loop_internal even on empty list, if
1685 * http.c (gethttp): Be a little bit smarter about status codes.
1687 * recur.c (recursive_retrieve): Always reset opt.recursive when
1690 1997-01-18 Hrvoje Niksic <hniksic@srce.hr>
1692 * retr.c (retrieve_url): New variable location_changed; use it for
1693 tests instead of mynewloc.
1694 (retrieve_url): Allow heuristic adding of html.
1696 * url.c (url_filename): Don't use the `%' in Windows file names.
1698 * http.c (http_loop): Always time-stamp the local file.
1700 * http.c (http_loop): Ditto.
1702 * ftp.c (ftp_retrieve_list): Use it.
1704 * utils.c (my_touch): New function.
1706 * ftp.c (ftp_retrieve_list): Use #ifdef HAVE_STRUCT_UTIMBUF
1707 instead of #ifndef NeXT.
1709 * utils.c (strptime): New version, by Ulrich Drepper.
1711 1997-01-17 Hrvoje Niksic <hniksic@srce.hr>
1713 * http.c (haccepts_none): Renamed from `haccepts_bytes'.
1714 (gethttp): If haccepts_none(), disable ACCEPTRANGES.
1715 (http_loop): Would remove ACCEPTRANGES.
1717 * ftp.c (getftp): Call ftp_list with NULL.
1719 1997-01-15 Hrvoje Niksic <hniksic@srce.hr>
1721 * html.c (ftp_index): Don't print minutes and seconds if we don't
1722 know them; beautify the output.
1724 * ftp.c (getftp): Don't close the socket on FTPNSFOD.
1726 1997-01-14 Hrvoje Niksic <hniksic@srce.hr>
1728 * utils.c (strptime): New function.
1729 (strptime): Don't use get_alt_number.
1730 (strptime): Don't use locale.
1731 (match_string): Made it a function.
1733 1997-01-12 Hrvoje Niksic <hniksic@srce.hr>
1735 * http.c (http_atotm): New function.
1736 (http_loop): Use it.
1738 * atotm.c: Removed from the distribution.
1740 * http.c (base64_encode_line): Rewrite.
1742 1997-01-09 Hrvoje Niksic <hniksic@srce.hr>
1744 * ftp.c (getftp): Use ftp_expected_bytes; print size.
1746 * ftp-basic.c (ftp_response): Use ftp_last_respline.
1747 (ftp_expected_bytes): New function.
1749 * ftp.c (getftp): Print the unauthoritative file length.
1751 * ftp-ls.c: Renamed from ftp-unix.c.
1752 (ftp_parse_ls): Moved from ftp.c.
1753 (ftp_parse_unix_ls): Recognize seconds in time spec.
1754 (ftp_parse_unix_ls): Recognize year-less dates of the previous
1757 1997-01-08 Hrvoje Niksic <hniksic@srce.hr>
1759 * ftp-basic.c: Don't declare errno if #defined.
1761 * host.c (ftp_getaddress): Check for sysinfo legally.
1763 1997-01-08 Darko Budor <dbudor@diana.zems.fer.hr>
1765 * connect.c (iread): Use READ.
1767 1996-12-23 Hrvoje Niksic <hniksic@srce.hr>
1769 * url.c: Recognize finger, rlogin, tn3270, mid and cid as valid
1772 1996-12-22 Hrvoje Niksic <hniksic@srce.hr>
1774 * host.c (ftp_getaddress): Allow `.' in hostname.
1776 1996-12-26 Darko Budor <dbudor@zems.fer.hr>
1778 * wget.h: READ and WRITE macros for use instead of read and write
1779 on sockets, grep READ *.c, grep WRITE *.c
1781 * wsstartup.c: new file - startup for winsock
1783 * wsstartup.h: new file
1785 * win32decl.h: new file - fixup for <errno.h> and winsock trouble
1787 * configure.bat: Configure utility for MSVC
1789 * src/Makefile.ms,config.h.ms: new files for use with MSVC 4.x
1791 1996-12-22 Hrvoje Niksic <hniksic@srce.hr>
1793 * version.c: Released 1.4.3-pre.
1795 * utils.c (prnum): Accept long.
1796 (legible): Use prnum().
1798 * connect.c (make_connection): Accept port as short.
1801 * http.c (gethttp): Use search_netrc.
1803 1996-12-21 Hrvoje Niksic <hniksic@srce.hr>
1805 * ftp.c (getftp): Use search_netrc.
1807 * netrc.c (free_netrc): New function.
1809 * init.c (home_dir): New function.
1811 * url.c (convert_links): Allow REL2ABS changes.
1813 1996-12-21 Gordon Matzigkeit <gord@gnu.ai.mit.edu>
1815 * netrc.c: New file.
1816 (parse_netrc, maybe_add_to_list): New functions.
1818 1996-12-17 Hrvoje Niksic <hniksic@srce.hr>
1820 * retr.c (retrieve_url): Reset opt.recursion before calling
1821 ftp_loop if it is reached through newloc.
1823 * init.c (run_wgetrc): Print the wgetrc path too, when reporting
1824 error; don't use "Syntax error", since we don't know if it is
1825 really a syntax error.
1827 1996-12-16 Hrvoje Niksic <hniksic@srce.hr>
1829 * utils.c (acceptable): Extract the filename part of the path.
1831 * recur.c (recursive_retrieve): Call acceptable() with the right
1832 argument; would bug out on wildcards.
1834 * init.c (parse_line): Likewise.
1836 * html.c (htmlfindurl): Cast to char * when calling stuff.
1838 1996-12-15 Hrvoje Niksic <hniksic@srce.hr>
1840 * ftp.c (getftp): Use ftp_pasv.
1842 * ftp-basic.c (ftp_request): Accept NULL value.
1843 (ftp_pasv): New function.
1845 * options.h (struct options): Add passive FTP option.
1847 1996-12-15 Hrvoje Niksic <hniksic@srce.hr>
1849 * url.c (parseurl): Debug output.
1851 * utils.c (path_simplify): New one, adapted from bash's
1852 canonicalize_pathname().
1854 1996-12-14 Hrvoje Niksic <hniksic@srce.hr>
1856 * ftp.c (getftp): Don't discard the buffer.
1858 * retr.c (get_contents): New parameter nobuf.
1860 1996-12-13 Shawn McHorse <riffraff@txdirect.net>
1862 * html.c (htmlfindurl): Recognize <meta contents="d; URL=...".
1864 * init.c (setval): Strip the trailing slashes on CVECDIR.
1866 1996-12-13 Hrvoje Niksic <hniksic@srce.hr>
1868 * init.c: Make excludes and includes under CVECDIR instead of
1871 1996-12-13 Shawn McHorse <riffraff@txdirect.net>
1873 * url.c (get_urls_html): Skip "http:".
1875 1996-12-13 Hrvoje Niksic <hniksic@srce.hr>
1877 * utils.c (strcasecmp): From glibc.
1878 (strncasecmp): Also.
1881 * url.c: Added javascript: to the list of URLs prefixes.
1883 1996-12-12 Shawn McHorse <riffraff@txdirect.net>
1885 * recur.c (retrieve_robots): Print the warning message only if
1888 1996-12-12 Gregor Hoffleit <flight@mathi.uni-heidelberg.DE>
1890 * ftp.c (ftp_retrieve_list): Use NeXT old utime interface.
1892 1996-12-12 Hrvoje Niksic <hniksic@srce.hr>
1894 * systhings.h: New file.
1896 * ../configure.in: Check for utime.h
1898 * ftp.c: Check whether we have unistd.h.
1900 1996-11-27 Hrvoje Niksic <hniksic@srce.hr>
1902 * recur.c (recursive_retrieve): Send the canonical URL as referer.
1903 (recursive_retrieve): Call get_urls_html with the canonical URL.
1905 1996-12-13 Kaveh R. Ghazi <ghazi@caip.rutgers.edu>
1907 * (configure.in, config.h.in, src/Makefile.in, src/*.[ch]): Add
1908 ansi2knr support for compilers which don't support ANSI style
1909 function prototypes and signatures.
1911 * (aclocal.m4, src/ansi2knr.c, src/ansi2knr.1): New files.
1913 1996-11-26 Hrvoje Niksic <hniksic@srce.hr>
1915 * url.c: Use it; Recognize paths ending with "." and ".." as
1917 (url_filename): Append .n whenever file exists and could be a
1920 * url.h (ISDDOT): New macro.
1922 * init.c (parse_line): Use unsigned char.
1924 * url.c (get_urls_html): Cast to unsigned char * when calling
1927 * html.c (htmlfindurl): Use unsigned char.
1929 * version.c: Changed version to 1.4.3.
1931 1996-11-25 Hrvoje Niksic <hniksic@srce.hr>
1933 * version.c: Released 1.4.2.
1935 * ftp.c (getftp): Simplified assertion.
1936 (ftp_loop_internal): Remove symlink before downloading.
1937 (ftp_retrieve_list): Unlink the symlink name before attempting to
1940 * options.h (struct options): Renamed print_server_response to
1943 * ftp.c (rel_constr): Removed.
1944 (ftp_retrieve_list): Don't use it.
1945 (ftp_retrieve_list): Use opt.retr_symlinks.
1947 1996-11-24 Hrvoje Niksic <hniksic@srce.hr>
1949 * main.c (main): New option retr_symlinks.
1951 * url.c (convert_links): Print verbose message.
1953 1996-11-24 Hrvoje Niksic <hniksic@srce.hr>
1955 * http.c (http_loop): Reset newloc in the beginning of function;
1956 would cause FMR in retrieve_url.
1958 1996-11-23 Hrvoje Niksic <hniksic@srce.hr>
1960 * recur.c (convert_all_links): Find the URL of each HTML document,
1961 and feed it to get_urls_html; would bug out.
1962 (convert_all_links): Check for l2 instead of dl; removed dl.
1964 * url.c (convert_links): Don't refer to freed newname.
1966 * recur.c (recursive_retrieve): Add this_url to urls_downloaded.
1968 * main.c (main): Print the OS_TYPE in the debug output, too.
1970 * recur.c (recursive_retrieve): Check for opt.delete_after.
1972 * main.c (main): New option delete-after.
1974 * init.c (setval): Cleaned up.
1976 1996-11-21 Hrvoje Niksic <hniksic@srce.hr>
1978 * Makefile.in (wget): Make `wget' the default target.
1980 * ftp.c (ftp_loop_internal): Move noclobber checking out of the
1982 (ftp_retrieve_list): Warn about non-matching sizes.
1984 * http.c (http_loop): Made -nc non-dependent on opt.recursive.
1986 * init.c (setnum): Renamed from setnuminf; New argument flags.
1989 * main.c (main): Sorted the options.
1990 (main): New option --wait.
1992 1996-11-21 Shawn McHorse <riffraff@txdirect.net>
1994 * html.c (htmlfindurl): Reset s->in_quote after getting out of
1997 1996-11-20 Hrvoje Niksic <hniksic@srce.hr>
1999 * version.c: Changed version to 1.4.2.
2001 1996-11-20 Hrvoje Niksic <hniksic@srce.hr>
2003 * version.c: Released 1.4.1.
2005 * html.c (html_quote_string): New function.
2006 (ftp_index): Use it.
2007 (htmlfindurl): A more gentle ending debug message.
2009 * ftp.c (ftp_loop): Check for opt.htmlify.
2011 * init.c: New command htmlify.
2013 * ftp.c (getftp): Nicer error messages, with `'-encapsulated
2015 (ftp_loop): Print size of index.html.
2017 * init.c (setval): Implement "styles".
2019 * main.c (main): New option dotstyle.
2021 1996-11-19 Hrvoje Niksic <hniksic@srce.hr>
2023 * ftp.c (getftp): Close the master socket in case of errors, after
2026 * connect.c (bindport): Initialize msock to -1.
2028 * ftp.c (getftp): Initialize dtsock to -1.
2030 * connect.c (closeport): Don't close sock if sock == -1.
2032 1996-11-18 Hrvoje Niksic <hniksic@srce.hr>
2034 * init.c (setnuminf): Nuked default value -- just leave unchanged.
2035 (setval): Don't send default values.
2036 (defaults): Use DEFAULT_TIMEOUT -- aaargh.
2038 * options.h (struct options): Use long for dot_bytes.
2040 * init.c (setquota): Renamed to setbytes.
2041 (setval): Use setbytes on DOTBYTES.
2043 1996-11-17 Hrvoje Niksic <hniksic@srce.hr>
2045 * ftp.c (getftp): Initialize con->dltime.
2047 * recur.c (recursive_retrieve): Use same_host instead of
2048 try_robots; simply load robots_txt whenever the host is changed.
2049 (recursive_retrieve): Free forbidden before calling parse_robots.
2051 1996-11-16 Hrvoje Niksic <hniksic@srce.hr>
2053 * retr.c (show_progress): Use them.
2055 * options.h (struct options): New options dot_bytes, dots_on_line
2058 1996-11-16 Mark Boyns <boyns@sdsu.edu>
2060 * recur.c (recursive_retrieve): Retrieve directories regardless of
2061 acc/rej rules; check for empty u->file.
2063 1996-11-14 Hrvoje Niksic <hniksic@srce.hr>
2065 * init.c (setval): Use it.
2067 * utils.c (merge_vecs): New function.
2069 * init.c (setval): Reset the list-type functions when encountering
2072 1996-11-14 Shawn McHorse <riffraff@txdirect.net>
2074 * recur.c (recursive_retrieve): Use base_url instead of this_url
2077 1996-11-14 Shawn McHorse <riffraff@txdirect.net>
2079 * html.c (htmlfindurl): Reset s->in_quote after exiting the quote.
2081 1996-11-13 Hrvoje Niksic <hniksic@srce.hr>
2083 * utils.c (sepstring): Rewrote; don't use strtok.
2085 * recur.c (recursive_retrieve): Enter assorted this_url to slist
2086 when running the first time.
2087 (retrieve_robots): Warn to ignore errors when robots are loaded.
2089 * utils.c (load_file): Moved from url.c.
2091 * http.c: Made static variables const too in h* functions.
2093 * main.c (main): Renamed --continue-ftp to --continue.
2095 * recur.c (recursive_retrieve): Use it.
2097 * utils.c (frontcmp): New function.
2099 * url.c (accdir): New function.
2101 * html.c (htmlfindurl): Recognize <area href=...>.
2103 * ftp.c (ftp_retrieve_dirs): Implemented opt.includes.
2105 * init.c (setval): Free the existing opt.excludes and
2106 opt.includes, if available.
2108 * main.c (main): New option -I.
2110 1996-11-12 Hrvoje Niksic <hniksic@srce.hr>
2112 * ftp.c (ftp_retrieve_glob): Do not weed out directories.
2114 * version.c: Changed version to 1.4.1.
2116 1996-11-11 Hrvoje Niksic <hniksic@srce.hr>
2118 * version.c: Released 1.4.0.
2120 1996-11-10 Hrvoje Niksic <hniksic@srce.hr>
2122 * main.c (main): Free com and val after parse_line.
2123 (printhelp): Reorder the listing.
2125 * http.c: More robust header parsing.
2127 * http.c: Allow any number of spaces, or no spaces, precede ':'.
2128 (hskip_lws): New function.
2129 (haccepts_bytes): New function.
2132 * init.c (setval): Check header sanity.
2133 (setval): Allow resetting of headers.
2135 1996-11-10 Hrvoje Niksic <hniksic@srce.hr>
2137 * http.c (http_loop): Don't use has_wildcards.
2139 * http.c (gethttp): Free all_headers -- would leak.
2141 * recur.c (recursive_retrieve): Initialize depth to 1 instead of
2142 0 -- this fixes a long-standing bug in -rl.
2144 1996-11-09 Hrvoje Niksic <hniksic@srce.hr>
2146 * ftp.c: Use -1 as "impossible" value for con->fd.
2148 * url.h (URL_SEPARATOR): Don't treat `*' and `+' as separators.
2150 * init.c (parse_line): Use isalpha.
2152 * ftp-unix.c: Use HAVE_UNISTD_H.
2154 * mtch.c (has_wildcards): Don't match \.
2156 * http.c (http_loop): Warn on HTTP wildcard usage.
2158 1996-11-08 Hrvoje Niksic <hniksic@srce.hr>
2160 * url.c (url_filename): Do not create numbered suffixes if
2161 opt.noclobber -- would bug out on -nc.
2163 1996-11-07 Hrvoje Niksic <hniksic@srce.hr>
2165 * recur.c (parse_robots): Don't chuck out the commands without
2166 arguments (`Disallow:<empty>' didn't work).
2167 (parse_robots): Compare versions lowercase.
2168 (parse_robots): Match on base_version, not version_string!
2169 (parse_robots): Handle comments properly.
2170 (parse_robots): Match versions in a sane way.
2172 * init.c: Print nicer error messages.
2174 * version.c: Changed version to 1.4.0.
2176 1996-11-06 Hrvoje Niksic <hniksic@srce.hr>
2178 * version.c: Released 1.4.0-test2.
2180 * init.c (run_wgetrc): Close fp.
2182 * ftp.c (ftp_retrieve_dirs): Allocate the correct length for
2185 1996-11-06 Hrvoje Niksic <hniksic@srce.hr>
2187 * init.c (setquota): Allow inf as quota specification.
2189 1996-11-05 Hrvoje Niksic <hniksic@srce.hr>
2191 * ftp.c (ftp_retrieve_dirs): Return QUOTEXC if quota exceeded.
2192 (ftp_retrieve_glob): Return QUOTEXC on quota exceeded.
2194 * main.c (main): Check for quota by comparison with downloaded
2195 stuff, not from status.
2197 * connect.c (select_fd): Should compile on HPUX without warnings now.
2199 * ftp.c (ftp_get_listing): Check whether ftp_loop_internal
2202 1996-11-04 Hrvoje Niksic <hniksic@srce.hr>
2204 * ftp.c (ftp_retrieve_glob): Print the pattern nicely.
2205 (getftp): Return FTPRETRINT on control connection error.
2207 * html.c (htmlfindurl): Recognize <embed src=...> and
2209 (ftp_index): Handle username and password correctly.
2211 * main.c (main): Made `-np' a synonim for --no-parent.
2213 1996-11-02 Hrvoje Niksic <hniksic@srce.hr>
2215 * ftp.c (ftp_loop): Check for opt.ftp_glob too before calling
2218 * version.c: Changed version to 1.4.0-test2.
2220 1996-11-02 Hrvoje Niksic <hniksic@srce.hr>
2222 * version.c: Released 1.4.0-test1.
2224 * url.c (str_url): Don't use sprintf when creating %2F-prefixed
2226 (convert_links): Removed definition of make_backup.
2228 * http.h: Removed definition of MAX_ERROR_LENGTH.
2230 * host.c (ftp_getaddress): Check for "(none)" domains.
2232 * ftp.c (ftp_retrieve_dirs): Docfix.
2234 * http.c (gethttp): Use ou->referer instead of u->referer.
2236 * retr.c (retrieve_url): Reset u to avoid freeing pointers twice;
2237 this was known to cause coredumps on Linux.
2239 * html.c (ftp_index): Cast the argument to local_time to time_t *.
2241 1996-11-01 Hrvoje Niksic <hniksic@srce.hr>
2243 * connect.c (select_fd): Use exceptfds -- once and for all.
2245 * retr.c (retrieve_from_file): Free filename after
2247 (retrieve_from_file): Send RFIRST_TIME to recursive_retrieve on
2248 first-time retrieval.
2249 (retrieve_from_file): Return uerr_t; new argument, count.
2250 (retrieve_from_file): Break on QUOTEXC.
2252 * init.c (setquota): Fixed a bug that caused rejection of
2253 non-postfixed values..
2255 1996-10-30 Hrvoje Niksic <hniksic@srce.hr>
2257 * version.c: Changed name to wget.
2259 * connect.c (iread): Smarter use of select.
2260 (select_fd): Set errno on timeout. If not timeout, return 1
2263 1996-10-29 Hrvoje Niksic <hniksic@srce.hr>
2265 * ftp.c (ftp_loop_internal): Don't use con->cmd before
2268 1996-10-26 Hrvoje Niksic <hniksic@srce.hr>
2270 * http.c (gethttp): Send correct referer when using proxy.
2271 (gethttp): Use struct urlinfo ou to access the relevant data; send
2272 correct authorization in all cases.
2274 * host.c (same_host): Use skip_uname to skip username and
2277 * url.c (skip_uname): New function.
2280 * host.c (same_host): Do not assume HTTP -- same_host should now
2281 be totally foolproof.
2283 * url.c (skip_proto): New function.
2284 (parse_uname): Use it.
2286 * http.c (gethttp): Create local user and passwd from what is
2289 * url.c (parseurl): Check for HTTP username and password too.
2291 1996-10-25 Hrvoje Niksic <hniksic@srce.hr>
2293 * config.h.in: Removed #define gethostbyname R...
2295 1996-10-22 Hrvoje Niksic <hniksic@srce.hr>
2297 * version.c: Changed version to 1.4.0-test1.
2299 1996-10-21 Hrvoje Niksic <hniksic@srce.hr>
2301 * version.c: "Released" 1.4b29.
2303 * recur.c (recursive_retrieve): Check for no_parent.
2305 * init.c (setval): Option update.
2307 * main.c (main): New option no-parent.
2309 * options.h (struct options): New variable no_parent.
2311 * recur.c (recursive_retrieve): Only files are checked for
2312 opt.accepts and opt.rejects.
2313 (recursive_retrieve): Check directories for opt.excludes.
2314 (recursive_retrieve): Make the dir absolute when checking
2317 * html.c (htmlfindurl): Recognize <applet code=...> and <script
2320 1996-10-18 Hrvoje Niksic <hniksic@srce.hr>
2322 * ftp.c (getftp): Do not line-break assert entries at all.
2323 (ftp_retrieve_dirs): docfix.
2325 * connect.c (select_fd): Use fd + 1 as nfds.
2327 * version.c: Changed version to 1.4b29.
2329 1996-10-18 Hrvoje Niksic <hniksic@srce.hr>
2331 * version.c: "Released" 1.4b28.
2333 * ftp.c (ftp_loop_internal): Check whether f->size == len and
2334 don't continue the loop if it is.
2335 (ftp_get_listing): Remove list_filename on unsuccesful loop.
2337 1996-10-17 Hrvoje Niksic <hniksic@srce.hr>
2339 * ftp.c (ftp_loop_internal): Use strcpy to initialize tmp.
2340 (getftp): Do not use multiline assert.
2342 * http.c (hparsestatline): Use mjr and mnr instead of major and
2343 minor, which don't compile on Ultrix.
2344 (http_loop): Use strcpy() to initialize tmp.
2346 * all: Geturl -> Fetch
2348 1996-10-17 Hrvoje Niksic <hniksic@srce.hr>
2350 * recur.c (parse_robots): Fixed an off-by-one bug when looking for
2353 * html.c (htmlfindurl): Fixed several possible off-by-one bugs by
2354 moving `bufsize &&' to the beginning of each check in for-loops.
2356 * recur.c (parse_robots): Close fp on exit.
2358 * url.c (mymkdir): Check for each directory before creating.
2360 1996-10-16 Hrvoje Niksic <hniksic@srce.hr>
2362 * version.c: Changed version to 1.4b28.
2364 1996-10-16 Hrvoje Niksic <hniksic@srce.hr>
2366 * version.c: "Released" 1.4b27.
2368 * init.c (parse_line): Use isspace.
2369 (parse_line): Free *com on all errors.
2371 * ftp.c (ftp_loop): Change FTPOK to RETROK before exiting.
2372 (delelement): Use next instead of f->next and prev instead of
2374 (delelement): Free the members of the deleted element.
2376 * http.c (http_loop): Do not return RETROK on code != 20x.
2378 * init.c (cleanup): Free opt.user_header.
2379 (cleanup): Free opt.domains.
2381 * url.c (freelists): Moved to cleanup().
2383 * http.c (hparsestatline): Docfix.
2385 * main.c (main): Return with error status on unsuccesful
2388 * init.c (setval): Do not remove listing when mirroring.
2390 * url.c (url_filename): Use opt.fileprefix.
2392 * ftp.c (ftp_get_listing): Use url_filename to get filename for
2395 * main.c (main): New option: -rn.
2397 1996-10-15 Hrvoje Niksic <hniksic@srce.hr>
2399 * Makefile.in (RM): Added RM = rm -f.
2401 * host.c (clean_hosts): New function.
2402 (free_hlist): Just free the list, no reset.
2404 * version.c: Changed version to 1.4b27.
2406 1996-10-13 Hrvoje Niksic <hniksic@srce.hr>
2408 * version.c: "Released" 1.4b26.
2410 * retr.c (retrieve_from_file): If call get_urls_html with
2411 opt.spider to make it silent in spider mode.
2413 * url.c (str_url): Use CLEANDUP instead of URL_CLEANSE.
2415 * url.h (CLEANDUP): New macro.
2417 * http.c (gethttp): Fixed a bug that freed location only when it
2420 * retr.c (retrieve_url): Free url if it will not be stored,
2421 i.e. newloc is NULL.
2423 * html.c (htmlfindurl): Handle exiting from quotes correctly; the
2424 old version would bug out on <a href="x#a"href="y">.
2426 * html.h (state_t): New member in_quote.
2428 * html.c (htmlfindurl): Free s->attr at the beginning of
2431 * recur.c (recursive_retrieve): Recognize RCLEANUP.
2432 (tried_robots): Make hosts a global variable.
2433 (recursive_retrieve): Free constr after URL host optimization.
2434 (tried_robots): Free urlinfo before exiting.
2436 * utils.c (free_slist): New function.
2438 * recur.c (recursive_retrieve): Use flags to add cleanup
2441 * main.c (main): Free filename after recursive_retrieve.
2443 * http.c (gethttp): Store successful responses too.
2445 1996-10-12 Hrvoje Niksic <hniksic@srce.hr>
2447 * all: Constified the whole source. This required some minor
2448 changes in many functions in url.c, possibly introducing bugs -- I
2451 * ftp-basic.c: Removed last_respline.
2453 * http.c (gethttp): Free type.
2455 * host.c (same_host): Free real1 and real2.
2457 * main.c (main): New option --spider.
2459 * retr.c (get_contents): Don't reset errno.
2461 * main.c (main): Sorted the options.
2463 * connect.c (iread): Set errno to ETIMEDOUT only if it was left
2464 uninitialized by select().
2466 * http.c (http_loop): Print the time when the connection is
2468 (gethttp): Debug-print the HTTP request.
2470 1996-10-11 Hrvoje Niksic <hniksic@srce.hr>
2472 * connect.c (iread): Do not try reading after timeout.
2474 * main.c (main): Would bug out on -T.
2476 * connect.c (select_fd): Do not use exceptfds.
2477 (iread): Set ETIMEDOUT on select_fd <= 0.
2479 * version.c: Changed version to 1.4b26.
2481 1996-10-10 Hrvoje Niksic <hniksic@srce.hr>
2483 * version.c: "Released" 1.4b25.
2485 * ftp-unix.c (ftp_parse_unix_ls): Ignore lines without file name
2488 * http.c (gethttp): Add errcode to struct hstat.
2489 (http_loop): Use it.
2491 * url.c (no_proxy_match): Simplify using char** for no_proxy.
2493 * options.h (struct options): Make opt.no_proxy a vector.
2495 * utils.c (sepstring): Use !*s instead of !strlen(s).
2497 * init.c (setval): Set opt.maxreclevel to 0 on --mirror.
2498 (getperms): Use ISODIGIT instead of isdigit.
2500 * ftp.c (getftp): Print time.
2502 * main.c (main): Use legible output of downloaded quantity.
2504 * ftp.c (getftp): Use elapsed_time().
2505 (ftp_loop_internal): Use rate().
2507 * http.c (http_loop): Add download ratio output; Use rate().
2509 * utils.c (rate): New function.
2511 1996-10-09 Hrvoje Niksic <hniksic@srce.hr>
2513 * http.c (http_loop): Use timer.
2515 * ftp.c: Split to ftp-basic.c and ftp.c
2517 * utils.c (reset_timer): New function.
2518 (elapsed_time): New function.
2520 * retr.c (show_progress): Make bytes_in_line and offs long; should
2521 work on 16-bit machines.
2523 1996-10-08 Hrvoje Niksic <hniksic@srce.hr>
2525 * url.c (in_acclist): New argument backward.
2527 * ftp.c (ftp_retrieve_glob): Use acceptable() to determine whether
2528 a file should be retrieved according to suffix.
2529 (ftp_get_listing): Check the return value of unlink; Do not call
2530 ftp_retrieve_dirs if depth reached maxreclevel.
2531 (ftp_retrieve_dirs): Check whether the directory is in
2534 * main.c (main): Print the version number at the beginning of
2536 (main): Use strrchr when creating exec_name.
2538 * ftp.c (ftp_retrieve_glob): Do not close control connection.
2540 * version.c: Changed version to 1.4b25.
2542 1996-10-07 Hrvoje Niksic <hniksic@srce.hr>
2544 * version.c: "Released" 1.4b24.
2546 * Makefile.in: Rewrite.
2548 * ftp.c (ftp_loop_internal): Likewise.
2550 * retr.c (time_str): Check for failed time().
2552 * html.c (htmlfindurl): Recognize <fig src> and <overlay src> from
2555 * retr.c (time_str): Return time_t *.
2557 * connect.c (bindport): Close msock on unsuccesful bind.
2558 (bindport): The same for getsockname and listen.
2560 * retr.c (retrieve_url): Allow any number of retries on
2563 * http.c (gethttp): Do not treat errno == 0 as timeout.
2564 (http_loop): Likewise.
2565 (http_loop): Cosmetic changes.
2567 * connect.c (iread): Set errno to ETIMEDOUT in case of timeout.
2569 * retr.c (get_contents): Reset errno.
2571 * ftp.c (getftp): Minor fixes.
2573 1996-10-06 Hrvoje Niksic <hniksic@srce.hr>
2575 * http.c: Do not use backups.
2577 * geturl.1 (WARNING): Warn that man-page could be obsolete.
2579 * getopt.c (getopt_long): Moved to getopt.c
2581 * geturl.texi: Enhanced.
2583 * main.c (main): Use it.
2585 * recur.c (convert_all_links): New function.
2587 * utils.c (add_slist): New argument flags.
2589 * recur.c (recursive_retrieve): Update a list of downloaded URLs.
2590 (parse_robots): Do not chuck out empty value fields.
2591 (parse_robots): Make yourself welcome on empty Disallow.
2593 * version.c: Changed version to 1.4b24.
2595 1996-10-06 Hrvoje Niksic <hniksic@srce.hr>
2597 * version.c: "Released" 1.4b23.
2599 * ftp.c (ftp_loop_internal): Get the time after getftp.
2601 * Makefile.in (install.info): New target.
2604 * http.c (http_loop): Fix output when doing -O.
2606 1996-10-05 Hrvoje Niksic <hniksic@srce.hr>
2608 * geturl.texi: New file.
2610 * main.c (main): Do not print the warnings and download summary if
2613 * version.c: Changed version to 1.4b23.
2615 1996-10-05 Hrvoje Niksic <hniksic@srce.hr>
2617 * "Released" 1.4b22.
2619 * atotm.c (atotm): Use True and False instead of TRUE and FALSE,
2620 to avoid redefinition warnings.
2622 * host.c (store_hostaddress): Use memcpy() to copy the address
2623 returned by inet_addr.
2625 * version.c: Changed version to 1.4b22.
2627 1996-10-04 Hrvoje Niksic <hniksic@srce.hr>
2629 * version.c: "Released" 1.4b21.
2631 * ftp-unix.c (ftp_parse_ls): Renamed to ftp_parse_unix_ls.
2633 * ftp.c (ftp_port): Use conaddr.
2634 (getftp): Print the file length.
2635 (ftp_retrieve_list): Check the stamps of plain files only.
2637 * connect.c (closeport): Do not call shutdown().
2638 (conaddr): New function.
2640 * html.c (ftp_index): Made it dfp-aware.
2642 * init.c (cleanup): New name of freemem. Close opt.dfp.
2644 * ftp.c (getftp): Use opt.dfp if it is set.
2646 * ftp-unix.c (ftp_parse_ls): Recognize time in h:mm format.
2648 * ftp.c (ftp_retrieve_dirs): Fixed a bug that caused incorrect
2649 CWDs to be sent with recursive FTP retrievals.
2651 1996-10-03 Hrvoje Niksic <hniksic@srce.hr>
2653 * recur.c (parse_robots): Made it more compliant with "official"
2656 * http.c: New function.
2658 * ftp-unix.c (ftp_parse_ls): Added better debug output.
2660 * ftp.c (getftp): Print out the LIST in case of
2661 opt.print_server_response.
2663 * version.c: Changed version to 1.4b21.
2665 1996-10-01 Hrvoje Niksic <hniksic@srce.hr>
2667 * version.c: "Released" 1.4b20.
2671 * http.c (gethttp): Preset lengths of various headers instead of
2672 calculating them dynamically.
2673 (gethttp): Check for 206 partial contents.
2675 1996-09-30 Hrvoje Niksic <hniksic@srce.hr>
2677 * configure.in: Set SYSTEM_GETURLRC to $libdir/geturlrc
2679 * http.c (gethttp): Send the port number in the Host: header.
2681 1996-09-29 Hrvoje Niksic <hniksic@srce.hr>
2683 * http.c (gethttp): Send host: header.
2684 (gethttp): Add the possibility of user-defined headers.
2685 (gethttp): Move decision about pragma: no-cache to http_loop,
2687 (gethttp): Pass a struct instead of enormous argument list.
2688 (http_loop): Use a new, fancier display format.
2689 (ftp_loop): Likewise.
2691 * main.c: (hangup): Turn off buffering of the new log file.
2693 * install-sh: Likewise.
2695 * config.sub: Replace with the one in autoconf-2.10
2699 * init.c: New options httpuser and httppasswd.
2701 * http.c: (base64_encode_line): New function.
2702 (gethttp): Send authentication.
2704 * connect.c (make_connection): Use store_hostaddress.
2706 1996-09-28 Hrvoje Niksic <hniksic@srce.hr>
2708 * host.c (store_hostaddress): New function.
2712 * http.c (hgetrange): New function.
2713 (gethttp): Use ranges.
2715 * utils.c (numdigit): Accept long instead of int.
2717 * http.c (http_loop): Add restart capabilities.
2719 * ftp.c (ftp_retrieve_glob): Fixed a bug that could cause matchres
2720 being used uninitialized.
2721 (ftp_retrieve_list): Similar fix.
2723 * host.c (add_hlist): Fixed a bug that could cause cmp being used
2726 * url.c (construct_relative): New function.
2728 * recur.c (recursive_retrieve): Use it.
2730 * retr.c (convert_links): New function.
2732 1996-09-27 Hrvoje Niksic <hniksic@srce.hr>
2734 * url.c (free_urlpos): New function.
2736 * recur.c (recursive_retrieve): Adapt.
2738 * url.c (get_urls_html): Return a linked list instead of a vector.
2740 * url.c (get_urls_file): Return a linked list instead of a vector.
2744 * http.c (gethttp): Implement it.
2746 * init.c (setval): New option: SAVEHEADERS
2748 * ftp.c (ftp_loop_internal): Do not set restval if listing is to
2749 be retrieved. Lack of this test caused bugs when the connection
2750 was lost during listing.
2752 * retr.c (retrieve_url): Fixed a bug that caused
2753 coredumps. *newloc is now reset by default.
2754 (retrieve_url): Lift the twenty-tries limit on proxies.
2756 * version.c: Changed version to 1.4b20.
2758 1996-09-20 Hrvoje Niksic <hniksic@srce.hr>
2760 * version.c: "Released" 1.4b19.
2762 1996-09-19 Hrvoje Niksic <hniksic@srce.hr>
2764 * ftp.c (ftp_loop_internal): Renamed from ftp_1fl_loop.
2765 (getftp): Changed prototype to accept ccon *.
2767 1996-09-17 Hrvoje Niksic <hniksic@srce.hr>
2769 * ftp.c (ftp_retrieve_list): Fixed a bug that caused setting
2770 incorrect values to files pointed to by symbolic links.
2771 (ftp_1fl_loop): Do not count listings among the downloaded URL-s.
2773 1996-09-16 Hrvoje Niksic <hniksic@srce.hr>
2775 * url.c (mkstruct): Do not prepend "./" in front of a pathname.
2777 * main.c (main): New option: --user-agent.
2783 * init.c (setval): Ditto.
2785 * main.c (main): Rename "server-headers" to "server-response".
2787 * ftp-unix.c (ftp_parse_ls): Check for asterisks at the end of
2788 executables in 'ls -F' listings.
2790 1996-09-15 Hrvoje Niksic <hniksic@srce.hr>
2792 * url.c (parseurl): Remove realloc() and sprintf().
2793 (str_url): Get rid of sprintf().
2795 * recur.c (recursive_retrieve): Enable FTP recursion through proxy
2798 * url.h (URL_CLEANSE): Made it else-resistant.
2799 (USE_PROXY): New macro.
2801 1996-09-14 Hrvoje Niksic <hniksic@srce.hr>
2805 * version.c: Changed version to 1.4b19.
2807 1996-09-14 Hrvoje Niksic <hniksic@srce.hr>
2809 * version.c: "Released" 1.4b18.
2811 * url.c: Made it reallocate space exponentially.
2813 1996-09-14 Drazen Kacar <dave@fly.cc.fer.hr>
2815 * html.c (htmlfindurl): Added <frame src> and <iframe src> among
2816 the list of stuff to fetch.
2818 1996-09-13 Hrvoje Niksic <hniksic@srce.hr>
2820 * url.c (get_urls_html): Fixed a bug that caused SIGSEGV's with
2823 * html.c (htmlfindurl): Rewrite.
2825 * http.c (gethttp): Use opt.proxy_cache.
2827 * main.c (main): Added --cache option.
2829 * ftp.c (ftp_response): Print server response if opt.print_server
2831 (getftp): Print newlines after each request if the server response
2833 (ftp_response): Copy the last response line to last_respline.
2835 * http.c (gethttp): Add Pragma: nocache for retried
2838 * ftp.c (getftp): Use it.
2840 * retr.c (buf_discard): New function.
2842 * ftp.c (ftp_response): Use buf_readchar().
2843 (getftp): Flush the control connection buffer before calling
2846 * retr.c (buf_readchar): New function.
2847 (buf_flush): New function.
2848 (get_contents): Use buf_readchar() instead of read(x, x, 1).
2849 (get_contents): Use buf_flush.
2851 1996-09-12 Hrvoje Niksic <hniksic@srce.hr>
2853 * ftp.c: Incorporate changes to ftp_response.
2855 * ftp.c (ftp_response): Allocate the server response dynamically,
2856 as in read_whole_line and fetch_next_header.
2858 * utils.c (read_whole_line): Fixed a bug that prevented reading
2859 the last line if it is not \n-terminated. Also fixed a possible
2862 * http.c (fetch_next_header): Return malloc-ed string as large as
2864 (gethttp): Use new fetch_next_header.
2866 1996-09-12 Hrvoje Niksic <hniksic@srce.hr>
2868 * http.c (hgetlen): Compute the header length the first time only.
2870 (hgetlocation): Ditto.
2871 (hgetmodified): Ditto.
2873 1996-09-11 Hrvoje Niksic <hniksic@srce.hr>
2875 * sample.geturlrc: Update.
2877 1996-09-10 Hrvoje Niksic <hniksic@srce.hr>
2879 * http.c (http_loop): Ditto.
2881 * ftp.c (getftp): Open the output file as binary.
2883 * version.c: Changed version to 1.4b18.
2885 1996-09-10 Hrvoje Niksic <hniksic@srce.hr>
2887 * version.c: "Released" 1.4b17.
2889 * ftp-unix.c (ftp_parse_ls): If unable to open file, return NULL
2890 instead of failed assertion.
2892 1996-09-09 Hrvoje Niksic <hniksic@srce.hr>
2894 * ftp.c (ftp_get_listing): Add a numbered suffix to LIST_FILENAME
2895 if a file of that name already exists.
2897 1996-09-05 Hrvoje Niksic <hniksic@srce.hr>
2899 * ftp.c (ftp_1fl_loop): Handler FTPPORTERR and FOPENERR correctly.
2901 * config.h.in: Define gethostbyname as Rgethostbyname when using
2904 * configure.in: Check for -lresolv if using Socks.
2906 * version.c: Changed version to 1.4b17.
2908 1996-07-15 Hrvoje Niksic <hniksic@srce.hr>
2910 * version.c: "Released" 1.4b16.
2912 * http.c (gethttp): More intelligent check for first line of HTTP
2914 (gethttp): Would bug out on time-stamping.
2916 * version.c: Changed version to 1.4b16.
2918 1996-07-11 Hrvoje Niksic <hniksic@srce.hr>
2920 * version.c: Released 1.4b15.
2922 * http.c (http_loop): Print \n after the loop entry, not before.
2924 * url.c (url_filename): Use ISDOT.
2926 * url.h (ISDOT): New macro.
2928 * recur.c (recursive_retrieve): Change only opt.recursive for
2931 1996-07-11 Antonio Rosella <antonio.rosella@agip.it>
2933 * socks/geturl.cgi: Fixed version No.
2935 * socks/download-netscape.html: Ditto.
2937 * socks/download.html: Changed socks.html to download.html.
2939 1996-07-11 Hrvoje Niksic <hniksic@srce.hr>
2941 * url.c (url_filename): Check for opt.dirstruct instead for
2942 opt.recursive && opt.dirstruct.
2944 * init.c (defaults): Ditto.
2945 (defaults): Reset dirstruct by default.
2946 (setval): Set opt.dirstruct whenever setting recursive.
2948 * init.h: Removed FORCEDIRHIER.
2950 * INSTALL: Added -L to socks-description.
2952 * version.c: Changed version to 1.4b15.
2954 1996-07-10 Hrvoje Niksic <hniksic@srce.hr>
2956 * version.c: "Released" 1.4b14.
2958 * geturl.1: Update AUTHOR to include Rosella as contributor.
2962 * socks/geturl.cgi: Simplified command creation, nuked <blink>.
2964 * socks/geturl.cgi: Wrap nutscape extensions within if $netscape.
2965 (cal_time): Fix == to eq.
2967 * socks/geturl.cgi: GPL-ized with permission of A. Rosella.
2969 * geturl.1 (hostname): Moved URL CONVENTIONS to the beginning.
2971 * Makefile.in: Use @VERSION@.
2973 * configure.in: Check version from version.c.
2975 * socks/geturl.cgi: Changed /pub/bin/perl to /usr/bin/perl.
2977 * socks/download.html: Created from download-netscape.html, made
2980 * recur.c (recursive_retrieve): Set opt.force_dir_hier when
2981 following FTP links from recursions.
2983 1996-07-09 Hrvoje Niksic <hniksic@srce.hr>
2985 * url.c (mymkdir): Fixed a bug that prevented mymkdir() to create
2986 absolute directories correctly.
2988 * version.c: Changed version to 1.4b14.
2990 1996-07-09 Hrvoje Niksic <hniksic@srce.hr>
2992 * version.c: "Released" 1.4b13.
2994 * url.c (make_backup): New function.
2996 * http.c (http_loop): Make a backup copy of the local file (using
2997 rename(2)) before opening it.
2999 * main.c (main): Added --backups.
3001 * host.c (ftp_getaddress): Bail out on failed mycuserid().
3002 (ftp_getaddress): Check for leading dot on MY_DOMAIN.
3003 (ftp_getaddress): Check for empty, null or (null) domain.
3005 * url.c (get_urls_html): If this_url is NULL, the base must have a
3007 (parseurl): Use has_proto.
3009 * retr.c (retrieve_url): Warn when proxy is used with more than 20
3012 * url.c (mkstruct): Create the directory (calling mymkdir()) only
3013 if it is not already there.
3014 (has_proto): New function.
3015 (get_urls_html): Eliminate the remaining call to findurl -- use
3020 * main.c: Change -X to -x.
3022 * url.c (url_filename): Simplify creation of filename if
3023 prefix_files is set.
3024 (url_filename): Simplify everything. And I do mean *everything*.
3025 (mkstruct): Add dir_prefix before hostname.
3026 (path_simplify): Fixed a bug that caused writing outside the path
3027 string in case of "." and ".." path strings.
3029 1996-07-06 Hrvoje Niksic <hniksic@srce.hr>
3031 * init.c: Added --mirror.
3033 * main.c (main): Added -X to force saving of directory hierarchy.
3035 * ftp.c (ftp_retrieve_list): Added recursion depth counter.
3036 (ftp_retrieve_list): Check whether quota is exceeded.
3038 * url.c (get_urls_html): Skip leading blanks for absolute URIs.
3040 * http.c (gethttp): Use referer if present.
3042 * recur.c (recursive_retrieve): Set u->referer before calling
3045 * url.c (newurl): Use memset to nullify the struct members.
3046 (freeurl): Free the referer field too.
3048 * url.h: Added referer to urlinfo.
3050 * geturl.1: Updated the manual to document some of the new features.
3052 * utils.c (numdigit): Moved from url.c.
3054 * README: Rewritten.
3056 * config.h.in: Add the support for socks.
3058 * configure.in: Add the support for socks.
3060 * url.c (url_filename): If the dir_prefix is ".", work with just
3062 (url_filename): Do not look for .n extensions if timestamping if
3065 * retr.c (show_progress): Skip the over-abundant restval data, and
3066 print the rest of it with ',' instead of '.'.
3068 1996-07-05 Hrvoje Niksic <hniksic@srce.hr>
3070 * retr.c (show_progress): Changed second arg. to long (as it
3072 (show_progress): Moved to retr.c.
3073 (get_contents): Moved to retr.c.
3075 * version.c: Change version to 1.4b13.
3077 1996-07-05 Hrvoje Lacko <hlacko@fly.cc.fer.hr>
3079 * url.c (in_acclist): Would return after the first suffix.
3081 1996-07-04 Hrvoje Niksic <hniksic@srce.hr>
3083 * version.c: "Released" 1.4b12.
3085 * url.c (path_simplify): More kludgifications.
3086 (get_urls_html): Use new parameters for htmlfindurl.
3088 * html.c: Removed memorizing "parser states", since the new
3089 organization does not require them.
3091 * init.c (run_geturlrc): Use read_whole_line.
3093 * ftp-unix.c (ftp_parse_ls): Use read_whole_line.
3095 * recur.c (parse_robots): Use read_whole_line.
3097 * utils.c (read_whole_line): New function.
3099 * recur.c (tried_robots): Use add_slist/in_slist, *much* cleaner.
3101 * host.c (ngethostbyname): Call inet_addr just once. Yet to be
3102 tested on OSF and Ultrix.
3103 (add_hlist): New function.
3104 (free_hlist): New function.
3105 (search_host): New function.
3106 (search_address): New function.
3107 (realhost): Use search_host, search_address and add_hlist.
3108 (same_host): Replaced realloc() with strdupdelim(), made
3109 case-insensitive, fixed a memory leak.
3111 * html.c (ftp_index): Fixed tm_min and tm_sec to be tm_hour and
3112 tm_min, like intended.
3114 * version.c: Change user agent information to
3117 1996-07-03 Hrvoje Niksic <hniksic@srce.hr>
3119 * utils.c: Renamed nmalloc.c to utils.c, .h likewise.
3121 * url.c (acceptable): Always accept directories.
3123 * ftp-unix.c (ftp_parse_ls): Support brain-damaged "ls -F"-loving
3124 servers by stripping trailing @ from symlinks and trailing / from
3127 * ftp.c (ftp_loop): Debugged the "enhanced" heuristics. :-)
3129 * url.c (skip_url): Use toupper instead of UCASE.
3131 * host.c (sufmatch): Made it case-insensitive.
3133 * url.c (match_backwards_or_pattern): Fixed i == -1 to j == -1.
3134 (match_backwards): New function, instead of
3135 match_backwards_or_pattern.
3137 * recur.c (recursive_retrieve): Increased performance by
3138 introducing inl, which reduces number of calls to in_slist to only
3141 * ftp.c (ftp_loop): Enhanced the heuristics that decides which
3144 * main.c (printhelp): Removed the warranty stuff.
3146 1996-07-02 Hrvoje Niksic <hniksic@srce.hr>
3148 * url.c (add_slist): Simplify.
3149 (match_backwards_or_pattern): New function.
3150 (in_acclist): Use match_backwards_or_pattern.
3153 1996-06-30 Hrvoje Niksic <hniksic@srce.hr>
3155 * ftp.c (ftp_loop): Call ftp_index on empty file names, if not
3158 * html.c (ftp_index): Fixed to work. Beautified the output.
3160 * ftp.c (ftp_retrieve_glob): Another argument to control whether
3161 globbing is to be used.
3162 (ftp_retrieve_list): Compare the time-stamps of local and remote
3163 files to determine whether to download.
3165 1996-06-29 Hrvoje Niksic <hniksic@srce.hr>
3167 * ftp.c (rel_constr): New function.
3169 * retr.c (retrieve_from_file): Check for text/html before
3170 retrieving recursively.
3172 * main.c (main): Check whether the file is HTML before going into
3173 recursive HTML retrieving.
3175 * ftp.c (ftp_retrieve_list): Manage directories.
3176 (ftp_retrieve_glob): Pass all the file-types to ftp_retrieve_list.
3177 (ftp_1fl_loop): Fixed a bug that caused con->com to be incorrectly
3178 initialized, causing bugchecks in getftp to fail.
3180 * configure.in: Check for symlink.
3182 * ftp.c (ftp_retrieve_list): Added support for symlinks.
3184 * version.c: "Released" 1.4b10.
3186 * atotm.c (atotm): Redeclared as time_t.
3188 * init.c: New variable "timestamping".
3190 * main.c (main): New option 'N'.
3192 * http.c (hgetlocation): Case-insensitive match.
3193 (hgetmodified): New function.
3194 (http_loop): Implement time-stamping.
3196 1996-06-28 Hrvoje Niksic <hniksic@srce.hr>
3198 * version.c: Changed version to 1.4b10
3200 * atotm.c: New file, from phttpd.
3202 * options.h (struct options): New parameter timestamping.
3204 * version.c: 1.4b9 "released".
3206 * recur.c (recursive_retrieve): Used linked list (ulist) for
3207 faster storing of URLs.
3209 * url.c (get_urls_html): Removed the old kludge with comparing the
3210 outputs of htmlfindurl and findurl.
3211 (get_urls_html): Added better protocol support here.
3212 (create_hash): Removed, as well as add_hash and in_hash.
3213 (addslist): New function.
3216 * version.c: Released 1.4b8, changed version to b9.
3218 1996-06-27 Hrvoje Niksic <hniksic@srce.hr>
3220 * ftp.c (freefileinfo): New function.
3221 (delelement): New function.
3225 * ftp.c (ftp_loop): Use ccon.
3226 (ftp_retrieve_glob): Likewise.
3228 * ftp.h: Define ccon, to define status of control connection.
3230 * ftp.c (ftp_get_listing): New function.
3231 (ftp_retrieve_more): New function.
3232 (ftp_retrieve_glob): New function.
3234 1996-06-25 Hrvoje Niksic <hniksic@srce.hr>
3236 * configure.in: Removed the search for cuserid().
3238 * init.c (getmode): Renamed to getperms.
3240 1996-06-24 Hrvoje Niksic <hniksic@srce.hr>
3242 * version.c: New version.
3244 * main.c (hangup): New function, that handles hangup. Hangup
3245 signal now causes geturl to stop writing on stdout, and to write
3248 * ftp.c (getftp): "Released" 1.4b7.
3250 * html.c (htmlfindurl): Ignore everything inside <head>...</head>.
3251 (ftp_index): Use fileinfo/urlinfo.
3253 * ftp-unix.c (ftp_parse_ls): New function.
3254 (symperms): New function.
3256 * ftp.c (ftp_1fl_loop): New function, to handle 1-file loops.
3258 * retr.c (retrieve_url): Added FTP support.
3260 1996-06-23 Hrvoje Niksic <hniksic@srce.hr>
3262 * geturl.h: Removed NOTFTP2HTML enum.
3263 Added DO_LOGIN, DO_CWD and DO_LIST. LIST_ONLY is obsolete.
3265 * ftp.c (getftp): Resynched with urlinfo.
3266 (getftp): Removed HMTL-ization of index.html from getftp.
3268 * version.c: 1.4b6 "released".
3270 * options.h (options): New struct, to keep options in.
3272 * http.c (http_loop,gethttp): Synched with proxy.
3274 * retr.c (retrieve_url): Implemented proxy retrieval.
3276 * main.c (main): Use retrieve_from_file.
3278 1996-06-22 Hrvoje Niksic <hniksic@srce.hr>
3280 * retr.c (retrieve_from_file): New function.
3282 * url.c (parseurl): Modified to return URLOK if all OK. Protocol
3283 can be found in u->proto.
3285 * ftp.c (ftp_response): Fixed to accept multi-line responses as
3288 * recr.c (recursive_retrieve): Take newloc from retrieve_url.
3290 * url.c (mymkdir): Removed the file of the same name, if one
3292 (isfile): New function.
3293 (mkstruct): Fixed the '/' glitches.
3294 (path_simplify): Hacked to treat something/.. correctly.
3296 1996-06-21 Hrvoje Niksic <hniksic@srce.hr>
3298 * http.c (gethttp): Close the socket after error in headers.
3299 (http_loop): HEOF no longer a fatal header.
3301 * loop.c (retrieve_url): When dt is NULL, do not modify it. This
3302 simplifies the syntax of calling retrieve_url.
3304 * recr.c (recursive_retrieve): Modified to use get_urls_html.
3306 * url.c (get_urls_file): New function.
3307 (get_urls_html): New function.
3309 * recr.c (recursive_retrieve): Patched up to conform to the
3312 * http.c (gethttp): Synched with the rest...
3313 (gethttp): Treat only CONREFUSED specially, with connection
3316 * init.c,geturl.1,http.c (http_loop): Removed kill_error.
3318 1996-06-20 Hrvoje Niksic <hniksic@srce.hr>
3320 * http.c (http_loop): New function.
3322 * loop.c: Removed *lots* of stuff from retrieve_url.
3324 * url.c (parseurl): Changed to work with urlinfo. Integrated
3325 username finding and path parsing.
3326 (newurl): New function.
3327 (freeurl): New function.
3328 (mkstruct): Removed the old bogosities, made it urlinfo-compliant.
3329 (url_filename): Likewise.
3330 (path_simplify): Accept relative paths too.
3331 (opt_url): Made urlinfo-compliant, removed bogosities.
3332 (path_simplify): Expanded to accept relative paths.
3333 (str_url): A replacement for hide_url
3334 (decode_string): Fixed a bug that caused malfunctioning when
3335 encountering an illegal %.. combination.
3336 (opt_url): Removed the argument. Dot-optimizations are now default.
3338 * nmalloc.c (strdupdelim): New function.
3340 * url.h: Added the urlinfo structure
3342 1996-06-19 Hrvoje Niksic <hniksic@srce.hr>
3344 * url.c (hide_url): Thrown out the protocol assertion. Do not
3345 change the URL if the protocol if not recognized.
3346 (findurl): Put continue instead of break.
3348 1996-06-18 Hrvoje Niksic <hniksic@srce.hr>
3350 * sample.geturlrc: Changed the defaults to be commented out and
3351 harmless (previous defaults caused pains if copied to
3354 * http.c (gethttp): Print the HTTP request in debug mode.
3356 * connect.c (iread): Added EINTR check loop to select-ing
3357 too. EINTR is now correctly handled with select().
3361 1996-05-07 Hrvoje Niksic <hniksic@srce.hr>
3363 * host.c (same_host): Made the function a little bit more
3364 intelligent regarding diversified URL syntaxes.
3366 * url.c (skip_url): Spaces are now skipped after URL:
3368 * Released 1.3.1 with the patch to prevent crashing when sending
3369 NULL to robot* functions and the patch to compile "out of the box"
3372 * recr.c (recursive_retrieve): Added checking whether this_url is
3373 NULL when calling the robot functions.
3375 * ChangeLog: New file.