From: hniksic Date: Thu, 30 May 2002 01:51:09 +0000 (-0700) Subject: [svn] New file. X-Git-Tag: v1.13~1744 X-Git-Url: http://sjero.net/git/?p=wget;a=commitdiff_plain;h=ac7a99c6365898d3763b18e96703ec27e6039d8e [svn] New file. --- diff --git a/src/ChangeLog-branches/1.8_branch.ChangeLog b/src/ChangeLog-branches/1.8_branch.ChangeLog new file mode 100644 index 00000000..f73830ee --- /dev/null +++ b/src/ChangeLog-branches/1.8_branch.ChangeLog @@ -0,0 +1,6253 @@ +2002-05-28 Hrvoje Niksic + + * version.c: Wget 1.8.2 is released. + +2002-02-11 Christian Lackas + + * Recurive downloading for https fixed. + +2002-05-24 Hrvoje Niksic + + * progress.c (bar_set_params): Fall back to dot progress if the + terminal type is "emacs". + +2002-05-24 Hack Kampbjorn + + * url.c (mkstruct) [WINDOWS || __CYGWIN__]: Use `_' to separate domain + from port number in filename instead of `:'. + +2002-05-24 Ian Abbott + + * url.c (mkstruct) [WINDOWS || __CYGWIN__]: Replace query character + `?' with `@'. + (compose_file_name) [WINDOWS || __CYGWIN__]: Ditto. + +2002-04-11 Hrvoje Niksic + + * progress.c (struct progress_implementation): Use PARAMS when + declaring the parameters of *create, *update, *finish, and + *set_params. + + * netrc.c: Ditto. + + * http.c: Reformat some function definitions so that ansi2knr can + read them. + + * hash.c (struct hash_table): Use the PARAMS macro around + parameters in the declaration of hash_function and test_function. + (prime_size): Spell 2580823717UL and 3355070839UL as (unsigned + long)0x99d43ea5 and (unsigned long)0xc7fa5177 respectively, so + that pre-ANSI compilers can read them. + (find_mapping): Use PARAMS when declaring EQUALS. + (hash_table_put): Ditto. + + * ftp.h: Wrap the parameters of ftp_index declaration in PARAMS. + + * cookies.c (cookie_new): Use (unsigned long)0 instead of 0UL, + which was unsupported by pre-ANSI compilers. + + From Nelson H. F. Beebe , for the most part. + +2002-04-11 Hrvoje Niksic + + * progress.c (bar_create): If INITIAL is larger than TOTAL, fix + TOTAL. + (bar_finish): Likewise. + +2002-02-19 Andreas Damm + + * wget.h (DO_REALLOC_FROM_ALLOCA): Multiply with sizeof(type) when + calling xmalloc and memcpy. + +2002-01-07 Ian Abbott + + * url.c (uri_merge_1): Deal with "net path" relative URL (one that + starts with "//"). + +2002-01-14 Hrvoje Niksic + + * url.c (parse_uname): URL-decode *USER and *PASSWD. + +2002-02-19 Hrvoje Niksic + + * recur.c (retrieve_tree): Handle the case when start_url doesn't + parse. + +2002-01-17 Hrvoje Niksic + + * netrc.c (parse_netrc): Skip leading whitespace before testing + whether the line is empty. Empty lines still contain the line + terminator. + +2002-01-14 Hrvoje Niksic + + * http.c (gethttp): Invalidate SOCK if get_contents encountered an + error. + +2002-04-16 Hrvoje Niksic + + * http.c (gethttp): If Content-Type is not given, assume + text/html. + +2002-02-01 Hrvoje Niksic + + * html-url.c (tag_handle_meta): Don't crash on where content is missing. + +2002-01-14 Hrvoje Niksic + + * headers.c (header_get): Strip trailing whitespace from the + header. + +2002-01-15 Hrvoje Niksic + + * gen_sslfunc.c (ssl_iread): Call select on the file descriptor + only if no data is pending in SSL buffers. + From tony@bluetail.com. + +2001-12-24 Hrvoje Niksic + + * version.c: Wget 1.8.1 is released. + +2001-12-19 Hrvoje Niksic + + * version.c: Wget 1.8.1-pre3 is released. + +2001-12-19 Hrvoje Niksic + + * recur.c (retrieve_tree): Enqueue the canonical representation of + start_url, so that the test against dl_url_file_map works. + +2001-12-19 Hrvoje Niksic + + * log.c (logputs): Check for requested verbosity before printing + anything. + +2001-12-19 Hrvoje Niksic + + * html-url.c (tag_handle_link): Treat the "shortcut icon" link as + inline. + +2001-12-18 Hrvoje Niksic + + * recur.c (retrieve_tree): Make a copy of file obtained from + dl_url_file_map because the code calls xfree(file) later. + +2001-12-18 Hrvoje Niksic + + * recur.c (register_html): Maintain a hash table of HTML files + along with the list. Disallow duplicates. + (retrieve_tree): Use downloaded_html_set to check whether the file + found in dl_url_file_map is an HTML file, and descend into it if + so. + (convert_all_links): Don't guard against duplicates in + downloaded_html_list, since they are no longer possible. + +2001-12-18 Ian Abbott + + * recur.c (retrieve_tree): Pass on referring URL when retrieving + recursed URL. + +2001-12-17 Hrvoje Niksic + + * version.c: Wget 1.8.1-pre2 is released. + +2001-12-17 Hrvoje Niksic + + * retr.c (sleep_between_retrievals): Simplify indentation. + +2001-12-17 Hrvoje Niksic + + * gen_sslfunc.c (ssl_init_prng): Use random_number to get a byte + of "randomness" at a time. + (ssl_init_prng): Don't seed the PRNG; random_number will do that. + + * retr.c (sleep_between_retrievals): Use it. Make sure that the + random amount averages in opt.wait. + (sleep_between_retrievals): Don't seed the PRNG; random_number + will do that. + + * utils.c (random_number): New function. + +2001-12-14 Hrvoje Niksic + + * url.c (path_simplify): Move here from utils.c, and make static. + +2001-12-13 Hrvoje Niksic + + * init.c (wgetrc_file_name): Print correct message when loading + getenv("WGETRC") fails. + +2001-12-13 Hrvoje Niksic + + * recur.c (register_download): Don't abort when one URL references + two different files. + +2001-12-13 Hrvoje Niksic + + * http.c (gethttp): Check for conn->scheme, not u->scheme, before + calling ssl_iwrite. + +2001-12-13 Hrvoje Niksic + + * version.c: Wget 1.8.1-pre1 is released. + +2001-12-13 Hrvoje Niksic + + * res.c (matches): Fix broken URL in the docstring. + +2001-12-13 Hrvoje Niksic + + * html-url.c (tag_url_attributes): Mark as + external. + +2001-12-12 Hrvoje Niksic + + * url.c (get_urls_file): Cosmetic changes. + +2001-12-12 Hrvoje Niksic + + * html-url.c (append_one_url): Resurrect warning when unable to + resolve a relative link. + +2001-12-12 Hrvoje Niksic + + * html-url.c (collect_tags_mapper): Break into several functions. + (tag_url_attributes): Collect . + +2001-12-11 Hrvoje Niksic + + * host.c: New type ipv4_address. Use it consistently instead of + `unsigned char[4]' and `unsigned char *'. + (pretty_print_address): Accept a `const void *', to require even + less casting. + +2001-12-11 Hrvoje Niksic + + * ftp-ls.c (ftp_parse_vms_ls): Fix obvious memory leaks. + +2001-12-10 Hrvoje Niksic + + * main.c (main): Initialize progress after fork_to_background, so + that it knows when to use dots. + + * mswindows.c (ws_hangup): Call log_request_redirect_output. + + * utils.c (fork_to_background): Print the PID of the child + process. + + * log.c (log_request_redirect_output): Set a flag that output + redirection has been requested. Doing anything else in a signal + handler is unsafe. + (check_redirect_output): New function: check whether redirection + has been requested and, if so, call redirect_output(). + (logputs): Call check_redirect_output. + (logprintf): Ditto. + (debug_logprintf): Ditto. + (redirect_output): Print clearer messages. + + * main.c (redirect_output_signal): Don't call + redirect_output_signal directly. Instead, call + log_request_redirect_output. + + * utils.c (memfatal): Ditto. + + * progress.c (display_image): Use it. + + * log.c (log_set_save_context): New function: allow the caller to + turn off saving log context lines. + +2001-12-10 Hrvoje Niksic + + * host.c (address_list_set_faulty): Uncomment a sanity check. + +2001-12-10 Hrvoje Niksic + + * utils.c (long_to_string): Return a pointer after where the + number ends. + (long_to_string): Rename to number_to_string. + +2001-12-10 Hrvoje Niksic + + * utils.c (path_simplify): Correctly handle the unlikely case that + b starts out as path + 1. + +2001-12-10 Hrvoje Niksic + + * utils.c (path_simplify): Rewrite, with better comments, and + without the use of strcpy to move overlapping blocks. + +2001-12-09 Hrvoje Niksic + + * init.c (cmd_spec_progress): Resurrect. Check whether VAL is a + valid progress type before setting it. + +2001-12-09 Hrvoje Niksic + + * main.c (main): Remove stray debugging message. + +2001-12-09 Hrvoje Niksic + + * progress.c (create_image): Fix ETA padding when hours are prined. + +2001-12-09 Hrvoje Niksic + + * version.c: Wget 1.8 is released. + +2001-12-09 Hrvoje Niksic + + * url.c (reencode_string): Declare static. + + * res.c (registered_specs): Declare static. + + * progress.c (current_impl_locked): Declare static. + + * log.c (flush_log_p): Declare static. + (needs_flushing): Ditto. + + * http.c (digest_authentication_encode): Declare static. + + * html-url.c (init_interesting): Declare static. + + * host.c (host_name_addresses_map): Declare static. + + * cookies.c (find_matching_chains): Declare static. + + * ftp-ls.c (ftp_parse_vms_ls): Warn about the memory leak + indicated by lint. + + * utils.c (path_simplify): Remove unused variable STUB_CHAR. + + * host.c (address_list_set_faulty): Document that INDEX is + currently unused. + + * url.c (rewrite_shorthand_url): Remove unused variable PATH. + +2001-12-08 Hrvoje Niksic + + * version.c: Wget 1.8-pre2 is released. + +2001-12-06 Hrvoje Niksic + + * progress.c (progress_handle_sigwinch): Set up the signal again. + + * utils.c: Include , where Solaris defines + TIOCGWINSZ. + + * progress.c (bar_create): Don't use the last column on the screen. + (create_image): Pad ETA to constant size. Pad SIZE to nine digits + only until it exceeded them. + +2001-12-06 Hrvoje Niksic + + * version.c: Wget 1.8-pre1 is released. + +2001-12-06 Hrvoje Niksic + + * progress.c (progress_create): Make sure that, when the output is + redirected, the progress implementation gets changed to the + fallback one. + (bar_set_params): Set current_impl_locked to 1 when "force" is + specified. + (progress_create): Don't change the progress implementation if + current_impl_locked is non-zero. + + * main.c (redirect_output_signal): Call + progress_schedule_redirect. + + * progress.c (progress_schedule_redirect): New function. + +2001-12-06 Hrvoje Niksic + + * log.c (logvprintf): Restructure to allow being called multiple + times. + (logprintf): Call logvprintf in a loop. + (debug_logprintf): Ditto. + +2001-12-06 Hrvoje Niksic + + * gen_sslfunc.c (ssl_init_prng): Allow the user to disable EGD by + setting egd_file it to empty string. + + * main.c (main): Change the option name from --sslegdsock to + --egd-file. + +2001-12-06 Hrvoje Niksic + + * gen_sslfunc.c (ssl_init_prng): Make the printed message + translatable. + +2001-12-06 Hrvoje Niksic + + * url.c (scheme_disable): New function. + + * main.c (main): Call ssl_init_prng from here rather than from + init_ssl, so that it has a chance to disable support for https + before a URL has been resolved. + + * gen_sslfunc.c (ssl_init_prng): Seed with rand() if all else + failed. + (ssl_init_prng): Disable support for https if seeding the PRNG + fails. + +2001-12-06 Hrvoje Niksic + + * utils.c (read_whole_line): Handle lines beginning with \0. + +2001-12-05 Hrvoje Niksic + + * recur.c (convert_all_links): Guard against duplicates in + downloaded_html_files. + (register_download): Don't invalidate similar-looking URLs. + (match_except_index): New function. + +2001-12-05 Hrvoje Niksic + + * utils.c (path_simplify): Document with test cases. + +2001-12-04 Hrvoje Niksic + + * gen_sslfunc.c: Ditto. + + * rbuf.c: Include . + +2001-12-04 Hrvoje Niksic + + * recur.c (retrieve_tree): Check whether the URL was already + downloaded before downloading it again. + (descend_child_p): Renamed to download_child_p. + (register_download): When one URL is downloaded to a file already + "owned" by another URL, delete all references that map any URL to + that file. + (register_delete_file): New function. + (retrieve_tree): Use it after deleting a file. + + * url.c (url_parse): Re-canonicalize the URL also if the path is + empty, so that e.g. "http://www.server.com" -> + "http://www.server.com/". + (lowercase_str): Use ISUPPER instead of !ISLOWER. + + * retr.c (retrieve_url): Use the canonical URL form when calling + register_download(). + +2001-12-04 Ian Abbott + + * snprintf.c (dopr): Use `unsigned int' as the second argument to + va_arg when casting to `unsigned short' is intended. + +2001-12-04 Herold Heiko + + * gen_sslfunc.c: on windows provide ssl crypto random + initialization through RAND_screen(); could possibly + be not enough for strong ssl communication (see the + relevant manual page from the openssl package). + +2001-12-04 Hrvoje Niksic + + * url.c (local_quote_string): Reenable quoting of question marks, + but only when `--html-extension' is used. + +2001-12-03 Hrvoje Niksic + + * version.c: Wget 1.8-beta3 is released. + +2001-12-03 Hrvoje Niksic + + * snprintf.c (dopr): Cast the result of va_arg to short int and + short unsigned int where these types are expected to be used. + +2001-12-03 Hrvoje Niksic + + * snprintf.c (dopr): Replace `short int' and `unsigned short int' + with `int' when using it as the second argument to `va_arg'. + +2001-12-03 Hrvoje Niksic + + * host.c (address_list_new_one): New function. + (lookup_host): Use it. + +2001-12-03 Andre Majorel + + * host.c (lookup_host): Don't initialize TMPSTORE directly because + it's not legal C. + +2001-12-03 Hrvoje Niksic + + * ftp-basic.c (ftp_port): Don't return HOSTERR if we fail getting + the socket data. + + * ftp.c: Ditto. + + * http.c: No need to declare h_errno. + + * host.c: Declare h_errno. + +2001-12-02 Hrvoje Niksic + + * utils.c (file_merge): If BASE doesn't contain a slash, just + return a copy of FILE. + +2001-12-02 Hrvoje Niksic + + * version.c: Wget 1.8-beta2 is released. + +2001-12-01 Hrvoje Niksic + + * ftp.c (getftp): When PWD fails, assume "/". + + * ftp-basic.c (ftp_syst): Fix indentation. + +2001-12-01 Hrvoje Niksic + + * url.c (get_urls_file): If opt.base_href is specified, merge each + URL with the base. + +2001-12-01 Hrvoje Niksic + + * main.c (print_help): Don't document the removed `-nh'. + +2001-12-01 Hrvoje Niksic + + * url.c (url_full_path): Document better. + + * http.c (gethttp): Use the full path when creating digest + authorization. + +2001-12-01 Hrvoje Niksic + + * cookies.c (path_matches): Return 0 if PREFIX doesn't begin with + '/'. + +2001-12-01 Hrvoje Niksic + + * cookies.c (path_matches): FULL_PATH doesn't begin with '/', but + PREFIX does. + +2001-12-01 Hrvoje Niksic + + * cookies.c (check_domain_match): Reimplement to match Netscape's + "preliminary specification" for cookies. + +2001-12-01 Hrvoje Niksic + + * url.c (replace_attr_refresh_hack): New function. + (convert_links): Call replace_attr_refresh_hack for Refresh + links. It will add the "TMOUT; URL=" junk before the link. + + * html-url.c (collect_tags_mapper): Set ID to the ID of the + "content" attribute, not "http-equiv". + (collect_tags_mapper): Don't use OFFSET to hack the raw_* values; + instead, store the information that this entry belongs to a + "refresh" link. + +2001-12-01 Hrvoje Niksic + + * version.c: Wget 1.8-beta1 is released. + +2001-12-01 Hrvoje Niksic + + * recur.c (retrieve_tree): Allow -p retrievals to exceed maximum + depth by more than one. + +2001-11-30 Hrvoje Niksic + + * retr.c (retrieve_url): Don't allow more than 20 redirections. + +2001-11-30 Hrvoje Niksic + + * recur.c (retrieve_tree): Skip the non-inline entries when + enqueuing the children of a leaf HTML node in -p mode. + (descend_url_p): Ignore opt.no_parent when in -p mode and UPOS is + "inline". + + * html-url.c (get_urls_html): Don't accept dash_p_leaf_HTML. + (collect_tags_mapper): When an entry is "inline", mark it as such. + + * recur.c (descend_url_p): Fix test when checking for + acceptance/rejection rules. + +2001-10-31 Daniel BODEA + + * netrc.c (search_netrc): When slack_default is 0, still look for + an account with matching password, just not the "default account". + HTTP Authorization using .netrc should now work as expected. + +2001-11-30 T. Bharath + + * http.c (persistent_available_p): Call SHUTDOWN_SSL if + test_socket_open fails. + +2001-11-30 Hrvoje Niksic + + * progress.c (display_image): Just print one CR to reset the + cursor position. + +2001-11-30 Christian Fraenkel + + * init.c: New command `ssl_egd_sock'. + + * main.c (main): New option `--sslegdsock'. + + * gen_sslfunc.c (ssl_init_prng): Seed the RNG using EGD. + +2001-11-29 Hrvoje Niksic + + * cmpt.c (memmove): Include a simple memmove implementation. + +2001-11-29 Hrvoje Niksic + + * headers: Guard against header files being included twice. + +2001-11-29 Hrvoje Niksic + + * gen-md5.c: Use unsigned char * as the buffer argument to + gen_md5_update. + +2001-11-29 Hrvoje Niksic + + * connect.h: Declare select_fd. + +2001-11-29 Hrvoje Niksic + + * recur.c (descend_url_p): When resolving no_parent, compare with + start_url, not parent url. Otherwise link from /a/b/ to /a/c/ + wouldn't be followed, although the download started from /a/. + +2001-01-23 Herold Heiko + + * config.h.ms, mswindows.h: defined HAVE_ISATTY, use _isatty for + MS VC; somebody with Borland compiler please check and provide + patch if possible; + + * cmpt.c: provided a usleep emulation. + +2001-11-29 Hrvoje Niksic + + * host.c (address_list_new): Initialize al->faulty. + +2001-11-29 Hrvoje Niksic + + * http.c (http_process_range): Accept the broken output of + "JavaWebServer/1.1.1". + +2001-11-28 Hrvoje Niksic + + * progress.c (dot_set_params): If PARAMS is unspecified, use + dot_style, if available. + + * init.c: Ditto. + + * main.c (main): Resurect --dot-style. + + * progress.c (dot_finish): Print the quantity if we're left at the + beginning of a row. + +2001-11-27 Hrvoje Niksic + + * cmpt.c (random): Removed. + + * retr.c (sleep_between_retrievals): Use the more portable rand() + instead of random(). + +2001-11-27 Ian Abbott + + * retr.c (retrieve_from_file): Initialize `new_file' to NULL to + prevent seg fault. + +2001-11-27 Hrvoje Niksic + + * connect.c (connect_to_many): Use address_list_set_faulty to + prevent the faulty address from being reused. + + * host.c (address_list_set_faulty): New function. + (address_list_get_bounds): New function, instead of + address_list_count. + +2001-11-27 Hrvoje Niksic + + * url.c (convert_links): Don't translate %d-%d. + + * main.c (print_help): Remove stray HAVE_RANDOM code. + +2001-11-27 Hrvoje Niksic + + * ftp.c (getftp): Improve output after sending PASV. Don't + attempt to "look up" the IP address we already know; call + connect_to_one directly. + +2001-11-27 Hrvoje Niksic + + * progress.c: Change the default progress implementation to "bar". + +2001-11-27 Hrvoje Niksic + + * progress.c (bar_create): Print two newlines. + +2001-11-27 Hrvoje Niksic + + * cmpt.c (random): New function, a simple-minded replacement for + random() on systems that don't have it. + +2001-11-26 Hrvoje Niksic + + * config.h.in: Put a HAVE_USLEEP stub. + + * cmpt.c (usleep): Replacement implementation of usleep using + select. + + * init.c: New option init_rate. + + * main.c (main): New option --limit-rate. + + * retr.c (limit_bandwidth): New function. + (get_contents): Call it to limit the bandwidth used when + downloading. + + * progress.c (dot_update): Would print the wrong download speed on + rows other than the first one when the download was continued. + (dot_finish): Ditto. + +2001-11-26 Ian Abbott + + * http.c (gethttp): fix undeclared variable 'err' when compiled + with HAVE_SSL. + +2001-11-26 Hrvoje Niksic + + * progress.c: Don't allocate new timers; use the timing data + propagated from the caller. + + * retr.c (get_contents): Allocate and use a timer. + +2001-11-26 Hrvoje Niksic + + * http.c (last_host_ip): Made into an address_list. + (invalidate_persistent): Release pc_last_host_ip. + (register_persistent): Use lookup_host. + (persistent_available_p): Check for equality of hosts using + address_list_match_all. Call address_list_release. + (http_cleanup): New function. + + * ftp.c (getftp): Use lookup_host and connect_to_many. + + * http.c (gethttp): Use lookup_host and connect_to_many. + + * connect.c (make_connection): Removed. + (connect_to_one): New function. + (connect_to_many): Ditto. + (set_connection_host_name): Ditto. + + * host.c (lookup_host): New function; new return type. + (address_list_new): New function. + (address_list_count): Ditto. + (address_list_copy_one): Ditto. + (address_list_delete): Ditto. + (address_list_release): Ditto. + (pretty_print_address): Ditto. + +2001-11-26 Hrvoje Niksic + + * recur.c (retrieve_tree): In case of followed redirection, + blacklist the pre-redirection URL. + +2001-11-26 Hrvoje Niksic + + * recur.c (descend_redirect_p): New function. + (retrieve_tree): Make sure redirections are not blindly followed. + +2001-11-04 Alan Eldridge + + * config.h.in: added HAVE_RANDOM. + + * options.h: added random_wait to struct options. + + * main.c (print_help [HAVE_RANDOM], main): added arg parsing, help + for --random-wait. + + * retr.c (sleep_between_retrievals) [HAVE_RANDOM]: added + implementation of random wait times. + + * init.c (commands): added "randomwait" keyword. + +2001-11-25 Hrvoje Niksic + + * recur.c (descend_url_p): Be more conservative with blacklisting + URLs. + (convert_all_links): Print how many files have been converted, and + how long it took. + + * progress.c (create_image): Place the number of downloaded bytes + right after the progress bar. + + * utils.c (suffix): Return a pointer into the string. + +2001-11-25 Hrvoje Niksic + + * url.c (convert_links): Handle CO_NULLIFY_BASE. + + * recur.c (retrieve_tree): Ignore download-ignorable children. + (convert_all_links): Specify CO_NULLIFY_BASE when link_base_p. + + * html-url.c (handle_link): Return the newly created urlpos. + (collect_tags_mapper): When dealing with BASE, store the base + reference and mark it as download-ignorable. + +2001-11-25 Hrvoje Niksic + + * url.c (convert_links): Attempt to quote '?' as "%3F" when + linking to local files. Given up on the attempt, as it breaks + local browsing. + +2001-11-25 Hrvoje Niksic + + * main.c (private_initialize): Removed. + (main): Don't call private_initialize. + + * http.c: Call lookup_host. + + * host.c (host_init): Removed. + (add_host_to_cache): Initialize host_name_address_map here, on + demand. + (ngethostbyname): Commented out. + + * connect.c (make_connection): Remove dead code; use lookup_host. + + * host.c (store_hostaddress): Renamed to lookup_host and reversed + the args. + Removed host_address_name_map and host_slave_master_map. + +2001-11-25 Hrvoje Niksic + + * progress.c (dot_create): Align the "[ skipping ... ]" string + with the dots. + + * retr.c (rate): Split into two functions: calc_rate and + retr_rate. + + * progress.c (create_image): Draw a dummy progress bar even when + total size is unknown. + (display_image): Place the text cursor at the end of the "image". + +2001-11-25 Hrvoje Niksic + + * url.c (reencode_string): Use unsigned char, not char -- + otherwise the hex digits come out wrong for 8-bit chars such as + nbsp. + (lowercase_str): New function. + (url_parse): Canonicalize u->url if needed. + (get_urls_file): Parse each URL, and return only the valid ones. + (free_urlpos): Call url_free. + (mkstruct): Add :port if the port is non-standard. + (mkstruct): Append the query string to the file name, if any. + (urlpath_length): Use strpbrk_or_eos. + (uri_merge_1): Handle the cases where LINK is an empty string, + where LINK consists only of query, and where LINK consists only of + fragment. + (convert_links): Count and report both kinds of conversion. + (downloaded_file): Use a hash table, not a list. + (downloaded_files_free): Free the hash table. + + * retr.c (retrieve_from_file): Ditto. + + * main.c (main): Call either retrieve_url or retrieve_tree + for each URL, not both. + + * retr.c (register_all_redirections): New function. + (register_redirections_mapper): Ditto. + (retrieve_url): Register the redirections. + (retrieve_url): Make the string "Error parsing proxy ..." + translatable. + + * res.c (add_path): Strip leading slash from robots.txt paths so + that the path representations are "compatible". + (free_specs): Free each individual path, too. + (res_cleanup): New function. + (cleanup_hash_table_mapper): Ditto. + + * recur.c (url_queue_new): New function. + (url_queue_delete): Ditto. + (url_enqueue): Ditto. + (url_dequeue): Ditto. + (retrieve_tree): New function, replacement for recursive_retrieve. + (descend_url_p): New function. + (register_redirection): New function. + + * progress.c (create_image): Cosmetic changes. + + * init.c (cleanup): Do all those complex cleanups only if + DEBUG_MALLOC is defined. + + * main.c: Removed --simple-check and the corresponding + simple_host_check in init.c. + + * html-url.c (handle_link): Parse the URL here, and propagate the + parsed URL to the caller, who would otherwise have to parse it + again. + + * host.c (xstrdup_lower): Moved to utils.c. + (realhost): Removed. + (same_host): Ditto. + +2001-11-24 Hrvoje Niksic + + * utils.c (path_simplify): Preserver the (non-)existence of + leading slash. Return non-zero if changes were made. + +2001-11-24 Hrvoje Niksic + + * progress.c (bar_update): Don't modify bp->total_length if it is + zero. + +2001-11-24 Hrvoje Niksic + + * retr.c (retrieve_url): When the redirection URL doesn't parse, + print the correct error message rather than "UNKNOWN". + +2001-11-24 Hrvoje Niksic + + * progress.c (bar_finish): If the timer didn't record any time + since the download beginning, fake 1ms. + +2001-11-24 Hrvoje Niksic + + * recur.c (recursive_retrieve): Fix typo. + +2001-11-23 Hrvoje Niksic + + * progress.c (create_image): Don't translate "%ld ". + +2001-11-23 Hrvoje Niksic + + * progress.c (bar_set_params): Allow the user to force the use of + the bar. + +2001-11-23 Lemble Gregory + + * gen_sslfunc.c (ssl_init_prng): New function; seed the SSL RNG. + +2001-11-23 Hrvoje Niksic + + * progress.c: Renamed dp_* functions to dot_* for greater clarity + and consistency with bar_*. + (print_download_speed): Get rid of the unneeded '@' character. + (create_image): Fix download rate geometry. + + * progress.c (print_elapsed): Remove spurious space. + (print_elapsed): Renamed to print_download_speed, since that's + what it does. + +2001-11-23 Hrvoje Niksic + + * progress.c (bar_update): If the downloaded amount becomes larger + than the expected amount, adjust the expected amount accordingly. + +2001-11-23 Hrvoje Niksic + + * utils.c (determine_screen_width): New function. + + * main.c (main): New option `--progress=TYPE'. + (main): Implement compatibility with the old option `--dot-style'. + + * init.c: Removed cmd_spec_dotstyle -- that logic is now in + dp_set_params. + (cmd_spec_progress): New function. + + * retr.c (get_contents): Use the progress_* functions instead of + the old show_progress(). + (show_progress): Removed. + (rate): Print "xxxx.xx K/s" instead of "KB/s". Ditto for MB/s, + etc. + + * progress.c (set_progress_implementation): New function. + (valid_progress_implementation_p): Ditto. + (progress_create): Ditto. + (progress_update): Ditto. + (progress_finish): Ditto. + (dp_create): Ditto. + (dp_update): Ditto. + (dp_finish): Ditto. + (dp_set_params): Ditto. + (print_elapsed): Ditto. + +2001-11-22 Hrvoje Niksic + + * retr.c (show_progress): Use it. + + * log.c (log_set_flush): New function. + +2001-11-22 Hrvoje Niksic + + * utils.c (path_simplify): Don't remove trailing slashes. + + * ftp.c (ftp_get_listing): Use it. + + * utils.c (file_merge): New function. + + * url.c (opt_url): Removed. + + * recur.c (recursive_retrieve): Inline "opt_url" logic. + + * main.c (main): Use xfree(), not free(). + + * url.c (rewrite_url_maybe): Renamed to rewrite_shorthand_url. + + * ftp.c (ccon): Move `ccon' typedef here, since it's only used + internally. + + * config.h.in: Include a stub for HAVE_STRPBRK. + + * cmpt.c (strpbrk): Include a replacement for systems without + strpbrk(). + + * ftp.c: Use url_set_dir and url_set_file when modifying the URL. + + * url.c (url_set_dir): New function. + (url_set_file): Ditto. + + * ftp-basic.c (ftp_process_type): Process FTP type here; the URL + parser makes the URL "params" available, so we can do that in this + function. + + * retr.c: Ditto. + + * ftp.c: Ditto; pass the local file information in `ccon'. + + * http.c: Get rid of the ugly kludge that had URL being replaced + with the proxy URL when proxy retrieval was requested. Use a + separate parameter to http_loop and gethttp for the proxy URL. + + * http.c: Changed to reflect the fact that local file, proxy, and + referer information are no longer stored in struct url. The local + file information is passed in `struct hstat' now. + + * url.c: Reworked URL parsing to be more regular. Reencode the + URL using reencode_string. + Removed non-URL-related information from struct url. This + includes fields `proxy', `local', and `referer'. + +2001-11-22 Jochen Hein + + * main.c (main): Split the copyright notice for easier + translation. + +2001-08-21 Dave Turner + + * ftp-basic.c (ftp_size): New function to send non-standard SIZE + command to server to request file size. + * ftp.h (ftp_size): Export it. + * ftp.c (getftp): Use new ftp_size function if restoring + transfer of a file with unknown size. + +2001-11-20 Hrvoje Niksic + + * url.c (parseurl): Don't depend on the now-obsolete TYPE. + +2001-11-19 Hrvoje Niksic + + * url.c (getproxy): Handle URL shorthands. + +2001-11-19 Hrvoje Niksic + + * main.c: Remove --wait / --waitretry backwards compatibility + code. + +2001-11-19 Hrvoje Niksic + + * main.c (main): Use it. + + * url.c (rewrite_url_maybe): New function. + +2001-11-19 Hrvoje Niksic + + * url.c: Clean up handling of URL schemes. + +2001-05-13 Hrvoje Niksic + + * url.c: Get rid of `protostrings'. + (skip_proto): Don't use protostrings. + (has_proto): Ditto. + +2001-11-18 Hrvoje Niksic + + * Makefile.in: Conditionally compile getopt.o. + +2001-11-18 Hrvoje Niksic + + * md5.h: Renamed to gnu-md5.h. + + * md5.c: Renamed to gnu-md5.c. + + * http.c: Ditto. + + * ftp-opie.c: Use the new macros. + + * sysdep.h: Define md5-related macros. + + * config.h.in: Define HAVE_SOLARIS_MD5 or HAVE_BUILTIN_MD5 + depending on which md5 implementation is used. + +2001-11-18 Hrvoje Niksic + + * res.c (res_register_specs): Initialize OLD and HP_OLD to appease + the compiler. + +2001-11-18 Hrvoje Niksic + + * http.c (gethttp): Print the whole response line when printing + headers is requested. + +2001-05-12 Hrvoje Niksic + + * res.c: New file. Implement all RES-related code here. + +2001-11-18 Hrvoje Niksic + + * version.c: Wget 1.7.1 is released. + +2001-11-16 Hrvoje Niksic + + * headers.c (header_extract_number): Ignore trailing whitespace. + +2001-08-24 Ian Abbott + + * html-url.c (collect_tags_mapper): Fix bug converting links + with -k option for tags with multiple link attributes by + handling links in the order they appear. + +2001-08-15 Ian Abbott + + * ftp.c (ftp_loop_internal): Avoid a potential buffer overflow in + the call to the 'rate' function by moving it past the error + checking for the 'getftp' function return value. + +2001-11-16 Hrvoje Niksic + + * html-parse.c (advance_declaration): Use 0x22 instead of '"' or + '\"'. Different compilers' assert macros are broken in different + ways. + +2001-09-29 Christian Fraenkel + + * http.c (gethttp): print debug output for errors occuring during + the ssl handshake. + +2001-11-16 Chris Seawood + + * init.c: Ditto. + + * host.c: Ditto. + + * connect.c: Ditto. + + * sysdep.h: Support compilation under BEOS. + +2001-06-08 Edward J. Sabol + + * url.c (url_equal): Fix a memory leak when parseurl returns an + error on the second URL. Also, since url_equal is not used at the + moment, do not compile it. + + * url.h: Ditto for the prototype of url_equal. + +2001-11-16 Hrvoje Niksic + + * html-parse.c (map_html_tags): Support XML-style empty tags. + +2001-06-26 Hrvoje Niksic + + * wget.h (DO_REALLOC_FROM_ALLOCA): Check for do_realloc_newsize in + loop condition because we're no longer setting SIZEVAR here. + +2001-06-26 Hrvoje Niksic + + * wget.h (DO_REALLOC_FROM_ALLOCA): Set SIZEVAR after the memcpy() + call because it needs the old value. + +2001-06-18 Hrvoje Niksic + + * cookies.c (ATTR_NAME_CHAR): Allow almost any character to be in + an attribute name. + +2001-06-18 Hrvoje Niksic + + * url.c (url_filename): Make sure that slashes that sneak in to + u->file via query string get protected. + (file_name_protect_query_string): New function. + +2001-06-14 Hrvoje Niksic + + * recur.c (recursive_retrieve): Also check undesirable_urls with + canonicalized URL. + +2001-06-14 Hrvoje Niksic + + * http.c (gethttp): Search `.netrc' with real host, not the proxy + one. + +2001-06-14 Hrvoje Niksic + + * sysdep.h (MAP_FAILED): Provide MAP_FAILED for systems that don't + define it. + +2001-06-09 Jan Prikryl + + * ftp.h: Provide correct prototype for ftp_parse_ls(). + +2001-06-04 Hrvoje Niksic + + * version.c: Wget 1.7 is released. + +2001-06-03 Karl Eichwalder + + * ftp-ls.c (ftp_parse_ls): Fix typo. + +2001-05-27 Hrvoje Niksic + + * all: Update copyright information. + +2001-05-26 Hrvoje Niksic + + * http.c (gethttp): Indicate that the continued download failed + for *this* file. + +2001-05-26 Hrvoje Niksic + + * version.c: Wget 1.7-pre1 is released. + +2001-05-26 Hrvoje Niksic + + * version.c: Updated version to 1.7-pre1. + +2001-05-14 Hrvoje Niksic + + * gen_sslfunc.c: Don't include directly. + +2001-05-14 Hrvoje Niksic + + * http.c (gethttp): Use real URL data for cookies, not the proxy + stuff. + +2001-05-12 Hrvoje Niksic + + * main.c (print_help): Document `--no-http-keep-alive'. + + * utils.c (numdigit): Handle negative numbers *correctly*. + + * hash.c (make_nocase_string_hash_table): Use term "nocase" rather + than the confusing "unsigned". + + * utils.c (string_set_contains): Renamed from string_set_exists. + + * hash.c (hash_table_contains): Renamed from hash_table_exists. + + * cookies.c: Move case-insensitive hash tables to hash.c. + +2001-05-09 Hrvoje Niksic + + * http.c (gethttp): Before concluding that the file is already + fully retrieved, make sure that the file existed and `Range' was + actually requested. + +2001-05-09 Hrvoje Niksic + + * cookies.c (eliminate_dups): New function. + (build_cookies_request): Use it. + (build_cookies_request): Set chain_store_size after reallocating + all_chains. + (check_domain_match): Annotated for easier future debugging. + (store_cookie): In the debug message, print whether the cookie is + permanent. + +2001-05-08 Hrvoje Niksic + + * http.c (http_loop): Reset no_truncate before deciding whether to + set it. + (gethttp): Further clarify "-c conflicts with existing file" error + message, based on input from Herold Heiko. + +2001-05-07 Hrvoje Niksic + + * http.c (http_loop): If restval is set, set no_truncate to 1 + unconditionally. + +2001-05-02 Jan Prikryl + + * ftp-ls.c (ftp_parse_winnt_ls): Assure months are being correctly + converted. Pointed out by . + (ftp_parse_vms_ls): Ditto. + +2001-04-30 Hrvoje Niksic + + * init.c (cmd_address): Zero SIN before using it; apparently + needed on *BSD. + +2001-04-29 Hrvoje Niksic + + * ftp.c (ftp_loop_internal): Don't set NO_TRUNCATE if the file is + empty. + +2001-04-29 Hrvoje Niksic + + * main.c (main): Make `--cookies' respect its argument. + +2001-04-28 Hrvoje Niksic + + * main.c (main): Removed undocumented option `--email-address'. + + * netrc.c: Use the latest read_whole_line. + + * init.c (defaults): Set opt.ftp_pass to "-wget@". + + * mswindows.c (pwd_cuserid): Ditto. + + * utils.c (pwd_cuserid): Removed. + + * host.c (ftp_getaddress): Removed. + +2001-04-28 Hrvoje Niksic + + (http_loop): Allocate space for filename_plus_orig_suffix with + alloca; this is more efficient and removes the need to free it + before each and every return. + +2001-04-28 Hrvoje Niksic + + * http.c (gethttp): Return RETRUNNEEDED when the retrieval is + unneeded because the file is already there and fully downloaded, + and -c is specified. + (http_loop): Handle RETRUNNEEDED. + + * wget.h (uerr_t): New value RETRUNNEEDED. + + * http.c (http_loop): Set no_truncate for files that both exist + and are non-empty. + (gethttp): Consider the download finished when restval >= contlen, + not only when restval==contlen. + (gethttp): Handle redirection before giving up due to -c. + (gethttp): Clarify error message which explains that -c will not + truncate the file. + (gethttp): When returning CONTNOTSUPPORTED, don't forget to free + the stuff that needs freeing and release the socket. + +2001-04-27 Hrvoje Niksic + + * main.c (print_help): Wget booleans accept "off", not "no". + +2001-04-27 Hrvoje Niksic + + * http.c (http_loop): If allow_cache is zero, always disable + caching, not only when retrieving through proxy. + + * init.c: Ditto. + + * options.h (struct options): Rename proxy_cache to allow_cache. + +2001-04-27 Hrvoje Niksic + + * http.c (mktime_from_utc): Improve documentation. + (http_atotm): Put format strings into a separate array. + +2001-04-27 Hrvoje Niksic + + * safe-ctype.h: Instead of throwing #error when isalpha is + defined, redefine it to something that will throw a compile-time + error if actually *used*. Do the same for the rest of the + standard C macros. + +2001-04-26 Hrvoje Niksic + + * url.c (getproxy): Ignore empty proxy vars. + +2001-04-25 Hrvoje Niksic + + * http.c (http_loop): Would load cookies every time. + + * cookies.c (load_cookies): Handle cookies whose values contain + embedded spaces. + +2001-04-25 Hrvoje Niksic + + * utils.c: Define each DIGITS_* in one line. + +2001-04-25 Roger L. Beeman + + * http.c (http_atotm): Initialize t.tm_isdst to 0. + (mktime_from_utc): Prevent mktime() from having discontinuities at + DST transition points. + +2001-04-25 Hrvoje Niksic + + * html-url.c (get_urls_html): Fix documentation. + +2001-04-25 Hrvoje Niksic + + * url.c (UNSAFE_CHAR): Reimplement using a static table. + (url_init): Removed. + (init_unsafe_char_table): Removed. + +2001-04-25 Hrvoje Niksic + + * snprintf.c (dopr): Replace ISDIGIT with '0' <= ch && ch <= '9'. + +2001-04-25 Hrvoje Niksic + + * utils.c: Document timer functions. + + * retr.c (rate): Use it. + (rate): Print in GB/s if transfer rate exceeds 1 GB/s. + + * utils.c (wtimer_granularity): New function. + +2001-04-24 Hrvoje Niksic + + * retr.c (show_progress): Ditto. + + * ftp.c (getftp): Ditto. + + * http.c (gethttp): Use new timer functions. + + * utils.c (wtimer_allocate): New function. + (wtimer_new): Ditto. + (wtimer_delete): Ditto. + (wtimer_reset): Ditto. + (wtimer_elapsed): Ditto. + +2001-04-24 Hrvoje Niksic + + * utils.c (long_to_string): New, faster version. Favors smaller + numbers; much of the calculation is now done at compile-time. + +2001-04-24 Hrvoje Niksic + + * utils.c (numdigit): Handle negative numbers. + +2001-04-23 Hrvoje Niksic + + * retr.c (show_progress): Print the download rate even when the + percentages are not available. + +2001-04-21 Hrvoje Niksic + + * ftp.c (getftp): Adjust expected_bytes if the length is + authoritative. + +2001-04-14 Hrvoje Niksic + + * url.c: Don't declare `construct'. + + * hash.c (grow_hash_table): Speed up rehashing; inline storing of + mappings to new locations. + (hash_table_new): Make resize_threshold a field in the hash table, + so we don't have to recalculate it in each hash_table_put. + (grow_hash_table): Update resize_threshold. + (MAX): Remove unused macro. + (prime_size): Made static. + +2001-04-14 Hrvoje Niksic + + * retr.c (retrieve_url): Call uri_merge, not url_concat. + + * html-url.c (collect_tags_mapper): Call uri_merge, not + url_concat. + + * url.c (mkstruct): Use encode_string instead of xstrdup followed + by URL_CLEANSE. + (path_simplify_with_kludge): Deleted. + (contains_unsafe): Deleted. + (construct): Renamed to uri_merge_1. + (url_concat): Renamed to uri_merge. + +2001-04-13 Hrvoje Niksic + + * url.c (str_url): Use encode_string instead of the unnecessary + CLEANDUP. + (encode_string_maybe): New function, returns input string if no + encoding is needed. + (encode_string): Call encode_string_maybe to do the dirty work, + xstrdup if no work needed. + +2001-04-13 Hrvoje Niksic + + * wget.h (XDIGIT_TO_xchar): Define here. + + * url.c (decode_string): Use new name. + (encode_string): Ditto. + + * http.c (XDIGIT_TO_xchar): Rename HEXD2asc to XDIGIT_TO_xchar. + (dump_hash): Use new name. + + * wget.h: Rename ASC2HEXD and HEXD2ASC to XCHAR_TO_XDIGIT and + XDIGIT_TO_XCHAR respectively. + +2001-04-13 Hrvoje Niksic + + * init.c: Include cookies.h. + + * cookies.h: Declare cookies_cleanup. + + * cookies.c (check_domain_match): Remove unused variable. + (save_cookies): Remove extraneous argument from debug statement. + + * host.c (same_host): Don't call skip_url. + + * url.c (skip_url): Removed. Removed its calls from various + functions in url.c. + +2001-04-13 Hrvoje Niksic + + * cookies.c (unsigned_string_hash): Use the new code in + string_hash as reference. + + * hash.c (hash_table_map): Allow deletion and change of the + element processed by MAPFUN. + (string_hash): Use the function from glib. + +2001-04-12 Hrvoje Niksic + + * config.h.in: Include #undef stub. + + * hash.c (hash_table_remove): Rewrite to actually clear deleted + entries instead of just marking them as deleted. + +2001-04-12 Hrvoje Niksic + + * hash.h: Declare hash_table_get_pair and hash_table_count. + +2001-04-12 Hrvoje Niksic + + * cookies.c: Declare http_atotm. + +2001-04-12 Hrvoje Niksic + + * ftp-ls.c (ftp_parse_unix_ls): Use octal constants for + permissions. A compiler that doesn't accept octal constants is + seriously broken and shouldn't be used -- octal constants were + present in K&R C! + +2001-01-20 Karl Eichwalder + + * Makefile.in: Provide and use DESTDIR according to the Coding + Standards. + +2001-04-12 Hrvoje Niksic + + * ftp-ls.c (ftp_parse_vms_ls): Make seconds optional in time + specification. + +2001-04-11 Hrvoje Niksic + + * url.c (parseurl): Don't strip trailing slash when u->dir is "/" + because that strips the *leading* slash, thus forcing relative + FTP retrieval. + +2001-04-10 Jan Prikryl + + * ftp.c (getftp): Convert initial FTP directory from VMS to UNIX + notation for VMS servers. + (ftp_retrieve_dirs): Do not prepend '/' to f->name when + odir is an empty string. + +2001-04-10 Jan Prikryl + + * ftp-ls.c (ftp_parse_winnt_ls): Made the fix for AM/PM more + effective. Suggested by Edward J. Sabol. + +2001-04-10 Hrvoje Niksic + + * cookies.c (build_cookies_request): Use and sort cookies from all + matching domains. + (build_cookies_request): Check for duplicates before generating + the `Cookies' header. + + * main.c (main): Don't load cookies here. + (main): Make loadcookies and savecookies call the correct command. + + * http.c (http_loop): Load cookies on-demand. + +2001-04-09 Hrvoje Niksic + + * http.c (gethttp): Fix indentation of SSL ifdef. + +2001-04-09 Hrvoje Niksic + + * ftp.c (ftp_retrieve_dirs): Don't forcibly prepend "/" to u->dir; + that hack is no longer necessary. + (getftp): Prepend initial directory to *non*-absolute u->dir's. + +2001-04-09 Hrvoje Niksic + + * init.c (cmd_file): New function. + (enable_tilde_expansion): New variable. + (run_wgetrc): Use it. + (cmd_file): Use it. + +2001-04-08 Hrvoje Niksic + + * init.c: Include cookie-related options. + + * main.c (main): Include cookie-specific options. + (main): Load cookies before download is finished. + (main): Save cookies when done. + + * http.c (gethttp): Process the `Set-Cookie' header. + (gethttp): Include cookies in the response. + + * cookies.c: New file. + +2001-04-08 Hrvoje Niksic + + * utils.c (datetime_str): New function. + +2001-04-08 Jan Prikryl + + * ftp-ls.c (ftp_parse_winnt_ls): The AM/PM change did assume + 12:01PM == 00:01, which was obviously wrong. Taken care of this + anomaly. + + * ChangeLog: Removed an excess conflict marker. Reformatted the + entry by Philipp Thomas from 2001-03-09. + + * ftp-ls.c (ftp_parse_winnt_ls): Ensure that adjusted PM hours lay + between 0 and 23. Elminate unused variable `sec'. + +2001-04-08 Hrvoje Niksic + + * hash.c (hash_table_count): New function. + +2001-04-06 Hrvoje Niksic + + * utils.c (read_file): Cast MAP_FAILED to char *. Enforced by + Digital Unix cc. + +2001-04-06 Hrvoje Niksic + + * config.h.in: Oops, do the namespace tweaks only on systems we + know about. + +2001-04-06 Hrvoje Niksic + + * hash.c: Include . + +2001-04-06 Hrvoje Niksic + + * config.h.in: Define "compilation environment" options that work + under Linux and Solaris. To be reviewed on other OS'es. + +2001-04-06 Hrvoje Niksic + + * http.c (gethttp): Prepend literal newline with `\n\'. + +2001-04-06 Hrvoje Niksic + + * sysdep.h: Don't define VERY_LONG_FORMAT. + + * utils.c (very_long_to_string): New function. + (legible_very_long): Use it; don't use VERY_LONG_FORMAT. + +2001-04-04 Christian Fraenkel + + * url.c (parse_uname): Would run past the end of the string if the + username was present, but the URL did not contain a slash, e.g. + http://foo:bar@myhost. + +2001-04-03 KOJIMA Hajime + + * http.c (http_atotm): Use %A instead of %a to match full + weekday. (On most systems there is no difference.) + +2001-04-03 Paul Bludov + + * mswindows.c (sleep): Use SleepEx() instead of Sleep(). + (ws_changetitle): Use alloca() instead of malloc() to avoid memory + leak. + (ws_mypath): Use GetModuleFileName instead of argv[0]. + (ws_startup): Use data.wVersion for comparison. + +2001-04-02 Hrvoje Niksic + + * http.c (http_loop): Ditto. + + * ftp.c (ftp_loop_internal): Made the check whether to continue + retrieval `-O'-friendly. + +2001-04-02 Hrvoje Niksic + + * netrc.c (parse_netrc): Don't trim the line endings explicitly; + they will be handled as whitespace. + (parse_netrc): Correctly handle lines that end with whitespace. + +2001-04-02 Hrvoje Niksic + + * retr.c (retrieve_url): New variable global_download_count used + to identify first retrieval. + + * ftp.c (getftp): Ditto. + + * http.c (gethttp): Rewind opt.dfp only on first retrieval. + +2001-04-02 Hrvoje Niksic + + * init.c (cmd_address): Heap-allocate the address that gets stored + to CLOSURE. Old code would simply assign an address on the stack. + +2001-04-01 Nicolas Lichtmaier + + * ftp.c (ftp_get_listing): Propagate error status. + (ftp_retrieve_glob): Use it. + (ftp_loop): Ditto. + +2001-04-01 Nicolas Lichtmaier + + * main.c (main): Add -C to the string that is the third arg to + getopt_long(). + +2001-04-01 Hrvoje Niksic + + * ftp.c (getftp): Don't start the download from scratch if `-c' + was specified, but the file is already fully downloaded. + + * http.c (gethttp): Don't truncate a pre-existing file if `-c' was + specified and the server doesn't support continued download. + (gethttp): Don't start the download from scratch if `-c' was + specified, but the file is already fully downloaded. + +2001-03-31 Hrvoje Niksic + + (recursive_retrieve): Don't clear the hash tables at this point at + all; it interferes with the normal operation of register_download. + +2001-03-31 Hrvoje Niksic + + * recur.c (recursive_retrieve): Clear the hash tables only when + they are defined. + +2001-03-31 Hrvoje Niksic + + * http.c (gethttp): Make sure the socket is closed with + CLOSE_INVALIDATE before we have drained the body. + +2001-03-31 Hrvoje Niksic + + * retr.c (retrieve_url): Call register_download() for downloaded + files and register_html() for downloaded HTML files. + + * recur.c (register_download): New function; register here that a + file has been downloaded, rather than in recursive_retrieve(). + (register_html): New function; enqueue the location of HTML files + here rather than in recursive_retrieve(). + +2001-03-31 Hrvoje Niksic + + * main.c (print_help): Use multiple fputs instead of a single ugly + printf(). + (main): Consistently assign numbers >128 to options without a + corresponding character. + +2001-03-09 Philipp Thomas + + * safe-ctype.h: New file. Locale independent ctype.h replacement + taken from libiberty. + + * safe-ctype.c: New file. Tables for above. + + * Makefile.in: Add safe-ctype$o to OBJS. Add dependencies for + safe-ctype$o. + + * cmpt.c: Remove include of ctype.h. Use ISSPACE instead of + isspace. + + * ftp-basic.c: Don't include ctype.h. + + * ftp-ls.c: Likewise. + + * ftp.c: Likewise. + + * headers.c: Likewise. + + * host.c: Likewise. + + * html-parse.c: Likewise. + + * html-url.c: Likewise. + + * http.c: Likewise. + + * init.c: Likewise. + + * main.c: Likewise. Set LC_CTYPE along with LC_MESSAGES. + + * netrc.c: Likewise. + + * recur.c: Likewise. + + * retr.c: Likewise. + + * snprintf.c: Replace ctype.h with safe-ctype.h. Use ISDIGIT + instead of isdigit. + + * sysdep.h: Remove defines of ctype macros as they aren't needed + for safe-ctype-h. + + * url.c: Don't include ctype.h. + + * utils.c: Likewise. + + * wget.h: Include safe-ctype.h. + +2001-03-27 Dan Harkless + + * Makefile.in: Moved top_builddir out of "User configuration + section" of top Makefile and analogous spot in this one. + +2001-03-17 Dan Harkless + + * Makefile.in: Include @SSL_INCLUDES@ substition in INCLUDES. + Define top_builddir. Link wget with libtool so the user doesn't + have to supply a bunch of custom environment variables to + correctly link with the OpenSSL shared libraries. + +2001-03-06 Hack Kampbjorn + + * http.c (gethttp): skip :port in host header if it is the + DEFAULT_HTTPS_PORT when using SSL. + + * url.c: move the #define of DEFAULT_HTTP_PORT, DEFAULT_FTP_PORT + and DEFAULT_HTTPS_PORT to the header file so it can be use in the + rest of the code. + * url.h: Ditto + +2001-03-01 Jonas Jensen + + * retr.c (show_progress): Correctly calculate the number of bytes + in the first line of the download that have been actually + downloaded in this run. + +2001-02-23 Dan Harkless + + * main.c (print_help): --help documentation for -N said it would + re-download files if they had the _same_ timestamp on server. + (print_help): -nr belongs in "FTP options" section of --help + output, not "Recursive retrieval" section. Alphabetized FTP + options by long option name. + +2001-02-16 Dan Harkless + + * init.c (commands): Hack Kampbjørn discovered + that "httpsproxy" had been inserted into commands[] out of + alphabetical order, causing "BUG: unknown command `httpuser'". + +2001-02-13 Jan Prikryl + + * ftp-ls.c (ftp_parse_ls): Added support of ST_MACOS (Unix-like + listing without correct permissons). + + * ftp.h (stype): Added ST_MACOS to identify the NetPresenz MacOS + FTP server. + + * ftp.c (ftp_retrieve_list): New mirroring logic: A remote file + shall be donwloaded only when it's newer than the local copy or + when it has the same timeestamp but its size is different. ST_VMS + and ST_MACOS as special cases that lie about file size. + + * ftp-ls.c (ftp_parse_ls): Support for ST_MACOS. + + * Makefile.in: Removed dependency on ftpparse library due to unclear + copyright issues and absence of any feedback to our queries. + + * ftp-ls.c: Removed dependency on ftpparse library due to unclear + copyright issues and absence of any feedback to our queries. + (ftp_parse_ls): Added a warning message when remote server system + does not seem to be suported by wget. + (ftp_parse_vms_ls): New function for parsing VMS ftp + server listing output. + (clean_line): New function responsible for removing + end-of-line characters from FTP listing texts. + + * ftp.c (getftp): Global variables pwd and host_type are now + member of the ccon structure under names ccon.id and ccon.rs. + + * ftp.h (struct ccon): Added formed global variables from ftp.c, + enum stype rs (remote system identification) and char *id (initial + working directory), as suggested by Hrvoje. + + * url.c (parse_uname): Added support for passwords containing '@' + characters. + (skip_uname): Ditto. + +2001-02-11 Hrvoje Niksic + + * ftp.c (ftp_loop): Reset con. + +2001-01-06 Jan Prikryl + + * url.c (parse_uname): Added support for passwords containing '@' + characters. + (skip_uname): Ditto. + +2001-02-11 Hack Kampbjørn + + * url.c (parseurl): Debug-print u->ftp_type. + +2001-02-11 Hrvoje Niksic + + * ftp.c (ftp_loop_internal): Disable padding. + (getftp): Ditto. + + * http.c (http_loop): Disable padding. + + * retr.c (show_progress): Use it to enable padding. + + * retr.c (rate): Optional parameter PAD for padding the rate. + +2001-02-10 Hrvoje Niksic + + * retr.c (show_progress): Make sure that the last output line + includes progress. + +2001-02-10 Jonas Jensen + + * retr.c (show_progress): Print the download rate along with the + percentages. + Along with Anders Thorsby . + +2001-02-10 Tim Mooney + + * ftp.h: Rename enums `command' to `wget_ftp_command' and + `fstatus' to `wget_ftp_status' because old names clash with Tru64 + net/if.h. + +2001-02-08 Christian Fraenkel + + * gen_sslfunc.c: verify_callback is now static + + * gen_sslfunc.c (init_ssl): load certificate if specified + + * gen_sslfunc.c (ssl_printerr): new function + + * init.c: added new --sslcertfile and --sslcertkey switches + + * main.c: ditto + + * options.h: ditto + + * http.c (gethttp): abort when init_ssl fails + +2001-01-23 Herold Heiko + + * mswindows.h: Include ; it's needed for alloca(). + +2001-01-10 Dan Harkless + + * url.c (str_url): Clarified this function's comment header after + Hrvoje answered my question on the list as to when hide != 1. + Also Hrvoje pointed out I need to use xstrdup() on the string literal. + +2001-01-06 Hrvoje Niksic + + * connect.c (bindport): Declare addrlen as int. Diagnosed by + Drazen Kacar . + (conaddr): Ditto. + +2001-01-09 Dan Harkless + + * html-url.c: A bunch of fixup of `--page-requisites'-related + comments to reflect Hrvoje's changes to my code when transplanting + it into this new file, to fix spelling mistakes, to clarify, etc. + + * url.c (write_backup_file): Clarified a comment. + (str_url): Henrik van Ginhoven pointed out on the list that we + shouldn't give away the number of characters in the password by + replacing each character with a 'x'. Use "" instead. + + * ftp.c (ftp_retrieve_dirs): The bug where recursion into FTP + directories didn't work if logging in put you in a directory other + than "/" is fixed now. Removed the comment here warning of the bug. + + * main.c (print_help): --continue's description was misleading. + We don't "restart", we "resume". Also, better to say + "partially-downloaded file" rather than just "existing file". + +2001-01-06 Dan Harkless + + * ChangeLog: The '[Not in 1.6 branch.]'s were decided not to be + the best way to go about my aim. Removed them in favor of: + + * ChangeLog-branches/1.6_branch.ChangeLog: New file. + +2001-01-04 Hrvoje Niksic + + * url.c (replace_attr): New function, to be used by both + TO_COMPLETE and TO_RELATIVE case in convert_links. + (find_fragment): New function for finding URL fragments. + (replace_attr): Better handle the case where the original string + is not quoted. Use find_fragment. + (convert_links): Use replace_attr(). + +2000-12-31 Dan Harkless + + * ChangeLog: Since this flat file doesn't have multiple branches, + looking at the dates would make you think that things went into + 1.6 that actually just went into the 1.7-dev branch. Added "[Not + in 1.6 branch.]" where appropriate to clarify. + +2000-12-30 Dan Harkless + + * ftp.c, http.c: Applied Hack Kampbjørn 's + patch to deal with h_errno not being defined in netdb.h under Cygwin. + +2000-12-18 Csaba Raduly + + * sysdep.h: Include and under Watcom. + +2000-12-17 Igor Khristophorov + + * http.c (check_end): Fix test for '+' or '-'. + +2000-12-17 Hrvoje Niksic + + * url.c (parseurl): Rename inner loop var from i to ind to avoid + clash with the function top-level-declared variable i. + (str_url): Likewise, rename inner-loop i to j. + + * recur.c (parse_robots): Don't declare LEN at top of function. + (robots_match): Renamed parameter FORBIDDEN to avoid hiding of + global variable. + + * main.c (main): Change erroneous use of bitwise and to logical. + + * init.c (cmd_address): Don't heap-allocate `sin'; it can be on + the stack because it will be copied to closure. + + Thanks to Csaba Raduly's run of PC-LINT over the sources. + +2000-12-17 Hrvoje Niksic + + * http.c (basic_authentication_encode): Use xmalloc(), not + malloc(). Thanks to Csaba Raduly's run of PC-LINT over the + sources. + +2000-12-17 Csaba Raduly + + * sysdep.h: Test for __EMX__ rather than for EMXOS2 for OS/2 + compilation. + +2000-12-17 Hrvoje Niksic + + * mswindows.c: Include . + + * gen_sslfunc.c: Include . + + * ftp-basic.c: Don't attempt to declare errno or h_errno because + they're not used. + + * main.c: Include because errno is used. + + * ftp.c: Ditto. + + * http.c: Include for h_errno. + +2000-12-13 Hrvoje Niksic + + * html-parse.c (advance_declaration): MSVC assert() chokes on + '\"'. Use '"' instead. + +2000-12-11 Hrvoje Niksic + + * utils.c (xfree_real): Removed. + (xfree_debug): Just call free(). + + * wget.h (xfree): Make it an alias for free. + +2000-12-11 Hrvoje Niksic + + * http.c (http_loop): Furthermore, touch output_document only if + it is known to be an existing regular file. + +2000-12-11 Hrvoje Niksic + + * ftp.c (ftp_retrieve_list): Ditto. + + * http.c (http_loop): Touch output_document if that is used for + output. + +2000-12-06 Hrvoje Niksic + + * http.c: Include gen_sslfunc.h after including Wget's headers. + (persistent_available_p): Needed coma before `int ssl'. + +2000-12-06 Hrvoje Niksic + + * ftp.c (ftp_loop_internal): Ditto. + + * http.c (http_loop): Use it. + + * retr.c (sleep_between_retrievals): New function that handles the + logic of opt.wait and opt.waitretry. + +2000-12-06 Hrvoje Niksic + + * rbuf.h: Implement only a single version of RBUF_READCHAR, using + rbuf_read_bufferful when the buffer is depleted. + + * rbuf.c (rbuf_read_bufferful): New function. + +2000-12-06 Hrvoje Niksic + + * gen_sslfunc.h: Use ansi2knr style function declarations. + + * gen_sslfunc.c: Reformat according to the GNU coding standards. + More should be done. + + * http.c (persistent_available_p): Place the cheap SSL test nearer + the top of the function. + (CLOSE_FINISH, CLOSE_INVALIDATE): Define only one version of each. + +2000-12-05 Hrvoje Niksic + + * url.c (init_unsafe_char_table): Reinstate space as an unsafe + char. + +2000-12-03 Christian Fraenkel + + * Makefile.in: added gen_sslfunc object + * config.h.in: added HAVE_SSL define + * connect.c: changed select_fd from static int to int + * connect.h: ditto + * gen_sslfunc.h: New file + * gen_sslfunc.c: ditto + * http.c: added HTTPS fuctionality + * retrc.c: ditto + * url.c: ditto + * init.c: added opt.httpsproxy + * options.h: ditto + * rbuf.h: added alternate rbuf struct + * wget.h: added CONSSLERR + * rbuf.c: ditto + + * http.c: Added HTTPS fuctionality. + + * retrc.c: Ditto. + + * url.c: Ditto. + + * init.c: Added opt.httpsproxy. + + * options.h: Ditto. + + * rbuf.h: Added alternate rbuf struct. + + * wget.h: Added CONSSLERR. + + * rbuf.c: Ditto. + +2000-11-30 Jan Prikryl + + * ftp-ls.c (ftp_parse_unix_ls): Added second parameter + "ignore_perms" to ignore file and directory permissions for + Windows NT FTP server listings. + (ftp_parse_winnt_ls): New function. + (ftp_parse_ls): Parses UNIX and Windows NT listings + separately. Simple heuristics for distinguishing between UNIX and + MS-DOS-like FTP listing provided by Windows NT FTP service. + +2000-11-18 Jan Prikryl + + * ftpparse.c, ftpparse.h: New files. + + * ftp-ls.c (ftp_parse_ls): Use ftp_parse_unix_ls for UNIX servers + only. Use ftp_parse_nonunix_ls otherwise. + (ftp_parse_nonunix_ls): Stub to the ftpparse library handling all + exotic FTP servers. + + * ftp.h (stype): New enum, distinguishes UNIX, VMS, and "other" + FTP servers. + + * ftp.c: New static wariables host_type, pwd, and pwd_len. + (getftp): Support for VMS. Support for FTP servers that do not + place you in the root directory after login. + (ftp_retrieve_list): VMS is silent about the real file size, issue + a more appropriate message. + (ftp_get_listing): Pass host_type to ftp_parse_ls. + + * ftp-basic.c (ftp_pwd, ftp_syst): New functions. + +2000-11-30 Jan Prikryl + + * ftp-ls.c (ftp_parse_unix_ls): Added second parameter + "ignore_perms" to ignore file and directory permissions for + Windows NT FTP server listings. + (ftp_parse_winnt_ls): New function. + (ftp_parse_ls): Parses UNIX and Windows NT listings + separately. Simple heuristics for distinguishing between UNIX + and MS-DOS-like FTP listing provided by Windows NT FTP service. + +2000-11-29 John Summerfield + + * netrc.c (parse_netrc): Get rid of line ending. + +2000-11-25 Hrvoje Niksic + + * ftp.c (ftp_retrieve_list): Undo typo "fix" until resolution by + Dan. + +2000-11-24 Karl Eichwalder + + * main.c (print_help): Untabify. + +2000-11-23 Hrvoje Niksic + + * utils.c (xrealloc_debug): Do the unregister/register thing only + if the pointer has actually changed. + (xmalloc_real): Declare `static' in DEBUG_MALLOC builds. + (xfree_real): Ditto. + (xrealloc_real): Ditto. + (xstrdup_real): Ditto. + +2000-11-22 Hrvoje Niksic + + * ftp.c (getftp): ftp_getaddress() returns a malloc'ed copy of the + string; no need to strdup() it. + (getftp): Make pwd_len a local variable. + (ftp_loop): Free PWD before returning. + + * init.c (cleanup): Free opt.ftp_pass only if it's non-NULL. + +2000-11-22 Hrvoje Niksic + + * all: Use xfree() instead of free. + + * utils.c (xfree): New function. + +2000-11-21 Hrvoje Niksic + + * url.c (convert_links): HTML-quote the converted string. + + * utils.c (html_quote_string): Move here from ftp-ls.c + (html_quote_string): Make non-static; declare in utils.h. + (html_quote_string): Convert SP to . + +2000-11-21 Hrvoje Niksic + + * ftp.c (getftp): Reformat Jan's code according to GNU coding + standards; remove (debugging?) printf's; use '\0' for the ASCII + zero character. Use alloca() instead of malloc() for + inter-function temporary allocations. + +2000-11-18 Jan Prikryl + + * ftpparse.c, ftpparse.h: New files. + + * ftp-ls.c (ftp_parse_ls): Use ftp_parse_unix_ls for UNIX servers + only. Use ftp_parse_nonunix_ls otherwise. + (ftp_parse_nonunix_ls): Stub to the ftpparse library handling all + exotic FTP servers. + + * ftp.h (stype): New enum, distinguishes UNIX, VMS, and "other" + FTP servers. + + * ftp.c: New static wariables host_type, pwd, and pwd_len. + (getftp): Support for VMS. Support for FTP servers that do not + place you in the root directory after login. + (ftp_retrieve_list): VMS is silent about the real file size, issue + a more appropriate message. + (ftp_get_listing): Pass host_type to ftp_parse_ls. + + * ftp-basic.c (ftp_pwd, ftp_syst): New functions. + +2000-11-21 Hrvoje Niksic + + * hash.c (hash_table_put): Don't overwrite deleted mappings. + +2000-11-21 Hrvoje Niksic + + * hash.c (find_mapping): New function. + (hash_table_get): Use it. + (hash_table_get_pair): Ditto. + (hash_table_exists): Ditto. + (hash_table_remove): Ditto. + (hash_table_remove): Really delete the entry if the mapping + following LOCATION is empty. + + * utils.c (string_set_add): Check whether the element has existed + before. + + * hash.c (hash_table_get_pair): New function. + +2000-11-20 Hrvoje Niksic + + * http.c (http_process_type): Ignore trailing whitespace; use + strdupdelim(). + + * recur.c (recursive_retrieve): Use the new `convert' field. + (convert_all_links): Ditto. + (convert_all_links): Don't respect meta_disallow_follow. + + * html-url.c (handle_link): Fill out link_relative_p and + link_complete_p. + + * url.h (struct _urlpos): Make elements more readable. + + * recur.c (recursive_retrieve): Call slist_prepend instead of + slist_append. + (convert_all_links): Call slist_nreverse before iterating through + urls_html. + + * utils.c (slist_prepend): New function. + (slist_nreverse): Ditto. + +2000-11-20 Hrvoje Niksic + + * http.c (check_end): Constify. + +2000-11-20 Hrvoje Niksic + + * http.c (http_loop): If username and password are known, try the + `Basic' authentication scheme by default. + + * connect.h: Declare test_socket_open. + +2000-11-20 Hrvoje Niksic + + * version.c: Bump version from 1.5.3+dev to 1.7-dev. + +2000-11-20 Hrvoje Niksic + + * http.c (gethttp): Don't use the return value of sprintf(). + (gethttp): Inhibit keep-alive if opt.http_keep_alive is 0. + +2000-11-20 Hrvoje Niksic + + * recur.c (recursive_retrieve): Print the "so we don't load" + debugging message only if we really don't load. + + * http.c (gethttp): Inhibit keep-alive if proxy is being used. + (gethttp): Don't request keep-alive if keep-alive is inhibited. + +2000-11-19 Hrvoje Niksic + + * http.c (gethttp): Make the HTTP persistent connections more + robust. + +2000-11-19 Hrvoje Niksic + + * retr.c (get_contents): If use_expected, make sure that the + appropriate amount of data is being read. + + * http.c (gethttp): Check for both `Keep-Alive: ...' and + `Connection: Keep-Alive'. + + * wget.h (DEBUGP): Call debug_logprintf only if opt.debug is + turned on. + +2000-11-19 Hrvoje Niksic + + * http.c (connection_available_p): Use it. + + * connect.c (test_socket_open): New function. + + * http.c (gethttp): Support persistent connections. Based on the + ideas, and partly on code, by Sam Horrocks . + (register_persistent): New function. + (connection_available_p): Ditto. + (invalidate_connection): Ditto. + +2000-11-19 Hrvoje Niksic + + * url.c (convert_links): Handle UREL2ABS case. + + * recur.c (recursive_retrieve): Instead of the list + urls_downloaded, use hash tables dl_file_url_map and + dl_url_file_map. + (convert_all_links): Use them to retrieve data. + + * host.c (clean_hosts): Free the hash tables. + + * main.c (private_initialize): Call host_init(). + + * host.c (store_hostaddress): Use a saner, hash table-based data + model. + (realhost): Ditto. + (host_init): Initialize the hash tables. + +2000-11-18 Hrvoje Niksic + + * utils.c (slist_append): Eviscerate NOSORT. Hash tables are now + used for what the sorted slists used to be used for. + (slist_contains): Don't rely on the list being sorted. + (slist_append): Simplify the code. + + * recur.c (recursive_cleanup): Use free_string_set. + + * utils.c (string_set_add, string_set_exists, string_set_free): + New functions for easier freeing of hash tables whose keys are + strdup'ed strings. + + * recur.c (recursive_retrieve): Use the hash table functions for + storing undesirable URLs. + + * hash.c: New file. + +2000-11-17 Hrvoje Niksic + + * main.c (private_initialize): Call url_init. + (main): Call private_initialize. + + * url.c (unsafe_char_table): New table. + (UNSAFE_CHAR): Use it. + (init_unsafe_char_table): New function. + (url_init): New function; call init_unsafe_char_table. + +2000-11-16 Hrvoje Niksic + + * mswindows.h: Define snprintf and vsnprintf to _snprintf and + _vsnprintf respectively. + +2000-11-15 Hrvoje Niksic + + * config.h.in: Do the _XOPEN_SOURCE and _SVID_SOURCE things only + on Linux. + +2000-11-15 Hrvoje Niksic + + * html-url.c (handle_link): Handle HTML fragment identifiers. + + * recur.c (recursive_retrieve): If norobot info is respected and + the file is specified not to be followed by robots, respect that. + + * html-url.c (collect_tags_mapper): Handle . For us the important cases are where X is NONE or + where X contains NOFOLLOW. + (get_urls_html): Propagate that information to the caller. + +2000-11-13 Hrvoje Niksic + + * url.c (convert_links): Unlink the file we might be reading from + before writing to it. + (convert_links): Use alloca instead of malloc for + filename_plus_orig_suffix. + +2000-11-12 Hrvoje Niksic + + * host.c (realhost): Add HOST to the list with quality==0 only if + it wasn't already there. + Based on analysis by Lu Guohan . + +2000-11-10 Hrvoje Niksic + + * url.c (get_urls_file): Ditto. + (convert_links): Ditto. + + * html-url.c (get_urls_html): Use read_file() instead of + load_file(). + + * utils.c (read_file): New function, instead of the old + load_file(). + (read_file_free): Ditto. + + * url.c (findurl): Search only for the supported protocols. + (convert_links): Use fwrite() when writing out a region of + characters. + +2000-11-10 Hrvoje Niksic + + * ftp-ls.c: Move html_quote_string and ftp_index here. + + * url.c: Remove get_urls_html, since that's now in html-url.c. + + * html-url.c: New file. + + * html-parse.c: New file. + +2000-11-10 Hrvoje Niksic + + * init.c (run_wgetrc): Don't bother killing off '\r' since + pars_line() skips whitespace at end of line anyway. + (parse_line): Oops, it didn't. Now it does. + + * recur.c (parse_robots): Ditto here. + + * ftp-ls.c (ftp_parse_unix_ls): Kill off the newline character + manually because read_whole_line no longer does. + + * utils.c (read_whole_line): Rewrite to: a) use less memory + (reallocates to needed size after work), b) work faster --> + fgets() instead of getc, c) be more correct --> doesn't kill the + newline character at the end of line. + +2000-11-10 Hrvoje Niksic + + * init.c (comind): Initialize MAX to array size - 1. + +2000-11-08 Hrvoje Niksic + + * url.c (construct): Changed last_slash[-1] to *(last_slash - 1). + Suggested by Edward J. Sabol. + +2000-11-08 Hrvoje Niksic + + * url.c (construct): Handle the case where host name is not + followed by a slash. + +2000-11-06 Hrvoje Niksic + + * init.c: commands[] need to be sorted! ("base" wasn't.) + +2000-11-05 Hrvoje Niksic + + * wget.h (DO_REALLOC_FROM_ALLOCA): Use braces to disambiguate + `if'. + +2000-11-05 Hrvoje Niksic + + * url.c (construct): Insert unneeded initialization for the + compiler to shut up. + + * config.h.in: Define _XOPEN_SOURCE to 500 to get the prototype + for strptime() (*duh*). Define _SVID_SOURCE to get S_IFLNK which + otherwise gets lost when you define _XOPEN_SOURCE. + + * utils.c (touch): Include the file name in the error message. + From Debian. + +2000-11-05 Hrvoje Niksic + + * log.c (logvprintf): Use vsnprintf() in all cases. If necessary, + resize the buffer to fit the formated message. That way, messages + of arbitrary size may be printed. + (logvprintf): Use saved_append() to optionally log the last + several lines of output. + (logputs): Ditto. + (log_close): Adapt to new data structures. + (log_dump): Ditto. + (redirect_output): Print messages to stderr, not to stdout. + + * log.c (saved_append_1): New function. Replaces the old logging + system ("log all output until 10M characters") with a new, much + more reasonable one ("log last screenful of text"). + (saved_append): New function; call saved_append_1. + (free_log_line): New function. + +2000-11-05 Hrvoje Niksic + + * url.c (construct): Fix comment. + (find_last_char): Document. + +2000-11-04 Hrvoje Niksic + + * snprintf.c: New file. + +2000-11-03 Hrvoje Niksic + + * wget.h: If HAVE_STDARG_H is not defined, don't declare argument + types to logprintf() and debug_logprintf(). + +2000-11-02 Hrvoje Niksic + + * ftp.c (ftp_loop_internal): Hide the password from the URL when + printing non-verbose. Problem spotted by Dariusz Mlynarczyk + . + +2000-11-02 Junio Hamano + + * ftp-basic.c (ftp_login): Make comparison case-insensitive. + +2000-11-02 Tyler Riddle + + * http.c (known_authentication_scheme_p): Recognize NTML + authentication. + (create_authorization_line): Treat NTML the same as `Basic'. + +2000-11-02 Hrvoje Niksic + + * retr.c (retrieve_url): Free url before returning. + (retrieve_url): Free mynewloc before returning. + Spotted by Mark A. Mankins . + +2000-11-02 Hrvoje Niksic + + * url.c (parseurl): Remove possible reading past the end of + sup_protos[]. Spotted by Mark A. Mankins . + +2000-11-01 Hrvoje Niksic + + * main.c (main): In case of opt.downloaded overflowing, print + instead of a totally bogus random value. + + * retr.c (retrieve_from_file): Ditto. + + * recur.c (recursive_retrieve): Ditto. + + * main.c (main): Ditto. + + * http.c (http_loop): Ditto. + + * ftp.c (ftp_loop_internal): Use downloaded_increase() instead of + `+=', and downloaded_exceeds_quota() instead of the simple-minded + check. + (ftp_retrieve_list): Ditto. + (ftp_retrieve_dirs): Ditto. + (ftp_retrieve_glob): Ditto. + + * retr.c (downloaded_increase): New function. Notice overflows of + opt.downloaded. + (downloaded_exceeds_quota): Make sure that opt.downloaded is not + used if it overflowed. + + * options.h (struct options): New member downloaded_overflow. + +2000-11-01 Hrvoje Niksic + + * wget.h (enum): Remove extra space after last enumeration. + +2000-11-01 Hrvoje Niksic + + * main.c (main): Use legible_very_long() for printing + opt.downloaded. + + * utils.c (legible_1): New function that operates on strings and + does the brunt of legible()'s work. + (legible): Use legible_1(). + (legible_very_long): New function; dump the argument with + sprintf(), and call legible_1(). + + * options.h (struct options): Use VERY_LONG_TYPE for + opt.downloaded. + + * sysdep.h (VERY_LONG_TYPE): Define it to have a 64-bit or greater + type. + + * config.h.in: Make sure that SIZEOF_LONG and SIZEOF_LONG_LONG get + defined. Define HAVE_LONG_LONG if long long is available. + +2000-11-01 Hrvoje Niksic + + * utils.c (long_to_string): Update with a later, better version. + +2000-11-01 Hrvoje Niksic + + * url.c (path_simplify_with_kludge): New function. + (path_simplify_with_kludge): Disable it. Instead... + (parse_dir): ...make sure that at this point the right thing is + done, i.e. that "query" part of the URL (?...) is always assigned + to the file, never to the directory portion of the path. + +2000-11-01 Hrvoje Niksic + + * retr.c (retrieve_url): Detect redirection cycles. + +2000-11-01 Hrvoje Niksic + + * url.c (get_urls_html): Decode HTML entities using + html_decode_entities. + + * html.c (htmlfindurl): Don't count the `#' in numeric entities + (&#NNN;) as an HTML fragemnt. + (html_decode_entities): New function. + +2000-11-01 Hrvoje Niksic + + * html.c (htmlfindurl): Fix recognition of # HTML fragments. + +2000-11-01 Hrvoje Niksic + + * url.c (construct): Rewritten for clarity. Avoids the + unnecessary copying and stack-allocation the old version + performed. + +2000-10-31 Hrvoje Niksic + + * ftp.c (getftp): Ditto. + + * http.c (gethttp): Rewind the stream when retrying from scratch. + +2000-10-31 Hrvoje Niksic + + * retr.c (retrieve_url): Use url_concat() to handle relative + redirections instead of /ad hoc/ code. + + * url.c (url_concat): New function encapsulating weird + construct(). + (urllen_http_hack): New function. + (construct): When constructing new URLs, recognize that `?' does + not form part of the file name in HTTP. + +2000-10-13 Adrian Aichner + + * retr.c: Add msec timing support for WINDOWS. + * retr.c (reset_timer): GetSystemTime() on WINDOWS. + * retr.c (elapsed_time): Calculate delta time to msec on WINDOWS. + +2000-10-27 Dan Harkless + + * retr.c (retrieve_url): Manually applied T. Bharath + 's patch to get wget to grok + illegal relative URL redirects. Reformatted and re-commented it. + +2000-10-23 Dan Harkless + + * connect.c (make_connection and bindport): Manually applied Rob + Mayoff 's 1.5.3 patch to add --bind-address, + changing coding style to GNU's. + + * ftp.c (ftp_loop_internal): --delete-after wasn't implemented for + files downloaded via FTP. Per a comment, .listing files were not + counted towards number of bytes and files downloaded because they're + deleted anyway. Well, they aren't under -nr, so count them then. + + * init.c: Manually applied Rob Mayoff's 1.5.3 patch to add + --bind-address, alphabetizing, changing coding style to GNU's, + commenting, and renaming cmd_ip_address() to cmd_address() to + imply hostnames also okay. + + * main.c (main): --delete-after didn't delete the root of the + tree. Ignore --convert-links if --delete-after was specified. + Manually applied Rob Mayoff's 1.5.3 patch to add --bind-address, + fixing duplicate use of added-since-1.5.3 case value. + (print_help): Clarified that --delete-after deletes local files. + Rob forgot to add a line for his new --bind-address option. + + * options.h (struct options): Manually applied Rob Mayoff's patch + to add --bind-address (bind_address structure member). + + * recur.c (recursive_retrieve): Improved comment; added DEBUGP(). + Ignore --convert-links if --delete-after was specified. + + * retr.c (retrieve_from_file): Just added a DEBUGP(). + +2000-10-19 Dan Harkless + + * ftp.c (ftp_loop_internal): downloaded_file() enumerators changed. + (getftp): Applied Piotr Sulecki 's + patch to work around FTP servers that incorrectly respond to the + "REST" command with the remaining size rather than the total file size. + + * http.c (gethttp): Improved a comment and added code to tack on + ".html" to text/html files without that extension when -E specified. + (http_loop): Use new downloaded_file() enumerators and deal with + the case of gethttp() called xrealloc() on u->local. + + * init.c (commands): Added new "htmlextension" command. + Also renamed John Daily's cmd_quad() to the more descriptive + cmd_lockable_boolean(), alpha-sorted the CMD_DECLARE()s and + removed duplicate cmd_boolean() declaration. + + * main.c (print_help): Added my new -E / --html-extension option. + (main): Undocumented --email-address option previously used -E synonym. + Stole it away for the much more deserving --html-extension's use. + + * options.h (struct options): Added html_extension field. + + * url.c (convert_links): URL X that we saved as X.html locally due + to -E needs to be backed up as X.orig, not X.html.orig. Added comments. + (downloaded_file): Now remembers if we added .html extension to a file. + + * url.h (downloaded_file_t): Added extra enumerators to support above. + (downloaded_file): Now takes and returns a downloaded_file_t. + + * wget.h (unnamed "dt" enum): Added ADDED_HTML_EXTENSION enumerator. + +2000-10-09 Dan Harkless + + * html.c (htmlfindurl): Added unneeded initialization to quiet warning. + + * main.c (print_help): Clarified what --retr-symlinks does. + +2000-09-15 John Daily + + * init.c: Add support for "always" and "never" values to allow + .wgetrc to override commandline (useful e.g. with .pm files + calling `wget --passive-ftp' when your firewall doesn't allow that). + + * ftp.c (getftp): passive_ftp is first option to support always/never. + +2000-08-30 Dan Harkless + + * ftp.c (ftp_retrieve_list): Use new INFINITE_RECURSION #define. + + * html.c: htmlfindurl() now takes final `dash_p_leaf_HTML' parameter. + Wrapped some > 80-column lines. When -p is specified and we're at a + leaf node, do not traverse , , or tags other than + . + + * html.h (htmlfindurl): Now takes final `dash_p_leaf_HTML' parameter. + + * init.c: Added new -p / --page-requisites / page_requisites option. + + * main.c (print_help): Clarified that -l inf and -l 0 both allow + infinite recursion. Changed the unhelpful --mirrior description + to simply give the options it's equivalent to. Added new -p option. + (main): Added some comments; handle new -p / --page-requisites. + + * options.h (struct options): Added new page_requisites field. + + * recur.c: Changed "URL-s" to "URLs" and "HTML-s" to "HTMLs". + Calculate and pass down new `dash_p_leaf_HTML' parameter to + get_urls_html(). Use new INFINITE_RECURSION #define. + + * retr.c: Changed "URL-s" to "URLs". get_urls_html() now takes + final `dash_p_leaf_HTML' parameter. + + * url.c: get_urls_html() and htmlfindurl() now take final + `dash_p_leaf_HTML' parameter. + + * url.h (get_urls_html): Now takes final `dash_p_leaf_HTML' parameter. + + * wget.h: Added some comments and new INFINITE_RECURSION #define. + +2000-08-23 Dan Harkless + + * main.c (print_help): -B / --base was not mentioned. + +2000-08-22 Dan Harkless + + * main.c (print_help): Modified -nc description to mention that it + also prevents the creation of multiple versions of the same file + with "." suffixes. + +2000-07-14 Jan Prikryl + + * retr.c (retrieve_url): Consistently strdup opt.referer when + setting u->referer. + +2000-06-09 Dan Harkless + + * main.c (print_help): --help output for --waitretry was over 80 cols. + +2000-06-09 Hrvoje Niksic + + * url.c (encode_string): Fix comment. + Suggested by Herold Heiko . + +2000-06-01 Const Kaplinsky + + * ftp.c (ftp_retrieve_list): Change permissions only on plain + files. + +2000-06-01 Hrvoje Niksic + + * url.c (str_url): Print the port number only if it's different + from the default port number for that protocol. + +2000-05-22 Dan Harkless + + * main.c (print_help): Added --help line for Damir Dzeko + 's until-now-undocumented --referer option. + Removed comments that --referer and --waitretry were undocumented. + Changed "`.wgetrc' command" to "`.wgetrc'-style command" on --help + line for --execute. + +2000-05-18 Hrvoje Niksic + + * ftp.c (getftp): Ditto. + + * http.c (gethttp): Check for return value of fclose/fflush. + +2000-04-12 Hrvoje Niksic + + * host.c (store_hostaddress): Instead of shifting ADDR, start + copying from the correct address. + +2000-04-12 Hrvoje Niksic + + * http.c (gethttp): Don't free REQUEST -- it was allocated with + alloca(). + Pointed out by Gisle Vanem . + +2000-04-04 Dan Harkless + + * host.c (store_hostaddress): R. K. Owen's patch introduces a + "left shift count >= width of type" warning on 32-bit + architectures. Got rid of it by tricking the compiler w/ a variable. + + * url.c (UNSAFE_CHAR): The macro didn't include all the illegal + characters per RFC1738, namely everything above '~'. It also + generated a warning on OSes where char =~ unsigned char. Fixed. + +1998-10-17 Hrvoje Niksic + + * http.c (http_process_type): Removed needless strdup(), a memory + leak. + +1998-09-25 Hrvoje Niksic + + * html.c (htmlfindurl): Set PH to the first occurrence of `#'. + +1998-09-25 Simon Munton + + * init.c (wgetrc_file_name): Don't free HOME under Windows. + +1998-12-01 "R. K. Owen" + + * host.c (store_hostaddress): Fix for big endian 64-bit machines. + +1998-12-01 Hrvoje Niksic + + * url.c (UNSAFE_CHAR): New macro. + (contains_unsafe): Use it. + (encode_string): Ditto. + +1998-12-01 Hrvoje Niksic + + * main.c (i18n_initialize): Use LC_MESSAGES only if available. + +2000-03-31 Hrvoje Niksic + + * Use TOUPPER/TOLOWER. + +1998-12-22 Alexander V. Lukyanov + + * ftp-opie.c (btoe): Zero-terminate OSTORE. + +2000-03-21 Hrvoje Niksic + + * wget.h (DO_REALLOC_FROM_ALLOCA): Ditto. + + * sysdep.h (ISALNUM): New macro. + (TOLOWER): Ditto. + (TOUPPER): Ditto. + +2000-03-10 Dan Harkless + + * html.c (idmatch): Implemented checking of my new --follow-tags + and --ignore-tags options. + + * init.c (commands): Added comment reminding people adding new + entries doing allocation to add corresponding freeing in cleanup(). + (commands): Added new followtags and ignoretags commands. + (cleanup): Free storage for new followtags and ignoretags. + + * main.c: Use of "comma-separated list" was random -- normalized + it. Did some alphabetization. Added comments pointing out + "Options without arguments" and "Options accepting an argument" + sections of long_options[]. Added new options --follow-tags and + -G / --ignore-tags. Added comment that Damir's --referer is + currently undocumented. Added comment that Heiko's --waitretry is + partially undocumented (mentioned in --help but not in + wget.texi). Moved improperly sorted 24, 129, and 'G' cases. + + * options.h (struct options): Added new fields follow_tags and + ignore_tags. + + * wget.h: Added "#define EQ 0" so we can say "strcmp(a, b) == EQ". + +2000-03-02 Dan Harkless + + * ftp.c (ftp_loop_internal): Heiko introduced "suggest explicit + braces to avoid ambiguous `else'" warnings. Eliminated them. + + * http.c (gethttp): Dan Berger's query string patch is totally + bogus. If you have two different URLs, gen_page.cgi?page1 and + get_page.cgi?page2, they'll both be saved as get_page.cgi and the + second will overwrite the first. Also, parameters to implicit + CGIs, like "http://www.host.com/db/?2000-03-02" cause the URLs to + be printed with trailing garbage characters, and could seg fault. + Backing out the patch, which Dan B. informed me by email was just + a kludge to download StarOffice from Sun made necessary due to + wget's unconditional escaping of certain characters (room for an + option there?). + (http_loop): Heiko introduced "suggest explicit braces to avoid + ambiguous `else'" warnings. Eliminated them. + + * main.c: Heiko's --wait / --waitretry backwards compatibility + code looks to have been totally untested -- automatic variable + 'wr' was used without being initialized, and a long int was passed + into setval()'s char* val parameter. + + * recur.c (parse_robots): Applied Edward J. Sabol + 's patch for Guan Yang's reported + problem with "User-agent:*" lines in robots.txt. + + * url.c (parseurl, str_url): Removing Dan Berger's code (see + http.c above for explanation). + +1999-08-25 Heiko Herold + + * ftp.c: Respect new option waitretry. + +2000-01-30 Damir Dzeko + + * http.c (gethttp): Send custom Referer, if required. + +1999-09-24 Charles G Waldman + + * netrc.c (parse_netrc): Allow passwords to contain spaces. + + * netrc.c (parse_netrc): New function. + +1999-09-17 Dan Berger + + * http.c (gethttp): Send it. + + * url.c (parseurl): Detect query string in HTTP URL-s. + (str_url): Print it. + +2000-03-02 HIROSE Masaaki + + * html.c (html_allow): Add and