X-Git-Url: http://sjero.net/git/?p=wget;a=blobdiff_plain;f=NEWS;h=4d3a51b0785507cccf583ef303867a72014d6f11;hp=9350eb7309b999fc08959780bcbe4bc9a9685c67;hb=bcb055884bc0d78d8af37fa2823e67f53a7a9018;hpb=42a80553f62a2a3d36ef6d7c26733452c09fca46 diff --git a/NEWS b/NEWS index 9350eb73..4d3a51b0 100644 --- a/NEWS +++ b/NEWS @@ -1,21 +1,295 @@ GNU Wget NEWS -- history of user-visible changes. -Copyright (C) 1997, 1998, 2000, 2001 Free Software Foundation, Inc. +Copyright (C) 1997-2006 Free Software Foundation, Inc. See the end for copying conditions. Please send GNU Wget bug reports to . -* Changes in Wget 1.7-dev +* Changes in Wget 1.11. + +** Wget now saves HTTP downloads using file names specified by the +`Content-Disposition' header. This is a standard way of specifying +the file name used by many web dynamically generated pages. + +** The GnuTLS library is now also supported for https downloads. +This is still work-in-progress. OpenSSL is still used by default; use +--with-ssl=gnutls to build with GnuTLS. OpenSSL is still required for +NTLM authorization to work, but this should eventually change. + +** The new option `--ignore-case' makes Wget ignore case when +matching files, directories, and wildcards. This affects the -X, -I, +-A, and -R options, as well as globbing in FTP URLs. + +** ETA projection is now displayed in "dot" progress output as well as +in the default progress bar. (The dot progress is used by default when +logging Wget's output to file using the `-o' option.) + +** The "lockable boolean" argument type is no longer supported. It +was only used by the passive_ftp .wgetrc setting. If you're running +broken scripts or Perl modules that unconditionally specify +`--passive-ftp' and your firewall disallows it, you can override them +by replacing wget with a script that execs wget "$@" --no-passive-ftp. + +** The source code has migrated from CVS to Subversion. The +repository is available at http://svn.dotsrc.org/repo/wget/; to +checkout the trunk to a directory named `wget', use something like +`svn checkout http://svn.dotsrc.org/repo/wget/trunk/ wget'. + +* Changes in Wget 1.10. + +** Downloading files larger than 2GB, sometimes referred to as "large +files", now works on systems that support them. This includes the +majority of modern Unixes, as well as MS Windows. + +** IPv6 is now supported by Wget. Unlike the experimental code in +1.9, this version supports dual-family systems. The new flags +`--inet4' and `--inet6' (or `-4' and `-6' for short) force the use of +IPv4 and IPv6 respectively. Note that IPv6 support has not yet been +tested on Windows. + +** Microsoft's proprietary "NTLM" method of HTTP authentication is now +supported. This authentication method is undocumented and only used +by IIS. Note that *proxy* authentication is not supported in this +release; you can only authenticate to the target web site. + +** Wget no longer truncates partially downloaded files when download +has to start over because the server doesn't support Range. Instead, +with such servers Wget now simply ignores the data up to the byte +where the last attempt left off, and only then continues appending to +the file. That way the downloaded file never shrinks, and download +retries from servers without support for partial downloads work even +when downloading to stdout. + +** SSL/TLS changes: + +*** SSL/TLS downloads now attempt to verify the server's certificate +against the recognized certificate authorities. This requires CA +certificates to have been installed in a location visible to the +OpenSSL library. If this is not the case, you can get the bundle +yourself from a source you trust (for example, the bundle extracted +from Mozilla available at http://curl.haxx.se/docs/caextract.html), +and point Wget to the PEM file using the `--ca-certificate' +command-line option or the corresponding `.wgetrc' command. + +*** Secure downloads now verify that the host name in the URL matches +the "common name" in the certificate presented by the server. + +*** Although the above checks provide more secure downloads, they +unavoidably break interoperability with some sites that worked with +previous versions, particularly those using self-signed, expired, or +otherwise invalid certificates. If you encounter "certificate +verification" errors or complaints that "common name doesn't match +requested host name" and are convinced of the site's authenticity, you +can use `--no-check-certificate' to bypass both checks. + +*** Talking to SSL/TLS servers over proxies now actually works. +Previous versions of Wget erroneously sent GET requests for https +URLs. Wget 1.10 utilizes the CONNECT method designed for this +purpose. + +*** The SSL/TLS-related options have been redesigned and, for the +first time, documented in the manual. The old, undocumented, options +are no longer supported. + +** Passive FTP is now the default FTP transfer mode. Use +`--no-passive-ftp' or specify `passive_ftp = off' in your init file to +revert to the old behavior. + +** The `--header' option can now be used to override generated +headers. For example, `wget --header="Host: foo.bar" +http://127.0.0.1' tells Wget to connect to localhost, but to specify +"foo.bar" in the `Host' header. In previous versions such use of +`--header' lead to duplicate headers in HTTP requests. + +** The responses without headers, aka "HTTP 0.9" responses, are +detected and handled. Although HTTP 0.9 has long been obsolete, it is +still occasionally used, sometimes by accident. + +** The progress bar is now updated regularly even when the data does +not arrive from the network. + +** Wget no longer preserves permissions of files retrieved by FTP by +default. Anonymous FTP servers frequently use permissions like "664", +which might not be what the user wants. The new option +`--preserve-permissions' and the corresponding `.wgetrc' variable can +be used to revert to the old behavior. + +** The new option `--protocol-directories' instructs Wget to also use +the protocol name as a directory component of local file names. + +** Options that previously unconditionally set or unset various flags +are now boolean options that can be invoked as either `--OPTION' or +`--no-OPTION'. Options that required an argument "on" or "off" have +also been changed this way, but they still accept the old syntax for +backward compatibility. For example, instead of `--glob=off' you can +write `--no-glob'. + +Allowing `--no-OPTION' for every `--OPTION' and the other way around +is useful because it allows the user to override non-default behavior +specified via `.wgetrc'. + +** The new option `--keep-session-cookies' causes `--save-cookies' to +save session cookies (normally only kept in memory) along with the +permanent ones. This is useful because many sites track important +information, such as whether the user has authenticated, in session +cookies. With this option multiple Wget runs are treated as a single +browser session. + +** Wget now supports the --ftp-user and --ftp-password command +switches to set username and password for FTP, and the --user and +--password command switches to set username and password for both FTP +and HTTP. The --http-passwd and --proxy-passwd command switches have +been renamed to --http-password and --proxy-password respectively, and +the related http_passwd and proxy_passwd .wgetrc commands to +http_password and proxy_password respectively. The login and passwd +.wgetrc commands have been deprecated. + +* `wget -b' now works correctly under Windows. + +* Wget 1.9.1 is a bugfix release with no user-visible changes. + +* Changes in Wget 1.9. + +** It is now possible to specify that POST method be used for HTTP +requests. For example, `wget --post-data="id=foo&data=bar" URL' will +send a POST request with the specified contents. + +** IPv6 support is available, although it's still experimental. + +** The `--timeout' option now also affects DNS lookup and establishing +the TCP connection. Previously it only affected reading and writing +data. Those three timeouts can be set separately using +`--dns-timeout', `--connection-timeout', and `--read-timeout', +respectively. + +** Download speed shown by the progress bar is based on the data +recently read, rather than the average speed of the entire download. +The ETA projection is still based on the overall average. + +** It is now possible to connect to FTP servers through FWTK +firewalls. Set ftp_proxy to an FTP URL, and Wget will automatically +log on to the proxy as "username@host". + +** The new option `--retry-connrefused' makes Wget retry downloads +even in the face of refused connections, which are otherwise +considered a fatal error. + +** The new option `--no-dns-cache' may be used to prevent Wget from +caching DNS lookups. + +** Wget no longer escapes characters in local file names based on +whether they're appropriate in URLs. Escaping can still occur for +nonprintable characters or for '/', but no longer for frequent +characters such as space. You can use the new option +--restrict-file-names to relax or strengthen these rules, which can be +useful if you dislike the default or if you're downloading to +non-native partitions. + +** Handling of HTML comments has been dumbed down to conform to what +users expect and other browsers do: instead of being treated as SGML +declaration, a comment is terminated at the first occurrence of "-->". +Use `--strict-comments' to revert to the old behavior. + +** Wget now correctly handles relative URIs that begin with "//", such +as "//img.foo.com/foo.jpg". + +** Boolean options in `.wgetrc' and on the command line now accept +values "yes" and "no" along with the traditional "on" and "off". + +** It is now possible to specify decimal values for timeouts, waiting +periods, and download rate. For instance, `--wait=0.5' now works as +expected, as does `--dns-timeout=0.5' and even `--limit-rate=2.5k'. + +* Wget 1.8.2 is a bugfix release with no user-visible changes. + +* Wget 1.8.1 is a bugfix release with no user-visible changes. + +* Changes in Wget 1.8. + +** A new progress indicator is now available and used by default. +You can choose the progress bar type with `--progress=TYPE'. Two +types are available, "bar" (the new default), and "dot" (the old +dotted indicator). You can permanently revert to the old progress +indicator by putting `progress = dot' in your `.wgetrc'. + +** You can limit the download rate of the retrieval using the +`--limit-rate' option. For example, `wget --limit-rate=15k URL' will +tell Wget not to download the body of the URL faster than 15 kilobytes +per second. -** Wget can now recognize FTP directory listings from some non-Unix -servers. Currently Windows NT is supported. +** Recursive retrieval and link conversion have been revamped: + +*** Wget now traverses links breadth-first. This makes the +calculation of depth much more reliable than before. Also, recursive +downloads are faster and consume *significantly* less memory than +before. + +*** Links are converted only when the entire retrieval is complete. +This is the only safe thing to do, as only then is it known what URLs +have been downloaded. + +*** BASE tags are handled correctly when converting links. Since Wget +already resolves when resolving handling URLs, link +conversion now makes the BASE tags point to an empty string. + +*** HTML anchors are now handled correctly. Links to an anchor in the +same document (), which used to confuse Wget, +are now converted correctly. + +*** When in page-requisites (-p) mode, no-parent (-np) is ignored when +retrieving for inline images, stylesheets, and other documents needed +to display the page. + +*** Page-requisites (-p) mode now works with frames. In other words, +`wget -p URL-THAT-USES-FRAMES' will now download the frame HTML files, +and all the files that they need to be displayed properly. + +** `--base' now works conjunction with `--input-file', providing a +base for each URL and thereby allowing the URLs in the file to be +relative. + +** If a host has more than one IP address, Wget uses the other +addresses when accessing the first one fails. + +** Host directories now contain port information if the URL is at a +non-standard port. + +** Wget now supports the robots.txt directives specified in +. + +** URL parser has been fixed, especially the infamous overzealous +quoting. Wget no longer dequotes reserved characters, e.g. `%3F' is +no longer translated to `?', nor `%2B' to `+'. Unsafe characters +which are not reserved are still escaped, of course. + +** No more than 20 successive redirections are allowed. + +* Wget 1.7.1 is a bugfix release with no user-visible changes. + +* Changes in Wget 1.7. + +** SSL (`https') pages now work if you compile Wget with SSL support; +use the `--with-ssl' configure flag. You need to have OpenSSL +installed. + +** Cookies are now supported. Wget will accept cookies sent by the +server and return them in later requests. Additionally, it can load +and save cookies to disk, in the same format that Netscape uses. + +** "Keep-alive" (persistent) HTTP connections are now supported. +Using keep-alive allows Wget to share one TCP/IP connection for +many retrievals, making multiple-file downloads faster and less +stressing for the server and the network. + +** Wget now recognizes FTP directory listings generated by NT and VMS +servers. ** It is now possible to recurse through FTP sites where logging in puts you in some directory other than '/'. -** "Keep-Alive" connections are now supported. - -** https (SSL) URLs are now supported if you link wget with OpenSSL. +** You may now use `~' to mean home directory in `.wgetrc'. For +example, `load_cookies = ~/.netscape/cookies.txt' works as you would +expect. ** The HTML parser has been rewritten. The new one works more reliably, allows finer-grained control over which tags and attributes @@ -29,8 +303,10 @@ also more general. where appropriate. This results in huge speedups when retrieving large sites (thousands of documents). -** Wget now mmaps the files it needs to read instead of allocating -heap memory to read them. +** Wget now has a man page, automatically generated from the Texinfo +documentation. (The last version that shipped with a man page was +1.4.5). To get this, you need to have pod2man from the Perl +distribution installed on your system. * Changes in Wget 1.6 @@ -303,7 +579,7 @@ geturl -vo log http://fly.cc.fer.hr/ ---------------------------------------------------------------------- Copyright information: -Copyright (C) 1997, 1998, 2000 Free Software Foundation, Inc. +Copyright (C) 1997-2005 Free Software Foundation, Inc. Permission is granted to anyone to make or distribute verbatim copies of this document as received, in any medium, provided that