X-Git-Url: http://sjero.net/git/?p=wget;a=blobdiff_plain;f=TODO;h=d55a97b75990ba6fba72d03c23f1ccc5d95f8c72;hp=a1b0f4dc5a0cf78bcb685cd0f24315bed48459b4;hb=d9dd14a995a118806148673b9b60b8569a2e13c7;hpb=737daec8e6bd3e0fa02d8e9f639074b1df5b7980 diff --git a/TODO b/TODO index a1b0f4dc..d55a97b7 100644 --- a/TODO +++ b/TODO @@ -3,8 +3,25 @@ This is the to-do list for Wget. There is no timetable of when we plan to implement these features -- this is just a list of things it'd be nice to see in Wget. Patches to implement any of these items would be gladly accepted. The -items are not listed in any particular order. Not all of them represent -user-visible changes. +items are not listed in any particular order (except that recently-added items +may tend towards the top). Not all of these represent user-visible changes. + +* When spanning hosts, there's no way to say that you are only interested in + files in a certain directory on _one_ of the hosts (-I and -X apply to all). + Perhaps -I and -X should take an optional hostname before the directory? + +* When simplifying paths, wget needs to stop at any '?' character. Also, wget + behaves differently from popular browsers in removing './'s from abs. URLs. + +* Add an option to not encode special characters like ' ' and '~' when saving + local files. Would be good to have a mode that encodes all special characters + (as now), one that encodes none (as above), and one that only encodes a + character if it was encoded in the original URL (e.g. %20 but not %7E). + +* --retr-symlinks should cause wget to traverse links to directories too. + +* Lots of noncompliant webservers issue HTTP redirects to relative URLs, and + browsers follow them, so wget should too. * Make wget return non-zero status in more situations, like incorrect HTTP auth. @@ -16,14 +33,13 @@ user-visible changes. * Make -K compare X.orig to X and move the former on top of the latter if they're the same, rather than leaving identical .orig files laying around. -* Add an option to save all text/html files with a .html extension so that when - grabbing the output of a dynamically-generated remote page, you'll end up with - a filename that will cause _your_ webserver to realize the saved static HTML - file isn't text/plain (or an illegal CGI script in the case of *.cgi). - * Allow mirroring of FTP URLs where logging in puts you somewhere else besides '/'. +* If CGI output is saved to a file, e.g. cow.cgi?param, -k needs to change the + '?' to a "%3F" in links to that file to avoid passing part of the filename as + a parameter. + * Make `-k' convert too. * Make `-k' check for files that were downloaded in the past and convert links @@ -62,7 +78,8 @@ user-visible changes. * Fix Unix directory parser to allow for spaces in file names. -* Allow size limit to files. +* Allow size limit to files (perhaps with an option to download oversize files + up through the limit or not at all, to get more functionality than [u]limit. * Recognize HTML comments correctly. Add more options for handling bogus HTML found all over the 'net. @@ -71,11 +88,9 @@ user-visible changes. * Download to .in* when mirroring. -* Add an option to delete or move no-longer-existent files when - mirroring. +* Add an option to delete or move no-longer-existent files when mirroring. -* Implement a switch to avoid downloading multiple files (e.g. x and - x.gz). +* Implement a switch to avoid downloading multiple files (e.g. x and x.gz). * Implement uploading (--upload URL?) in FTP and HTTP.