]> sjero.net Git - wget/commit
[svn] A bunch of new features:
authorhniksic <devnull@localhost>
Sun, 19 Nov 2000 20:50:10 +0000 (12:50 -0800)
committerhniksic <devnull@localhost>
Sun, 19 Nov 2000 20:50:10 +0000 (12:50 -0800)
commitb0b1c815c15e49c9172f59428810713097a65e37
treedd394cc1b524410f9e78ddd7c3ef987def940fef
parentccf31643ab5226a81632329ddc19e1c81b5a85b6
[svn] A bunch of new features:

- use mmap() to read whole files in core instead of allocating memory
  and read'ing it.

- use a new, more general, HTML parser (html-parse.c) and interface to
  it from Wget (html-url.c).

- respect <meta name=robots content=nofollow> (easy with the new HTML
  parser).

- use hash tables instead of linked lists in places where the lists
  were used to facilitate mappings.

- rewrite the code in host.c to be more readable and faster (hash
  tables instead of home-grown lists.)

- make convert_links properly convert partial URLs to complete ones
  for those URLs that have *not* been downloaded.

- use HTTP persistent connections where available.  very
  simple-minded, caches the last connection to the server.

Published in <sxshf533d5r.fsf@florida.arsdigita.de>.
39 files changed:
ChangeLog
TODO
configure
configure.in
doc/ChangeLog
doc/wget.texi
po/cs.gmo
po/de.gmo
po/hr.gmo
po/it.gmo
po/no.gmo
po/pl.gmo
po/pt_BR.gmo
po/ru.gmo
src/ChangeLog
src/Makefile.in
src/config.h.in
src/connect.c
src/ftp-ls.c
src/ftp.c
src/ftp.h
src/hash.c [new file with mode: 0644]
src/hash.h [new file with mode: 0644]
src/headers.c
src/headers.h
src/host.c
src/html-parse.c [new file with mode: 0644]
src/html-parse.h [new file with mode: 0644]
src/html-url.c [new file with mode: 0644]
src/http.c
src/main.c
src/recur.c
src/retr.c
src/retr.h
src/url.c
src/url.h
src/utils.c
src/utils.h
src/wget.h