]> sjero.net Git - wget/blobdiff - src/ChangeLog
[svn] Don't descend into HTML that was downloaded by following <img src=...>
[wget] / src / ChangeLog
index e6fb559dcfd9b384d3e195bfd5347843c52f29ff..a554f15070a62347411876fa61c8490fbf19f451 100644 (file)
@@ -1,3 +1,40 @@
+2003-10-10  Hrvoje Niksic  <hniksic@xemacs.org>
+
+       * recur.c (retrieve_tree): Don't descend into documents that are
+       not expected to contain HTML, regardless of their content-type.
+
+       * html-url.c (tag_url_attributes): Record which attributes are
+       supposed to yield HTML links that can be followed.
+       (tag_find_urls): Propagate that information to the caller through
+       struct urlpos.
+
+2003-10-10  Hrvoje Niksic  <hniksic@xemacs.org>
+
+       * hash.c (find_mapping): Return the next available mapping when
+       the key is not found, not NULL.
+       (hash_table_put): Use find_mapping to find the storage for the new
+       data.
+       (hash_table_put): Grow the table before exceeding maximum
+       fullness, not afterwards.
+
+2003-10-10  Hrvoje Niksic  <hniksic@xemacs.org>
+
+       * hash.c (hash_table_new): Slightly change the meaning of the
+       first parameter.  Instead of being the minimum initial size, it is
+       now the minimum number of items that the hash table can take
+       without needing to resize.
+
+2003-10-09  Hrvoje Niksic  <hniksic@xemacs.org>
+
+       * html-url.c (init_interesting): Initialize interesting_tags and
+       interesting_attributes as hash tables.  This simplifies the code
+       immensely because hash tables handle allocation and remove
+       duplicates automatically.
+       (find_tag): Removed.
+       (collect_tags_mapper): Instead of calling find_tag, simply get the
+       entry from interesting_tags hash table, which is both simpler and
+       faster.
+
 2003-10-09  Hrvoje Niksic  <hniksic@xemacs.org>
 
        * hash.c (hash_table_get): Declare hash-table argument as const.