release note
parent
e7bf425df4
commit
a9fc06a916
|
@ -1,5 +1,19 @@
|
||||||
Release Notes
|
Release Notes
|
||||||
----
|
----
|
||||||
|
*2012-9-4* `version:0.3.0`
|
||||||
|
|
||||||
|
* Change default XPath selector from HtmlCleaner to [Xsoup](https://github.com/code4craft/xsoup).
|
||||||
|
|
||||||
|
[Xsoup](https://github.com/code4craft/xsoup) is an XPath selector based on Jsoup written by me. It has much better performance than HtmlCleaner.
|
||||||
|
|
||||||
|
Time of processing a page is reduced from 7~9ms to 0.4ms.
|
||||||
|
|
||||||
|
If Xsoup is not stable for your usage, just use `Spider.xsoupOff()` to turn off it and report an issue to me!
|
||||||
|
|
||||||
|
* Add cycle retry times for Site.
|
||||||
|
|
||||||
|
When cycle retry times is set, Spider will put the url which downloading failed back to scheduler, and retry after a cycle of queue.
|
||||||
|
|
||||||
*2012-8-20* `version:0.2.1`
|
*2012-8-20* `version:0.2.1`
|
||||||
|
|
||||||
ComboExtractor support for annotation.
|
ComboExtractor support for annotation.
|
||||||
|
|
Loading…
Reference in New Issue