Commit Graph

275 Commits (884f51ba3bf336cc79b1487ca4faef644fe4bd76)

Author SHA1 Message Date
yihua.huang cf62d707e0 #36 Spider does not exit when success 2013-11-27 23:33:18 +08:00
yihua.huang a01312930a #39 Parsing html after page.getHtml() 2013-11-27 22:01:34 +08:00
yihua.huang f9daae39cf [maven-release-plugin] prepare for next development iteration 2013-11-11 14:33:11 +08:00
yihua.huang fdb9441519 [maven-release-plugin] prepare release webmagic-0.4.0 2013-11-11 14:33:01 +08:00
yihua.huang 1d75ae7f5b rollback version to 0.4.0 because not deploy success 2013-11-11 11:52:56 +08:00
yihua.huang b838c4e433 #34 Close reader in FileCacheQueueScheduler 2013-11-08 14:59:09 +08:00
yihua.huang 775eb9732f [maven-release-plugin] prepare for next development iteration 2013-11-06 22:17:58 +08:00
yihua.huang 0b4fadc24d [maven-release-plugin] prepare release webmagic-0.4.0 2013-11-06 22:17:47 +08:00
yihua.huang fd6d2fd6f8 try to keepalive TCP connection 2013-11-06 21:19:14 +08:00
yihua.huang 425df08523 update version to 0.4.0 2013-11-06 12:50:45 +08:00
yihua.huang e046bb0723 remove useless code 2013-11-06 12:48:14 +08:00
yihua.huang 6e32a19f80 update api for direct download 2013-11-06 12:46:50 +08:00
yihua.huang 807aefe9df change EntityUtil to IOUtil because some encoding error 2013-11-06 07:37:34 +08:00
yihua.huang 8f774afc84 add direct download 2013-11-06 06:41:04 +08:00
yihua.huang 2e496402dc add more warning for 0.3.3 2013-10-24 13:16:48 +08:00
yihua.huang 1a2c84ea78 #27 add timeout config to site 2013-10-11 07:36:16 +08:00
yihua.huang 3b00190f99 api without implementation for #28: add specific url crawl 2013-10-10 00:40:44 +08:00
yihua.huang 4acbc19cee [maven-release-plugin] prepare for next development iteration 2013-09-23 13:12:32 +08:00
yihua.huang cc3b787991 [maven-release-plugin] prepare release webmagic-0.3.2 2013-09-23 13:12:19 +08:00
yihua.huang 6f18eec77e fix a test error 2013-09-23 13:07:33 +08:00
yihua.huang b131878123 add example 2013-09-23 13:01:28 +08:00
yihua.huang 95ab4edec3 some bugfix 2013-09-23 08:38:54 +08:00
yihua.huang 250cc5e662 change formatter to class 2013-09-23 08:17:21 +08:00
yihua.huang b18216245b add type convert 2013-09-23 07:53:33 +08:00
yihua.huang fb693a4ac4 [maven-release-plugin] prepare for next development iteration 2013-09-08 22:25:07 +08:00
yihua.huang bfaaa042b9 [maven-release-plugin] prepare release webmagic-parent-0.3.1 2013-09-08 22:24:48 +08:00
yihua.huang d7c7a78177 complete test cases 2013-09-08 22:19:02 +08:00
yihua.huang c17a31a21d fix null pointe exception #26 2013-09-08 21:09:49 +08:00
yihua.huang e7bf425df4 [maven-release-plugin] prepare for next development iteration 2013-09-04 10:51:01 +08:00
yihua.huang 77ff252316 [maven-release-plugin] prepare release webmagic-0.3.0 2013-09-04 10:50:50 +08:00
yihua.huang d141541ef3 add retry 2013-09-04 09:57:19 +08:00
yihua.huang aefd0569a5 update version 2013-09-04 09:36:56 +08:00
yihua.huang 194518fd82 add switch 2013-09-04 08:21:34 +08:00
yihua.huang 326b97c65a update 2013-09-04 00:15:54 +08:00
yihua.huang d7cd9e5747 update pom 2013-09-02 11:56:01 +08:00
yihua.huang 478ace7e97 add FilePageModelPipeline 2013-08-22 07:29:18 +08:00
yihua.huang ad66d33f38 [maven-release-plugin] prepare for next development iteration 2013-08-20 23:39:59 +08:00
yihua.huang 9dc6b11954 [maven-release-plugin] prepare release webmagic-parent-0.2.1 2013-08-20 23:37:55 +08:00
yihua.huang 4f62dfc8a4 release 2013-08-20 23:37:20 +08:00
yihua.huang 74c940c758 [maven-release-plugin] prepare for next development iteration 2013-08-20 23:19:58 +08:00
yihua.huang a4bb4e3429 [maven-release-plugin] prepare release webmagic-parent-0.2.1 2013-08-20 23:19:27 +08:00
yihua.huang 194f16aa75 update 2013-08-20 23:16:43 +08:00
yihua.huang 09ffd468c0 fix comments 2013-08-20 22:53:16 +08:00
yihua.huang c70ed57025 remove PriorityScheduler to core 2013-08-20 21:55:58 +08:00
yihua.huang 7003426898 update pom 2013-08-20 21:52:39 +08:00
yihua.huang 606417fdc7 update pom 2013-08-19 09:55:49 +08:00
yihua.huang d460e136ef update version 2013-08-19 09:52:15 +08:00
yihua.huang c79d6ecf09 complete all comments 2013-08-17 23:30:49 +08:00
yihua.huang 5073258237 closable 2013-08-17 21:19:24 +08:00
yihua.huang 5f1f4cbc46 update comments 2013-08-17 20:41:29 +08:00
yihua.huang 6cc1d62a08 bugfix: rawhtml do not work 2013-08-17 19:42:51 +08:00
yihua.huang a994b1c9fd complete extension comments in en 2013-08-17 19:35:45 +08:00
yihua.huang c59c1fe80d update comments 2013-08-17 19:19:27 +08:00
yihua.huang 59aad6a7f4 comments in english 2013-08-17 18:33:05 +08:00
yihua.huang e566a53936 update ignore test 2013-08-17 18:13:13 +08:00
yihua.huang 1148450ff9 update filecache to more useful 2013-08-17 18:12:47 +08:00
yihua.huang 3ba7a76f44 add combo extract to replace Extract2 Extract3... 2013-08-17 17:23:11 +08:00
yihua.huang 5cb45af3a4 +doc 2013-08-17 12:10:34 +08:00
yihua.huang a339e4ab5c add jsonpathselector 2013-08-12 13:36:44 +08:00
yihua.huang 9e82256ce3 update docs 2013-08-12 10:08:20 +08:00
yihua.huang f21097421b add new constructor to redisscheduler 2013-08-11 18:53:13 +08:00
yihua.huang 0f2c5b5723 update redisscheduler 2013-08-11 18:28:12 +08:00
yihua.huang 19229dd855 add JsonFilePageModelPipeline 2013-08-10 08:27:14 +08:00
yihua.huang 21eca688e9 complete docs 2013-08-09 20:56:33 +08:00
yihua.huang 17d2d98cec remove invalid @date 2013-08-09 20:43:06 +08:00
yihua.huang fcfa2c30c7 complete docs 2013-08-09 20:36:27 +08:00
yihua.huang c78de7bcbb update notnull default to false 2013-08-08 13:10:05 +08:00
yihua.huang 521fbad987 move xpath2.0 support to seperate package 2013-08-07 23:21:28 +08:00
yihua.huang 268bd8d0c4 remove saxon to extension 2013-08-07 23:04:10 +08:00
yihua.huang f1573b40a2 set selenium dep to seperate package 2013-08-07 14:44:52 +08:00
yihua.huang cff943f698 fix path format error 2013-08-07 13:05:12 +08:00
yihua.huang 36384246b5 update package structure 2013-08-07 12:51:21 +08:00
yihua.huang 5ef231a768 update version 2013-08-07 12:48:32 +08:00
yihua.huang 570533cce5 update readme 2013-08-07 09:45:38 +08:00
yihua.huang 0c8599e3b2 update packages structure 2013-08-06 23:17:07 +08:00