Commit Graph

62 Commits (90447bff027e1d03794d4d794a7e40e3a85596dd)

Author SHA1 Message Date
yihua.huang 6c11718566 Clean project structure #70 2014-03-14 23:24:38 +08:00
yihua.huang 0e98183f74 Change log4j to slf4j #55 2014-02-12 09:35:57 +08:00
yihua.huang fa33b15843 property loader 2014-02-11 23:07:31 +08:00
yihua.huang 362fdd0662 Merge branch 'master' of github.com:code4craft/webmagic 2014-02-11 22:23:56 +08:00
yihua.huang af809c4d55 update version to 0.5.0-snapshot 2014-02-11 22:16:01 +08:00
jon a722f9bb66 修复由于FileCacheQueueScheduler中fileCursor 文件再次打开时没有初始化抛出NullPointerException的错误 2014-01-08 21:24:58 +08:00
yihua.huang 486d9d276f #45 Remove multi in ExtractBy 2013-11-28 18:23:51 +08:00
yihua.huang 18a3af4a0a add more sample for jsonpath #42 2013-11-28 09:58:22 +08:00
yihua.huang 59ad4cad27 #42 Add jsonpath in annotation mode for json result 2013-11-28 08:25:16 +08:00
yihua.huang cf62d707e0 #36 Spider does not exit when success 2013-11-27 23:33:18 +08:00
yihua.huang a01312930a #39 Parsing html after page.getHtml() 2013-11-27 22:01:34 +08:00
yihua.huang b838c4e433 #34 Close reader in FileCacheQueueScheduler 2013-11-08 14:59:09 +08:00
yihua.huang fd6d2fd6f8 try to keepalive TCP connection 2013-11-06 21:19:14 +08:00
yihua.huang e046bb0723 remove useless code 2013-11-06 12:48:14 +08:00
yihua.huang 6e32a19f80 update api for direct download 2013-11-06 12:46:50 +08:00
yihua.huang 807aefe9df change EntityUtil to IOUtil because some encoding error 2013-11-06 07:37:34 +08:00
yihua.huang 8f774afc84 add direct download 2013-11-06 06:41:04 +08:00
yihua.huang 2e496402dc add more warning for 0.3.3 2013-10-24 13:16:48 +08:00
yihua.huang 1a2c84ea78 #27 add timeout config to site 2013-10-11 07:36:16 +08:00
yihua.huang 3b00190f99 api without implementation for #28: add specific url crawl 2013-10-10 00:40:44 +08:00
yihua.huang 6f18eec77e fix a test error 2013-09-23 13:07:33 +08:00
yihua.huang b131878123 add example 2013-09-23 13:01:28 +08:00
yihua.huang 95ab4edec3 some bugfix 2013-09-23 08:38:54 +08:00
yihua.huang 250cc5e662 change formatter to class 2013-09-23 08:17:21 +08:00
yihua.huang b18216245b add type convert 2013-09-23 07:53:33 +08:00
yihua.huang d7c7a78177 complete test cases 2013-09-08 22:19:02 +08:00
yihua.huang c17a31a21d fix null pointe exception #26 2013-09-08 21:09:49 +08:00
yihua.huang d141541ef3 add retry 2013-09-04 09:57:19 +08:00
yihua.huang aefd0569a5 update version 2013-09-04 09:36:56 +08:00
yihua.huang 194518fd82 add switch 2013-09-04 08:21:34 +08:00
yihua.huang 326b97c65a update 2013-09-04 00:15:54 +08:00
yihua.huang d7cd9e5747 update pom 2013-09-02 11:56:01 +08:00
yihua.huang 478ace7e97 add FilePageModelPipeline 2013-08-22 07:29:18 +08:00
yihua.huang 09ffd468c0 fix comments 2013-08-20 22:53:16 +08:00
yihua.huang c70ed57025 remove PriorityScheduler to core 2013-08-20 21:55:58 +08:00
yihua.huang 7003426898 update pom 2013-08-20 21:52:39 +08:00
yihua.huang c79d6ecf09 complete all comments 2013-08-17 23:30:49 +08:00
yihua.huang 5073258237 closable 2013-08-17 21:19:24 +08:00
yihua.huang 5f1f4cbc46 update comments 2013-08-17 20:41:29 +08:00
yihua.huang 6cc1d62a08 bugfix: rawhtml do not work 2013-08-17 19:42:51 +08:00
yihua.huang a994b1c9fd complete extension comments in en 2013-08-17 19:35:45 +08:00
yihua.huang c59c1fe80d update comments 2013-08-17 19:19:27 +08:00
yihua.huang 59aad6a7f4 comments in english 2013-08-17 18:33:05 +08:00
yihua.huang e566a53936 update ignore test 2013-08-17 18:13:13 +08:00
yihua.huang 1148450ff9 update filecache to more useful 2013-08-17 18:12:47 +08:00
yihua.huang 3ba7a76f44 add combo extract to replace Extract2 Extract3... 2013-08-17 17:23:11 +08:00
yihua.huang 5cb45af3a4 +doc 2013-08-17 12:10:34 +08:00
yihua.huang a339e4ab5c add jsonpathselector 2013-08-12 13:36:44 +08:00
yihua.huang 9e82256ce3 update docs 2013-08-12 10:08:20 +08:00
yihua.huang f21097421b add new constructor to redisscheduler 2013-08-11 18:53:13 +08:00