Commit Graph

630 Commits (41c2ea94984ed19120d9ef0abe2ef1d0b93135fd)

Author SHA1 Message Date
Yihua Huang 19843c063b Merge pull request #57 from d0ngw/master
Fix chrome driver can't quit
2014-02-17 07:17:45 +08:00
d0ngw 4fc89485b6 Fix chrome driver can't quit 2014-02-16 22:04:12 +08:00
yihua.huang 0e98183f74 Change log4j to slf4j #55 2014-02-12 09:35:57 +08:00
yihua.huang fa33b15843 property loader 2014-02-11 23:07:31 +08:00
yihua.huang 362fdd0662 Merge branch 'master' of github.com:code4craft/webmagic 2014-02-11 22:23:56 +08:00
yihua.huang af809c4d55 update version to 0.5.0-snapshot 2014-02-11 22:16:01 +08:00
Yihua Huang 7e8a5c7dd2 Merge pull request #54 from bitdeli-chef/master
Add a Bitdeli Badge to README
2014-01-21 23:00:07 -08:00
Bitdeli Chef 6cade5ddf3 Add a Bitdeli badge to README 2014-01-22 07:03:10 +00:00
Yihua Huang 31ff4cc404 Merge pull request #53 from xuchaoo/master
修复由于FileCacheQueueScheduler中fileCursor 文件再次打开时没有初始化抛出NullPointerExceptio...
2014-01-08 17:49:44 -08:00
jon a722f9bb66 修复由于FileCacheQueueScheduler中fileCursor 文件再次打开时没有初始化抛出NullPointerException的错误 2014-01-08 21:24:58 +08:00
Yihua Huang 090827f124 Merge pull request #52 from d0ngw/master
The SeleniumDownloader should call the setRawText
2013-12-26 19:30:31 -08:00
d0ngw a5a9b141b3 The SeleniumDownloader should call the setRawText 2013-12-27 11:09:04 +08:00
yihua.huang 6933029ea5 update modules 2013-12-25 18:48:31 +08:00
Yihua Huang 5d9fda0614 Merge pull request #51 from code4craft/test
Update RegexSelector.java
2013-12-20 16:43:49 -08:00
Almark Ming 2b46b11e55 Update RegexSelector.java
Optimize regex format check

Conflicts:
	webmagic-core/src/main/java/us/codecraft/webmagic/selector/RegexSelector.java
2013-12-21 08:38:17 +08:00
yihua.huang d9a15ad66a 404 page 2013-12-21 07:56:58 +08:00
yihua.huang 12a6390cbd update spring4 configuration 2013-12-18 01:02:59 +08:00
yihua.huang 7fd27d9b22 add spring 4 mvc for worker 2013-12-17 23:57:23 +08:00
yihua.huang 31fb0048a1 add worker 2013-12-07 00:37:07 +08:00
yihua.huang b51fb2696b update ut for cookie 2013-12-06 00:30:01 +08:00
yihua.huang ff2f588c41 #48 nullpointer exception 2013-12-04 22:11:20 +08:00
yihua.huang ac516f9b0e update version in docs 2013-12-03 23:46:31 +08:00
yihua.huang cc241bc0f2 update versions 2013-12-03 23:40:47 +08:00
yihua.huang d274310cb2 [maven-release-plugin] prepare for next development iteration 2013-12-03 23:35:06 +08:00
yihua.huang e8c32a32dc [maven-release-plugin] prepare release webmagic-0.4.2 2013-12-03 23:34:57 +08:00
yihua.huang 93cb4308ef update pom 2013-12-03 23:29:57 +08:00
yihua.huang 6a828e923c #46 Downloader thread hang up when timeout 2013-12-03 09:59:54 +08:00
yihua.huang 486d9d276f #45 Remove multi in ExtractBy 2013-11-28 18:23:51 +08:00
yihua.huang aaa53f58c7 Merge branch 'master' of github.com:code4craft/webmagic 2013-11-28 16:27:49 +08:00
yihua.huang 07bcb06a3f update version in readme 2013-11-28 16:27:36 +08:00
Yihua Huang 4f53b07e47 Merge pull request #44 from supermicah/master
double-check 中再取次httpClient的内容
2013-11-27 23:28:20 -08:00
shijinping 9a524aa364 double-check 中再取次httpClient的内容 2013-11-28 14:38:30 +08:00
yihua.huang cb84220d7a update version 2013-11-28 13:37:10 +08:00
yihua.huang e7083dc39d [maven-release-plugin] prepare for next development iteration 2013-11-28 13:04:32 +08:00
yihua.huang ae623567b3 [maven-release-plugin] prepare release webmagic-0.4.1 2013-11-28 13:04:22 +08:00
yihua.huang 7c43b5146e scripts readme 2013-11-28 12:04:05 +08:00
yihua.huang 633e0fe834 document for avalon 2013-11-28 11:39:19 +08:00
yihua.huang 18a3af4a0a add more sample for jsonpath #42 2013-11-28 09:58:22 +08:00
yihua.huang 59ad4cad27 #42 Add jsonpath in annotation mode for json result 2013-11-28 08:25:16 +08:00
yihua.huang c2d6d495b3 #41 add getThreadAlive(),getStatus,getPageCount() to spider 2013-11-28 07:59:24 +08:00
yihua.huang cf62d707e0 #36 Spider does not exit when success 2013-11-27 23:33:18 +08:00
yihua.huang a01312930a #39 Parsing html after page.getHtml() 2013-11-27 22:01:34 +08:00
yihua.huang f63d33b457 update some comments 2013-11-27 21:06:53 +08:00
yihua.huang 04fcf3193f #38 Change algorithm of SmartContentSelector 2013-11-23 13:56:55 +08:00
yihua.huang 296a68920e fix javadoc and add setPipelines() for spider 2013-11-14 13:23:29 +08:00
yihua.huang 948fa094b0 remove console pipeling 2013-11-12 13:19:36 +08:00
yihua.huang 4479428277 add multithread support 2013-11-12 13:11:31 +08:00
yihua.huang b5f2498c99 remove nativeobject for rhino 2013-11-12 11:55:56 +08:00
yihua.huang 6e6b3cc896 add more status code for check 2013-11-12 11:52:34 +08:00
yihua.huang 47a0360783 #35 add status code to page 2013-11-12 11:51:34 +08:00