Commit Graph

846 Commits (407fbb613080102e3658727955d8731e22de3722)

Author SHA1 Message Date
Yihua Huang 5d9fda0614 Merge pull request #51 from code4craft/test
Update RegexSelector.java
2013-12-20 16:43:49 -08:00
Almark Ming 2b46b11e55 Update RegexSelector.java
Optimize regex format check

Conflicts:
	webmagic-core/src/main/java/us/codecraft/webmagic/selector/RegexSelector.java
2013-12-21 08:38:17 +08:00
yihua.huang 2a8e1b654d Merge branch 'master' of git.oschina.net:flashsword20/webmagic into osc
Conflicts:
	pom.xml
2013-12-21 07:59:28 +08:00
yihua.huang d9a15ad66a 404 page 2013-12-21 07:56:58 +08:00
黄亿华 1bdffc1e56 Merge pull request !337 from Almark Ming/master 2013-12-21 07:54:57 +08:00
yihua.huang 12a6390cbd update spring4 configuration 2013-12-18 01:02:59 +08:00
yihua.huang 7fd27d9b22 add spring 4 mvc for worker 2013-12-17 23:57:23 +08:00
Almark Ming 91ed66ecac Update RegexSelector.java 2013-12-17 16:57:22 +08:00
Almark Ming 83926970b2 Check valid left parenthesis 2013-12-17 16:55:53 +08:00
yihua.huang 31fb0048a1 add worker 2013-12-07 00:37:07 +08:00
yihua.huang b51fb2696b update ut for cookie 2013-12-06 00:30:01 +08:00
yihua.huang ff2f588c41 #48 nullpointer exception 2013-12-04 22:11:20 +08:00
yihua.huang 0c3ff3d6b1 remove duplicate logo in readme 2013-12-04 00:05:38 +08:00
yihua.huang fc97cb58c5 update lib and version 2013-12-04 00:04:29 +08:00
yihua.huang 7c41bec92f Merge branch 'master' of github.com:code4craft/webmagic
Conflicts:
	README.md
	webmagic-samples/pom.xml
	webmagic-selenium/pom.xml
2013-12-03 23:50:26 +08:00
yihua.huang ac516f9b0e update version in docs 2013-12-03 23:46:31 +08:00
yihua.huang cc241bc0f2 update versions 2013-12-03 23:40:47 +08:00
yihua.huang d274310cb2 [maven-release-plugin] prepare for next development iteration 2013-12-03 23:35:06 +08:00
yihua.huang e8c32a32dc [maven-release-plugin] prepare release webmagic-0.4.2 2013-12-03 23:34:57 +08:00
yihua.huang 93cb4308ef update pom 2013-12-03 23:29:57 +08:00
yihua.huang 6a828e923c #46 Downloader thread hang up when timeout 2013-12-03 09:59:54 +08:00
yihua.huang 486d9d276f #45 Remove multi in ExtractBy 2013-11-28 18:23:51 +08:00
yihua.huang aaa53f58c7 Merge branch 'master' of github.com:code4craft/webmagic 2013-11-28 16:27:49 +08:00
yihua.huang 07bcb06a3f update version in readme 2013-11-28 16:27:36 +08:00
Yihua Huang 4f53b07e47 Merge pull request #44 from supermicah/master
double-check 中再取次httpClient的内容
2013-11-27 23:28:20 -08:00
shijinping 9a524aa364 double-check 中再取次httpClient的内容 2013-11-28 14:38:30 +08:00
yihua.huang 057b3a530e update jar 2013-11-28 13:42:08 +08:00
yihua.huang fd23cb6dc0 Merge branch 'master' of github.com:code4craft/webmagic
Conflicts:
	README.md
	pom.xml
	webmagic-samples/pom.xml
	webmagic-selenium/pom.xml
2013-11-28 13:40:24 +08:00
yihua.huang cb84220d7a update version 2013-11-28 13:37:10 +08:00
yihua.huang e7083dc39d [maven-release-plugin] prepare for next development iteration 2013-11-28 13:04:32 +08:00
yihua.huang ae623567b3 [maven-release-plugin] prepare release webmagic-0.4.1 2013-11-28 13:04:22 +08:00
yihua.huang 7c43b5146e scripts readme 2013-11-28 12:04:05 +08:00
yihua.huang 633e0fe834 document for avalon 2013-11-28 11:39:19 +08:00
yihua.huang 18a3af4a0a add more sample for jsonpath #42 2013-11-28 09:58:22 +08:00
yihua.huang 59ad4cad27 #42 Add jsonpath in annotation mode for json result 2013-11-28 08:25:16 +08:00
yihua.huang c2d6d495b3 #41 add getThreadAlive(),getStatus,getPageCount() to spider 2013-11-28 07:59:24 +08:00
yihua.huang cf62d707e0 #36 Spider does not exit when success 2013-11-27 23:33:18 +08:00
yihua.huang a01312930a #39 Parsing html after page.getHtml() 2013-11-27 22:01:34 +08:00
yihua.huang f63d33b457 update some comments 2013-11-27 21:06:53 +08:00
yihua.huang 04fcf3193f #38 Change algorithm of SmartContentSelector 2013-11-23 13:56:55 +08:00
yihua.huang 296a68920e fix javadoc and add setPipelines() for spider 2013-11-14 13:23:29 +08:00
yihua.huang 948fa094b0 remove console pipeling 2013-11-12 13:19:36 +08:00
yihua.huang 4479428277 add multithread support 2013-11-12 13:11:31 +08:00
yihua.huang b5f2498c99 remove nativeobject for rhino 2013-11-12 11:55:56 +08:00
yihua.huang 6e6b3cc896 add more status code for check 2013-11-12 11:52:34 +08:00
yihua.huang 47a0360783 #35 add status code to page 2013-11-12 11:51:34 +08:00
yihua.huang f1d5e297bf add scripts 2013-11-12 11:42:42 +08:00
yihua.huang 4cd3e1d871 add build script 2013-11-12 11:14:15 +08:00
yihua.huang 81bb809dba update scripts 2013-11-12 10:38:12 +08:00
yihua.huang 7f26b84439 remove test codes 2013-11-12 08:21:57 +08:00