Commit Graph

67 Commits (d61f65cef84cad0ada34e8744c1d1d5a8e311cf0)

Author SHA1 Message Date
yihua.huang d61f65cef8 update mbean to mxbean #98 2014-04-25 11:31:43 +08:00
yihua.huang ad6a273b12 update test url 2014-04-25 11:28:35 +08:00
yihua.huang 27b37e8164 extension point and sample for JMX support #98 2014-04-17 08:12:37 +08:00
yihua.huang f7950ebcab fix tests 2014-04-13 13:00:31 +08:00
yihua.huang 84b897f83b update AngularJSProcessor 2014-04-13 12:20:57 +08:00
yihua.huang 03c251237b add Json parse support 2014-04-13 10:23:00 +08:00
yihua.huang 22c394e629 [doc] 2014-04-04 20:00:58 +08:00
yihua.huang 01848301d4 encode illegal charactors in url #80 2014-04-01 22:14:30 +08:00
yihua.huang 2780423e60 enable blank space in quotes in UrlUtils.fixAllRelativeHrefs #80 2014-04-01 20:35:11 +08:00
yihua.huang 8d8194bee4 Change HashMap to LinkedHashMap in ResultItems for same order of input and output #76 2014-03-25 08:23:20 +08:00
yihua.huang 8b35d79569 Do not cache document in Selectable for selected Html element #73 2014-03-19 22:19:06 +08:00
yihua.huang 6c11718566 Clean project structure #70 2014-03-14 23:24:38 +08:00
yihua.huang 2768a1cae4 add test for cycleTriedTimes and fix cycleTriedTimes inc error #60 2014-03-01 15:10:38 +08:00
Almark Ming 2b46b11e55 Update RegexSelector.java
Optimize regex format check

Conflicts:
	webmagic-core/src/main/java/us/codecraft/webmagic/selector/RegexSelector.java
2013-12-21 08:38:17 +08:00
yihua.huang b51fb2696b update ut for cookie 2013-12-06 00:30:01 +08:00
yihua.huang ff2f588c41 #48 nullpointer exception 2013-12-04 22:11:20 +08:00
yihua.huang cf62d707e0 #36 Spider does not exit when success 2013-11-27 23:33:18 +08:00
yihua.huang a3f9ad198f refactor multi thread code in Spider 2013-10-31 21:52:43 +08:00
yihua.huang 5a226387e0 #27 nullpointer fix 2013-10-11 11:32:44 +08:00
yihua.huang fba330872b fix a thread pool exception 2013-09-22 23:57:15 +08:00
yihua.huang d2e0f0cd33 #25 use URL api in UrlUtils.canonicalizeUrl() 2013-09-06 21:35:23 +08:00
yihua.huang ef4cf49fee add stop method to spider #24 2013-09-06 21:17:36 +08:00
yihua.huang 194518fd82 add switch 2013-09-04 08:21:34 +08:00
yihua.huang 2c3574537a refactor in selectors 2013-09-02 14:14:24 +08:00
yihua.huang d7abbd0e4b fix compile error 2013-08-25 16:31:00 +08:00
yihua.huang 5e9e8b2541 add TextContentSelector 2013-08-25 16:30:38 +08:00
yihua.huang c1471718df extractors 2013-08-20 22:44:53 +08:00
yihua.huang c70ed57025 remove PriorityScheduler to core 2013-08-20 21:55:58 +08:00
yihua.huang c79d6ecf09 complete all comments 2013-08-17 23:30:49 +08:00
yihua.huang 268bd8d0c4 remove saxon to extension 2013-08-07 23:04:10 +08:00
yihua.huang b40cca1122 move model package to plugin 2013-08-06 20:41:35 +08:00
yihua.huang 619a12b303 add paged support 2013-08-04 21:22:15 +08:00
yihua.huang a5c85c3c8b add annotation ExtractByRaw 2013-08-04 15:12:06 +08:00
yihua.huang 21cae2ff2e update package 2013-08-04 07:53:28 +08:00
yihua.huang cfb8990453 update author 2013-08-04 03:04:30 +08:00
yihua.huang bfadac756a fix an attribute bug 2013-08-03 18:36:03 +08:00
yihua.huang 145628557d update afterextract api 2013-08-03 18:01:17 +08:00
yihua.huang aca165b132 add and or selector 2013-08-03 17:38:36 +08:00
yihua.huang 69245e8c03 fix Class.assinable bug 2013-08-03 17:17:59 +08:00
yihua.huang 65518f7672 add list support 2013-08-03 17:01:25 +08:00
yihua.huang d4de60a562 skip test 2013-08-03 16:35:12 +08:00
yihua.huang d26cd82d59 rename package 2013-08-03 16:29:50 +08:00
yihua.huang f84b53514f complete objectpipeline 2013-08-03 15:55:54 +08:00
yihua.huang 866ab0a056 update email 2013-08-03 14:01:18 +08:00
yihua.huang 7c9e9ce869 xpath2.0 2013-08-03 07:28:46 +08:00
yihua.huang 7f27c28d4c simplify api 2013-08-02 23:45:13 +08:00
yihua.huang d7899e94ae test saxon and invite XPath2.0 support 2013-08-02 23:39:34 +08:00
yihua.huang 3fe3d8f044 update 2013-08-02 13:51:42 +08:00
yihua.huang abba3b7bff add extract by url 2013-08-02 06:59:25 +08:00
yihua.huang f08ffc34fd rename 2013-08-02 06:33:48 +08:00