yihua.huang
|
d61f65cef8
|
update mbean to mxbean #98
|
2014-04-25 11:31:43 +08:00 |
yihua.huang
|
ad6a273b12
|
update test url
|
2014-04-25 11:28:35 +08:00 |
yihua.huang
|
27b37e8164
|
extension point and sample for JMX support #98
|
2014-04-17 08:12:37 +08:00 |
yihua.huang
|
f7950ebcab
|
fix tests
|
2014-04-13 13:00:31 +08:00 |
yihua.huang
|
84b897f83b
|
update AngularJSProcessor
|
2014-04-13 12:20:57 +08:00 |
yihua.huang
|
03c251237b
|
add Json parse support
|
2014-04-13 10:23:00 +08:00 |
yihua.huang
|
22c394e629
|
[doc]
|
2014-04-04 20:00:58 +08:00 |
yihua.huang
|
01848301d4
|
encode illegal charactors in url #80
|
2014-04-01 22:14:30 +08:00 |
yihua.huang
|
2780423e60
|
enable blank space in quotes in UrlUtils.fixAllRelativeHrefs #80
|
2014-04-01 20:35:11 +08:00 |
yihua.huang
|
8d8194bee4
|
Change HashMap to LinkedHashMap in ResultItems for same order of input and output #76
|
2014-03-25 08:23:20 +08:00 |
yihua.huang
|
8b35d79569
|
Do not cache document in Selectable for selected Html element #73
|
2014-03-19 22:19:06 +08:00 |
yihua.huang
|
6c11718566
|
Clean project structure #70
|
2014-03-14 23:24:38 +08:00 |
yihua.huang
|
2768a1cae4
|
add test for cycleTriedTimes and fix cycleTriedTimes inc error #60
|
2014-03-01 15:10:38 +08:00 |
Almark Ming
|
2b46b11e55
|
Update RegexSelector.java
Optimize regex format check
Conflicts:
webmagic-core/src/main/java/us/codecraft/webmagic/selector/RegexSelector.java
|
2013-12-21 08:38:17 +08:00 |
yihua.huang
|
b51fb2696b
|
update ut for cookie
|
2013-12-06 00:30:01 +08:00 |
yihua.huang
|
ff2f588c41
|
#48 nullpointer exception
|
2013-12-04 22:11:20 +08:00 |
yihua.huang
|
cf62d707e0
|
#36 Spider does not exit when success
|
2013-11-27 23:33:18 +08:00 |
yihua.huang
|
a3f9ad198f
|
refactor multi thread code in Spider
|
2013-10-31 21:52:43 +08:00 |
yihua.huang
|
5a226387e0
|
#27 nullpointer fix
|
2013-10-11 11:32:44 +08:00 |
yihua.huang
|
fba330872b
|
fix a thread pool exception
|
2013-09-22 23:57:15 +08:00 |
yihua.huang
|
d2e0f0cd33
|
#25 use URL api in UrlUtils.canonicalizeUrl()
|
2013-09-06 21:35:23 +08:00 |
yihua.huang
|
ef4cf49fee
|
add stop method to spider #24
|
2013-09-06 21:17:36 +08:00 |
yihua.huang
|
194518fd82
|
add switch
|
2013-09-04 08:21:34 +08:00 |
yihua.huang
|
2c3574537a
|
refactor in selectors
|
2013-09-02 14:14:24 +08:00 |
yihua.huang
|
d7abbd0e4b
|
fix compile error
|
2013-08-25 16:31:00 +08:00 |
yihua.huang
|
5e9e8b2541
|
add TextContentSelector
|
2013-08-25 16:30:38 +08:00 |
yihua.huang
|
c1471718df
|
extractors
|
2013-08-20 22:44:53 +08:00 |
yihua.huang
|
c70ed57025
|
remove PriorityScheduler to core
|
2013-08-20 21:55:58 +08:00 |
yihua.huang
|
c79d6ecf09
|
complete all comments
|
2013-08-17 23:30:49 +08:00 |
yihua.huang
|
268bd8d0c4
|
remove saxon to extension
|
2013-08-07 23:04:10 +08:00 |
yihua.huang
|
b40cca1122
|
move model package to plugin
|
2013-08-06 20:41:35 +08:00 |
yihua.huang
|
619a12b303
|
add paged support
|
2013-08-04 21:22:15 +08:00 |
yihua.huang
|
a5c85c3c8b
|
add annotation ExtractByRaw
|
2013-08-04 15:12:06 +08:00 |
yihua.huang
|
21cae2ff2e
|
update package
|
2013-08-04 07:53:28 +08:00 |
yihua.huang
|
cfb8990453
|
update author
|
2013-08-04 03:04:30 +08:00 |
yihua.huang
|
bfadac756a
|
fix an attribute bug
|
2013-08-03 18:36:03 +08:00 |
yihua.huang
|
145628557d
|
update afterextract api
|
2013-08-03 18:01:17 +08:00 |
yihua.huang
|
aca165b132
|
add and or selector
|
2013-08-03 17:38:36 +08:00 |
yihua.huang
|
69245e8c03
|
fix Class.assinable bug
|
2013-08-03 17:17:59 +08:00 |
yihua.huang
|
65518f7672
|
add list support
|
2013-08-03 17:01:25 +08:00 |
yihua.huang
|
d4de60a562
|
skip test
|
2013-08-03 16:35:12 +08:00 |
yihua.huang
|
d26cd82d59
|
rename package
|
2013-08-03 16:29:50 +08:00 |
yihua.huang
|
f84b53514f
|
complete objectpipeline
|
2013-08-03 15:55:54 +08:00 |
yihua.huang
|
866ab0a056
|
update email
|
2013-08-03 14:01:18 +08:00 |
yihua.huang
|
7c9e9ce869
|
xpath2.0
|
2013-08-03 07:28:46 +08:00 |
yihua.huang
|
7f27c28d4c
|
simplify api
|
2013-08-02 23:45:13 +08:00 |
yihua.huang
|
d7899e94ae
|
test saxon and invite XPath2.0 support
|
2013-08-02 23:39:34 +08:00 |
yihua.huang
|
3fe3d8f044
|
update
|
2013-08-02 13:51:42 +08:00 |
yihua.huang
|
abba3b7bff
|
add extract by url
|
2013-08-02 06:59:25 +08:00 |
yihua.huang
|
f08ffc34fd
|
rename
|
2013-08-02 06:33:48 +08:00 |