yihua.huang
|
f39aa435cf
|
add null check #104
|
2014-04-16 19:46:32 +08:00 |
yihua.huang
|
42bbe40a37
|
[Bugfix]Urls will be lost when call setScheduler() #104
|
2014-04-16 19:45:17 +08:00 |
yihua.huang
|
aae1ab2cd6
|
fix compile error
|
2014-04-16 18:14:13 +08:00 |
yihua.huang
|
bc8d0220eb
|
Merge branch 'master' of github.com:code4craft/webmagic
|
2014-04-16 18:13:51 +08:00 |
yihua.huang
|
1fbfc92de2
|
Inherit support of Field annotation in Model #103
|
2014-04-16 18:13:44 +08:00 |
Yihua Huang
|
93c4a2afb7
|
Merge pull request #102 from ccliangbo/waitNewUrl
combine two try-catch block into one, make it cleaner.
|
2014-04-16 16:38:10 +08:00 |
Bo LIANG
|
163773af6b
|
combine two try-catch block into one, make it cleaner.
|
2014-04-16 16:05:08 +08:00 |
yihua.huang
|
c8014a9ae6
|
update readme
|
2014-04-15 15:34:37 +08:00 |
yihua.huang
|
ec446277b1
|
some refactor in httpclientdownloader
|
2014-04-15 15:30:37 +08:00 |
yihua.huang
|
4a035e729a
|
extension point for LocalDuplicatedRemovedScheduler #95
|
2014-04-13 23:31:13 +08:00 |
yihua.huang
|
b249e49748
|
[Bugfix]loop error when add TargetRequest #99
|
2014-04-13 23:04:09 +08:00 |
yihua.huang
|
3a79b1b64a
|
[Bugfix]formatter property does not work when field is String#100
|
2014-04-13 23:02:34 +08:00 |
Yihua Huang
|
cc9d319fd9
|
Merge pull request #94 from sebastian1118/master
update:PatternHandler
|
2014-04-13 13:16:20 +08:00 |
Yihua Huang
|
da2f023c12
|
Merge pull request #96 from ouyanghuangzheng/master
修改了Spider 和site 几处注释
|
2014-04-13 13:12:12 +08:00 |
yihua.huang
|
f7950ebcab
|
fix tests
|
2014-04-13 13:00:31 +08:00 |
yihua.huang
|
b14f0ee479
|
fix jsonpath in AngularJSProcessor
|
2014-04-13 12:54:44 +08:00 |
愤怒的番茄
|
32ba1b8889
|
修复几处注释问题
|
2014-04-13 12:41:15 +08:00 |
yihua.huang
|
84b897f83b
|
update AngularJSProcessor
|
2014-04-13 12:20:57 +08:00 |
yihua.huang
|
03c251237b
|
add Json parse support
|
2014-04-13 10:23:00 +08:00 |
Tian
|
99e12aafaa
|
update:PatternHandler
|
2014-04-13 10:14:39 +08:00 |
愤怒的番茄
|
53184f0390
|
test
|
2014-04-12 23:00:37 +08:00 |
愤怒的番茄
|
644e8d1f72
|
同步官方源码
|
2014-04-12 22:32:22 +08:00 |
愤怒的番茄
|
610ac42c07
|
更新
|
2014-04-12 22:22:07 +08:00 |
愤怒的番茄
|
5b254e446b
|
更新
|
2014-04-12 22:08:53 +08:00 |
yihua.huang
|
843e928c2c
|
comments on sinablogprocessor sample
|
2014-04-12 20:10:24 +08:00 |
yihua.huang
|
be37d8b216
|
sinablogprocessor sample
|
2014-04-12 20:03:44 +08:00 |
yihua.huang
|
094f9d1552
|
rename assets for spell mistake
|
2014-04-12 13:42:32 +08:00 |
yihua.huang
|
2b023c95c2
|
qqmeishi demo
|
2014-04-11 11:43:04 +08:00 |
yihua.huang
|
db65dfafb8
|
add baidunews sample
|
2014-04-09 23:32:07 +08:00 |
yihua.huang
|
3669e73e4a
|
update News163: use Xsoup 0.2.0 syntax instead of ComboExtract
|
2014-04-09 16:43:55 +08:00 |
yihua.huang
|
02b441ad38
|
disable NativeObject in Rhino because it is a hotspot internal api and compile error in OpenJDK #93
|
2014-04-09 15:40:33 +08:00 |
yihua.huang
|
9f5a6494a0
|
add support for JDK6 #93
|
2014-04-09 10:44:52 +08:00 |
yihua.huang
|
c6c56ad511
|
Merge branch 'master' of github.com:code4craft/webmagic
|
2014-04-09 09:54:13 +08:00 |
yihua.huang
|
c2873928c8
|
[prototype] extractrule
|
2014-04-09 09:54:01 +08:00 |
Yihua Huang
|
7cb4e37812
|
Merge pull request #93 from friddle/master
update the script
|
2014-04-07 23:22:35 +08:00 |
friddle
|
933800147b
|
update ruby
|
2014-04-07 23:18:00 +08:00 |
friddle
|
37666a7151
|
update the script
|
2014-04-07 23:04:24 +08:00 |
yihua.huang
|
c1e7207869
|
add FileCacheQueueScheduler support for cycleRetryTimes
|
2014-04-07 11:00:09 +08:00 |
yihua.huang
|
969ad1766b
|
change logger style to slf4j for cleaner code
|
2014-04-06 21:32:20 +08:00 |
yihua.huang
|
9b2cb43f47
|
ConfigurablePageProcessor #91
|
2014-04-05 23:40:10 +08:00 |
Yihua Huang
|
1090d070d9
|
Merge pull request #90 from ccliangbo/removeUnusedLines
Remove unused variable to make the project cleaner.
|
2014-04-05 22:00:30 +08:00 |
Bo LIANG
|
159eeea2f5
|
Remove unused variable to make the project cleaner.
|
2014-04-05 18:32:12 +08:00 |
yihua.huang
|
c143fc662c
|
add SubPageProcessor #86
|
2014-04-05 18:17:48 +08:00 |
Yihua Huang
|
2b2ce9ce13
|
Merge pull request #89 from ccliangbo/slf4jFormat
change the formatter of log.
|
2014-04-05 15:11:58 +08:00 |
Bo LIANG
|
b043ac76d6
|
change the formatter of log.
To use slf4j, we should insert {} into the formatter string.
|
2014-04-05 11:31:56 +08:00 |
Yihua Huang
|
474f785dab
|
Merge pull request #86 from sebastian1118/master
new feature: PatternProcessor
|
2014-04-04 23:41:27 +08:00 |
yihua.huang
|
8fe967ba8d
|
[BugFix]exclude log4j.xml from maven jar plugin #82
|
2014-04-04 23:39:32 +08:00 |
Tian
|
38a12f8641
|
new feature: PatternProcessor
|
2014-04-04 22:02:52 +08:00 |
yihua.huang
|
dafd0b5875
|
[BugFix]multi model in one pageprocessor will be skipped #85
|
2014-04-04 20:36:31 +08:00 |
yihua.huang
|
7aaf837e15
|
change logger to slf4j style for performance #84
|
2014-04-04 20:10:00 +08:00 |