Commit Graph

698 Commits (800f66c4cc7e1e4b3e485af5236e3c9b8d54f028)

Author SHA1 Message Date
yihua.huang f39aa435cf add null check #104 2014-04-16 19:46:32 +08:00
yihua.huang 42bbe40a37 [Bugfix]Urls will be lost when call setScheduler() #104 2014-04-16 19:45:17 +08:00
yihua.huang aae1ab2cd6 fix compile error 2014-04-16 18:14:13 +08:00
yihua.huang bc8d0220eb Merge branch 'master' of github.com:code4craft/webmagic 2014-04-16 18:13:51 +08:00
yihua.huang 1fbfc92de2 Inherit support of Field annotation in Model #103 2014-04-16 18:13:44 +08:00
Yihua Huang 93c4a2afb7 Merge pull request #102 from ccliangbo/waitNewUrl
combine two try-catch block into one, make it cleaner.
2014-04-16 16:38:10 +08:00
Bo LIANG 163773af6b combine two try-catch block into one, make it cleaner. 2014-04-16 16:05:08 +08:00
yihua.huang c8014a9ae6 update readme 2014-04-15 15:34:37 +08:00
yihua.huang ec446277b1 some refactor in httpclientdownloader 2014-04-15 15:30:37 +08:00
yihua.huang 4a035e729a extension point for LocalDuplicatedRemovedScheduler #95 2014-04-13 23:31:13 +08:00
yihua.huang b249e49748 [Bugfix]loop error when add TargetRequest #99 2014-04-13 23:04:09 +08:00
yihua.huang 3a79b1b64a [Bugfix]formatter property does not work when field is String#100 2014-04-13 23:02:34 +08:00
Yihua Huang cc9d319fd9 Merge pull request #94 from sebastian1118/master
update:PatternHandler
2014-04-13 13:16:20 +08:00
Yihua Huang da2f023c12 Merge pull request #96 from ouyanghuangzheng/master
修改了Spider 和site  几处注释
2014-04-13 13:12:12 +08:00
yihua.huang f7950ebcab fix tests 2014-04-13 13:00:31 +08:00
yihua.huang b14f0ee479 fix jsonpath in AngularJSProcessor 2014-04-13 12:54:44 +08:00
愤怒的番茄 32ba1b8889 修复几处注释问题 2014-04-13 12:41:15 +08:00
yihua.huang 84b897f83b update AngularJSProcessor 2014-04-13 12:20:57 +08:00
yihua.huang 03c251237b add Json parse support 2014-04-13 10:23:00 +08:00
Tian 99e12aafaa update:PatternHandler 2014-04-13 10:14:39 +08:00
愤怒的番茄 53184f0390 test 2014-04-12 23:00:37 +08:00
愤怒的番茄 644e8d1f72 同步官方源码 2014-04-12 22:32:22 +08:00
愤怒的番茄 610ac42c07 更新 2014-04-12 22:22:07 +08:00
愤怒的番茄 5b254e446b 更新 2014-04-12 22:08:53 +08:00
yihua.huang 843e928c2c comments on sinablogprocessor sample 2014-04-12 20:10:24 +08:00
yihua.huang be37d8b216 sinablogprocessor sample 2014-04-12 20:03:44 +08:00
yihua.huang 094f9d1552 rename assets for spell mistake 2014-04-12 13:42:32 +08:00
yihua.huang 2b023c95c2 qqmeishi demo 2014-04-11 11:43:04 +08:00
yihua.huang db65dfafb8 add baidunews sample 2014-04-09 23:32:07 +08:00
yihua.huang 3669e73e4a update News163: use Xsoup 0.2.0 syntax instead of ComboExtract 2014-04-09 16:43:55 +08:00
yihua.huang 02b441ad38 disable NativeObject in Rhino because it is a hotspot internal api and compile error in OpenJDK #93 2014-04-09 15:40:33 +08:00
yihua.huang 9f5a6494a0 add support for JDK6 #93 2014-04-09 10:44:52 +08:00
yihua.huang c6c56ad511 Merge branch 'master' of github.com:code4craft/webmagic 2014-04-09 09:54:13 +08:00
yihua.huang c2873928c8 [prototype] extractrule 2014-04-09 09:54:01 +08:00
Yihua Huang 7cb4e37812 Merge pull request #93 from friddle/master
update the script
2014-04-07 23:22:35 +08:00
friddle 933800147b update ruby 2014-04-07 23:18:00 +08:00
friddle 37666a7151 update the script 2014-04-07 23:04:24 +08:00
yihua.huang c1e7207869 add FileCacheQueueScheduler support for cycleRetryTimes 2014-04-07 11:00:09 +08:00
yihua.huang 969ad1766b change logger style to slf4j for cleaner code 2014-04-06 21:32:20 +08:00
yihua.huang 9b2cb43f47 ConfigurablePageProcessor #91 2014-04-05 23:40:10 +08:00
Yihua Huang 1090d070d9 Merge pull request #90 from ccliangbo/removeUnusedLines
Remove unused variable to make the project cleaner.
2014-04-05 22:00:30 +08:00
Bo LIANG 159eeea2f5 Remove unused variable to make the project cleaner. 2014-04-05 18:32:12 +08:00
yihua.huang c143fc662c add SubPageProcessor #86 2014-04-05 18:17:48 +08:00
Yihua Huang 2b2ce9ce13 Merge pull request #89 from ccliangbo/slf4jFormat
change the formatter of log.
2014-04-05 15:11:58 +08:00
Bo LIANG b043ac76d6 change the formatter of log.
To use slf4j, we should insert {} into the formatter string.
2014-04-05 11:31:56 +08:00
Yihua Huang 474f785dab Merge pull request #86 from sebastian1118/master
new feature: PatternProcessor
2014-04-04 23:41:27 +08:00
yihua.huang 8fe967ba8d [BugFix]exclude log4j.xml from maven jar plugin #82 2014-04-04 23:39:32 +08:00
Tian 38a12f8641 new feature: PatternProcessor 2014-04-04 22:02:52 +08:00
yihua.huang dafd0b5875 [BugFix]multi model in one pageprocessor will be skipped #85 2014-04-04 20:36:31 +08:00
yihua.huang 7aaf837e15 change logger to slf4j style for performance #84 2014-04-04 20:10:00 +08:00