This website requires JavaScript.
5016700658
feat():爬虫框架
master
Saisai
2024-08-12 11:03:09 +0800
9d75cce16d
Merge branch 'release/1.0.0'
Joe Zhou
2024-07-05 00:27:50 +0800
3e9cd9b5c3
Update versions for release
Joe Zhou
2024-07-05 00:20:28 +0800
4d0cdb011f
stopWhenComplete,增加动态修改完成时停止方法。 (#1169 )
Niu_XZ
2024-06-17 17:27:28 +0800
49a5efff46
Add a private constructor to hide the implicit public one.
Joe Zhou
2024-06-04 01:02:45 +0800
5c43e36118
Make sure the contentType of detectCharset could be null.
Joe Zhou
2024-06-04 00:59:30 +0800
d2aebc60a7
Make getCharset to support null parameter.
Joe Zhou
2024-06-04 00:57:28 +0800
7d2d2244b3
Upgrade oxerr-parent from 2.1.0 to 2.2.1.
Sutra Zhou
2024-05-21 12:55:05 +0800
16a4fe3e28
Use oxerr-parent instead.
Sutra Zhou
2024-05-17 13:17:13 +0800
4ee30c0592
Merge tag 'WebMagic-0.10.3' into develop
Sutra Zhou
2024-04-23 23:41:18 +0800
38f240c42e
Update to hotfix version
Sutra Zhou
2024-04-23 23:39:23 +0800
f7fdcd025f
Merge branch 'hotfix/0.10.3'
Sutra Zhou
2024-04-23 23:39:16 +0800
462c60fef2
Fix for entity is null.
Sutra Zhou
2024-04-23 23:37:46 +0800
54aef0f0e0
Update versions for hotfix
Sutra Zhou
2024-04-23 23:34:15 +0800
6be02f3a0f
Merge tag 'WebMagic-0.10.2' into develop
Sutra Zhou
2024-04-23 21:47:57 +0800
a81c4e7627
Update to hotfix version
Sutra Zhou
2024-04-23 21:46:35 +0800
0dc852e219
Merge branch 'hotfix/0.10.2'
Sutra Zhou
2024-04-23 21:46:28 +0800
0dbfaf7284
Remove useless log.
Sutra Zhou
2024-04-23 21:34:59 +0800
bda51537b6
Update versions for hotfix
Sutra Zhou
2024-04-23 21:32:04 +0800
2217b1c8f2
Merge tag 'WebMagic-0.10.1' into develop
Sutra Zhou
2024-04-23 20:41:02 +0800
884f51ba3b
Update to hotfix version
Sutra Zhou
2024-04-23 20:38:03 +0800
2994716ba5
Merge branch 'hotfix/0.10.1'
Sutra Zhou
2024-04-23 20:37:56 +0800
78740a4e28
Fix NPE.
Sutra Zhou
2024-04-23 20:26:38 +0800
5afe3e7b81
Revert "fix: pom.xml to reduce vulnerabilities (#1134 )"
Sutra Zhou
2024-04-23 20:21:45 +0800
a0ff4a2d2e
Fix log message.
Sutra Zhou
2024-04-23 20:09:15 +0800
76ef0332f1
Update versions for hotfix
Sutra Zhou
2024-04-23 20:06:47 +0800
a5144350bd
Upgrade maven plugins to latest versions.
Joe Zhou
2024-04-22 00:45:54 +0800
e34b495625
Upgrade maven-suirefire-plugin from 3.0.0-M7 to 3.2.5.
Joe Zhou
2024-04-22 00:17:58 +0800
5344db0106
Upgrade jacoco-maven-plugin from 0.8.8 to 0.8.12.
Joe Zhou
2024-04-21 23:35:26 +0800
9ab342c3a7
Remove public modifiers from junit5 test methods.
Joe Zhou
2024-04-21 23:31:21 +0800
5196a56ccf
Format code.
Joe Zhou
2024-04-21 23:30:43 +0800
dba1668306
Add tests to test the equals & hashCode of Proxy.
Joe Zhou
2024-04-21 23:23:32 +0800
b7e0d360ec
Upgrade junit from 4.13.2 to 5.10.2.
Joe Zhou
2024-04-21 22:04:48 +0800
d8321baf56
Refactored and implement of a template method pattern for logger config in webmagic-scripts (#1158 )
François Gibier
2024-04-06 01:55:46 +0200
2df7dca871
Changed refactor of processSingle again, this one is a better version (#1157 )
François Gibier
2024-04-05 16:50:21 +0200
05e5eefc7d
Refactor of processSingle in PageModelExtractor (#1155 )
François Gibier
2024-04-05 15:51:08 +0200
f10fabcb58
Update .gitignore, with merging Maven.gitignore & Global/Eclipse.gitignore in github/gitignore.
Joe Zhou
2024-04-03 20:21:02 +0800
383bea32f6
Bump com.jayway.jsonpath:json-path from 2.8.0 to 2.9.0 (#1154 )
dependabot[bot]
2024-04-03 19:44:15 +0800
ed7429c293
Rename webmagic-parent to webmagic.
Sutra Zhou
2024-04-03 19:37:56 +0800
4ebf48f6e3
Replace log4j 1.x with log4j 2.x, refs #534 .
Sutra Zhou
2024-04-03 18:26:01 +0800
2c730eb978
Update Java version from 1.8 to 11, refs #1134 .
Joe Zhou
2024-04-03 01:14:36 +0800
0ceaf14882
Bump version number from 0.10.1-SNAPSHOT to 1.0.0-SNAPSHOT for Java version updating from 1.8 to 11, refs #1134 .
Joe Zhou
2024-04-03 01:00:47 +0800
31548deb93
Revert "Refactored code for increased optimization. (#1139 )" (#1153 )
Sutra Zhou
2024-03-30 14:37:55 +0800
f051d978e2
Refactored code for increased optimization. (#1139 )
Parthgajera056
2024-03-30 03:28:02 -0300
9b9f173c1c
Refactored Code to increase maintainability (#1152 )
ayushi250317
2024-03-30 03:26:41 -0300
28ac8bf9c4
Refactored Code to Resolve Implementation Code Smells (#1151 )
ayushi250317
2024-03-28 13:45:12 -0300
80842d72db
Added test cases for improving line and branch coverage (#1150 )
ayushi250317
2024-03-23 02:22:30 -0300
7f8607b881
update com.fasterxml.jackson.core:jackson-databind 2.15.2 to 2.16.0 (#1149 )
Ch3n4y
2024-03-07 08:41:26 +0800
22a60df6aa
Fix build for selenium upgrading from 3.141.59 to 4.14.1, refs #1134 .
Sutra Zhou
2024-03-03 20:02:09 +0800
e4ab6e27e4
Optimize Request#extras, refs #1148 .
Sutra Zhou
2024-03-03 18:35:25 +0800
95d1f44150
Optimize Request#extras, fix #1148 .
Sutra Zhou
2024-03-03 18:33:00 +0800
65fd8f3779
fix: pom.xml to reduce vulnerabilities (#1146 )
Yihua Huang
2024-01-22 12:50:01 +0800
4281f82352
fix: webmagic-samples/pom.xml to reduce vulnerabilities (#1142 )
Yihua Huang
2024-01-11 08:53:43 +0800
7ededbea1a
Fix javadoc.
Sutra Zhou
2023-12-05 12:56:14 +0800
1e5c6488ff
Update for next development version
Sutra Zhou
2023-12-05 12:48:14 +0800
26e134f9a1
Merge tag 'WebMagic-0.10.0' into develop
Sutra Zhou
2023-12-05 12:47:56 +0800
5d55bf33d2
Merge branch 'release/0.10.0'
Sutra Zhou
2023-12-05 12:47:56 +0800
73dd2ebbac
Update versions for release
Sutra Zhou
2023-12-05 12:28:05 +0800
7c20290ce4
Refactor addTargetRequests method to eliminate redundant code. (#1138 )
Harikrishna
2023-11-26 08:26:06 +0530
a911104076
Refactored to remove multiple calls of getSourceTexts() api (#1137 )
Harikrishna
2023-11-24 17:39:32 +0530
622ed5a17f
Refactor compareLong method using Long.compare, corrected the local variable name (#1136 )
Harikrishna
2023-11-24 10:07:04 +0530
67644de3d9
Expose Page to onSuccess & onError.
Sutra Zhou
2023-11-20 18:26:45 +0800
19288e9c11
fix: pom.xml to reduce vulnerabilities (#1134 )
Yihua Huang
2023-11-17 22:12:43 +0800
eda3be9432
Fix log format.
Joe Zhou
2023-11-06 22:44:53 +0800
73f60f809e
Fix typos (#1131 )
Maciej Walkowiak
2023-10-24 01:50:14 +0200
c0d38a6f1a
Upgrade maven-fluido-skin from 1.9 to 1.11.1.
Joe Zhou
2023-09-23 12:04:56 +0800
8c008563ff
Update for next development version
Joe Zhou
2023-09-10 17:06:55 +0800
780e712b0c
Merge tag 'WebMagic-0.9.1' into develop
Joe Zhou
2023-09-10 17:06:52 +0800
21de4689ab
Merge branch 'release/0.9.1'
Joe Zhou
2023-09-10 17:06:52 +0800
07e83a421b
Update versions for release
Joe Zhou
2023-09-10 16:35:32 +0800
0b62461b7a
Upgrade mapdb from 3.0.9 to 3.0.10.
Joe Zhou
2023-09-10 16:25:10 +0800
238814acb4
update net.sourceforge.htmlcleaner:htmlcleaner 2.26 to 2.29 (#1127 )
dack-su
2023-09-07 07:20:11 +0800
028f1ed4a4
fix: pom.xml to reduce vulnerabilities (#1126 )
Yihua Huang
2023-08-22 13:02:24 +0800
6e8d130113
Update for next development version
Joe Zhou
2023-06-22 11:23:44 +0800
94107bc5b3
Merge tag 'WebMagic-0.9.0' into develop
Joe Zhou
2023-06-22 11:23:41 +0800
fd4a136f9a
Merge branch 'release/0.9.0'
Joe Zhou
2023-06-22 11:23:41 +0800
3688226e32
Update versions for release
Joe Zhou
2023-06-22 11:16:41 +0800
ad010927f6
Fix test.
Joe Zhou
2023-06-22 10:40:46 +0800
a5fb4e0414
Upgrade dependencies.
Joe Zhou
2023-06-22 10:40:37 +0800
58fd08bcf8
Expose Request to ProxyProvider.
Joe Zhou
2023-06-22 00:27:30 +0800
9e59b37834
fix: pom.xml to reduce vulnerabilities (#1119 )
Yihua Huang
2023-06-20 13:39:45 +0800
aeb0e89f02
fix: pom.xml to reduce vulnerabilities (#1114 )
Snyk bot
2023-03-28 00:30:03 +0800
244ade7b4c
feat:update host verify (#1112 )
Tanky-Zhang
2023-03-22 22:25:51 +0800
717931166a
向 webmagic-saxon 组件提供若干新 API,更优雅更灵活更强大 (#1108 )
hooy
2023-02-11 02:14:11 +0800
f47038db63
Merge pull request #1107 from hooyantsing/develop
Sutra Zhou
2023-02-04 14:07:17 +0800
08f4a4046b
Update: 提供测试用例。
hooyantsing
2023-02-03 22:59:56 +0800
12ce86425f
BugFix: Jsoup 和 HtmlCleaner 构建 Dom 时,若缺失 table 标签,则无法正常解析 tr 和 td 标签。
hooyantsing
2023-02-03 22:48:58 +0800
a266df406f
Add Site.defaultCharset. closes #1101 .
Joe Zhou
2022-12-20 23:41:31 +0800
80424b0bd7
Replace List with Iterable, fixed #1099 .
Joe Zhou
2022-12-05 23:26:01 +0800
ef616c999e
Fix warnings.
Joe Zhou
2022-11-27 02:05:31 +0800
faf7e1559a
Update README for the webmagic version.
Joe Zhou
2022-11-24 20:31:43 +0800
cda8467f95
Update for next development version
Joe Zhou
2022-11-24 00:49:43 +0800
3085896743
Merge tag 'WebMagic-0.8.0' into develop
Joe Zhou
2022-11-24 00:49:41 +0800
43ce1a0db9
Merge branch 'release/0.8.0'
Joe Zhou
2022-11-24 00:49:41 +0800
7d091def55
Upgrade fastjson, jruby, slf4j.
Joe Zhou
2022-11-24 00:42:41 +0800
a7a06936f0
Fix requireMavenVersion.
Joe Zhou
2022-11-24 00:33:30 +0800
888682863c
Update versions for release
Joe Zhou
2022-11-24 00:30:07 +0800
64e6a9800a
Add dead-lock note for QueueScheduler.
Joe Zhou
2022-11-24 00:26:09 +0800
e735e4e585
Log the remaining capacity.
Joe Zhou
2022-11-20 18:31:36 +0800
4915431845
Revise logging level from warn to info, as we have passed the exception to onError. refs #1094
Joe Zhou
2022-11-15 22:48:02 +0800