Commit Graph

  • 5016700658 feat():爬虫框架 master Saisai 2024-08-12 11:03:09 +0800
  • 9d75cce16d Merge branch 'release/1.0.0' Joe Zhou 2024-07-05 00:27:50 +0800
  • 3e9cd9b5c3 Update versions for release Joe Zhou 2024-07-05 00:20:28 +0800
  • 4d0cdb011f
    stopWhenComplete,增加动态修改完成时停止方法。 (#1169) Niu_XZ 2024-06-17 17:27:28 +0800
  • 49a5efff46 Add a private constructor to hide the implicit public one. Joe Zhou 2024-06-04 01:02:45 +0800
  • 5c43e36118 Make sure the contentType of detectCharset could be null. Joe Zhou 2024-06-04 00:59:30 +0800
  • d2aebc60a7 Make getCharset to support null parameter. Joe Zhou 2024-06-04 00:57:28 +0800
  • 7d2d2244b3 Upgrade oxerr-parent from 2.1.0 to 2.2.1. Sutra Zhou 2024-05-21 12:55:05 +0800
  • 16a4fe3e28 Use oxerr-parent instead. Sutra Zhou 2024-05-17 13:17:13 +0800
  • 4ee30c0592 Merge tag 'WebMagic-0.10.3' into develop Sutra Zhou 2024-04-23 23:41:18 +0800
  • 38f240c42e Update to hotfix version Sutra Zhou 2024-04-23 23:39:23 +0800
  • f7fdcd025f Merge branch 'hotfix/0.10.3' Sutra Zhou 2024-04-23 23:39:16 +0800
  • 462c60fef2 Fix for entity is null. Sutra Zhou 2024-04-23 23:37:46 +0800
  • 54aef0f0e0 Update versions for hotfix Sutra Zhou 2024-04-23 23:34:15 +0800
  • 6be02f3a0f Merge tag 'WebMagic-0.10.2' into develop Sutra Zhou 2024-04-23 21:47:57 +0800
  • a81c4e7627 Update to hotfix version Sutra Zhou 2024-04-23 21:46:35 +0800
  • 0dc852e219 Merge branch 'hotfix/0.10.2' Sutra Zhou 2024-04-23 21:46:28 +0800
  • 0dbfaf7284 Remove useless log. Sutra Zhou 2024-04-23 21:34:59 +0800
  • bda51537b6 Update versions for hotfix Sutra Zhou 2024-04-23 21:32:04 +0800
  • 2217b1c8f2 Merge tag 'WebMagic-0.10.1' into develop Sutra Zhou 2024-04-23 20:41:02 +0800
  • 884f51ba3b Update to hotfix version Sutra Zhou 2024-04-23 20:38:03 +0800
  • 2994716ba5 Merge branch 'hotfix/0.10.1' Sutra Zhou 2024-04-23 20:37:56 +0800
  • 78740a4e28 Fix NPE. Sutra Zhou 2024-04-23 20:26:38 +0800
  • 5afe3e7b81 Revert "fix: pom.xml to reduce vulnerabilities (#1134)" Sutra Zhou 2024-04-23 20:21:45 +0800
  • a0ff4a2d2e Fix log message. Sutra Zhou 2024-04-23 20:09:15 +0800
  • 76ef0332f1 Update versions for hotfix Sutra Zhou 2024-04-23 20:06:47 +0800
  • a5144350bd Upgrade maven plugins to latest versions. Joe Zhou 2024-04-22 00:45:54 +0800
  • e34b495625 Upgrade maven-suirefire-plugin from 3.0.0-M7 to 3.2.5. Joe Zhou 2024-04-22 00:17:58 +0800
  • 5344db0106 Upgrade jacoco-maven-plugin from 0.8.8 to 0.8.12. Joe Zhou 2024-04-21 23:35:26 +0800
  • 9ab342c3a7 Remove public modifiers from junit5 test methods. Joe Zhou 2024-04-21 23:31:21 +0800
  • 5196a56ccf Format code. Joe Zhou 2024-04-21 23:30:43 +0800
  • dba1668306 Add tests to test the equals & hashCode of Proxy. Joe Zhou 2024-04-21 23:23:32 +0800
  • b7e0d360ec Upgrade junit from 4.13.2 to 5.10.2. Joe Zhou 2024-04-21 22:04:48 +0800
  • d8321baf56
    Refactored and implement of a template method pattern for logger config in webmagic-scripts (#1158) François Gibier 2024-04-06 01:55:46 +0200
  • 2df7dca871
    Changed refactor of processSingle again, this one is a better version (#1157) François Gibier 2024-04-05 16:50:21 +0200
  • 05e5eefc7d
    Refactor of processSingle in PageModelExtractor (#1155) François Gibier 2024-04-05 15:51:08 +0200
  • f10fabcb58 Update .gitignore, with merging Maven.gitignore & Global/Eclipse.gitignore in github/gitignore. Joe Zhou 2024-04-03 20:21:02 +0800
  • 383bea32f6
    Bump com.jayway.jsonpath:json-path from 2.8.0 to 2.9.0 (#1154) dependabot[bot] 2024-04-03 19:44:15 +0800
  • ed7429c293 Rename webmagic-parent to webmagic. Sutra Zhou 2024-04-03 19:37:56 +0800
  • 4ebf48f6e3 Replace log4j 1.x with log4j 2.x, refs #534. Sutra Zhou 2024-04-03 18:26:01 +0800
  • 2c730eb978 Update Java version from 1.8 to 11, refs #1134. Joe Zhou 2024-04-03 01:14:36 +0800
  • 0ceaf14882 Bump version number from 0.10.1-SNAPSHOT to 1.0.0-SNAPSHOT for Java version updating from 1.8 to 11, refs #1134. Joe Zhou 2024-04-03 01:00:47 +0800
  • 31548deb93
    Revert "Refactored code for increased optimization. (#1139)" (#1153) Sutra Zhou 2024-03-30 14:37:55 +0800
  • f051d978e2
    Refactored code for increased optimization. (#1139) Parthgajera056 2024-03-30 03:28:02 -0300
  • 9b9f173c1c
    Refactored Code to increase maintainability (#1152) ayushi250317 2024-03-30 03:26:41 -0300
  • 28ac8bf9c4
    Refactored Code to Resolve Implementation Code Smells (#1151) ayushi250317 2024-03-28 13:45:12 -0300
  • 80842d72db
    Added test cases for improving line and branch coverage (#1150) ayushi250317 2024-03-23 02:22:30 -0300
  • 7f8607b881
    update com.fasterxml.jackson.core:jackson-databind 2.15.2 to 2.16.0 (#1149) Ch3n4y 2024-03-07 08:41:26 +0800
  • 22a60df6aa Fix build for selenium upgrading from 3.141.59 to 4.14.1, refs #1134. Sutra Zhou 2024-03-03 20:02:09 +0800
  • e4ab6e27e4 Optimize Request#extras, refs #1148. Sutra Zhou 2024-03-03 18:35:25 +0800
  • 95d1f44150 Optimize Request#extras, fix #1148. Sutra Zhou 2024-03-03 18:33:00 +0800
  • 65fd8f3779
    fix: pom.xml to reduce vulnerabilities (#1146) Yihua Huang 2024-01-22 12:50:01 +0800
  • 4281f82352
    fix: webmagic-samples/pom.xml to reduce vulnerabilities (#1142) Yihua Huang 2024-01-11 08:53:43 +0800
  • 7ededbea1a Fix javadoc. Sutra Zhou 2023-12-05 12:56:14 +0800
  • 1e5c6488ff Update for next development version Sutra Zhou 2023-12-05 12:48:14 +0800
  • 26e134f9a1 Merge tag 'WebMagic-0.10.0' into develop Sutra Zhou 2023-12-05 12:47:56 +0800
  • 5d55bf33d2 Merge branch 'release/0.10.0' Sutra Zhou 2023-12-05 12:47:56 +0800
  • 73dd2ebbac Update versions for release Sutra Zhou 2023-12-05 12:28:05 +0800
  • 7c20290ce4
    Refactor addTargetRequests method to eliminate redundant code. (#1138) Harikrishna 2023-11-26 08:26:06 +0530
  • a911104076
    Refactored to remove multiple calls of getSourceTexts() api (#1137) Harikrishna 2023-11-24 17:39:32 +0530
  • 622ed5a17f
    Refactor compareLong method using Long.compare, corrected the local variable name (#1136) Harikrishna 2023-11-24 10:07:04 +0530
  • 67644de3d9 Expose Page to onSuccess & onError. Sutra Zhou 2023-11-20 18:26:45 +0800
  • 19288e9c11
    fix: pom.xml to reduce vulnerabilities (#1134) Yihua Huang 2023-11-17 22:12:43 +0800
  • eda3be9432 Fix log format. Joe Zhou 2023-11-06 22:44:53 +0800
  • 73f60f809e
    Fix typos (#1131) Maciej Walkowiak 2023-10-24 01:50:14 +0200
  • c0d38a6f1a Upgrade maven-fluido-skin from 1.9 to 1.11.1. Joe Zhou 2023-09-23 12:04:56 +0800
  • 8c008563ff Update for next development version Joe Zhou 2023-09-10 17:06:55 +0800
  • 780e712b0c Merge tag 'WebMagic-0.9.1' into develop Joe Zhou 2023-09-10 17:06:52 +0800
  • 21de4689ab Merge branch 'release/0.9.1' Joe Zhou 2023-09-10 17:06:52 +0800
  • 07e83a421b Update versions for release Joe Zhou 2023-09-10 16:35:32 +0800
  • 0b62461b7a Upgrade mapdb from 3.0.9 to 3.0.10. Joe Zhou 2023-09-10 16:25:10 +0800
  • 238814acb4
    update net.sourceforge.htmlcleaner:htmlcleaner 2.26 to 2.29 (#1127) dack-su 2023-09-07 07:20:11 +0800
  • 028f1ed4a4
    fix: pom.xml to reduce vulnerabilities (#1126) Yihua Huang 2023-08-22 13:02:24 +0800
  • 6e8d130113 Update for next development version Joe Zhou 2023-06-22 11:23:44 +0800
  • 94107bc5b3 Merge tag 'WebMagic-0.9.0' into develop Joe Zhou 2023-06-22 11:23:41 +0800
  • fd4a136f9a Merge branch 'release/0.9.0' Joe Zhou 2023-06-22 11:23:41 +0800
  • 3688226e32 Update versions for release Joe Zhou 2023-06-22 11:16:41 +0800
  • ad010927f6 Fix test. Joe Zhou 2023-06-22 10:40:46 +0800
  • a5fb4e0414 Upgrade dependencies. Joe Zhou 2023-06-22 10:40:37 +0800
  • 58fd08bcf8 Expose Request to ProxyProvider. Joe Zhou 2023-06-22 00:27:30 +0800
  • 9e59b37834
    fix: pom.xml to reduce vulnerabilities (#1119) Yihua Huang 2023-06-20 13:39:45 +0800
  • aeb0e89f02
    fix: pom.xml to reduce vulnerabilities (#1114) Snyk bot 2023-03-28 00:30:03 +0800
  • 244ade7b4c
    feat:update host verify (#1112) Tanky-Zhang 2023-03-22 22:25:51 +0800
  • 717931166a
    向 webmagic-saxon 组件提供若干新 API,更优雅更灵活更强大 (#1108) hooy 2023-02-11 02:14:11 +0800
  • f47038db63
    Merge pull request #1107 from hooyantsing/develop Sutra Zhou 2023-02-04 14:07:17 +0800
  • 08f4a4046b Update: 提供测试用例。 hooyantsing 2023-02-03 22:59:56 +0800
  • 12ce86425f BugFix: Jsoup 和 HtmlCleaner 构建 Dom 时,若缺失 table 标签,则无法正常解析 tr 和 td 标签。 hooyantsing 2023-02-03 22:48:58 +0800
  • a266df406f Add Site.defaultCharset. closes #1101. Joe Zhou 2022-12-20 23:41:31 +0800
  • 80424b0bd7 Replace List with Iterable, fixed #1099. Joe Zhou 2022-12-05 23:26:01 +0800
  • ef616c999e Fix warnings. Joe Zhou 2022-11-27 02:05:31 +0800
  • faf7e1559a Update README for the webmagic version. Joe Zhou 2022-11-24 20:31:43 +0800
  • cda8467f95 Update for next development version Joe Zhou 2022-11-24 00:49:43 +0800
  • 3085896743 Merge tag 'WebMagic-0.8.0' into develop Joe Zhou 2022-11-24 00:49:41 +0800
  • 43ce1a0db9 Merge branch 'release/0.8.0' Joe Zhou 2022-11-24 00:49:41 +0800
  • 7d091def55 Upgrade fastjson, jruby, slf4j. Joe Zhou 2022-11-24 00:42:41 +0800
  • a7a06936f0 Fix requireMavenVersion. Joe Zhou 2022-11-24 00:33:30 +0800
  • 888682863c Update versions for release Joe Zhou 2022-11-24 00:30:07 +0800
  • 64e6a9800a Add dead-lock note for QueueScheduler. Joe Zhou 2022-11-24 00:26:09 +0800
  • e735e4e585 Log the remaining capacity. Joe Zhou 2022-11-20 18:31:36 +0800
  • 4915431845 Revise logging level from warn to info, as we have passed the exception to onError. refs #1094 Joe Zhou 2022-11-15 22:48:02 +0800