Commit Graph

1292 Commits (50167006588e93493a5f0c0825796945505ab0f2)

Author SHA1 Message Date
yihua.huang 1d86f7c048 compile passed in httpclientDownloader 2017-03-20 22:40:14 +08:00
yihua.huang b71f379512 fix 2017-03-18 12:18:00 +08:00
yihua.huang a7f9e7cad5 重构一部分httpclient 2017-03-18 12:16:21 +08:00
yihua.huang 221c155060 move release connection before return proxy #396 2017-03-18 11:15:36 +08:00
yihua.huang 68beff42c5 add test #493 2017-03-18 11:01:30 +08:00
Yihua Huang d6b4e98a66 Merge pull request #493 from cheiftain/my_change
Bug, add null check to site in HttpClientDownloader & HttpClientGener…
2017-03-18 10:49:19 +08:00
xbynet 76729c9302 Merge pull request #2 from code4craft/master
合并官方最新代码
2017-03-17 16:07:56 +08:00
wuyifan 79522f941e Bug, add null check to site in HttpClientDownloader & HttpClientGenerator 2017-03-17 14:10:54 +08:00
yihua.huang e9341d0291 complete test #447 2017-03-17 07:54:28 +08:00
yihua.huang e7d35c4846 add params to all method of request #447 2017-03-17 07:18:05 +08:00
yihua.huang 75bad591d7 rewrite hashCode and equals for params #447 2017-03-17 07:10:14 +08:00
Yihua Huang 11c32669b2 Merge pull request #447 from xbynet/master
简化POST参数设置.
2017-03-17 07:06:46 +08:00
yihua.huang aa01e27779 change constructor for Proxy to public #490 2017-03-17 07:02:02 +08:00
yihua.huang 0fbf657d86 update fastjson to 1.2.28 #489 2017-03-17 06:59:28 +08:00
Yihua Huang 676758349f Merge pull request #492 from eyougo/master
fix a bug of RegexSelector when regex has zero-width assertions.
2017-03-17 06:55:07 +08:00
mei 791520e6a0 fix a bug of RegexSelector when regex has zero-width assertions. 2017-03-17 00:06:15 +08:00
yihua.huang c175ea88c0 #more test #484 2017-03-11 11:43:18 +08:00
yihua.huang 9b964c0a99 test for #484 2017-03-11 11:41:01 +08:00
yihua.huang fc702fd3b6 introduce mockito for test 2017-03-11 11:31:15 +08:00
yihua.huang 5215a492cc remove duplicate check for POST request #484 2017-03-11 11:26:13 +08:00
yihua.huang 45bf2b6fd7 remove javadoc link because it's out of date 2017-03-11 11:01:25 +08:00
yihua.huang 2a35bb4688 remove contributors because it's hard to maintain the list: see https://github.com/code4craft/webmagic/graphs/contributors instead 2017-03-11 10:59:55 +08:00
yihua.huang 0a1fb19052 add tests #483 2017-03-11 10:56:31 +08:00
yihua.huang a2e7f0004b Merge branch 'master' of github.com:code4craft/webmagic 2017-03-11 10:52:54 +08:00
yihua.huang ef32571821 rewrite Request.equals and hashCode, add Method to check #483 2017-03-11 10:52:39 +08:00
yihua.huang 8b8f535c30 refactor:extract charset detect to utils 2017-03-11 10:43:10 +08:00
Yihua Huang 50247f9bc6 Merge pull request #478 from ckex/develop
fix bug
2017-03-06 09:43:11 +08:00
Ckex.zha e645524ad2 fix bug,set ExecutorService 2017-03-04 20:57:29 +08:00
yihua.huang 11904a4d41 fix huaban demo #475 2017-03-04 11:43:28 +08:00
yihua.huang 895fca9fd7 修复seleniumDownloader配置文件写死的问题 #475 2017-03-04 11:34:06 +08:00
yihua.huang d87c73b472 change check-and-set to atomic sadd for redis DuplicateRemover #368 2017-03-01 22:24:34 +08:00
yihua.huang d6cd92b1a8 LICENSE file 2017-02-27 10:26:26 +08:00
yihua.huang a872a6480e fix code sample for github #348 2017-02-25 22:46:29 +08:00
yihua.huang 1d2171805f add test for #228 2017-02-25 22:30:48 +08:00
yihua.huang bbe0b52ddd remove synchronized in QueueScheduler #410 2017-02-25 19:55:45 +08:00
yihua.huang 00e81bd650 update common-collections to 3.2.2 #456 2017-02-25 19:51:03 +08:00
yihua.huang ad69963005 remove synchronize in Page #411 2017-02-25 19:42:12 +08:00
yihua.huang 3a796b9413 remove duplicate code #421 2017-02-25 12:01:12 +08:00
yihua.huang 42f1018010 remove messy code 2017-02-21 14:08:05 +08:00
xbynet 650468c0e4 解决POST中文参数乱码问题 2017-01-22 18:04:22 +08:00
xbynet 1f85674ae1 Merge pull request #1 from code4craft/master
test
2017-01-22 15:44:39 +08:00
yihua.huang 76076e51d8 update version in readme 2017-01-21 22:19:09 +08:00
yihua.huang aaccc93215 new version 2017-01-21 12:04:12 +08:00
yihua.huang 3e633c6871 version 2017-01-21 11:51:14 +08:00
yihua.huang f45e2f118b for release 2017-01-21 11:38:36 +08:00
yihua.huang d60615f503 修复使用startUrls没有设置domain导致使用cookie空指针的问题#438 2017-01-21 11:29:42 +08:00
yihua.huang 407fbb6130 refactor logger#445 2017-01-21 11:05:54 +08:00
Yihua Huang f29a10472f Merge pull request #414 from jsbd/master
新增构造函数,支持crawl.js路径自定义,因为当其他项目依赖此jar包时,runtime.exec()执行phantomjs命令时无使用法jar包中的crawl.js
2017-01-21 10:50:48 +08:00
Yihua Huang 93e7040fe5 Merge pull request #445 from ckex/develop
optimize code.
2017-01-21 10:45:32 +08:00
Ckex.zha 0dc26c8ca0 optimize code. 2017-01-20 14:03:26 +08:00