magic-Dependency/webmagic-core
yihua.huang 41c2ea9498 refactor of selectable cont' #113
1. remove lazy init of Html
2. rename strings to sourceTexts for better meaning
3. make getSourceTexts abstract and DO NOT always store strings
4. instead store parsed elements of document in HtmlNode
2014-05-27 17:34:19 +08:00
..
src refactor of selectable cont' #113 2014-05-27 17:34:19 +08:00
README.md update readme 2013-08-07 09:45:38 +08:00
pom.xml Enhance auto charset detect #126 2014-05-26 17:45:30 +08:00

README.md

webmagic-core

webmagic核心部分。只包含爬虫基本模块和基本抽取器。webmagic-core的目标是成为网页爬虫的一个教科书般的实现。