gz51837844 d3c7084cf7 上传douban.py, 修改之前的小bug 8 anni fa
..
simpleSpider d3c7084cf7 上传douban.py, 修改之前的小bug 8 anni fa
tmSpider d3c7084cf7 上传douban.py, 修改之前的小bug 8 anni fa
README d3c7084cf7 上传douban.py, 修改之前的小bug 8 anni fa
anjuke.py 9b34e43858 添加实战代码anjuke.py 8 anni fa
crawl_gooseeker_bbs.py 3c5c9b7e21 update class name from gsExtractor to GsExtractor 8 anni fa
douban.py d3c7084cf7 上传douban.py, 修改之前的小bug 8 anni fa
result1.xml 9b34e43858 添加实战代码anjuke.py 8 anni fa
result2.xml 9b34e43858 添加实战代码anjuke.py 8 anni fa
xslt_bbs.xml f14549c2c8 Upload craw_gooseeker_bbs.py , xslt_bbs.xml 8 anni fa

README

# Created at 15:10, May 18,2016
# Updated at 15:20, Jul 6,2016

目录文件说明
================
crawler

- anjuke.py 采集安居客房产经纪人
- result1.xml 安居客房产经纪人结果文件1
- result2.xml 安居客房产经纪人结果文件2
- crawl_gooseeker_bbs.py 采集集搜客论坛内容
- xslt_bbs.xml 集搜客论坛内容提取本地xslt文件
- douban.py 采集豆瓣小组讨论话题

- simpleSpider 一个小爬虫(基于Scrapy开源框架)
- tmSpider 采集天猫商品信息(基于Scrapy开源框架)