PHP-spider

程序名称:PHP-spider

授权协议: GPL

操作系统: 跨平台

开发语言: PHP

PHP-spider 介绍

一个可扩展的PHP WEB 蜘蛛,示例代码

use VDB\Spider\Spider;
use VDB\Spider\discoverer\XPathExpressiondiscoverer;

$spider = new Spider('http://www.oschina.net');

特性:

  • supports two traversal algorithms: breadth-first and depth-first

  • supports depth limiting and queue size limiting

  • supports adding custom URI discovery logic, based on XPath, CSS selectors, or plain old PHP

  • comes with a useful set of URI filters, such as Domain limiting

  • supports custom URI filters, both prefetch (URI) and postfetch (Resource content)

  • supports custom request handling logic

  • comes with a useful set of persistence handlers (memory, file. Redis soon to follow)

  • supports custom persistence handlers

  • collects statistics about the crawl for reporting

  • dispatches useful events, allowing developers to add even more custom behavior

  • supports a politeness policy

  • will soon come with many default discoverers: RSS, Atom, RDF, etc.

  • will soon support multiple queueing mechanisms (file, memcache, redis)

  • will eventually support distributed spidering with a central queue

PHP-spider 官网

http://php-spider.org/

相关编程语言

Pacman 是一个软件包管理器, 作为 ArchLinux 发行版...
Smb4K 是KDE下的网络共享浏览器 更多屏幕截图请看:...
Wine (“Wine Is Not an Emulator” 的首字母缩写)...
虚拟桌面软件,可管理最多9个虚拟桌面,你可以用热键...
UNetbootin (Universal Netboot Installer)为一种跨...
Cobbler 可以用来快速建立 Linux 网络安装环境,它已...