问题描述
我想使用 wget
下载 website 的所有页面(带有嵌套链接)。我尝试了以下命令:
wget --reject PHP,xml --exclude-domains https://motamem.org/wp-content/plugins/ProProfile/ajax/upme-get-avatar.PHP?email=' + new_user_email,https://motamem.org/wp-admin/admin-ajax.PHP,https://wprp.sovrn.com/static/,https:\/\/motamem.org\/wp-admin\/admin-ajax.PHP,https://motamem.org/xmlrpc.PHP,https://motamem.org/Feed/,https://motamem.org/wp-includes/wlwmanifest.xml,https://motamem.org/xmlrpc.PHP?rsd,https://motamem.org/wp-json/ -U "Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML,like Gecko) Chrome/61.0.3163.79 Safari/537.36" -mkEpnp -l10 -e robots=off --page-requisites --html-extension --adjust-extension --convert-links https://motamem.org/
但有这些问题:
解决方法
试试
wget -r --mirror --page-requisites --convert-links --span-hosts -U mozilla -F http://example.com