[英]wget, download linked files with specific ending
Want to download all wordlists from this site https://wiki.skullsecurity.org/Passwords 想要从此站点下载所有单词列表https://wiki.skullsecurity.org/Passwords
I tried 我试过了
wget https://wiki.skullsecurity.org/Passwords --no-check-certificate --accept "*.bz2" -r
but does only download the targeted page.. 但只会下载目标网页。
What didn't work either: 什么也不起作用:
wget https://downloads.skullsecurity.org/passwords/ --no-check-certificate -m
(tried different combinations of -m and -r) (尝试使用-m和-r的不同组合)
Tried also with --user-agent
to prevent wget preventing from downloading 也尝试过--user-agent
以防止wget阻止下载
Tried -l 3
, still no success..facepalm 尝试-l 3
,仍然没有成功..facepalm
This works for me: 这对我有用:
wget -e robots=off -r -np -nH --accept "*.bz2" http://downloads.skullsecurity.org/passwords/
Read about Robot Exclusion 了解有关机器人排除的信息
If you know what you are doing and really really wish to turn off the robot exclusion, set the robots variable to 'off' 如果您知道自己在做什么,并且真的希望关闭机器人排除功能,请将机器人变量设置为“关闭”
Site http://downloads.skullsecurity.org/ contains robot.txt with content 网站http://downloads.skullsecurity.org/包含robot.txt及其内容
User-agent: *
Disallow: /
Explanation 说明
The
Disallow: /
tells the robot that it should not visit any pages on the site.Disallow: /
告诉机器人不应访问该网站上的任何页面。
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.