简体   繁体   English

wget,下载具有特定结尾的链接文件

[英]wget, download linked files with specific ending

Want to download all wordlists from this site https://wiki.skullsecurity.org/Passwords 想要从此站点下载所有单词列表https://wiki.skullsecurity.org/Passwords

I tried 我试过了

wget https://wiki.skullsecurity.org/Passwords --no-check-certificate --accept "*.bz2" -r

but does only download the targeted page.. 但只会下载目标网页。

What didn't work either: 什么也不起作用:

wget https://downloads.skullsecurity.org/passwords/ --no-check-certificate -m

(tried different combinations of -m and -r) (尝试使用-m和-r的不同组合)

Tried also with --user-agent to prevent wget preventing from downloading 也尝试过--user-agent以防止wget阻止下载

Tried -l 3 , still no success..facepalm 尝试-l 3 ,仍然没有成功..facepalm

This works for me: 这对我有用:

 wget -e robots=off -r -np -nH --accept "*.bz2"  http://downloads.skullsecurity.org/passwords/

Read about Robot Exclusion 了解有关机器人排除的信息

If you know what you are doing and really really wish to turn off the robot exclusion, set the robots variable to 'off' 如果您知道自己在做什么,并且真的希望关闭机器人排除功能,请将机器人变量设置为“关闭”

Site http://downloads.skullsecurity.org/ contains robot.txt with content 网站http://downloads.skullsecurity.org/包含robot.txt及其内容

User-agent: *
Disallow: /

Explanation 说明

The Disallow: / tells the robot that it should not visit any pages on the site. Disallow: /告诉机器人不应访问该网站上的任何页面。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM