Linux下wget命令有哪些

发布时间：2021-12-03 11:30:20 作者：iii
来源：亿速云阅读：174

# Linux下wget命令详解

## 1. wget命令概述

### 1.1 wget简介

wget是GNU项目开发的一个非交互式网络下载工具，全称为"Web Get"，是Linux/Unix系统中最常用的命令行下载工具之一。它支持HTTP、HTTPS和FTP协议，能够递归下载网站内容，具有断点续传、带宽控制、镜像网站等强大功能。

wget的主要特点包括：
- 非交互式操作：可以在后台运行
- 支持断点续传
- 支持递归下载
- 支持代理服务器
- 支持SSL/TLS安全下载
- 支持限速下载
- 支持镜像整个网站

### 1.2 wget的历史与发展

wget最初由Hrvoje Nikšić于1996年开发，后被纳入GNU项目。经过20多年的发展，wget已成为Linux系统默认安装的标准工具之一，几乎所有Linux发行版都预装了wget。

## 2. wget安装与基本使用

### 2.1 安装wget

大多数Linux发行版已经预装了wget。如果没有安装，可以使用以下命令安装：

```bash
# Debian/Ubuntu
sudo apt-get install wget

# CentOS/RHEL
sudo yum install wget

# Fedora
sudo dnf install wget

# Arch Linux
sudo pacman -S wget

2.2 基本语法

wget的基本语法格式为：

wget [选项] [URL]

2.3 最简单的下载示例

下载单个文件：

wget https://example.com/file.zip

3. wget常用命令选项详解

3.1 下载控制选项

3.1.1 断点续传 (-c, –continue)

wget -c https://example.com/large-file.iso

3.1.2 后台下载 (-b, –background)

wget -b https://example.com/large-file.iso

3.1.3 限速下载 (–limit-rate)

wget --limit-rate=200k https://example.com/large-file.iso

3.2 输出控制选项

3.2.1 静默模式 (-q, –quiet)

wget -q https://example.com/file.txt

3.2.2 输出到日志 (-o, –output-file)

wget -o download.log https://example.com/file.txt

3.2.3 追加到日志 (-a, –append-output)

wget -a download.log https://example.com/file.txt

3.3 目录选项

3.3.1 指定下载目录 (-P, –directory-prefix)

wget -P /path/to/directory https://example.com/file.txt

3.3.2 创建目录结构 (-x, –force-directories)

wget -x https://example.com/path/to/file.txt

3.4 HTTP选项

3.4.1 使用HTTP认证 (–http-user, –http-password)

wget --http-user=username --http-password=password https://example.com/protected/file.txt

3.4.2 设置User-Agent (–user-agent)

wget --user-agent="Mozilla/5.0" https://example.com

3.5 FTP选项

3.5.1 FTP匿名登录 (–ftp-user, –ftp-password)

wget --ftp-user=anonymous --ftp-password=user@example.com ftp://example.com/file.txt

3.5.2 被动模式FTP (–passive-ftp)

wget --passive-ftp ftp://example.com/file.txt

3.6 递归下载选项

3.6.1 递归下载 (-r, –recursive)

wget -r https://example.com

3.6.2 递归深度 (-l, –level)

wget -r -l 2 https://example.com

3.6.3 不创建父目录 (-np, –no-parent)

wget -r -np https://example.com/path/

3.7 镜像选项

3.7.1 镜像整个网站 (-m, –mirror)

wget -m https://example.com

3.7.2 转换链接 (-k, –convert-links)

wget -mk https://example.com

3.8 时间戳选项

3.8.1 时间戳检查 (-N, –timestamping)

wget -N https://example.com/file.txt

3.8.2 忽略时间戳 (–no-use-server-timestamps)

wget --no-use-server-timestamps https://example.com/file.txt

3.9 重试选项

3.9.1 重试次数 (–tries)

wget --tries=10 https://example.com/unstable-file.txt

3.9.2 无限重试 (–tries=inf)

wget --tries=inf https://example.com/unstable-file.txt

3.10 代理选项

3.10.1 使用代理 (–proxy)

wget --proxy=on http://example.com

3.10.2 指定代理服务器 (-e, –execute)

wget -e use_proxy=yes -e http_proxy=10.0.0.1:8080 http://example.com

4. wget高级用法

4.1 批量下载

4.1.1 使用-i选项从文件读取URL

wget -i url-list.txt

4.1.2 使用通配符下载

wget https://example.com/files/file{1..10}.jpg

4.2 下载限制

4.2.1 限制下载文件大小 (–quota)

wget --quota=100m https://example.com/large-files/

4.2.2 限制下载文件类型 (-A, –accept)

wget -r -A.jpg,.png https://example.com/images/

4.2.3 排除特定文件类型 (-R, –reject)

wget -r -R.html https://example.com

4.3 网站镜像

4.3.1 完整镜像网站

wget --mirror --convert-links --adjust-extension --page-requisites --no-parent https://example.com

4.3.2 镜像时保持目录结构

wget -mkEpnp https://example.com

4.4 下载FTP目录

4.4.1 递归下载FTP目录

wget -r ftp://example.com/pub/

4.4.2 下载FTP目录但不递归

wget -nd -r ftp://example.com/pub/

4.5 下载受限内容

4.5.1 使用Cookie下载

wget --load-cookies cookies.txt https://example.com/members-only/

4.5.2 模拟浏览器下载

wget --header="Accept: text/html" --user-agent="Mozilla/5.0" https://example.com

5. wget实用技巧

5.1 下载整个网站供离线浏览

wget --mirror --convert-links --adjust-extension --page-requisites --no-parent https://example.com

5.2 下载YouTube视频

wget https://www.youtube.com/watch?v=VIDEO_ID

5.3 下载所有PDF文件

wget -r -l1 -A.pdf https://example.com/documents/

5.4 定时下载

while true; do wget -N https://example.com/updated-file.txt; sleep 3600; done

5.5 下载并自动解压

wget -q -O - https://example.com/archive.tar.gz | tar xz

6. wget配置文件

6.1 全局配置文件 (/etc/wgetrc)

6.2 用户配置文件 (~/.wgetrc)

6.3 常用配置示例

# 设置默认重试次数
tries = 10

# 设置默认限速
limit_rate = 100k

# 使用被动FTP
passive_ftp = on

7. wget常见问题解决

7.1 SSL/TLS证书问题

wget --no-check-certificate https://example.com

7.2 连接超时问题

wget --timeout=30 https://example.com

7.3 重定向问题

wget --max-redirect=5 https://example.com

7.4 大文件下载中断

wget -c --tries=inf https://example.com/large-file.iso

8. wget与其他工具的比较

8.1 wget vs curl

8.2 wget vs aria2

8.3 wget vs axel

9. wget脚本编程

9.1 使用wget编写下载脚本

9.2 结合cron实现定时下载

9.3 使用wget进行网站监控

10. wget安全注意事项

10.1 避免下载恶意文件

10.2 安全使用wget下载

10.3 wget与防火墙设置

11. wget的未来发展

11.1 wget2介绍

11.2 wget的新特性

12. 总结

wget是Linux系统中最强大、最灵活的命令行下载工具之一。通过本文的学习，您应该已经掌握了wget的基本用法和高级技巧，能够利用wget完成各种复杂的下载任务。

无论是简单的文件下载，还是复杂的网站镜像，wget都能胜任。结合脚本编程，wget可以自动化完成许多重复性的下载工作，大大提高工作效率。

”`

注意：由于篇幅限制，这里提供的是文章的大纲和主要内容框架。要扩展到11400字，需要在每个章节中添加更多详细说明、示例、使用场景、注意事项等内容。您可以根据这个框架进一步扩展每个部分的内容。