python3 爬虫怎样处理请求头 - 问答

在Python3中，使用requests库可以方便地处理请求头。以下是一个简单的示例，展示了如何使用requests库发送带有自定义请求头的爬虫请求：

首先，确保已经安装了requests库。如果没有安装，可以使用以下命令安装：

pip install requests

然后，编写一个简单的爬虫程序，如下所示：

import requests

def send_request(url, headers):
    response = requests.get(url, headers=headers)
    return response

def main():
    url = "https://example.com"  # 替换为您要爬取的URL
    headers = {
        "User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.110 Safari/537.3",
        "Accept": "text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,*/*;q=0.8",
        "Accept-Language": "zh-CN,zh;q=0.8,en;q=0.6",
        "Connection": "keep-alive",
    }

    response = send_request(url, headers)
    print(response.text)

if __name__ == "__main__":
    main()

在这个示例中，我们定义了一个名为send_request的函数，该函数接受一个URL和一个请求头字典作为参数。我们使用requests.get()方法发送GET请求，并将请求头传递给该方法。然后，我们定义了一个名为main的函数，其中包含了我们要爬取的URL和自定义请求头。最后，我们调用send_request()函数并打印响应的文本内容。

请注意，您可以根据需要修改请求头字典中的键值对，以模拟不同的浏览器访问。

0 赞

0 踩