python爬取网络小说 中文‘乱码’,因为不知道是否是乱码,所以加了引号
代码如下
# -- coding:utf8 --
from bs4 import BeautifulSoup
import requests
url = "http://www.cishuge.com/read/0/250/"
web_data = requests.get(url)
soup = BeautifulSoup(web_data.text, 'lxml')
titles = soup.select('#readerlist > ul > li > a')
for title in titles:
data = {
'title': title.get('title'),
'link': title.get('href')
}
print(data)
目标网页为http://www.cishuge.com/read/0/250/
运行结果如下图所示
链接能正常显示,文章标题貌似‘乱码’
百度下没找到解决方法,特来求助各位前辈
补充:运行环境 windows10, python3, pycharm
Copyright 2014-2026 https://www.php.cn/ All Rights Reserved | php.cn | 湘ICP备2023035733号
业精于勤,荒于嬉;行成于思,毁于随。