求助!如何爬取当当网畅销榜图书的折扣,就差这一个

import requests
from bs4 import BeautifulSoup
import re
import csv

headers = {‘User-Agent’: ‘Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/65.0.3314.0 Safari/537.36 SE 2.X MetaSr 1.0’}

def getHTMLText(url):
try:
r = requests.get(url, headers=headers)
r.raise_for_status()
r.encoding = r.apparent_encoding
return r.text
except Exception as e:
print(e)

def getBookInfo(html, lst):
soup = BeautifulSoup(html, ‘html.parser’)
tag_ul = soup.find(‘ul’, attrs={‘class’: ‘bang_list clearfix bang_list_mode’})
tag_li = tag_ul.find_all(‘li’)
for book in tag_li:
book_info = {}

    book_rank = book.find('div', attrs={'class': re.compile('list_num$')})book_name = book.find('div', attrs={'class': 'name'}).abook_comments = boo


本文来自互联网用户投稿,文章观点仅代表作者本人,不代表本站立场,不承担相关法律责任。如若转载,请注明出处。 如若内容造成侵权/违法违规/事实不符,请点击【内容举报】进行投诉反馈!

相关文章

立即
投稿

微信公众账号

微信扫一扫加关注

返回
顶部