爬取豆瓣前25Top好评电影

import requests
from bs4 import BeautifulSoup
from openpyxl import Workbook

wb = Workbook()
sheet = wb.active

url = ‘https://movie.douban.com/top250’

ret = requests.get(url=url)
data = ret.text

soup = BeautifulSoup(data, ‘html.parser’)

ol = soup.find(name=‘ol’, attrs={‘class’: ‘grid_view’})

li_list = ol.find_all(name=‘li’)

sheet.titlt = ‘豆瓣好评电影’

sheet[‘A1’].value = ‘序号’
sheet[‘B1’].value = ‘电影名称’
sheet[‘C1’].value = ‘电影评分’
sheet[‘D1’].value = ‘电影链接’
sheet[‘E1’].value = ‘电影图片’

count = 1
for li in li_list:
name = li.find(name=‘span’, attrs={‘class’:‘title’})
rat = li.find(name=‘span’, attrs={‘class’: ‘rating_num’})
url = li.find(name=‘a’)
img = li.find(name=‘img’)
count += 1
sheet[‘A%s’ % count].value = count - 1
sheet[‘B%s’ % count].value = name.text
sheet[‘C%s’ % count].value = rat.text
sheet[‘D%s’ % count].value = url[‘href’]
sheet[‘E%s’ % count].value = img[‘src’]

wb.save(‘好评电影.xlsx’)


本文来自互联网用户投稿,文章观点仅代表作者本人,不代表本站立场,不承担相关法律责任。如若转载,请注明出处。 如若内容造成侵权/违法违规/事实不符,请点击【内容举报】进行投诉反馈!

相关文章

立即
投稿

微信公众账号

微信扫一扫加关注

返回
顶部