Python crawler keeps crawling the content of the first page repeatedly #2379

Q-yh-ouo · 2020-09-01T04:57:33Z

import requests import urllib from lxml import html #需要爬数据的网址 j=0 #!s-p3 for k in range(1,20): url='https://www.duitang.com/search/?kw=%E6%AD%A3%E5%A4%AA&type=feed'+'#!s-p'+str(k) page=requests.Session().get(url) tree=html.fromstring(page.text) result=tree.xpath('//a[@class="a"]//img/@src') #获取需要的数据 for i in result: urllib.request.urlretrieve(i,'C://Users//FangJZ//Desktop//duitan//'+(str(j))+'.jpg') j=j+1 print('true')

Aug	SEP	Oct
	02
2019	2020	2021

TheAlgorithms / Python

Python crawler keeps crawling the content of the first page repeatedly #2379

Python crawler keeps crawling the content of the first page repeatedly #2379

Q-yh-ouo commented Sep 1, 2020

TheAlgorithms / Python

Join GitHub today

Python crawler keeps crawling the content of the first page repeatedly #2379

Python crawler keeps crawling the content of the first page repeatedly #2379

Comments

Q-yh-ouo commented Sep 1, 2020