Looping through multiple pages with changing url

Qaruri · Jan-16-2020, 09:11 PM

Hi, I am new to web scraping and have just managed to write my first working script. However it is only able to extract data from the first page. I have not been able to apply solutions offered online successfully. Will be might glad if someone can assist me write a complete scrip that extracts data from all pages. below is my current working script

from urllib.request import urlopen as uReq
from bs4 import BeautifulSoup as soup

my_url = 'https://www.merchantcircle.com/search?q=self-storage&qn='

#opens connection, grabbing the page
uClient = uReq(my_url)
page_html = uClient.read()
uClient.close()

#page parser
page_soup = soup(page_html, "html.parser")

businesses = page_soup.findAll("div",{"class":"hInfo vcard"})


filename = "storage.csv"
f = open(filename, "w")

#headers = "brand, product_name, price, shipping\n"
headers = "biz_name, biz_address, biz_phone_num\n"

f.write(headers)

for business in businesses:
		

	#grabs business name
	biz_name = business.h2.a.text.strip()

	#grabs business address
	address = business.find("a",{"class":"directions"})
	biz_address = address.text.strip()	


	#grabs phone number
	phone_num = business.find("a",{"class":"phone digits tel"})
	biz_phone_num = phone_num.text.strip()



	print("biz_name: " + biz_name)
	print("biz_address: " + biz_address)
	print("biz_phone_num: " + biz_phone_num)

	f.write(biz_name + "," + biz_address.replace(",", "|") + "," + biz_phone_num + "\n")

f.close()

**Larz60+** · Jan-16-2020, 11:11 PM

I'm not sure what links you are interested in, but using your first result 'businesses' on line 14 (for example)
You can pull the link to kabbage by adding after line 14
next_url = businesses.h2.a.get('href')
and review link for same using review_url = businesses.div.a.get('href')
I'd also use requests rather that urllib.
see examples in the following two threads:
web scraping part1
web scraping part2

Qaruri · Jan-17-2020, 01:55 PM

Oh, sorry for the omission. I'm interested in the page links so that I'm able to extract all 65000+ records. Currently only able to extract 21 records in page one.

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Scrape table from multiple pages	Nhattanktnn	1	2,325	Jun-07-2023, 09:35 AM Last Post: Larz60+
	Web scrap multiple pages	anilacem_302	3	5,446	Jul-01-2020, 07:50 PM Last Post: mlieqo
	scraping multiple pages from table	bandar	1	3,948	Jun-27-2020, 10:43 PM Last Post: Larz60+
	Beginner help - Leap Year Issue Feb 29 and multiple pages	warriordazza	3	4,587	May-10-2020, 01:14 AM Last Post: warriordazza
	Scraping Multiple Pages	mbadatanut	1	5,586	May-08-2020, 02:30 AM Last Post: Larz60+
	How to handle tables splitted across multiple web pages	ankitjindalbti	2	3,347	Jun-02-2019, 07:33 AM Last Post: ankitjindalbti
	scraping multiple pages of a website.	Blue Dog	14	27,619	Jun-21-2018, 09:03 PM Last Post: Blue Dog
	Looping and naming multiple files	revo	4	5,731	Mar-25-2018, 09:46 PM Last Post: Larz60+

Looping through multiple pages with changing url

User Panel Messages

Announcements