Scrape Multiple items from a webpage

Prince_Bhatia · (This post was last modified: Sep-10-2017, 01:13 PM by Prince_Bhatia.)

i am trying to scrape image and table from a wikipedia page and write it into csv but i am confused that how to club them together and write this data into csv.

below are my codes

from urllib.request import urlopen
from bs4 import BeautifulSoup

url = "https://en.wikipedia.org/wiki/Kevin_Bacon"
html = urlopen(url)
soup = BeautifulSoup(html, "html.parser")

newfile = "Newlyout.csv"
f = open(newfile, "w")
Headers = "Year, Association, Category, Nominated, Results, Imagelink\n"
f.write(Headers)

soup1 = soup.find_all("img")
for i in soup1:
    Image = i['src']
    
    #ddprint(Image['src'])
    soup3 = soup.find("table", {"class":"wikitable sortable"})
    for tag in soup3.find_all("tr"):
        cell = tag.find_all("td")
        
        if len(cell) == 5:
            Year = cell[0].find(text=True)
            Association = cell[2].find(text=True)
            Category = cell[3].find(text=True)
            Nominated = cell[4].find(text=True)
            Results = cell[4].find(text=True)
            f.write("{}".format(Year)+ ",{}".format(Association)+ ",{}".format(Category) + ",{}".format(Nominated) + ",{}".format(Results)+ ",{}".format(Image)+"\n")
            
f.close()

i got it solved till here but it is repeating the data..and in images there are multiple images in one single cell....all i need table and against it all images in that page..

**nilamo** · Sep-11-2017, 04:56 PM

Quote:

soup1 = soup.find_all("img")
for i in soup1:
    Image = i['src']
     
    #ddprint(Image['src'])
    soup3 = soup.find("table", {"class":"wikitable sortable"})

So for every image on the page... find all the tables with a certain class, and then do more stuff.
Those sound like two different things.

Prince_Bhatia · Sep-12-2017, 06:08 AM

Problem is that i am not able to club them them together into csv file...but it is repeating the data..and in images there are multiple images in one single cell....all i need table and against it all images in that page..

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Scrape table from multiple pages	Nhattanktnn	1	2,325	Jun-07-2023, 09:35 AM Last Post: Larz60+
	scrape data 1 go to next page scrape data 2 and so on	alkaline3	6	10,407	Mar-13-2020, 07:59 PM Last Post: alkaline3
	Pass multiple items from one parse to another using Scrapy	nazmulfinance	2	6,429	Jan-23-2020, 06:44 PM Last Post: nazmulfinance
	Need advice how to scrape a Chinese webpage	omar	2	3,164	Nov-21-2019, 12:30 PM Last Post: snippsat
	Scrape multiple urls LXML	santdoyle	1	4,596	Oct-26-2019, 09:53 PM Last Post: snippsat
	How to create Flask-Stripe Checkout and Charge for the multiple items	Antares	4	8,931	Jul-05-2019, 10:20 AM Last Post: Antares
	display multiple sensors on webpage python flask jinja	pascale	6	7,834	Jan-29-2019, 10:10 AM Last Post: pascale
	Multiple input box in a webpage have same XPATH	sumandas89	3	7,650	Jul-19-2018, 08:41 AM Last Post: buran
	Scrape multiple lines with regex	greetings	2	4,454	Jul-04-2018, 09:09 PM Last Post: snippsat

Scrape Multiple items from a webpage

User Panel Messages

Announcements