I want to extract only text in list format from tag list extracted by BeautifulSoup

 from bs4 import BeautifulSoup

r=requests.get("***************")

soup = BeautifulSoup(r.content, "html.parser")

class=soup.find_all("div", class_="word")

At this rate, scraping will remain in the list surrounded by html tags.

I don't need tags, so I'd like to keep them in a list format with the tags removed.

python web-scraping beautifulsoup

2022-09-30 14:02

1 Answers

If you only need text, you can do the following.

 from bs4 import BeautifulSoup
import requests

r=requests.get("***************")

soup = BeautifulSoup(r.content, "html.parser")

wordclass=soup.find_all("div", class_="word")

wordlist = [x.text for x in wordclass ]

print(wordlist)

2022-09-30 14:02

If you have any answers or tips

Popular Tags

python x 4647

android x 1593

java x 1494

javascript x 1427

c x 927

c++ x 878

ruby-on-rails x 696

php x 692

python3 x 685

html x 656