King Chun O

BeautifulSoup is one python module for getting content from a web page.

This python module can be used to scrape data from any web data by making calls

pip install beautifulsoup4pip install requests

using the python Beautiful Soup python module.

The syntax for scraping text using python beautifulsoup is as follows:

import bs4 as bsimport requesturl = “www.google.com"content = bs.read(url)

While scraping the data we should handle the following python exceptions:

>> BeautifulSoupSyntaxError: invalid … or tag in line 1 and many more python exceptions.

The below python code will not generate any python exception to scrap the web content.

import bs4 as bsimport requestsurl = "www.google.com"content = bs.read(url)

http://docs.python-requests.org/en/master/user/quickstart/#quick-exceptions

The python code to scrape the python documentation using python beautiful soup

--

--