Happy to be featured on the Alibaba Cloud Tech Share blog.
By Mark Andrews, Alibaba Cloud Tech Share Author. Tech Share is Alibaba Cloud’s incentive program to encourage the sharing of technical knowledge and best practices within the cloud community.
Today we will be writing our own basic headless web scraping “bot” in Python with Beautiful Soup. Headless generally means web browsing with no GUI (Graphical User Interface). In this lesson, we will be doing everything through the terminal command line.
We will deploy an Alibaba Cloud Elastic Compute Service (ECS) burstable type t5 nano instance running CentOS 7. We will be utilizing the Requests and Beautiful Soup 4 modules.
Full Tutorial is below.
https://www.alibabacloud.com/blog/headless-web-scraping-in-python-with-beautiful-soup-4_593881