I want Python scraping on the cloud (preferably in parallel)

Asked 1 years ago, Updated 1 years ago, 113 views

Python is trying to scrape airbnb sites.
I want to keep my PC running regularly even if I close it, so I would like to do the scraping process in parallel on the cloud, if possible.(The scraping itself is coded.)
I would appreciate it if you could let me know if there are any recommendations.

python web-scraping

2022-09-30 21:38

1 Answers

I summarized it briefly under the constraints of the cloud environment.

  • Use cron to launch scripts periodically
    • cron is a daemon that runs jobs periodically, and you can specify when to run them in minutes.
    • cron-ArchWiki
  • Periodically run the acquisition process while starting the application all the time
  • cron is a daemon that runs jobs periodically, and you can specify when to run them in minutes.
  • cron-ArchWiki

These methods are traditional and documentative, but they also incur the cost of constantly starting the server and monitoring and maintaining the server itself.

Therefore, in recent years, the following methods have become more mainstream:

However, this method cannot store data locally, so we need to prepare a separate destination for storing data.You also need to write code according to FaaS style (function name and execution time are limited).

Choose which one you want to choose according to your requirements.


2022-09-30 21:38

If you have any answers or tips


© 2024 OneMinuteCode. All rights reserved.