Relative Content

Tag Archive for pythonweb-scrapingscrapy

How to run a scrapy Crawl Spider from terminal?

i made a code from a tutorial, its practically the same. The guy run it from terminal with a .csv file as output, but when i run it, it appears a lot of options for running the file but doesnt make anything that i want but doesnt throw any error apparently. What am i doing wrong?, i run it like this:

How to debug Scrapy in VS Code?

The problem is that I can’t debug Scrapy crawlers in VS Code. The problem is that always when I start debugging it breaks on one of my imports. Of course, I played a lot with that import in order to fix it, but it didn’t helped. I also tried with venv and without venv, but it doesn’t helped.

Return Multiple HtmlResponse in Scrapy Middleware

I’m trying to scrape a website with a scroll-down loading page. The page does not load new elements but just updates the element’s content when scrolling down. So I’m trying to use selenium in the DownloaderMiddleware to scroll, and I want to return the current page_sorce every time I scroll. (as it is different every time)