scraping-websites
Here are 1,032 public repositories matching this topic...
-
Updated
Jul 5, 2020 - Python
-
Updated
Nov 22, 2018 - JavaScript
-
Updated
Jul 19, 2021 - Python
Currently, in master, the logs are stored in a non-optional manner.
I think a good improvement for this would be to make the file logging optional, as not everyone has log rotation configured or needs logging (especially ad-hoc users).
Mentioned first here: elixir-crawly/crawly#155 (comment)
Some improvements i can think of is:
- optional switch to enabl
-
Updated
Jun 12, 2020 - Go
-
Updated
Aug 19, 2021 - Python
-
Updated
Apr 15, 2021 - TypeScript
-
Updated
Jan 31, 2021 - Python
-
Updated
Aug 9, 2021 - Python
-
Updated
Jun 22, 2021 - TypeScript
It would be beneficial to return the URL of the sitemap.xml file directly.
-
Updated
May 9, 2021 - Python
-
Updated
Feb 28, 2019 - Python
-
Updated
Jul 17, 2021 - Go
-
Updated
Jun 29, 2021 - C#
-
Updated
Nov 1, 2018 - Java
-
Updated
Mar 4, 2021 - Python
The idea is to have an option like 3 (Do a Google search, save the Urls found and search the emails), but search a list of phrases.
This list can be in a .txt
The option can ask for number of search results in Google
-
Updated
Jun 2, 2021 - Python
Is your feature request related to a problem? Please describe.
It would be interesting to capture if a listing is a Buy It Now or Auction listing and if Auction how many bids were on it.
Describe the solution you'd like
Another two columns on the df for "Listing Type" and "Number of Bids" where if BIN the latter should be 0. Really we could get away with just one column as all BIN wil
Pareto Plot Warning
-
Updated
Apr 7, 2021 - Python
Download video
PR for download video feature will be welcome or the gem is only for audio
-
Updated
Aug 17, 2021 - Python
-
Updated
Mar 24, 2021 - Python
-
Updated
Oct 6, 2020 - Python
-
Updated
Jan 12, 2021 - Python
-
Updated
May 9, 2021 - Python
-
Updated
Aug 17, 2021 - Python
Improve this page
Add a description, image, and links to the scraping-websites topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the scraping-websites topic, visit your repo's landing page and select "manage topics."
It's been awhile since I updated e2e tests and there are some of them that are filing (most of them are related to examples).
Also, we need to add e2e tests that cover headers and cookies for both drivers.