-
-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Study: ML-based scrapers #70
Comments
Hey, It's a good idea to use mlscraper as a backend. But first of all, we need data (inputs and outputs). |
Yes, I can see potential on this one. |
Autoscraper is another one that would be great to have in Dude. It learns the scraping rules and returns similar elements. It just needs a few examples and isn't complicated as mlscraper. Input: Output: Any ideas to add this one to Dude? Should I open a new issue for this? |
The thing is, I've been reading the source code of Autoscraper and it is not actually using Machine Learning or AI. It is just using Please correct me if I am wrong. I cannot categorize it as such, but for sure it learns by saving rules. |
Though it seems Autoscraper does not fall into this category, I believe it is a very powerful tool for web scraping and I'd love to include it. Please open a separate ticket. |
Done ✅ |
Possible format:
Potential backends:
The text was updated successfully, but these errors were encountered: