You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
As an end-user, I would like to be able to modify the dupe spotter expression list and be able to update it during runtime like other configuration options.
I would also like to be apply different defaults into more intentional sets of defaults for specific website types.
Right now, it is currently hard-coded into dupespotter.py but may require more thought as to how to expose the list of expressions and keep it up to date (write to file to update).
What happens when the list is updated but an invalid expression is found? I think skipping the line and printing out an error should be sufficient.
The text was updated successfully, but these errors were encountered:
Yeah, it would be nice to be able to customize dupespotter. But because most users won't, it probably makes sense to fix it in grab-site for more websites that anyone would care to crawl.
Also note that it was written a while ago and the site-specific parts of it are probably out of date.
process_body(body, url) in dupespotter.py
As an end-user, I would like to be able to modify the dupe spotter expression list and be able to update it during runtime like other configuration options.
I would also like to be apply different defaults into more intentional sets of defaults for specific website types.
Right now, it is currently hard-coded into dupespotter.py but may require more thought as to how to expose the list of expressions and keep it up to date (write to file to update).
What happens when the list is updated but an invalid expression is found? I think skipping the line and printing out an error should be sufficient.
The text was updated successfully, but these errors were encountered: