Atom feed #845

naquad · 2025-05-14T18:00:36Z

An implementation of the reporter, storing the updates in the Atom feed at the specified location.

naquad · 2025-07-03T10:04:18Z

Erm are we going somewhere with this or it'll just fail time to time because of the formatting settings?

thp · 2025-07-07T04:55:24Z

Erm are we going somewhere with this or it'll just fail time to time because of the formatting settings?

[...]/lib/urlwatch/reporters.py:1173:1: E302 expected 2 blank lines, found 1
[...]/lib/urlwatch/reporters.py:1231:57: W504 line break after binary operator

thp · 2025-07-07T05:13:44Z

The two remaining issues are pycodestyle bugs, fixed in later versions, see: PyCQA/flake8#1845 (comment)

This is taken care of in #847 (and should be fixed once you merge upstream thp:master into your branch).

thp

See comments. I think it's feature-complete, but making the _e() thing a bit less smart would make maintaining this easier.

thp · 2025-08-01T12:32:25Z

lib/urlwatch/reporters.py

+
+                logger.warning("%s: invalid atom feed", self.config['path'])
+        except etree.LxmlError as e:
+            logger.warning("failed to parse %s: %s", self.config['path'], e)


Maybe mention here that a new file will be created. Should the old file be copied somewhere to not cause data loss?

thp · 2025-08-01T12:34:36Z

lib/urlwatch/reporters.py

+                # fix the namespaces
+                for elem in tree.iter():
+                    if hasattr(elem, 'tag') and elem.tag.startswith(nspfx):
+                        elem.tag = elem.tag[len(nspfx):]


Could this be done instead with using namespace features in etree?

.iter() doesn't seem to support namespaces, but .iterfind() seems to do..

thp · 2025-08-01T12:36:20Z

lib/urlwatch/reporters.py

+    def _attrs_equal(self, a, b, exist):
+        for k in a.keys() | b.keys():
+            if (
+                k not in exist and a.get(k) != b.get(k)
+                or k in exist and k not in a
+            ):
+                return False
+
+        return True


Can you add a comment here, explaining in words what this is supposed to do in terms of "a", "b" and "exist"? It seems to compare "a" and "b" for equality, somehow taking "exist" into account.

thp · 2025-08-01T12:38:01Z

lib/urlwatch/reporters.py

+    def _entry_updated(self, entry):
+        """Tries to fetch the updated timestamp from the entry"""
+        updated = entry.find('./updated')
+        return updated is not None and updated.text or '2099-01-01T00:00:00Z'


Why the magical value of 2099? Can we just not have a value here, or use the current time/date?

thp · 2025-08-01T12:39:54Z

lib/urlwatch/reporters.py

+
+        maxitems = self.config.get('maxitems', 0)
+        if maxitems < 0:
+            logger.warning("atom: maxitems can't be negative")


Also, what is the effect here? Ignoring and using all items? Maybe the warning should say that, something like "maxitems can't be negative, not limiting amount of items" or something.

thp · 2025-08-01T12:40:17Z

lib/urlwatch/storage.py

+            'enabled': False,
+            'maxitems': 50,
+            'path': '/path/to/feed.xml',
+            'title': 'URLWatch Updates',


Branding.

Suggested change

'title': 'URLWatch Updates',

'title': 'urlwatch Updates',

thp · 2025-08-01T12:41:17Z

share/man/man5/urlwatch-reporters.5

 .\" indent \\n[an-margin]
 .\" old: \\n[rst2man-indent\\n[rst2man-indent-level]]
 .nr rst2man-indent-level -1
 .\" new: \\n[rst2man-indent\\n[rst2man-indent-level]]


Possibly revert this file change for now. It's updated mechanically on release, and it seems to have more than just the added changes, so we shouldn't do the file update as part of this PR.

thp · 2025-08-01T12:42:39Z

lib/urlwatch/reporters.py

+        e = functools.partial(self._e, entry)
+
+        e("id", self._mkuuid())
+        e("title", f'{job_state.verb}: {job.pretty_name()}')
+
+        if job.location_is_url():
+            e("link", job.get_location(), target='href')
+        else:
+            e("summary", job.get_location())
+
+        content = self._format_content(job_state, cfg['diff'])
+        e("content", str(content), target='cdata', type='html')
+        e("updated", self._tsfmt(timestamp))


I'm not yet sure if I like the whole _e thing. It's probably clever and stuff, but I don't fully get it, and it's probably hard to maintain(?) if I don't understand it. Would it be possible to split it up into multiple functions? If not, why not?

thp · 2025-08-01T12:43:35Z

docs/source/reporters.rst

+     # Optional: Unique feed ID (automatically generated if omitted)
+     id: "urn:uuid:ffa6dc6e-7436-48f6-bc99-020ab1e7d429"
+     # Optional: Title of the feed
+     title: "URLWatch"


Branding.

Suggested change

title: "URLWatch"

title: "urlwatch changes"

naquad added 4 commits April 18, 2025 23:37

atom feed reporter

3dad234

python 3.10 compat, minor cleanup

948acab

ensure the feed always contains the required tags

4c6f7c0

doc update

1610734

thp mentioned this pull request Jun 6, 2025

How can i add a custom reporter in the urlwatch?like wechat? #665

Open

pep8 format

df93703

PEP-8 fixes

43dfe39

Merge branch 'thp:master' into master

3b3c34d

thp requested changes Aug 1, 2025

View reviewed changes

Uh oh!

Atom feed #845

Are you sure you want to change the base?

Atom feed #845

Conversation

naquad commented May 14, 2025

Uh oh!

naquad commented Jul 3, 2025

Uh oh!

thp commented Jul 7, 2025

Uh oh!

thp commented Jul 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

thp left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

thp commented Jul 7, 2025 •

edited

Loading