[WIP]model/messages/api_types: alert_words mark up #1314

Subhasish-Behera · 2023-03-02T18:57:41Z

What does this PR do?

Partial fix for #588

Tested?

Manually
Existing tests (adapted, if necessary)
New tests added (for any new behavior)
Passed linting & tests (each commit)

Commit flow

first commit:Fetches alert words and registers for any alert_word event and updates the alerts words.
second commit: adds transform_content_alert_words which adds <span class=alert-word> around alert-words

Notes

-> This PR(specifically the second commit) is based on the alert word implementation of the Zulip web client which was later adapted by other clients.
link for the web implementation code: link
-> Other helpful documentations for contributors:
1. regex wrt python
2. re module
-> Whats Done?

fetching and registering of alert_words
finding the alert_word patterns and applying the

-> What needs to be done?

Even though applying span logic is ready but the logic for applying markup for alert words inside of multiple elements is not ready. So basically the markup works for the alert-words without any other tags indie soup2markup. Before that we have to decide whether we want to apply different markup when multiple tags are present for a alert_word or we have to choose the markup from any one of them.
Currently the markup applied for alert-words is same as code tag i.e msg_code as the markup for different zt_themes are yet to be discussed.
Tests are not added as the non-test code is yet to reach a stable state.

neiljp

@Subhasish-Behera Thanks for looking at this other piece of the alert words puzzle 👍

I've focused on the first commit, since we need the alert words first in order to make progress, at least in the application itself.

It'd be good to split the first commit into two parts:

getting the initial data
updating the data

While forgetting to include the updating wouldn't be great (and we have that with other data right now), alert words aren't often updated, so you should be able to manually test a new first commit easily - and even the highlighting if you wanted. That first commit could be merged sooner on it's own once it looks good.

Please note my point about the alert words being 'private' to the model; while other parts of the code simply access inside the model, ideally we want to use a method for this (and migrate the others too). That would sit in the new first commit, and could form part of a test that the data from the server comes out via that function. For that, I'd suggest you check out the API link I gave, as well as the fixtures in tests/conftest.py.

The updating part does look very simple, and in a data sense it should be, based on the event data, but note that a change in the data should affect how messages are rendered. That is, not just ones that are constructed based on the message content and alert words at the time, but they need updating too. We could ignore that initially, but we shouldn't forget it :)

As a final quick point, if you've not, I'd suggest installing gitlint as per the README. It should complain about your long commit title ;)
That section should also give you pointers on the general format.
If you find you need to say too much, or include too many files, that can indicate the commit can be split to be simpler.

zulipterminal/api_types.py

zulipterminal/model.py

neiljp

@Subhasish-Behera Let's still address the first commit first, but I just took a look at the second commit as it stands.

I can see how this mirrors web-app code, but we should test it ourselves too, to ensure that it works how we expect in this translated form.

If we can demonstrate this manually and with tests, this should be a good v1 implementation of this feature.

Beyond that, if we have more tests, and can compare what the web app highlights (and tests), we should be able to determine whether we can do the same using this current approach. This will affect if we need to think differently to cover more cases.

We may want to consider an alternative way of formatting this kind of text, and maybe search text too, if it crosses tag/style boundaries.

I expect the web app applies tags dynamically since it uses CSS to style the text, and I'm guessing the spans can overlap? That's complex with urwid - we process the HTML into urwid-styled Text instead.

If so, we might end up considering processing the content into text only, finding matches, then mapping those into adjacent pieces in the HTML or urwid-processed text.

However, let's focus on testing this v1 first :)
(and the v0, ie. the data handling, before that!)

neiljp · 2023-03-10T23:12:24Z

zulipterminal/ui_tools/messages.py

+            self.alerted_words,
+            "has_alert_word" in self.message["flags"]


These new parameters could be combined into one. If the message doesn't have alert words flagged, then the function doesn't need to know/process them.

neiljp · 2023-03-10T23:13:00Z

zulipterminal/ui_tools/messages.py

@@ -796,14 +806,58 @@ def update_message_author_status(self) -> bool:

        return author_is_present

+    def transform_content_alert_words(content: str, alerted_list: List[str]) -> Any:


If this is translated from a server implementation, it would be useful to add a comment to note that, and in the commit text too.

That could refer to the file as it is now, but also a commit ref, so that if code changes later then it is easier to track what has changed in the original code.

Having this function in a commit on it's own first, with detailed tests to cover various cases, would be good before we hook it into the processing in a subsequent commit.

Yeah, that would be great. will do it.
But it's tricky to translate the code changes of js to python directly in this case because of the difference in libraries. In js, the lodash library uses _.escapeRegExp() but in python we have to use re.compile for creating objects and then we call the built-in functions like object.search() etc

Subhasish-Behera · 2023-03-25T17:34:54Z

@zulipbot add "PR needs review"

neiljp

@Subhasish-Behera I focused on the first 3 commits, since I think that was the recent focus?

The current overall PR fails when I run it right now, at line 843 of messages.py, compiling a regex (multiple repeat at position 27).

Otherwise, improving your commit text would improve the speed of merging, and the first three seem close, if needing some adjustments.

zulipterminal/model.py

tests/model/test_model.py

neiljp · 2023-04-09T01:32:40Z

tests/model/test_model.py

+    ):
+        model._handle_alert_words_event(event)
+
+        assert model._alert_words != initial_alerted_words


Is this always true?

Yep, this is always true. But it is more context-dependent. Basically, currently, this test is designed to test the most minimal of changes in the active_conversation_info i.e change(start/stop) of a single user. The inital_alerted_words means just before that change and not at the start of the whole typing event series.
So it is true because of the values that are passed in mark.parametrize i.e it is an approach followed in the test.

If that is true, then it is good to add a suffix to the test function name to indicate that.

I think the name test__handle_alert_words_event_minimal_change would do the job.

Hmm. Do we get an event if the changes aren't made?

I'd suggest a double underscore from the function name to any 'meaning'.

I believe by changes you mean no typing in the writebox(of any client).
That would be a event with op=stop. So yes.

test__handle_alert_words_event_minimal_change sounds bit subjective,so i will be changing it to test__handle_alert_words_event__single_user. Though it's natural as a single event can't inform about more than 1 user in this case.

neiljp · 2023-04-12T18:36:47Z

zulipterminal/model.py

+    def get_alert_words(self) -> List[str]:
+        return self._alert_words.copy()
+


I didn't comment on this before, but this function placement in the file seems strange too - between two related functions?

will fix it asap

Note that strictly this could go into a separate commit with a test to eg. ensure that one can't modify the data externally, but this is fine.
(also to indicate it returns the expected data, as a minimal test)

tests/model/test_model.py

theViz343

@Subhasish-Behera Thanks for working on this 👍! This seems to work well. There are a few cases I found during testing where it seems to be a bit unpredictable.

\something. Prefixing with a backslash seems to not get picked up by the regex if the alert word is \something. If the alert word is just something, it gets picked up when it shouldn't (according to the behavior on the webapp).
wqdnasj\ Suffixing with a backslash when the alert word is also wqdnasj\ seems to crash ZT with the following error - re.error: missing ), unterminated subpattern at position 23.

Apart from this, I've left some suggestions for minor code changes in the comments.

zulipterminal/ui_tools/messages.py

theViz343 · 2023-06-24T17:55:27Z

zulipterminal/ui_tools/messages.py

+
+        clean: str
+
+        def replace_callback(match: Match[Union[str, str, str]]) -> str:


Regarding the typing of match, Union[str, str, str] results in just Union[str] so you could shorten it. Also, a match object's type is generally Union[Match[str], None] for the case that there is no match, which could be in our case because not every alert word is guaranteed to be present.

agreed upto Match[Union[str]] . I agree that the Match object can be None when the pattern is not found but we know from the flag that an alert word is present. Re the case that not every alert word is present, the code iterates over every alert word as configured by the User and then puts span if it identifies that particular word in the content(so in every time one alert word is used to create the pattern). so None doesn't seem to be a possible value in our case.

Also, if I add None in typing of match , the errors shown is

error: Item "None" of "Optional[Match[str]]" has no attribute "group" [union-attr] Item "None" of "Optional[Match[str]]" has no attribute "start"

zulipterminal/ui_tools/messages.py

theViz343 · 2023-06-24T18:31:07Z

zulipterminal/ui_tools/messages.py

+            before = match.group(1)
+            word = match.group(2)
+            after = match.group(3)


Minor: You could use match.group(1,2,3) here, which returns a tuple consisting of the three values.

The only use these 3 have is in here:

if in_tag: return f"{before}{word}{after}" return f"{before}<span class='alert-word'>{word}</span>{after}"

I could have directly used match. group(n) but didn't. I think separating them into 3 variables gives better readability
By following this suggestion of match.group(1,2,3), I need it to be stored in some var and break it using the index to use it in the above code which will not clarify any further of what is being done.
So why group it when it can be accessed individually without creating any tuple

If I follow the above correctly, would it help if you used tuple unpacking?

one, two, three = function_returning_tuple_of three_values()

Yep! Multiple assignments in one line can do the same thing.

before,word,after = match.group(1,2,3)

The alert words are stored in _alert_words and it's copy can be accessed using get_alert_words.

…alert_words" added to event actions and corresponding handler is _handle_alert_words_event which updates _alert_words. Added AlertWordEvent in api_types.py Tests are added for _handle_alert_words_event.

stores alert_words using get_alert_words.Two new parameters for tranform_content- alerted_list and alert_word_present. if the second argument is true then a call is made to transform_content_alert_words. There firstly all the alert_words are converted into a string consisting of 3 parts clean, befor_punctuation and after_puntuation.Secondly, these modified alerted words are compiled to be searched in the content and added with the span using replace_callback.

neiljp · 2023-07-04T21:44:13Z

@Subhasish-Behera Is this awaiting an update from you, or have you addressed the feedback?
Just checking based on my earlier feedback and that from Vishwesh :)
(If so, you can add a 👍 emoji or a comment to specific comments to indicate this)

zulipbot added the size: L [Automatic label added by zulipbot] label Mar 2, 2023

Subhasish-Behera force-pushed the alert8021 branch from cb798d3 to e2325e5 Compare March 2, 2023 19:25

neiljp reviewed Mar 3, 2023

View reviewed changes

zulipterminal/api_types.py Outdated Show resolved Hide resolved

zulipterminal/model.py Outdated Show resolved Hide resolved

zulipterminal/model.py Outdated Show resolved Hide resolved

zulipterminal/model.py Outdated Show resolved Hide resolved

neiljp added area: message rendering area: event handling How events from the server are responded to PR awaiting update PR has been reviewed & is awaiting update or response to reviewer feedback labels Mar 3, 2023

neiljp reviewed Mar 10, 2023

View reviewed changes

Subhasish-Behera force-pushed the alert8021 branch 7 times, most recently from d79dfa1 to 54c72d4 Compare March 21, 2023 19:35

zulipbot added size: XL [Automatic label added by zulipbot] and removed size: L [Automatic label added by zulipbot] labels Mar 21, 2023

Subhasish-Behera force-pushed the alert8021 branch from 54c72d4 to 3c3ba1a Compare March 21, 2023 19:51

zulipbot added the PR needs review PR requires feedback to proceed label Mar 25, 2023

neiljp removed the PR awaiting update PR has been reviewed & is awaiting update or response to reviewer feedback label Mar 29, 2023

neiljp reviewed Apr 9, 2023

View reviewed changes

neiljp added PR awaiting update PR has been reviewed & is awaiting update or response to reviewer feedback and removed PR needs review PR requires feedback to proceed labels Apr 9, 2023

Subhasish-Behera force-pushed the alert8021 branch from 3c3ba1a to 673d974 Compare April 9, 2023 19:43

neiljp reviewed Apr 12, 2023

View reviewed changes

tests/model/test_model.py Outdated Show resolved Hide resolved

Subhasish-Behera force-pushed the alert8021 branch 2 times, most recently from cc78d04 to bc625ef Compare April 13, 2023 17:36

Subhasish-Behera added the PR needs buddy review label Jun 13, 2023

theViz343 reviewed Jun 24, 2023

View reviewed changes

theViz343 removed the PR needs buddy review label Jun 24, 2023

Subhasish-Behera added 3 commits June 28, 2023 02:14

model: Add alert_words to initial_data_to_fetch.Tests adapted.

943182c

The alert words are stored in _alert_words and it's copy can be accessed using get_alert_words.

model/api_types: Update after alert_words event.Test adapted/added. "…

7930632

…alert_words" added to event actions and corresponding handler is _handle_alert_words_event which updates _alert_words. Added AlertWordEvent in api_types.py Tests are added for _handle_alert_words_event.

Subhasish-Behera force-pushed the alert8021 branch from bc625ef to e84e1da Compare June 29, 2023 08:30

commit 1

9928a09

neiljp added the missing feature: user A missing feature for all users, present in another Zulip client label Apr 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP]model/messages/api_types: alert_words mark up #1314

[WIP]model/messages/api_types: alert_words mark up #1314

Subhasish-Behera commented Mar 2, 2023 •

edited

Loading

neiljp left a comment

neiljp left a comment

neiljp Mar 10, 2023

neiljp Mar 10, 2023

Subhasish-Behera Mar 21, 2023

Subhasish-Behera commented Mar 25, 2023

neiljp left a comment

neiljp Apr 9, 2023

Subhasish-Behera Apr 9, 2023

neiljp Apr 12, 2023

Subhasish-Behera Apr 12, 2023

neiljp Apr 12, 2023

Subhasish-Behera Apr 12, 2023

Subhasish-Behera Apr 12, 2023

neiljp Apr 12, 2023

Subhasish-Behera Apr 12, 2023

neiljp Apr 12, 2023 •

edited

Loading

theViz343 left a comment

theViz343 Jun 24, 2023

Subhasish-Behera Jul 6, 2023 •

edited

Loading

theViz343 Jun 24, 2023

Subhasish-Behera Jul 6, 2023 •

edited

Loading

neiljp Jul 7, 2023

Subhasish-Behera Jul 7, 2023

neiljp commented Jul 4, 2023

		self.alerted_words,
		"has_alert_word" in self.message["flags"]

		@@ -796,14 +806,58 @@ def update_message_author_status(self) -> bool:

		return author_is_present

		def transform_content_alert_words(content: str, alerted_list: List[str]) -> Any:

		def get_alert_words(self) -> List[str]:
		return self._alert_words.copy()


		clean: str

		def replace_callback(match: Match[Union[str, str, str]]) -> str:

[WIP]model/messages/api_types: alert_words mark up #1314

Are you sure you want to change the base?

[WIP]model/messages/api_types: alert_words mark up #1314

Conversation

Subhasish-Behera commented Mar 2, 2023 • edited Loading

neiljp left a comment

Choose a reason for hiding this comment

neiljp left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Subhasish-Behera commented Mar 25, 2023

neiljp left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

neiljp Apr 12, 2023 • edited Loading

Choose a reason for hiding this comment

theViz343 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Subhasish-Behera Jul 6, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Subhasish-Behera Jul 6, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

neiljp commented Jul 4, 2023

Subhasish-Behera commented Mar 2, 2023 •

edited

Loading

neiljp Apr 12, 2023 •

edited

Loading

Subhasish-Behera Jul 6, 2023 •

edited

Loading

Subhasish-Behera Jul 6, 2023 •

edited

Loading