Parser: improve error message handling by chqrlie · Pull Request #290 · c2lang/c2compiler

chqrlie · 2025-06-11T12:24:17Z

use single on_error handler with error level and message arguments
remove Warning token type, never returned anyway.
improve #error and #warning message parsing consistency
make num_error messages non fatal
fix #warning behavior

bvdberg · 2025-06-13T11:53:25Z

It seems to make parsing slower... What was it that you were trying to improve?

chqrlie · 2025-06-13T20:47:18Z

It seems to make parsing slower...

This is surprising, the changes should not affect the inner loops, only the error cases.

What was it that you were trying to improve?

I was just trying to improve the consistency:

both warnings and errors should be produced synchronously,
#warning used to output an error with on_warning, itself using diag.error instead of diag.warn
num_error are errors, not warnings, but non fatal.
#error returned an error token without showing the error message.

bvdberg · 2025-06-14T05:38:11Z

parser/c2_tokenizer.c2

-    char[constants.MaxErrorMsgLen+1] msg;
-    string.memcpy(msg, start, len);
-    msg[len] = 0;
+    msg.size();  // ensure null terminator


debug leftover?

More anticipation of future feature :)

String buffers currently null terminate the array for each individual call. I suggest only adding the null terminator when the pointer or the size is actually needed. This would reduce the number of redundant writes and potentially improve the inlining of single character additions.

bvdberg · 2025-06-14T05:43:45Z

parser/c2_tokenizer.c2

-        return true;
+
+    // parse pptokens instead of raw text
+    string_buffer.Buf* msg = string_buffer.create_static(elemsof(t.error_msg), false, t.error_msg);


string_buffer.create_static does a malloc (surprised me as well), but old code seemed to be much simpler. If that used the t.error_msg buffer instead of a stack... The string buffer is now barely used (it does a memcopy internally). Also why parse to Eof instead of newline?

string_buffer.create_static does a malloc (surprised me as well),

The string_buffer.Buf object would not need to be allocated in this case, as well as many other ones, if we had separate string_buffer.Buf.init(Buf* b, const char *buf, u32 size, bool reallocate) function called by both string_buffer.create and string_buffer.create_static.

but old code seemed to be much simpler.

The #error and #warning features should use preprocessed tokens like #if for multiple reasons:

for consistency with the C parser.

to simplify language support in editors and code colorizers (simpler if we follow the already supported C semantics)

so comments are parsed as such on these lines and thus can be used by the tester, or the programmer.

If that used the t.error_msg buffer instead of a stack... The string buffer is now barely used (it does a memcpy internally).

It does a bunch of strcpy and strcat with truncation. We could indeed write and use strlcpy-like functions instead.

Also why parse toEof instead of newline?

Because t.lex_preproc(result) returns Eof on both the actual end of file and the first newline encountered. lex_preproc is a wrapper for lex_internal that sets the stop_at_eol for this very purpose. I shall add a comment to clarify this side-effect.

* use single `on_error` handler with error level and message arguments * remove `Warning` token type, never handled anyway. * improve `#error` and `#warning` message parsing consistency * make `num_error` messages non fatal * fix `#warning` behavior, add tests

bvdberg · 2025-12-22T07:49:45Z

merged

chqrlie force-pushed the errors branch from 56eeb9a to a0d857b Compare June 12, 2025 09:12

bvdberg reviewed Jun 14, 2025

View reviewed changes

chqrlie force-pushed the errors branch 8 times, most recently from c5a8f3e to 0a3ae91 Compare June 22, 2025 17:22

chqrlie force-pushed the errors branch 6 times, most recently from 51f67ec to aae549e Compare June 29, 2025 14:09

chqrlie force-pushed the errors branch 6 times, most recently from 0ce9852 to a9f4a6f Compare July 6, 2025 10:15

chqrlie force-pushed the errors branch 5 times, most recently from f8546b2 to b2f5bb8 Compare July 15, 2025 06:43

chqrlie force-pushed the errors branch 2 times, most recently from 1ce6666 to 019026e Compare October 20, 2025 20:57

chqrlie force-pushed the errors branch 3 times, most recently from ededd13 to 36b7171 Compare November 5, 2025 07:59

chqrlie force-pushed the errors branch 6 times, most recently from 19cfc58 to 7002a43 Compare November 10, 2025 11:09

chqrlie force-pushed the errors branch 7 times, most recently from dd292af to 82609b1 Compare November 23, 2025 21:49

chqrlie force-pushed the errors branch from 82609b1 to feab79f Compare November 25, 2025 07:47

chqrlie force-pushed the errors branch 3 times, most recently from 7579c95 to 30f2deb Compare December 8, 2025 14:23

chqrlie force-pushed the errors branch 3 times, most recently from acc0003 to f6d75e4 Compare December 16, 2025 14:21

chqrlie force-pushed the errors branch from f6d75e4 to ca57fc0 Compare December 18, 2025 22:29

bvdberg closed this Dec 22, 2025

chqrlie deleted the errors branch December 22, 2025 08:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parser: improve error message handling#290

Parser: improve error message handling#290
chqrlie wants to merge 1 commit intoc2lang:masterfrom
chqrlie:errors

chqrlie commented Jun 11, 2025

Uh oh!

bvdberg commented Jun 13, 2025

Uh oh!

chqrlie commented Jun 13, 2025

Uh oh!

bvdberg Jun 14, 2025

Uh oh!

chqrlie Jun 14, 2025

Uh oh!

bvdberg Jun 14, 2025

Uh oh!

chqrlie Jun 14, 2025

Uh oh!

bvdberg commented Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

chqrlie commented Jun 11, 2025

Uh oh!

bvdberg commented Jun 13, 2025

Uh oh!

chqrlie commented Jun 13, 2025

Uh oh!

bvdberg Jun 14, 2025

Choose a reason for hiding this comment

Uh oh!

chqrlie Jun 14, 2025

Choose a reason for hiding this comment

Uh oh!

bvdberg Jun 14, 2025

Choose a reason for hiding this comment

Uh oh!

chqrlie Jun 14, 2025

Choose a reason for hiding this comment

Uh oh!

bvdberg commented Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants