GSoC 2025 Clang-Doc blog #79

evelez7 · 2025-08-30T01:49:23Z

GSoC 2025 Improving Core Clang-Doc Functionality blog post

evelez7 · 2025-08-30T01:49:52Z

ilovepi

A few meta comments. First, you're underselling how much work you did in many ways. You completely rewrote a large portion of clang-doc, and solidified a new architecture that will allow us to delete a lot of code, with no performance loss and much better testing/maintainability.

second, because of your refactor you were able to implement several features really quickly. I don't think you hardly pointed those out at all.

Next, you wrote a lot of tests.

Lastly, we're almost ready to turn on the mustache backend on by default once we iron out 1-2 small issues w/ names. That's all because of you. Don't be modest, this is the time to brag about all the good things you did, because if you don't tell people, no one will know.

content/posts/2025-gsoc-clang-doc.md

ilovepi · 2025-08-30T02:11:12Z

I'd also consider adding a diagram or 2 if you have time/capacity.

ilovepi · 2025-08-30T02:11:44Z

Oh, and mention that you're presenting at LLVM dev meeting! can't leave that out!

ilovepi · 2025-09-02T15:46:41Z

content/posts/2025-gsoc-clang-doc.md

+<div style="margin:0 auto;">
+  <img src="/img/gsoc-2025-clang-doc-architecture.png"><br/>
+</div>


Not sure what time constraints are for the post, but I'd make a contrasting diagram for the old architecture that shows there were several backends, and we're moving to an architecture where you basically have one backend and then the templating just adjusts the format. from the clang-doc perspective you'll always just go to JSON and then generate the output format based on the template. What those templates are are is kind of irrelevant, since you'll just pick a template and some path names based on the output type (eventually). I didn't really pick that up in the text here or below.

There are no constraints. Take as much time as needed to polish the post :)

ilovepi · 2025-09-02T15:48:23Z

content/posts/2025-gsoc-clang-doc.md

+1. Visit source declarations via Clang's ASTVisitor.
+2. Serialize relevant source information into an Info (Clang-Doc's main data entity).
+3. Once all source declarations are serialized, write them into bitcode, reduce, and read the reduced Infos.
+4. Serialize Infos into the desired format.


Maybe provide more details here? the reader should have some intuition that these backends had a cumbersome job, and repeated lots of code to handle the same structures (and sometimes didn't!).

ilovepi · 2025-09-02T15:54:32Z

content/posts/2025-gsoc-clang-doc.md

+It seems fairly straightforward, but the architecture had a critical flaw.
+If a new C++ construct needed to be supported, it would be visited and serialized, but then support would have to be added to each backend individually.
+If you wanted to serialize something in YAML, you'd have to implement the Markdown logic separately.
+This imposed a very high maintenance cost for extending basic functionality, even if you just wanted to add something simple.
+It also easily led to generator disparity; a construct might be serialized in YAML, but not in Markdown.
+Testing was also in an awkward spot because it was unclear what format would be used to verify if the documentation output was acceptable.


You're sort of talking around the issue: yes you'd need to implement a thing in multiple places, but you're not highlighting that all the basic logic for handling any kind of Info was basically reimplemented for each backend, with almost no sharing (and some of them weren't implemented the same way). That's significantly worse than just handling things per backend. LLVM contributors won't have a feel for why this is bad unless you spell it out in a way that they'll see this isn't just lkie having an x86 backend and an Aarch64 backend w/ a bit of overlap. Its that, plus they each have their own implementations of all the shared logic in CodeGen. In fact, you may want to spell it out that way, since its an analogy that contributors will readily understand.

ilovepi · 2025-09-02T15:56:48Z

content/posts/2025-gsoc-clang-doc.md

+It also easily led to generator disparity; a construct might be serialized in YAML, but not in Markdown.
+Testing was also in an awkward spot because it was unclear what format would be used to verify if the documentation output was acceptable.
+
+## The Good: Mustache


I expected to see sections with The Good:, The Bad:, The Ugly: (or at least the first two). But

content/posts/2025-gsoc-clang-doc.md

ilovepi · 2025-09-12T00:50:40Z

@evelez7 any updates? Several other posts have gone up so far.

evelez7 · 2025-09-12T00:54:59Z

@evelez7 any updates? Several other posts have gone up so far.

Yeah I have a large change for organization and content that addresses the last round of feedback. Gives more context and better flow I think. Pushing it tonight, sorry about the delay.

evelez7 · 2025-09-12T20:08:49Z

I added a new diagram to show the problems with the old architecture, expanded detail in the bitcode refactor section, expanded details in the "bad" architecture section with a code example of the repeated yet dissimilar logic, and rearranged some of the sections a bit.

content/posts/2025-gsoc-clang-doc.md

ilovepi

I think this is a lot better. I'd still love to see some more details, but lets save something for the dev meeting :)

@petrhosek do you have any further thoughts?

content/posts/2025-gsoc-clang-doc.md

ilovepi

I think this is probably in a decent state now. I'd encourage you give it another read through before landing to make sure you don't want to change anything.

asl

This looks a decent post to me!

content/posts/2025-gsoc-clang-doc.md

GSoC 2025 Clang-Doc blog

a6b34ad

ilovepi reviewed Aug 30, 2025

View reviewed changes

content/posts/2025-gsoc-clang-doc.md Outdated Show resolved Hide resolved

content/posts/2025-gsoc-clang-doc.md Outdated Show resolved Hide resolved

content/posts/2025-gsoc-clang-doc.md Outdated Show resolved Hide resolved

content/posts/2025-gsoc-clang-doc.md Outdated Show resolved Hide resolved

address blog feedback, add diagrams

1e19e87

evelez7 requested a review from ilovepi September 1, 2025 08:56

mention dev meeting, spellcheck

034b4df

ilovepi reviewed Sep 2, 2025

View reviewed changes

address feedback, add new diagram

74861ce

evelez7 requested a review from ilovepi September 12, 2025 20:07

ilovepi reviewed Sep 12, 2025

View reviewed changes