Add profile events support by kozzztik · Pull Request #455 · mymarilyn/clickhouse-driver

kozzztik · 2024-09-18T16:19:22Z

fixes Query execution statistics add memory_usage #451

Adds support of profile events on native protocol. There are many different parameters inside like network timings, locks, memory usage, that may be very helpful for debug and monitoring queries.

Not sure is it needed to update docs.

Checklist:

Add tests that demonstrate the correct behavior of the change. Tests should fail without the change.
Add or update relevant docs, in the docs folder and in code.
Ensure PR doesn't contain untouched code reformatting: spaces, etc.
Run flake8 and fix issues.
Run pytest no tests failed. See https://clickhouse-driver.readthedocs.io/en/latest/development.html.

xzkostyan · 2024-09-19T09:45:34Z

clickhouse_driver/result.py

        self.elapsed = elapsed
+
+    def store_profile_events(self, packet):
+        data = QueryResult([packet]).get_result()


Currently we have 'static' attributes that stores statistics: https://clickhouse-driver.readthedocs.io/en/latest/features.html#query-execution-statistics: client.last_query.progress.total_rows, client.last_query.progress.total_bytes, etc.

I'd prefer to store statistics in the same way if it's possible: client.last_query.stats.select_query, client.last_query.stats.selected_rows.

I use content of this data for analyzing queries, and they may be very different. Metrics here vary on query type and even queried table engine. I guess server version may have effect too (I use only v.23 at this moment). So, finally, that is not stable list of metrics. And number of options too big (may be >100 finally).
I found that ~20 of them most common for queries and most intersting. I use pydantic model to get them:

class ClickhouseStats(pydantic.BaseModel): elapsed: int = pydantic.Field(alias="elapsed") is_insert: int | None = pydantic.Field(alias="InsertQuery", default=None) read_bytes: int | None = pydantic.Field(alias="ReadCompressedBytes", default=None) write_bytes: int | None = pydantic.Field(alias="WriteBufferFromFileDescriptorWriteBytes", default=None) network_recv_bytes: int | None = pydantic.Field(alias="NetworkReceiveBytes", default=None) network_recv_time: int | None = pydantic.Field(alias="NetworkReceiveElapsedMicroseconds", default=None) network_send_bytes: int | None = pydantic.Field(alias="NetworkSendBytes", default=None) network_send_time: int | None = pydantic.Field(alias="NetworkSendElapsedMicroseconds", default=None) memory_usage: int | None = pydantic.Field(alias="MemoryTrackerUsage", default=None) memory_peak: int | None = pydantic.Field(alias="MemoryTrackerPeakUsage", default=None) file_open: int | None = pydantic.Field(alias="FileOpen", default=None) function_execute: int | None = pydantic.Field(alias="FunctionExecute", default=None) write_time: int | None = pydantic.Field(alias="DiskWriteElapsedMicroseconds", default=None) insert_rows: int | None = pydantic.Field(alias="InsertedRows", default=None) insert_bytes: int | None = pydantic.Field(alias="InsertedBytes", default=None) select_rows: int | None = pydantic.Field(alias="SelectedRows", default=None) select_bytes: int | None = pydantic.Field(alias="SelectedBytes", default=None) insert_parts: int | None = pydantic.Field(alias="InsertedCompactParts", default=None) real_time: int | None = pydantic.Field(alias="RealTimeMicroseconds", default=None) system_time: int | None = pydantic.Field(alias="SystemTimeMicroseconds", default=None) def __init__(self, result: CursorResult | None = None, query_info: QueryInfo | None = None): if query_info is None: query_info: QueryInfo = result.context.query_info super().__init__(elapsed=int(query_info.elapsed * 1000), **(query_info.stats or {})) # TODO: 1000?

I can add them here(without pydantic), but wouldn't that be too much?

Yep. It's too much. Dict will be fine.

Add profile events support

8d2ac70

xzkostyan reviewed Sep 19, 2024

View reviewed changes

kozzztik added 2 commits September 20, 2024 16:49

Fix tests for old clickhouse versions

4ac7a81

Fix clickhouse statistics min version in tests

03f0995

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add profile events support#455

Add profile events support#455
kozzztik wants to merge 3 commits intomymarilyn:masterfrom
kozzztik:profile_events_support

kozzztik commented Sep 18, 2024 •

edited

Loading

Uh oh!

xzkostyan Sep 19, 2024

Uh oh!

kozzztik Sep 19, 2024

Uh oh!

xzkostyan Sep 19, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

kozzztik commented Sep 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xzkostyan Sep 19, 2024

Choose a reason for hiding this comment

Uh oh!

kozzztik Sep 19, 2024

Choose a reason for hiding this comment

Uh oh!

xzkostyan Sep 19, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kozzztik commented Sep 18, 2024 •

edited

Loading