Add Azure Log Analytics tools for HPCC component analysis and cost tracking #162

Copilot · 2025-11-20T01:44:45Z

Description

Tools for querying Azure Log Analytics to analyze HPCC component resource usage and correlate with infrastructure costs. Addresses the need to understand which HPCC components keep Azure VMs active and contribute to operational expenses.

Implementation

azure_log_analytics_fetch.py - KQL query tool (320 lines)

Queries KubeNodeInventory and KubePodInventory tables
Time range and namespace filtering
CSV output with metadata header documenting query parameters
Supports Azure CLI and service principal authentication

azure_log_analytics_analyze.py - Component categorization (501 lines)

Pattern-based pod categorization into HPCC components: dali, thor, roxie, esp, eclagent, eclccserver, eclscheduler, dfuserver, sasha, dafilesrv
Non-HPCC system pod categorization: kubernetes-system, azure-system, monitoring, logging, ingress
Time-series output suitable for Gantt visualization
Summary reports with component distribution and node usage statistics

test_categorization.py - Validation suite (164 lines)

29 test cases covering HPCC and system component patterns
100% pass rate

Usage

# Fetch inventory data
./azure_log_analytics_fetch.py \
  --subscription-id <sub-id> \
  --workspace-id <workspace-id> \
  --aks-name <cluster> \
  --start "2024-11-01T00:00:00Z" \
  --end "2024-11-02T00:00:00Z" \
  --output results.csv

# Analyze and categorize
./azure_log_analytics_analyze.py \
  --input results.csv \
  --output analysis.csv \
  --summary summary.txt

Output enables correlation of component lifecycles with VM costs, resource optimization analysis, and capacity planning based on historical usage patterns.

Type of change:

This change is a new feature (non-breaking change which adds functionality).

Checklist:

Testing:

Comprehensive test suite validates component categorization logic:

17 HPCC component patterns (dali, thor, roxie, esp, eclagent, eclccserver, eclscheduler, dfuserver, sasha, dafilesrv)
10 system component patterns (kubernetes-system, azure-system, monitoring, logging, ingress)
2 unknown component fallback cases

Integration tested with sample AKS inventory data covering realistic deployment scenarios. Python syntax validated, no security vulnerabilities introduced.

Original prompt

write a python program that access Azure Log Analytics via KQL for a given date+time range, for a given subscription and aks name, and pulls information about nodes and pods that were running from KubeNodeInventory and KubePodInventory and outputs in a clean csv format. The output should contain details about the command ran in a comment block header.
By default it should process all namespaces, but a optional filter should allow select namespaces to be fetched.
This tool should live in tools/azure.

Write a 2nd python tool, that compliments this tool, that is designed to analyze the csv output from the 1st. This analyze tool should categorize the pods into HPCC components based on their name (from knowledge gleaned from the helm/hpcc chart). It should also categorize non-hpcc pods.

The goal of this analytic tools is to create a time-series of categorized components and what resources (nodes) they were using, such that [separatley] this could be visualized as a gantt chart or similar, and insights into what components were responsible for was keeping the VM's up (and hence costing money).

If you have any questions, ask before proceeding.

💬 We'd love your input! Share your thoughts on Copilot coding agent in our 2 minute survey.

Co-authored-by: jakesmith <[email protected]>

Copilot AI self-assigned this Nov 20, 2025

Initial plan

1cdfb09

Copilot AI assigned jakesmith Nov 20, 2025

Copilot started work on behalf of jakesmith November 20, 2025 01:45 View session

Copilot AI and others added 3 commits November 20, 2025 01:53

Add Azure Log Analytics tools for HPCC component analysis

5c3100e

Co-authored-by: jakesmith <[email protected]>

Add categorization tests and improve pattern matching

24c0da8

Co-authored-by: jakesmith <[email protected]>

Add .gitignore for Python artifacts

4d8d6b6

Co-authored-by: jakesmith <[email protected]>

Copilot AI changed the title ~~[WIP] Add Azure Log Analytics tools for KQL data extraction~~ Add Azure Log Analytics tools for HPCC component analysis and cost tracking Nov 20, 2025

Copilot AI requested a review from jakesmith November 20, 2025 02:01

Copilot finished work on behalf of jakesmith November 20, 2025 02:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Azure Log Analytics tools for HPCC component analysis and cost tracking #162

Add Azure Log Analytics tools for HPCC component analysis and cost tracking #162

Uh oh!

Copilot AI commented Nov 20, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add Azure Log Analytics tools for HPCC component analysis and cost tracking #162

Are you sure you want to change the base?

Add Azure Log Analytics tools for HPCC component analysis and cost tracking #162

Uh oh!

Conversation

Copilot AI commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Implementation

Usage

Type of change:

Checklist:

Testing:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Nov 20, 2025 •

edited

Loading