feat: add validation rules for reserved words, XML tags, and type checks #54

gotalab · 2025-12-22T16:48:32Z

Description & Motivation

Add missing validation rules to skillport validate command per Agent Skills specification. These rules ensure SKILL.md frontmatter follows Anthropic's official requirements.

Changes

Reserved word check: name field cannot contain 'anthropic' or 'claude' (prevents confusion with official skills)
XML tag check: name and description fields cannot contain XML-like tags (<tag>, </tag>) for security
Type check: name and description must be strings (detects YAML type coercion like name: yes → True)
Added corresponding unit tests for all new validation rules

How to Test

uv run pytest tests/unit/test_validation_rules.py -v

Checklist

Lint passes
Tests pass
Docs updated (if behavior changed) - N/A (validation behavior, no doc changes needed)

Add missing validation rules per Agent Skills specification: - Reserved word check: name cannot contain 'anthropic' or 'claude' - XML tag check: name and description cannot contain XML-like tags - Type check: name and description must be strings (detects YAML type coercion) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <[email protected]>

Copilot

Pull request overview

This PR adds three new validation rules to the skillport validate command to enforce Anthropic's Agent Skills specification requirements: reserved word checking (prevents names containing "anthropic" or "claude"), XML tag detection (prevents XML-like tags in name and description fields), and type validation (ensures name and description are strings in frontmatter).

Key Changes:

Added reserved word validation to prevent skill names from containing "anthropic" or "claude"
Added XML tag detection using regex pattern to prevent injection attacks
Added type checking to detect YAML type coercion issues (e.g., name: yes becoming True)

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
src/skillport/modules/skills/internal/validation.py	Implements reserved word detection, XML tag pattern matching, and type validation for name/description fields
tests/unit/test_validation_rules.py	Adds unit tests for reserved words, XML tags, and type validation edge cases

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-22T16:51:23Z

src/skillport/modules/skills/internal/validation.py

+    """Return the first reserved word found in name, or None."""
+    name_lower = name.lower()
+    for word in RESERVED_WORDS:
+        if word in name_lower:
+            return word


The reserved word detection uses substring matching, which will produce false positives. For example, a skill named "anthropology-101" would be incorrectly rejected because it contains "anthropic" as a substring. Consider using word boundary matching instead, such as using regex with \b word boundaries or splitting on hyphens and checking each component.

Suggested change

"""Return the first reserved word found in name, or None."""

name_lower = name.lower()

for word in RESERVED_WORDS:

if word in name_lower:

return word

"""Return the first reserved word found in name, or None.

Reserved words are matched against hyphen-separated components of the name

(rather than arbitrary substrings) to avoid false positives such as

"anthropology-101" containing "anthropic" as a substring.

"""

normalized_lower = unicodedata.normalize("NFKC", name).lower()

for part in normalized_lower.split("-"):

if part and part in RESERVED_WORDS:

return part

Copilot · 2025-12-22T16:51:24Z

tests/unit/test_validation_rules.py

+    def test_name_reserved_word_case_insensitive(self):
+        """Reserved word check should be case-insensitive."""
+        # Note: name validation already fails on uppercase, but reserved check is independent
+        issues = validate_skill_record(
+            {"name": "my-skill", "description": "desc", "path": "/skills/my-skill"}
+        )
+        reserved_issues = [i for i in issues if "reserved" in i.message.lower()]
+        assert len(reserved_issues) == 0


This test doesn't actually verify case-insensitive reserved word detection. The name "my-skill" contains no reserved words (neither "anthropic" nor "claude"), so this test would pass even if the case-insensitive logic was broken. To properly test case insensitivity, use a name like "my-Anthropic-skill" or "my-CLAUDE-tool" and verify it triggers the reserved word check.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2025-12-22T16:53:27Z

src/skillport/modules/skills/internal/validation.py

+        if _contains_xml_tags(description):
+            issues.append(
+                ValidationIssue(
+                    severity="fatal",
+                    message="frontmatter.description: cannot contain XML tags",


Guard XML-tag check for non-string descriptions

If a SKILL.md frontmatter uses a non-string description (e.g., YAML list such as description: [item1, item2]), validate_skill_record now appends the new "must be a string" fatal but then immediately calls _contains_xml_tags(description) on lines 217-221, which invokes re.search on a list and raises TypeError: expected string or bytes-like object. The validation command (skillport validate path loader passes the raw meta as description) will crash before returning issues, so the new type-check rule never surfaces and the CLI aborts on this input.

Useful? React with 👍 / 👎.

Prevent TypeError when name/description are non-string types (e.g., YAML list or boolean). The XML tag check and other string operations now only run when the value is actually a string. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <[email protected]>

gotalab · 2025-12-22T17:00:08Z

@codex
review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2025-12-22T17:03:27Z

src/skillport/modules/skills/internal/validation.py

+    if name and isinstance(name, str):
        if len(name) > NAME_MAX_LENGTH:
            issues.append(
                ValidationIssue(


Type validation skipped when meta is absent

The new string-type enforcement is only applied in the meta branch, and the main validation block now runs only when isinstance(name, str). Calls without meta (e.g., validate_skill and skillport validate --id, which pass index records with no _meta and often no path) will treat non-string fields like name=True or description=['desc'] as valid because they bypass this guarded block, so the intended "must be a string" rule is never raised in those flows. This lets YAML type-coerced data pass validation whenever meta is unavailable.

Useful? React with 👍 / 👎.

Move type checks from meta-only block to always run on skill values. This ensures non-string name/description are detected even when validating via index (without meta). 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <[email protected]>

gotalab · 2025-12-22T17:14:11Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2025-12-22T17:16:07Z

src/skillport/modules/skills/internal/validation.py

+    # Required fields (value checks) - only check if type is valid
+    if not name and name_is_str:
+        # Empty string
        issues.append(
            ValidationIssue(severity="fatal", message="frontmatter.name: missing", field="name")


Block falsy non-string names/descriptions from passing validation

In validation.py, the required-field checks now run only when name_is_str/desc_is_str is true (if not name and name_is_str etc.), while the new type checks above are only executed for truthy values. As a result, frontmatter like name: null, name: [], or description: false (falsy non-strings) bypass both the type check and the missing check and return no fatal issues, letting invalid skills pass validation—previous logic would have flagged them as missing. This affects any validation run on SKILL.md data containing YAML null/empty collection values.

Useful? React with 👍 / 👎.

Restructure type/required checks using if/elif to ensure falsy non-string values like null, empty list, or False are caught as type errors instead of silently passing validation. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <[email protected]>

gotalab · 2025-12-22T17:19:54Z

@codex
review

chatgpt-codex-connector · 2025-12-22T17:23:17Z

Codex Review: Didn't find any major issues. 🚀

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Copilot AI review requested due to automatic review settings December 22, 2025 16:48

Copilot started reviewing on behalf of gotalab December 22, 2025 16:48 View session

Copilot AI reviewed Dec 22, 2025

View reviewed changes

chatgpt-codex-connector bot reviewed Dec 22, 2025

View reviewed changes

gotalab merged commit 5e41fb2 into main Dec 22, 2025
3 checks passed

gotalab deleted the fix/add-missing-validation-rules branch December 22, 2025 17:29

github-actions bot mentioned this pull request Dec 22, 2025

chore(main): release 1.0.0 #55

Merged

-    """Return the first reserved word found in name, or None."""
-    name_lower = name.lower()
-    for word in RESERVED_WORDS:
-        if word in name_lower:
-            return word
+    """Return the first reserved word found in name, or None.
+    Reserved words are matched against hyphen-separated components of the name
+    (rather than arbitrary substrings) to avoid false positives such as
+    "anthropology-101" containing "anthropic" as a substring.
+    """
+    normalized_lower = unicodedata.normalize("NFKC", name).lower()
+    for part in normalized_lower.split("-"):
+        if part and part in RESERVED_WORDS:
+            return part

feat: add validation rules for reserved words, XML tags, and type checks #54

feat: add validation rules for reserved words, XML tags, and type checks #54

Conversation

gotalab commented Dec 22, 2025

Description & Motivation

Changes

How to Test

Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Dec 22, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 22, 2025

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Dec 22, 2025

Choose a reason for hiding this comment

Uh oh!

gotalab commented Dec 22, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Dec 22, 2025

Choose a reason for hiding this comment

Uh oh!

gotalab commented Dec 22, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Dec 22, 2025

Choose a reason for hiding this comment

Uh oh!

gotalab commented Dec 22, 2025

Uh oh!

chatgpt-codex-connector bot commented Dec 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants