Skip to content

Предложения по репозиторию#22

Merged
OBenner merged 5 commits intomasterfrom
cursor/-bc-23857164-4d51-42f8-99cf-e634c4192ffe-65ed
Jan 13, 2026
Merged

Предложения по репозиторию#22
OBenner merged 5 commits intomasterfrom
cursor/-bc-23857164-4d51-42f8-99cf-e634c4192ffe-65ed

Conversation

@OBenner
Copy link
Owner

@OBenner OBenner commented Jan 13, 2026

Fixes TODO links in README.md, corrects content encoding, and adds CI checks with helper scripts to ensure repository quality and maintainability.

This PR addresses immediate content quality issues (placeholder links, mojibake) and introduces automated checks via GitHub Actions and Python scripts (repo_checks.py, generate_full.py) to prevent future regressions, improve content consistency, and validate internal links. A CONTRIBUTING.md file is also added to guide future contributions.


Open in Cursor Open in Web


Note

Expands coverage and hardens repo quality.

  • Adds new topics with Q&A: content/dbt.md, iceberg.md, hudi.md, cdc.md, data-modeling.md, data-quality.md, observability.md, data-governance.md, cost-optimization.md, python.md, system-design.md; updates content/full.md to index them
  • Introduces CI workflow .github/workflows/repo-checks.yml to run Python 3.12 and execute scripts/repo_checks.py
  • Adds scripts/repo_checks.py (checks UTF-8/"mojibake", placeholder links, and validates internal anchors in content/full.md) and scripts/generate_full.py (builds content/full.md from headings)
  • Updates README.md with corrected links, added sections (Iceberg/Hudi/dbt/Theory topics), and useful resource links; adds CONTRIBUTING.md with local checks/guidelines
  • Fixes encoding/typos and anchors (e.g., pseudo-column) in content/sql.md and corresponding entries in content/full.md

Written by Cursor Bugbot for commit ffa29c0. This will update automatically on new commits. Configure here.

- Replace README TODO links with real references and add contributing link
- Fix mojibake in SQL pseudo-column question and synced full index link
- Add scripts and GitHub Actions workflow to prevent regressions
Default repo checks now verify link targets exist without enforcing anchor matching.
Set STRICT_ANCHORS=1 to enable strict heading anchor validation.
@cursor
Copy link

cursor bot commented Jan 13, 2026

Cursor Agent can help with this pull request. Just @cursor in comments and I'll start working on changes in this branch.
Learn more about Cursor Agents

Introduce new topic pages (English Q&A) matching existing repo structure and add them to README and the full index.
Introduce four new practice-focused topics with English Q&A in the existing repo format and link them from README and the full index.
Add four new topic pages (English Q&A) in the existing repo format and link them from README and the full index.
@OBenner OBenner marked this pull request as ready for review January 13, 2026 16:50
@OBenner OBenner merged commit 2a27b4e into master Jan 13, 2026
3 checks passed
Copy link

@cursor cursor bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This PR is being reviewed by Cursor Bugbot

Details

You are on the Bugbot Free tier. On this plan, Bugbot will review limited PRs each billing cycle.

To receive Bugbot reviews on all of your PRs, visit the Cursor dashboard to activate Pro and start your 14-day free trial.



def check_forbidden_text(problems: list[Problem]) -> None:
paths = [README, FULL, *sorted(CONTENT_DIR.glob("*.md"))]
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Duplicate file checking causes redundant problem reports

Low Severity

The check_forbidden_text function builds a paths list that includes FULL explicitly and also includes it again via CONTENT_DIR.glob("*.md"). Since FULL is defined as CONTENT_DIR / "full.md", the glob matches the same file, causing full.md to appear twice in the list. This results in full.md being checked twice and any issues in that file being reported as duplicates, which wastes the 200-problem output limit and could hide distinct issues in other files.

Fix in Cursor Fix in Web

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants