Skip to content

Adds support for thomas-kocht.de#1894

Open
zdenek-stursa wants to merge 1 commit into
hhursev:mainfrom
zdenek-stursa:site/thomas-kocht-de
Open

Adds support for thomas-kocht.de#1894
zdenek-stursa wants to merge 1 commit into
hhursev:mainfrom
zdenek-stursa:site/thomas-kocht-de

Conversation

@zdenek-stursa
Copy link
Copy Markdown
Contributor

Add scraper for thomas-kocht.de

Adds support for scraping recipes from thomas-kocht.de.

Closes #1477

Site structure

Thomas Kocht uses Avada/Fusion page builder (WordPress). Recipe schema.org only includes name, description, image, and aggregate rating — ingredients, instructions, time and yields require HTML parsing:

  • Title / Description / Image / Ratings: from JSON-LD schema.org/Recipe
  • Ingredients: <h3>Zutaten …> heading followed by a <ul><li> list
  • Instructions: <div class="fusion-title"> divs containing <h2>Schritt N</h2>, each followed by a <div class="fusion-text"> with <h4> sub-title and <p> paragraphs
  • Total time: parsed from a <p>Vorbereitungszeit: X Minuten | Kochzeit: Y Minuten | …</p> paragraph
  • Yields: parsed from the same info paragraph (Ergibt N Stück)
  • Author: hardcoded "Thomas Kocht"

Testing

Tested against the Pfannkuchen recipe from the issue.

All tests pass.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

I'd like to request support for Thomas Kocht

1 participant