Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to unarchive 7z format #348

Closed
wendellpiez opened this issue Mar 18, 2025 · 7 comments · Fixed by #357
Closed

How to unarchive 7z format #348

wendellpiez opened this issue Mar 18, 2025 · 7 comments · Fixed by #357

Comments

@wendellpiez
Copy link

As documented, XML Calabash should be able to unarchive 7z format.

However, when I try it,

<p:unarchive include-filter="\.xml$" format="7z"/>

I get this error:

Fatal err:XD0036 at file:/some/pipeline.xpl:24:55: Value "7z" cannot be converted to type "xs:NCName".

Oops!

Glad for a tip or workaround. When I leave the format='7z' off the step, it complains the file is not a zip.

@ndw
Copy link
Contributor

ndw commented Mar 18, 2025

Off the top of my head and from the comfort of the couch, try 'sevenz'. But that’s a guess for the benefit of our time zone differences.

@wendellpiez
Copy link
Author

I thought something like that might be worth a try - will do that, tomorrow (EDT) ... thanks @ndw

@djbpitt
Copy link

djbpitt commented Mar 19, 2025

@wendellpiez Does the situation change if you specify XML 1.1 instead of 1.0?

@ndw
Copy link
Contributor

ndw commented Mar 19, 2025

You want sevenZ, apparently. @djbpitt, the version of XML isn't going to matter here (and XML 1.1 doesn't allow QNames to begin with numbers, does it? I don't recall.)

Please leave this bug open to remind me to improve the documentation.

@djbpitt
Copy link

djbpitt commented Mar 19, 2025

1.1 allows Unicode characters that 1.0 prohibits,but I don't remember whether it also broadens the range specifically of name-start characters.

@djbpitt
Copy link

djbpitt commented Mar 19, 2025

Nope. https://www.w3.org/TR/2006/REC-xml11-20060816/#sec-xml11, production 4. Digits still are not name-start characters.

@wendellpiez
Copy link
Author

This is to confirm that sevenZ works, thanks again. It takes more than a couple of seconds to run, but XML Calabash reports seeing 2612 XML files inside my 900MB 7z archive.

Leaving the ticket open to amend the docs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants