process archive in mememory #75

pablochacin · 2025-03-11T17:28:01Z

Fixes multiple issues when trying to extract the archive and process it with k6pack, including issues with windows paths.
Also ensures that only the content of the archive is processed.

Closes #70

Signed-off-by: Pablo Chacin <[email protected]>

szkiba

LGTM

olegbespalov

LGTM!

Left non-blocking suggestions

olegbespalov · 2025-03-11T19:42:47Z

archive.go

+// analizeMetadata extracts the dependencies from the metadata.json file
+func analizeMetadata(content []byte) (analyzer, error) {
+	metadata := archiveMetadata{}
+	if err := json.Unmarshal(content, &metadata); err != nil {


If I'm not mistaken, we could use json.NewDecoder and then there is no need to pass bytes, but we could use io.Reader directly here

olegbespalov · 2025-03-11T19:48:29Z

archive.go

-				continue
-			}
+		content := &bytes.Buffer{}
+		if _, err := io.CopyN(content, reader, maxFileSize); err != nil && !errors.Is(err, io.EOF) {


It's a corner case, but still, I'm thinking isn't that right for the files that are bigger than 10 MB 🤔 Shoudld we maybe write a warning log in that case.

If the other suggestion about analizeMetadata will be accepted, we could probably move these lines closer to the scriptAnalyzer

And perhaps not for that PR, but it's worth investigation if we can instead of copying content.Bytes() just pass around readers, in other words make possible to scriptAnalyzer also work with buffers

It's a corner case, but still, I'm thinking isn't that right for the files that are bigger than 10 MB 🤔 Shoudld we maybe write a warning log in that case.

Not sure what you refer to in "isn't that right". Could you please elaborate?

So my understanding that this line will copy at maximum 10MB, silently ignoring the rest, and I'm questioning if that's a right way to do. Yes, the risks that we processing bigger files where module usage located after 10MB of data is low, but still

And perhaps not for that PR, but it's worth investigation if we can instead of copying content.Bytes() just pass around readers, in other words make possible to scriptAnalyzer also work with buffers

I definitely will do this. I started but realized it requires significant changes in other parts of the code and prefer to make this in a follow-up PR.

Sure, like I said, both my comments are non-blocking, feel free to merge this as it is and continue in follow-up PRs 👍

process archive in mememory

13ecacc

Signed-off-by: Pablo Chacin <[email protected]>

pablochacin marked this pull request as ready for review March 11, 2025 17:30

pablochacin requested a review from a team as a code owner March 11, 2025 17:30

pablochacin requested review from szkiba and removed request for a team March 11, 2025 17:30

szkiba approved these changes Mar 11, 2025

View reviewed changes

olegbespalov approved these changes Mar 11, 2025

View reviewed changes

pablochacin merged commit 2050984 into main Mar 12, 2025
6 checks passed

pablochacin deleted the process-archive-in-memory branch March 12, 2025 11:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

process archive in mememory #75

process archive in mememory #75

pablochacin commented Mar 11, 2025 •

edited

Loading

szkiba left a comment

olegbespalov left a comment

olegbespalov Mar 11, 2025

olegbespalov Mar 11, 2025

pablochacin Mar 12, 2025

olegbespalov Mar 12, 2025

pablochacin Mar 12, 2025

olegbespalov Mar 12, 2025 •

edited

Loading

process archive in mememory #75

process archive in mememory #75

Conversation

pablochacin commented Mar 11, 2025 • edited Loading

szkiba left a comment

Choose a reason for hiding this comment

olegbespalov left a comment

Choose a reason for hiding this comment

olegbespalov Mar 11, 2025

Choose a reason for hiding this comment

olegbespalov Mar 11, 2025

Choose a reason for hiding this comment

pablochacin Mar 12, 2025

Choose a reason for hiding this comment

olegbespalov Mar 12, 2025

Choose a reason for hiding this comment

pablochacin Mar 12, 2025

Choose a reason for hiding this comment

olegbespalov Mar 12, 2025 • edited Loading

Choose a reason for hiding this comment

pablochacin commented Mar 11, 2025 •

edited

Loading

olegbespalov Mar 12, 2025 •

edited

Loading