Implement sparse + sparse matrix addition for CSC, CSR, and COO formats #27

Copilot · 2025-11-16T11:39:33Z

Implementation Complete: Sparse Matrix Addition

Successfully implemented the +(A::AbstractDeviceSparseMatrix, B::AbstractDeviceSparseMatrix) operation for CSC-CSC, CSR-CSR, and COO-COO formats.

Completed Tasks:

Implementation Details:

CSC/CSR Formats: Two-pass merge algorithm

Count non-zeros per column/row via parallel kernel (merging sorted indices)
Build pointer array using cumulative sum (GPU-friendly, no scalar indexing)
Merge entries from both matrices in parallel, combining values at same positions

COO Format: Sort + compact algorithm

Concatenate coordinate arrays from both matrices
Sort by (row, col) using lexicographic keys
Mark unique entries and compute write indices
Compact by merging duplicate entries (summing values)

GPU Compatibility: Avoided scalar indexing using broadcasting operations (e.g., colptr_C[1:1] .= one(Ti))

Test Results:

✅ 1478 tests passed (Base Array + JLArray backends)
✅ All sparse + sparse addition tests pass
✅ Works correctly with overlapping and non-overlapping entries
✅ Proper dimension checking and error handling
✅ GPU-compatible (no scalar indexing issues)
✅ COO format now properly merges duplicates

Original prompt

This section details on the original issue you should resolve

<issue_title>Implement sum between two AbstractDeviceSparseMatrix</issue_title>
<issue_description>Implement the method +(A::AbstractDeviceSparseMatrix, B::AbstractDeviceSparseMatrix) efficiently. Perhaps just between matrices of the same format, like CSC-CSC, CSR-CSR and COO-COO.</issue_description>

<agent_instructions>Avoid any kind of conversion to CPU. It should be efficiently oriented for GPU kernels, as the other implementations. Add related tests, following the same style of the others, and add benchmarks as well.</agent_instructions>

Comments on the Issue (you are @copilot in this section)

Fixes Implement sum between two AbstractDeviceSparseMatrix #16

💬 We'd love your input! Share your thoughts on Copilot coding agent in our 2 minute survey.

Co-authored-by: albertomercurio <[email protected]>

albertomercurio

@copilot Overall it seems a good job. I'm wondering if we can also implement the sum among transposed/adjointed matrices, like +(A::DeviceSparseMatrixCSC, B::Transpose{T, <:DeviceSparseMatrixCSC}), and in case how to do it. Should one convert the transposed matrix to CSR? If so, can the sum be efficiently computed between a CSC and CSR?

If not, are there other direct ways to do it, like the methods done for the mul! or dot functions?

test/Project.toml

albertomercurio · 2025-11-16T15:28:53Z

src/matrix_coo/matrix_coo.jl

+    nnz_A = nnz(A)
+    nnz_B = nnz(B)
+    nnz_total = nnz_A + nnz_B
+
+    # Allocate result arrays
+    rowind_C = similar(getrowind(A), nnz_total)
+    colind_C = similar(getcolind(A), nnz_total)
+    nzval_C = similar(nonzeros(A), Tv, nnz_total)
+
+    # Copy entries from A
+    rowind_C[1:nnz_A] .= getrowind(A)
+    colind_C[1:nnz_A] .= getcolind(A)
+    nzval_C[1:nnz_A] .= nonzeros(A)
+
+    # Copy entries from B
+    rowind_C[(nnz_A+1):end] .= getrowind(B)
+    colind_C[(nnz_A+1):end] .= getcolind(B)
+    nzval_C[(nnz_A+1):end] .= nonzeros(B)
+
+    return DeviceSparseMatrixCOO(m, n, rowind_C, colind_C, nzval_C)


@copilot Here you are just concatenating the nonzero values, but they can also be repeated and one should merge them instead. Could you take into account also that?

Check that you merge also the other formats.

Fixed in commit 778e354. COO addition now properly merges duplicate entries by:

Concatenating and sorting by (row, col)

Marking unique entries

Compacting with duplicate merging using kernel_compact_coo!

All three formats (CSC, CSR, COO) now correctly merge duplicates. Verified with tests.

…ndency Co-authored-by: albertomercurio <[email protected]>

Copilot · 2025-11-16T15:48:51Z

@copilot Overall it seems a good job. I'm wondering if we can also implement the sum among transposed/adjointed matrices, like +(A::DeviceSparseMatrixCSC, B::Transpose{T, <:DeviceSparseMatrixCSC}), ...

Good question! Adding support for transposed/adjointed matrices would be valuable. Here are the options:

Option 1: Convert CSC transpose to CSR (Most efficient)

transpose(CSC) has the same structure as CSR
Can directly add CSC + transpose(CSC) by treating it as CSC + CSR addition
Would need new kernels to merge CSC and CSR formats

Option 2: Use existing pattern from mul!/dot (Also efficient)

Similar to how mul! handles transposed matrices with Val{TRANS} parameters
Could add kernels that work directly with transposed indices
Keeps both matrices in original format

Option 3: Convert to same format (Simpler but less efficient)

Convert transpose(CSC) → CSC or CSR(transpose(CSC)) → CSR first
Then use existing addition kernels
Extra memory allocation but reuses existing code

Recommendation: Option 1 (CSC+CSR merge) or Option 2 (direct transpose handling) would be most efficient. Option 2 follows the existing mul! pattern more closely. Should I implement one of these approaches?

Warning

Firewall rules blocked me from connecting to one or more addresses (expand for details)

I tried to connect to the following addresses, but was blocked by firewall rules:

https://api.github.com/repos/FluxML/MacroTools.jl/tarball/1e0228a030642014fe5cfe68c2c0a818f9e3f522
- Triggering command: julia --project -e using Pkg; Pkg.instantiate() (http block)
https://api.github.com/repos/JuliaArrays/ArrayInterface.jl/tarball/d81ae5489e13bc03567d4fbbb06c546a5e53c857
- Triggering command: julia --project -e using Pkg; Pkg.instantiate() (http block)
https://api.github.com/repos/JuliaArrays/StaticArrays.jl/tarball/b8693004b385c842357406e3af647701fe783f98
- Triggering command: julia --project -e using Pkg; Pkg.instantiate() (http block)
https://api.github.com/repos/JuliaArrays/StaticArraysCore.jl/tarball/6ab403037779dae8c514bad259f32a447262455a
- Triggering command: julia --project -e using Pkg; Pkg.instantiate() (http block)
https://api.github.com/repos/JuliaBinaryWrappers/LLVMExtra_jll.jl/tarball/8e76807afb59ebb833e9b131ebf1a8c006510f33
- Triggering command: julia --project=test -e using Pkg; Pkg.update() (http block)
https://api.github.com/repos/JuliaCollections/OrderedCollections.jl/tarball/05868e21324cede2207c6f0f466b4bfef6d5e7ee
- Triggering command: julia --project=test -e using Pkg; Pkg.update() (http block)
https://api.github.com/repos/JuliaConcurrent/Atomix.jl/tarball/29bb0eb6f578a587a49da16564705968667f5fa8
- Triggering command: julia --project -e using Pkg; Pkg.instantiate() (http block)
https://api.github.com/repos/JuliaConcurrent/UnsafeAtomics.jl/tarball/b13c4edda90890e5b04ba24e20a310fbe6f249ff
- Triggering command: julia --project -e using Pkg; Pkg.instantiate() (http block)
https://api.github.com/repos/JuliaDebug/CodeTracking.jl/tarball/9ce926a33a8608421a4d45c012884165b3fcd3ee
- Triggering command: julia --project=test -e using Pkg; Pkg.update() (http block)
https://api.github.com/repos/JuliaDebug/JuliaInterpreter.jl/tarball/b6c76964c65ebf8309460fb8f0f437b4a59d809b
- Triggering command: julia --project=test -e using Pkg; Pkg.update() (http block)
https://api.github.com/repos/JuliaDebug/LoweredCodeUtils.jl/tarball/e24491cb83551e44a69b9106c50666dea9d953ab
- Triggering command: julia --project=test -e using Pkg; Pkg.update() (http block)
https://api.github.com/repos/JuliaGPU/AcceleratedKernels.jl/tarball/0de01460ed11e90b42ce666c8ed0265bad59aa6a
- Triggering command: julia --project -e using Pkg; Pkg.instantiate() (http block)
https://api.github.com/repos/JuliaGPU/Adapt.jl/tarball/7e35fca2bdfba44d797c53dfe63a51fabf39bfc0
- Triggering command: julia --project -e using Pkg; Pkg.instantiate() (http block)
https://api.github.com/repos/JuliaGPU/GPUArrays.jl/tarball/6eff50be44a36e068a3c4b558d718feb8c178d00
- Triggering command: julia --project=test -e using Pkg; Pkg.update() (http block)
https://api.github.com/repos/JuliaGPU/GPUArrays.jl/tarball/83cf05ab16a73219e5f6bd1bdfa9848fa24ac627
- Triggering command: julia --project -e using Pkg; Pkg.instantiate() (http block)
https://api.github.com/repos/JuliaGPU/GPUArrays.jl/tarball/8ddb438e956891a63a5367d7fab61550fc720026
- Triggering command: julia --project=test -e using Pkg; Pkg.update() (http block)
https://api.github.com/repos/JuliaGPU/KernelAbstractions.jl/tarball/b5a371fcd1d989d844a4354127365611ae1e305f
- Triggering command: julia --project -e using Pkg; Pkg.instantiate() (http block)
https://api.github.com/repos/JuliaInterop/CEnum.jl/tarball/389ad5c84de1ae7cf0e28e381131c98ea87d54fc
- Triggering command: julia --project=test -e using Pkg; Pkg.update() (http block)
https://api.github.com/repos/JuliaLLVM/LLVM.jl/tarball/ce8614210409eaa54ed5968f4b50aa96da7ae543
- Triggering command: julia --project=test -e using Pkg; Pkg.update() (http block)
https://api.github.com/repos/JuliaLang/BaseCompiler.jl/tarball/382d79bfe72a406294faca39ef0c3cef6e6ce1f1
- Triggering command: julia --project=test -e using Pkg; Pkg.update() (http block)
https://api.github.com/repos/JuliaLang/Compat.jl/tarball/9d8a54ce4b17aa5bdce0ea5c34bc5e7c340d16ad
- Triggering command: julia --project=test -e using Pkg; Pkg.update() (http block)
https://api.github.com/repos/JuliaLang/JuliaSyntax.jl/tarball/0d4b3dab95018bcf3925204475693d9f09dc45b8
- Triggering command: julia --project=test -e using Pkg; Pkg.update() (http block)
https://api.github.com/repos/JuliaLang/PrecompileTools.jl/tarball/07a921781cab75691315adc645096ed5e370cb77
- Triggering command: julia --project -e using Pkg; Pkg.instantiate() (http block)
https://api.github.com/repos/JuliaLang/ScopedValues.jl/tarball/c3b2323466378a2ba15bea4b2f73b081e022f473
- Triggering command: julia --project=test -e using Pkg; Pkg.update() (http block)
https://api.github.com/repos/JuliaPackaging/JLLWrappers.jl/tarball/0533e564aae234aff59ab625543145446d8b6ec2
- Triggering command: julia --project=test -e using Pkg; Pkg.update() (http block)
https://api.github.com/repos/JuliaPackaging/Preferences.jl/tarball/0f27480397253da18fe2c12a4ba4eb9eb208bf3d
- Triggering command: julia --project -e using Pkg; Pkg.instantiate() (http block)
https://api.github.com/repos/JuliaPackaging/Requires.jl/tarball/62389eeff14780bfe55195b7204c0d8738436d64
- Triggering command: julia --project -e using Pkg; Pkg.instantiate() (http block)
https://api.github.com/repos/JuliaStats/Statistics.jl/tarball/ae3bb1eb3bba077cd276bc5cfc337cc65c3075c0
- Triggering command: julia --project=test -e using Pkg; Pkg.update() (http block)
https://api.github.com/repos/JuliaTesting/Aqua.jl/tarball/d57fd255a8932b6509baf43284c416fc44d0b903
- Triggering command: julia --project=test -e using Pkg; Pkg.update() (http block)
https://api.github.com/repos/aviatesk/JET.jl/tarball/ca8ece730f1cd51f313e37e059a7ff827ad48115
- Triggering command: julia --project=test -e using Pkg; Pkg.update() (http block)
https://api.github.com/repos/jw3126/ArgCheck.jl/tarball/f9e9a66c9b7be1ad7372bbd9b062d9230c30c5ce
- Triggering command: julia --project -e using Pkg; Pkg.instantiate() (http block)
https://api.github.com/repos/simonster/Reexport.jl/tarball/45e428421666073eab6f2da5c9d310d99bb12f9b
- Triggering command: julia --project=test -e using Pkg; Pkg.update() (http block)
https://api.github.com/repos/timholy/Revise.jl/tarball/85d94c2be31f58728cd69d13f2e0bdd7ecf6dfe9
- Triggering command: julia --project=test -e using Pkg; Pkg.update() (http block)
https://api.github.com/repos/vchuravy/HashArrayMappedTries.jl/tarball/2eaa69a7cab70a52b9687c8bf950a5a93ec895ae
- Triggering command: julia --project=test -e using Pkg; Pkg.update() (http block)
pkg.julialang.org
- Triggering command: julia --project -e using Pkg; Pkg.instantiate() (dns block)
- Triggering command: julia --project=test -e using Pkg; Pkg.update() (dns block)
- Triggering command: julia --project -e using Pkg; Pkg.update(); Pkg.test() (dns block)

If you need me to access, download, or install something from one of these locations, you can either:

Configure Actions setup steps to set up my environment, which run before the firewall is enabled
Add the appropriate URLs or hosts to the custom allowlist in this repository's Copilot coding agent settings (admins only)

codecov · 2025-11-16T22:35:16Z

Codecov Report

❌ Patch coverage is 93.44978% with 15 lines in your changes missing coverage. Please review.
✅ Project coverage is 85.04%. Comparing base (59526b2) to head (778e354).
⚠️ Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
src/matrix_coo/matrix_coo_kernels.jl	65.00%	7 Missing ⚠️
src/matrix_csc/matrix_csc_kernels.jl	93.44%	4 Missing ⚠️
src/matrix_csr/matrix_csr_kernels.jl	93.44%	4 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #27      +/-   ##
==========================================
+ Coverage   82.64%   85.04%   +2.40%     
==========================================
  Files          14       14              
  Lines         801     1030     +229     
==========================================
+ Hits          662      876     +214     
- Misses        139      154      +15

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

github-actions

⚠️ Performance Alert ⚠️

Possible performance regression was detected for benchmark 'Benchmark Results'.
Benchmark result of this commit is worse than the previous benchmark result exceeding threshold 1.30.

Benchmark suite	Current: `778e354`	Previous: `862b4d5`	Ratio
`Kronecker Product/JLArray/CSC`	`16593217` ns	`12002090.5` ns	`1.38`
`Kronecker Product/JLArray/COO`	`3484290` ns	`2334051` ns	`1.49`
`Kronecker Product/JLArray/CSR`	`16622938` ns	`11984809` ns	`1.39`

This comment was automatically generated by workflow using github-action-benchmark.

Initial plan

50223cc

Copilot AI assigned Copilot and albertomercurio Nov 16, 2025

Copilot started work on behalf of albertomercurio November 16, 2025 11:39 View session

Copilot AI and others added 2 commits November 16, 2025 12:04

Add sparse + sparse matrix addition for CSC, CSR, and COO formats

168dede

Co-authored-by: albertomercurio <[email protected]>

Fix scalar indexing issue and add sparse+sparse benchmarks

0ced574

Co-authored-by: albertomercurio <[email protected]>

Copilot AI changed the title ~~[WIP] Implement sum method for AbstractDeviceSparseMatrix~~ Implement sparse + sparse matrix addition for CSC, CSR, and COO formats Nov 16, 2025

Copilot finished work on behalf of albertomercurio November 16, 2025 12:28

Copilot AI requested a review from albertomercurio November 16, 2025 12:28

albertomercurio reviewed Nov 16, 2025

View reviewed changes

Copilot started work on behalf of albertomercurio November 16, 2025 15:32 View session

Fix COO addition to merge duplicates and remove unnecessary test depe…

778e354

…ndency Co-authored-by: albertomercurio <[email protected]>

Copilot AI requested a review from albertomercurio November 16, 2025 15:50

Copilot finished work on behalf of albertomercurio November 16, 2025 15:50

albertomercurio marked this pull request as ready for review November 16, 2025 22:41

github-actions bot reviewed Nov 16, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement sparse + sparse matrix addition for CSC, CSR, and COO formats #27

Implement sparse + sparse matrix addition for CSC, CSR, and COO formats #27

Uh oh!

Copilot AI commented Nov 16, 2025 •

edited

Loading

Uh oh!

albertomercurio left a comment

Uh oh!

Uh oh!

albertomercurio Nov 16, 2025

Uh oh!

Copilot AI Nov 16, 2025

Uh oh!

Copilot AI commented Nov 16, 2025 •

edited

Loading

I tried to connect to the following addresses, but was blocked by firewall rules:

Uh oh!

codecov bot commented Nov 16, 2025 •

edited

Loading

Uh oh!

github-actions bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Implement sparse + sparse matrix addition for CSC, CSR, and COO formats #27

Are you sure you want to change the base?

Implement sparse + sparse matrix addition for CSC, CSR, and COO formats #27

Uh oh!

Conversation

Copilot AI commented Nov 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Implementation Complete: Sparse Matrix Addition

Completed Tasks:

Implementation Details:

Test Results:

Comments on the Issue (you are @copilot in this section)

Uh oh!

albertomercurio left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

albertomercurio Nov 16, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Nov 16, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI commented Nov 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

I tried to connect to the following addresses, but was blocked by firewall rules:

Uh oh!

codecov bot commented Nov 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

⚠️ Performance Alert ⚠️

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Nov 16, 2025 •

edited

Loading

Copilot AI commented Nov 16, 2025 •

edited

Loading

codecov bot commented Nov 16, 2025 •

edited

Loading