Test QR rules with CUDA by kshyatt · Pull Request #241 · QuantumKitHub/MatrixAlgebraKit.jl

kshyatt · 2026-06-01T09:40:59Z

No description provided.

kshyatt · 2026-06-04T15:48:07Z

OK, this is now working for QR on full CuMatrix, but there are still scalar indexing errors for Diagonal because dispatch for SubArray{T, 2, Diagonal{T, CuVector{T}} is awful. I'd be ok with merging this as-is if the pullback modifications look ok, and then come back for another swing at the Diagonal case.

lkdvos

Would it make sense to introduce a project_triu(!) and/or norm_triu function for this, and just overload the GPU ones to allocate?

As a side note, we might at some point try and not generate any of the gauge dependency checks if the tolerance is set to 0, as a way to disable the warnings, or alternatively put the entire thing in a @warn block so we can disable that, since these checks are starting to feel a bit expensive if we need to start allocating.

kshyatt · 2026-06-08T13:15:24Z

Would it make sense to introduce a project_triu(!) and/or norm_triu function for this, and just overload the GPU ones to allocate?

Yeah I think that could be a nice way to handle it, however I'm unsure if it's that helpful if it's only used in these checks.

codecov · 2026-06-09T12:46:37Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

Files with missing lines	Coverage Δ
src/pullbacks/qr.jl	`97.00% <100.00%> (+1.44%)`	⬆️

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Jutho · 2026-06-10T23:12:26Z


    Q₁ = view(Q, :, 1:p)
-    R₁₁ = UpperTriangular(view(R, 1:p, 1:p))
+    R₁₁ = UpperTriangular(R[1:p, 1:p])


This is a subtle and impactful change. The UpperTriangular wrapper is really only necessary to enable the rdiv! call below. If GPUs cannot deal with UpperTriangular of a view of a GPUArray, then maybe we need to call the corresponding BLAS/LAPACK methods directly, or have some intermediate wrapper like rdiv_uppertriangular!.

If GPUs cannot deal with UpperTriangular of a view of a GPUArray

Indeed they can't :(

I even wonder how rdiv!(::Matrix, ::UpperTriangular) is evaluated on the GPU, since you need cuSOLVERDx to access TRSM.

Through https://docs.nvidia.com/cuda/cublas/#cublas-t-trsm I think

BTW cuSOLVERDx is only for device side code, so it can only be called by running CUDA kernels, not from host side code as we are doing here...

https://github.com/JuliaGPU/CUDA.jl/blob/fbb90981cbde21d979087ad518a510f5b38f95b3/lib/cusolver/src/linalg.jl#L43 is this what you're looking for?

Not really, this is internally in dividing using \ by a general matrix, which is then evaluated by computing its QR decomposition and then directly calling cuBLAS.trsm! on the triangular factor. Here we already have the triangular factor, so I indeed want to call trsm!, but using generic code, which is why I was using ldiv!/rdiv!. And I don't see where rdiv!(first_arg, second_arg::UpperTrangiular) is then actually lowered to cuBLAS.trsm! in the CuArray case (and why that only works for a pure CuMatrix and not for a view over it.)

I think I figured it out, it's a combo of:

https://github.com/JuliaLang/LinearAlgebra.jl/blob/1dcf75c70ea5a8d518ce71efe04c6c1e2093628d/src/triangular.jl#L1263

and

https://github.com/JuliaGPU/CUDA.jl/blob/fbb90981cbde21d979087ad518a510f5b38f95b3/lib/cublas/src/linalg.jl#L443

To unblock this: shall we indeed create a helper rdiv!_uppertriangular! that for now just avoids the copy on the CPU and simply copies on the GPU with a # TODO: dispatch to trsm! directly

But Then I don't understand why it doesn't work. The argument B::StridedCuMatrix in https://github.com/JuliaGPU/CUDA.jl/blob/fbb90981cbde21d979087ad518a510f5b38f95b3/lib/cublas/src/linalg.jl#L443 should accept a view of a CuMatrix, no? My apologies for being annoying, my lack of access to a GPU to test these things myself makes me ask these questions.

kshyatt · 2026-06-30T23:20:28Z

The Enzyme windows fail is happening on main and seems unrelated

kshyatt force-pushed the ksh/cuqr branch from 61b5a56 to 58e25ae Compare June 4, 2026 15:46

kshyatt requested review from Jutho and lkdvos and removed request for Jutho June 4, 2026 15:48

Jutho reviewed Jun 5, 2026

View reviewed changes

Comment thread src/pullbacks/qr.jl Outdated

lkdvos reviewed Jun 7, 2026

View reviewed changes

kshyatt force-pushed the ksh/cuqr branch from 58e25ae to 57993e1 Compare June 9, 2026 12:11

Jutho reviewed Jun 10, 2026

View reviewed changes

Comment thread src/pullbacks/qr.jl Outdated

Jutho reviewed Jun 10, 2026

View reviewed changes

kshyatt and others added 5 commits June 30, 2026 14:08

Test QR rules with CUDA

403a1ef

Incremental progress on pb

79bd042

Turn off Diagonal QR tests for CUDA for now

4029531

Working QR

c435b7b

Fix another bad R22upper

ddc689a

kshyatt force-pushed the ksh/cuqr branch from 57993e1 to ddc689a Compare June 30, 2026 18:09

kshyatt added 2 commits June 30, 2026 14:36

Typo

0b344ec

Another fix

2a94261

Jutho reviewed Jul 1, 2026

View reviewed changes

Comment thread src/pullbacks/qr.jl

Remove duplicated comment

46f5f4b

Uh oh!

Conversation

kshyatt commented Jun 1, 2026

Uh oh!

kshyatt commented Jun 4, 2026

Uh oh!

Uh oh!

lkdvos left a comment

Choose a reason for hiding this comment

Uh oh!

kshyatt commented Jun 8, 2026

Uh oh!

codecov Bot commented Jun 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kshyatt commented Jun 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov Bot commented Jun 9, 2026 •

edited

Loading