[FEAT] Adding `vecdot` implementation by SwayamInSync · Pull Request #86 · numpy/numpy-quaddtype

SwayamInSync · 2026-05-11T23:18:52Z

Implementing NumPy's vecdot for quaddtype

AI declaration:
Claude is used to extend test_dot.py for adding test related vecdot and possible edge cases

SwayamInSync · 2026-05-11T23:21:57Z

This PR also adds some helpers and a new ufunc registering helper function, which will be use for register matvec and vecmat so I will make those PR after this gets merged

ngoldbaum

Overall looks good, a few comments below.

ngoldbaum · 2026-05-15T20:22:14Z

-#ifndef DISABLE_QUADBLAS
-
    PyType_Slot slots[] = {
            {NPY_METH_resolve_descriptors, (void *)&quad_matmul_resolve_descriptors},


Claude points out that quad_matmul_resolve_desctriptors includes an error message that references matmul. It needs to be generalized to use the ufunc name rather than hardcode it:

numpy-quaddtype/src/csrc/umath/matmul.cpp

Lines 38 to 39 in fc52921

"QBLAS-accelerated matmul only supports SLEEF backend. "

"Please raise the issue at SwayamInSync/QBLAS for longdouble support");

Also because resolve_descriptors errors out for non-SLEEF backends, longdouble will never actually be able to call the naive loops. This is a pre-existing issue but it's copied here.

ngoldbaum · 2026-05-15T20:23:01Z

+    if (descr->backend != BACKEND_SLEEF) {
+        PyErr_SetString(PyExc_NotImplementedError,
+                        "QBLAS-accelerated vecdot only supports SLEEF backend.");
+        return -1;


this check is unnecessary because resolve_descriptors would have already triggered the same error.

ngoldbaum · 2026-05-15T20:26:18Z

+            Sleef_quad a_val, b_val;
+            memcpy(&a_val, x + k * x_n_stride, sizeof(Sleef_quad));
+            memcpy(&b_val, y + k * y_n_stride, sizeof(Sleef_quad));
+            sum = Sleef_fmaq1_u05(a_val, b_val, sum);


unaligned matmul does a different thing and calls into qblas_dot:

numpy-quaddtype/src/csrc/umath/matmul.cpp

Lines 234 to 248 in fc52921

Sleef_quad *temp_A_buffer = new Sleef_quad[n];

Sleef_quad *temp_B_buffer = new Sleef_quad[n];

memcpy(temp_A_buffer, A_ptr, n * sizeof(Sleef_quad));

memcpy(temp_B_buffer, B_ptr, n * sizeof(Sleef_quad));

size_t incx = 1;

size_t incy = 1;

result = qblas_dot(n, temp_A_buffer, incx, temp_B_buffer, incy, C_ptr);

delete[] temp_A_buffer;

delete[] temp_B_buffer;

break;

}

Why the difference? Claude seems to think the matmul approach has better numerical accuracy.

ngoldbaum · 2026-05-15T20:27:44Z

+        x = create_quad_array([1, 2])
+        y = create_quad_array([1, 2, 3])
+        with pytest.raises(ValueError):
+            np.vecdot(x, y)


might be worth adding a test that does vecdot on an empty array, e.g. np.vecdot(create_quad_array([]), create_quad_array([])).

SwayamInSync · 2026-05-15T21:19:08Z

Thanks @ngoldbaum , I was thinking to re-patch this and #88 after #95
So if possible we you can next checkout #95 and then I will easily wire the rest ones in.

adding vecdot implementation

5c95b5d

SwayamInSync requested a review from ngoldbaum May 12, 2026 15:48

ngoldbaum reviewed May 15, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[FEAT] Adding `vecdot` implementation#86

[FEAT] Adding `vecdot` implementation#86
SwayamInSync wants to merge 1 commit into
numpy:mainfrom
SwayamInSync:vecdot

SwayamInSync commented May 11, 2026

Uh oh!

SwayamInSync commented May 11, 2026

Uh oh!

ngoldbaum left a comment

Uh oh!

ngoldbaum May 15, 2026

Uh oh!

ngoldbaum May 15, 2026

Uh oh!

ngoldbaum May 15, 2026

Uh oh!

ngoldbaum May 15, 2026

Uh oh!

SwayamInSync commented May 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	"QBLAS-accelerated matmul only supports SLEEF backend. "
	"Please raise the issue at SwayamInSync/QBLAS for longdouble support");

	Sleef_quad *temp_A_buffer = new Sleef_quad[n];
	Sleef_quad *temp_B_buffer = new Sleef_quad[n];

	memcpy(temp_A_buffer, A_ptr, n * sizeof(Sleef_quad));
	memcpy(temp_B_buffer, B_ptr, n * sizeof(Sleef_quad));

	size_t incx = 1;
	size_t incy = 1;

	result = qblas_dot(n, temp_A_buffer, incx, temp_B_buffer, incy, C_ptr);

	delete[] temp_A_buffer;
	delete[] temp_B_buffer;
	break;
	}

Uh oh!

Conversation

SwayamInSync commented May 11, 2026

Uh oh!

SwayamInSync commented May 11, 2026

Uh oh!

ngoldbaum left a comment

Choose a reason for hiding this comment

Uh oh!

ngoldbaum May 15, 2026

Choose a reason for hiding this comment

Uh oh!

ngoldbaum May 15, 2026

Choose a reason for hiding this comment

Uh oh!

ngoldbaum May 15, 2026

Choose a reason for hiding this comment

Uh oh!

ngoldbaum May 15, 2026

Choose a reason for hiding this comment

Uh oh!

SwayamInSync commented May 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants