Skip to content

BLAS API compatibility#159

Draft
simonpintarelli wants to merge 4 commits intomasterfrom
blas-api-compatibility
Draft

BLAS API compatibility#159
simonpintarelli wants to merge 4 commits intomasterfrom
blas-api-compatibility

Conversation

@simonpintarelli
Copy link
Member

@simonpintarelli simonpintarelli commented Jan 14, 2026

Ref #158

Accept unitialized matrix C for case beta==0, fill with zeros on entry to call to multiply

Need to wait for a new release of COSTA, with a routine to fill matrix with given value.

@simonpintarelli simonpintarelli marked this pull request as draft January 14, 2026 13:44
simonpintarelli and others added 3 commits February 4, 2026 12:53
* Fix validation tolerance: use relative error instead of absolute

The validation was using absolute error tolerance (1e-8) which fails for
large matrix multiplication results (magnitude ~1e4). This caused false
negatives where COSMA computed correct results but failed validation.

Changes:
- Switch from absolute error to relative error for validation
- Use 1e-5 tolerance for float32 (appropriate for single precision)
- Use 1e-8 tolerance for float64 (appropriate for double precision)
- Handle small values near zero with absolute error fallback

This fixes issue #153 where K-split strategy was incorrectly reported
as producing 93.6% errors when actual relative errors were < 1e-6.

Tested with:
- 32x896x896 float32: now passes (was 93.8% false errors)
- 32x10000x896 float32: now passes (was 93.6% false errors)
- 32x32x32 float64: still passes (regression test)

* format

---------

Co-authored-by: David Sanftenberg <david.sanftenberg@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant

Comments