IFU-master-2025-06-04 #73

MISHANMAURYA · 2025-06-04T10:56:35Z

* Test g5g runner * Switch L4 to L40S runner; swap GitHub Linux T4 runner for AWS g4dn * Run tests on last 2 pytorch stable releases * Run tests on last 2 pytorch stable releases

* General cleanup & test improvements * Tests: WA numpy 2 compat issue for torch<2.3 * Tests: update aarch64 cpu min torch version * Tests: update aarch64 cpu min torch version * Tests: update aarch64 cpu min torch version

* Add torch.compile tests * Tests: WA aarch64 CPU regressions for torch 2.6.0; add Windows torch==2.7.0+cu118 test config * Tests: skip torch.compile for cuda on windows

* Start cleaning up docs * Remove page * Minor update * correction * Minor doc revisions * Update installation.mdx * Update _toctree.yml

* enable ipex Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix cpu 8bit quantization Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix int8 and nf4 cpu inference Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * add cpu fp4 and rem Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix dequantize nf4 xpu Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix ipex op Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix dequantize nf4 name Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix dequantize nf4 ipex Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix matmul8bitfp Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * enable cpu tests Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix format Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix quantize blockwise output shape Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix quant_storage bf16 and gemv cpu Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix cpu tests Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix xpu tests Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix lib Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * skip xpu dequantize blockwise op check Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix matmul8bit Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * skip not used function teests Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix matmul8bit fp Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * check ipex before MatMul8bitFp Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * update ipex install guide Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * update install guide Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix error log Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix error lof Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * update comment Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * move torch op to default Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * revert ipex check Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix code tabledevice Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix code table device Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix xpu ops Signed-off-by: jiqing-feng <jiqing.feng@intel.com> --------- Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* Tests: xfail opcheck for 4bit quantization with floating storage dtypes * Tests: xfail opcheck for 4bit quantization with floating storage dtypes * Tests: skip test_gemv_eye_4bit on CPU with bf16 when not supported by torch * Tests: skip test_gemv_eye_4bit on CPU with bf16 when not supported by torch

* Tests: add linux x64 cpu+ipex to nightly CI workflow * typo * Tests: guard linear8bit compile test for ipex cpu issue

…ent device (bitsandbytes-foundation#1665)

* Deprecation cleanup: remove histogram_scatter_add_2d * Deprecation cleanup: vectorwise_mm_dequant * Deprecation cleanup: vectorwise_quant * Remove unused test * Optimizer test cleanup * Deprecations: remove estimate_quantiles, create_quantile_map * Move deprecated test

* supports hpu backend in main branch * Update bitsandbytes/backends/hpu/ops.py updates the assertion message Co-authored-by: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com> * Update bitsandbytes/backends/hpu/ops.py Co-authored-by: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com> * Update ops.py Fix lint issue * Update ops.py --------- Co-authored-by: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com>

…ndation#1673)

…s-foundation#1629) * [xpu/triton] Add trtion dequantization kernel This PR adds xpu backend and trtion kernel for dequantization nf4 dtype. Trtion is an optional import. Tests: tests/test_functional.py::TestQuantize4BitFunctional supported nf4/fp4 cases tests/test_functional.py::Test8BitBlockwiseQuantizeFunctional implemented quantize_blockwise with binary search that works faster for XPU tests/test_linear4bit.py Signed-off-by: Dmitrii Makarenko <dmitrii.makarenko@intel.com> * align with ipex code * enable test for ipex * test_kbit_backprop: skip no longer needed * remove unused --------- Signed-off-by: Dmitrii Makarenko <dmitrii.makarenko@intel.com>

* doc fix signature for 8-bit optim * required changes * precommit

* Add clang-format rules * Update clang-format

…-06-04

* Setup XPU CI * CI: expand XPU matrix * test * test * test * test * test * test * test * test * test * test * skip some fp4 tests on hpu * skip some fp4 tests on hpu * skip gemv tests on hpu * test * Additional test patches for HPU * HPU test update * HPU test update * HPU test update * HPU test update * Format

pnunna93

LGTM

Titus-von-Koeller and others added 30 commits May 16, 2025 08:41

continuous release: tweaks

18ead19

continuous release: tweaks

90f38ac

continuous release: tweaks

66c0c45

continuous release: tweaks

4011273

continuous release: tweaks

3176277

continuous release: tweaks

3047ab9

continuous release: tweak + docs

513e69b

CI runner updates (bitsandbytes-foundation#1643)

cdcae8d

* Test g5g runner * Switch L4 to L40S runner; swap GitHub Linux T4 runner for AWS g4dn * Run tests on last 2 pytorch stable releases * Run tests on last 2 pytorch stable releases

Update test for L40S

70bacda

Update README.md

d475533

Optimizer backwards compatibility (bitsandbytes-foundation#1647)

e99ac0a

Add torch.compile tests (bitsandbytes-foundation#1648)

9f85829

* Add torch.compile tests * Tests: WA aarch64 CPU regressions for torch 2.6.0; add Windows torch==2.7.0+cu118 test config * Tests: skip torch.compile for cuda on windows

Documentation Cleanup (bitsandbytes-foundation#1644)

198d08f

* Start cleaning up docs * Remove page * Minor update * correction * Minor doc revisions * Update installation.mdx * Update _toctree.yml

simplified non_sign_bits (bitsandbytes-foundation#1649)

1d4ea6a

Last minute pre-release changes

0d1b3a3

Release 0.46.0

1e54f91

Bump dev version

a2a74ed

Add CPU + IPEX to nightly CI (bitsandbytes-foundation#1667)

318a86e

* Tests: add linux x64 cpu+ipex to nightly CI workflow * typo * Tests: guard linear8bit compile test for ipex cpu issue

Tests: don't require grad on weights for test_kbit_backprop

55ebaac

pass current bnb_quantized when moving quantized Params4bit to differ…

cd8bd2d

…ent device (bitsandbytes-foundation#1665)

Update README.md

76d3e2b

Merge upstream/main into IFU-master-2025-06-04

59ec4b9

CI workflow: bump torch 2.7.0 to 2.7.1 (bitsandbytes-foundation#1670)

e9fc96a

Update main.py

75487d3

Fix Linear4bit warnings/test for compute dtype

e9f3605

matthewdouglas and others added 18 commits June 6, 2025 16:19

Update README.md

11df723

Improvement for torch.compile support on Params4bit (bitsandbytes-fou…

d9333aa

…ndation#1673)

Lint

58e989e

Fixed a bug in test_fw_bit_quant (bitsandbytes-foundation#1675)

df73d3e

doc fix signature for 8-bit optim (bitsandbytes-foundation#1660)

61db085

* doc fix signature for 8-bit optim * required changes * precommit

Apply clang-format rules (bitsandbytes-foundation#1678)

4955d13

Update .git-blame-ignore-revs

6bd94c2

Add clang-format (bitsandbytes-foundation#1677)

d863adb

* Add clang-format rules * Update clang-format

HPU support for unit tests (bitsandbytes-foundation#1680)

70bbbb9

Merge branch 'origin/upstream_main_rocm_enabled' into IFU-master-2025…

ad5794f

…-06-04

merge

f9746dc

Update pythonInterface.cpp

3db3196

lint fix

75a654e

lint

5624736

Update pythonInterface.cpp

c75fdb7

revert permissions change

3936ca4

pnunna93 self-requested a review June 18, 2025 17:15

pnunna93 approved these changes Jun 18, 2025

View reviewed changes

pnunna93 merged commit 648ecd2 into ROCm:upstream_main_rocm_enabled Jun 18, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

IFU-master-2025-06-04 #73

IFU-master-2025-06-04 #73

Uh oh!

MISHANMAURYA commented Jun 4, 2025

Uh oh!

pnunna93 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

11 participants

IFU-master-2025-06-04 #73

IFU-master-2025-06-04 #73

Uh oh!

Conversation

MISHANMAURYA commented Jun 4, 2025

Uh oh!

pnunna93 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

11 participants