Add frequency-domain SVD projection and dense residual block by mcoughlin · Pull Request #256 · ML4GW/ml4gw

mcoughlin · 2026-02-16T18:46:55Z

New ml4gw.nn.svd module with two components:

FreqDomainSVDProjection: FFT + linear projection layer initialized from precomputed SVD right singular vectors. Supports shared or per-channel (per-IFO) weights, and freeze/unfreeze for two-phase training. Adapted from DINGO's LinearProjectionRB.
DenseResidualBlock: LayerNorm(x + MLP(x)) residual block for processing SVD coefficients. Uses LayerNorm exclusively — BatchNorm causes train/eval output collapse in GW detection where batch composition (signal/noise ratio) differs between train and eval.

Includes 134 parametrized tests covering shapes, gradients, V matrix initialization, freeze/unfreeze, save/load, and train/eval consistency.

New ml4gw.nn.svd module with two components: - FreqDomainSVDProjection: FFT + linear projection layer initialized from precomputed SVD right singular vectors. Supports shared or per-channel (per-IFO) weights, and freeze/unfreeze for two-phase training. Adapted from DINGO's LinearProjectionRB. - DenseResidualBlock: LayerNorm(x + MLP(x)) residual block for processing SVD coefficients. Uses LayerNorm exclusively — BatchNorm causes train/eval output collapse in GW detection where batch composition (signal/noise ratio) differs between train and eval. Includes 134 parametrized tests covering shapes, gradients, V matrix initialization, freeze/unfreeze, save/load, and train/eval consistency. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

wbenoit26 · 2026-02-16T18:55:34Z

@mcoughlin Out of curiosity, how much of this was able to be done by Claude?

mcoughlin · 2026-02-16T18:57:14Z

@wbenoit26 A lot of it, but also a lot of discussion ;)

github-actions · 2026-02-16T19:30:02Z

Coverage report

Click to see where and how coverage changed

File	Statements	Missing	Coverage	Coverage (new stmts)	Lines missing
ml4gw/nn
__init__.py
ml4gw/nn/svd
__init__.py
dense.py
projection.py
Project Total

_{This report was generated by python-coverage-comment-action}

EthanMarx · 2026-02-17T16:22:27Z

Very cool. Not surprised claude is decent at this.

I think a feature that is missing is the actual fitting of the SVD? Or is it assumed that the SVD will be fit outside and saved to a file?

deepchatterjeeligo

Hi @mcoughlin we discussed about this morning. What came out of the discussion was that in addition to unittests, we should add some content in our documentation pages showing a usecase: something I thought out loud was a figure showing how adding more svd components approaches the waveform, but that is up for more discussion.

deepchatterjeeligo · 2026-02-19T16:04:35Z

ml4gw/nn/svd/dense.py

Can we reuse the parts from our ResNet implementation or generalize them for the residual blocks? I worry this will become a duplicate residual block implementation.

I think it's a bit too awkward, and I don't think it would generalize too much.

deepchatterjeeligo · 2026-02-19T16:08:21Z

ml4gw/nn/svd/projection.py

+                if V_tensor is not None:
+                    proj.weight.data = V_tensor.T.contiguous()


I believe the .T syntax for transpose will be deprecated soon. I have seen warnings about it. Can we explicitly supply the axes? Also, I'm curious about the .contiguous why that is needed.

deepchatterjeeligo · 2026-02-19T16:18:19Z

ml4gw/nn/svd/projection.py

+        x_freq = torch.fft.rfft(x, dim=-1)
+
+        # Stack real and imaginary: (batch, channels, 2 * n_freq)
+        x_ri = torch.cat([x_freq.real, x_freq.imag], dim=-1)
+
+        if self.per_channel:
+            proj_list = []
+            for ch in range(self.num_channels):
+                proj_list.append(self.projections[ch](x_ri[:, ch, :]))
+            x_proj = torch.stack(proj_list, dim=1)
+        else:
+            x_proj = self.projection(x_ri)
+
+        return x_proj.reshape(batch_size, -1)


This is a more basic question: I don't think you are computing the SVD here, right? Without the pretrained V matrix, the projections are random linear layers, which is not what we want.

I think it will be great addition to have the SVD creation feature using our own waveforms.

Added an example for this.

Add compute_basis() static method to FreqDomainSVDProjection for computing SVD basis vectors from waveform banks. Fix bug where frequency-domain input had imaginary parts discarded by premature float32 cast. Add comprehensive tests for compute_basis() and documentation with ml4gw IMRPhenomD waveform generation examples. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

The compute_basis() static method uses sklearn's randomized_svd. Add scikit-learn to project dependencies and tox test deps so CI can run the TestComputeBasis tests. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

mcoughlin requested a review from deepchatterjeeligo February 16, 2026 18:47

Fix ruff formatting

512931b

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

deepchatterjeeligo reviewed Feb 19, 2026

View reviewed changes

mcoughlin and others added 3 commits February 20, 2026 08:31

Fix ruff formatting and trailing newline

24b5f4b

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Add scikit-learn dependency for compute_basis()

8b13dd4

The compute_basis() static method uses sklearn's randomized_svd. Add scikit-learn to project dependencies and tox test deps so CI can run the TestComputeBasis tests. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

mcoughlin requested a review from deepchatterjeeligo February 20, 2026 16:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add frequency-domain SVD projection and dense residual block#256

Add frequency-domain SVD projection and dense residual block#256
mcoughlin wants to merge 5 commits intoML4GW:mainfrom
mcoughlin:svd-projection

mcoughlin commented Feb 16, 2026

Uh oh!

wbenoit26 commented Feb 16, 2026

Uh oh!

mcoughlin commented Feb 16, 2026

Uh oh!

github-actions bot commented Feb 16, 2026 •

edited

Loading

Uh oh!

EthanMarx commented Feb 17, 2026

Uh oh!

deepchatterjeeligo left a comment

Uh oh!

deepchatterjeeligo Feb 19, 2026

Uh oh!

mcoughlin Feb 19, 2026

Uh oh!

deepchatterjeeligo Feb 19, 2026

Uh oh!

mcoughlin Feb 19, 2026

Uh oh!

deepchatterjeeligo Feb 19, 2026

Uh oh!

mcoughlin Feb 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		if V_tensor is not None:
		proj.weight.data = V_tensor.T.contiguous()

Conversation

mcoughlin commented Feb 16, 2026

Uh oh!

wbenoit26 commented Feb 16, 2026

Uh oh!

mcoughlin commented Feb 16, 2026

Uh oh!

github-actions bot commented Feb 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Coverage report

Uh oh!

EthanMarx commented Feb 17, 2026

Uh oh!

deepchatterjeeligo left a comment

Choose a reason for hiding this comment

Uh oh!

deepchatterjeeligo Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

mcoughlin Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

deepchatterjeeligo Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

mcoughlin Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

deepchatterjeeligo Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

mcoughlin Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

github-actions bot commented Feb 16, 2026 •

edited

Loading