PytatoPyOpenCLArrayContext: use SVM allocator if available, limit arg size for GPUs #189

matthiasdiener · 2022-08-31T03:02:24Z

Needs:

~~LoopyPyOpenCLTarget: pass through loopy.PyOpenCLTarget pytato#359~~ (maybe not needed)
~~PyOpenCL target: Overflow large argument counts into SVM struct loopy#642~~

inducer

This looks good generally. Two style nits below.

Two questions:

Does it work?
How come it's marked as draft?

inducer · 2022-08-31T23:25:22Z

arraycontext/impl/pytato/compile.py


        with ProcessLogger(logger, f"generate_loopy for '{prg_id}'"):
+            import pyopencl as cl
+            dev = self.actx.context.devices[0]


Safer to get it from the queue, which only has one device.

Done in 1e54ce4

arraycontext/impl/pytato/compile.py

inducer · 2022-08-31T23:28:41Z

arraycontext/impl/pytato/compile.py

+                    and cl.characterize.has_coarse_grain_buffer_svm(dev)):
+                limit = dev.max_parameter_size
+                # Leave some extra space since our sizes are estimates
+                target = lp.PyOpenCLTarget(limit_arg_size_nbytes=limit//2)


Hmm, upon second thought: We can only pass this if we're sure that the memory allocated is actually SVM. So this has to get involved with memory pool creation.

Should we do this here or as part of inducer/loopy#642 ?

Done in faba326 and 61038f9

matthiasdiener · 2022-09-07T21:29:11Z

Does it work?

As far as I can tell, yes.

How come it's marked as draft?

This still needs the other PRs to be merged first I think.

matthiasdiener · 2022-09-19T21:47:01Z

This is ready for another review @inducer

inducer · 2022-09-19T22:07:56Z

Thanks!

matthiasdiener self-assigned this Aug 31, 2022

matthiasdiener requested a review from inducer August 31, 2022 03:02

LazilyPyOpenCLCompilingFunctionCaller: limit arg size for GPUs

ba7c205

matthiasdiener force-pushed the limit-arg-size branch from 156e369 to ba7c205 Compare August 31, 2022 03:03

matthiasdiener added 2 commits August 30, 2022 22:23

move limit

07be560

also check for SVM presence

620ac82

inducer reviewed Aug 31, 2022

View reviewed changes

matthiasdiener added 2 commits September 7, 2022 16:05

get_target()

1e54ce4

memoize get_target

f82ba67

matthiasdiener and others added 6 commits September 7, 2022 16:34

UNDO BEFORE MERGE: use dev branches

11924bc

Merge branch 'main' into limit-arg-size

f1aedad

Hackety hack: SVM detection in actx constructor

faba326

check whether passed allocator supports SVM

61038f9

undo loopy branch

c769de1

implement it for the base class

6e912a9

matthiasdiener force-pushed the limit-arg-size branch from 40c3821 to 6e912a9 Compare September 13, 2022 01:12

matthiasdiener mentioned this pull request Sep 13, 2022

LoopyPyOpenCLTarget: pass through loopy.PyOpenCLTarget inducer/pytato#359

Closed

matthiasdiener added 2 commits September 13, 2022 08:33

subclass LoopyPyOpenCLTarget

2768fee

set actual limit

05a75bf

matthiasdiener marked this pull request as ready for review September 13, 2022 13:37

undo pytato branch

5e3bed2

matthiasdiener changed the title ~~LazilyPyOpenCLCompilingFunctionCaller: limit arg size for GPUs~~ PytatoPyOpenCLArrayContext, use SVM allocator if available, limit arg size for GPUs Sep 13, 2022

matthiasdiener changed the title ~~PytatoPyOpenCLArrayContext, use SVM allocator if available, limit arg size for GPUs~~ PytatoPyOpenCLArrayContext: use SVM allocator if available, limit arg size for GPUs Sep 13, 2022

matthiasdiener requested a review from inducer September 13, 2022 23:36

matthiasdiener and others added 4 commits September 14, 2022 10:05

remove unused argument

fe407cb

add type annotations

e14df92

add logging

4271e21

Refactor arg size passing to put less logic in the target

bf459d6

matthiasdiener added 2 commits September 19, 2022 15:11

flake8

025b1cf

add a test

29fe793

inducer enabled auto-merge (squash) September 19, 2022 22:07

inducer merged commit 3c9aee6 into main Sep 19, 2022

inducer deleted the limit-arg-size branch September 19, 2022 22:44

alexfikl mentioned this pull request Sep 20, 2022

Make pytato a soft dependency again #194

Closed

inducer mentioned this pull request Sep 21, 2022

Regression: "Out of host memory" on Nvidia ICD #196

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

PytatoPyOpenCLArrayContext: use SVM allocator if available, limit arg size for GPUs #189

PytatoPyOpenCLArrayContext: use SVM allocator if available, limit arg size for GPUs #189

Uh oh!

matthiasdiener commented Aug 31, 2022 •

edited

Loading

Uh oh!

inducer left a comment

Uh oh!

inducer Aug 31, 2022

Uh oh!

matthiasdiener Sep 7, 2022

Uh oh!

Uh oh!

inducer Aug 31, 2022

Uh oh!

matthiasdiener Sep 7, 2022

Uh oh!

matthiasdiener Sep 12, 2022

Uh oh!

matthiasdiener commented Sep 7, 2022 •

edited

Loading

Uh oh!

matthiasdiener commented Sep 19, 2022

Uh oh!

inducer commented Sep 19, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

PytatoPyOpenCLArrayContext: use SVM allocator if available, limit arg size for GPUs #189

PytatoPyOpenCLArrayContext: use SVM allocator if available, limit arg size for GPUs #189

Uh oh!

Conversation

matthiasdiener commented Aug 31, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

inducer left a comment

Choose a reason for hiding this comment

Uh oh!

inducer Aug 31, 2022

Choose a reason for hiding this comment

Uh oh!

matthiasdiener Sep 7, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

inducer Aug 31, 2022

Choose a reason for hiding this comment

Uh oh!

matthiasdiener Sep 7, 2022

Choose a reason for hiding this comment

Uh oh!

matthiasdiener Sep 12, 2022

Choose a reason for hiding this comment

Uh oh!

matthiasdiener commented Sep 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

matthiasdiener commented Sep 19, 2022

Uh oh!

inducer commented Sep 19, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

matthiasdiener commented Aug 31, 2022 •

edited

Loading

matthiasdiener commented Sep 7, 2022 •

edited

Loading