Skip to content
Discussion options

You must be logged in to vote

If assumptions (1, 2) above are broken, does it mean we get less benefit from cuQuantum (as it currently stands) due to other induced overheads as a result of assumptions (1, 2) no longer hold?

Regarding (1, 2), the more mingled the model is, the more time you will spend in cuQuantum (say custatevec) calculating intermediate gradient data in the backward pass. The more the model looks like a traditional circuit, the more opportunities you will have to optimize and pipeline the circuit gradient with fewer API calls and memory transactions.

The Qiskit example is really useful here since AFAIK it doesn't use JAX. Let's suppose we implement a hybrid model, where all the assumptions (1, 2, …

Replies: 2 comments 5 replies

Comment options

You must be logged in to vote
3 replies
@wcqc
Comment options

@mtjrider
Comment options

@refraction-ray
Comment options

Comment options

You must be logged in to vote
2 replies
@mtjrider
Comment options

Answer selected by wcqc
@wcqc
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants