rewrite cp by ayushkamat · Pull Request #498 · latchbio/latch

ayushkamat · 2024-10-09T02:28:56Z

No description provided.

Signed-off-by: Ayush Kamat <ayush@latch.bio>

latch_cli/services/cp/download/main.py

AidanAbd · 2024-10-09T22:11:40Z

latch_cli/services/cp/download/main.py

+                    try:
+                        parent.mkdir(exist_ok=True, parents=True)
+                        break
+                    except NotADirectoryError:  # somewhere up the tree is a file


huh shouldnt this eat shit?

not sure what this means

latch_cli/services/cp/download/main.py

latch_cli/services/cp/http_utils.py

AidanAbd · 2024-10-09T23:57:56Z

latch_cli/services/cp/main.py

+    acc_info = execute(gql.gql("""
+        query AccountInfo {
+            accountInfoCurrent {
+                id
+            }
+        }
+    """))["accountInfoCurrent"]

-    for src in srcs:
-        src_remote = is_remote_path(src)
+    acc_id = acc_info["id"]


what is the purpose of this?

used in the get_path_error fn in the except for nice error message printing

I would definitely only do this query if we need to (if there is an error)

I would entirely avoid network stuff in the error path if possible.

latch_cli/services/cp/main.py

AidanAbd · 2024-10-10T00:42:57Z

latch_cli/services/cp/upload/worker.py

+            # jitter to not dos nuc-data
+            await asyncio.sleep(0.1 * random.random())


kinda odd what did we see here before?

for wide directories with small files theres not enough time between start-upload calls so we end up throttling nuc-data

But like shouldn't we use a semaphore on the call rather than adding jitter? Jitter is worse because it is not aware of how many calls are inflight or how long they are taking.

AidanAbd · 2024-10-10T00:46:14Z

latch_cli/services/cp/upload/worker.py

+            # exception handling
+            resp = await sess.post(
+                "https://nucleus.latch.bio/ldata/end-upload",
+                headers={"Authorization": get_auth_header()},
+                json={
+                    "path": work.dest,
+                    "upload_id": data["upload_id"],
+                    "parts": [
+                        {
+                            "ETag": part.etag,
+                            "PartNumber": part.part,
+                        }
+                        for part in parts
+                    ],
+                },
+            )
+            resp.raise_for_status()
+
+            if print_file_on_completion:
+                pbar.write(work.src.name)
+
+            pbar.reset()
+            total_pbar.update(1)
+
+    pbar.clear()
+


you might want to do a smarter backoff with more tries given that we can 429 on this

makes sense for more retries - two qs:

what is a smarter backoff method - im not super familiar with any other than exponential

what does this backoff method lack that a smarter method would address?

this one is kinda unbounded in retries but there should be a maximum. Ideally, we have a semaphore which bounds the number of concurrent calls to nuc-data and then backoffs are less important and we can keep as is.

Signed-off-by: Ayush Kamat <ayush@latch.bio>

AidanAbd · 2024-10-10T23:27:04Z

latch_cli/services/cp/http_utils.py

+            "https://nucleus.latch.bio/ldata/start-upload": asyncio.BoundedSemaphore(2),
+            "https://nucleus.latch.bio/ldata/end-upload": asyncio.BoundedSemaphore(2),
+        }


u r probably fine with like 5 or 10 each

AidanAbd · 2024-10-10T23:29:52Z

latch_cli/services/cp/upload/worker.py

+start_upload_sema = asyncio.BoundedSemaphore(2)
+end_upload_sema = asyncio.BoundedSemaphore(2)


AidanAbd · 2024-10-10T23:30:28Z

latch_cli/services/cp/upload/worker.py

+            if resp.status == 429:
+                raise RateLimitExceeded(
+                    "The service is currently under load and could not complete your"
+                    " request - please try again later."
+                )


wait this should just backoff and retry? why are we failing here?

AidanAbd · 2024-10-10T23:30:46Z

latch_cli/services/cp/upload/worker.py

+            if resp.status == 429:
+                raise RateLimitExceeded(
+                    "The service is currently under load and could not complete your"
+                    " request - please try again later."
+                )


very odd to die here.

…-again-i-guess

Signed-off-by: Ayush Kamat <ayush@latch.bio>

…-again-i-guess

Signed-off-by: Ayush Kamat <ayush@latch.bio>

ayushkamat added 2 commits October 8, 2024 19:23

initial commit

5f16e61

Signed-off-by: Ayush Kamat <ayush@latch.bio>

dont use own node_data

bc89144

Signed-off-by: Ayush Kamat <ayush@latch.bio>

AidanAbd requested changes Oct 10, 2024

View reviewed changes

ayushkamat added 5 commits October 10, 2024 09:09

comments

e83b609

Signed-off-by: Ayush Kamat <ayush@latch.bio>

switch queues

f10e534

Signed-off-by: Ayush Kamat <ayush@latch.bio>

updates

063e787

Signed-off-by: Ayush Kamat <ayush@latch.bio>

download fixies

158e130

Signed-off-by: Ayush Kamat <ayush@latch.bio>

remove all top level latch.ldata imports

af0af19

Signed-off-by: Ayush Kamat <ayush@latch.bio>

AidanAbd requested changes Oct 10, 2024

View reviewed changes

ayushkamat added 7 commits October 18, 2024 11:23

Merge branch 'main' of github.com:latchbio/latch into ayush/cp-fixies…

944075b

…-again-i-guess

dev version

cae8968

Signed-off-by: Ayush Kamat <ayush@latch.bio>

deps

4a87bc7

Signed-off-by: Ayush Kamat <ayush@latch.bio>

Merge branch 'main' of github.com:latchbio/latch into ayush/cp-fixies…

81e3fac

…-again-i-guess

Merge branch 'main' of github.com:latchbio/latch into ayush/cp-fixies…

836af49

…-again-i-guess

Merge branch 'main' of github.com:latchbio/latch into ayush/cp-fixies…

d17e229

…-again-i-guess

save state

cd4cccf

Signed-off-by: Ayush Kamat <ayush@latch.bio>

		# jitter to not dos nuc-data
		await asyncio.sleep(0.1 * random.random())

		start_upload_sema = asyncio.BoundedSemaphore(2)
		end_upload_sema = asyncio.BoundedSemaphore(2)

Conversation

ayushkamat commented Oct 9, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants