Skip to content

Commit f21a679

Browse files
committed
enable cpu for contiguous batching
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
1 parent 8ac8c21 commit f21a679

File tree

1 file changed

+10
-3
lines changed
  • src/transformers/generation/continuous_batching

1 file changed

+10
-3
lines changed

src/transformers/generation/continuous_batching/requests.py

Lines changed: 10 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -50,9 +50,16 @@ def get_device_and_memory_breakdown() -> tuple[torch.device, int, int, int]:
5050
reserved_memory = 0 # MPS does not track reserved separately
5151
else:
5252
device = torch.device("cpu")
53-
total_memory = None
54-
reserved_memory = 0
55-
allocated_memory = 0
53+
try:
54+
import psutil
55+
56+
total_memory = psutil.virtual_memory().total
57+
allocated_memory = psutil.Process().memory_info().rss
58+
reserved_memory = allocated_memory
59+
except ImportError:
60+
total_memory = 0
61+
reserved_memory = 0
62+
allocated_memory = 0
5663
return device, total_memory, reserved_memory, allocated_memory
5764

5865

0 commit comments

Comments
 (0)