-
Notifications
You must be signed in to change notification settings - Fork 216
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[WSL]"XPU out of memory" error when using to("xpu") method in Intel PyTorch Extension (IPEX) #629
Comments
I learned from the issue that the prompt about exceeding 4GB is normal, but I've noticed that memory requests exceeding 1GB are also not allowed within WSL2, even though my a770 desktop only has Windows 11 installed, so I'm unable to observe the situation under Ubuntu 22.04 for now. Update: |
Not sure if this is related to the device driver or not, but it doesn't seem right to me that users can only allocate <1GB memory here. |
Can you run xpu-smi (https://github.com/intel/xpumanager/releases) to check "Memory pysical size" "Max Mem Alloc Size" |
@jgong5 @feng-intel |
I can't reproduce the issue on my ARC770. |
Describe the bug
Minimum Codes
Traceback (most recent call last):
Additional Details:
When moving the tensor of size (80, 1584, 2048) to the XPU device for the first time, the operation succeeds, and it shows that the tensor occupies approximately 990 MB of memory.
However, when attempting to move a slightly larger tensor (size (83, 1584, 2048)) to the XPU device, the "XPU out of memory" error is thrown, even though the XPU device has a total capacity of 15.56 GB.
Steps to Reproduce:
Install the required dependencies (PyTorch and IPEX) in a WSL2 environment.
Run the provided minimal reproducible code in a Python interactive session or script.
Observe the "XPU out of memory" error when attempting to move the larger tensor to the XPU device.
Expected Behavior:
The larger tensor should be successfully moved to the XPU device without encountering an "XPU out of memory" error, as the XPU device has sufficient total capacity.
Actual Behavior:
An "XPU out of memory" error is thrown when attempting to move the larger tensor to the XPU device, despite the XPU device having enough total capacity.
I would appreciate any assistance or guidance in resolving this issue. Please let me know if you require any additional information or clarification.
Versions
Environment Information:
Operating System: Windows Subsystem for Linux 2 (WSL2) Ubuntu22.04
Python Version: 3.10.14
Driver versison: Intel® Graphics Driver 31.0.101.5522 (WHQL Certified)**
Versions:
The text was updated successfully, but these errors were encountered: