Deep learning on Arc A770 + Ubuntu 22.04

The graphics processor consumer market is no longer dominated by the two-brand dispute of NVIDIA’s GeForce vs AMD’s Radeon. In 2022, a third contender entered the ring: Intel’s Arc.

It is still a relatively unknown option to most desktop enthusiasts, which may explain why it is priced lower than its competitors. For folks like me, trying to get as much VRAM as possible on a budget, is hard to dismiss.

But in a world where NVIDIA is the de facto standard and its seasoned AMD rival is struggling to make a dent, will the support for Intel GPUs be there?

Are the drivers stable enough? Are the open-source project maintainers making the effort to support it? Let’s find out.

Downloading and setting up drivers

A pleasant surprise here, there is no need to edit configuration files or install any package, there’s out-of-the-box support for it with the latest Linux kernel. The support is lackluster though: I couldn’t get fan speed or temperature sensors to work (someone already asked for it).

Checking GPU usage

To make sure the GPU is working properly and to check its performance, we are going to install:

$ sudo apt install intel-gpu-tools

$ sudo apt install mesa-utils

Now let’s fire glxgears while watching the GPU activity:

$ sudo intel_gpu_top

$ vblank_mode=0 glxgears -info

The “vblank_mode=0” variable disables vsync, allowing glxgears to run free from syncing it with the display refresh rate.

We can see activity on the GPU, but I didn’t make my card fans spin. Let’s try with glmark2:

That managed to get the GPU fan to spin.

Ollama

After Ollama finished installing, it printed “AMD GPU ready” and then proceeded to use my Ryzen 5 5500 for inference, yielding merely ~13 tokens per second (for comparison, I get ~100 tokens/s and ~75 tokens/s with my RTX 2060 and RX 7600 respectively).

Fingers crossed for the merge request that I found adding support for Intel Arc to get merged soon.

Intel provides a custom Docker image (with custom scripts inside) for running Ollama, but I didn’t have much luck with it either.

Pulling intelanalytics/ipex-llm-inference-cpp-xpu image and running a container from it:

$ docker run -it --rm --net host --device /dev/dri --privileged --memory 32gb --shm-size 16gb intelanalytics/ipex-llm-inference-cpp-xpu

Argument	Description
`-it`	starts an interactive terminal session in the Docker container
`--rm`	automatically removes the container and its filesystem after it exits
`--net host`	uses the host network stack
`--device /dev/dri`	grants the container access to the Direct Rendering Infrastructure (DRI) devices on the host
`--privileged`	gives extended privileges to the container, such as accessing all devices on the host system
`--memory 32gb`	limits the memory usage of the container to 32 GB
`--shm-size 16gb`	“shm” stands for shared memory, `/dev/shm` is 64 MB by default, which is little for applications with large datasets or heavy inter-process communication

Inside the container:

# . ipex-llm-init --gpu --device Arc
(...)
# sh /llm/scripts/start-ollama.sh

Then the error:

Error: llama runner process has terminated: signal: bus error (core dumped)

Unfortunately, I gave up running Ollama on it for now.

PyTorch

Following Intel’s official documentation, we get this for installing the package:

$ pip install torch==2.1.0a0 intel-extension-for-pytorch==2.1.10+xpu --extra-index-url https://pytorch-extension.intel.com/release-whl/stable/xpu/us/

And an error right after importing the package:

$ python3
Python 3.10.12 (main, Mar 22 2024, 16:50:05) [GCC 11.4.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import intel_extension_for_pytorch as ipex
Traceback (most recent call last):
(...)
OSError: libmkl_intel_lp64.so.2: cannot open shared object file: No such file or directory

Once again, let’s resort to an Intel-provided Docker image:

$ docker run -it --rm --device /dev/dri -v /dev/dri/by-path:/dev/dri/by-path --ipc=host intel/intel-extension-for-pytorch:2.1.30-xpu

Inside the container:

# python3
>>> import intel_extension_for_tensorflow as ipex
>>> print(ipex.__version__)
2.1.30+xpu

Looks OK.

TensorFlow

Following Intel’s official documentation, we get this for installing the package:

$ pip install intel-extension-for-tensorflow[xpu]

Then the following for checking the installation:

import intel_extension_for_tensorflow as itex
print(itex.__version__)

And, of course, it didn’t work:

tensorflow.python.framework.errors_impl.NotFoundError: libimf.so: cannot open shared object file: No such file or directory

Again, Docker to the rescue:

$ docker run -it --rm --device /dev/dri -v /dev/dri/by-path:/dev/dri/by-path --ipc=host intel/intel-extension-for-tensorflow:xpu

While inside the container it seems that it’s working as expected:

>>> print(itex.__version__)
2.15.0.0