Vulkan driver

This driver uses libvulkan and clspv to run OpenCL code on GPU devices via Vulkan API.

NOTE: THIS DRIVER IS INCOMPLETE, without an active maintainer. Pull Requests welcomed.

Installation

Required:

  • vulkan drivers (on Ubuntu, “mesa-vulkan-drivers” for opensource vulkan drivers)

  • vulkan development files (on Ubuntu, “vulkan-headers” and “libvulkan-dev”)

  • SPIR-V tools (for clspv; on Ubuntu, package “spirv-tools”)

The Vulkan headers, devices and library must support at least Vulkan version 1.1; 1.0 devices may work but are untested. With 1.0 headers, pocl-vulkan won’t compile.

Optional:

  • “vulkan-validationlayers-dev” for vulkan validation layers

  • “vulkan-tools” or “vulkan-utils” package for vulkaninfo

Note that the Vulkan device MUST support the following extensions (clspv requirements):

  • VK_KHR_variable_pointers

  • VK_KHR_storage_buffer_storage_class

  • VK_KHR_shader_non_semantic_info

Optional extensions:
  • VK_EXT_external_memory_host for CL_MEM_USE_HOST_PTR to be useful

  • VK_KHR_16bit_storage, VK_KHR_8bit_storage, VK_KHR_shader_float16_int8 to be able to use 8 bit and 16 bit integers

Easiest to check is with vulkaninfo utility, they must be listed in ‘Device Extensions’ section.

To build the full pocl-vulkan, first you must build the clspv compiler:

git clone https://github.com/google/clspv.git
cd clspv
python utils/fetch_sources.py
mkdir build ; cd build
cmake /path/to/clspv -DCLSPV_BUILD_TESTS=OFF -DCMAKE_BUILD_TYPE=Release
make -jX
make install

… this will take some time and space, because it compiles its own checkout of LLVM.

After the build, copy “clspv” and “clspv-reflection” binaries to some place CLSPV_BIN_DIR

Then build the vulkan driver:

cmake -DENABLE_HOST_CPU_DEVICES=0 -DENABLE_LLVM=0 -DENABLE_VULKAN=1 -DCLSPV_DIR=${CLSPV_BIN_DIR} <path-to-pocl-source-dir>

You may set VULKAN_SDK env variable before running cmake, then it will look for libvulkan in VULKAN_SDK/lib directory.

After build, libpocl can be tested with (run in the build directory):

OCL_ICD_VENDORS=$PWD/ocl-vendors/pocl-tests.icd POCL_BUILDING=1 POCL_DEVICES=vulkan ./examples/example1/example1

Adding POCL_VULKAN_VALIDATE=1 POCL_DEBUG=vulkan into the environment enables the use of validation layers, this will make output from PoCL much more verbose.

It is possible to build & use pocl-vulkan without clspv, but this limits the usability of the driver to clCreateProgramWithBinaries() with poclbinaries.

What works

  • both integrated and discrete GPUs are supported

  • buffer (cl_mem) kernel arguments

  • POD (plain old data) kernel arguments (int32 and float32; other int/float types are enabled only if indicated by device features; structs with these types)

  • local memory, both as static (in-kernel) and as kernel argument

  • constant memory, both at module-scope and as kernel argument

  • most 1.2 API calls

  • CL_MEM_USE_HOST_PTR with clCreateBuffer(), if the device supports VK_EXT_external_memory_host

  • global offsets to clEnqueueNDRangeKernel

Doesnt work / missing

  • image / sampler support

  • clLinkProgram & clCompileProgram

  • clCreateBuffer(): CL_MEM_USE_HOST_PTR on dGPUs doesn’t work

  • clCreateBuffer(): the CL_MEM_ALLOC_HOST_PTR flag is ignored

Unfinished / non-optimal

  • missing sub-allocator for small allocations

  • statically sized structs that create certain limits

  • descriptor set should be cached (setup once per kernel, then just update)

  • command buffers should be cached

  • kernel library - check what clspv is missing

  • push constants for POD arguments instead of POD UBO

  • stop using deprecated clspv-reflection, instead extract the kernel metadata from the SPIR-V file itself

Known Bugs

Validation layers can print this message:

“After specialization was applied, VkShaderModule 0xXY0000XY[] does not contain valid spirv for stage VK_SHADER_STAGE_COMPUTE_BIT. The Vulkan spec states: module must be a valid VkShaderModule handle (https://www.khronos.org/registry/vulkan/specs/1.1-extensions/html/vkspec.html#VUID-VkPipelineShaderStageCreateInfo-module-parameter)”

This is (AFAIK) caused by Clspv reflection metadata present in SPIR-V, and is harmless.

The pocl vulkan driver will wait indefinitely for a kernel to finish. However GPU drivers have their own “freeze detection” timeouts and could kill the kernel sooner. This would result in PoCL aborting with error -4 (device lost).

Clspv can compile a lot of code, but is still unfinished and has bugs, so pocl-vulkan may fail to compile OpenCL code.

Testing

The tests that should work with Vulkan driver can be run with tools/scripts/run_vulkan_tests.

This driver was tested with these devices:

  • Intel HD 530 integrated GPU

  • AMD Radeon Vega 8 iGPU

  • Nvidia Quadro P600 discrete GPU

  • Raspberry Pi 4 + Ubuntu 22.04