Severe lack of information on how to build or use ZenDNN

# Overview

Unfortunately, it seems that this project is intended for internal AMD use. If I were to use it I would have to go to the desk of one of the maintainers and have a conversation on how to work with the framework. The build procedure is conceptually documented. Following it is not sufficient to compile the project. A lot of the framework options are not documented. There are no links from this repo to the official AMD project pages (https://www.amd.com/en/developer/zendnn.html and https://docs.amd.com/r/en-US/57300-ZenDNN-user-guide). Not that these pages provide any valuable information.



# Build

Here is my experience with this framework:
1. Opened the main README.md and followed the build instructions for v5.0.
2. Got the [AOCL-BLAS Library](https://github.com/amd/blis) and read the [build instructions](https://github.com/amd/blis/blob/master/docs/BuildSystem.md).
```bash
$ ./configure auto
$ make
$ export ZENDNN_BLIS_PATH=$(pwd)
```
3. Got the [Composable Kernel](https://github.com/ROCm/composable_kernel/tree/cpu_avx2). Massive amount of warnings and not properly documented dependencies. However, managed to build it.
4. Got [FBGEMM](https://github.com/pytorch/FBGEMM) and built the appropriate version. Managed to build it with some [help](https://iq.opengenus.org/build-fbgemm-from-source/).
5. With path configured we are going for the main build.
```bash
export ZENDNN_BLIS_PATH=/build/blis
export DEPEND_ON_CK=1
export FBGEMM_ENABLE=1
export FBGEMM_PATH=/build/fbgemm
```
6. Does not compile. Missing `bfloat16`, missing `blis.h`, missing `cblas.h`, missing functions like `*_gelu_*`, etc.
7. I am using a Zen 2 EPYC so my autoconfig was properly detected, however, an additional `zen2` path was added. I figured out that `blis` paths should be modified. Also `ZENDNN_STANDALONE_BUILD` should be used.

All the information on `ZENDNN_STANDALONE_BUILD`. By the way, that `LP64` is a complete mystery. Nevermind, let's go on.
![Image](https://github.com/user-attachments/assets/c9f1ca72-76b1-4bc0-8d29-16b51d26755e)
8. Thought the missing `cblas.h` could be another dependency that is not properly described here. Plausable, judging from the other AMD projects.
9. Could not get rid of the other errors. So many.
10. Decided to ditch v5.0 and go for v4.2 and use only the mandatory `blis` dependency.
11. Followed the instructions again. The [User Guide PDF](https://www.amd.com/content/dam/amd/en/documents/developer/zendnn-user-guide-4.2.pdf) is useless.
12. ZenDNN does not compile.
13. Disabled most functionality.
14. I was lucky that the errors were more concentrated and I got searching for one of the functions that were not defined `aocl_gelu_tanh_f32`.
15. Found it in a [PDF](https://www.amd.com/content/dam/amd/en/documents/developer/version-4-2-documents/aocl/aocl-4-2-user-guide.pdf) (of course).
16. The PDF describes how to use an addon for `blis`. Great!
17. Finally, I figured out that I am building the `blis` dependency in the wrong way. Here is the correct config.
```bash
$ ./configure --prefix=/build/blis-dist -a aocl_gemm --enable-cblas --enable-threading=openmp auto
$ make
$ make install
$ cp /build/blis-dist/include/blis/* /build/blis-dist/include
```
18. The `install` is very important. It properly places all artifacts on the target directory.
19. No mention that this project requires `blis` to be built with the `--enable-cblas` flag.
20. No mention that the `aocl_gemm` addon should be enabled. That is almost impossible to figure out from the `blis` documentation as well. `configure` just points out how to enable addons. No information on the addons and when to use them. Just folders with a bunch of code.
```txt
  -a NAME --enable-addon=NAME

                Enable the code provided by an addon. An addon consists
                of a separate directory of code that provides additional
                APIs, implementations, and/or operations that would
                otherwise not be present within a build of BLIS. This
                option may be used multiple times to specify the inclusion
                of multiple addons. By default, no addons are enabled.
```
21. No mention here that threading has to be enabled in the `blis` library.



# Usage

So, we have the `libamdZenDNN.so`, what now? Well, nothing much. The integration seems quite poor. Again, following the [User Guide](https://www.amd.com/content/dam/amd/en/documents/developer/zendnn-user-guide-4.2.pdf) it seems that a wrapper has to be written for every model.

```python
model = torch.compile(model, backend='zentorch')
with torch.no_grad():
 output = model(input)
```

Great! But makes this entire effort practically useless. Maybe there could be value if I am creating a model from scratch.

Even the ONNX instructions are quite conceptual. The user guide mentions a binary release `ONNXRT_v1.17.0_ZenDNN_v4.2_Python_v3.8.zip` but it is a mystery as where is this is to be found. At this point I am giving up.



# Target result

There should be an easy to follow build guide. __ALL__ tuning parameters and addons should be properly described. Here and in `blis`. All related pages should interlinked.

A `Dockerfile` is highly welcome. It describes how this framework is built in a clean environment.

It should be easier to enable ZenDNN optimizations for models without writing Python wrappers. ONNX is a move in the write direction. However, frameworks like [`vllm`](https://github.com/vllm-project/vllm) are also getting a lot of traction.



# Conclusion

I hope that this gives you a glimpse into the high complexity of your code base and poor documentation. Because of that I have encountered other oppinions that AMD frameworks (like ROCm and ZenDNN) "do not compile" and users should "just use NVIDIA". I hope that in the future you would make it easier for new users to get started with your projects. And improve the integration with AI frameworks and tools.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Severe lack of information on how to build or use ZenDNN #12

Overview

Build

Usage

Target result

Conclusion

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Severe lack of information on how to build or use ZenDNN #12

Description

Overview

Build

Usage

Target result

Conclusion

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions