Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Query regarding BLAS implementation of GGML #1145

Open
vikasbalaga opened this issue Mar 13, 2025 · 1 comment
Open

Query regarding BLAS implementation of GGML #1145

vikasbalaga opened this issue Mar 13, 2025 · 1 comment

Comments

@vikasbalaga
Copy link

I am trying to implement an application with GGML library on x86_64 architecture. I have built the "default" variant which will utilize GNU libraries for low level math functions.

But when I wanted to use "BLAS" as a backend library, I can see in the source code that it supports only a few operations, even for a basic operation like "MUL", I am seeing the following error

ggml-blas.cpp:250: ggml_backend_blas_graph_compute: unsupported op MUL

@ggerganov, does it mean that I can't use BLAS as a backend library or can you please confirm if there is a way to overcome these issues?

@ggerganov
Copy link
Member

You need to use both the CPU and BLAS backends and a ggml_backend_sched to distribute the operations to the correct backends based on what they support. See the gpt-2, main-sched.cpp example and start from there.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants