New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Question]Worse performance on matrix dot product than numpy using cpu backend #3042
Comments
Can you try the techniques listed here: |
@umar456 Thanks for the link. I was finding infos about arrayfire's CPU backend, but could find it. |
ArrayFire uses MKL as the default BLAS library. I am not sure what library is being used by numpy for this task. Because you are using an AMD CPU for this test, you could be running into issues that could be alleviated by the steps listed in that blog post. ArrayFire can also be built using other BLAS libraries but this is not offered through the official installers. |
@umar456 Nice, after adding
And the
|
@HO-COOH perhaps there are some openCL drivers for AMD CPU : if so, could be worth to try afopencl ? |
@WilliamTambellini AMD has dropped OpenCL for their CPU. You can see the discussion here |
@HO-COOH You can use the intel runtime on AMD CPUs. |
@umar456 Tried, but no luck. My CPU is not appeared under |
Using almost the same code as blas.cpp, but slightly modified to test the dot product on 2 randomly initialized matrix like this:
I am getting worse performance than the numpy equivalent code
From arrayfire, I got
From numpy I got
The text was updated successfully, but these errors were encountered: