Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

0.5.2 DML 2x to 4x Slower than 0.4.0 (Big regression) #1114

Open
elephantpanda opened this issue Dec 3, 2024 · 1 comment
Open

0.5.2 DML 2x to 4x Slower than 0.4.0 (Big regression) #1114

elephantpanda opened this issue Dec 3, 2024 · 1 comment

Comments

@elephantpanda
Copy link

elephantpanda commented Dec 3, 2024

genai 0.5.2 appears to be between 2x and 4x slower than genai 0.4.0, in fact it is only about 50% faster than CPU mode.

If have just tested both 0.4.0 and 0.5.2 and it definitely is a vast difference in DML mode.

GPU Quadro P5000.

model: microsoft/Phi-3-mini-4k-instruct-onnx
c#
BTW, I checked it is not the new DirectML.dll library causing this.

@elephantpanda elephantpanda changed the title 0.5.2 0.5.2 Slower? Dec 3, 2024
@elephantpanda elephantpanda changed the title 0.5.2 Slower? 0.5.2 DML Slower than 0.4.0? Dec 3, 2024
@elephantpanda elephantpanda changed the title 0.5.2 DML Slower than 0.4.0? 0.5.2 DML 2x to 4x Slower than 0.4.0 (Big regression) Dec 3, 2024
@hanbitmyths
Copy link

@elephantpanda, can you share genai_config.json and benchmark steps to repro?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants