-
Notifications
You must be signed in to change notification settings - Fork 2.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Grouped Quantized Transposed Conv is rejected by trtexec. #1934
Comments
I can reproduce this on 8.4. @ttyio do we support group convtranpose in QAT? |
@tzskp1 , currently we only enabled loop based convtranspose. The limitation is that for the INT8 IO, we have to fit the requirement that channel at least 4 times groups number, like in your case, you have to modify to:
I will create RFE to enable non loop based convtranspose feature. Thanks! |
Thank you for your response. |
Hello @tzskp1 , sorry for the delay response, it is partially because of the loop base implementation, But if you try 1 / 32, you can get more speed up from tensorcore. thanks! |
Closing due to >14 days without activity. Please feel free to reopen if the issue still exists. Thanks |
I've tested with TRT 10.9 and the issue is resolved. I'm not sure what is the earliest version when is was implemented.
|
Description
I tried quantization via pytorch-quantization library.
Then I met the error while atempting to convert grouped quantized transposed convolution layer.
Thank you.
Environment
TensorRT Version: 8003
NVIDIA GPU: GTX 1650
NVIDIA Driver Version: NVIDIA UNIX x86_64 Kernel Module 510.54
CUDA Version: V11.5.50
CUDNN Version:
Operating System: Ubuntu 20.04.3
Python Version (if applicable): 3.8.12
Tensorflow Version (if applicable):
PyTorch Version (if applicable): '1.11.0a0+b6df043'
Baremetal or Container (if so, version): nvcr.io/nvidia/pytorch:21.11-py3
Relevant Files
Steps To Reproduce
trtexec --onnx=./bug_on.onnx --int8
The text was updated successfully, but these errors were encountered: