CUDA vs Naive Speedup? #1

glenn-jocher · 2021-03-11T23:02:54Z

@d-li14 hi, thanks for your contributions and for this amazing idea!

I'd like to try your involution() module in a non-mmdetection repo (YOLOv5), and was trying to figure out the best technical way to do this using your existing code here:

The naive implementation seems easier to integrate into new works, so I'd like to use that, and my main question is:
How much of a speed change do you see in training (and inference) when moving from naive to cuda? Thanks!

d-li14 · 2021-03-12T02:42:26Z

Thanks for your feedback!
We have not tried involution with the YOLO framework. Moreover, the practical change may depend on specific platforms and test settings. For reference, we consider another one-stage detector RetinaNet in our work. The inference speedup on a single NVIDIA V100 GPU is roughly 40%.
Another major drawback of the naive implementation is that it costs much GPU memory due to the harmful unfold operation.

glenn-jocher mentioned this issue Mar 11, 2021

Add ASFF (three fuse feature layers) int the Head for V5(s,m,l,x) ultralytics/yolov5#2348

Closed

d-li14 closed this as completed Mar 12, 2021

ChristophReich1996 mentioned this issue May 14, 2021

[feature request] involution 3D #32

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA vs Naive Speedup? #1

CUDA vs Naive Speedup? #1

glenn-jocher commented Mar 11, 2021 •

edited

Loading

d-li14 commented Mar 12, 2021

CUDA vs Naive Speedup? #1

CUDA vs Naive Speedup? #1

Comments

glenn-jocher commented Mar 11, 2021 • edited Loading

d-li14 commented Mar 12, 2021

glenn-jocher commented Mar 11, 2021 •

edited

Loading