Embed this proof
Contributed CUDA kernel optimization reducing inference latency by 34%
<iframe src="https://powstik.com/embed/proof/008fdfde-6c93-4e8a-b0d0-d2a6b9c1a170" width="100%" height="220" frameborder="0" style="border-radius:12px;overflow:hidden"></iframe>
Add this to your personal site, Notion, or portfolio →
← Back to proof