Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
FlashAttention-4 (colfax-intl.com)
2 points by maralom 33 days ago | past
Categorical Foundations for CuTe Layouts (colfax-intl.com)
1 point by matt_d 6 months ago | past
Categorical Foundations for Cute Layouts (colfax-intl.com)
39 points by charles_irl 6 months ago | past | 6 comments
Cutlass Tutorial: Sub-Byte GEMM on Nvidia Blackwell GPUs (colfax-intl.com)
2 points by jxmorris12 10 months ago | past
GEMM with Thread Block Clusters on Nvidia Blackwell GPUs (colfax-intl.com)
2 points by ashvardanian 11 months ago | past
Cutlass Tutorial: Writing GEMM Kernels Using Tensor Memory for Blackwell GPUs (colfax-intl.com)
2 points by ashvardanian 11 months ago | past
DeepSeek-R1 and FP8 Mixed-Precision Training (colfax-intl.com)
2 points by skidrow 11 months ago | past
DeepSeek-R1 and FP8 Mixed-Precision Training (colfax-intl.com)
2 points by skidrow 11 months ago | past
DeepSeek-R1 and FP8 Mixed-Precision Training (colfax-intl.com)
1 point by skidrow on April 1, 2025 | past
Cutlass Tutorial: Fast Matrix-Multiplication with Wgmma on Nvidia Hopper GPUs (colfax-intl.com)
1 point by sebg on Sept 26, 2024 | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: