Dark Hacker News
new
|
best
|
ask
|
show
|
jobs
skidrow | Dark Hacker News
user:
skidrow
created:
July 2, 2024
karma:
367
submissions
comments
1.
Occupancy Math on the AMD MI355X: A From-First-Principles Guide
(indianspeedster.github.io)
2 points
by
skidrow
2 hours ago
|
0 comments
2.
Computer Vision – Lecture 1.1 (Introduction: Organization) [video]
(youtube.com)
2 points
by
skidrow
1 day ago
|
0 comments
3.
FlashAttention-4: Algorithm and Kernel Pipelining Co-Design
(research.colfax-intl.com)
2 points
by
skidrow
2 days ago
|
0 comments
4.
Cutlass Tutorial: Efficient GEMM Kernel Designs with Pipelining
(research.colfax-intl.com)
2 points
by
skidrow
2 days ago
|
0 comments
5.
Toward Better Hip Kernel Generation for AMD GPUs
(scalingintelligence.stanford.edu)
2 points
by
skidrow
2 days ago
|
0 comments
6.
FP8 GEMM Optimization on AMD CDNA4 Architecture
(rocm.blogs.amd.com)
2 points
by
skidrow
2 days ago
|
0 comments
7.
Occupancy Math on the AMD MI355X: A From-First-Principles Guide
(indianspeedster.github.io)
2 points
by
skidrow
2 days ago
|
0 comments
8.
FP8 GEMM Optimization on AMD CDNA4 Architecture
(rocm.blogs.amd.com)
1 points
by
skidrow
3 days ago
|
0 comments
9.
Occupancy Math on the AMD MI355X
(indianspeedster.github.io)
1 points
by
skidrow
3 days ago
|
0 comments
10.
FP8 GEMM Optimization on AMD CDNA4 Architecture
(rocm.blogs.amd.com)
1 points
by
skidrow
3 days ago
|
0 comments
11.
Occupancy Math on the AMD MI355X: A From-First-Principles Guide
(indianspeedster.github.io)
1 points
by
skidrow
3 days ago
|
0 comments
12.
FP8 GEMM Optimization on AMD CDNA4 Architecture
(rocm.blogs.amd.com)
4 points
by
skidrow
4 days ago
|
0 comments
13.
Occupancy Math on the AMD MI355X: A From-First-Principles Guide
(indianspeedster.github.io)
17 points
by
skidrow
4 days ago
|
0 comments
14.
FP8 GEMM Optimization on AMD CDNA4 Architecture
(rocm.blogs.amd.com)
3 points
by
skidrow
4 days ago
|
0 comments
15.
Deep Dive into 4-Wave Interleave FP8 GEMM
(rocm.blogs.amd.com)
3 points
by
skidrow
4 days ago
|
0 comments
16.
Occupancy Math on the AMD MI355X: A From-First-Principles Guide
(indianspeedster.github.io)
3 points
by
skidrow
5 days ago
|
0 comments
17.
Creating custom kernels for the AMD MI300
(huggingface.co)
2 points
by
skidrow
259 days ago
|
0 comments
18.
Implementing a Fast Tensor Core Matmul on the Ada Architecture
(spatters.ca)
4 points
by
skidrow
259 days ago
|
0 comments
19.
Matrix Core Programming on AMD GPUs
(salykova.github.io)
116 points
by
skidrow
259 days ago
|
5 comments
20.
Implementing a Fast Tensor Core Matmul on the Ada Architecture
(spatters.ca)
3 points
by
skidrow
261 days ago
|
0 comments
21.
Matrix Core Programming on AMD GPUs
(salykova.github.io)
2 points
by
skidrow
261 days ago
|
0 comments
22.
Creating custom kernels for the AMD MI300
(huggingface.co)
1 points
by
skidrow
261 days ago
|
0 comments
23.
Implementing a Fast Tensor Core Matmul on the Ada Architecture
(spatters.ca)
2 points
by
skidrow
262 days ago
|
0 comments
24.
Matrix Core Programming on AMD CDNA3 and CDNA4 Architecture
(salykova.github.io)
24 points
by
skidrow
262 days ago
|
3 comments
25.
Creating custom kernels for the AMD MI300
(huggingface.co)
2 points
by
skidrow
262 days ago
|
0 comments
26.
Implementing a Fast Tensor Core Matmul on the Ada Architecture
(spatters.ca)
2 points
by
skidrow
263 days ago
|
0 comments
27.
Advanced Matrix Multiplication Optimization on Multi-Core Processors (2024)
(salykova.github.io)
85 points
by
skidrow
263 days ago
|
3 comments
28.
Creating custom kernels for the AMD MI300
(huggingface.co)
2 points
by
skidrow
263 days ago
|
0 comments
29.
Introduction to Matrix Core Programming on AMD CDNA3 and CDNA4 Architecture
(salykova.github.io)
2 points
by
skidrow
263 days ago
|
0 comments
30.
Creating custom kernels for the AMD MI300
(huggingface.co)
2 points
by
skidrow
331 days ago
|
0 comments