Top GPUs & CPUs & AI accelerators Ranked by TF32 FLOPS of Matrix(Tensor) cores

This is NOT a ranking of gaming performance, it's a ranking for AI workloads.
This is NOT a ranking of processors—many of them may be missing since their specifications aren’t available. If you know them, feel free to comment.
This is NOT a ranking of actual task performances, as they may vary due to many factors like thermal throttling, driver support, and framework compatibility...

#1

GPU
192 GB
NVIDIA B100

1800 T

8 TB/s

653.70 T

6 TB/s

653.70 T

5.30 TB/s

#4

GPU
80 GB
NVIDIA H100 SXM

494.70 T

3.36 TB/s

#5

GPU
141 GB
NVIDIA H200 SXM

494 T

4.80 TB/s

490.30 T

5.30 TB/s

#7

GPU
141 GB
NVIDIA H200 NVL

482.55 T

4.80 TB/s

459 T

3.70 TB/s

#9

AIU
96 GB
AWS Trainium2

431 T

4 TB/s

#10

AIU
32 GB
AWS Trainium1

190 T

820 GB/s

#11

GPU
80 GB
NVIDIA A100 PCIe

155.93 T

1.94 TB/s

#12

GPU
80 GB
NVIDIA A100 SXM

155.93 T

2.04 TB/s

104.75 T

1.79 TB/s

#14

GPU
48 GB
NVIDIA L40

90.52 T

864 GB/s

82.58 T

1.01 TB/s

#16

GPU
48 GB
NVIDIA RTX A6000

77.41 T

768 GB/s

#17

GPU
48 GB
NVIDIA A40 PCIe

74.83 T

695 GB/s

56.28 T

960 GB/s

52.22 T

736 GB/s

48.74 T

716 GB/s

44.10 T

672 GB/s

43.94 T

896 GB/s

40.09 T

504 GB/s

40.00 T

1.01 TB/s

35.58 T

936 GB/s

35.48 T

504 GB/s

34.10 T

912 GB/s

32.98 T

576 GB/s

31.80 T

896 GB/s

30.87 T

672 GB/s

29.77 T

760 GB/s

29.68 T

768 GB/s

29.15 T

504 GB/s

24.72 T

432 GB/s

24.58 T

896 GB/s

23.22 T

512 GB/s

22.06 T

288 GB/s

21.75 T

608 GB/s

20.31 T

448 GB/s

18.98 T

448 GB/s

18.71 T

512 GB/s

18.06 T

512 GB/s

16.60 T

448 GB/s

16.20 T

448 GB/s

15.97 T

448 GB/s

15.62 T

256 GB/s

15.11 T

272 GB/s

12.74 T

360 GB/s

11.61 T

256 GB/s

9.10 T

224 GB/s

8.99 T

192 GB/s

7.60 T

192 GB/s

5.50 T

192 GB/s

5.10 T

112 GB/s

4.73 T

96 GB/s