Top Desktop & Mobile GPUs Ranked by TF32 FLOPS of Matrix(Tensor) cores

This is NOT a ranking of gaming performance, it's a ranking for AI workloads.
This is NOT a ranking of processors—many of them may be missing since their specifications aren’t available. If you know them, feel free to comment.
This is NOT a ranking of actual task performances, as they may vary due to many factors like thermal throttling, driver support, and framework compatibility...

104.75 T

1.79 TB/s

82.58 T

1.01 TB/s

56.28 T

960 GB/s

52.22 T

736 GB/s

48.74 T

716 GB/s

44.10 T

672 GB/s

43.94 T

896 GB/s

40.09 T

504 GB/s

40.00 T

1.01 TB/s

35.58 T

936 GB/s

35.48 T

504 GB/s

34.10 T

912 GB/s

32.98 T

576 GB/s

31.80 T

896 GB/s

30.87 T

672 GB/s

29.77 T

760 GB/s

29.68 T

768 GB/s

29.15 T

504 GB/s

24.72 T

432 GB/s

24.58 T

896 GB/s

23.22 T

512 GB/s

22.06 T

288 GB/s

21.75 T

608 GB/s

20.31 T

448 GB/s

18.98 T

448 GB/s

18.71 T

512 GB/s

18.06 T

512 GB/s

16.60 T

448 GB/s

16.20 T

448 GB/s

15.97 T

448 GB/s

15.62 T

256 GB/s

15.11 T

272 GB/s

12.74 T

360 GB/s

11.61 T

256 GB/s

9.10 T

224 GB/s

8.99 T

192 GB/s

5.50 T

192 GB/s

5.10 T

112 GB/s

4.73 T

96 GB/s