-
Notifications
You must be signed in to change notification settings - Fork 815
Pull requests: NVIDIA/cutlass
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix potential out-of-bounds access in 3.x grouped gemm kernel
#1543
opened May 21, 2024 by
kongroo
Loading…
Allow scalar broadcasting in VisitorRowBroadcast and VisitorColBroadcast
feature request
New feature or request
#1539
opened May 16, 2024 by
tlrmchlsmth
Loading…
Fix template parameter
IterationsUnroll
type from int to bool
#1534
opened May 11, 2024 by
peakcrosser7
Loading…
Update half.h - typo at line 138(unnecessary space before '1')
#1527
opened May 8, 2024 by
sjbae1999
Loading…
add publication: ‘EVT: Accelerating Deep Learning Training with Epilo…
#1526
opened May 7, 2024 by
reed-lau
Loading…
feat: support kFactor 8 used in mma tensor op tile iterator
#1512
opened Apr 29, 2024 by
gavinchen430
Loading…
Fix device thread
gemm.h
constructor
inactive-30d
#1473
opened Apr 11, 2024 by
luliyucoordinate
Loading…
Add Faster Neighborhood Attention to PUBLICATIONS
inactive-30d
#1471
opened Apr 11, 2024 by
alihassanijr
Loading…
Add missing #include <memory> for definition of std::addressof.
inactive-30d
#1470
opened Apr 10, 2024 by
Gregory-Meyer
Loading…
Fix B operand variable name and comments
inactive-30d
#1458
opened Apr 6, 2024 by
andylolu2
Loading…
Refactor to use FastDivmod for predicated strided dgrad iterators.
inactive-30d
#1453
opened Apr 3, 2024 by
ZelboK
Loading…
add a new epilogue for the case that the output is not packed
inactive-30d
#1437
opened Mar 28, 2024 by
hwu36
Loading…
Allow setting a custom TmaDescriptor for TMAStore.
inactive-30d
#1428
opened Mar 26, 2024 by
ipiszy
Loading…
Add support for mixed 4-bit/8-bit data types GEMM
#1413
opened Mar 19, 2024 by
alexsamardzic
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.