-
Notifications
You must be signed in to change notification settings - Fork 551
Pull requests: tile-ai/tilelang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[feature][Blackwell] Add SM120 FP4 and A8W4 GEMM support
#2171
opened May 8, 2026 by
TerminusAkivili
Contributor
Loading…
[Example] Add CLC-pipelined 2-CTA GEMM example for sm100
#2169
opened May 8, 2026 by
ighoshsubho
Loading…
[BugFix] Consider non-local store in external call and SIMT producer for warp specialize
#2166
opened May 7, 2026 by
Rachmanino
Collaborator
Loading…
[Refactor] Refactor multiple TensorCoreIntrinEmitter to provide atom-level mma control interface
#2161
opened May 7, 2026 by
Rachmanino
Collaborator
Loading…
5 tasks done
[Autotune] Add pipeline, grouped compilation, and multi-GPU benchmark support
#2159
opened May 7, 2026 by
Wazrrr
Loading…
Fix atomic_load access_ptr lowering for dynamic indices
#2157
opened May 6, 2026 by
VitalyAnkh
Contributor
Loading…
[Metal] FP8 vector cast lanes 2/3/4 (extends storage-only FP8)
#2145
opened May 4, 2026 by
apstenku123
Loading…
6 tasks done
[Metal] FP8 storage-only emulation (uchar storage + LUT decode helpers)
#2144
opened May 4, 2026 by
apstenku123
Loading…
5 of 6 tasks
[Metal] emit Metal builtins directly instead of CUDA-style threadIdx/blockIdx aliases
#2143
opened May 4, 2026 by
apstenku123
Loading…
3 of 4 tasks
tilelang: T.fp8_scaled_matmul DSL intrinsic + Metal lowering
#2142
opened May 4, 2026 by
apstenku123
Loading…
[Metal] thread stage dim through T.access_ptr for T.Pipelined num_stages>1
#2141
opened May 4, 2026 by
apstenku123
Loading…
[Metal] allow mixed-dtype T.gemm via scalar fallback
#2139
opened May 4, 2026 by
apstenku123
Loading…
[Fix] Allow autotuning kernels with scalar value parameters
#2136
opened May 3, 2026 by
yurekami
Contributor
Loading…
2 of 3 tasks
[AMD][CDNA4] Add MXFP4 (FP4 E2M1) support for gfx950
#2132
opened Apr 30, 2026 by
zhangnju
Collaborator
Loading…
Rebase Metal simdgroup GEMM support and runtime coverage
#2130
opened Apr 30, 2026 by
jorgecurious
Loading…
Add TMA tile::gather4 / tile::scatter4 support
#2129
opened Apr 30, 2026 by
ighoshsubho
Loading…
3 of 4 tasks
[Feature] Support mutable TMA descriptor and canonicalize usage in examples
#2113
opened Apr 28, 2026 by
Rachmanino
Collaborator
•
Draft
feat: auto-vectorize bf16/fp16 reduce with packed add2 intrinsics
#2112
opened Apr 28, 2026 by
kurisu6912
Collaborator
Loading…
[Example] Optimize topk selector for
B=1 and large S cases
#2108
opened Apr 27, 2026 by
Rachmanino
Collaborator
•
Draft
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.