mega-kernel

Here is 1 public repository matching this topic...

manishklach / gpu-resident-inference-lab

Research lab for GPU-resident LLM inference loops: persistent kernels, sparse KV selection, tiered residency, speculative decode, and trace-driven scheduling.

runtime cuda kv-cache gpu-systems llm-inference speculative-decoding model-systems persistent-kernel mega-kernel

Updated Jun 13, 2026
Python

Improve this page

Add a description, image, and links to the mega-kernel topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mega-kernel topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mega-kernel

Here is 1 public repository matching this topic...

manishklach / gpu-resident-inference-lab

Improve this page

Add this topic to your repo