I'm anshuman. I build GPU kernels, profile distributed systems, and care about how the metal actually works.
Currently: hierarchical collectives for HPX (GSoC 2026) · writing every CUDA kernel from scratch to understand what torch.compile does for free.
// CS undergrad · HPC · ML infrastructure · applying for MS Fall 2027
Establishing database connection...
Loading nodes...
Fetching from GitHub...