About
I'm Anshuman Agrawal, a systems and AI infrastructure engineer. I spend my time deep in C++ parallel runtimes, CUDA, and the layers between user code and silicon.
What I work on
- HPX: a C++ standard library for parallelism and concurrency. I contribute to its collective communication primitives as part of GSoC.
- CUDA / GPU programming: writing kernels, understanding memory hierarchies, fighting occupancy.
- Distributed systems: how compute and data move across a cluster, and why that's hard.
What this blog is
Long-form deep dives. 30 to 60 minutes of reading per post. Code, diagrams, math. No newsletters, no popups, no fluff. Just notes from someone trying to understand things well enough to write them down.