My main interest lies in machine learning systems — especially building and understanding the training and inference pipelines end-to-end.
Right now, I'm focused on:
- Writing efficient CUDA kernels
- Understanding GPU-level optimizations
- Exploring system-level design for ML training and inference
I'm currently ranked top #50 on the global leaderboard at leetgpu.com, a platform for solving low-level GPU programming challenges.