Hi there, I'm Darshan

A curious mind exploring the intersection of
Computation and Machine Intelligence.

I am a student, a thinker, and an engineer. I craft elegant systems and learn new things(most of the time).

Who I Am

I am a student of everything around me. My roots are humble. I come from a Tier 3 city in Gujarat and a college far from the spotlight, where the path wasn’t already paved for me. I realized early on that knowledge would not be handed to me; I would have to take it. Everything I know—from the fundamentals of systems to the complexities of AI—is the result of relentless self-learning.

My strength is not loud; it lives in my curiosity, in my willingness to keep going when things get complex. I believe that the best work comes from a place of genuine curiosity. While I have a deep background in technical problem-solving, my true passion lies in simplifying complexity.

Currently, I am refining this craft as a Master’s student in Data Science and a Machine Learning Engineer in training. Because I had to teach myself the foundations, I don't just use tools—I deconstruct them. Whether I am optimizing algorithms or architecting systems, I am driven by a single goal: to understand the "why" behind the "how."

I am a student; of code, of math, of systems, of life. Work occupies most of my time, not out of pressure but out of passion. And when I’m not working, I consume ideas: podcasts, philosophy, blogs. I keep learning because it keeps me alive.

I take inspiration from some legendary figures like Elon Musk, Srinivas Ramanujan, Isaac Newton, Albert Einstein, Vikram Sarabhai, and many more :)

Latest Thoughts

I write to clear my mind and share what I learn.

CUDA

A curious mind exploring the intersection of
Computation and Machine Intelligence.

Who I Am

Latest Thoughts

The Global GEMM — Putting It All Together

Hello, MMA — Your First Tensor Core Instruction

Swizzling ; Avoiding Shared Memory Bank Conflicts

The Parallel Copy ; Orchestrating Threads with TiledCopy

The Naive Copy ; Scalar vs. Vectorized Memory Movement

The Art of Slicing ; Partitioning Data Across Blocks and Threads

Hello, Layout! ; Visualizing Memory in CuTe

Beating PyTorch: Writing a Faster Softmax Kernel in CUDA

Stable Diffusion 1.5: How I Optimized It

Propositional Logic

Raw Dawgging Linear Regression

A curious mind exploring the intersection of Computation and Machine Intelligence.

Who I Am

Latest Thoughts

The Global GEMM — Putting It All Together

Hello, MMA — Your First Tensor Core Instruction

Swizzling ; Avoiding Shared Memory Bank Conflicts

The Parallel Copy ; Orchestrating Threads with TiledCopy

The Naive Copy ; Scalar vs. Vectorized Memory Movement

The Art of Slicing ; Partitioning Data Across Blocks and Threads

Hello, Layout! ; Visualizing Memory in CuTe

Beating PyTorch: Writing a Faster Softmax Kernel in CUDA

Stable Diffusion 1.5: How I Optimized It

Propositional Logic

Raw Dawgging Linear Regression

A curious mind exploring the intersection of
Computation and Machine Intelligence.