Clangd for CUDA Device Code
Compile CUDA for AMD GPUs with zero code changes—breaks NVIDIA lock-in.

Finally replaces Nvidia's ancient Excel spreadsheet with something that supports Hopper.
CUDA developers and GPU performance engineers
NVIDIA CUDA Occupancy Spreadsheet · Nsight Compute
It's pure client-side JS — no backend, no tracking, no signup. The math lives in formulas.js so you can sanity-check it against the CUDA Programming Guide. Architectures and their per-SM limits are in presets.js.
Things I'd love feedback on:
* Are the per-arch limits correct for [arch you're least sure about]?
* The "compare two configs side-by-side" view - is the diff readable?
* Anything you wish the Excel version did that this doesn't?
Link: https://toolbelt.widgita.xyz/cuda-occupancy-calculator/
(There are a few other small tools at the root, but this one is the reason for the post.)
Compile CUDA for AMD GPUs with zero code changes—breaks NVIDIA lock-in.
Runs 4B-parameter image-to-3D on Mac without CUDA—Microsoft's original required NVIDIA only.
Production-ready CUDA profiling when NSight only works in development.
NVIDIA Broadcast for Linux, but requires Enterprise SDK access and heavy container setup.
Formula recalculation survives serialization round-trips for agent workflows.
Git-diffable spreadsheets beat XLSX binary hell for version control.