Back to browse
GitHub Repository

Profine automatically profiles and optimizes PyTorch training jobs on real GPUs, delivering measurable speedups and lower GPU costs before teams waste days tuning configs by hand.

17 starsPython

Profine – Automated profiling and code rewrites for ML training loops

by aisinghal·May 13, 2026·2 points·0 comments

AI Analysis

●●SolidSolve My ProblemBig Brain

3.11x speedup on minGPT with automated LLM-suggested rewrites.

Strengths
  • Concrete 67.8% speedup and 68.7% memory reduction with verifiable benchmarks
  • Real GPU profiling on Modal backend, not simulated or estimated
  • Works with local LLMs (Ollama, vLLM) avoiding vendor lock-in
Weaknesses
  • Requires Modal account plus LLM API keys before you can test it
  • AI code optimization space getting crowded with similar tools emerging
Category
Target Audience

ML engineers and researchers training PyTorch models

Similar To

PyTorch Profiler · NVIDIA Nsight Systems · CodeTuning

Similar Projects

AI/ML●●Solid

Profine – optimize your PyTorch training script before the run

Automated PyTorch optimizer delivering 3x speedups before you waste cloud credits.

Solve My ProblemBig Brain
aisinghal
301mo ago