Optimization
All Posts
- llm (3)
- cuda (2)
- writing (2)
- tensors (2)
- pytorch (2)
- linear-algebra (2)
- machine-learning (2)
- metal (1)
- gpu (1)
- parallel-programming (1)
- nvcc (1)
- colab (1)
- pretraining (1)
- midtraining (1)
- posttraining (1)
- data-quality (1)
- synthetic-data (1)
- dpo (1)
- energy-based-model (1)
- likelihood (1)
- score-matching (1)
- diffusion (1)
- personal-growth (1)
- webdev (1)
- side-project (1)
- attention (1)
- transformers (1)
- deepseek (1)
- mla (1)
- kv-cache (1)
- inference (1)