Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

3. Scaling Networks

age of scaling (2020 - 2025)

Contents

3.1 Scaling Large Language Models

3.2 Accelerating nanogpt with FlashAttention

3.2 Programming a Compiled and Distributed Tensor

3.2.1 OpCode, OpNode Intermediate Graph Representation

3.2.2 ExecItem Kernelizer/Fuser, Scheduler

3.2.3 Runtime, Allocator, Heterogenous Runtime