English Knowledge Base

This section is a code-accurate reference for the current repository state: public APIs, runtime contracts, performance tooling, and kernel internals.

Reading paths

First visit

Read Installation and Quick Start.

API integration

Read Core Kernels and Integration.

Performance work

Read Benchmark, Auto-Tuning, and Performance.

Source dive

Read Architecture and Kernel Design.

Boundary reminder

  • Triton kernel execution requires CUDA.
  • CPU-only environments remain useful for import checks, linting, typing, build validation, and CPU-safe tests.
  • The site intentionally keeps only technical knowledge pages; repository process history and changelog content are not part of the published docs.

Table of contents