Skip to main content

Documentation Index

Fetch the complete documentation index at: https://www.radixark.com/llms.txt

Use this file to discover all available pages before exploring further.

The model recipes show you how to train a model. The examples below show you how to build something useful with Miles — tools, search, multi-agent, distillation, and async rollout. Each example follows the same template:
  1. What you’ll learn — the takeaway in one sentence.
  2. Prerequisites — what you need installed/downloaded first.
  3. Files — what’s in the example directory.
  4. Quick start — single command to run.
  5. Walkthrough — annotated tour of the key code.
  6. What’s happening underneath — the moving parts you can’t see.
  7. Tuning knobs — the levers that matter.
  8. Troubleshooting — the failure modes we’ve actually hit.
  9. Variations — common adaptations.

The catalog

Fully Async Rollout

Continuous background generation with a queue between rollout and training. Up to 2× end-to-end speedup.

Search-R1 (Tool Use)

Multi-turn rollout where the model can issue <search>... actions, get observations from a retrieval server, and produce a final answer.

ReTool (Code Execution)

SFT + RL pipeline for tool-augmented reasoning. Sandboxed Python code execution interleaved with thinking.

Multi-Agent Co-Evolution

Two specialized agents (e.g. doctor + patient) train together and improve each other.

Reproducibility Recipe

Bit-stable training across reruns. Determinism flags, seeds, and what to watch.

SFT on OpenHermes

Plain SFT (no RL) — sometimes you just need a quick fine-tune.

Where to start