Dashboard — Live leaderboard, dataset coverage, and activity feed.
User Guide (user/)¶
Documentation for RAs and contributors using qebench.
Getting Started — Install, configure, run your first command
Working with uv — Python package manager intro and common workflows
CLI Reference — All commands, options, and examples
Tutorials
Your First Translation Session — Walk through
qebench translateContributing Entries — Grow the benchmark with
qebench addUpdating Datasets — Keep data current with
qebench updateJudging Translations — Compare translations and build Elo ratings with
qebench judgeRunning LLM Benchmarks — Batch translate with Claude or OpenAI via
qebench run
Developer Guide (developer/)¶
Documentation for developers building and maintaining qebench.
Architecture — Module structure, data flow, design decisions
Data Models — Pydantic schemas, JSON format, validation
Contributing — Development workflow, testing, code style