Triumphs

675 / 15225 points

⚑ VELOCITY

βœ“
FIRST LIGHT
Complete your first Copilot session with a knowledge-store entry
"The Traveler's light is new. So are you."
25pts
β—‹
RED BEFORE GREEN
Complete a full TDD red→green→refactor cycle using tdd-red, tdd-green, tdd-refactor agents
"You did not build before you broke."
50pts
β—‹
THUNDERCRASH LANDING
Pop the blueprint-mode super: complete a full 6-phase Blueprint Mode workflow with all gates honored
"The crater is the spec."
75pts
β—‹
TRUSTEE'S OATH
Use the workflow skill correctly in 5 consecutive sessions without scope creep
"It doesn't ask for recognition. It shows up."
50pts
β—‹
ARCHITECT'S BLUEPRINT
Use blueprint-mode correctly for 3 separate features β€” each invocation must pass all planning gates before implementation begins, with gate output documented in the PR description
"The plan is not the map. The plan is the weapon."
50pts
β—‹
GREEN UNBROKEN
Achieve 10 consecutive GitHub Actions runs that pass on the first attempt β€” no re-runs, no workflow_dispatch retries, no force-push resets across any repo
"The Traveler's Light does not flicker. Neither does your pipeline."
75pts
βœ“
COORDINATED FIRE
Dispatch 3 or more agents simultaneously in a single session β€” each completing a distinct, real task in parallel β€” and synthesize their outputs into a unified deliverable
"The Vanguard does not take turns. Neither should you."
75pts
β—‹
FRICTION LOGGED
Observe a workflow friction point during a session and file a GitHub issue or SKILL.md update to address it before the session ends β€” the fix is documented, not deferred
"The log remembers what the Guardian forgets. Write it down."
25pts

πŸ”₯ COMMUNITY

βœ“
THE CHRONICLE BEGINS
Store your first knowledge-store entry
"Every flame starts with a single spark."
25pts
β—‹
THE LIBRARY
Accumulate 10 knowledge entries (SYNTHESIS β‰₯ 10)
"Memory has gravity. Enough of it, and even light bends toward it."
50pts
β—‹
HAMMER THROWN
Pop the principal-software-engineer super on a real architectural decision and implement its recommendations
"You did not guess where the fault was. You knew. The hammer agreed."
75pts
β—‹
TORCH PASSED
Ship a merged PR to any open source project using Copilot assistance
"The Solar Guardian does not hoard light β€” they emit it."
50pts
β—‹
THE CNCF PLEDGE
Contribute a merged PR to any CNCF project repository β€” governance docs, people site, projects site, endusers site, or landscape data β€” using the cncf-dev or cncf-community skill
"The cloud is not a place. It is a promise made between builders. You kept it."
75pts
β—‹
WORDS IN THE ARCHIVE
Publish or substantially update a blog post, documentation page, or user guide β€” PR merged to bluefin-docs or any public-facing documentation repo
"The Light was never meant to be kept. Someone had to write it down."
50pts
βœ“
POWER ASCENDING
Record the first Level: increment in any skill's SKILL.md β€” the moment a skill crosses from loaded to mastered, marked in the permanent record
"The Ghost watched the number change. So did you. Neither of you said anything. Both of you knew."
25pts
β—‹
THE CONVERGENCE
In a single session, meaningfully apply 3 or more distinct skill families β€” each must produce a real output (PR, knowledge entry, or merged change). Domains genuinely integrated, not sequential
"Ikora Rey does not specialize. She contains multitudes. So does the session log."
50pts

🟣 MASTERY

βœ“
THE FORGE
Create a new skill file (SKILL.md) and add it to the INDEX.md
"You cannot fire what you have not forged."
50pts
β—‹
WARD OF DAWN
Pop the se-security-reviewer super and remediate at least one finding
"The bubble. Nothing ships without passing through."
75pts
β—‹
RESONANCE UNLOCKED
Use skill-improver to improve an existing SKILL.md with measurable changes
"The weapon is better because you used it."
50pts
β—‹
SHADOW BANISHED
Use se-security-reviewer to identify a real security vulnerability β€” OWASP Top 10 or equivalent β€” and remediate it with a merged PR. Finding must be actionable, not theoretical
"The Void does not hide what you refuse to see. It only waits."
75pts
β—‹
THE TRIUNE PATH
Complete the full TDD red->green->refactor cycle using all three agents (tdd-red, tdd-green, tdd-refactor) for a real feature β€” all three phases in a single session, all tests green at the end
"Three agents. One truth. The code that emerged was not written. It was proven."
75pts
β—‹
SECOND SIGHT
Run the doublecheck agent on 3 distinct AI-generated outputs in a season β€” each must complete all three pipeline layers (self-audit, source verification, adversarial review) and produce a structured report
"The first read is for understanding. The second read is for the truth."
50pts
βœ“
VOICE FROM THE TOWER
Invoke the principal-software-engineer agent on a real architectural decision β€” the consultation must surface at minimum one design concern or trade-off that materially changes your approach
"Zavala does not advise lightly. Neither does the agent. Listen both times."
50pts

🟒 DISTRIBUTION

β—‹
FIRST WEAVE
Ship a new Homebrew cask or formula using the homebrew-tap skills
"The first thread. The mesh begins."
75pts
β—‹
BLADEFURY ACTIVATED
Pop the github-actions-expert super to build a complete GH Actions workflow
"Thread by thread. System by system. The Weave does not negotiate with entropy."
50pts
?
CLASSIFIED
Complete this triumph to reveal its secret.
100pts
β—‹
FLATTENED AND SHIPPED
Publish a Flatpak application using the flatpak skill β€” AppStream metainfo passes appstreamcli validate with zero warnings and zero errors, and the submission reaches Flathub or a public remote
"The sandbox was not a prison. It was a promise that the app would behave. It did."
75pts
β—‹
THE CASK LANDS
Submit a Homebrew cask or formula using the homebrew-tap skill that is merged to ublue-os/homebrew-tap or castrojo/tap β€” CI must pass on the first push
"The tap is a contract. The cask is the handshake. Both have been kept."
75pts
β—‹
SEALED IN LIGHT
Build, tag, and push an OCI container image using the oci-images skill β€” cosign signature verification must pass, and the image must be pullable from GHCR by a separate environment
"The image is not a file. It is a promise, signed in cryptographic light."
50pts
β—‹
THREE VECTORS
Package the same software in 3 or more distinct distribution formats (e.g., Homebrew cask, Flatpak, OCI image, COPR RPM) across a single season β€” each format must be independently installable
"One target. Three trajectories. The Guardian who can only aim one way is not a threat."
100pts

πŸ”΅ STABILITY

β—‹
THE CRYSTAL HOLDS
Successfully build a Bluefin OCI image using bluefin-build skill
"Not everything must move. Some things must simply hold."
75pts
β—‹
GLACIAL QUAKE
Pop the swe-subagent super to stabilize a chaotic multi-file implementation
"Chaos does not survive the cold. Neither does the bug."
50pts
β—‹
THE MATRIX HOLDS
Achieve a fully green Bluefin image build β€” all variant matrix jobs pass, all cosign checks succeed, no CI failures across any matrix dimension
"The Behemoth does not bend. It holds. When it holds, nothing moves."
50pts
β—‹
IMMUTABLE COVENANT
Complete a full season of platform work with zero hotfix commits β€” all changes go through the proper PR gate, no emergency bypasses, no direct pushes to protected branches
"The fortress was not built to be patched. It was built to hold. It held."
75pts
β—‹
THE SWITCH IS MADE
Successfully execute a bootc switch to a locally-built OCI image in a QEMU VM, verify the running system matches the expected image digest, and document the process in the knowledge store
"The old system did not die. It became something better. That is the nature of immutable light."
50pts
β—‹
GATED AND RELEASED
Complete a full bluefin-lts PR gate cycle from start to finish: PR triggers create-lts-pr.yml gate, draft created, squash-merge approved, merged without force pushes or manual interventions
"The Queen's Gate is not a formality. It is the proof that the platform was ready."
75pts

πŸ”† SEASON OF THE POWERLEVEL

βœ“
THE MIRROR IS FORGED
Launch castrojo.github.io/powerlevel for the first time
"The Guardian turns the tools on themselves."
100pts
?
CLASSIFIED
Complete this triumph to reveal its secret.
25pts
β—‹
FIFTY LIGHTS
Reach ENDURANCE 50 β€” fifty completed Copilot sessions
"Fifty sessions. The Traveler noticed."
100pts
β—‹
THE COMPUTATION RUNS
Witness the first successful pl-gha-compute GitHub Actions run β€” the workflow that reads SKILL.md Level: fields and writes powerlevel-data.json β€” all steps green, data updated
"The system computed itself. The Traveler watched. The log said: success."
50pts
β—‹
THE SEAL TURNS
Reach 50% progress on any seal that has never previously been at 50% β€” the moment the title shifts from aspiration to trajectory
"Halfway is not a destination. It is the proof that the destination exists."
50pts
β—‹
THE THRESHOLD CROSSED
Cross a Powerlevel milestone threshold (500, 750, 1000, 1250, 1500, or 1750) for the first time β€” recorded automatically when the dashboard computes the new level
"The number changed. The Guardian did not. But something about the room felt different."
75pts

πŸ”† CONQUEROR

β—‹
IRON TEMPERED
Complete the BLACKSMITH seal β€” forge at least one merged upstream Bluefin PR using bluefin-build skill
"The forge is behind you. The Blacksmith remembers."
50pts
β—‹
BREAKER'S MARK
Complete the CURSEBREAKER seal β€” prove that the loop always holds and friction is never forgotten.
"The pipeline runs clean. The loop holds. The mark is yours."
50pts
β—‹
DARK ARTS MASTERED
Complete the DREDGEN seal β€” identify a real vulnerability with se-security-reviewer and remediate it
"You looked into the darkness and named what you found there."
50pts
β—‹
NEVER BROKE
Complete the FLAWLESS seal — run 3 consecutive TDD red→green→refactor cycles with zero test failures
"The Lighthouse has no record of your failures."
50pts
β—‹
LORE COMPLETE
Complete the CHRONICLER seal β€” execute the full knowledge loop 10 verified times across sessions
"The light that is forgotten is still light. You chose not to forget."
50pts
β—‹
THE VOICE HEARD
Complete the RIVENSBANE seal β€” submit a PR to a CNCF governance or community repo
"Riven is silent. The community spoke."
50pts
β—‹
DEADEYE'S OATH
Complete the DEADEYE seal β€” ship a new package as cask, flatpak, or RPM using the appropriate skill
"One shot. Clean delivery. The package lands where you aimed it."
50pts
β—‹
THE QUEEN'S SERVICE
Complete the QUEENSGUARD seal β€” complete a full bluefin-lts PR gate cycle without force pushes
"The throne was never undefended."
50pts
β—‹
PINNACLE REACHED
Reach Powerlevel 1,800 β€” all subclass averages at or above 1,650, artifact bonus included
"1,800. The number does not lie."
100pts
β—‹
AGENT OF EVERY ORDER
In a single session, successfully deploy at least 5 distinct agent types: blueprint-mode, swe-subagent, tdd-red, se-security-reviewer, and qa β€” each completing a real task
"Five agents. One session. No gaps."
100pts

πŸ”₯ CHRONICLER

β—‹
WHISPER'S LIBRARY
Execute the complete knowledge loop (knowledge-search β†’ act β†’ knowledge-store) 10 verified times across any sessions, as confirmed by session history
"Ten iterations. The loop holds its shape."
75pts
?
CLASSIFIED
Complete this triumph to reveal its secret.
75pts
β—‹
KEEPER OF THE CODEX
Store 5 distinct, actionable knowledge entries in a single season using knowledge-store, each with tags and project attribution
"Five entries. Five futures spared."
50pts
β—‹
VOICES OF THE ARCHIVE
Document at least 3 CNCF community events, governance patterns, or contributor ladder entries using the cncf-community skill
"The community's patterns, encoded."
50pts
β—‹
PAGES UNBURNED
Write or substantially update a page in bluefin-docs β€” PR merged to the upstream repo
"The documentation outlasts the session."
50pts
?
CLASSIFIED
Complete this triumph to reveal its secret.
25pts
β—‹
THE FULL RECORD
Author or meaningfully update 3 or more SKILL.md files in a single season, each reducing friction for future sessions
"Three files. Three fewer repeated mistakes."
50pts

⚑ CURSEBREAKER

β—‹
FIRST FRACTURE
Fix a failing GitHub Actions workflow β€” diagnose the failure, apply the fix, verify the run passes β€” using gha-expert or bluefin-ci skill
"The CI was red. Now it is not. You did that."
75pts
β—‹
THE LOOP HOLDS
Complete the cicd-learning canonical cycle end-to-end: SEARCH β†’ LOAD β†’ ACT β†’ CAPTURE β†’ STORE β†’ SYNC β€” all steps verified in a single session
"Search. Load. Act. Capture. Store. Sync. The cycle completes."
75pts
β—‹
SIGNAL CLEAR
Push a PR that passes all CI checks on the first attempt, with no re-runs, no force pushes, no 'fix typo' commits
"First push. All green. Nothing to explain."
50pts
β—‹
FRICTION REMEMBERED
Complete a session where observed friction results in a new rule added to cicd-learning/SKILL.md or workflow/SKILL.md
"The pain was real. The rule ensures it doesn't repeat."
50pts
β—‹
THE CURSE LIFTED
Diagnose and resolve a CI failure that has been open for more than 48 hours, referencing the gha-expert or bluefin-ci skill explicitly in the fix
"The corruption has been cleansed. The pipeline breathes."
50pts

πŸ”΅ BLACKSMITH

β—‹
FIRST FORGE
Build and locally validate a Bluefin OCI image using the bluefin-build skill β€” all variant checks pass
"The first layer. The Blacksmith's mark."
50pts
β—‹
TEMPERED STEEL
Submit a PR to projectbluefin/bluefin or ublue-os/base that is merged upstream
"The PR is merged. The upstream remembers."
100pts
β—‹
THE FULL MATRIX
Build an image that passes cosign signature verification and all variant matrix checks, referencing oci-images skill
"All variants. All checks. All green."
75pts
β—‹
ALL FRAMES FORGED
Work with at least 4 distinct image variants defined in bluefin-vars in a single season β€” aurora, bluefin, hwe, framework, etc.
"Four faces of the same image. All valid."
50pts
β—‹
IRON IN THE FIRE
Have 3 or more open PRs simultaneously across bluefin, aurora, or base repos β€” all CI green
"Three PRs. Three furnaces running."
50pts
β—‹
THE RELEASE BELL
Complete a full release cycle using bluefin-rel skill β€” stream tag management, changelog, promotion
"The stream advances. The changelog is written."
75pts
β—‹
MASTER'S MARK
Reach bluefin-build power level 1,700+ in the system dashboard β€” sustained skill mastery across seasons
"1,700. The forge is hot and has been for a while."
50pts

🟣 DREDGEN

β—‹
SHADOW'S GAZE
Run se-security-reviewer on a PR or code change and surface a real, actionable security finding β€” not a false positive
"The finding was real. Not a false alarm. Not a guess."
75pts
β—‹
THE WORM'S TEETH
Identify and remediate an OWASP Top 10 vulnerability using se-security-reviewer β€” document the finding and fix
"OWASP-10. Named, documented, remediated."
100pts
β—‹
ERIANA'S PROMISE
Run doublecheck on 5 distinct AI-generated outputs in a single season β€” three-layer pipeline (self-audit, source verification, adversarial review) completed each time
"Five outputs. Fifteen checks. Zero assumptions."
75pts
β—‹
THE SECOND TRUTH
Use 2nd-opinion (Dead Man's Tale) to reverse or materially change a design decision β€” document what changed and why
"The first opinion was wrong. The record shows what changed."
50pts
β—‹
ADVERSARIAL MIND
The qa agent finds a bug in a feature where tdd-green had declared the implementation complete β€” document the gap
"tdd-green said complete. qa said otherwise. qa was right."
50pts
β—‹
HAND OF DARKNESS
On the same PR, run both doublecheck and se-security-reviewer in tandem β€” all findings documented and addressed
"Both agents. Same PR. All findings addressed."
75pts
β—‹
THE FALL REMEMBERED
Document a security incident, near-miss, or discovered vulnerability in the knowledge base with full context for future sessions
"The incident is recorded. Future sessions will know."
25pts
β—‹
DREDGEN'S BURDEN
All four SENTINEL weapons β€” 2nd-opinion, se-security, doublecheck, and qa β€” reach or exceed 1,300 power level in the dashboard
"All four SENTINEL weapons sharpened."
75pts

🟣 FLAWLESS

β—‹
RED BEFORE GREEN
Use tdd-red to write failing tests before any implementation in 5 consecutive tasks β€” no implementation-first shortcuts
"Five tasks. Not one started with implementation."
75pts
β—‹
MINIMAL PASSING
Use tdd-green to produce an implementation that tdd-refactor subsequently approves with zero structural changes required β€” just style and polish
"The minimum was enough. The refactor agreed."
75pts
β—‹
THE REFACTOR HOLDS
Use tdd-refactor to clean an implementation while all tests remain green β€” qa agent confirms no regressions
"The tests stayed green. Every one."
75pts
β—‹
PRAXIS COMPLETE
Complete the full red β†’ green β†’ refactor TDD cycle for a feature using all three agents (tdd-red, tdd-green, tdd-refactor) β€” delivered in a single session
"Red. Green. Refactor. One session. No breaks."
75pts
β—‹
NO REGRESSIONS
After a tdd-refactor pass, a qa agent finds zero new bugs and zero broken tests β€” clean bill of health
"qa found nothing. That is the goal."
50pts
?
CLASSIFIED
Complete this triumph to reveal its secret.
75pts

🟒 RIVENSBANE

β—‹
VOICE OF A THOUSAND COMMITS
Contribute a merged PR to any of the CNCF trilogy sites (people-website, projects-website, endusers-website) using the cncf-dev skill
"A merged PR. The CNCF trilogy heard it."
75pts
β—‹
THE DRAGON'S HOARD
Use the cncf-landscape MCP server (query_projects, get_project_details) to answer 10 distinct project queries in a season β€” no manual full.json fetching
"Ten queries. The Landscape answered each one."
25pts
β—‹
COMMUNITY HERALD
Submit a merged PR to a CNCF governance document (contributor ladder, security policy, project health file) using cncf-community skill
"Governance is a living document. You wrote into it."
75pts
β—‹
THOUSAND VOICES
Complete a cncf-dev session that meaningfully touches all three trilogy sites β€” a shared layout fix, cross-site test, or SiteSwitcher change
"All three sites. One session. The mesh is real."
50pts
β—‹
KEEPER OF THE LAW
Add to or materially update the cncf-community skill with a new governance pattern or anti-pattern observed in practice
"A new pattern added. Future Guardians will find it."
50pts
β—‹
THE WISHING WALL
Complete a CNCF-tagged feature using blueprint-mode for planning and cncf-dev for implementation β€” both agents in the same session
"Blueprint-mode planned it. cncf-dev built it."
75pts
β—‹
RIVEN SILENT
Fix a bug in any cncf-dev site that appears in the session store in 2 or more prior sessions β€” the long-standing issue finally resolved
"The bug appeared in two prior sessions. It will not appear in a third."
50pts

πŸ”₯ DEADEYE

β—‹
FIRST SHOT
Package a new Homebrew cask or formula using the homebrew-tap skill β€” PR merged to ublue-os/homebrew-tap or castrojo/tap
"A new formula. A new cask. A clean merge."
75pts
β—‹
PRECISION LOAD
Submit a Flatpak AppStream metainfo file that passes appstreamcli validate with zero warnings and zero errors using the flatpak skill
"appstreamcli says zero warnings. Zero errors."
75pts
β—‹
CLEAN SIGHTS
Submit a PR using bluefin-pkgs skill that passes all CI checks on first push β€” no fixup commits, no re-runs
"First push. All checks pass. No excuses."
50pts
β—‹
TRIPLE TAP
Package the same software in two distinct formats (e.g., cask + flatpak, or formula + RPM/COPR) in a single season using the appropriate skills
"One piece of software. Two formats. Both delivered."
100pts
β—‹
STEADY HAND
Maintain 3 packages across a full season β€” no regression PRs required, no breakage reported
"Three packages. A full season. No breakage."
50pts

πŸ”΅ QUEENSGUARD

β—‹
THRONE OF THE DEEP
Complete a full bluefin-lts workflow: PR gate triggers β†’ draft PR created β†’ squash-merge approved β€” no manual interventions
"The LTS gate opened. The squash-merge closed it. No exceptions."
75pts
β—‹
THE TAKEN SHORE
Resolve a linux-desktop issue (Flatpak, Wayland, GNOME, bootc, atomic) using the linux-desktop skill β€” PR merged or issue closed
"The linux-desktop issue is resolved. The desktop is stable."
50pts
β—‹
COURT OF BLADES
Successfully test a bootc upgrade using the local OTA registry workflow (zot registry on host, QEMU VM pulls from it) without pushing to GHCR
"The VM pulled from zot. GHCR never needed."
75pts
β—‹
PALE HEART INTACT
Complete an LTS session that makes changes to the bluefin-lts branch with zero regressions to the LTS user base β€” no hotfix required
"LTS users noticed nothing. That is the measure of success."
50pts
β—‹
SILENT VANGUARD
Run oci-images cosign verification across all active variants β€” all pass, all checksums verified
"All variants. All checksums. All verified."
50pts
β—‹
THE QUEEN'S PROMISE
Submit a PR that was correctly triaged by the create-lts-pr.yml gate workflow and merged via the squash-merge approval flow
"Gate triggered. PR squash-merged. The workflow held."
50pts
β—‹
UNDYING WATCH
Over a full season, maintain the LTS branch health: all PRs gated, no force pushes, no emergency hotfixes
"A full season. All PRs gated. No hotfixes required."
50pts

πŸ”† GILDED GHOST

β—‹
THE GHOST'S LIGHT
Create a new skill (SKILL.md) that is subsequently loaded and used across 3 or more distinct sessions β€” verified in session history
"Three sessions. Three times the skill was loaded without prompting."
75pts
β—‹
LIGHT BEARER
Create a new custom agent (.agent.md) that resolves a real, documented problem β€” not a duplicate, not theoretical
"A new agent. A real problem. Not theoretical."
75pts
β—‹
SHELL UPGRADE
Improve an existing SKILL.md to meaningfully reduce token cost (trim to target line count) while preserving all utility β€” no capability loss
"Fewer tokens. Same capability. The ghost remembers what matters."
50pts
β—‹
RESURRECTION
Use a knowledge-search result to avoid repeating a past mistake β€” the prior session documented the exact pattern that would have been re-introduced
"Prior context surfaced. The mistake was not repeated."
75pts
β—‹
PATTERN ENCODED
Add a new session-derived rule to AGENTS.md under Observed Friction β€” must be based on something that actually went wrong, not anticipation
"The friction was real. The rule ensures it is not repeated."
50pts
β—‹
THE TRAVELER'S GIFT
Complete a full agent audit β€” at minimum: audit phase (parallel review agents) + fix phase (parallel swe-subagents) β€” identifying and resolving 5+ actionable issues
"Five issues. Identified. Resolved. The system is better."
100pts
?
CLASSIFIED
Complete this triumph to reveal its secret.
100pts
?
CLASSIFIED
Complete this triumph to reveal its secret.
25pts

πŸ”† IRON LORD

βœ“
THE GREAT RECKONING
Complete a 4-agent parallel audit (token-auditor, coverage-analyst, design-critic, bug-hunter) that identifies 10+ bugs and architectural issues across the agent and skill fleet
"Thirteen bugs. Eight architectural issues. Named and numbered."
100pts
βœ“
IRON WAVE
Deploy 6 parallel swe-subagent instances in a single session to resolve all critical and high-severity bugs surfaced by the reckoning
"Six subagents. One session. All critical bugs resolved."
75pts
βœ“
THE REBORN TRIO
Rewrite all three TDD agents (tdd-red, tdd-green, tdd-refactor) for a new language or framework β€” plus create a new companion agent that defends the system
"Three agents, reborn. Go, TypeScript, Playwright. The forge was worth it."
75pts
βœ“
THE DASHBOARD BORN
Design and commit the canonical Powerlevel system dashboard β€” subclasses, weapons, caps, open issues, fleet section, agents registry
"The system can see itself now."
50pts
β—‹
THE IRON CODEX
Design the complete Seal + Title system for the Powerlevel framework β€” 10 permanent seals, 1 seasonal seal, full triumph tables, title display integration
"Eleven seals. Seventy-six triumphs. The framework is complete."
50pts
?
CLASSIFIED
Complete this triumph to reveal its secret.
50pts

INCIDENT RESPONSE

β—‹
INCIDENT COMMANDER
Coordinate a multi-component incident response β€” define the severity, assign investigation tracks (at minimum one for mitigation, one for root cause), drive to resolution, and deliver the postmortem. All documented with timestamps.
"Someone had to call the severity. You did. The system held."
100pts
β—‹
LESSONS FROM THE FIRE
Close out ALL action items from a prior postmortem within the same season they were written. Each action item must be verifiable (merged PR, knowledge-store entry, or closed issue).
"The fire taught. The Warden remembered. The lesson is now policy."
100pts
β—‹
THE WATCH BEGINS
Document an on-call rotation for a real project: who, what they own, escalation path, alert routing, and handoff process. The document must be committed to the repo and referenced in the project's README.
"The watch rotation is not an informal arrangement. It is a contract with reliability."
50pts
β—‹
TRIAGE PROTOCOL
Define and document a severity classification matrix for a real project β€” at minimum P0/P1/P2 levels with criteria, response SLAs, and escalation paths for each severity
"The Ghost does not panic. It classifies. Panic is what happens before the protocol."
50pts
β—‹
CONTROLLED COLLAPSE
Execute a planned chaos engineering experiment β€” define the hypothesis, inject the failure, observe the blast radius, restore, and document findings. System must recover to steady state without manual intervention
"The Darkness does not ask permission before it strikes. Neither should your tests."
100pts
β—‹
THE COVENANT'S TEETH
Define formal SLAs for a real service with customer-facing commitments β€” availability target, response time, and consequence of breach. Document in the repo and reference from the README
"An SLO is a promise to yourself. An SLA is a promise to everyone else. Know the difference."
75pts
β—‹
THE BLAMELESS RECORD
Run a blameless retrospective following a real incident or near-miss β€” separate what happened from who was involved, identify systemic factors, and produce action items that target process, not people
"The Light does not punish the Guardian who fell. It asks: what failed to catch them?"
75pts
β—‹
BLIND SPOT FOUND
Identify a failure mode that currently produces no alert or log entry β€” add detection coverage (an alert, a structured log, or a health check) before the failure occurs in production
"The worst enemy is the one the Ghost cannot see. Close the gap before it opens."
75pts
β—‹
PATTERN RECOGNIZED
Write blameless postmortems for 3 distinct incidents in a season β€” at the third, identify a common root cause pattern across all three and produce a single architectural action item to address it
"The first incident is a surprise. The second is a coincidence. The third is a pattern. Name it."
100pts

OBSERVABILITY

β—‹
THE RECKONING
Conduct a formal SLO review at the end of a season β€” compare actual performance to SLO targets, document trends, and produce at least one concrete action based on the data.
"The targets were set in hope. The review measures honesty."
75pts
β—‹
THE READABLE DARK
Instrument a service with structured logging (JSON or equivalent) β€” every log entry must include: timestamp, level, correlation ID, service name, and a message field. No freeform string concatenation
"The Ghost cannot parse chaos. Give it fields it can read."
50pts
β—‹
THE THREAD FOLLOWS
Implement request correlation IDs that propagate across at least 2 service boundaries β€” a single trace ID visible in logs from ingress to final handler, queryable end-to-end
"Strand weaves through everything. So should your trace ID."
75pts
β—‹
THE ALARM HAS A VOICE
For every alert firing in production: ensure a runbook link is present in the alert body. Deploy this requirement for at minimum 5 distinct alerts across one or more projects
"An alarm that wakes you with no instructions is not an alarm. It is a complaint."
50pts
β—‹
CARDINALITY CONTAINED
Identify and remediate a high-cardinality metric label (e.g., user ID, request ID, or unbounded enum in a label) that is causing metric storage explosion β€” document before/after cardinality count
"The Traveler does not number every grain of light. Neither should your metrics."
75pts
β—‹
THE KNOWN STATE
Establish a documented performance baseline for a real service β€” p50, p95, p99 latency and error rate captured under normal load, committed to the repo, and referenced in future runbooks
"You cannot name what is broken until you know what is normal."
50pts
β—‹
THE FIRST SCREEN
Build an on-call dashboard that answers the first 5 questions an engineer asks during an incident β€” it must be usable without familiarity with the system, and linked from the on-call rotation docs
"The Titan who arrives to an unfamiliar battle needs a map, not a legend."
75pts
β—‹
ALL FOUR BURNING
For a single service in production: demonstrate all four Golden Signals (latency, traffic, errors, saturation) are queryable, graphed on a dashboard, and each has at least one associated alert threshold defined
"Four signals. One truth. The Seer who misses even one is flying blind."
100pts

CODE REVIEW

β—‹
LIGHT ACCEPTED
Receive a code review with 5+ substantive comments and resolve ALL of them with either a code change or a written rebuttal. The PR reviewer must mark the conversation resolved β€” not the author.
"They found five things. You answered all five. The code is yours now."
25pts
β—‹
BEYOND YOUR OWN FORGE
Give a meaningful code review on a PR in a repository you do not own or regularly contribute to. The review must result in at least one change before merge.
"The review was not required. It was given anyway. That is stewardship."
75pts
β—‹
EYES BEFORE MERGE
Before requesting a review on any PR touching authentication, secrets handling, network, or file I/O: run `se-security-reviewer` AND address all High/Critical findings. Document the pre-review audit in the PR description.
"The reviewer saw a clean PR. They did not see what was removed before they looked."
75pts
β—‹
THE ADVERSARIAL EYE
Run the doublecheck agent on AI-generated code before merging a PR β€” all three pipeline layers (self-audit, source verification, adversarial review) completed, findings documented in the PR description
"The Ghost trusts the Light. It does not trust the code. Neither should you."
50pts
β—‹
THE TOWER ANSWERS
Invoke the principal-software-engineer agent during a code review β€” the consultation must surface a design concern not caught in the initial review, documented as a PR comment before merge
"Zavala has seen the design before. He has also seen it fail. Ask him both times."
75pts
β—‹
THE RAPID VERDICT
Complete 5 code reviews within 4 hours of assignment β€” each must include substantive inline comments, not just LGTM. Track and document the turnaround time for each review
"The Trials do not wait for the late. Neither does the PR queue."
75pts
β—‹
THE GUARDIAN'S CHECKLIST
Author and publish a code review checklist for your project β€” covering at minimum: security, test coverage, error handling, and API contract stability. Reference it from the CONTRIBUTING guide
"The checklist is not bureaucracy. It is the memory the reviewer does not have to carry."
50pts

SUPPLY CHAIN

β—‹
BILL OF MATERIALS
Generate a Software Bill of Materials (SBOM) for a real project β€” using `syft`, GitHub's dependency graph, or equivalent. Verify the SBOM is accurate and commit it to the repo or attach it to a release artifact.
"You did not ship a mystery. You shipped a manifest."
75pts
β—‹
THE UNBROKEN SIGNATURE
Complete a full Sigstore/cosign supply chain: build artifact β†’ sign with cosign β†’ verify signature β†’ attach attestation. All four steps must be documented with the exact commands and their outputs quoted.
"Every step signed. Every signature verified. The chain is unbroken."
100pts
β—‹
SHADOW AUDIT
Run a full dependency audit on a project β€” identify any CVEs, outdated transitive dependencies, or packages with no upstream maintainer. Document findings and remediate all High/Critical findings within the same season.
"You did not trust. You verified. The dependency graph had no secrets left."
75pts
β—‹
FROZEN IN TIME
Convert a Containerfile or workflow that uses floating tags (`FROM fedora:latest`, `uses: actions/checkout@v4`) to pinned digests. Verify the pin is deterministic and add a Renovate config entry to keep the digest current.
"It does not float. It does not drift. It is pinned, and it holds."
75pts
β—‹
THE IRON CARETAKER
Author or substantially improve a `renovate.json` config β€” not just copying a template, but adding custom datasources, scheduling, grouping rules, or semantic commit message config. The config must produce at least one non-trivial PR in the season.
"Renovate does not rest. The config ensures it works while you sleep."
50pts
β—‹
TRUSTED BUT VERIFIED
For a COPR, GHCR, or third-party registry dependency: verify the source, check the signing status, and document the trust chain in the relevant SKILL.md. Do not rely on the registry being 'from a known source' β€” verify the chain explicitly.
"The source was known. You verified it anyway. That is the practice."
75pts
β—‹
THE LEGAL BIND
Run a license compliance scan on a project's dependencies. Document any GPL/copyleft dependencies in a project that distributes binaries, and confirm the license strategy is compatible.
"The license was not an afterthought. It was the first question."
50pts

ARCHITECTURE DECISION RECORDS

β—‹
WRITTEN IN ADVANCE
An ADR you authored is referenced by another contributor (or a future session agent) when evaluating a design option β€” and it demonstrably prevents a mistake or avoids re-debating a settled decision.
"You wrote it before the question was asked. The answer was already there."
150pts
βœ“
PLAN BEFORE FORGE
For any session touching 3+ files: complete the scope declaration AND knowledge-search step BEFORE writing a single line of code β€” with the scope, goal, and out-of-scope explicitly stated in the session's first response.
"The cursor did not move first. The plan moved first. The cursor followed."
25pts
β—‹
KEEPER OF DECREES
Maintain an ADR log for a full season: at minimum 3 ADRs written, 1 superseded or updated, and all ADRs indexed in a `docs/decisions/README.md`. The log must be navigable by status: Accepted / Superseded / Proposed.
"The decisions are navigable. The history is not lost. The log holds."
75pts
β—‹
THE DESIGN COMMITTED
Author a design document for a feature touching 3+ components before writing any implementation code β€” must include: problem statement, constraints, proposed solution, alternatives considered, and open questions
"The blueprint is not the building. But without it, the building is not the plan."
75pts
β—‹
THE COUNCIL DECIDES
Have an ADR you authored reviewed and ratified by at least one other contributor β€” the review must result in at least one substantive change to the decision or its consequences section
"The Iron Lords did not build alone. The council sharpens what the individual cannot."
75pts
β—‹
THE WRONG PATH NAMED
Document a decision that was made and subsequently proven wrong β€” write the full reversal ADR: what was decided, what changed, what it cost, and what the new decision is. Tag the original ADR as Deprecated
"The Darkness teaches by winning. The Guardian teaches by writing down how they lost."
100pts

MULTI-ENVIRONMENT

β—‹
THE FIRST GATE
Promote a change from a development environment to staging using an explicit gating mechanism. The gate must verify at minimum: tests pass, no secrets in diff, and the artifact is signed. Document the gate criteria.
"It passed the first gate. Not because it was easy. Because it was ready."
50pts
β—‹
THE FINAL GATE
Promote a change from staging to production with explicit pre-promotion verification: smoke test results quoted, rollback procedure confirmed, and a deployment window documented. Not just 'CI was green' β€” active verification at promotion time.
"The final gate. Active verification. Not hope β€” proof."
75pts
β—‹
TACTICAL RETREAT
Execute a real rollback β€” not a test rollback, but one triggered by a genuine problem discovered post-deployment. Document the trigger, the rollback procedure, the time from detection to resolution, and the postmortem reference.
"Rolling back is not failure. Knowing when to roll back is mastery."
100pts
β—‹
CANARY IN THE MINE
Implement or execute a staged rollout for a real change β€” canary, blue/green, or percentage-based. Document the rollout stages, the signals monitored at each stage, and the criteria for proceeding vs. rolling back.
"Not everyone received it at once. The canary went first. The canary survived."
100pts
β—‹
NONE SHALL PASS UNVERIFIED
Write an explicit promotion policy for a project: what must be true before a PR can be promoted from each environment. Commit it to the repo as `docs/deployment-policy.md` or equivalent. Reference it in at least one subsequent PR.
"The rules exist. They are written. They apply to everyone, including you."
50pts

DOCUMENTATION QUALITY

β—‹
THE NEXT AGENT TEST PASSED
Author a SKILL.md entry that passes the Next-Agent Test explicitly: a fresh session references the entry and navigates directly to the correct solution WITHOUT needing to re-investigate. Document the test in the session that confirms it.
"The new agent did not search. It read your entry. It did not need to search."
75pts
β—‹
FOUR PARTS COMPLETE
Write 5 consecutive knowledge-store entries that each include all four mandatory parts: `[WHAT]`, `[WHY]`, `[FIX]`, and `[NEXT]`. The `[NEXT]` field must be a specific, actionable warning β€” not 'be careful.'
"What. Why. Fix. Next. Every entry. No exceptions."
50pts
β—‹
NEVER AGAIN
A knowledge-store entry you wrote is referenced in a subsequent session to prevent repeating the exact same mistake it documented. The repeat-prevention must be explicit β€” the session must say 'this entry prevented X' β€” not implied.
"The entry was written. The mistake was not made again. The loop works."
100pts
β—‹
THE CONCISE TRUTH
Reduce a SKILL.md file by 20% or more in line count while preserving all utility β€” verified by the Next-Agent Test before and after. The trimmed version must load faster and leave no critical pattern undocumented.
"Shorter. Just as true. The Chronicler knows what to keep."
75pts
β—‹
SHOW, DON'T TELL
Improve a SKILL.md entry by adding a concrete worked example β€” actual commands, actual output quoted, actual failure and fix shown side by side. The example must be real (from a prior session), not fabricated.
"The documentation has a scar in it. That scar is real. That scar teaches."
50pts

TOIL REDUCTION

β—‹
THE WEIGHT OF REPETITION
Identify and quantify a source of operational toil: how many times per week, how long each occurrence, what is the cumulative cost per season. Document it as a backlog item with a priority level.
"Toil that is named can be measured. Toil that is measured can be eliminated."
50pts
β—‹
TIME TO LIGHT
Measure and document Mean Time to Recovery (MTTR) for at minimum 3 real incidents or CI failures across a season. Present the trend β€” is MTTR improving, stable, or degrading? Produce one concrete action based on the trend.
"The time is counted now. The trend has a direction. The Warden acts on it."
75pts
β—‹
THE QUIET NIGHT
Identify a recurring false alert pattern and eliminate it β€” tune the threshold, add a suppression condition, or deprecate the alert entirely if it has zero signal value. Document the before/after alert volume.
"The alert that cried wolf is gone. The silence is now trustworthy."
75pts
β—‹
THE COMPLETE WATCH
Complete a full incident lifecycle end-to-end: detect β†’ contain β†’ resolve β†’ postmortem β†’ prevention action merged. All five phases documented with timestamps. The prevention action must be a merged PR or committed change.
"Detection. Containment. Resolution. Postmortem. Prevention. The Watch is complete."
150pts
β—‹
THE RUNBOOK DIES WELL
Automate a runbook procedure so thoroughly that the runbook becomes unnecessary β€” the automation handles detection, response, and verification. Mark the runbook as Deprecated with a pointer to the automation
"The best runbook is the one that no longer needs to be run."
100pts

STREAKS

β—‹
CIRCUIT UNBROKEN
Execute `knowledge-search` at the START of a session (before writing any plan or touching any file) in 10 consecutive non-trivial sessions. A non-trivial session is one where at least one file is modified.
"Search before you touch anything. Ten times in a row. The circuit holds."
100pts
?
CLASSIFIED
Complete this triumph to reveal its secret.
150pts
β—‹
UNWAVERING RED
In 7 consecutive code-change sessions, the TDD red phase (failing tests written FIRST) precedes any implementation. A session where implementation precedes tests resets the counter to zero.
"Seven sessions. Red first. Every time. The reflex is now permanent."
100pts
?
CLASSIFIED
Complete this triumph to reveal its secret.
200pts
β—‹
IRON STANDARD
Complete ALL 9 items in the Definition of Done checklist for 10 consecutive sessions. Every item β€” knowledge-search at start, scope declared, tests written before implementation, tests run and quoted, output verified, scope respected, learnings captured, knowledge-store called at end, sync dispatched.
"Ten sessions. Nine checkboxes. All verified. The Iron Lords would approve."
200pts
β—‹
FIVE STAR REVIEW
Every PR has at least one substantive review comment before merging β€” 5 consecutive PRs. Not auto-merge, not 'LGTM' without reasoning. Substantive means at minimum 2 sentences of technical feedback.
"Five PRs. Five reviews. None merged without a real comment."
75pts
β—‹
NO AGENT REDO
Zero agent re-dispatches in 7 consecutive sessions β€” every agent call resolves the task on the first invocation. A re-dispatch is defined as calling the same agent type twice for the same task because the first result was wrong or insufficient.
"Seven sessions. Every agent call landed. The brief was always clear."
100pts
β—‹
CLEAN SLATE
Zero pending todos at session end for 10 consecutive sessions β€” every task started is either completed or explicitly deferred with a reason. A session that ends with an unaddressed pending todo resets the counter.
"The todo list was empty at close. Ten times. That is rare."
75pts
β—‹
FIRST TOOL WINS
First tool called resolves the task (no fallback) β€” 5 consecutive tasks. The correct tool is identified and deployed on the first attempt; no secondary tool call is needed to complete what the first was meant to do.
"Five tasks. Five first calls. Five resolutions. The tool was right."
75pts
β—‹
IRON DISCIPLINE
AUTOLOAD completed fully every session β€” 20 consecutive sessions. Every session begins with the full AUTOLOAD sequence: knowledge-search, load relevant skills, declare scope. No partial completions. No skipped steps.
"Twenty sessions. The AUTOLOAD ran. Every time. The discipline became the reflex."
150pts

WAYFARER

β—‹
THREE ROADS WALKED
Touch at least 3 different skill families (bluefin, cncf, tdd, security, etc.) in a single season β€” each family must produce a real deliverable (PR, knowledge entry, or skill update) in that season.
"The Wayfarer does not specialize. The Wayfarer walks every road."
25pts
β—‹
THE MAP IS REAL
Complete the knowledge loop (knowledge-search β†’ act β†’ knowledge-store) in 10 or more distinct sessions across the season. Each session must be a different project or domain.
"The map grew with every session. Now it describes somewhere real."
25pts
β—‹
ARCHITECT OF ROADS
Invoke `blueprint-mode` for 3 or more different project types across the season β€” each invocation for a genuinely different domain (not the same type of feature in the same repo three times).
"The blueprint came before the code. Three times. Three different terrains."
25pts
β—‹
BEYOND THE HORIZON
Complete a cross-repo session that meaningfully touches 3 or more distinct repositories β€” coordinated changes, not just read-only exploration. All repos must receive a real deliverable.
"Three repositories. One session. The Wayfarer sees the whole map."
25pts
β—‹
FIRST STEPS
Load a skill that has never been loaded before in your session history β€” 3 separate occurrences across the season. Each must be a genuinely new skill (not one you've previously loaded and forgotten about).
"You did not know this territory. You learned it anyway."
25pts
β—‹
THE FULL MAP
Complete a session that combines bluefin + cncf + tdd workflows in a single session β€” each must produce a real output (PR, test, knowledge entry). The domains must be genuinely integrated, not sequential and isolated.
"The map is not the territory. The Wayfarer has walked both."
25pts

UNBROKEN

β—‹
FIRST ATTEMPT, FIRST GREEN
CI first-attempt green for 10 consecutive sessions. Every PR in those sessions must pass all CI checks on the initial push β€” no re-runs, no force-pushes, no fixup commits.
"Ten sessions. Green on first push. Every time. The circuit holds."
100pts
β—‹
THE STRAIGHT LINE
Zero force-pushes for an entire season. Every push to every branch is additive. If git history needs rewriting, it is done before the first push β€” never after.
"The history is linear. The history is honest. It was never rewritten."
100pts
β—‹
ZERO REGRESSIONS
Zero regressions introduced across a full season β€” confirmed by the `qa` agent finding no bugs introduced by your changes in any session. Each session where `qa` runs must produce a clean bill of health.
"The season passed. Nothing was broken that was not already broken."
100pts
β—‹
SYNC IN SEQUENCE
`just sync` run in 15 consecutive sessions where any skill or knowledge file was updated. The sync must occur within the session window β€” not batched afterward.
"Fifteen syncs. Fifteen session closes. The knowledge did not wait."
50pts
β—‹
THE `pl` DISCIPLINE
`pl` command run (AUTOLOAD honored) in 20 consecutive sessions. AUTOLOAD must be completed fully β€” all four steps β€” in every qualifying session.
"The circuit holds. It has always held. Twenty sessions confirm it."
50pts
β—‹
ZERO FAILED DISPATCHES
Zero failed agent dispatches across 10 consecutive sessions. A failed dispatch is one where the agent returns an error, produces no output, or is re-dispatched because the first result was unusable.
"Ten sessions. Every agent returned. Every dispatch landed."
50pts
β—‹
GATE HONORED
Blueprint-mode gate honored every time across 5 sessions β€” no implementation started before the blueprint gate passes. The gate output must be documented in each session.
"Five sessions. Five gates. All passed before the forge was lit."
50pts
β—‹
THE LOOP, COMPLETE
Full cicd-learning cycle completed 3 consecutive times β€” SEARCH β†’ LOAD β†’ ACT β†’ CAPTURE β†’ STORE β†’ SYNC β€” each cycle in a distinct session with a real deliverable. Three consecutive completions without a break.
"Three complete loops. The system learned each time. You carried the thread."
100pts
β—‹
THE UNBLEMISHED SEASON
Complete an entire season of work (12+ consecutive sessions) with zero CI regressions introduced β€” every commit leaves the pipeline green
"The Crucible does not remember who almost made it. Neither does the build log."
100pts

HARBINGER

β—‹
BEFORE THE FIRE
Be the first to load a newly-published skill (within its first 3 sessions of existence in the system). The skill must be real, not created by you, and you must complete a real task with it in that session.
"They saw the ember before the fire. They loaded the skill on day one."
75pts
β—‹
THE FIRST PR
Submit the first PR to a repository that did not exist in the previous season. The PR must be merged and must be a substantive contribution β€” not a README placeholder or initial commit.
"The repo did not exist. Now it does. And you left your mark in it."
75pts
β—‹
LIGHT BEARER
Create a custom agent that solves a genuine, documented problem β€” not a duplicate of an existing agent, not theoretical. The agent must be used in at least 2 real sessions before claiming this triumph.
"The problem had no agent. Now it does. You built the tool the system needed."
75pts
β—‹
THE FIRST DOCUMENTED USE
Invoke a skill for the first documented use case in the knowledge base β€” a use that has no prior session history. Document it as the founding entry in that skill's knowledge record.
"No one had done this before. Now someone has. The record begins here."
50pts
β—‹
THE LAWMAKER
Propose and implement an architecture change accepted as a new AGENTS.md rule β€” a rule that changes how all future sessions behave. The rule must be based on observed friction and accepted into the canonical AGENTS.md.
"The law was proposed. The law was accepted. The system runs differently now."
50pts

πŸ”₯ WARDEN

β—‹
FIRST BLOOD
Respond to a real production or CI incident β€” not a routine failure β€” within 1 hour of detection. Document the initial triage steps taken in the knowledge store within the same session.
"The alert fires. The cursor moves. The Warden was already watching."
50pts
β—‹
BLAMELESS RECORD
Write a complete blameless postmortem for a real incident or near-miss: timeline, root cause (not just symptom), contributing factors, action items with owners and due dates. Store in knowledge-store with `type:postmortem` tag.
"No names in the report. Only systems, decisions, and the gaps between them."
75pts
β—‹
THE FIVE WHYS
Apply the 5-why technique to a real failure β€” trace the causal chain to its root (not the proximate cause). Each 'why' must be a distinct, verifiable statement. Document in knowledge-store with the full chain.
"The proximate cause is never the cause. Keep asking."
75pts
β—‹
THE WARDEN'S TOME
Author a runbook for an operational procedure that currently exists only in tribal knowledge or informal SKILL.md notes. Structure: trigger condition β†’ detection β†’ mitigation β†’ escalation β†’ verification. Commit to the skills repo.
"If it isn't written down, it only exists while you're awake."
50pts
β—‹
BY THE BOOK
Execute a pre-written runbook during a real incident or maintenance operation β€” following it step-by-step, not from memory. Annotate any steps that needed updating and submit the improvement.
"The Warden did not improvise. The Warden followed the runbook. The runbook was right."
75pts
β—‹
REVISED IN BLOOD
After any incident where the runbook failed to prevent or accelerate recovery, update the runbook with the missing step/detection signal/escalation path. Document what the runbook missed and why.
"The old runbook would have let it happen again. The new one will not."
75pts
β—‹
THE AUTOMATION IMPERATIVE
Identify a repeated manual operation (toil) that consumes real time each week, automate it completely, and document the before/after time measurement in knowledge-store. The automation must be used in at least 2 subsequent sessions before claiming this triumph.
"The task no longer exists. The time it took no longer disappears."
150pts

⚑ SPIREKEEPER

β—‹
GOLDEN SIGNALS
Instrument a real service or pipeline with all four Golden Signals: latency, traffic, error rate, saturation. Each signal must produce a queryable metric or log line β€” not just a print statement.
"Four signals. One truth. The system speaks; the Seer listens."
75pts
β—‹
TRACE THE PATH
Instrument a multi-step operation with distributed tracing (OpenTelemetry or equivalent) β€” at minimum 3 spans covering distinct stages. Use the trace to identify an actual performance bottleneck or unexpected latency.
"The request left. The Seer watched every step it took."
100pts
β—‹
SIGNAL, NOT NOISE
Define and deploy a monitoring alert that fires for a real condition and NOT for normal variance. The alert must have documented: threshold rationale, runbook link, severity level, and a suppression condition for known maintenance windows.
"The alert fires once. For the right reason. The Seer built that."
100pts
β—‹
THE WATCHER'S EYE
Create an operational dashboard covering at least 3 distinct signal types for a real project. The dashboard must be usable by someone who didn't build it β€” include a title and description for each panel.
"They did not need to ask what was happening. They looked at the board."
50pts
β—‹
THE COVENANT
Define formal SLOs for a real service: at minimum one availability SLO and one latency SLO. Each must include: measurement method, target percentage, measurement window, and the consequence of breach. Store the definition in the knowledge base.
"Success has a number now. The system is accountable to it."
75pts
β—‹
BUDGET BURNED
Make and document an explicit error budget decision: either freeze releases because the budget is exhausted, OR document a risk acceptance to ship despite budget pressure. The decision must reference the SLO and be stored with a `type:slo-decision` tag.
"The budget was finite. The decision was deliberate. The record is honest."
100pts
β—‹
INTO THE DARK NO MORE
Identify a system, pipeline, or workflow that currently fails invisibly. Add at minimum: a failure log with context, an alert or check, and a runbook link. Verified by simulating the failure and confirming the signal fires.
"It failed before. Silently. It will never fail silently again."
100pts

🟒 PATHFINDER

β—‹
THE REVIEWER'S OATH
Give a code review that results in at least one actionable finding being addressed before merge β€” not a spelling fix, but a logic, security, or design concern. The review comment must be substantive (3+ sentences with reasoning).
"The PR was better because you read it. The author knows it now too."
50pts
β—‹
BLUEPRINT BEFORE FORGE
For any feature touching more than 3 files: run `blueprint-mode` for the design phase AND present the design in the PR description BEFORE writing implementation code. The PR must reference the blueprint output.
"The plan came first. The code followed the plan. The PR shows both."
75pts
β—‹
RAPID RESPONSE
Respond to a PR review request within 4 hours of assignment (during a working session) β€” not just 'LGTM' but a complete review with inline comments. Verified by PR timeline showing review within 4 hours of assignment.
"The request came in. The review went out. The queue did not grow."
50pts
β—‹
DESIGN IN THE OPEN
Open a GitHub issue or discussion for a significant architectural change BEFORE writing any code β€” gather at least one external piece of feedback. The final implementation must reference the issue.
"The design was public. The critique was welcomed. The code is better for it."
75pts
β—‹
THE FIRST DECREE
Author your first formal Architecture Decision Record (ADR) β€” in `docs/decisions/` or equivalent. Must include: Title, Status, Context, Decision, Consequences (positive and negative). Not a SKILL.md entry β€” a structured ADR document.
"The first decree. Context. Decision. Consequences. It is written."
50pts
β—‹
LORE APPLIED
Reference a prior ADR in a PR description when making a decision that either follows or consciously deviates from that ADR. The reference must explain WHY the ADR is relevant β€” not just a link.
"The decision was made before. You knew it. You said why it still applied."
75pts
β—‹
THE LAW REWRITTEN
Supersede an existing ADR with a new one β€” because circumstances changed, new data emerged, or the original decision was wrong. The new ADR must explicitly reference the old one and explain why the decision changed.
"The old law was not wrong. It was right, then. This one is right, now."
100pts