Your agent works the backlog.
You read the morning report.

Agents Never Sleep is an open-source harness + Agent Skill that lets a coding agent run a ticket backlog to completion unattended — with a durable per-ticket state machine, deterministic test gates, git-backed reversibility and an autonomy contract that parks what a machine shouldn't decide alone.

$ pip install git+https://github.com/TokonoMix/agents-never-sleep@v1.0.0

Source on GitHub MIT · Python stdlib only · v1.0.0

example run — illustrative

23:4101-fix-flaky-webhook-testdone

00:5802-add-csv-exportdone

02:1603-migrate-billing-columnPARKED · schema decision

02:1904-i18n-missing-keysdone

04:0305-rate-limit-headersdone

06:1206-cleanup-dead-flagsdone

morning report · 08:00

5 completed (each git-snapshotted, gates green), 1 parked — the schema-direction call it wouldn't make without you.

Mechanism, not magic

Six things the harness actually does

Every capability below is a concrete mechanism in the codebase — named here by the thing that implements it. The full walkthrough lives on How it works.

state machine · per ticket

Durable ticket state

Each ticket moves through an explicit state machine that survives crashes and restarts. A run resumes where it stopped — it doesn't start over.

autonomy contract

ASK / PARK / HALT

Decisions a machine shouldn't make alone — money, schema direction, security boundaries — are parked for the human, and the run moves on to the next ticket.

deterministic gates

Tests decide, not vibes

A ticket only counts as done when its test gate passes. A failing gate reverts the work; the failure is classified and reported, not hidden.

git-backed snapshots

Everything reversible

Work is snapshotted per ticket. Anything the night produced can be reverted commit-by-commit in the morning.

heartbeat watchdog

Survives frozen runs

A watchdog wraps the run by default and restarts it resumably when the agent freezes mid-run — for example during a provider overload wave.

night-report.md

Morning report

The run ends with a ranked report: what landed, what parked and why, what needs your judgement. You review decisions, not diffs-at-random.

The press runs through the night; the report is ready by morning.

The autonomy contract

Never block the run on a single ticket

Unattended doesn't mean unaccountable. The contract defines exactly what happens when the agent hits something it shouldn't decide alone.

ASK

Forbidden while unattended. The harness never sits waiting on a question at 3 a.m. — a question that would block becomes a park instead.

PARK

Record and move on. Money, schema direction, security boundaries, public API contracts: parked with the reasoning, surfaced in the morning report for a human decision.

HALT

Stop the whole run. Reserved for conditions where continuing could do harm — the run ends, states are preserved, the report says why.

What it doesn't do

The honest part

It doesn't guarantee correct code. Gates catch what tests catch. The morning report exists precisely because a human still reviews the outcome.
Enforcement hooks are Claude Code-only today. The harness and skill are portable; the hard permission-enforcement layer currently exists for Claude Code. Hermes and Aider adapters are on the roadmap — until they ship, we won't claim them.
No throughput numbers on this page. We publish benchmarks as reproducible methodology or not at all. A controlled agent-vs-harness comparison is planned; its numbers will appear when they exist.
Reduced, not eliminated. The watchdog, gates and revert paths reduce the ways an overnight run can go wrong. They don't make autonomy risk-free — the contract is designed for exactly that reality.

The night shift: one lamp, one agent, a queue of tickets.

Get started

Two minutes to a first supervised run