Agent loop + CI improvements: PRs, fast/slow split, scheduled deploy #156

New Issue

2026-05-22T19:53:01Z

guettlibot commented

2026-05-22 19:53:01 +00:00

(Migrated from codeberg.org)

Background / root cause found

Issue #153 (and likely #148, #150–#155) was closed without any fix. Root cause:

Claude hit its org rate limit immediately — the agent exited after < 1 s.
The loop saw: agent dead → pending_issue=153, latest CI green (from issue #147's push) → closed #153 without verifying a new CI run had occurred.

Fix already committed in b48cb98: the loop now records the CI run ID when the agent starts and only closes the issue if a newer run passes. Same CI run ID → agent pushed nothing → set State/Question instead.

Planned improvements

1. Use PRs instead of pushing directly to main

Currently agents push directly to main. This means:

There is no code-review step, not even a diff in the UI.
If CI fails on main, it blocks all other work.
Merge conflicts are more likely.

Proposed change:

Agents create a feature branch (issue-<N>-fix) and open a PR.
The loop tracks the PR number in state alongside the issue number.
When the PR's CI passes the loop merges the PR (with --squash or --merge) and closes the issue.
If CI fails, the loop starts a fix-CI agent on the same branch.

Agent prompt addition: "Create a branch named issue-<N>-fix, push there, open a PR against main. Do NOT close the issue or merge."

Loop state addition: { "pr": 42, "issue": 153, ... } — _latest_ci_run switches to checking the PR's head commit status.

2. Fast tests only in CI (on push + PR)

ci.yml currently runs the full suite on every push and PR: unit tests, Android build, Play Store publish, linux deploy, website publish. Android + Play Store each take ~30–45 min.

Proposed split:

ci.yml (on: push + pull_request) — keep only the check job:

task check-dagger (unit tests, analysis, formatting)
Removes build-linux, deploy-playstore, publish-website jobs from this workflow.
Target: < 10 min total.

android-emulator-tests.yml — move to scheduled-only (remove push/pull_request triggers).

3. Scheduled hourly run on main for long tests + deploy

New workflow .forgejo/workflows/deploy.yml (or rename android-emulator-tests.yml):

on:
  schedule:
    - cron: '0 * * * *'   # every hour
  workflow_dispatch:

Jobs:

test-android-firebase (Firebase Test Lab)
deploy-playstore (publish to Play Store)
build-linux + deploy-linux
publish-website

4. Set a label on success after scheduled run

When the scheduled hourly run succeeds, mark success visibly so:

The agent loop can check whether the last full deploy passed.
Developers can see the deploy health at a glance.

Option A — label on a persistent tracking issue (e.g., a pinned "Deploy health" issue):

fgj issue edit <tracking-issue> --add-label "CI/Full-Pass" --remove-label "CI/Full-Fail"

This is easy to query from the loop.

Option B — post a comment + set a Forgejo commit status (via Forgejo API POST /repos/.../statuses).

Option C — create a git tag deploy/YYYYMMDD-HHMM on each successful full deploy.

Recommendation: Option A (tracking issue label) is simplest and queryable by the loop.

Acceptance criteria

Agent loop fix (b48cb98) deployed and verified — no more phantom closes.
Agents create PRs; loop merges on CI green.
ci.yml runs only fast tests (< 10 min) on push + PR.
New scheduled workflow runs full suite once per hour on main.
On scheduled success, set CI/Full-Pass label on a tracking issue.

## Background / root cause found Issue #153 (and likely #148, #150–#155) was closed without any fix. Root cause: 1. Claude hit its org rate limit immediately — the agent exited after < 1 s. 2. The loop saw: agent dead → `pending_issue=153`, latest CI green (from issue #147's push) → **closed #153 without verifying a new CI run had occurred**. **Fix already committed** in b48cb98: the loop now records the CI run ID when the agent starts and only closes the issue if a *newer* run passes. Same CI run ID → agent pushed nothing → set `State/Question` instead. --- ## Planned improvements ### 1. Use PRs instead of pushing directly to main Currently agents push directly to `main`. This means: - There is no code-review step, not even a diff in the UI. - If CI fails on `main`, it blocks all other work. - Merge conflicts are more likely. **Proposed change**: - Agents create a feature branch (`issue-<N>-fix`) and open a PR. - The loop tracks the PR number in state alongside the issue number. - When the PR's CI passes the loop merges the PR (with `--squash` or `--merge`) and closes the issue. - If CI fails, the loop starts a fix-CI agent on the same branch. Agent prompt addition: "Create a branch named `issue-<N>-fix`, push there, open a PR against main. Do NOT close the issue or merge." Loop state addition: `{ "pr": 42, "issue": 153, ... }` — `_latest_ci_run` switches to checking the PR's head commit status. ### 2. Fast tests only in CI (on push + PR) `ci.yml` currently runs the full suite on every push and PR: unit tests, Android build, Play Store publish, linux deploy, website publish. Android + Play Store each take ~30–45 min. **Proposed split**: **`ci.yml` (on: push + pull_request)** — keep only the `check` job: - `task check-dagger` (unit tests, analysis, formatting) - Removes `build-linux`, `deploy-playstore`, `publish-website` jobs from this workflow. - Target: < 10 min total. **`android-emulator-tests.yml`** — move to scheduled-only (remove `push`/`pull_request` triggers). ### 3. Scheduled hourly run on main for long tests + deploy New workflow `.forgejo/workflows/deploy.yml` (or rename `android-emulator-tests.yml`): ```yaml on: schedule: - cron: '0 * * * *' # every hour workflow_dispatch: ``` Jobs: - `test-android-firebase` (Firebase Test Lab) - `deploy-playstore` (publish to Play Store) - `build-linux` + `deploy-linux` - `publish-website` ### 4. Set a label on success after scheduled run When the scheduled hourly run succeeds, mark success visibly so: - The agent loop can check whether the last full deploy passed. - Developers can see the deploy health at a glance. **Option A** — label on a persistent tracking issue (e.g., a pinned "Deploy health" issue): ``` fgj issue edit <tracking-issue> --add-label "CI/Full-Pass" --remove-label "CI/Full-Fail" ``` This is easy to query from the loop. **Option B** — post a comment + set a Forgejo commit status (via Forgejo API `POST /repos/.../statuses`). **Option C** — create a git tag `deploy/YYYYMMDD-HHMM` on each successful full deploy. Recommendation: Option A (tracking issue label) is simplest and queryable by the loop. --- ## Acceptance criteria - [ ] Agent loop fix (b48cb98) deployed and verified — no more phantom closes. - [ ] Agents create PRs; loop merges on CI green. - [ ] `ci.yml` runs only fast tests (< 10 min) on push + PR. - [ ] New scheduled workflow runs full suite once per hour on main. - [ ] On scheduled success, set `CI/Full-Pass` label on a tracking issue.

Sign in to join this conversation.

Branches Tags

main

issue-563-agentloop-validation

dummy-pr-test

issue-560-fix-firebase-run-url

issue-539-stable-imap-uid

issue-533-shared-email-list

plan-issue-555

drop-nix

plan-issue-484

plan-issue-539

plan-issue-535

plan-issue-474

plan-issue-533

fix-dagger-engineless-precommit

issue-521-fix-deploy-yml-wait-time-api

issue-502-fix-email-id-collision-mailbox

issue-492-eliminate-duplicate-build-runner

issue-494-website-change-detection

issue-491-parallelize-check

issue-478-fix-stalwart-dual-stack-bind

issue-475-allowed-addresses-glob

issue-473-search-result-reorder

issue-453-update-agentloop-defaults

issue-466-structured-search

issue-505-exclude-chaos-monkey-from-regular-ci

issue-509-fix-search-result-sorting

fix-ink-sparkle-remaining-tests

issue-506-fix-search-emails-tests

issue-504-runner-wait-time

issue-488-search-notes

issue-472-changelog-issue-links

issue-501-folder-search-local-sqlite

issue-486-fix-stale-test-shader-mismatch

fix/prevent-settled-search-rerun-473

issue-467-fix-search-stale-results

issue-446-installed-versions-in-changelog

issue-462-fix-pr

issue-448-chaos-monkey-test

issue-436-notes-on-emails

issue-429-unify-mail-display

issue-422-move-to-folder-create-new

issue-414-ensure-not-run-as-root

issue-424-unify-email-list-views

issue-419-trusted-senders-page

issue-425-fix-prs

test-foo

issue-421-bug-report

issue-383-fix-ci

issue-394-fix-deploy-flutter-version

issue-391-fix-ci-double-trigger

issue-376-combined-inbox-v2

issue-376-combined-inbox

issue-384-fix-open-prs

sops-migrate

issue-339-safe-first-on-imap-fetch

issue-340-try-catch-measure-height

issue-342-pin-intl-version

issue-341-guard-threademails-last

issue-335-agentloop-code-test

issue-329-fix

issue-315-fix

issue-320-fix

issue-325-fix

issue-312-fix

issue-311-fix

issue-305-fix

issue-304-fix

issue-299-fix

issue-300-fix

issue-298-fix

issue-296-fix

issue-294-fix

issue-289-fix

issue-288-fix

issue-287-fix

issue-286-fix

issue-277-fix

issue-282-fix

issue-280-fix

issue-272-fix

issue-268-fix

issue-267-fix

issue-266-fix

issue-258-fix

issue-260-fix

issue-257-fix

issue-253-fix

issue-216-fix

issue-251-fix

issue-249-fix

issue-question-fixes

issue-235-fix

issue-236-fix-v2

issue-237-fix

issue-236-fix

issue-228-fix

issue-217-fix

issue-214-fix

issue-213-fix

issue-208-fix

issue-205-fix

issue-204-fix

issue-203-fix

issue-202-fix

issue-129-fix

issue-161-fix

issue-160-fix

issue-201-fix

issue-210-fix

issue-198-fix

issue-200-fix

issue-144-fix

issue-199-fix

fix/playstore-upload-use-requests

issue-193-fix

issue-186-fix

issue-185-fix

issue-192-fix

issue-183-fix

issue-175-fix

issue-172-fix

issue-171-fix

issue-167-fix

issue-136-fix

issue-162-fix

issue-179-fix

issue-155-fix

issue-154-fix

issue-152-fix

issue-151-fix

issue-141-fix

issue-150-fix

issue-164-fix

migrate-to-dagger

task/d1-ci-matrix

task/a4-typeconverter-json

task/u7-onboarding-walkthrough

task/d3-sync-doc

task/a5-layer-boundary-lint

task/t5-golden-tests

task/p5-date-cache

task/s4-link-handling

task/p3-html-parse-isolate

task/u8-mark-all-read

task/u3-recent-searches

task/a3-jmap-injectable-http-client

task/r5-tls-error-handling

fix/playstore-redirect-retry

task/t3-repository-contract-tests

task/p2-email-list-pagination

task/p1-fts5-search

fix/playstore-upload-timeout

task/a1-email-detail-notifier

fix/upgrade-workmanager-0.9

fix/android-core-library-desugaring

task/p4-db-indexes

task/r3-html-error-boundary

task/d2-check-coverage

task/a2-email-tile

task/t4-migration-tests

task/t2-widget-tests

task/t1-email-repo-coverage

task/u6-connection-status

task/u4-push-notifications

task/u2-draft-sync

task/u1-list-unsubscribe

task/s2-hostname-validation

task/r6-reliability-fuzz-tests

task/r4-sync-error-banner

task/r2-force-resync

task/r1-undo-history-persistence

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: guettli/sharedinbox#156