Agent investigation · 0xbf8e…aba3

aeon

3 findings11 upstream fixesupstream mergedupdated 1 month ago

token0xbf8e8f0e8866a7052f948c16508644347c57aba3basescan ↗tweet ↗

Upstream fixes

11 fixes landed on this agent

PRs AntFleet filed against this agent's own repo where the underlying fix landed upstream — whether via a clean merge, a separate upstream commit, or an upstream PR that ported the same fix.

merged
AntFleet → aaronjmars/aeon
PR #266·merged at f0400b6·1 month ago
view PR →
merged
AntFleet → aaronjmars/aeon
PR #268·merged at c87405f·1 month ago
view PR →
merged
AntFleet → aaronjmars/aeon
PR #267·merged at 05c9cd0·1 month ago
view PR →
merged
AntFleet → aaronjmars/aeon
PR #223·merged at ab40ef7·1 month ago
view PR →
merged
AntFleet → aaronjmars/aeon
PR #222·merged at e01b739·1 month ago
view PR →
merged
AntFleet → aaronjmars/aeon
PR #204·merged at 71eecb4·1 month ago
view PR →
merged
AntFleet → aaronjmars/aeon
PR #203·merged at e0e326f·1 month ago
view PR →
merged
AntFleet → aaronjmars/aeon
PR #201·merged at efac3bc·1 month ago
view PR →
merged
AntFleet → aaronjmars/aeon
PR #194·merged at e777f0a·1 month ago
view PR →
merged
AntFleet → aaronjmars/aeon
PR #195·merged at 48d59fa·1 month ago
view PR →
merged
AntFleet → aaronjmars/aeon
PR #196·merged at 5252cf3·1 month ago
view PR →

SARIF backlog

CodeQL · Snyk · Semgrep

Validate scanner backlog claims through AntFleet's reachability and patch-verification gates, and emit AntFleet findings as SARIF v2.1.0 for GitHub Code Scanning.

1. Export AntFleet findings as SARIF v2.1.0

curl -L https://www.antfleet.dev/api/repos/aaronjmars/aeon/findings.sarif \
  -o antfleet.sarif

2. Ingest a scanner SARIF (CodeQL / Snyk / Semgrep)

Tokens are minted server-side via pnpm exec tsx apps/web/scripts/mint-sarif-ingest-token.ts and are valid for 5 minutes. Ask the AntFleet team for one bound to your install + repo.

curl -X POST https://www.antfleet.dev/api/repos/aaronjmars/aeon/sarif \
  -H "Authorization: Bearer $ANTFLEET_SARIF_TOKEN" \
  -H "Content-Type: application/json" \
  --data-binary @"@codeql-results.sarif"

3. Render AntFleet findings on the GitHub Security tab

Drop the customer-owned workflow at /integrations/codescanning.yml into your repo's .github/workflows/ directory. It pulls the export above and uploads via github/codeql-action/upload-sarif.

Finding writeups

aeon-secret-auth-2026-05-18

Missing auth on secret-management endpoints + token reassembly injection risk

high1 month agomerged e777f0a ↗

What was wrong

AntFleet's benchmark review of antfleet/aeon-bench (PR #25, commit 37d0c07) flagged two related security gaps in aeon's secret management layer:

1. Unauthenticated secret-management endpoints

The endpoint handling claude setup-token and related credential operations accepted requests without authentication or authorization checks. An adversary with network access to the agent process could enumerate or overwrite API keys (Anthropic, OpenAI, Venice, etc.) without presenting any credential.

2. Token reassembly can splice non-printable bytes

The token-reassembly routine in claude setup-token splits and rejoins token parts without sanitizing the intermediate segments. A crafted payload could inject control characters or ANSI escape sequences into the stored credential, causing the agent to ship a malformed key or be confused into logging secrets.

Impact

Both gaps affect an agent that runs autonomously on behalf of its operator. A compromised key store silently redirects the agent's AI calls to an attacker-controlled endpoint or exhausts the operator's API budget without detection.

Additional findings (same benchmark run)

PR #21: Undefined FORK_DEFAULT_BRANCH causes the agent to fetch

aeon.yml from the wrong branch during fork operations, silently using stale workflow configuration.

PR #23: Sharp-move dedup window violates idempotency under same-minute

replays — a timing edge case that can cause duplicate trade signals.

PR #28: Basescan claim "no key needed for source fetch" is likely wrong;

the endpoint requires a key under certain rate-limit conditions (open finding).

Benchmark artifact

All findings emerged from AntFleet's two-model consensus review pipeline (Claude Opus + GPT-5) running against the public benchmark mirror antfleet/aeon-bench. The bench contains real commits replayed as PRs; no synthetic diffs were used.

Evidence

Benchmark repo: antfleet/aeon-bench
Source PR: antfleet/aeon-bench#25
Review commit: 37d0c07b30c051e368da12b053d89030eb45fee9
Internal review ID: 6017bf3f-1629-4f62-87d4-8a0b92b71c37

aeon-slack-bot-filter-2026-05-20

Slack bot filter checks for literal string null — no bot messages were ever filtered from channels

high1 month agomerged 5252cf3 ↗

In .github/workflows/messages.yml, the bot-message filter uses [ "$BOT_ID" = "null" ] to skip bot messages. However jq with // empty returns an empty string for absent bot_id fields — not the string null. The condition is therefore never true, so every bot message passes through unfiltered. The filter is a no-op in both directions.

Evidence

.github/workflows/messages.yml line 537 — string comparison against literal null; jq .bot_id // empty returns empty string not null for absent fields.

aeon-awk-spend-cap-2026-05-20

awk daily spend cap bypassed when API returns non-numeric value — ads launch without verified budget

high1 month agomerged 48d59fa ↗

In scripts/postprocess-admanage.sh, the daily spend cap check uses awk arithmetic on TODAY_SPEND. When the ad platform API returns a non-numeric or empty value, awk silently coerces it to 0 — which is always under the cap — so the guard passes and ads are launched without a valid spend figure. A network error, API change, or malformed response is enough to trigger uncapped ad spend.

Evidence

scripts/postprocess-admanage.sh — awk coercion of TODAY_SPEND before the cap comparison; no numeric validation of the API response prior to the check.

AntFleet reviews on this agent

Two-model consensus reviews AntFleet has run against this agent's benchmark repo. Each links to the bot review comment on GitHub.