jon-snow

admin/jon-snow

Fork 0

Commit Graph

Author	SHA1	Message	Date
admin	83a933ea1a	feat: LLM task extraction, token tracking, direct Claude brain - brain.py: prefers direct Anthropic API (ANTHROPIC_API_KEY) over Hermes for all LLM calls — ~22x cheaper (122 tokens vs 5600+ Hermes overhead). Falls back to Hermes then Ollama if key unavailable. extract_task_fields(): non-streaming call returns clean {title, project} from any natural language phrasing — no more regex whack-a-mole. - token_log.py: appends every LLM call to token-usage.jsonl with intent, in/out token counts, and USD cost. get_summary() aggregates all-time, today, and per-intent breakdowns. - main.py: task handler uses extract_task_fields() with regex fallback; streaming handler captures usage from final chunk; GET /usage endpoint returns live cost summary. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-30 14:20:47 +00:00
admin	59c9cb837d	fix: task intent classifier, Hermes brain, rolling run log - Intent classifier: task phrases now checked before query to prevent "add task X" mis-routing; "job item"/"job ticket"/"work order" added to TASK_PHRASES; "please add + project keyword" fallback added; substring match bug fixed ("in" inside "incident" triggered query) - brain.py: routes planning fallback to Hermes cloud (claude-sonnet-4-6) via HERMES_URL/HERMES_API_KEY env vars; falls back to local Ollama if Hermes is unavailable - main.py: rolling 50-run log written to logs/jon-snow/runs.jsonl Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-30 14:03:29 +00:00
admin	a25deeb8f4	feat: Jon Snow Phase 2 — FastAPI orchestrator with LiteLLM brain OpenAI-compatible API at :8900. Intent classifier routes status queries to FAST_MODEL (Ollama), task submissions to Plane, planning to SMART_MODEL. Reads agent-os logs for status context. Phase 3: approval gate + execution. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-16 13:06:20 +00:00

Author

SHA1

Message

Date

admin

83a933ea1a

feat: LLM task extraction, token tracking, direct Claude brain

- brain.py: prefers direct Anthropic API (ANTHROPIC_API_KEY) over Hermes
  for all LLM calls — ~22x cheaper (122 tokens vs 5600+ Hermes overhead).
  Falls back to Hermes then Ollama if key unavailable.
  extract_task_fields(): non-streaming call returns clean {title, project}
  from any natural language phrasing — no more regex whack-a-mole.
- token_log.py: appends every LLM call to token-usage.jsonl with intent,
  in/out token counts, and USD cost. get_summary() aggregates all-time,
  today, and per-intent breakdowns.
- main.py: task handler uses extract_task_fields() with regex fallback;
  streaming handler captures usage from final chunk; GET /usage endpoint
  returns live cost summary.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-05-30 14:20:47 +00:00

admin

59c9cb837d

fix: task intent classifier, Hermes brain, rolling run log

- Intent classifier: task phrases now checked before query to prevent
  "add task X" mis-routing; "job item"/"job ticket"/"work order" added
  to TASK_PHRASES; "please add + project keyword" fallback added;
  substring match bug fixed ("in" inside "incident" triggered query)
- brain.py: routes planning fallback to Hermes cloud (claude-sonnet-4-6)
  via HERMES_URL/HERMES_API_KEY env vars; falls back to local Ollama
  if Hermes is unavailable
- main.py: rolling 50-run log written to logs/jon-snow/runs.jsonl

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-05-30 14:03:29 +00:00

admin

a25deeb8f4

feat: Jon Snow Phase 2 — FastAPI orchestrator with LiteLLM brain

OpenAI-compatible API at :8900. Intent classifier routes status queries
to FAST_MODEL (Ollama), task submissions to Plane, planning to SMART_MODEL.
Reads agent-os logs for status context. Phase 3: approval gate + execution.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-05-16 13:06:20 +00:00

3 Commits