Todo Report

People — This Week (2 open)

☐ Connect with SMAI

.claude/worktrees/busy-jang/TODO.md:26

Figure out dates, schedule for this coming week (week of Feb 10)

☐ Connect with SMAI

TODO.md:51

Figure out dates, schedule for this coming week (week of Feb 10)

Priority (36 open)

▶

☐ 🔴 Summarize with brain/extract 🔴

.claude/worktrees/busy-jang/TODO.md:60

Use brain's extraction pipeline to summarize web pages, documents, and transcripts. brain already fetches + extracts content — add a "summarize" mode that produces concise, structured summaries. Natural entry point for all content that flows through the system (Davos videos, 990 filings, VIC theses, earnings calls). This is the glue between data ingestion and analysis.

☐ 🔴 Bright Data scraping integration 🔴

.claude/worktrees/busy-jang/TODO.md:62

MCP installed (brightdata SSE in rivus project config). Also integrate with browser service.

MCP tools available: search_engine, scrape_as_markdown (free tier, 5K req/mo), LinkedIn profiles/companies/jobs/posts, Crunchbase, browser automation (Pro mode)
Browser integration: Wire Bright Data proxies into browser service for stealth scraping. Already have BRIGHTDATA_PRIMARY_PROXY, BRIGHTDATA_WEBUNLOCKER_PROXY, BRIGHTDATA_BROWSER_PROXY env vars.
Key use cases: LinkedIn profile scraping for people/VC project, company data enrichment, paywall bypass, anti-bot circumvention
Next: Test MCP LinkedIn tools, measure cost per profile, build batch pipeline for VC partner scraping

☐ 🔴 Founder Evaluator / Wow-the-VCs tool 🔴

.claude/worktrees/busy-jang/TODO.md:68

projects/people/ + projects/vc_intel/

Goal: For any founder, show profile card + network map + score + thesis match + highlights/red flags
3 parallel tracks: Data Engine (backend scraping) | Scoring Logic (mock → real) | UI (Gradio shell → real data)
Schema-first: Define output schema Day 0, build UI with mocks, data fills in. VCs buy the vision before auditing DB accuracy.
Free sources first: Scrape VC firm "About Us" pages (free bios, education, board seats) BEFORE paying for LinkedIn. Reduces LinkedIn spend 30-40%.
Narrow demo focus: Pick ONE ecosystem (ex-Stripe/ex-OpenAI → Sequoia/Benchmark) for dense, high-quality graph. Total MVP cost: $1-2.
MVP scoring: 3 core dims (prior success, network quality, technical depth). Full 7 dims post-demo.
Edge weighting: By org size — 50-person startup overlap = strong, Google overlap = weak. Prevents "everyone went to Stanford" noise.
Thesis matching: Scrape VC Twitter bios + blog titles for sector matching (not just who-knows-whom).
Score validation: Backtest against YC S24/W25 acceptances + unicorn founders. No hallucinated metrics.
Risk mitigations: Hard BD spend limits, test 10 profiles first, Plan B outreach beyond Gil, invite-only demo.
Check with Gil: Fresh LinkedIn snapshots pricing, bulk deal, a16z intro.
Full spec: projects/people/README.md

☐ 🔴 Skillz: finance skill exemplars from web 🔴

.claude/worktrees/busy-jang/TODO.md:84

projects/skillz/domains/finance/ — Find great public examples of AI doing stock analysis, startup evaluation, founder/CEO assessment. Stock analyst is the juiciest domain. Survey GitHub repos, prompt chains, VC tooling landscape. These become training data / inspiration for skill acquisition. See domains/finance/README.md and domains/finance/benchmarks.md.

☐ 🔴 Vario/eval: brain benefit vs vanilla 🔴

.claude/worktrees/busy-jang/TODO.md:85

Measure whether brain's vario parallel prompt pipeline actually produces better output than vanilla single-prompt. Design eval: same inputs, brain+vario vs plain LLM, judge quality. This determines whether the complexity pays off.

☐ 🔴 Summarize with brain/extract 🔴

TODO.md:85

Use brain's extraction pipeline to summarize web pages, documents, and transcripts. brain already fetches + extracts content — add a "summarize" mode that produces concise, structured summaries. Natural entry point for all content that flows through the system (Davos videos, 990 filings, VIC theses, earnings calls). This is the glue between data ingestion and analysis.

☐ 🔴 Causal extraction from docs/transcripts 🔴

.claude/worktrees/busy-jang/TODO.md:86

Extract causal claims, important topics, emphasis patterns, and structured knowledge from documents and video transcripts. Sources: Lex Fridman, HealthyGamerGG, DML (Dumb Money Live), earnings calls. Pipeline: transcript → topic segmentation → claim extraction → causality graph. Builds on brain/ extraction and kb/ knowledge accumulation.

☐ 🔴 Fetch failover to browser 🔴

.claude/worktrees/busy-jang/TODO.md:87

Track cases where brain's fetch tools fail (paywalls, JS-required, rate limits, anti-bot). Measure whether browser service can handle those failures. Build auto-failover: fetch → fail → escalate to browser automatically. Log failure reasons to guide which escalation path to take (JS render, proxy, Bright Data unlocker).

☐ 🔴 Bright Data scraping integration 🔴

TODO.md:87

MCP installed (brightdata SSE in rivus project config). Also integrate with browser service.

MCP tools available: search_engine, scrape_as_markdown (free tier, 5K req/mo), LinkedIn profiles/companies/jobs/posts, Crunchbase, browser automation (Pro mode)
Browser integration: Wire Bright Data proxies into browser service for stealth scraping. Already have BRIGHTDATA_PRIMARY_PROXY, BRIGHTDATA_WEBUNLOCKER_PROXY, BRIGHTDATA_BROWSER_PROXY env vars.
Key use cases: LinkedIn profile scraping for people/VC project, company data enrichment, paywall bypass, anti-bot circumvention
Next: Test MCP LinkedIn tools, measure cost per profile, build batch pipeline for VC partner scraping

☐ 🔴 Founder Evaluator / Wow-the-VCs tool 🔴

TODO.md:93

projects/people/ + projects/vc_intel/

Goal: For any founder, show profile card + network map + score + thesis match + highlights/red flags
3 parallel tracks: Data Engine (backend scraping) | Scoring Logic (mock → real) | UI (Gradio shell → real data)
Schema-first: Define output schema Day 0, build UI with mocks, data fills in. VCs buy the vision before auditing DB accuracy.
Free sources first: Scrape VC firm "About Us" pages (free bios, education, board seats) BEFORE paying for LinkedIn. Reduces LinkedIn spend 30-40%.
Narrow demo focus: Pick ONE ecosystem (ex-Stripe/ex-OpenAI → Sequoia/Benchmark) for dense, high-quality graph. Total MVP cost: $1-2.
MVP scoring: 3 core dims (prior success, network quality, technical depth). Full 7 dims post-demo.
Edge weighting: By org size — 50-person startup overlap = strong, Google overlap = weak. Prevents "everyone went to Stanford" noise.
Thesis matching: Scrape VC Twitter bios + blog titles for sector matching (not just who-knows-whom).
Score validation: Backtest against YC S24/W25 acceptances + unicorn founders. No hallucinated metrics.
Risk mitigations: Hard BD spend limits, test 10 profiles first, Plan B outreach beyond Gil, invite-only demo.
Check with Gil: Fresh LinkedIn snapshots pricing, bulk deal, a16z intro.
Full spec: projects/people/README.md

☐ 🔴 Skillz: finance skill exemplars from web 🔴

TODO.md:109

projects/skillz/domains/finance/ — Find great public examples of AI doing stock analysis, startup evaluation, founder/CEO assessment. Stock analyst is the juiciest domain. Survey GitHub repos, prompt chains, VC tooling landscape. These become training data / inspiration for skill acquisition. See domains/finance/README.md and domains/finance/benchmarks.md.

☐ 🔴 Vario/eval: brain benefit vs vanilla 🔴

TODO.md:110

Measure whether brain's vario parallel prompt pipeline actually produces better output than vanilla single-prompt. Design eval: same inputs, brain+vario vs plain LLM, judge quality. This determines whether the complexity pays off.

☐ 🔴 Causal extraction from docs/transcripts 🔴

TODO.md:111

Extract causal claims, important topics, emphasis patterns, and structured knowledge from documents and video transcripts. Sources: Lex Fridman, HealthyGamerGG, DML (Dumb Money Live), earnings calls. Pipeline: transcript → topic segmentation → claim extraction → causality graph. Builds on brain/ extraction and kb/ knowledge accumulation.

☐ 🔴 Fetch failover to browser 🔴

TODO.md:112

Track cases where brain's fetch tools fail (paywalls, JS-required, rate limits, anti-bot). Measure whether browser service can handle those failures. Build auto-failover: fetch → fail → escalate to browser automatically. Log failure reasons to guide which escalation path to take (JS render, proxy, Bright Data unlocker).

☐ 🟡 Session analysis at end 🟡

.claude/worktrees/busy-jang/TODO.md:52

Auto-analyze what each CC session accomplished

SessionEnd hook: Self-summary (context already cached, near-free output-only cost)
Batch fallback: Flash-lite on condensed transcript for Ctrl+C'd sessions (~$0.0003/session, 2s)
Output: 1-line summary, tags, key files, decisions → session_index.db
Enables: Semantic session search (vs raw rg), chronicle feed, learning extraction
Tested: flash-lite on 3.5MB JSONL → 4K tokens condensed → accurate summary in 2.1s
Location: supervisor/sidekick/hooks/handler.py (SessionEnd event)

☐ 🟡 Session analysis at end 🟡

TODO.md:77

Auto-analyze what each CC session accomplished

SessionEnd hook: Self-summary (context already cached, near-free output-only cost)
Batch fallback: Flash-lite on condensed transcript for Ctrl+C'd sessions (~$0.0003/session, 2s)
Output: 1-line summary, tags, key files, decisions → session_index.db
Enables: Semantic session search (vs raw rg), chronicle feed, learning extraction
Tested: flash-lite on 3.5MB JSONL → 4K tokens condensed → accurate summary in 2.1s
Location: supervisor/sidekick/hooks/handler.py (SessionEnd event)

☐ 🟡 TFTF Public Company Analysis 🟡

.claude/worktrees/busy-jang/TODO.md:82

projects/skillz/domains/companies/ — Separate project from VC/founder tool. Identify "Too Fast To Follow" public companies via SEC filings, earnings calls, patent velocity, product launch cadence. Score on: velocity, compounding, moat depth, talent magnetism, capital efficiency, founder intensity. All free public data sources (10K/10Q, USPTO, GitHub, press releases). See domains/companies/README.md.

☐ 🟡 Brain unified input + impact analysis 🟡

.claude/worktrees/busy-jang/TODO.md:88

Extend brain's input box to accept: URLs, pasted text, macro events ("silver down 8%"), or analysis questions. For macro events, run parallel vario LLM calls to identify impacted companies/sectors ranked by strength. Output: ranked impacts with [Watch] and [Order @ price] actions. See brain/GOALS.md for UI mockup, tasks/design/newsflow_macro.md for full spec.

☐ 🟡 Advance learning from history 🟡

.claude/worktrees/busy-jang/TODO.md:89

Mine past sessions, decisions, and outcomes to build institutional memory that compounds across sessions

☐ 🟡 Learning materialization 🟡

.claude/worktrees/busy-jang/TODO.md:90

learning.db stores knowledge but Claude doesn't see it. Need to materialize to MEMORY.md, stable image paths, keep in sync.

☐ 🟡 Price impacts of news 🟡

.claude/worktrees/busy-jang/TODO.md:91

Analyze how news events move prices, build causal models of market reactions

☐ 🟡 Supervisor 🟡

.claude/worktrees/busy-jang/TODO.md:92

Unified process/file/schedule/session supervision (tasks/design/supervisor_design_task.md). Heartbeat, watchdog, cron, session supervision, sidekick.

☐ 🟡 Cloudflare Access: Google OAuth 🟡

.claude/worktrees/busy-jang/TODO.md:93

Replace one-time PIN with Google OAuth for *.jott.ninja. Smoother login (no email wait), supports different session durations per user (owner = 30 days, others = 24h). Configure in Cloudflare Zero Trust dashboard → Settings → Authentication → Add Google as identity provider. Requires Google Cloud OAuth client ID + secret.

☐ 🟡 Wisdom over VIC 🟡

.claude/worktrees/busy-jang/TODO.md:94

kb/wisdom/ — Extract + analyze investment thesis logic from Value Investors Club. See kb/wisdom/README.md.

☐ 🟡 TFTF Public Company Analysis 🟡

TODO.md:107

projects/skillz/domains/companies/ — Separate project from VC/founder tool. Identify "Too Fast To Follow" public companies via SEC filings, earnings calls, patent velocity, product launch cadence. Score on: velocity, compounding, moat depth, talent magnetism, capital efficiency, founder intensity. All free public data sources (10K/10Q, USPTO, GitHub, press releases). See domains/companies/README.md.

☐ 🟡 Brain unified input + impact analysis 🟡

TODO.md:113

Extend brain's input box to accept: URLs, pasted text, macro events ("silver down 8%"), or analysis questions. For macro events, run parallel vario LLM calls to identify impacted companies/sectors ranked by strength. Output: ranked impacts with [Watch] and [Order @ price] actions. See brain/GOALS.md for UI mockup, tasks/design/newsflow_macro.md for full spec.

☐ 🟡 Advance learning from history 🟡

TODO.md:114

Mine past sessions, decisions, and outcomes to build institutional memory that compounds across sessions

☐ 🟡 Learning materialization 🟡

TODO.md:115

learning.db stores knowledge but Claude doesn't see it. Need to materialize to MEMORY.md, stable image paths, keep in sync.

☐ 🟡 Price impacts of news 🟡

TODO.md:116

Analyze how news events move prices, build causal models of market reactions

☐ 🟡 Supervisor 🟡

TODO.md:117

Unified process/file/schedule/session supervision (tasks/design/supervisor_design_task.md). Heartbeat, watchdog, cron, session supervision, sidekick.

☐ 🟡 Cloudflare Access: Google OAuth 🟡

TODO.md:118

Replace one-time PIN with Google OAuth for *.jott.ninja. Smoother login (no email wait), supports different session durations per user (owner = 30 days, others = 24h). Configure in Cloudflare Zero Trust dashboard → Settings → Authentication → Add Google as identity provider. Requires Google Cloud OAuth client ID + secret.

☐ 🟡 Wisdom over VIC 🟡

TODO.md:119

kb/wisdom/ — Extract + analyze investment thesis logic from Value Investors Club. See kb/wisdom/README.md.

☐ 🔐 admin.jott.ninja

.claude/worktrees/busy-jang/TODO.md:30

Restricted admin area behind Cloudflare Access (tchklovski@gmail.com only). Create CF Access Application with email policy. Add admin.jott.ninja to Caddyfile. Move sensitive content (billing links, API key references, cost tracking) from watch dashboard to dedicated admin page. Eventually: live cost polling from Anthropic/OpenAI/BD APIs.

☐ 📧 $1 custom domain for temp email *(parked

.claude/worktrees/busy-jang/TODO.md:32

Wayback backfill + normal login cover near-term VIC needs)* — Buy a cheap domain (~$1/yr on Porkbun), set up Cloudflare Email Routing (free catch-all), add cloudflare provider to lib/tempmail/. Unblocks VIC multi-account signup automation — all free temp email domains (virgilian.com, guerrillamail.com, etc.) are on disposable blocklists. VIC silently accepts signup but never sends the welcome email. Custom domain is the only bulletproof fix. projects/vic/signup.py (BD Scraping Browser + Turnstile), lib/tempmail/ (3 providers + domain reputation checker).

buy domain
add to Cloudflare
enable catch-all → forward to Gmail
add IMAP/Gmail reader to lib/tempmail
test VIC signup end-to-end.

☑ 📋 Nonprofit 990 filing scraper

.claude/worktrees/busy-jang/TODO.md:34

jobs/handlers/nonprofit_990s.py ✅ OCR complete

Model: gemini/gemini-3-flash-preview at 150 DPI, 1 page/batch (streaming, repetition-safe)
Quality: Excellent — CEO names, compensation, financials all extracted cleanly
32 filings have no PDF on ProPublica (recent e-filings, need IRS XML)
CEO data verified: Reshma Saujani/Tarika Barrett (GWC), Tara Chklovski (Technovation), Sal Khan (Khan Academy)
Priority orgs: Girls Who Code, Technovation, Khan Academy (priority=1 in orgs.yaml)
Next steps:
[ ] CEO/officer structured extraction from OCR'd Part VII data (LLM parse → DB table)
[ ] IRS XML e-files for 32 filings without PDFs (2017-2024)
[ ] Revenue/expense trend analysis across orgs

☐ 🔐 admin.jott.ninja

TODO.md:55

Restricted admin area behind Cloudflare Access (tchklovski@gmail.com only). Create CF Access Application with email policy. Add admin.jott.ninja to Caddyfile. Move sensitive content (billing links, API key references, cost tracking) from watch dashboard to dedicated admin page. Eventually: live cost polling from Anthropic/OpenAI/BD APIs.

☐ 📧 $1 custom domain for temp email *(parked

TODO.md:57

Wayback backfill + normal login cover near-term VIC needs)* — Buy a cheap domain (~$1/yr on Porkbun), set up Cloudflare Email Routing (free catch-all), add cloudflare provider to lib/tempmail/. Unblocks VIC multi-account signup automation — all free temp email domains (virgilian.com, guerrillamail.com, etc.) are on disposable blocklists. VIC silently accepts signup but never sends the welcome email. Custom domain is the only bulletproof fix. projects/vic/signup.py (BD Scraping Browser + Turnstile), lib/tempmail/ (3 providers + domain reputation checker).

buy domain
add to Cloudflare
enable catch-all → forward to Gmail
add IMAP/Gmail reader to lib/tempmail
test VIC signup end-to-end.

☑ 📋 Nonprofit 990 filing scraper

TODO.md:59

jobs/handlers/nonprofit_990s.py ✅ OCR complete

Model: gemini/gemini-3-flash-preview at 150 DPI, 1 page/batch (streaming, repetition-safe)
Quality: Excellent — CEO names, compensation, financials all extracted cleanly
32 filings have no PDF on ProPublica (recent e-filings, need IRS XML)
CEO data verified: Reshma Saujani/Tarika Barrett (GWC), Tara Chklovski (Technovation), Sal Khan (Khan Academy)
Priority orgs: Girls Who Code, Technovation, Khan Academy (priority=1 in orgs.yaml)
Next steps:
[ ] CEO/officer structured extraction from OCR'd Part VII data (LLM parse → DB table)
[ ] IRS XML e-files for 32 filings without PDFs (2017-2024)
[ ] Revenue/expense trend analysis across orgs

Review with User (4 open)

▶

☐ Try out supervisor

.claude/worktrees/busy-jang/TODO.md:98

supervisor/ — Run PYTHONPATH=. python -m supervisor.cli run --dry-run -v to watch it detect waiting sessions, then test live on a forked session with plan approval or AskUserQuestion. Session ID ed537830-011b-4aff-9502-90570e9a83b3 (this build session).

☑ ✅ Fixed failure→repair pairing quality

.claude/worktrees/busy-jang/TODO.md:100

learning/session_review/ — Multi-candidate approach implemented (Feb 2026):

Miner now collects 8 candidates after each error instead of just the first success
LLM judge (pair_judge.py) evaluates all candidates and picks the actual repair
Results: repair rate jumped from 38% → 83% (61% of repairs were NOT at the first candidate)
principle_propose.py filters to pair_verdict='repair' automatically
See METHODOLOGY.md for full documentation

☐ Session review system walkthrough

.claude/worktrees/busy-jang/TODO.md:106

learning/session_review/ — Full pipeline: failure_mining.py, failure_browser.py, principle_propose.py, judge caching, METHODOLOGY.md. See METHODOLOGY.md for approach.

☐ Try out supervisor

TODO.md:123

supervisor/ — Run PYTHONPATH=. python -m supervisor.cli run --dry-run -v to watch it detect waiting sessions, then test live on a forked session with plan approval or AskUserQuestion. Session ID ed537830-011b-4aff-9502-90570e9a83b3 (this build session).

☑ ✅ Fixed failure→repair pairing quality

TODO.md:125

learning/session_review/ — Multi-candidate approach implemented (Feb 2026):

Miner now collects 8 candidates after each error instead of just the first success
LLM judge (pair_judge.py) evaluates all candidates and picks the actual repair
Results: repair rate jumped from 38% → 83% (61% of repairs were NOT at the first candidate)
principle_propose.py filters to pair_verdict='repair' automatically
See METHODOLOGY.md for full documentation

☐ Session review system walkthrough

TODO.md:131

learning/session_review/ — Full pipeline: failure_mining.py, failure_browser.py, principle_propose.py, judge caching, METHODOLOGY.md. See METHODOLOGY.md for approach.

Investor Replication & Covenant Analysis (4 open)

☐ Investor replication system

.claude/worktrees/busy-jang/TODO.md:110

~/all-code/investor/replication/ — Extract analytical frameworks from top investors (Reeves/Infuse, Druckenmiller, Tepper) by acquiring their content (Substack, interviews, letters, 13F), extracting structured thesis elements per document, building company timelines, and operationalizing into scoring/screening. Design task: tasks/design/investment_philosophy_extraction.md

☐ Bond covenant analysis

.claude/worktrees/busy-jang/TODO.md:111

~/all-code/investor/covenants/ — Extract structured covenants from EDGAR indentures/credit agreements, compute headroom vs current financials, track amendments over time. Key challenge: resolving nested EBITDA definitions and cross-references.

☐ Investor replication system

TODO.md:135

~/all-code/investor/replication/ — Extract analytical frameworks from top investors (Reeves/Infuse, Druckenmiller, Tepper) by acquiring their content (Substack, interviews, letters, 13F), extracting structured thesis elements per document, building company timelines, and operationalizing into scoring/screening. Design task: tasks/design/investment_philosophy_extraction.md

☐ Bond covenant analysis

TODO.md:136

~/all-code/investor/covenants/ — Extract structured covenants from EDGAR indentures/credit agreements, compute headroom vs current financials, track amendments over time. Key challenge: resolving nested EBITDA definitions and cross-references.

Learning (8 open)

▶

☐ 🔴 Sidekick auto-generates learnings 🔴

.claude/worktrees/busy-jang/TODO.md:115

supervisor/sidekick/ + learning/ — Sidekick should observe session activity and auto-generate learnings asynchronously. When it detects patterns (repeated failures, successful fixes, new conventions established), create learning entries in learning.db without blocking the session. Hook into UserPromptSubmit or PostToolUse events. Lightweight LLM call (flash/haiku) to classify whether the current turn contains a learning-worthy observation.

☐ 🔴 Situations DB 🔴

.claude/worktrees/busy-jang/TODO.md:118

learning/ — Build the dataset first. Mine sessions for situation records: context + problem + resolution + outcome. Broader than tool errors — capture slow paths, workarounds, design decisions, architectural choices. This DB is the foundation for everything else (principle extraction, wisdom, pattern matching to new situations). Start simple: SQLite, one table, mine from JSONL transcripts.

☐ 🔴 Sidekick auto-generates learnings 🔴

TODO.md:140

supervisor/sidekick/ + learning/ — Sidekick should observe session activity and auto-generate learnings asynchronously. When it detects patterns (repeated failures, successful fixes, new conventions established), create learning entries in learning.db without blocking the session. Hook into UserPromptSubmit or PostToolUse events. Lightweight LLM call (flash/haiku) to classify whether the current turn contains a learning-worthy observation.

☐ 🔴 Situations DB 🔴

TODO.md:143

learning/ — Build the dataset first. Mine sessions for situation records: context + problem + resolution + outcome. Broader than tool errors — capture slow paths, workarounds, design decisions, architectural choices. This DB is the foundation for everything else (principle extraction, wisdom, pattern matching to new situations). Start simple: SQLite, one table, mine from JSONL transcripts.

☐ 🟡 Learning importance rating 🟡

.claude/worktrees/busy-jang/TODO.md:116

learning/ — Each learning should have an importance/weight field (1-5 or similar). High-importance learnings get prioritized in materialization to learnings.md and principles extraction. Low-importance ones stay in DB but don't consume context window real estate. Could be auto-rated by the LLM during learn classification, or manually set.

☐ 🟡 Learning importance rating 🟡

TODO.md:141

learning/ — Each learning should have an importance/weight field (1-5 or similar). High-importance learnings get prioritized in materialization to learnings.md and principles extraction. Low-importance ones stay in DB but don't consume context window real estate. Could be auto-rated by the LLM during learn classification, or manually set.

☐ Principle extraction from situations

.claude/worktrees/busy-jang/TODO.md:119

Once DB exists, apply three-question depth process (what class of problems? what constraint can we drop? convention to capture?) — see ~/.claude/skills/principles/SKILL.md. Feeds into ~/.claude/principles/, project conventions, ~/.claude/howto/.

Related: failure→repair pairing (fixes the foundation this builds on), principle_propose.py (existing but limited to tool errors)

☑ Track .myconf progress in chronicle

.claude/worktrees/busy-jang/TODO.md:121

Principles, conventions, howtos, and skills added/updated should count as learning progress in doctor/chronicle/. Currently only tracks code commits and session topics.

☐ Principle extraction from situations

TODO.md:144

Once DB exists, apply three-question depth process (what class of problems? what constraint can we drop? convention to capture?) — see ~/.claude/skills/principles/SKILL.md. Feeds into ~/.claude/principles/, project conventions, ~/.claude/howto/.

Related: failure→repair pairing (fixes the foundation this builds on), principle_propose.py (existing but limited to tool errors)

☑ Track .myconf progress in chronicle

TODO.md:146

Principles, conventions, howtos, and skills added/updated should count as learning progress in doctor/chronicle/. Currently only tracks code commits and session topics.

Newsflow: CEO Interviews & Podcasts (10 open)

▶

☐ CEO interview podcasts

.claude/worktrees/busy-jang/TODO.md:127

Add podcast sources to newsflow: Acquired, Invest Like the Best (Patrick O'Shaughnessy), Lex Fridman, All-In, The Knowledge Project. These feature deep CEO/founder interviews that are high-signal for understanding companies. Strategy: YouTube channel discovery + Serper search for episode titles mentioning tracked companies/people.

☐ CEO bio outlier study

.claude/worktrees/busy-jang/TODO.md:128

Collect bios of CEOs of tech and biotech companies at scale. Study for outlier patterns (unconventional backgrounds, career pivots, founder vs hired, domain depth, serial entrepreneurship) and correlate with stock outperformance. Which bio traits predict 3-5yr returns? Sources: LinkedIn (Bright Data), SEC proxy statements (DEF 14A has detailed bios), company "About" pages, Wikipedia. Start with Russell 1000 tech + biotech filter (~200 companies). Store in projects/people/ or projects/companies/.

☐ CEO analysis project

.claude/worktrees/busy-jang/TODO.md:129

Systematic CEO quality assessment as predictor of stock outperformance. Two axes:

Content signals: Capital allocation philosophy, candor about mistakes, long-term orientation, insider buying patterns
Linguistic/behavioral signals: Speaking patterns (hedging frequency, specificity vs vagueness, use of "I" vs "we", passive voice on bad news), vocabulary complexity, energy/conviction shifts over time, age as a factor (founder energy vs late-career coasting)
Methodology: Transcribe interviews → extract features per appearance → build longitudinal profiles → correlate with subsequent 1-3yr stock performance. Test hypotheses like "CEOs who increase hedging language before earnings underperform" or "vocabulary specificity correlates with execution quality"
Key metrics: Insightfulness (novel framing, non-obvious connections), contrarianism (willingness to disagree with consensus, specificity of contrarian views vs vague "we're different"), conviction density (claims per minute backed by data/examples vs hand-waving)
Calibration set: Analyze known exceptional communicators/leaders to establish baselines — Feynman, Steve Jobs, Deming, Drucker, Jensen Huang, Robert Pera, Rick Smith (Axon CEO), Glenn Fogel (Booking.com CEO), Bob van Dijk (Naspers/Prosus), Sean McClain (Absci), Luis von Ahn (Duolingo CEO), Robert Duggan (PLSE chairman, Pharmacyclics→$21B AbbVie exit, cf. "For Blood and Money" by Nathan Vardi). What linguistic patterns distinguish people who actually deliver vs those who just talk well? Use as ground truth for the CEO scoring model.
Could live in kb/ or investor/. NLP feature extraction + backtesting against price data.

☐ Innovation pace DB

.claude/worktrees/busy-jang/TODO.md:136

Track company innovation velocity as an investment signal. SQLite DB with:

Product launches: new products, features, platform releases (from press releases, earnings calls, news)
Patent activity: filing rate, domains, citation velocity (USPTO/Google Patents)
R&D intensity: R&D spend %, headcount growth in engineering, new lab/facility announcements
Iteration speed: time between product generations (e.g. NVIDIA GPU cadence, Tesla FSD versions), shrinking cycles = accelerating
Talent signals: key hires, acqui-hires, departures of technical leadership
Scoring: Composite innovation pace score per company per quarter, track trajectory (accelerating/decelerating/steady). Companies accelerating innovation pace before the market prices it in = alpha signal.
Feed from newsflow extractions + earnings call NLP + patent API. Cross-reference with stock performance windows.

☐ Expand people tracking

.claude/worktrees/busy-jang/TODO.md:144

Beyond investors (Druckenmiller, Baron, Tepper), track CEOs of monitored companies (Jensen Huang, Lisa Su, etc.) and key tech figures. Cross-reference appearances across podcast sources for dedup.

☐ CEO interview podcasts

TODO.md:152

Add podcast sources to newsflow: Acquired, Invest Like the Best (Patrick O'Shaughnessy), Lex Fridman, All-In, The Knowledge Project. These feature deep CEO/founder interviews that are high-signal for understanding companies. Strategy: YouTube channel discovery + Serper search for episode titles mentioning tracked companies/people.

☐ CEO bio outlier study

TODO.md:153

Collect bios of CEOs of tech and biotech companies at scale. Study for outlier patterns (unconventional backgrounds, career pivots, founder vs hired, domain depth, serial entrepreneurship) and correlate with stock outperformance. Which bio traits predict 3-5yr returns? Sources: LinkedIn (Bright Data), SEC proxy statements (DEF 14A has detailed bios), company "About" pages, Wikipedia. Start with Russell 1000 tech + biotech filter (~200 companies). Store in projects/people/ or projects/companies/.

☐ CEO analysis project

TODO.md:154

Systematic CEO quality assessment as predictor of stock outperformance. Two axes:

Content signals: Capital allocation philosophy, candor about mistakes, long-term orientation, insider buying patterns
Linguistic/behavioral signals: Speaking patterns (hedging frequency, specificity vs vagueness, use of "I" vs "we", passive voice on bad news), vocabulary complexity, energy/conviction shifts over time, age as a factor (founder energy vs late-career coasting)
Methodology: Transcribe interviews → extract features per appearance → build longitudinal profiles → correlate with subsequent 1-3yr stock performance. Test hypotheses like "CEOs who increase hedging language before earnings underperform" or "vocabulary specificity correlates with execution quality"
Key metrics: Insightfulness (novel framing, non-obvious connections), contrarianism (willingness to disagree with consensus, specificity of contrarian views vs vague "we're different"), conviction density (claims per minute backed by data/examples vs hand-waving)
Calibration set: Analyze known exceptional communicators/leaders to establish baselines — Feynman, Steve Jobs, Deming, Drucker, Jensen Huang, Robert Pera, Rick Smith (Axon CEO), Glenn Fogel (Booking.com CEO), Bob van Dijk (Naspers/Prosus), Sean McClain (Absci), Luis von Ahn (Duolingo CEO), Robert Duggan (PLSE chairman, Pharmacyclics→$21B AbbVie exit, cf. "For Blood and Money" by Nathan Vardi). What linguistic patterns distinguish people who actually deliver vs those who just talk well? Use as ground truth for the CEO scoring model.
Could live in kb/ or investor/. NLP feature extraction + backtesting against price data.

☐ Innovation pace DB

TODO.md:161

Track company innovation velocity as an investment signal. SQLite DB with:

Product launches: new products, features, platform releases (from press releases, earnings calls, news)
Patent activity: filing rate, domains, citation velocity (USPTO/Google Patents)
R&D intensity: R&D spend %, headcount growth in engineering, new lab/facility announcements
Iteration speed: time between product generations (e.g. NVIDIA GPU cadence, Tesla FSD versions), shrinking cycles = accelerating
Talent signals: key hires, acqui-hires, departures of technical leadership
Scoring: Composite innovation pace score per company per quarter, track trajectory (accelerating/decelerating/steady). Companies accelerating innovation pace before the market prices it in = alpha signal.
Feed from newsflow extractions + earnings call NLP + patent API. Cross-reference with stock performance windows.

☐ Expand people tracking

TODO.md:169

Beyond investors (Druckenmiller, Baron, Tepper), track CEOs of monitored companies (Jensen Huang, Lisa Su, etc.) and key tech figures. Cross-reference appearances across podcast sources for dedup.

LLM Tools (2 open)

☐ fetch tool for lib/llm tool registry

.claude/worktrees/busy-jang/TODO.md:148

lib/llm/tools.py — LLM can fetch URLs from search results. Needs: brain's fetch_escalate with smart proxy escalation, BrightData unlocker/JS rendering for paywalled/dynamic content. High-volume JS fetching may need existing browser service or BrightData Browser CDP endpoint. Design considerations: mode param (auto/js/unlocker), rate limiting, content truncation for token efficiency.

☐ fetch tool for lib/llm tool registry

TODO.md:173

lib/llm/tools.py — LLM can fetch URLs from search results. Needs: brain's fetch_escalate with smart proxy escalation, BrightData unlocker/JS rendering for paywalled/dynamic content. High-volume JS fetching may need existing browser service or BrightData Browser CDP endpoint. Design considerations: mode param (auto/js/unlocker), rate limiting, content truncation for token efficiency.

Transcription (6 open)

▶

☐ Eval transcript quality

.claude/worktrees/busy-jang/TODO.md:152

Compare on same 5-10 audio samples (mix of clean/noisy, short/long):

Groq whisper-large-v3-turbo (current, $0.04/hr, 12% WER)
Groq whisper-large-v3 ($0.111/hr, 10.3% WER)
Deepgram Nova-3 ($0.46/hr, commercial quality + diarization)
Measure: WER against YouTube auto-captions as baseline, readability, speaker attribution quality
Use samples from pltr_interviews and dumb_money_live (have both YT captions and Groq for comparison)

☐ Pyannote diarization

.claude/worktrees/busy-jang/TODO.md:158

Speaker segmentation on audio files, merge with transcripts (forked session in progress)

☐ Run HF models in HF cloud on our audio

.claude/worktrees/busy-jang/TODO.md:159

Test Hugging Face hosted inference (whisper, pyannote, etc.) on our audio samples instead of local — compare quality, latency, cost

☐ Eval transcript quality

TODO.md:177

Compare on same 5-10 audio samples (mix of clean/noisy, short/long):

Groq whisper-large-v3-turbo (current, $0.04/hr, 12% WER)
Groq whisper-large-v3 ($0.111/hr, 10.3% WER)
Deepgram Nova-3 ($0.46/hr, commercial quality + diarization)
Measure: WER against YouTube auto-captions as baseline, readability, speaker attribution quality
Use samples from pltr_interviews and dumb_money_live (have both YT captions and Groq for comparison)

☐ Pyannote diarization

TODO.md:183

Speaker segmentation on audio files, merge with transcripts (forked session in progress)

☐ Run HF models in HF cloud on our audio

TODO.md:184

Test Hugging Face hosted inference (whisper, pyannote, etc.) on our audio samples instead of local — compare quality, latency, cost

KB & Self-Learning (6 open)

▶

☐ KB - Self-learning knowledge extraction (kb/, formerly knowledge_gym)

.claude/worktrees/busy-jang/TODO.md:163

Run: python -m kb.scenario -s URL -n 3
Extracts: statements, inferences, causations, generalizations (with relevance scores)
Corpus grows at kb/corpus/knowledge.jsonl
This is just the skeleton - see "Web → knowledge acceleration" below for full vision
Next: test it runs, track corpus quality, then expand

☐ Problem Solving Gym - Strategy learning for reasoning (explorations/problem_solving_gym/)

.claude/worktrees/busy-jang/TODO.md:170

Concept: try different strategies (chain-of-thought, decompose, adversarial), track what works
Learn which approaches work for which problem types
Build strategy recommendation based on problem similarity

☐ Test extraction - Run python brain/validate_extraction.py to verify HTML/YouTube/PDF extraction

.claude/worktrees/busy-jang/TODO.md:175

Investigate notes: YouTube missing metadata, PDF garbled text sequences
Add more test URLs as needed

☐ KB - Self-learning knowledge extraction (kb/, formerly knowledge_gym)

TODO.md:188

Run: python -m kb.scenario -s URL -n 3
Extracts: statements, inferences, causations, generalizations (with relevance scores)
Corpus grows at kb/corpus/knowledge.jsonl
This is just the skeleton - see "Web → knowledge acceleration" below for full vision
Next: test it runs, track corpus quality, then expand

☐ Problem Solving Gym - Strategy learning for reasoning (explorations/problem_solving_gym/)

TODO.md:195

Concept: try different strategies (chain-of-thought, decompose, adversarial), track what works
Learn which approaches work for which problem types
Build strategy recommendation based on problem similarity

☐ Test extraction - Run python brain/validate_extraction.py to verify HTML/YouTube/PDF extraction

TODO.md:200

Investigate notes: YouTube missing metadata, PDF garbled text sequences
Add more test URLs as needed

Visual TODO (2 open)

☐ Explore Gradio themes - https://www.gradio.app/guides/theming-guide

.claude/worktrees/busy-jang/TODO.md:181

Built-in: gr.themes.Glass(), gr.themes.Ocean(), gr.themes.Citrus()
Pick one consistent theme for all rivus Gradio apps

☐ Explore Gradio themes - https://www.gradio.app/guides/theming-guide

TODO.md:206

Built-in: gr.themes.Glass(), gr.themes.Ocean(), gr.themes.Citrus()
Pick one consistent theme for all rivus Gradio apps

System (2 open)

☐ Background hook - Hook to check if hooks need updating (meta-hook for hook maintenance)

.claude/worktrees/busy-jang/TODO.md:187

☐ Background hook - Hook to check if hooks need updating (meta-hook for hook maintenance)

TODO.md:212

Writing / Substack (4 open)

☐ Parallel & Speculative Development - Write up patterns for developing with cheap parallel workers

.claude/worktrees/busy-jang/TODO.md:191

Speculative execution, fork-and-verify, test assumptions in background, design for parallel dev
Real examples from rivus: vario pipeline, background agents, fork-to-check-history
Key insight: copies of workers are cheap, waiting is expensive
This is a genuine contribution — most dev practices assume serial work

☐ Decide where writeups live - writing/ or design/writing/ in rivus?

.claude/worktrees/busy-jang/TODO.md:197

Substack drafts, learnings, patterns worth sharing
Separate from design/drafts (which are LLM review outputs)
Should be git-tracked, easy to preview as markdown

☐ Parallel & Speculative Development - Write up patterns for developing with cheap parallel workers

TODO.md:216

Speculative execution, fork-and-verify, test assumptions in background, design for parallel dev
Real examples from rivus: vario pipeline, background agents, fork-to-check-history
Key insight: copies of workers are cheap, waiting is expensive
This is a genuine contribution — most dev practices assume serial work

☐ Decide where writeups live - writing/ or design/writing/ in rivus?

TODO.md:222

Substack drafts, learnings, patterns worth sharing
Separate from design/drafts (which are LLM review outputs)
Should be git-tracked, easy to preview as markdown

Trading / Investor (2 open)

☐ Portfolio news monitoring - Monitor news about portfolio companies, assess market reaction and implications

.claude/worktrees/busy-jang/TODO.md:204

Track news events (earnings, product launches, regulatory, macro) for held positions
Assess: how is the market reacting? how should we be reacting?
Compare market reaction vs our fundamental view — find mismatches (overreaction, underreaction)
Feed into position sizing / exit decisions in moneygun

☐ Portfolio news monitoring - Monitor news about portfolio companies, assess market reaction and implications

TODO.md:229

Track news events (earnings, product launches, regulatory, macro) for held positions
Assess: how is the market reacting? how should we be reacting?
Compare market reaction vs our fundamental view — find mismatches (overreaction, underreaction)
Feed into position sizing / exit decisions in moneygun

Self-Learning & Iteration (vario/geneval direction) (10 open)

▶

☐ **Gradio Layout Gym

.claude/worktrees/busy-jang/TODO.md:221

"How to Solve It" for UI** — explorations/gradio_layout_gym/, branch learn-gradio

Approach: Polya-style skill isolation — work through layout challenges one at a time, each in its own mini-experiment with screenshots as evidence. Build up a body of experience and documented lessons.
Existing foundation: explorations/gradio_layout/EXPERIMENTS.md has 6 real experiments (viewport locking, HTML overflow, flex CSS). Gym concept in gradio_layout_gym/README.md (never built).
Challenge curriculum (progressive difficulty):
Per challenge: isolate skill, create minimal app, screenshot at 2+ viewports, document what works/breaks/why
Branch: learn-gradio — accumulates experiments without polluting main. Lessons graduate to ~/.claude/howto/gradio.md and gradio-layout skill
Automation: Playwright screenshots, before/after comparison, visual diff where useful

☐ Visual artifact iteration - geneval for images/diagrams/SVGs

.claude/worktrees/busy-jang/TODO.md:239

Generate → render → screenshot → score → iterate
Closes the loop on visual generation quality

☐ Benchmark self-improvement - Iterate to outperform on known benchmarks

.claude/worktrees/busy-jang/TODO.md:243

Pick a benchmark, generate approaches, eval, refine
Track progress over iterations

☐ **Gradio Layout Gym

TODO.md:246

"How to Solve It" for UI** — explorations/gradio_layout_gym/, branch learn-gradio

Approach: Polya-style skill isolation — work through layout challenges one at a time, each in its own mini-experiment with screenshots as evidence. Build up a body of experience and documented lessons.
Existing foundation: explorations/gradio_layout/EXPERIMENTS.md has 6 real experiments (viewport locking, HTML overflow, flex CSS). Gym concept in gradio_layout_gym/README.md (never built).
Challenge curriculum (progressive difficulty):
Per challenge: isolate skill, create minimal app, screenshot at 2+ viewports, document what works/breaks/why
Branch: learn-gradio — accumulates experiments without polluting main. Lessons graduate to ~/.claude/howto/gradio.md and gradio-layout skill
Automation: Playwright screenshots, before/after comparison, visual diff where useful

☐ Web → knowledge acceleration - Self-growing knowledge extraction

.claude/worktrees/busy-jang/TODO.md:247

Convert web info into correlations, causations, chains of reasoning
Accelerating loop: more knowledge → better extraction → more knowledge
Cross-reference facts across sources, verify over time
Started: kb/ has basic extract→score→corpus loop
Remaining: auto-crawl, knowledge graph, verification, acceleration

☐ Long thread discovery - Find and synthesize extended discussions

.claude/worktrees/busy-jang/TODO.md:254

Reference from brain: multi-page threads, forum discussions, comment chains
Extract argument structure, key claims, evidence links

☐ Visual artifact iteration - geneval for images/diagrams/SVGs

TODO.md:264

Generate → render → screenshot → score → iterate
Closes the loop on visual generation quality

☐ Benchmark self-improvement - Iterate to outperform on known benchmarks

TODO.md:268

Pick a benchmark, generate approaches, eval, refine
Track progress over iterations

☐ Web → knowledge acceleration - Self-growing knowledge extraction

TODO.md:272

Convert web info into correlations, causations, chains of reasoning
Accelerating loop: more knowledge → better extraction → more knowledge
Cross-reference facts across sources, verify over time
Started: kb/ has basic extract→score→corpus loop
Remaining: auto-crawl, knowledge graph, verification, acceleration

☐ Long thread discovery - Find and synthesize extended discussions

TODO.md:279

Reference from brain: multi-page threads, forum discussions, comment chains
Extract argument structure, key claims, evidence links

Refactoring (2 open)

☐ Move smart-fetch logic to browser project - brain/fetcher.py + refusal.py (~400 lines) should move to browser

.claude/worktrees/busy-jang/TODO.md:260

browser exposes /smart-fetch endpoint with JS retry, refusal detection
brain just calls browser, handles caching + LLM analysis

☐ Move smart-fetch logic to browser project - brain/fetcher.py + refusal.py (~400 lines) should move to browser

TODO.md:285

browser exposes /smart-fetch endpoint with JS retry, refusal detection
brain just calls browser, handles caching + LLM analysis

Long-term (4 open)

☐ 🔴 Rapid takeoff company sketch 🔴

.claude/worktrees/busy-jang/TODO.md:266

What would a rapid-takeoff AI-native company look like? Sketch out:

Mission & focus: What problem, what wedge, what makes it defensible
Funding: How much, what stages, what milestones unlock each round
Team & roles: Who to hire first (and last), what each role's mission/focus looks like individually — not just titles but what each person should be obsessing over in months 1-6 vs 6-18
Velocity model: What enables rapid iteration — small team, AI leverage, tight feedback loops, what's automated vs human-judgment
Anti-patterns: What slows down takeoff (premature scaling, wrong hires, too much process, consensus culture)
Calibration: Study real rapid-takeoff examples (Midjourney: 11 people → $200M ARR, Cursor, Perplexity early days, Instagram pre-acquisition) — what did the org chart actually look like?

☐ 🔴 Rapid takeoff company sketch 🔴

TODO.md:291

What would a rapid-takeoff AI-native company look like? Sketch out:

Mission & focus: What problem, what wedge, what makes it defensible
Funding: How much, what stages, what milestones unlock each round
Team & roles: Who to hire first (and last), what each role's mission/focus looks like individually — not just titles but what each person should be obsessing over in months 1-6 vs 6-18
Velocity model: What enables rapid iteration — small team, AI leverage, tight feedback loops, what's automated vs human-judgment
Anti-patterns: What slows down takeoff (premature scaling, wrong hires, too much process, consensus culture)
Calibration: Study real rapid-takeoff examples (Midjourney: 11 people → $200M ARR, Cursor, Perplexity early days, Instagram pre-acquisition) — what did the org chart actually look like?

☐ Repro my PhD

.claude/worktrees/busy-jang/TODO.md:274

Reproduce PhD research/results

☐ Repro my PhD

TODO.md:299

Reproduce PhD research/results

Phase 1: Search fallback (4 open)

☐ If input isn't URL/event/question → browser search → fetch top result

.claude/worktrees/busy-jang/brain/TODO.md:8

☐ If input isn't URL/event/question → browser search → fetch top result

brain/TODO.md:8

☐ Add brain search "query" CLI command

.claude/worktrees/busy-jang/brain/TODO.md:9

☐ Add brain search "query" CLI command

brain/TODO.md:9

Phase 2: Multi-result analysis (vario integration) (8 open)

▶

☐ brain search "query" -n 5 → fetch top N results in parallel

.claude/worktrees/busy-jang/brain/TODO.md:12

☐ brain search "query" -n 5 → fetch top N results in parallel

brain/TODO.md:12

☐ Run extraction prompts on each result

.claude/worktrees/busy-jang/brain/TODO.md:13

☐ Run extraction prompts on each result

brain/TODO.md:13

☐ Synthesize across results (compare, dedupe, rank)

.claude/worktrees/busy-jang/brain/TODO.md:14

☐ Synthesize across results (compare, dedupe, rank)

brain/TODO.md:14

☐ Output: combined insights from multiple sources

.claude/worktrees/busy-jang/brain/TODO.md:15

☐ Output: combined insights from multiple sources

brain/TODO.md:15

Unified NL input (CLI + UI) (10 open)

▶

☐ Single input: "what to analyze" + "how to analyze it"

.claude/worktrees/busy-jang/brain/TODO.md:26

☐ Single input: "what to analyze" + "how to analyze it"

brain/TODO.md:26

☐ Examples:

.claude/worktrees/busy-jang/brain/TODO.md:27

brain "silver down 8%, who's affected"
brain "Apple earnings, extract causal claims"
brain "summarize https://example.com"

☐ Examples:

brain/TODO.md:27

brain "silver down 8%, who's affected"
brain "Apple earnings, extract causal claims"
brain "summarize https://example.com"

☐ LLM parses intent → picks config → runs

.claude/worktrees/busy-jang/brain/TODO.md:31

☐ LLM parses intent → picks config → runs

brain/TODO.md:31

☐ Reuse generate_config_from_nl from config_ui.py

.claude/worktrees/busy-jang/brain/TODO.md:32

☐ Reuse generate_config_from_nl from config_ui.py

brain/TODO.md:32

☐ brain "query" -n N - multi-source analysis

.claude/worktrees/busy-jang/brain/TODO.md:33

☐ brain "query" -n N - multi-source analysis

brain/TODO.md:33

Active Development (6 open)

▶

☐ Person integrity analysis - develop person_integrity precursor into structured prompt

.claude/worktrees/busy-jang/brain/TODO.md:51

Need: example interviews/transcripts to test against
Consider: scoring rubric vs narrative assessment

☐ Person integrity analysis - develop person_integrity precursor into structured prompt

brain/TODO.md:51

Need: example interviews/transcripts to test against
Consider: scoring rubric vs narrative assessment

☐ Expertise verification - develop person_expertise_real precursor

.claude/worktrees/busy-jang/brain/TODO.md:55

Harder to prompt well - needs domain context

☐ Expertise verification - develop person_expertise_real precursor

brain/TODO.md:55

Harder to prompt well - needs domain context

☐ Red flag detection - develop person_red_flags precursor

.claude/worktrees/busy-jang/brain/TODO.md:58

Balance sensitivity (catch issues) vs specificity (avoid false positives)

☐ Red flag detection - develop person_red_flags precursor

brain/TODO.md:58

Balance sensitivity (catch issues) vs specificity (avoid false positives)

Research Queries (2 open)

☐ Develop research-oriented precursors (research_* in query_precursors.yaml)

.claude/worktrees/busy-jang/brain/TODO.md:63

These may be better as reusable analysis patterns than one-off prompts
Consider: composable prompt fragments vs monolithic prompts

☐ Develop research-oriented precursors (research_* in query_precursors.yaml)

brain/TODO.md:63

These may be better as reusable analysis patterns than one-off prompts
Consider: composable prompt fragments vs monolithic prompts

Infrastructure (4 open)

☐ Add CLI command to list precursors by status

.claude/worktrees/busy-jang/brain/TODO.md:69

☐ Add CLI command to list precursors by status

brain/TODO.md:69

☐ Add test harness: run prompt against sample docs, compare outputs

.claude/worktrees/busy-jang/brain/TODO.md:70

☐ Add test harness: run prompt against sample docs, compare outputs

brain/TODO.md:70

Automation / Integration (8 open)

▶

☐ URL param initialization - Support query params to pre-populate UI:

.claude/worktrees/busy-jang/brain/vario/TODO.md:65

?prompt=... - set the prompt text
?config=name - select a config preset
?input=... or ?input_url=... - set input content
?autorun=1 - auto-run on load

☐ URL param initialization - Support query params to pre-populate UI:

brain/vario/TODO.md:65

?prompt=... - set the prompt text
?config=name - select a config preset
?input=... or ?input_url=... - set input content
?autorun=1 - auto-run on load

☐ Save output on completion - After generation completes:

.claude/worktrees/busy-jang/brain/vario/TODO.md:71

Save results to file (YAML/JSON/MD)
Support ?output=/path/to/file.yaml param
CLI already supports stdout, add --output path for file
Enable pipeline: brain get URL | vario gen "..." --output results.yaml

☐ Save output on completion - After generation completes:

brain/vario/TODO.md:71

Save results to file (YAML/JSON/MD)
Support ?output=/path/to/file.yaml param
CLI already supports stdout, add --output path for file
Enable pipeline: brain get URL | vario gen "..." --output results.yaml

☐ URL param initialization - Already supports ?url=...&config=..., extend to:

.claude/worktrees/busy-jang/brain/TODO.md:74

?autorun=1 - auto-run extraction on load
?output=path - specify output destination

☐ URL param initialization - Already supports ?url=...&config=..., extend to:

brain/TODO.md:74

?autorun=1 - auto-run extraction on load
?output=path - specify output destination

☐ Save output on completion - After extraction completes:

.claude/worktrees/busy-jang/brain/TODO.md:78

Save results to file (YAML/JSON/MD based on config)
Support ?output=/path/to/file.yaml param
CLI: brain extract URL --config causal --output results.yaml
Enable automation: fetch → extract → save without manual intervention

☐ Save output on completion - After extraction completes:

brain/TODO.md:78

Save results to file (YAML/JSON/MD based on config)
Support ?output=/path/to/file.yaml param
CLI: brain extract URL --config causal --output results.yaml
Enable automation: fetch → extract → save without manual intervention

Top Level (4 open)

☐ Streaming coalesce / incremental synthesis - Coalesce information as it arrives (fetches, LLM streams, chunks):

.claude/worktrees/busy-jang/brain/vario/TODO.md:5

Real-time doc updates as data comes in (e.g., person search → update profile as each source fetched)
Line numbers + content hashes for addressing ranges, detecting overlap
N LLMs propose content → shuffle lines into place → edit/unify in real-time
Use case: parallel research streams merge into single evolving document
Think: collaborative doc where each source/model contributes lines, system detects redundancy and merges

☐ Streaming coalesce / incremental synthesis - Coalesce information as it arrives (fetches, LLM streams, chunks):

brain/vario/TODO.md:5

Real-time doc updates as data comes in (e.g., person search → update profile as each source fetched)
Line numbers + content hashes for addressing ranges, detecting overlap
N LLMs propose content → shuffle lines into place → edit/unify in real-time
Use case: parallel research streams merge into single evolving document
Think: collaborative doc where each source/model contributes lines, system detects redundancy and merges

☐ Live audio analysis for Tesla call - Real-time audio stream analysis for today's Tesla earnings call

.claude/worktrees/busy-jang/brain/vario/TODO.md:12

☐ Live audio analysis for Tesla call - Real-time audio stream analysis for today's Tesla earnings call

brain/vario/TODO.md:12

Next (2 open)

☐ Try judging pipeline - Test the new --each flag end-to-end:

.claude/worktrees/busy-jang/brain/vario/TODO.md:16

☐ Try judging pipeline - Test the new --each flag end-to-end:

brain/vario/TODO.md:16

Features (12 open)

▶

☐ "Help me think through" mode - Causal reasoning and forecasting prompt for future predictions:

.claude/worktrees/busy-jang/brain/vario/TODO.md:26

Effect of X on Y (causal chains, second-order effects)
Implications of an event (what happens next, who wins/loses)
Which factor will win or come to dominate (competing forces, trend analysis)
Multiple models explore different causal pathways and convergence scenarios
Could be a preset config or prompt in prompts.yaml

☐ "Help me think through" mode - Causal reasoning and forecasting prompt for future predictions:

brain/vario/TODO.md:26

Effect of X on Y (causal chains, second-order effects)
Implications of an event (what happens next, who wins/loses)
Which factor will win or come to dominate (competing forces, trend analysis)
Multiple models explore different causal pathways and convergence scenarios
Could be a preset config or prompt in prompts.yaml

☐ "Help me prioritize" mode - Decision-making prompt for ranking/prioritizing options:

.claude/worktrees/busy-jang/brain/vario/TODO.md:33

Evaluate options against criteria (impact, effort, risk, etc.)
Score and rank items
Identify quick wins vs long-term investments
Output structured priority list with rationale

☐ "Help me prioritize" mode - Decision-making prompt for ranking/prioritizing options:

brain/vario/TODO.md:33

Evaluate options against criteria (impact, effort, risk, etc.)
Score and rank items
Identify quick wins vs long-term investments
Output structured priority list with rationale

☐ Auto-select/author prompt from request - Given a rough task description, auto-pick the most relevant prompt from prompts.yaml or generate a new one:

.claude/worktrees/busy-jang/brain/vario/TODO.md:39

"evaluate these plans for feasibility" → select rank_feasibility
"score clarity and brevity" → author custom prompt with those criteria
Use LLM to match intent to existing prompts or generate schema+prompt
Could be a vario prompt "rough request" command

☐ Auto-select/author prompt from request - Given a rough task description, auto-pick the most relevant prompt from prompts.yaml or generate a new one:

brain/vario/TODO.md:39

"evaluate these plans for feasibility" → select rank_feasibility
"score clarity and brevity" → author custom prompt with those criteria
Use LLM to match intent to existing prompts or generate schema+prompt
Could be a vario prompt "rough request" command

☐ Config history - Use gr.BrowserState to persist recent configs in localStorage. Show as clickable items alongside presets. Keep last ~10, dedupe.

.claude/worktrees/busy-jang/brain/vario/TODO.md:45

☐ Config history - Use gr.BrowserState to persist recent configs in localStorage. Show as clickable items alongside presets. Keep last ~10, dedupe.

brain/vario/TODO.md:45

☐ Search mode in UI - Add dropdown/radio for search mode (tool/native/none):

.claude/worktrees/busy-jang/brain/vario/TODO.md:47

CLI already supports: --search tool|native|none, --directive ignore|async|sync
Add ?search=tool|native|none and ?directive=ignore|async|sync URL params
Add radio buttons in Generate tab below prompt
"Interpret" and "Run" as separate buttons for explicit control
Show detected directive in YAML preamble: # directive: none

☐ Search mode in UI - Add dropdown/radio for search mode (tool/native/none):

brain/vario/TODO.md:47

CLI already supports: --search tool|native|none, --directive ignore|async|sync
Add ?search=tool|native|none and ?directive=ignore|async|sync URL params
Add radio buttons in Generate tab below prompt
"Interpret" and "Run" as separate buttons for explicit control
Show detected directive in YAML preamble: # directive: none

☐ Expand directive parsing - Parse model selection along with search mode:

.claude/worktrees/busy-jang/brain/vario/TODO.md:54

"using haiku and gpt" → extract models
Single LLM call extracts: search_mode + models + cleaned_prompt
Add model trigger words to fast-path check

☐ Expand directive parsing - Parse model selection along with search mode:

brain/vario/TODO.md:54

"using haiku and gpt" → extract models
Single LLM call extracts: search_mode + models + cleaned_prompt
Add model trigger words to fast-path check

Polish (2 open)

☐ Syntax highlighting theme for YAML (CodeMirror CSS overrides)

.claude/worktrees/busy-jang/brain/vario/TODO.md:61

☐ Syntax highlighting theme for YAML (CodeMirror CSS overrides)

brain/vario/TODO.md:61

Explore (2 open)

☐ Collapsible messages in chat - Use Gradio's reasoning_tags or similar for collapsible system prompt display. See https://www.gradio.app/docs/gradio/chatbot#param-chatbot-reasoning-tags and https://www.gradio.app/docs/gradio/chatbot#examples

.claude/worktrees/busy-jang/brain/vario/TODO.md:79

☐ Collapsible messages in chat - Use Gradio's reasoning_tags or similar for collapsible system prompt display. See https://www.gradio.app/docs/gradio/chatbot#param-chatbot-reasoning-tags and https://www.gradio.app/docs/gradio/chatbot#examples

brain/vario/TODO.md:79

Completed (0 open)

▶

☑ System prompt: async patterns section (must await .all(), .count(), etc.)

.claude/worktrees/busy-jang/browser/TODO.md:12

☑ System prompt: JS in page.evaluate() examples (# vs //, len vs .length)

.claude/worktrees/busy-jang/browser/TODO.md:13

☑ System prompt: search result extraction guidance (use solve_site first)

.claude/worktrees/busy-jang/browser/TODO.md:14

☑ Add solve_site tool to agent tools list

.claude/worktrees/busy-jang/browser/TODO.md:15

☑ Add solve_site tool handling in get_action()

.claude/worktrees/busy-jang/browser/TODO.md:16

☑ Add _check_async_patterns() validation method

.claude/worktrees/busy-jang/browser/TODO.md:17

☑ Add _check_evaluate_js() validation method

.claude/worktrees/busy-jang/browser/TODO.md:18

☑ System prompt: async patterns section (must await .all(), .count(), etc.)

browser/TODO.md:25

☑ System prompt: JS in page.evaluate() examples (# vs //, len vs .length)

browser/TODO.md:26

☑ System prompt: search result extraction guidance (use solve_site first)

browser/TODO.md:27

☑ Add solve_site tool to agent tools list

browser/TODO.md:28

☑ Add solve_site tool handling in get_action()

browser/TODO.md:29

☑ Add _check_async_patterns() validation method

browser/TODO.md:30

☑ Add _check_evaluate_js() validation method

browser/TODO.md:31

Ready to Test (8 open)

▶

☐ Test async pattern detection catches missing await

.claude/worktrees/busy-jang/browser/TODO.md:21

☐ Test JS validation catches Python syntax in evaluate()

.claude/worktrees/busy-jang/browser/TODO.md:22

☐ Test solve_site tool returns useful selectors

.claude/worktrees/busy-jang/browser/TODO.md:23

☐ End-to-end: Google Images extraction works

.claude/worktrees/busy-jang/browser/TODO.md:24

☐ Test async pattern detection catches missing await

browser/TODO.md:34

☐ Test JS validation catches Python syntax in evaluate()

browser/TODO.md:35

☐ Test solve_site tool returns useful selectors

browser/TODO.md:36

☐ End-to-end: Google Images extraction works

browser/TODO.md:37

Auto-Create Ingestion Wisdom (6 open)

▶

☐ Generate site configs automatically from sample pages (via Stagehand or LLM analysis)

.claude/worktrees/busy-jang/browser/TODO.md:95

☐ Brain project should use browser's site configs for X.com/social media extraction

.claude/worktrees/busy-jang/browser/TODO.md:96

☐ Learn extraction patterns from successful runs, persist as reusable configs

.claude/worktrees/busy-jang/browser/TODO.md:97

☐ Generate site configs automatically from sample pages (via Stagehand or LLM analysis)

browser/TODO.md:108

☐ Brain project should use browser's site configs for X.com/social media extraction

browser/TODO.md:109

☐ Learn extraction patterns from successful runs, persist as reusable configs

browser/TODO.md:110

Verification Execution Engine (24 open)

▶

☐ verify/models.py - VerifySpec, CheckResult, etc.

.claude/worktrees/busy-jang/browser/TODO.md:112

☐ verify/actions.py - click, fill, wait implementations

.claude/worktrees/busy-jang/browser/TODO.md:113

☐ verify/assertions.py - visible, in_viewport, text_contains

.claude/worktrees/busy-jang/browser/TODO.md:114

☐ verify/capture.py - screenshot, DOM, aria capture

.claude/worktrees/busy-jang/browser/TODO.md:115

☐ verify/executor.py - main runner with parallel support

.claude/worktrees/busy-jang/browser/TODO.md:116

☐ Server /verify endpoint

.claude/worktrees/busy-jang/browser/TODO.md:119

☐ Client verify() method

.claude/worktrees/busy-jang/browser/TODO.md:120

☐ CLI: python -m browser.verify spec.yaml

.claude/worktrees/busy-jang/browser/TODO.md:121

☐ Visual regression with baseline management

.claude/worktrees/busy-jang/browser/TODO.md:124

☐ LLM assertion support

.claude/worktrees/busy-jang/browser/TODO.md:125

☐ verify/models.py - VerifySpec, CheckResult, etc.

browser/TODO.md:125

☐ Video recording for failed checks

.claude/worktrees/busy-jang/browser/TODO.md:126

☐ verify/actions.py - click, fill, wait implementations

browser/TODO.md:126

☐ Browser pool for heavy parallelism

.claude/worktrees/busy-jang/browser/TODO.md:127

☐ verify/assertions.py - visible, in_viewport, text_contains

browser/TODO.md:127

☐ verify/capture.py - screenshot, DOM, aria capture

browser/TODO.md:128

☐ verify/executor.py - main runner with parallel support

browser/TODO.md:129

☐ Server /verify endpoint

browser/TODO.md:132

☐ Client verify() method

browser/TODO.md:133

☐ CLI: python -m browser.verify spec.yaml

browser/TODO.md:134

☐ Visual regression with baseline management

browser/TODO.md:137

☐ LLM assertion support

browser/TODO.md:138

☐ Video recording for failed checks

browser/TODO.md:139

☐ Browser pool for heavy parallelism

browser/TODO.md:140

Questions to Resolve (6 open)

▶

☐ Extract just better content, or structured fields (title, author, date)?

.claude/worktrees/busy-jang/browser/TODO.md:138

☐ Full page semantic mapping?

.claude/worktrees/busy-jang/browser/TODO.md:139

☐ Where to store: browser/site_extractors.yaml or brain/?

.claude/worktrees/busy-jang/browser/TODO.md:140

☐ Extract just better content, or structured fields (title, author, date)?

browser/TODO.md:151

☐ Full page semantic mapping?

browser/TODO.md:152

☐ Where to store: browser/site_extractors.yaml or brain/?

browser/TODO.md:153

Implementation (10 open)

▶

☐ YAML schema for site rules

.claude/worktrees/busy-jang/browser/TODO.md:143

☐ CLI: browser extract URL --save to save working rule

.claude/worktrees/busy-jang/browser/TODO.md:144

☐ CLI: browser sites to list/edit known sites

.claude/worktrees/busy-jang/browser/TODO.md:145

☐ Auto-apply rules when extracting from known domains

.claude/worktrees/busy-jang/browser/TODO.md:146

☐ Learn patterns from successful extractions

.claude/worktrees/busy-jang/browser/TODO.md:147

☐ YAML schema for site rules

browser/TODO.md:156

☐ CLI: browser extract URL --save to save working rule

browser/TODO.md:157

☐ CLI: browser sites to list/edit known sites

browser/TODO.md:158

☐ Auto-apply rules when extracting from known domains

browser/TODO.md:159

☐ Learn patterns from successful extractions

browser/TODO.md:160

Free-Signup Paywall Sites (2 open)

☐ endpoints.news

.claude/worktrees/busy-jang/browser/TODO.md:155

Biotech/pharma news. Free signup gets limited articles. Test URL: https://endpoints.news/roivants-dealmaker-lands-81m-cash-bonus-following-drug-sale-to-roche/

Signup flow: email + password → limited free articles
Strategy: create account once, persist session cookies, reuse across fetches
Ties into Session & Login Management below

☐ endpoints.news

browser/TODO.md:168

Biotech/pharma news. Free signup gets limited articles. Test URL: https://endpoints.news/roivants-dealmaker-lands-81m-cash-bonus-following-drug-sale-to-roche/

Signup flow: email + password → limited free articles
Strategy: create account once, persist session cookies, reuse across fetches
Ties into Session & Login Management below

Tasks (10 open)

▶

☐ inv browser.login DOMAIN - invoke task wrapper

.claude/worktrees/busy-jang/browser/TODO.md:175

☐ Import cookies from Chrome/Firefox (decryption needed on macOS)

.claude/worktrees/busy-jang/browser/TODO.md:176

☐ --profile PATH to use existing browser profile directly

.claude/worktrees/busy-jang/browser/TODO.md:177

☐ Session persistence: save/load cookies between runs

.claude/worktrees/busy-jang/browser/TODO.md:178

☐ Auto-detect when login needed (401, login page patterns)

.claude/worktrees/busy-jang/browser/TODO.md:179

☐ inv browser.login DOMAIN - invoke task wrapper

browser/TODO.md:188

☐ Import cookies from Chrome/Firefox (decryption needed on macOS)

browser/TODO.md:189

☐ --profile PATH to use existing browser profile directly

browser/TODO.md:190

☐ Session persistence: save/load cookies between runs

browser/TODO.md:191

☐ Auto-detect when login needed (401, login page patterns)

browser/TODO.md:192

Automation Mode Enhancements (6 open)

▶

☐ Batch commands from file

.claude/worktrees/busy-jang/browser/TODO.md:241

☐ Retry logic for flaky selectors

.claude/worktrees/busy-jang/browser/TODO.md:242

☐ Record/replay sessions

.claude/worktrees/busy-jang/browser/TODO.md:243

☐ Batch commands from file

browser/TODO.md:254

☐ Retry logic for flaky selectors

browser/TODO.md:255

☐ Record/replay sessions

browser/TODO.md:256

Agent Quality (6 open)

▶

☐ Better fail-fast: if selector returns 0, try alternatives before giving up

.claude/worktrees/busy-jang/browser/TODO.md:246

☐ Domain-specific selector hints (optional, user-provided)

.claude/worktrees/busy-jang/browser/TODO.md:247

☐ Multi-turn refinement: "that got the logo, try the main content area"

.claude/worktrees/busy-jang/browser/TODO.md:248

☐ Better fail-fast: if selector returns 0, try alternatives before giving up

browser/TODO.md:259

☐ Domain-specific selector hints (optional, user-provided)

browser/TODO.md:260

☐ Multi-turn refinement: "that got the logo, try the main content area"

browser/TODO.md:261

Testing (6 open)

▶

☐ Add selector discovery tests

.claude/worktrees/busy-jang/browser/TODO.md:251

☐ Test timeout behavior

.claude/worktrees/busy-jang/browser/TODO.md:252

☐ Test JSON output mode

.claude/worktrees/busy-jang/browser/TODO.md:253

☐ Add selector discovery tests

browser/TODO.md:264

☐ Test timeout behavior

browser/TODO.md:265

☐ Test JSON output mode

browser/TODO.md:266

Implemented (0 open)

▶

☑ inv doctor.ci - Watch tests on change

.claude/worktrees/busy-jang/doctor/TODO.md:5

☑ inv doctor.ci - Watch tests on change

doctor/TODO.md:5

☑ inv doctor.tail - Log error detection + LLM analysis

.claude/worktrees/busy-jang/doctor/TODO.md:6

☑ inv doctor.tail - Log error detection + LLM analysis

doctor/TODO.md:6

☑ inv doctor.expect - Basic LLM expectations (HTTP only)

.claude/worktrees/busy-jang/doctor/TODO.md:7

☑ inv doctor.expect - Basic LLM expectations (HTTP only)

doctor/TODO.md:7

☑ inv doctor.fix - Auto-fix with worktrees

.claude/worktrees/busy-jang/doctor/TODO.md:8

☑ inv doctor.fix - Auto-fix with worktrees

doctor/TODO.md:8

☑ Silence/unsilence error patterns

.claude/worktrees/busy-jang/doctor/TODO.md:9

☑ Silence/unsilence error patterns

doctor/TODO.md:9

☑ Error fingerprinting/deduplication

.claude/worktrees/busy-jang/doctor/TODO.md:10

☑ Error fingerprinting/deduplication

doctor/TODO.md:10

Visual Verification (Priority) (34 open)

▶

☐ verify/spec.py - YAML parser and validator

.claude/worktrees/busy-jang/doctor/TODO.md:74

☐ verify/spec.py - YAML parser and validator

doctor/TODO.md:74

☐ verify/runner.py - Playwright executor (uses browser/verify)

.claude/worktrees/busy-jang/doctor/TODO.md:75

☐ verify/runner.py - Playwright executor (uses browser/verify)

doctor/TODO.md:75

☐ verify/report.py - HTML report generator

.claude/worktrees/busy-jang/doctor/TODO.md:76

☐ verify/report.py - HTML report generator

doctor/TODO.md:76

☐ CLI: inv doctor.verify -s spec.yaml

.claude/worktrees/busy-jang/doctor/TODO.md:77

☐ CLI: inv doctor.verify -s spec.yaml

doctor/TODO.md:77

☐ Add "Verify" tab to dashboard

.claude/worktrees/busy-jang/doctor/TODO.md:80

☐ Add "Verify" tab to dashboard

doctor/TODO.md:80

☐ Spec editor with YAML syntax highlighting

.claude/worktrees/busy-jang/doctor/TODO.md:81

☐ Spec editor with YAML syntax highlighting

doctor/TODO.md:81

☐ Viewport selection checkboxes

.claude/worktrees/busy-jang/doctor/TODO.md:82

☐ Viewport selection checkboxes

doctor/TODO.md:82

☐ Run button with progress indicator

.claude/worktrees/busy-jang/doctor/TODO.md:83

☐ Run button with progress indicator

doctor/TODO.md:83

☐ Embedded report viewer

.claude/worktrees/busy-jang/doctor/TODO.md:84

☐ Embedded report viewer

doctor/TODO.md:84

☐ Run history table

.claude/worktrees/busy-jang/doctor/TODO.md:85

☐ Run history table

doctor/TODO.md:85

☐ verify/nl_assist.py - LLM generates spec from description

.claude/worktrees/busy-jang/doctor/TODO.md:88

☐ verify/nl_assist.py - LLM generates spec from description

doctor/TODO.md:88

☐ "Generate Spec" button in dashboard

.claude/worktrees/busy-jang/doctor/TODO.md:89

☐ "Generate Spec" button in dashboard

doctor/TODO.md:89

☐ Iterative refinement via chat

.claude/worktrees/busy-jang/doctor/TODO.md:90

☐ Iterative refinement via chat

doctor/TODO.md:90

☐ LLM-powered expectations (llm: "footer should be visible")

.claude/worktrees/busy-jang/doctor/TODO.md:93

☐ LLM-powered expectations (llm: "footer should be visible")

doctor/TODO.md:93

☐ Autonomous exploration mode

.claude/worktrees/busy-jang/doctor/TODO.md:94

☐ Autonomous exploration mode

doctor/TODO.md:94

☐ Visual regression with baselines

.claude/worktrees/busy-jang/doctor/TODO.md:95

☐ Visual regression with baselines

doctor/TODO.md:95

☐ Scheduled verification runs

.claude/worktrees/busy-jang/doctor/TODO.md:96

☐ Scheduled verification runs

doctor/TODO.md:96

Reference Appearance Screenshots (8 open)

▶

☐ references.py - Reference management (add/list/delete/show)

.claude/worktrees/busy-jang/doctor/TODO.md:130

☐ references.py - Reference management (add/list/delete/show)

doctor/TODO.md:130

☐ visual.py - Screenshot capture + LLM comparison

.claude/worktrees/busy-jang/doctor/TODO.md:131

☐ visual.py - Screenshot capture + LLM comparison

doctor/TODO.md:131

☐ Extend expect.py with visual_match expectation type

.claude/worktrees/busy-jang/doctor/TODO.md:132

☐ Extend expect.py with visual_match expectation type

doctor/TODO.md:132

☐ Add inv doctor.ref command to tasks.py

.claude/worktrees/busy-jang/doctor/TODO.md:133

☐ Add inv doctor.ref command to tasks.py

doctor/TODO.md:133

Top Priority (2 open)

☐ Section 351 ETFs

.claude/worktrees/busy-jang/jobs/TODO.md:5

Scrape and collect all Section 351 ETFs. Research what's involved: tax-free exchange mechanism, which ETFs use it, fund structures, eligible securities, investor requirements. Build a comprehensive dataset of 351 ETFs with their holdings, launch dates, and conversion details.

☐ Section 351 ETFs

jobs/TODO.md:5

Scrape and collect all Section 351 ETFs. Research what's involved: tax-free exchange mechanism, which ETFs use it, fund structures, eligible securities, investor requirements. Build a comprehensive dataset of 351 ETFs with their holdings, launch dates, and conversion details.

VIC Cached Content Improvements (2 open)

☐ VIC styling in cached viewer

.claude/worktrees/busy-jang/jobs/TODO.md:24

Static server serves cached VIC HTML but CSS/JS assets don't load (require VIC authentication). Options: (1) Extract description content only, serve in clean wrapper with basic styling, (2) Use VIC cookies to fetch/cache CSS/JS assets, (3) Inline critical styles directly in cached HTML. Current state: content is readable but unstyled. Related: static/server.py asset caching, jobs/data/vic_ideas/.share base_path config.

☐ VIC styling in cached viewer

jobs/TODO.md:24

Static server serves cached VIC HTML but CSS/JS assets don't load (require VIC authentication). Options: (1) Extract description content only, serve in clean wrapper with basic styling, (2) Use VIC cookies to fetch/cache CSS/JS assets, (3) Inline critical styles directly in cached HTML. Current state: content is readable but unstyled. Related: static/server.py asset caching, jobs/data/vic_ideas/.share base_path config.

Dashboard Improvements (2 open)

☐ Paginate items in large jobs

.claude/worktrees/busy-jang/jobs/TODO.md:28

Jobs with 500+ items are slow to load and unwieldy. Add pagination (page size ~50) to Pending/Done/Failed tabs in the detail view, with next/prev controls and item count display.

☐ Paginate items in large jobs

jobs/TODO.md:28

Jobs with 500+ items are slow to load and unwieldy. Add pagination (page size ~50) to Pending/Done/Failed tabs in the detail view, with next/prev controls and item count display.

Runner Improvements (10 open)

▶

☑ Shared resource semaphores

.claude/worktrees/busy-jang/jobs/TODO.md:32

ResourceRegistry in runner.py. Stages set resource: youtube in YAML, share a single asyncio.Semaphore. Concurrency set in top-level resources: section. Hot-reloadable.

☑ Shared resource semaphores

jobs/TODO.md:32

ResourceRegistry in runner.py. Stages set resource: youtube in YAML, share a single asyncio.Semaphore. Concurrency set in top-level resources: section. Hot-reloadable.

☑ Hot-reload jobs.yaml

.claude/worktrees/busy-jang/jobs/TODO.md:33

Re-read every 60s, updates pacing/concurrency/guards/resources live

☑ Hot-reload jobs.yaml

jobs/TODO.md:33

Re-read every 60s, updates pacing/concurrency/guards/resources live

☐ Rethink retry_later

.claude/worktrees/busy-jang/jobs/TODO.md:34

Currently retry_later re-queues items immediately with no cooldown, creating tight loops and log spam. The real fix isn't a cooldown — it's scoping retries correctly: item-level (skip this one), resource-level (back off the API), or job-level (pause). Most current retry_later uses are either "should fail" (no URL found) or "should back off the resource" (429). See resource-aware backoff below.

☐ Rethink retry_later

jobs/TODO.md:34

Currently retry_later re-queues items immediately with no cooldown, creating tight loops and log spam. The real fix isn't a cooldown — it's scoping retries correctly: item-level (skip this one), resource-level (back off the API), or job-level (pause). Most current retry_later uses are either "should fail" (no URL found) or "should back off the resource" (429). See resource-aware backoff below.

☐ Resource-aware backoff on 429s

.claude/worktrees/busy-jang/jobs/TODO.md:35

When a stage gets a 429/rate-limit, the backoff should be scoped to the resource, not the individual item. Moving to the next item just hammers the same API. Design:

Handlers signal resource-level errors: raise ResourceThrottledError("finnhub", cooldown_s=60)
The ResourceRegistry semaphore temporarily blocks (all permits held for cooldown duration)
ALL stages/jobs sharing that resource pause, not just the one item
Different from item-level retry: one 429 pauses the resource, not each item individually
Resources can be per-API (finnhub, ib), per-provider (anthropic, openai), or per-site (wayback)
Concurrency reasoning: each external endpoint is a resource. Wayback Machine, Finnhub, YouTube, each LLM provider — each has its own rate limits and optimal concurrency. The right parallelism depends on the resource, not the stage.
Stage concurrency controls local parallelism; resource controls shared external limits
Future: auto-tune concurrency per resource based on observed 429 rates

☐ Resource-aware backoff on 429s

jobs/TODO.md:35

When a stage gets a 429/rate-limit, the backoff should be scoped to the resource, not the individual item. Moving to the next item just hammers the same API. Design:

Handlers signal resource-level errors: raise ResourceThrottledError("finnhub", cooldown_s=60)
The ResourceRegistry semaphore temporarily blocks (all permits held for cooldown duration)
ALL stages/jobs sharing that resource pause, not just the one item
Different from item-level retry: one 429 pauses the resource, not each item individually
Resources can be per-API (finnhub, ib), per-provider (anthropic, openai), or per-site (wayback)
Concurrency reasoning: each external endpoint is a resource. Wayback Machine, Finnhub, YouTube, each LLM provider — each has its own rate limits and optimal concurrency. The right parallelism depends on the resource, not the stage.
Stage concurrency controls local parallelism; resource controls shared external limits
Future: auto-tune concurrency per resource based on observed 429 rates

☐ Runner architecture diagram

.claude/worktrees/busy-jang/jobs/TODO.md:44

Visual diagram showing: per-job task tree (discovery_task + stage_workers + guard_checker), shared ResourceRegistry semaphores across jobs, item flow through stages, and how enabled: false / resource: / stage_deps interact. ASCII or Mermaid in CLAUDE.md.

☐ Runner architecture diagram

jobs/TODO.md:44

Visual diagram showing: per-job task tree (discovery_task + stage_workers + guard_checker), shared ResourceRegistry semaphores across jobs, item flow through stages, and how enabled: false / resource: / stage_deps interact. ASCII or Mermaid in CLAUDE.md.

☐ Priority ceiling per job

.claude/worktrees/busy-jang/jobs/TODO.md:45

max_priority guard to stop at threshold (phased rollouts)

☐ Priority ceiling per job

jobs/TODO.md:45

max_priority guard to stop at threshold (phased rollouts)

☐ Structured log files

.claude/worktrees/busy-jang/jobs/TODO.md:46

Per-job logs in jobs/logs/{job_id}.log with rotation

☐ Structured log files

jobs/TODO.md:46

Per-job logs in jobs/logs/{job_id}.log with rotation

☑ Skip disabled stages on startup

.claude/worktrees/busy-jang/jobs/TODO.md:47

Runner now marks pending/failed stages as "skipped" when the stage is enabled: false, preventing items from getting stuck forever.

☑ Skip disabled stages on startup

jobs/TODO.md:47

Runner now marks pending/failed stages as "skipped" when the stage is enabled: false, preventing items from getting stuck forever.

☑ VERSION_DEPS for deep hashing

.claude/worktrees/busy-jang/jobs/TODO.md:48

Handlers declare imported functions/prompts/config that affect stage output. stage_version_hash includes them. Implemented in tracker.py, used by vic_ideas and vic_wayback.

☑ VERSION_DEPS for deep hashing

jobs/TODO.md:48

Handlers declare imported functions/prompts/config that affect stage output. stage_version_hash includes them. Implemented in tracker.py, used by vic_ideas and vic_wayback.

Job Event Log (Changelog) (8 open)

▶

☐ job_events table

.claude/worktrees/busy-jang/jobs/TODO.md:54

(id, job_id, item_key, stage, event_type, event_data JSON, timestamp). Append-only, never updated/deleted. Event types: item_discovered, stage_started, stage_completed, stage_failed, job_paused, job_resumed, handler_updated, cb_tripped, config_reloaded

☐ job_events table

jobs/TODO.md:54

(id, job_id, item_key, stage, event_type, event_data JSON, timestamp). Append-only, never updated/deleted. Event types: item_discovered, stage_started, stage_completed, stage_failed, job_paused, job_resumed, handler_updated, cb_tripped, config_reloaded

☐ Emit events from runner

.claude/worktrees/busy-jang/jobs/TODO.md:55

Lightweight wrapper or decorator around existing tracker calls. CB trips, pauses, config reloads already have log lines — also emit to job_events.

☐ Emit events from runner

jobs/TODO.md:55

Lightweight wrapper or decorator around existing tracker calls. CB trips, pauses, config reloads already have log lines — also emit to job_events.

☐ Dashboard events timeline

.claude/worktrees/busy-jang/jobs/TODO.md:56

Per-job event stream, filterable by event type and stage. Shows "what happened recently" at a glance.

☐ Dashboard events timeline

jobs/TODO.md:56

Per-job event stream, filterable by event type and stage. Shows "what happened recently" at a glance.

☐ Progress stall detection

.claude/worktrees/busy-jang/jobs/TODO.md:57

Query events to detect "was making progress, then stopped". Supervisor/watcher can alert: "earnings_backfill processed 50 items/hr for 3h, then 0 for 1h — investigate?"

☐ Progress stall detection

jobs/TODO.md:57

Query events to detect "was making progress, then stopped". Supervisor/watcher can alert: "earnings_backfill processed 50 items/hr for 3h, then 0 for 1h — investigate?"

Validator Stage Role (10 open)

▶

☐ Stage role field in YAML

.claude/worktrees/busy-jang/jobs/TODO.md:64

role: validator tells the framework this stage validates upstream output. Adds validates: [extract, fetch] to declare what it checks.

☐ Stage role field in YAML

jobs/TODO.md:64

role: validator tells the framework this stage validates upstream output. Adds validates: [extract, fetch] to declare what it checks.

☐ _invalidate_stage runner handling

.claude/worktrees/busy-jang/jobs/TODO.md:65

When a validator returns _invalidate_stage: "extract", runner resets that stage to pending for the item. Dashboard shows items with active invalidations.

☐ _invalidate_stage runner handling

jobs/TODO.md:65

When a validator returns _invalidate_stage: "extract", runner resets that stage to pending for the item. Dashboard shows items with active invalidations.

☐ _discrepancies tracking

.claude/worktrees/busy-jang/jobs/TODO.md:66

Validators compare LLM findings with upstream code output, return structured _discrepancies: [{field, extract_value, llm_value, severity}]. Stored in results table.

☐ _discrepancies tracking

jobs/TODO.md:66

Validators compare LLM findings with upstream code output, return structured _discrepancies: [{field, extract_value, llm_value, severity}]. Stored in results table.

☐ Dashboard discrepancy health view

.claude/worktrees/busy-jang/jobs/TODO.md:67

Per-field discrepancy rates as parser health metrics. Shows "12% symbol mismatch" → click to see affected items. Prioritized by severity.

☐ Dashboard discrepancy health view

jobs/TODO.md:67

Per-field discrepancy rates as parser health metrics. Shows "12% symbol mismatch" → click to see affected items. Prioritized by severity.

☐ Emit _discrepancies from VIC check_enrich

.claude/worktrees/busy-jang/jobs/TODO.md:68

First implementation: compare check_enrich LLM output with extract results for the same item. Report symbol, description, catalysts mismatches.

☐ Emit _discrepancies from VIC check_enrich

jobs/TODO.md:68

First implementation: compare check_enrich LLM output with extract results for the same item. Report symbol, description, catalysts mismatches.

Success-Rate Circuit Breaker (2 open)

☐ Low success rate CB

.claude/worktrees/busy-jang/jobs/TODO.md:71

Track success/fail ratio over a sliding window (last N items, default 20). Auto-pause when success rate drops below threshold (e.g., success_rate_min: 0.50). Catches intermittent failures that never cluster enough to trip the consecutive CB. Config per-stage in YAML:

☐ Low success rate CB

jobs/TODO.md:71

Track success/fail ratio over a sliding window (last N items, default 20). Auto-pause when success rate drops below threshold (e.g., success_rate_min: 0.50). Catches intermittent failures that never cluster enough to trip the consecutive CB. Config per-stage in YAML:

Validation Circuit Breaker (6 open)

▶

☐ Semantic failure rate tracking

.claude/worktrees/busy-jang/jobs/TODO.md:81

Count content_ok=false and discrepancy rates over a sliding window (last N items). Different from exception-based circuit breaker.

☐ Semantic failure rate tracking

jobs/TODO.md:81

Count content_ok=false and discrepancy rates over a sliding window (last N items). Different from exception-based circuit breaker.

☐ Auto-pause on validation threshold

.claude/worktrees/busy-jang/jobs/TODO.md:82

circuit_breaker.content_ok_min_rate: 0.80 pauses when <80% of items pass validation. Pause reason includes which upstream stage is likely broken.

☐ Auto-pause on validation threshold

jobs/TODO.md:82

circuit_breaker.content_ok_min_rate: 0.80 pauses when <80% of items pass validation. Pause reason includes which upstream stage is likely broken.

☐ Dashboard: validation health per stage

.claude/worktrees/busy-jang/jobs/TODO.md:83

Show pass rate, discrepancy rate, trending up/down.

☐ Dashboard: validation health per stage

jobs/TODO.md:83

Show pass rate, discrepancy rate, trending up/down.

Repair Workflow (6 open)

▶

☐ Cascade reprocessing

.claude/worktrees/busy-jang/jobs/TODO.md:86

Reprocessing stage N automatically marks stages N+1, N+2, etc. as needing reprocessing. Currently only resets the specified stage.

☐ Cascade reprocessing

jobs/TODO.md:86

Reprocessing stage N automatically marks stages N+1, N+2, etc. as needing reprocessing. Currently only resets the specified stage.

☐ Auto-reprocess on VERSION_DEPS change

.claude/worktrees/busy-jang/jobs/TODO.md:87

When runner detects stale items at startup (hash mismatch), optionally auto-reprocess instead of waiting for manual button click. Config: auto_reprocess: true per stage.

☐ Auto-reprocess on VERSION_DEPS change

jobs/TODO.md:87

When runner detects stale items at startup (hash mismatch), optionally auto-reprocess instead of waiting for manual button click. Config: auto_reprocess: true per stage.

☐ Repair verification

.claude/worktrees/busy-jang/jobs/TODO.md:88

After reprocessing, compare new discrepancy rates with old ones. Log "parser fix reduced symbol mismatch from 12% → 2%".

☐ Repair verification

jobs/TODO.md:88

After reprocessing, compare new discrepancy rates with old ones. Log "parser fix reduced symbol mismatch from 12% → 2%".

New Job Ideas (6 open)

▶

☐ Quantpedia

.claude/worktrees/busy-jang/jobs/TODO.md:92

Market anomalies from SSRN/NBER papers (added, disabled)

☐ Quantpedia

jobs/TODO.md:92

Market anomalies from SSRN/NBER papers (added, disabled)

☐ CEO podcasts

.claude/worktrees/busy-jang/jobs/TODO.md:93

Acquired, Invest Like the Best, Lex Fridman

☐ CEO podcasts

jobs/TODO.md:93

Acquired, Invest Like the Best, Lex Fridman

☐ GitHub trending

.claude/worktrees/busy-jang/jobs/TODO.md:94

Track trending repos for partner discovery

☐ GitHub trending

jobs/TODO.md:94

Track trending repos for partner discovery

Investment Research (2 open)

☐ Cheap power / US solar production

.claude/worktrees/busy-jang/jobs/TODO.md:98

Research investment opportunities in cheap electricity and US-based solar manufacturing. Source: https://youtu.be/BYXbuik3dgA?si=6KqftryUmoChEqQa

☐ Cheap power / US solar production

jobs/TODO.md:98

Research investment opportunities in cheap electricity and US-based solar manufacturing. Source: https://youtu.be/BYXbuik3dgA?si=6KqftryUmoChEqQa

New Sources (2 open)

☐ **Local & Municipal Data

.claude/worktrees/busy-jang/kb/TODO.md:5

LLM-based Scrape**:

Goal: Extract structured data from local/municipal government sites (permits, zoning, property records, council minutes, budgets, public notices).
Why: Municipal data is high-value but poorly structured — PDFs, inconsistent HTML, no APIs. LLM extraction can normalize it into queryable knowledge.
Approach: Browser automation (rivus/browser) + LLM extraction (brain/extract). Same pipeline as VIC/supplychain but pointed at gov sites.
Examples: Building permits, zoning changes, city council agendas, public budget documents, property assessment records.

☐ **Local & Municipal Data

kb/TODO.md:5

LLM-based Scrape**:

Goal: Extract structured data from local/municipal government sites (permits, zoning, property records, council minutes, budgets, public notices).
Why: Municipal data is high-value but poorly structured — PDFs, inconsistent HTML, no APIs. LLM extraction can normalize it into queryable knowledge.
Approach: Browser automation (rivus/browser) + LLM extraction (brain/extract). Same pipeline as VIC/supplychain but pointed at gov sites.
Examples: Building permits, zoning changes, city council agendas, public budget documents, property assessment records.

Cost Control (2 open)

☐ Multiple Max accounts in envs

.claude/worktrees/busy-jang/ops/TODO.md:52

rotate/split API usage across accounts

☐ Multiple Max accounts in envs

ops/TODO.md:52

rotate/split API usage across accounts

Measure & Validate (2 open)

☐ Measure initial-only variant value: Does "T. Lastname" find any unique URLs that "Timothy Lastname" and "Tim Lastname" don't? Run 5-10 names, compare candidate URLs per variant. If initial-only never adds unique results, drop it to save Serper credits.

.claude/worktrees/busy-jang/projects/people/TODO.md:5

☐ Measure initial-only variant value: Does "T. Lastname" find any unique URLs that "Timothy Lastname" and "Tim Lastname" don't? Run 5-10 names, compare candidate URLs per variant. If initial-only never adds unique results, drop it to save Serper credits.

projects/people/TODO.md:5

Future Phases (16 open)

▶

☐ Phase 2: Profile images

.claude/worktrees/busy-jang/projects/people/TODO.md:9

Capture Serper image search results (imageUrl, thumbnailUrl), GitHub avatars, Apollo photos in discovery.json

☐ Phase 2: Profile images

projects/people/TODO.md:9

Capture Serper image search results (imageUrl, thumbnailUrl), GitHub avatars, Apollo photos in discovery.json

☐ Phase 3: LLM web search path

.claude/worktrees/busy-jang/projects/people/TODO.md:10

--llm flag for open-ended "what can you find about X?" discovery, runs in parallel with Serper bulk

☐ Phase 3: LLM web search path

projects/people/TODO.md:10

--llm flag for open-ended "what can you find about X?" discovery, runs in parallel with Serper bulk

☐ Phase 5: Specialization presets

.claude/worktrees/busy-jang/projects/people/TODO.md:11

--preset founder/exec/academic/contact controlling which enrichment sources and platforms to search

☐ Phase 5: Specialization presets

projects/people/TODO.md:11

--preset founder/exec/academic/contact controlling which enrichment sources and platforms to search

☐ Genealogy sources

.claude/worktrees/busy-jang/projects/people/TODO.md:12

FamilySearch (free OAuth API), Geni.com (free API), WikiTree (free API) for family tree depth, extended family, birth/death records. FindAGrave indexed by Google (no API). Low priority if deep LLM research covers family well enough.

☐ Genealogy sources

projects/people/TODO.md:12

FamilySearch (free OAuth API), Geni.com (free API), WikiTree (free API) for family tree depth, extended family, birth/death records. FindAGrave indexed by Google (no API). Low priority if deep LLM research covers family well enough.

☐ More academic sources

.claude/worktrees/busy-jang/projects/people/TODO.md:13

ORCID (free, researcher identity + grants), CrossRef (free, DOI metadata). OpenAlex, arXiv, PubMed, DBLP already integrated. Google Scholar and ResearchGate via Serper site: searches already integrated.

☐ More academic sources

projects/people/TODO.md:13

ORCID (free, researcher identity + grants), CrossRef (free, DOI metadata). OpenAlex, arXiv, PubMed, DBLP already integrated. Google Scholar and ResearchGate via Serper site: searches already integrated.

☐ BD rotating residential proxy for arXiv

.claude/worktrees/busy-jang/projects/people/TODO.md:14

Currently using BD datacenter static proxy (same exit IP: no help with rate limits). Need a BD rotating residential zone for arXiv — each request gets a fresh IP, bypassing the 1 req/3sec lockout. Rate limiter + proxy already centralized in enrich.py (http_get, _RateLimiter, PROXY_SOURCES). Just need to swap the proxy zone or add BRIGHTDATA_RESIDENTIAL_PROXY env var.

☐ BD rotating residential proxy for arXiv

projects/people/TODO.md:14

Currently using BD datacenter static proxy (same exit IP: no help with rate limits). Need a BD rotating residential zone for arXiv — each request gets a fresh IP, bypassing the 1 req/3sec lockout. Rate limiter + proxy already centralized in enrich.py (http_get, _RateLimiter, PROXY_SOURCES). Just need to swap the proxy zone or add BRIGHTDATA_RESIDENTIAL_PROXY env var.

☐ Identity clustering Gradio app

.claude/worktrees/busy-jang/projects/people/TODO.md:15

Interactive demo: enter name + hint, see clusters populate live with streaming. Show cluster cards with keywords, similarity matrix, expand/collapse items. Could be a tab in brain or standalone. HTML demo as starting point: https://static.localhost/projects/people/reports/cluster_demo_gilelbaz.html

☐ Identity clustering Gradio app

projects/people/TODO.md:15

Interactive demo: enter name + hint, see clusters populate live with streaming. Show cluster cards with keywords, similarity matrix, expand/collapse items. Could be a tab in brain or standalone. HTML demo as starting point: https://static.localhost/projects/people/reports/cluster_demo_gilelbaz.html

☐ Corporate officer DBs

.claude/worktrees/busy-jang/projects/people/TODO.md:16

Companies House UK (free but intricate signup), OpenCorporates (paid). Dropped from v1 — revisit if needed for UK/global director lookups.

☐ Corporate officer DBs

projects/people/TODO.md:16

Companies House UK (free but intricate signup), OpenCorporates (paid). Dropped from v1 — revisit if needed for UK/global director lookups.

Data Sources (4 open)

☐ Industry publications list

.claude/worktrees/busy-jang/tools/supplychain/TODO.md:5

thorough list of semiconductor/supply chain publications

Rank by quality
Note cost vs free access
Categories: news, research, analyst reports, trade journals
Examples to evaluate: SemiEngineering, EETimes, DigiTimes, Semiconductor Digest, SEMI reports, TrendForce, IC Insights, Yole, etc.

☐ Industry publications list

tools/supplychain/TODO.md:5

thorough list of semiconductor/supply chain publications

Rank by quality
Note cost vs free access
Categories: news, research, analyst reports, trade journals
Examples to evaluate: SemiEngineering, EETimes, DigiTimes, Semiconductor Digest, SEMI reports, TrendForce, IC Insights, Yole, etc.

☐ Paid data sources research

.claude/worktrees/busy-jang/tools/supplychain/TODO.md:11

what's available only via subscription/enterprise

Capital IQ (S&P) — supply chain relationships, financials, private company data
Refinitiv/LSEG — supply chain data, ownership, estimates
Bloomberg Terminal — supply chain module (SPLC)
FactSet — supply chain relationships
Pitchbook — private company valuations
Gartner/IDC — market share reports
SEMI — industry reports, fab capacity data
Evaluate: coverage, cost tiers, API access, data freshness

☐ Paid data sources research

tools/supplychain/TODO.md:11

what's available only via subscription/enterprise

Capital IQ (S&P) — supply chain relationships, financials, private company data
Refinitiv/LSEG — supply chain data, ownership, estimates
Bloomberg Terminal — supply chain module (SPLC)
FactSet — supply chain relationships
Pitchbook — private company valuations
Gartner/IDC — market share reports
SEMI — industry reports, fab capacity data
Evaluate: coverage, cost tiers, API access, data freshness

Data Quality (6 open)

▶

☐ Normalize country names (US vs USA vs United States)

.claude/worktrees/busy-jang/tools/supplychain/TODO.md:23

☐ Normalize country names (US vs USA vs United States)

tools/supplychain/TODO.md:23

☐ Fix market cap units where values > $10T (likely wrong units)

.claude/worktrees/busy-jang/tools/supplychain/TODO.md:24

☐ Fix market cap units where values > $10T (likely wrong units)

tools/supplychain/TODO.md:24

☐ Deduplicate companies (TSMC appears multiple times with different names)

.claude/worktrees/busy-jang/tools/supplychain/TODO.md:25

☐ Deduplicate companies (TSMC appears multiple times with different names)

tools/supplychain/TODO.md:25

Viewer Improvements (2 open)

☐ Add market cap data for seed companies (via finnhub or discover.py)

.claude/worktrees/busy-jang/tools/supplychain/TODO.md:29

☐ Add market cap data for seed companies (via finnhub or discover.py)

tools/supplychain/TODO.md:29

Transcript Analysis (6 open)

▶

☐ Semantic search of transcript (embedding-based, not just keyword)

.claude/worktrees/busy-jang/trading/earnings/backtest/TODO.md:4

☐ Semantic search of transcript (embedding-based, not just keyword)

trading/earnings/backtest/TODO.md:4

☐ Topic extraction

.claude/worktrees/busy-jang/trading/earnings/backtest/TODO.md:5

auto-segment call into topics (guidance, margins, capex, etc.)

☐ Topic extraction

trading/earnings/backtest/TODO.md:5

auto-segment call into topics (guidance, margins, capex, etc.)

☐ Compare to news

.claude/worktrees/busy-jang/trading/earnings/backtest/TODO.md:6

fetch what financial news reported as significant announcements, compare to our scoring

☐ Compare to news

trading/earnings/backtest/TODO.md:6

fetch what financial news reported as significant announcements, compare to our scoring

Dash Explorer (6 open)

▶

☐ Vertical spike line should visually span both price and volume panels (configured, verify rendering)

.claude/worktrees/busy-jang/trading/earnings/backtest/TODO.md:9

☐ Vertical spike line should visually span both price and volume panels (configured, verify rendering)

trading/earnings/backtest/TODO.md:9

☐ Micro chart window should follow overview chart clicks (currently works)

.claude/worktrees/busy-jang/trading/earnings/backtest/TODO.md:10

☐ Micro chart window should follow overview chart clicks (currently works)

trading/earnings/backtest/TODO.md:10

☐ Record visual expectations for regression testing (Playwright screenshots)

.claude/worktrees/busy-jang/trading/earnings/backtest/TODO.md:11

☐ Record visual expectations for regression testing (Playwright screenshots)

trading/earnings/backtest/TODO.md:11

Data Pipeline (8 open)

▶

☐ Tenacity retry logic for IB connection drops during fetch

.claude/worktrees/busy-jang/trading/earnings/backtest/TODO.md:14

☐ Tenacity retry logic for IB connection drops during fetch

trading/earnings/backtest/TODO.md:14

☐ Logging to run.log in data dir (structured pipeline log)

.claude/worktrees/busy-jang/trading/earnings/backtest/TODO.md:15

☐ Logging to run.log in data dir (structured pipeline log)

trading/earnings/backtest/TODO.md:15

☐ Audio download should be opt-in (--audio flag)

.claude/worktrees/busy-jang/trading/earnings/backtest/TODO.md:16

takes ~52s, not used yet

☐ Audio download should be opt-in (--audio flag)

trading/earnings/backtest/TODO.md:16

takes ~52s, not used yet

☐ Precompute downsampled series for faster chart load on large datasets

.claude/worktrees/busy-jang/trading/earnings/backtest/TODO.md:17

☐ Precompute downsampled series for faster chart load on large datasets

trading/earnings/backtest/TODO.md:17

High Priority (8 open)

▶

☐ Twilio phone dial-in capture

.claude/worktrees/busy-jang/video-analysis/TODO.md:5

Dial into earnings calls via Twilio, stream audio over WebSocket (20ms chunks, no HLS segmentation delay). Sub-second end-to-end latency vs 3-5s with YouTube HLS. Add as twilio:// source in audio_capture.py.

☐ Twilio phone dial-in capture

video-analysis/TODO.md:5

Dial into earnings calls via Twilio, stream audio over WebSocket (20ms chunks, no HLS segmentation delay). Sub-second end-to-end latency vs 3-5s with YouTube HLS. Add as twilio:// source in audio_capture.py.

☐ Scene change detection

.claude/worktrees/busy-jang/video-analysis/TODO.md:6

Detect scene transitions and extract frames at boundaries rather than fixed intervals. Use ffmpeg scene filter or frame difference analysis. This gives more meaningful keyframes than arbitrary time intervals.

☐ Scene change detection

video-analysis/TODO.md:6

Detect scene transitions and extract frames at boundaries rather than fixed intervals. Use ffmpeg scene filter or frame difference analysis. This gives more meaningful keyframes than arbitrary time intervals.

☐ Intro/outro detection

.claude/worktrees/busy-jang/video-analysis/TODO.md:7

Find where intro ends and outro begins. Could use scene detection + audio analysis (music patterns), or ML-based approach. Useful for skipping to main content and avoiding end credits/promos.

☐ Intro/outro detection

video-analysis/TODO.md:7

Find where intro ends and outro begins. Could use scene detection + audio analysis (music patterns), or ML-based approach. Useful for skipping to main content and avoiding end credits/promos.

☐ Davos video analysis for Technovation relevance

.claude/worktrees/busy-jang/video-analysis/TODO.md:9

Analyze World Economic Forum (Davos) panel videos and keynotes for relevance to Technovation (girls' tech education nonprofit). Pipeline: fetch Davos video list (WEF YouTube channel) → transcribe → score each video's relevance to Technovation's mission (STEM education, girls empowerment, youth entrepreneurship, global tech access, AI literacy). Output: ranked list of most relevant talks with timestamps of key moments, quotable segments, and potential partnership/funding/advocacy angles. Could also extract speaker names and affiliations for outreach targeting.

☐ Davos video analysis for Technovation relevance

video-analysis/TODO.md:9

Analyze World Economic Forum (Davos) panel videos and keynotes for relevance to Technovation (girls' tech education nonprofit). Pipeline: fetch Davos video list (WEF YouTube channel) → transcribe → score each video's relevance to Technovation's mission (STEM education, girls empowerment, youth entrepreneurship, global tech access, AI literacy). Output: ranked list of most relevant talks with timestamps of key moments, quotable segments, and potential partnership/funding/advocacy angles. Could also extract speaker names and affiliations for outreach targeting.

Medium Priority (6 open)

▶

☐ Bookmark timestamps for extra screenshots

.claude/worktrees/busy-jang/video-analysis/TODO.md:13

Click a "+" button next to current timestamp to add it to a list. These timestamps get extracted as additional frames on demand. UI shows bookmarked times, allows deletion, and "Extract" button to capture them.

☐ Bookmark timestamps for extra screenshots

video-analysis/TODO.md:13

Click a "+" button next to current timestamp to add it to a list. These timestamps get extracted as additional frames on demand. UI shows bookmarked times, allows deletion, and "Extract" button to capture them.

☐ Frame labeling (NudeNet + CLIP)

.claude/worktrees/busy-jang/video-analysis/TODO.md:14

see PLAN_frame_labeling.md

☐ Frame labeling (NudeNet + CLIP)

video-analysis/TODO.md:14

see PLAN_frame_labeling.md

☑ Transcription with Whisper

.claude/worktrees/busy-jang/video-analysis/TODO.md:15

full video transcription queue + on-demand 15s Speech button

☑ Transcription with Whisper

video-analysis/TODO.md:15

full video transcription queue + on-demand 15s Speech button

☐ Queue progress visualization

.claude/worktrees/busy-jang/video-analysis/TODO.md:16

☐ Queue progress visualization

video-analysis/TODO.md:16

Low Priority (6 open)

▶

☐ Face clustering

.claude/worktrees/busy-jang/video-analysis/TODO.md:20

group frames by person

☐ Face clustering

video-analysis/TODO.md:20

group frames by person

☐ Embedding storage for similarity search

.claude/worktrees/busy-jang/video-analysis/TODO.md:21

☐ Embedding storage for similarity search

video-analysis/TODO.md:21

☐ LLM descriptions (LLaVA, GPT-4V)

.claude/worktrees/busy-jang/video-analysis/TODO.md:22

☐ LLM descriptions (LLaVA, GPT-4V)

video-analysis/TODO.md:22

Translation (4 open)

☐ Real-time WebSocket translation

.claude/worktrees/busy-jang/video-analysis/TODO.md:27

Use OpenAI Realtime API for streaming transcription/translation during video playback. Would show live subtitles as video plays.

☐ Real-time WebSocket translation

video-analysis/TODO.md:27

Use OpenAI Realtime API for streaming transcription/translation during video playback. Would show live subtitles as video plays.

☐ Screen text translation

.claude/worktrees/busy-jang/video-analysis/TODO.md:28

OCR on-screen text (signs, subtitles burned into video) and translate. Could use Tesseract or cloud vision APIs.

☐ Screen text translation

video-analysis/TODO.md:28

OCR on-screen text (signs, subtitles burned into video) and translate. Could use Tesseract or cloud vision APIs.

TODO: Fetchability Matrix Validation (LLM URL Tool Input) (3 open)

☐ Maintain fetchability contract for mode=auto|js|unlocker and escalation evidence

browser/TODO.md:5

Gym spec: learning/gyms/fetchability/docs/FETCHABILITY_MATRIX_SPEC.md
Machine-readable matrix: learning/gyms/fetchability/tests/fixtures/fetchability_matrix.yaml
Parameterized tests: learning/gyms/fetchability/tests/test_fetchability_matrix.py

☐ Run live matrix probes with real paid URLs (Substack + Patreon) and record required means

browser/TODO.md:9

Required env: BROWSER_TEST_SUBSTACK_PAID_URL, BROWSER_TEST_PATREON_PAID_URL
Optional auth flags: BROWSER_TEST_SUBSTACK_PAID_AUTH=1, BROWSER_TEST_PATREON_PAID_AUTH=1

☐ Capture baseline latency/cost for each first-success mode before wiring into lib/llm/tools.py

browser/TODO.md:12

📋 Todo Report

💰 investor

Investor Replication (2 open)

Phase 0: Brain Demo (Now) (14 open)

Phase 1: Foundation (3 open)

Phase 2: Assimilation Engine (3 open)

Phase 3: Sentiment (4 open)

Phase 4: Research Mode (3 open)

Phase 5: Causal Learning (3 open)

Ideas / Backlog (6 open)

🌊 rivus

People — This Week (2 open)

Priority (36 open)

Review with User (4 open)

Investor Replication & Covenant Analysis (4 open)

Learning (8 open)

Newsflow: CEO Interviews & Podcasts (10 open)

LLM Tools (2 open)

Transcription (6 open)

KB & Self-Learning (6 open)

Visual TODO (2 open)

System (2 open)

Writing / Substack (4 open)

Trading / Investor (2 open)

Self-Learning & Iteration (vario/geneval direction) (10 open)

Refactoring (2 open)

Long-term (4 open)

Phase 1: Search fallback (4 open)

Phase 2: Multi-result analysis (vario integration) (8 open)

Unified NL input (CLI + UI) (10 open)

Active Development (6 open)

Research Queries (2 open)

Infrastructure (4 open)

Automation / Integration (8 open)

Top Level (4 open)

Next (2 open)

Features (12 open)

Polish (2 open)

Explore (2 open)

Completed (0 open)

Ready to Test (8 open)

Auto-Create Ingestion Wisdom (6 open)

Verification Execution Engine (24 open)

Questions to Resolve (6 open)

Implementation (10 open)

Free-Signup Paywall Sites (2 open)

Tasks (10 open)

Automation Mode Enhancements (6 open)

Agent Quality (6 open)

Testing (6 open)

Implemented (0 open)

Visual Verification (Priority) (34 open)

Reference Appearance Screenshots (8 open)

Top Priority (2 open)

VIC Cached Content Improvements (2 open)

Dashboard Improvements (2 open)

Runner Improvements (10 open)

Job Event Log (Changelog) (8 open)

Validator Stage Role (10 open)

Success-Rate Circuit Breaker (2 open)

Validation Circuit Breaker (6 open)

Repair Workflow (6 open)

New Job Ideas (6 open)

Investment Research (2 open)

New Sources (2 open)

Cost Control (2 open)

Measure & Validate (2 open)

Future Phases (16 open)

Data Sources (4 open)

Data Quality (6 open)

Viewer Improvements (2 open)

Transcript Analysis (6 open)

Dash Explorer (6 open)

Data Pipeline (8 open)

High Priority (8 open)

Medium Priority (6 open)

Low Priority (6 open)

Translation (4 open)

TODO: Fetchability Matrix Validation (LLM URL Tool Input) (3 open)

📊 timdata