feat: domain-aware normalizer for assumption validation (Phase 2)#72
Closed
82deutschmark wants to merge 5 commits into
Closed
feat: domain-aware normalizer for assumption validation (Phase 2)#7282deutschmark wants to merge 5 commits into
82deutschmark wants to merge 5 commits into
Conversation
- Loads domain profiles (Carpenter, Dentist, Personal) from YAML - Auto-detects domain from assumption signals (currency, units, keywords) - Normalizes currency to domain default + EUR equivalent - Normalizes units to metric (with conversion tables) - Re-assesses confidence per domain keywords - Batch normalization support - Unit tests cover detection, normalization, conversions Addresses Simon's feedback on hardcoded lists + Mark's requirement for clean, domain-aware outputs for AI agents.
Collaborator
Author
|
Closing this PR per Simon's review. The approach has fundamental issues that need proper architecture first:
This feature is being split into prerequisite components, each as a separate PR:
Each component will be submitted as a focused, reviewable PR. The proposals will be submitted to |
82deutschmark
added a commit
to VoynichLabs/PlanExe2026
that referenced
this pull request
Mar 6, 2026
Analyzes three approaches (Python module, Docker service, MCP tool) with pros/cons for each. Recommends starting with a Python module with local cache, evolving to hybrid module+MCP. Replaces hardcoded rates from PR PlanExeOrg#72.
82deutschmark
added a commit
to VoynichLabs/PlanExe2026
that referenced
this pull request
Mar 6, 2026
Designs a tiered data architecture for real-world cost lookups: - Tier 1: Authoritative sources (ILO, World Bank, BLS) - Tier 2: Interpolation (geographic, PPP adjustment, economic similarity) - Tier 3: LLM-assisted estimation with confidence scoring Includes data model, interpolation strategy, phased implementation plan, and integration points with PlanExe pipeline. Replaces hardcoded cost assumptions from PR PlanExeOrg#72.
82deutschmark
added a commit
to VoynichLabs/PlanExe2026
that referenced
this pull request
Mar 6, 2026
Analyzes three approaches (Python module, Docker service, MCP tool) with pros/cons for each. Recommends starting with a Python module with local cache, evolving to hybrid module+MCP. Replaces hardcoded rates from PR PlanExeOrg#72.
82deutschmark
added a commit
to VoynichLabs/PlanExe2026
that referenced
this pull request
Mar 6, 2026
… intent) Documents the original vision of PR PlanExeOrg#72 (domain detection, currency normalization, unit conversion, confidence calibration, Fermi sanity checks), redesigned to build on the prerequisite Currency Service (PR PlanExeOrg#147) and Resource/Cost Data (PR PlanExeOrg#146) proposals. Split into 4 implementation phases, each as a separate future PR. Addresses all review feedback from PR PlanExeOrg#72 (no hardcoded rates, no fragile paths, standard logger convention, configurable profiles).
82deutschmark
added a commit
to VoynichLabs/PlanExe2026
that referenced
this pull request
Mar 6, 2026
… intent) Comprehensive design document covering: - Domain detection and assumption extraction - Fermi sanity checking against real-world benchmarks - Currency/unit normalization via Currency Service - Confidence calibration per domain - Ethical framework: fair wage floors, workplace safety standards, child labor detection, working conditions assessment Built on Currency Service (PR PlanExeOrg#147) and Resource/Cost Data (PR PlanExeOrg#146). Split into 4 phased implementation PRs. Addresses all PR PlanExeOrg#72 feedback.
82deutschmark
added a commit
to VoynichLabs/PlanExe2026
that referenced
this pull request
Mar 6, 2026
… intent) Comprehensive design document covering: - Domain detection and assumption extraction - Fermi sanity checking against real-world benchmarks - Currency/unit normalization via Currency Service - Confidence calibration per domain - Ethical framework: fair wage floors, workplace safety standards, child labor detection, working conditions assessment Built on Currency Service (PR PlanExeOrg#147) and Resource/Cost Data (PR PlanExeOrg#146). Split into 4 phased implementation PRs. Addresses all PR PlanExeOrg#72 feedback.
82deutschmark
added a commit
to VoynichLabs/PlanExe2026
that referenced
this pull request
Mar 6, 2026
… intent) Comprehensive design document covering: - Domain detection and assumption extraction - Fermi sanity checking against real-world benchmarks - Currency/unit normalization via Currency Service - Confidence calibration per domain - Ethical framework: fair wage floors, workplace safety standards, child labor detection, working conditions assessment Built on Currency Service (PR PlanExeOrg#147) and Resource/Cost Data (PR PlanExeOrg#146). Split into 4 phased implementation PRs. Addresses all PR PlanExeOrg#72 feedback.
82deutschmark
added a commit
to VoynichLabs/PlanExe2026
that referenced
this pull request
Mar 6, 2026
… intent) Comprehensive design document covering: - Domain detection and assumption extraction - Fermi sanity checking against real-world benchmarks - Currency/unit normalization via Currency Service - Confidence calibration per domain - Ethical framework: fair wage floors, workplace safety standards, child labor detection, working conditions assessment Built on Currency Service (PR PlanExeOrg#147) and Resource/Cost Data (PR PlanExeOrg#146). Split into 4 phased implementation PRs. Addresses all PR PlanExeOrg#72 feedback.
82deutschmark
added a commit
to VoynichLabs/PlanExe2026
that referenced
this pull request
Mar 6, 2026
… intent) Comprehensive design document covering: - Domain detection and assumption extraction - Fermi sanity checking against real-world benchmarks - Currency/unit normalization via Currency Service - Confidence calibration per domain - Ethical framework: fair wage floors, workplace safety standards, child labor detection, working conditions assessment Built on Currency Service (PR PlanExeOrg#147) and Resource/Cost Data (PR PlanExeOrg#146). Split into 4 phased implementation PRs. Addresses all PR PlanExeOrg#72 feedback.
82deutschmark
added a commit
to VoynichLabs/PlanExe2026
that referenced
this pull request
Mar 6, 2026
… intent) Comprehensive design document covering: - Domain detection and assumption extraction - Fermi sanity checking against real-world benchmarks - Currency/unit normalization via Currency Service - Confidence calibration per domain - Ethical framework: fair wage floors, workplace safety standards, child labor detection, working conditions assessment Built on Currency Service (PR PlanExeOrg#147) and Resource/Cost Data (PR PlanExeOrg#146). Split into 4 phased implementation PRs. Addresses all PR PlanExeOrg#72 feedback.
82deutschmark
added a commit
to VoynichLabs/PlanExe2026
that referenced
this pull request
Mar 6, 2026
… intent) Comprehensive design document covering: - Domain detection and assumption extraction - Fermi sanity checking against real-world benchmarks - Currency/unit normalization via Currency Service - Confidence calibration per domain - Ethical framework: fair wage floors, workplace safety standards, child labor detection, working conditions assessment Built on Currency Service (PR PlanExeOrg#147) and Resource/Cost Data (PR PlanExeOrg#146). Split into 4 phased implementation PRs. Addresses all PR PlanExeOrg#72 feedback.
neoneye
added a commit
that referenced
this pull request
Mar 6, 2026
…izer proposal: Domain-aware assumption normalizer (captures PR #72 intent)
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Phase 2: Domain-Aware Assumption Normalizer
What this does
Adds a
DomainNormalizerthat auto-detects the domain of a plan (carpenter, dentist, personal, startup, non-profit) and normalizes QuantifiedAssumptions accordingly:Why it matters
PlanExe's role in 2026 is as a trusted auditing layer for autonomous agents — not just plan generation. Agents run in bubbles and hallucinate assumptions. This normalizer ensures FermiSanityCheck speaks the right language for each domain before flagging assumptions as sane or suspect.
Test results
11/11 tests pass (8 normalizer + 3 FermiSanityCheck).
DAG placement
MakeAssumptions → FermiSanityCheck → DomainNormalizer → DistillAssumptionsRelated