Planner Autonomy: ранжировать catalog chain templates

2026-05-01 14:02:55 +03:00 · 2026-05-01 14:02:55 +03:00 · 4dcffef7d6
parent ccfa9283e9
commit 4dcffef7d6
8 changed files with 190 additions and 2 deletions
--- a/planner_autonomy_consolidation_2026-05-01.md
+++ b/planner_autonomy_consolidation_2026-05-01.md
@ -110,6 +110,14 @@ The next local scoring step broadened metadata-surface autonomy without adding a
 - inferred catalog surfaces instantiate `catalog_drilldown`;
 - mixed or ambiguous surfaces still do not guess and continue through clarification / explicit data-need scoring.
 The following consolidation step added catalog-level chain-template scoring:
 - `assistantMcpCatalogIndex` can now score reviewed `chain_templates` directly from fact family, action family, required axes, comparison, ranking, and aggregation needs;
 - comparison-shaped value-flow ranks `value_flow_comparison` above the generic value-flow template;
 - ranking-shaped value-flow ranks `value_flow_ranking` above the generic value-flow template;
 - document/movement/inventory/lifecycle templates can now be inspected as catalog search results, not only as local planner branch constants;
 - `assistantMcpDiscoveryPlanner` now records the top catalog chain-template match in reason codes while preserving existing guarded execution behavior.
 ## Why This Matters
 This reduces the pressure to add one hard route per user wording.
@ -183,6 +191,13 @@ Latest validation after unambiguous metadata-surface lane inference:
 - live inventory full-pack attempt: `inventory_stock_exact_bridge_live_20260501_after_runtime_bridge`, status `partial`
 - live attempt interpretation: route/intent/recipe/capability selection matched, but MCP execution failed with `MCP fetch failed: This operation was aborted`; direct proxy `get_metadata` also timed out while `/health` reported `active_sessions_count=0` and pending commands, so this is an infrastructure/polling-session blocker rather than accepted semantic evidence.
 Latest validation after catalog chain-template scoring:
 - targeted catalog/planner tests: passed, `54 passed`
 - full MCP-discovery suite: passed, `282 passed`, `9 skipped`
 - `npm.cmd run build`: passed
 - graphify rebuild: `5938 nodes`, `12903 edges`, `139 communities`
 ## Next Step
 The next safe step is to re-run live replay once the 1C side is actively polling the proxy, then continue into broader reviewed scoring.
--- a/architecture_turnaround/README.md
+++ b/architecture_turnaround/README.md
@ -80,6 +80,7 @@ It now documents a turnaround that is already operational in code, already mater
  - runtime bridge and answer adapter now keep unsupported inventory route templates behind an explicit user-facing boundary instead of letting template planning look like confirmed stock/supplier/purchase/sale evidence;
  - inventory catalog templates now bridge through existing exact inventory recipes (`41.01` scoped stock, supplier overlap, purchase provenance, and sale trace) inside the bounded MCP discovery pilot, while missing selected-item anchors still clarify instead of guessing;
  - unambiguous metadata surfaces can now infer the next reviewed lane from `Document.*`, `Register.*`, or `Catalog.*` objects even before upstream labels `downstream_route_family`, while mixed surfaces still do not guess;
  - catalog index now scores reviewed chain templates directly from fact/action/axis/comparison/ranking needs, and planner exposes the top catalog chain match in reason codes;
  - live map sync: [20 - planner_autonomy_consolidation_2026-05-01.md](./20%20-%20planner_autonomy_consolidation_2026-05-01.md)
 Current honest status:
@ -91,8 +92,8 @@ Current honest status:
 - open-world bounded-autonomy readiness: `~85%`
 - Post-F semantic integrity module progress: `~99%` operationally closed, with remaining risk now treated as next-slice discovery rather than an open blocker inside the closed slice
 - active inventory-stock breadth slice progress: `100%` for the declared scenario pack, not for arbitrary inventory questions
- Planner Autonomy Consolidation progress: `~78%` for the declared module, with catalog-fabric, value-flow arbitration, lifecycle bounded inference, broad-evaluation bridge, inventory catalog templates, inventory runtime-boundary honesty, exact inventory recipe bridging, and unambiguous metadata-surface lane inference validated locally, but live replay for the new bridge is currently blocked by missing active 1C polling and broader unfamiliar 1C asks still need replay-backed growth
+- Planner Autonomy Consolidation progress: `~80%` for the declared module, with catalog-fabric, value-flow arbitration, lifecycle bounded inference, broad-evaluation bridge, inventory catalog templates, inventory runtime-boundary honesty, exact inventory recipe bridging, unambiguous metadata-surface lane inference, and catalog chain-template scoring validated locally, but live replay for the new bridge is currently blocked by missing active 1C polling and broader unfamiliar 1C asks still need replay-backed growth
- graph snapshot after latest rebuild: `5937 nodes`, `12899 edges`, `138 communities`
+- graph snapshot after latest rebuild: `5938 nodes`, `12903 edges`, `139 communities`
 - current breakpoint:
  - the validated hot paths are no longer structurally broken;
  - flagship continuity collapse is no longer the primary risk;
@ -137,6 +138,7 @@ Latest live proof now includes:
 - inventory exact-runtime bridge accepted locally: runtime-bridge/answer-adapter/pilot-executor slice passed `70/70` with `1` skipped; full MCP-discovery slice passed `279/279` with `9` skipped; build passed; graphify rebuilt to `5930 nodes`, `12884 edges`, `135 communities`
 - unambiguous metadata-surface lane inference accepted locally: planner slice passed `36/36`; full MCP-discovery slice passed `281/281` with `9` skipped; build passed; graphify rebuilt to `5937 nodes`, `12899 edges`, `138 communities`
 - live inventory exact-bridge rerun `inventory_stock_exact_bridge_live_20260501_after_runtime_bridge` is recorded as infrastructure-blocked, not accepted: route/intent/recipe/capability matched, but MCP calls aborted and direct `get_metadata` timed out while proxy health showed `active_sessions_count=0` with pending commands
 - catalog chain-template scoring accepted locally: catalog/planner slice passed `54/54`; full MCP-discovery slice passed `282/282` with `9` skipped; build passed; graphify rebuilt to `5938 nodes`, `12903 edges`, `139 communities`
 Current architectural reading:
--- a/llm_normalizer/backend/dist/services/assistantMcpCatalogIndex.js
+++ b/llm_normalizer/backend/dist/services/assistantMcpCatalogIndex.js
@ -3,6 +3,7 @@ Object.defineProperty(exports, "__esModule", { value: true });
 exports.ASSISTANT_MCP_CATALOG_PLAN_REVIEW_SCHEMA_VERSION = exports.ASSISTANT_MCP_CATALOG_INDEX_SCHEMA_VERSION = void 0;
 exports.searchAssistantMcpCatalogPrimitivesByDecompositionCandidates = searchAssistantMcpCatalogPrimitivesByDecompositionCandidates;
 exports.searchAssistantMcpCatalogPrimitivesByFactAxis = searchAssistantMcpCatalogPrimitivesByFactAxis;
 exports.searchAssistantMcpCatalogChainTemplatesByFactAxis = searchAssistantMcpCatalogChainTemplatesByFactAxis;
 exports.searchAssistantMcpCatalogPrimitivesByMetadataSurface = searchAssistantMcpCatalogPrimitivesByMetadataSurface;
 exports.buildAssistantMcpCatalogIndex = buildAssistantMcpCatalogIndex;
 exports.getAssistantMcpCatalogPrimitive = getAssistantMcpCatalogPrimitive;
@ -700,6 +701,53 @@ function searchAssistantMcpCatalogPrimitivesByFactAxis(input) {
        .sort((left, right) => right.score - left.score)
        .map((item) => item.primitive);
 }
 function searchAssistantMcpCatalogChainTemplatesByFactAxis(input) {
    const requiredAxisSet = toStringSet(input.required_axes ?? []);
    const desiredTags = tagSetFromFactAxisInput({
        business_fact_family: input.business_fact_family,
        action_family: input.action_family,
        required_axes: input.required_axes,
        comparison_need: input.comparison_need,
        ranking_need: input.ranking_need,
        aggregation_need: input.aggregation_need
    });
    const scored = [];
    for (const template of CHAIN_TEMPLATES) {
        const factMatch = matchesPlanningToken(input.business_fact_family, template.supported_fact_families);
        const actionMatch = matchesPlanningToken(input.action_family, template.supported_action_families);
        const tagMatches = template.planning_tags.filter((tag) => desiredTags.has(normalizePlanningToken(tag)));
        const axisOverlap = template.base_required_axes.filter((axis) => requiredAxisSet.has(axis)).length;
        let score = 0;
        if (factMatch) {
            score += 8;
        }
        if (actionMatch) {
            score += 5;
        }
        score += tagMatches.length * 2;
        score += axisOverlap;
        if (input.comparison_need && template.planning_tags.some((tag) => normalizePlanningToken(tag) === "comparison")) {
            score += 6;
        }
        if (input.ranking_need && template.planning_tags.some((tag) => normalizePlanningToken(tag) === "ranking")) {
            score += 6;
        }
        if (input.aggregation_need === "by_month" &&
            template.planning_tags.some((tag) => normalizePlanningToken(tag) === "monthly_aggregation")) {
            score += 4;
        }
        if (score <= 0) {
            continue;
        }
        scored.push({
            chainId: template.chain_id,
            score
        });
    }
    return scored
        .sort((left, right) => right.score - left.score)
        .map((item) => item.chainId);
 }
 function searchAssistantMcpCatalogPrimitivesByMetadataSurface(input) {
    const allowAggregateByAxis = input.allow_aggregate_by_axis !== false;
    const requiredAxisSet = toStringSet(input.required_axes ?? []);
--- a/llm_normalizer/backend/dist/services/assistantMcpDiscoveryPlanner.js
+++ b/llm_normalizer/backend/dist/services/assistantMcpDiscoveryPlanner.js
@ -278,6 +278,20 @@ function selectPrimitivesFromGraphAndCatalog(input) {
    if (factAxisPrimitives.length > 0) {
        reasonCodes.push("planner_selected_catalog_primitives_from_fact_axis_search");
    }
    const chainTemplateMatches = input.dataNeedGraph
        ? (0, assistantMcpCatalogIndex_1.searchAssistantMcpCatalogChainTemplatesByFactAxis)({
            business_fact_family: input.dataNeedGraph.business_fact_family,
            action_family: input.actionFamily ?? input.dataNeedGraph.action_family,
            required_axes: input.requiredAxes,
            comparison_need: input.dataNeedGraph.comparison_need,
            ranking_need: input.dataNeedGraph.ranking_need,
            aggregation_need: input.dataNeedGraph.aggregation_need
        })
        : [];
    if (chainTemplateMatches.length > 0) {
        reasonCodes.push("planner_scored_catalog_chain_templates_from_fact_axis");
        reasonCodes.push(`planner_catalog_chain_template_search_top_${chainTemplateMatches[0]}`);
    }
    const combinedCatalogPrimitives = [];
    for (const primitive of decompositionPrimitives) {
        if (!combinedCatalogPrimitives.includes(primitive)) {
--- a/llm_normalizer/backend/src/services/assistantMcpCatalogIndex.ts
+++ b/llm_normalizer/backend/src/services/assistantMcpCatalogIndex.ts
@ -622,6 +622,15 @@ export interface AssistantMcpCatalogFactAxisSearchInput {
  allow_aggregate_by_axis?: boolean;
 }
 export interface AssistantMcpCatalogChainTemplateSearchInput {
  business_fact_family?: string | null;
  action_family?: string | null;
  required_axes?: string[];
  comparison_need?: string | null;
  ranking_need?: string | null;
  aggregation_need?: string | null;
 }
 export interface AssistantMcpCatalogPrimitiveSearchInput {
  decomposition_candidates: string[];
  allow_aggregate_by_axis?: boolean;
@ -847,6 +856,62 @@ export function searchAssistantMcpCatalogPrimitivesByFactAxis(
    .map((item) => item.primitive);
 }
 export function searchAssistantMcpCatalogChainTemplatesByFactAxis(
  input: AssistantMcpCatalogChainTemplateSearchInput
 ): AssistantMcpCatalogChainTemplateId[] {
  const requiredAxisSet = toStringSet(input.required_axes ?? []);
  const desiredTags = tagSetFromFactAxisInput({
    business_fact_family: input.business_fact_family,
    action_family: input.action_family,
    required_axes: input.required_axes,
    comparison_need: input.comparison_need,
    ranking_need: input.ranking_need,
    aggregation_need: input.aggregation_need
  });
  const scored: Array<{ chainId: AssistantMcpCatalogChainTemplateId; score: number }> = [];
  for (const template of CHAIN_TEMPLATES) {
    const factMatch = matchesPlanningToken(input.business_fact_family, template.supported_fact_families);
    const actionMatch = matchesPlanningToken(input.action_family, template.supported_action_families);
    const tagMatches = template.planning_tags.filter((tag) => desiredTags.has(normalizePlanningToken(tag)));
    const axisOverlap = template.base_required_axes.filter((axis) => requiredAxisSet.has(axis)).length;
    let score = 0;
    if (factMatch) {
      score += 8;
    }
    if (actionMatch) {
      score += 5;
    }
    score += tagMatches.length * 2;
    score += axisOverlap;
    if (input.comparison_need && template.planning_tags.some((tag) => normalizePlanningToken(tag) === "comparison")) {
      score += 6;
    }
    if (input.ranking_need && template.planning_tags.some((tag) => normalizePlanningToken(tag) === "ranking")) {
      score += 6;
    }
    if (
      input.aggregation_need === "by_month" &&
      template.planning_tags.some((tag) => normalizePlanningToken(tag) === "monthly_aggregation")
    ) {
      score += 4;
    }
    if (score <= 0) {
      continue;
    }
    scored.push({
      chainId: template.chain_id,
      score
    });
  }
  return scored
    .sort((left, right) => right.score - left.score)
    .map((item) => item.chainId);
 }
 export function searchAssistantMcpCatalogPrimitivesByMetadataSurface(
  input: AssistantMcpCatalogMetadataSurfaceSearchInput
 ): AssistantMcpDiscoveryPrimitive[] {
--- a/llm_normalizer/backend/src/services/assistantMcpDiscoveryPlanner.ts
+++ b/llm_normalizer/backend/src/services/assistantMcpDiscoveryPlanner.ts
@ -6,6 +6,7 @@ import {
 } from "./assistantMcpDiscoveryPolicy";
 import {
  getAssistantMcpCatalogChainTemplate,
  searchAssistantMcpCatalogChainTemplatesByFactAxis,
  searchAssistantMcpCatalogPrimitivesByDecompositionCandidates,
  searchAssistantMcpCatalogPrimitivesByFactAxis,
  searchAssistantMcpCatalogPrimitivesByMetadataSurface,
@ -457,6 +458,21 @@ function selectPrimitivesFromGraphAndCatalog(input: {
    reasonCodes.push("planner_selected_catalog_primitives_from_fact_axis_search");
  }
  const chainTemplateMatches = input.dataNeedGraph
    ? searchAssistantMcpCatalogChainTemplatesByFactAxis({
        business_fact_family: input.dataNeedGraph.business_fact_family,
        action_family: input.actionFamily ?? input.dataNeedGraph.action_family,
        required_axes: input.requiredAxes,
        comparison_need: input.dataNeedGraph.comparison_need,
        ranking_need: input.dataNeedGraph.ranking_need,
        aggregation_need: input.dataNeedGraph.aggregation_need
      })
    : [];
  if (chainTemplateMatches.length > 0) {
    reasonCodes.push("planner_scored_catalog_chain_templates_from_fact_axis");
    reasonCodes.push(`planner_catalog_chain_template_search_top_${chainTemplateMatches[0]}`);
  }
  const combinedCatalogPrimitives: AssistantMcpDiscoveryPrimitive[] = [];
  for (const primitive of decompositionPrimitives) {
    if (!combinedCatalogPrimitives.includes(primitive)) {
--- a/llm_normalizer/backend/tests/assistantMcpCatalogIndex.test.ts
+++ b/llm_normalizer/backend/tests/assistantMcpCatalogIndex.test.ts
@ -5,6 +5,7 @@ import {
  getAssistantMcpCatalogChainTemplate,
  getAssistantMcpCatalogPrimitive,
  reviewAssistantMcpDiscoveryPlanAgainstCatalog,
  searchAssistantMcpCatalogChainTemplatesByFactAxis,
  searchAssistantMcpCatalogPrimitivesByDecompositionCandidates,
  searchAssistantMcpCatalogPrimitivesByFactAxis,
  searchAssistantMcpCatalogPrimitivesByMetadataSurface
@ -151,6 +152,30 @@ describe("assistant MCP catalog index", () => {
    expect(primitives).toEqual(["resolve_entity_reference", "query_documents", "probe_coverage"]);
  });
  it("can score reviewed chain templates directly from fact family and required axes", () => {
    const documentTemplates = searchAssistantMcpCatalogChainTemplatesByFactAxis({
      business_fact_family: "document_evidence",
      action_family: "list_documents",
      required_axes: ["counterparty", "period", "coverage_target"]
    });
    const comparisonTemplates = searchAssistantMcpCatalogChainTemplatesByFactAxis({
      business_fact_family: "value_flow",
      action_family: "net_value_flow",
      comparison_need: "incoming_vs_outgoing",
      required_axes: ["organization", "period", "amount", "coverage_target"]
    });
    const rankingTemplates = searchAssistantMcpCatalogChainTemplatesByFactAxis({
      business_fact_family: "value_flow",
      action_family: "turnover",
      ranking_need: "top_desc",
      required_axes: ["organization", "period", "aggregate_axis", "amount", "coverage_target"]
    });
    expect(documentTemplates[0]).toBe("document_evidence");
    expect(comparisonTemplates[0]).toBe("value_flow_comparison");
    expect(rankingTemplates[0]).toBe("value_flow_ranking");
  });
  it("can search reviewed primitives for inventory stock snapshot chains", () => {
    const primitives = searchAssistantMcpCatalogPrimitivesByFactAxis({
      business_fact_family: "inventory_stock_snapshot",
--- a/llm_normalizer/backend/tests/assistantMcpDiscoveryPlanner.test.ts
+++ b/llm_normalizer/backend/tests/assistantMcpDiscoveryPlanner.test.ts
@ -51,6 +51,8 @@ describe("assistant MCP discovery planner", () => {
    expect(result.reason_codes).toContain("planner_enabled_chunked_coverage_probe_budget");
    expect(result.reason_codes).toContain("planner_consumed_data_need_graph_v1");
    expect(result.reason_codes).toContain("planner_selected_catalog_primitives_from_decomposition_candidates");
    expect(result.reason_codes).toContain("planner_scored_catalog_chain_templates_from_fact_axis");
    expect(result.reason_codes).toContain("planner_catalog_chain_template_search_top_value_flow");
  });
  it("keeps a value-flow plan in clarification state when period axis is missing", () => {
@ -145,6 +147,7 @@ describe("assistant MCP discovery planner", () => {
    expect(result.proposed_primitives).toEqual(["resolve_entity_reference", "query_documents", "probe_coverage"]);
    expect(result.reason_codes).toContain("planner_selected_catalog_primitives_from_fact_axis_search");
    expect(result.reason_codes).toContain("planner_instantiated_catalog_chain_template_document_evidence");
    expect(result.reason_codes).toContain("planner_catalog_chain_template_search_top_document_evidence");
    expect(result.reason_codes).not.toContain("planner_fell_back_to_recipe_primitives_after_empty_catalog_search");
  });