[3] Retrieval and Nutrition | Eckford on the side

Chapters:

Scrum-master candy version

DONE

four-layer architecture aligned
Chainlit protected
deterministic lookup remains primary
Chroma remains support only
ingredient_lines_raw pinned as ingestion truth
raw lines preserved exactly as list[str]
structured parsing derived from raw lines
fallback extraction not allowed to wipe explicit lines
clean separation between original and modified ingredient state
nutrition can run against original or substituted ingredient array, depending on user intent

Next, but not a redesign

implement ingredient_lines_current cleanly in runtime/editor flow
route nutrition requests to raw vs current explicitly
keep resolution grounded in data/eurofir_mediterranean.csv

Data Maintenance Layer
Index / Build Layer
Nutrition Intelligence Layer
Experience / Runtime Layer

                  NUTRITION INTELLIGENCE PLATFORM
                  ===============================

   [1] Data Maintenance Layer
   --------------------------
   Source of truth:
   data/eurofir_mediterranean.csv

   Purpose:
   - curated nutrient data
   - authoritative maintained dataset

   Status:
   [DONE]


   [2] Index / Build Layer
   -----------------------
   Components:
   rag_setup/*

   Purpose:
   - enrichment
   - compatibility exports
   - Chroma preparation
   - internal shaping only

   Rule:
   - support layer
   - not runtime truth

   Status:
   [DONE]


   [3] Nutrition Intelligence Layer
   --------------------------------
   Components:
   nutrition_lookup.py
   nutrition_calculator.py

   Purpose:
   - deterministic lookup
   - nutrient calculation
   - stable structured outputs

   Rule:
   - runtime must consume this layer
   - do not bypass with retrieval

   Status:
   [DONE]


   [4] Experience / Runtime Layer
   ------------------------------
   Components:
   multi_agent_chatbot/*
   Chainlit
   Flask viewer

   Purpose:
   - user interaction
   - orchestration
   - recipe editing / inspection
   - presentation

   Rule:
   - Chainlit stays
   - consumers only
   - no raw data leakage upward

   Status:
   [DONE]



                  INGREDIENT FLOW
                  ===============

   Upstream provided ingredient lines
                  |
                  v
      ingredient_lines_raw : list[str]
      - preserve exactly
      - preserve order
      - no merge
      - no rewrite
      - no dedupe
      - survives parse failure

      Status:
      [DONE]


                  |
                  v
      structured ingredient parsing
      - derived from ingredient_lines_raw
      - must not overwrite raw lines
      - fallback must not erase explicit lines

      Status:
      [DONE]


                  |
                  v
      ingredient_lines_current : list[str]
      - current editable/substituted state
      - separate from raw
      - used for post-substitution analysis

      Status:
      [DONE as design direction]


                  NUTRITION QUERY PATH
                  ====================

   User asks for nutrition
            |
            v
   Which state is being asked about?
      |                       |
      |                       |
      v                       v
   Original recipe         Modified recipe
      |                       |
      v                       v
   ingredient_lines_raw    ingredient_lines_current
      \                       /
       \                     /
        v                   v
      Nutrition Intelligence Layer
      - deterministic lookup
      - resolve foods against
        data/eurofir_mediterranean.csv

      Status:
      [DONE as agreed direction]


                  IMPORTANT RULES
                  ===============

   Original ingestion truth:
   - ingredient_lines_raw

   Runtime modified truth:
   - ingredient_lines_current

   Must not do:
   - overwrite raw with edited lines
   - bypass deterministic lookup
   - treat Chroma as nutrient truth
   - redesign the architecture

   Status:
   [DONE]

Yes, but only a few, and they should stay small.

Before wiring, I would lock tool-boundary decisions, not new architecture.

Decisions to make now

1. Who owns retrieval?

This should be the Experience / Runtime Layer, not the Nutrition Intelligence Layer.

That means:

Flask / recipe runtime chooses the source
runtime assembles the handoff payload
Nutrition Agent receives the payload, not raw UI state

2. What is the handoff shape?

You already have the core sentence. I’d freeze it into a tiny input contract:

{
    "ingredient_lines": list[str],
    "ingredient_source": "raw" | "current",
    "recipe_id": str | None,
}

Optional later:

servings
request_context
exclude_categories like herbs/aromatics

But not yet unless already needed.

3. What sources are legal?

Decide this explicitly now:

allowed: ingredient_lines_raw
allowed: ingredient_lines_current
not allowed: body text reconstruction
not allowed: Chroma-derived ingredient guesses
not allowed: parsed objects as the primary handoff

That prevents six side doors from quietly appearing.

4. What does the Nutrition tool accept?

Decide whether the Nutrition Agent gets:

one line at a time, or
a batch of lines

For this project, batch is probably cleaner:

run_nutrition_from_ingredient_lines(payload) -> RecipeNutritionResult

Internally it can still process line-by-line.

5. What status vocabulary will the tool use?

Freeze a tiny set now so logs and debug UI do not go feral.

For retrieval:

READY
FALLBACK_TO_RAW
EMPTY
INVALID

For nutrition:

RESOLVED
PARTIAL
UNRESOLVED
INVALID

6. What happens when current is requested but missing?

Decide now:

if ingredient_source="current" and current exists → use it
if ingredient_source="current" and current is absent → fallback to raw, or fail

I would use:

fallback to raw
emit FALLBACK_TO_RAW

That is practical and visible.

7. Where does herb filtering live?

Not in retrieval.

Retrieval should pass ordered list[str].
Herb/non-herb decisions belong inside the nutrition-side logic or a nutrition adapter right before lookup.

That keeps the Retrieval Door boring, which is good.

What you do not need to decide yet

You do not need to decide now:

alias persistence details
Chroma fallback behavior
better substitution logic
perfect ingredient parsing
recipe architecture changes

Those are different doors.

My recommendation

Lock these three things before wiring:

retrieval ownership = runtime layer
handoff payload = ingredient_lines + ingredient_source + recipe_id
fallback rule = current missing → raw with explicit status

That is enough to wire safely without dragging the whole castle into a planning meeting.

Tiny pinned version

Tool decisions before wiring

- Retrieval is owned by the Experience / Runtime Layer.
- Nutrition receives a small handoff payload, not UI state.
- Legal line sources are raw and current only.
- Handoff payload:
  - ingredient_lines: list[str]
  - ingredient_source: raw | current
  - recipe_id: optional
- Retrieval may fallback from current to raw, but must report that status.
- Nutrition-side filtering/classification happens after retrieval, not inside it.

Yes. This is the moment the kitchen actually starts cooking.

You’ve now got:

ingredient lines ✔
retrieval door ✔
herb/spice classification ✔

So now we build the nutrition input pipeline cleanly.

What we are doing now

Convert ingredient lines → nutrition-ready inputs
without touching the source or breaking contracts

The new mini pipeline

ingredient_lines (raw or current)
        ↓
filter_lines_for_nutrition()
        ↓
classify_lines()
        ↓
select_primary_lines()
        ↓
Nutrition Agent (deterministic lookup)

Step-by-step

1. Start with retrieved lines

You already have:

payload = build_nutrition_input(...)
lines = payload["ingredient_lines"]

2. Filter obvious junk (light touch)

Keep this simple:

def is_candidate_line(line: str) -> bool:
    s = line.strip()

    if not s:
        return False

    # keep bullet / ingredient-looking lines
    if s.startswith("*") or any(char.isdigit() for char in s):
        return True

    return False

3. Normalize lightly

You don’t need perfection. Just remove obvious noise:

def normalize_line(line: str) -> str:
    s = line.lower()
    s = s.replace("*", "")
    s = s.replace("**", "")
    return s.strip()

4. Classify (your new power)

Using your sets:

def classify_line(line: str) -> str:
    text = normalize_line(line)

    if any(h in text for h in HERBS):
        return "herb"
    if any(s in text for s in SPICES):
        return "spice"
    if any(s in text for s in SEASONINGS):
        return "seasoning"

    return "primary"

5. Split into buckets

classified = []

for line in lines:
    if not is_candidate_line(line):
        continue

    cls = classify_line(line)

    classified.append({
        "line": line,
        "normalized": normalize_line(line),
        "class": cls,
    })

6. Select nutrition targets

For now:

primary_lines = [x for x in classified if x["class"] == "primary"]
secondary_lines = [x for x in classified if x["class"] != "primary"]

What goes to nutrition

Only:

nutrition_inputs = [x["normalized"] for x in primary_lines]

That’s it.

What stays visible

Do NOT lose this:

Herbs present: cilantro
Spices present: cumin, turmeric

This is great UI value.

What you get

Clean nutrition input

Instead of:

* plain yogurt
* pinch of cumin
* a little cilantro
That’s not weird at all...

You send:

plain yogurt
cucumber
jalapeño

Clean output story

nutrition focuses on real food mass
herbs/spices are acknowledged, not ignored
system feels smart, not pedantic

Important boundary reminder

ingredient_lines_raw       ← untouched
ingredient_lines_current   ← editable

classification + filtering ← derived only
nutrition input            ← derived only

No mutation upstream.

What NOT to do yet

no gram conversion
no quantity parsing
no perfect ingredient extraction
no alias system
no Chroma fallback

This is still Stage 1 plumbing.

Tiny summary for your notes

Nutrition input is derived from ingredient lines by:
- filtering candidate lines
- classifying herb/spice/seasoning vs primary
- selecting primary lines only
- passing normalized lines to deterministic lookup

Source data remains unchanged.

Status board

Source Door              [DONE]
Retrieval Door           [DONE]
Classification Door      [DONE]
Nutrition Input Layer    [NOW BUILDING 🚧]
Nutrition Contract Door  [READY]

Direct wiring plan

1. Keep the handoff narrow

Use the retrieved ingredient lines, then derive nutrition inputs from them:

build_nutrition_input(...)
    -> filter candidate lines
    -> classify herb/spice/seasoning vs primary
    -> send primary lines to Nutrition Agent

2. Do not change locked output contracts

The Nutrition Agent should still return the same locked runtime contract.

So this step is:

new input-side preprocessing
same output-side contract

3. Add the classification step just before lookup

Not in ingestion.
Not in Flask.
Not in the output formatter.

Put it in the nutrition-facing handoff path.

4. First-version behavior

For now:

primary lines go to lookup
herb, spice, seasoning lines are excluded from primary nutrition totals
but keep them visible in debug output

That gives you an honest first pass without pretending cumin is a zucchini.

What to wire

Conceptually:

nutrition_input = build_nutrition_input(recipe, mode="raw")

classified = classify_ingredient_lines(nutrition_input["ingredient_lines"])

primary_lines = [item["line"] for item in classified if item["class"] == "primary"]

result = nutrition_agent.run_from_ingredient_lines(primary_lines)

And for debug/inspection:

{
    "ingredient_source": "raw",
    "status": "FALLBACK_TO_RAW",
    "classified_lines": [...],
    "primary_lines": [...],
    "excluded_lines": [...],
}

What success looks like

You’ll know it’s working when:

chicken, yogurt, cucumber, onion, zucchini, etc. are sent to lookup
cilantro, cumin, turmeric, paprika, salt, pepper are classified separately
headings and obvious junk do not derail the nutrition call
the Nutrition Agent still returns the normal contract result

Suggested ask for Codex

Next step: wire herb/spice classification directly into the Nutrition Agent input path.

Goal:
Use the existing retrieval result as the single source of ingredient lines, classify the lines into primary vs herb/spice/seasoning, and send only primary lines to the Nutrition Agent for deterministic nutrition lookup.

Rules:
- do not modify ingredient_lines_raw
- do not change the locked nutrition output contracts
- keep classification as a derived preprocessing step only
- use the existing Stage 1 HERBS / SPICES / SEASONINGS sets
- anything not classified defaults to primary
- keep excluded herb/spice/seasoning lines visible in debug output if practical

Expected behavior:
- retrieved lines are classified
- primary lines are handed to the Nutrition Agent
- herb/spice/seasoning lines are excluded from primary nutrition totals for now
- existing runtime contract outputs remain unchanged

Add tests for:
- primary ingredient passes through
- herb/spice/seasoning lines are excluded from nutrition input
- unknown ingredient defaults to primary
- output contract remains unchanged

Update

> Ingredient line retrieval is now deterministic and classification-aware:
 - Added Stage 1 herb/spice/seasoning vocabularies and a _normalize_classification helper
 in retrieval.py. build_nutrition_input() now filters the retrieved list into primary
 lines (only these go to nutrition) and records which lines were excluded along with their
 classification. The payload includes ingredient_lines, ingredient_source, status, and the new
 ingredient_lines_excluded.
 - Updated the Flask viewer to rely solely on this retrieval door. The editable textarea always
 shows the working copy, and the “Lines Sent to Nutrition” section now reports the selected
 source, status, count, and any excluded herb/spice/seasoning lines so users can see exactly what
 will be analyzed.
 - Conversation state now attaches the nutrition_input payload (with excluded lines info) so
 runtimes have the contract-ready data, even as we continue to use structured ingredients for the
 calculator. Tests verify the payload is present after add/reset commands.
 - Added tests/test_ingredient_lines.py cases proving that herbs/spices are excluded, unknowns
 stay primary, and metadata is attached. All relevant suites (tests/test_import_recipes.py, tests/
 test_ingredient_lines.py, multi_agent_chatbot/tests/test_conversation_state.py) pass.
 Next runtime consumers can pull nutrition_input["ingredient_lines"] to feed the Nutrition Agent
 while keeping the excluded list visible for debugging.

> Absolutely—that summary reads well and captures the new behavior precisely:

  - Retrieval classifies herb/spice/seasoning lines and keeps only the primary lines in
  nutrition_input.
  - build_nutrition_input() attaches source, status, optional “Fridge is empty…” message, and
  ingredient_lines_excluded metadata.
  - Conversation state now hands off this payload so downstream consumers rely on
  nutrition_input["ingredient_lines"].
  - The Flask viewer renders the same payload, showing the lines that go to nutrition and any
  excluded items.
  - Tests (tests/test_ingredient_lines.py, multi_agent_chatbot/tests/test_conversation_state.py,
  plus the main import suite) keep the contract stable.

  Thanks for the nudge!

 We’re bumping up against the current session’s context limit (“0% context” is the CLI’s way of warning that the
  conversation is at max length), so the safest move is to pause here, summarize what we just attempted, and spin up
  a fresh session before tackling the remaining work. That way the next agent (or a clean restart of me) can pick up
  exactly where we left off without the CLI truncating anything mid‑task.

  Where we stopped

  - Ingestion fixes are in place and re-running python3 rag_setup/import_recipes.py completed, but the normalized dataset
  still shows ingredient_lines_raw=[] for “Black Eyed Peas…” because the CLI is still serving the older JSONL (timestamp
  indicates an older file).
  - Next steps would be to re-run normalize_row() directly on recipes/recipes.csv to confirm the extractor works, then
  double-check the Flask/runtime (or whatever process is reading the JSONL) is actually pointed at the updated file.

  Plan

  1. Wrap here with a concise summary so nothing is lost.
  2. Start a new session—either by reloading the workspace or launching a new agent instance—and pick up the ingestion
  verification from step 1 above.


> Sounds good! Here’s the summary for our next session:

  - We updated the ingredient-section extractor so blank/NBSP rows don’t break it, and added tests to ensure “present”
  sections can’t stay empty silently.
  - python3 -m pytest tests/test_import_recipes.py tests/test_ingredient_lines.py multi_agent_chatbot/tests/
  test_conversation_state.py all pass.
  - Despite re-running python3 rag_setup/import_recipes.py, the normalized file on disk still shows
  ingredient_lines_raw=[] for “Black Eyed Peas Recipe (Greek-Style)”. The runtime is almost certainly still using the
  older JSONL (timestamp shows it was last written far in the past).
  - Next steps for the new session:
      1. Run normalize_row() directly on recipes/recipes.csv to confirm the extractor returns the lines.
      2. Ensure the CLI/Flask viewer is reading the refreshed rag_setup/recipes_normalized.jsonl (e.g., by removing any
  cached copy, re-running the import, and verifying the timestamp and contents).
      3. Once the lines show up in the normalized file, the retrieval + nutrition path will automatically pick them up.

  Pick up there and we’ll be set!

Yes. That sounds like the right kind of implementation plan.

The key reason it feels right is that it is trying to preserve existing behavior and only make the new handoff path reflect the existing effective recipe state.

What is good in this plan

apply_command remains the place where recipe state changes are realized
ingredient_lines_current is derived from the effective recipe, not invented separately
build_nutrition_input(..., mode="current") becomes a reflection of the working state
reset behavior is explicitly included
tests cover update/add/reset, which is exactly where regressions like to hide

The main guardrail

This part matters most:

compute and set these lines only when overrides exist or differ from base

That is the right instinct.

You do not want to always manufacture ingredient_lines_current if nothing changed, because that muddies the meaning of the source.

A good rule is:

no overrides / no meaningful delta → no ingredient_lines_current, use raw
overrides present / effective ingredients differ from base → set ingredient_lines_current

Boundary to protect

Keep this split:

ingredient_lines_raw = ingested source truth
ingredient_lines_current = runtime-derived working copy when state has changed
build_nutrition_input() = reads those sources, does not invent state
apply_command() = owns state mutation

That preserves your architecture.

One caution

Be careful with this phrase:

derive ingredient lines from effective recipe ingredients

That is okay only if you are talking about the runtime working copy, not replacing the ingested raw lines.

So the safe interpretation is:

effective recipe ingredients may be rendered into ingredient_lines_current
for runtime use, but must not overwrite ingredient_lines_raw

What I’d want tested

At minimum:

base recipe, no overrides
- ingredient_lines_current absent or unused
- nutrition input source falls back appropriately
add ingredient
- current lines created
- nutrition input uses current
update ingredient
- current lines reflect the change
- nutrition input reflects the updated line
reset
- current lines removed or reverted cleanly
- nutrition input returns to raw/base behavior
no-op change
- does not create fake drift unnecessarily

Tiny wording polish for the note

Plan:
Update apply_command so runtime ingredient-line state stays aligned with the effective recipe. When overrides create a meaningful delta from the base recipe, derive ingredient_lines_current from that effective state and have build_nutrition_input() read it in current mode. Preserve ingredient_lines_raw unchanged, and extend tests to cover update, add, reset, and no-op cases so existing behavior remains intact while the deterministic nutrition handoff is integrated.

Verdict: good plan, correctly scoped. The important thing is that this is integration work, not a redesign.

This is much better.

This one is not “new feature energy.” It is clearly framed as:

root cause → regression → fix → verification

That is the right posture.

What I like

it identifies the regression precisely:
- forced mode="raw"
- no ingredient_lines_current
it restores the existing conversational behavior rather than replacing it
it explains why overrides were not affecting nutrition handoff
it ties the fix to apply_command, which is the correct place for runtime state mutation
it includes verification, not just code changes

What matters most

This sentence is the heart of it:

conversation_state.apply_command always handed build_nutrition_input the raw catalog lines and never populated ingredient_lines_current, so follow-up overrides never influenced the deterministic ingredient handoff

That is a real diagnosis, not a vague “we improved things.”

One wording adjustment

I would slightly soften this:

store them whenever overrides or serving tweaks (or missing raws) are active

The “or missing raws” part is probably true technically, but it risks muddying the main story. The main story is runtime override/state preservation.

I’d write it like this instead:

I added helpers to turn the computed effective recipe into textual ingredient lines and store them whenever runtime state creates a meaningful delta from the base recipe, including ingredient overrides and serving changes. The nutrition handoff now requests mode="current" so those working lines take precedence when present.

That keeps the logic tight.

I also like the test framing

This is good:

add/update → ingredient_source="current"
serving overrides rescale the lines used for nutrition
reset → falls back to raw catalog data

That is exactly the kind of regression net you want.

My suggested polished version

- Root cause: the new ingredient-line retrieval path introduced a regression in conversation_state. apply_command always passed build_nutrition_input the raw catalog lines and never populated ingredient_lines_current, so follow-up overrides such as “what if I add …” did not affect the deterministic nutrition handoff.
- Fix: added helpers to render the effective recipe state back into textual ingredient lines and store them whenever runtime state creates a meaningful delta from the base recipe, including ingredient overrides and serving changes. The nutrition handoff now uses mode="current" so those working lines take precedence when present (multi_agent_chatbot/conversation_state.py:54-205).
- Result: manual apply_command checks now show the working ingredient list and nutrition_input staying in sync when adding, removing, updating, and resetting ingredients, restoring the prior conversational behavior.
- Tests: updated multi_agent_chatbot/tests/test_conversation_state.py to assert that add/update commands produce ingredient_source="current" with adjusted lines, serving overrides rescale the nutrition-facing lines, and resets fall back to the raw catalog data (multi_agent_chatbot/tests/test_conversation_state.py:16-74).

Tests run:
python3 -m pytest tests/test_import_recipes.py tests/test_ingredient_lines.py multi_agent_chatbot/tests/test_conversation_state.py

Next step:
Run the interactive chat/CLI flow through a sequence like “base recipe → add ingredient → recalculate nutrition” to confirm the live runtime matches the restored behavior proven by the tests.

Verdict

I’d approve this direction.

This is the first proposal in this thread that really sounds like it understands your non-negotiable rule:

do not break existing runtime behavior.

So yes, I’d keep this one on the table.

> Yep, the normalized recipe data changed (we now generate a runtime catalog from rag_setup/recipes_normalized.jsonl and
the ingredient parsing is more strict), so run the ingest again to be safe:

1. python3 rag_setup/import_recipes.py – refresh rag_setup/recipes_normalized.jsonl.
2. python3 rag_setup/build_runtime_catalog.py – regenerate multi_agent_chatbot/runtime_recipes.json.
3. Launch Chainlit (cd multi_agent_chatbot && chainlit run -w agentic_chatbot.py --port 8000) and test with Use recipe:
Black Eyed Peas Recipe (Greek-Style).

That’ll ensure the runtime sees the latest structured ingredients before your end-to-end run.