Architecture¶

Rocco is designed to be modular, separating concerns across evaluation, enhancement, RAG, and LLM integration.

System Diagram¶

Core Processing Pipeline

digraph Pipeline {

rankdir=TB;
fontsize=16;
fontname="Helvetica";
bgcolor="transparent";

node [
shape=box,
style="rounded,filled",
fontname="Helvetica",
fontsize=14,
margin="0.35,0.25",
penwidth=2
];

edge [
fontname="Helvetica",
fontsize=12,
penwidth=2
];

INPUT [
label=<
<TABLE BORDER="0" CELLBORDER="0" CELLSPACING="0">
<TR><TD><B><FONT POINT-SIZE="15">Draft Description</FONT></B></TD></TR>
<TR><TD><FONT POINT-SIZE="13">User-provided description</FONT></TD></TR>
</TABLE>
>,
fillcolor="#e3f2fd",
height=1.4,
width=3.5
];

EVAL [
label=<
<TABLE BORDER="0" CELLBORDER="0" CELLSPACING="0">
<TR><TD><B><FONT POINT-SIZE="15">Evaluator</FONT></B></TD></TR>
<TR><TD><FONT FACE="monospace" POINT-SIZE="12">src/evaluator</FONT></TD></TR>
<TR><TD><FONT POINT-SIZE="13">10-point rubric scoring</FONT></TD></TR>
</TABLE>
>,
fillcolor="#fff3e0",
height=1.4,
width=3.5
];

RAG [
label=<
<TABLE BORDER="0" CELLBORDER="0" CELLSPACING="0">
<TR><TD><B><FONT POINT-SIZE="15">Retriever</FONT></B></TD></TR>
<TR><TD><FONT FACE="monospace" POINT-SIZE="12">src/retriever</FONT></TD></TR>
<TR><TD><FONT POINT-SIZE="13">RAG pipeline</FONT></TD></TR>
</TABLE>
>,
fillcolor="#f3e5f5",
height=1.4,
width=3.5
];

SCREEN [
label=<
<TABLE BORDER="0" CELLBORDER="0" CELLSPACING="0">
<TR><TD><B><FONT POINT-SIZE="15">Content Screener</FONT></B></TD></TR>
<TR><TD><FONT FACE="monospace" POINT-SIZE="12">src/llm</FONT></TD></TR>
<TR><TD><FONT POINT-SIZE="13">Validate feedback</FONT></TD></TR>
</TABLE>
>,
fillcolor="#f3e5f5",
height=1.4,
width=3.5
];

EDIT [
label=<
<TABLE BORDER="0" CELLBORDER="0" CELLSPACING="0">
<TR><TD><B><FONT POINT-SIZE="15">Editor</FONT></B></TD></TR>
<TR><TD><FONT FACE="monospace" POINT-SIZE="12">src/editor</FONT></TD></TR>
<TR><TD><FONT POINT-SIZE="13">Apply feedback + context</FONT></TD></TR>
</TABLE>
>,
fillcolor="#fff3e0",
height=1.4,
width=3.5
];

OUTPUT [
label=<
<TABLE BORDER="0" CELLBORDER="0" CELLSPACING="0">
<TR><TD><B><FONT POINT-SIZE="15">Refined Description</FONT></B></TD></TR>
<TR><TD><FONT POINT-SIZE="12">(Output)</FONT></TD></TR>
<TR><TD><FONT POINT-SIZE="13">With citations</FONT></TD></TR>
</TABLE>
>,
fillcolor="#c8e6c9",
height=1.4,
width=3.5
];

INPUT -> EVAL -> EDIT -> OUTPUT;
RAG -> EDIT;
SCREEN -> EDIT;
}

Core Modules¶

Data Flow¶

Configuration¶

Environment Variables (via .env)

LLM_PROVIDER — Shortcut to endpoint (openai, anthropic, ollama, etc.)
LLM_API_KEY — API key or “ollama” for local
LLM_BASE_URL — Custom endpoint URL (optional)
LLM_MODEL — Model name (defaults to gpt-4o-mini)

Session State (Streamlit)

Stored in st.session_state:

description_text — current description
evaluation — latest evaluation result
vector_store_manager — loaded FAISS index
enhanced_description — improved version
user_feedback — feedback text
screening_result — content screener result
And more…

Extension Points¶

Adding a New LLM Provider

Add provider → base URL mapping to PROVIDER_URLS in src/llm/client.py
Update .env.example with provider config
No code change needed (OpenAI SDK handles compatibility)
Document in README and configuration guide

Adding New Evaluation Criteria

Add criterion to src/evaluator/rubric.json
Update src/evaluator/examples_v3.json with new examples
Update src/prompts/evaluator.yaml to reference new criteria
Bump version in evaluator.yaml (major if score scale changes)

Adding a New Document Type

Create CustomIngestor extending DocumentIngestor
Implement custom chunking logic
Register in rocco_ui.py

Testing¶

Run tests:

pytest tests/

Key test patterns:

Evaluator tests — verify rubric scoring consistency
Retriever tests — verify FAISS indexing and search
Editor tests — verify prompt rendering and citation tracking
Integration tests — end-to-end workflow (evaluate → enhance → screen)

See Also¶

Streamlit App — User-facing workflow
Contributing — Development guidelines
CLAUDE.md — Detailed implementation patterns