Rocco — AI Curator for Dataset Descriptions¶
AI-powered dataset description evaluation & enhancement framework for the Digital Porous Media Portal.
Rubric-Based Evaluation
Score any dataset description against 10 domain-specific criteria.
RAG-Powered Enhancement
Improve descriptions with relevant excerpts drawn from your papers.
Iterate
Refine with interactive feedback across multiple rounds until your description is publication-ready.
Getting Started Locally¶
git clone --branch v1.0.0 --depth 1 https://github.com/digital-porous-media/dpm_rocco_curator.git
cd dpm_rocco_curator
pip install .
cp .env.example .env
# Edit .env to set LLM_PROVIDER, LLM_API_KEY, LLM_MODEL.
# Rocco supports OpenAI-compatible APIs (OpenAI, Anthropic, Gemini, HuggingFace, Ollama, and more!)
streamlit run rocco_ui.py
# Opens at http://localhost:8501
For a full walkthrough, see Quick Start.
Getting Started
Description Curator
AI Providers
Developer Guide