fix: use absolute /app/data/raw path so data survives container exit
../data/raw relative to WORKDIR /app resolves to /data/raw which is outside the volume mount (./data:/app/data). The scraper wrote 583 pages to the ephemeral container filesystem and they were lost on exit. Changed to /app/data/raw in both scraper and indexer so the JSONL file lands inside the mounted volume and persists between containers. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
@@ -31,7 +31,7 @@ log = logging.getLogger(__name__)
|
||||
|
||||
BASE_URL = "https://www.odoo.com/documentation/18.0"
|
||||
SITEMAP_URL = f"{BASE_URL}/sitemap.xml"
|
||||
OUTPUT_DIR = Path("../data/raw")
|
||||
OUTPUT_DIR = Path("/app/data/raw")
|
||||
OUTPUT_FILE = OUTPUT_DIR / "odoo18_docs_raw.jsonl"
|
||||
DELAY_SECONDS = 1.2
|
||||
MAX_RETRIES = 3
|
||||
|
||||
Reference in New Issue
Block a user