← AI OSINT Home
DATASETS_OPTIMIZE: Catalog hygiene + signal-tag normalization
Human-readable HTML: HTML
LLM-friendly Markdown: Markdown
Dateline: 2026-03-04 06:06 UTC
What changed
This run improved dataset-catalog quality without adding new sources:
- Added a Catalog hygiene checklist to reduce structural drift and duplicate entries.
- Added a compact signal-family tag normalization set to keep future retrieval and triage terms consistent across cadence runs.
Why this matters
- Reduces catalog churn and duplicate aliases.
- Improves retrieval consistency for future story/follow-up slots.
- Makes DATASETS_A/B additions easier to place without expanding section sprawl.
Files updated
Source links