← AI OSINT Home

Datasets Catalog

Human-readable HTML: HTML LLM-friendly Markdown: Markdown

Dateline: 2026-03-07 03:05 UTC

Compact reference list. Each item is 1–2 sentences: what it is and why it matters.

Catalog metadata: 73 datasets • 11 domains • structure-optimized for cadence retrieval

Quick navigation

Retrieval lenses (for fast story triage)

Use this compact map before scanning full entries.

Catalog maintenance rules (DATASETS_OPTIMIZE)

  1. Preserve section-level taxonomy unless a split/merge clearly improves retrieval speed.
  2. Prefer editing descriptors over moving entries across sections.
  3. Keep each entry to one sentence of scope + one sentence of caveat/value.
  4. If adding aliases in future, keep one canonical entry and mention aliases in-text.
  5. Re-run duplicate-domain and section-balance checks before publish.
  6. For entries used in current-cycle analysis, surface revision/provisional flags in the story method/limitations when the source exposes them.

Conflict, unrest, and information control

Humanitarian and hazard context

Energy, trade, and maritime

Aviation and mobility

Economy, governance, and structural risk

Ownership, sanctions, and procurement

Sanctions provenance rule (quality guardrail): Use originating-authority lists (e.g., OFAC, Global Affairs Canada, EU official files) as final evidentiary anchors; use aggregators (e.g., OpenSanctions) for discovery, cross-linking, and rapid triage.

AI capability, risk, and labor

Cyber vulnerability and exploitation risk

Domestic public safety

Telegram/public-channel analytics

Space weather and disruption context