Skip to main content

Data Model

Documentation Map

Data Model

Version: 2026-04-01

Core Entities

  • Article
    • normalized news record
  • Article Version
    • historical state of an article
  • Dedupe Cluster
    • groups duplicate/near-duplicate variants
  • Job
    • ingest/export/runtime work unit
  • Dataset Snapshot
    • frozen dataset selection for reproducible downstream work
  • Export
    • machine-usable artifact from snapshots or run context
  • Research Run
    • higher-level analytical context
  • Scenario Registry / Scenario Release
    • governance/research publication artifacts
  • Account
    • tenant boundary
  • API Key
    • scoped machine credential
  • Billing Period / Subscription / Invoice
    • commercial/accounting entities
  • Support Case
    • customer/operator support entity
  • Release Notice
    • deprecation/sunset communication entity

Relationships

  • accounts own API keys
  • jobs materialize articles, snapshots, exports, and run-adjacent outputs
  • articles participate in versions and dedupe clusters
  • snapshots and exports anchor reproducible downstream work
  • research runs reference snapshots and comparison/impact artifacts

Persistency Assumptions

  • PostgreSQL is the primary metadata store
  • MinIO stores artifact-like outputs
  • Qdrant stores vectors and retrieval payloads
  • Redis supports runtime/cache behavior

Integration-Relevant IDs

  • job_id
  • article_id
  • snapshot_id
  • export_id
  • run_id
  • account_id
  • invoice_id
  • case_id

License: AGPLv3
Learn more: https://helpifyr.com