CROVIA – AI Training Data Trust & Payout Engine
CROVIA turns raw attribution logs (“royalty receipts”) into:
- monthly payouts per provider from a fixed EUR budget
- trust and priority scores per provider
- a signable Trust Bundle aligned with EU AI Act record-keeping principles and compatible with major AI governance frameworks
All demo data exposed on croviatrust.com are synthetic and do not represent real payments.
1. Problem CROVIA solves
Modern AI models are trained on data coming from many providers and sources.
Companies need to:
- track who contributed what to a model
- calculate fair payouts from a fixed budget
- assign trust / risk bands per provider
- keep audit-ready evidence for regulators, partners and internal governance
Spreadsheets and ad-hoc scripts do not scale and are hard to audit.
CROVIA provides a structured, machine-readable contract between:
- the engine that trains and serves models, and
- the ecosystem of data providers, regulators and auditors.
2. Core concepts
2.1 Royalty receipts (attribution logs)
A royalty receipt is a compact JSON record saying, in essence:
“this model output was influenced by these providers, with these weights.”
Receipts are written as newline-delimited JSON (NDJSON) so they can be streamed, sharded and stored efficiently.
Typical fields include:
- timestamp
- model identifier
- segment (train / eval / inference)
- list of provider IDs
- contribution weight or score
2.2 Trust and priority
From all receipts in a given period (for example 2025-11), CROVIA computes:
- how often each provider appears in high-value outputs
- a trust score per provider (0–1)
- a risk profile and a priority band (HIGH / MED / LOW) combining contribution and risk
These signals drive both payouts and future negotiation priorities.
2.3 Payouts
Given a fixed monthly EUR budget, CROVIA allocates payouts per provider according to:
- observed contribution in the receipts
- policy parameters (caps, thresholds, exclusions)
- concentration targets and governance constraints
The result is a payout table that can be exported as CSV / NDJSON and linked to downstream payment systems.
2.4 Crovian Floors and coverage
A distinctive feature of CROVIA is the concept of Crovian Floor.
For each provider the engine estimates a coverage bound: a conservative lower bound of
“how much this provider really covers the model behaviour”.
Using this bound, policy parameters and the total budget, CROVIA derives a minimum payout floor for eligible providers.
Floors can then be used in contracts, revenue-sharing agreements and compliance reviews.
2.5 Trust Bundle
For each period, CROVIA emits a signed Trust Bundle JSON that includes:
- pointers and checksums for main artifacts (payout tables, charts, hashchain)
- trust / priority tables and coverage bounds
- assumptions and notes (coverage limits, known gaps, excluded providers)
- governance metadata (who ran the engine, when, with which profile)
The bundle is designed to be archived, shared with providers, or attached to internal model cards as evidence.
3. What you see on croviatrust.com
The public website currently exposes a demo configuration.
3.1 Dashboard – period view
The dashboard shows, for the latest available period:
- top providers by payout and share of budget
- payout charts (Top-10 providers, cumulative payout curve)
- basic concentration metrics (for example HHI and Gini)
- a button to download the Trust Bundle JSON for that period
All numbers are synthetic and for illustration only.
3.2 Sandbox – upload your own receipts
The Sandbox lets you upload a small NDJSON file with your own receipts (demo limit ≈ 2 MB) and get:
- a temporary payout preview per provider
- basic QA feedback if the format is wrong
- a view of how the CROVIA pipeline behaves on a private sample
Sandbox runs are meant for experimentation only; they are not stored long-term and do not create contractual obligations.
3.3 CLI reproducibility
The same demo period exposed on the website can be reproduced from the CLI using the open profile and tools described in the internal documentation.
This makes it possible to verify the pipeline end-to-end on a private environment.
4. CROVIA profile and implementation layers
CROVIA is split into three layers:
-
M0 – Open Profile
The data profile and examples (receipts, payouts, Trust Bundles).
Anyone can emit or consume objects that follow this profile. -
M1 – Lite tools
Simple validators and integrity checkers that help partners adopt the profile and verify basic properties. -
M2 – Enterprise Engine
The full CROVIA engine (trust, payouts, floors, compliance pack, charts, hashchain, signatures), offered under commercial terms or as a managed service.
Public documentation focuses on M0 (profile) and on the observable behaviour of the system.
The internal implementation of M2 can evolve without breaking the contract with data providers.
5. How to start a pilot
A typical pilot path is:
- Define a test period and total EUR budget.
- Export a sample of royalty receipts in the CROVIA profile.
- Run a sandbox or private instance of the engine.
- Review payouts, trust bands, floors and the Trust Bundle with:
- internal stakeholders
- selected data providers
- legal / compliance teams
- If results are acceptable, move to regular periodic runs and connect the outputs to real payment systems.
CROVIA is designed to keep the profile open and the engine evolvable, so organisations can adopt a transparent, auditable standard for AI-training data payouts while retaining flexibility in their internal implementations.
6. Contact
For enterprise or partnership enquiries you can reach us at
info@croviatrust.com
For the full open technical profile, see
CROVIA – AI Training Data Trust Profile M0.
For the CLI validation and compliance toolkit see
CROVIA – Lite Tools Pack M1.