Records how a specific field's value was obtained — manual entry, AI extraction, import, OCR, or computation

Required fields
fieldmethodextractedAt

Fields

FieldTypeReqDescription
confidenceintegerConfidence in the field value (0-100)
dataSourceobjectThe third-party source that provided this field value. Enables attribution credits and the data source kill switch
extractedAtstringWhen this value was extracted from the source document
fieldstringThe field name this provenance record applies to
methodobjectHow the field value was obtained
ocrEnginestringOCR engine or extraction tool used to obtain this value
originalScriptobjectWriting system of the source document. Critical for handwritten wills in non-Latin scripts latin: English, French, German, etc. kanji: Japanese kanji characters. hiragana/katakana: Japanese syllabaries. arabic: Arabic script (also used for Urdu, Farsi). hebrew: Hebrew script. devanagari: Hindi, Marathi, Sanskrit. tamil: Tamil script. chinese_simplified/traditional: Chinese characters. hangul: Korean. cyrillic: Russian, Serbian, etc. thai: Thai script. mixed: multiple scripts in one document. other: any other writing system.
originalTextstringThe raw text as extracted before any normalisation or structuring. Preserves exactly what the source document said
sourceDocumentIdstringReference to the document.json entity this value was extracted from
sourcePageNumberintegerPage number in the source document where this value was found
sourceRegionstringLocation within the page where this value was found
verifiedAtstringWhen this field was last verified
verifiedBystringWho verified this field

Get in touch

Have feedback on the schema reference? We'd love to hear from you.

Subject: Schema Reference Feedback