πGlossary
Overview of commonly used words and terminology
Entity
Field, entity type
An entity is a piece of information to be extracted from a document. Entities can have properties like required or maximum unique occurrences.
Value
Annotation, label, entity value
A specific instance of an entity like an annotation or a prediction. E.g. 2022-02-13
is a value of entity Document date
Annotating
Labeling
The process of marking or highlighting values of entities in a specific document.
Document
In Metamaze, a document is a logical set of pages that is assigned exactly one document type. E-mails are treated as documents as well.
Upload
Batch
An upload is a logical set of one or more files to be treated together.
Text entity
Entity consisting of text that is recognised on the document.β
Image entity
Object detection
Any entity to be recognised that is not text. For example: signatures, handwriting, stamps, logoβs, β¦
Composite entity
Entity group
A group of entities that belong together. For example: address consisting of street, number, city, β¦ or an order line consisting of item description, item amount, item price, total price, β¦
Page management
The process of merging and splitting individual pages and files into one or more document(s).
Parsing
Standardising raw input values into a structured format. For example standardising a date like βApril 24th, 2020β to β2020-04-24β. Applies to dates, numeric values, currencies, β¦β
OCR
Optical Character Recognition is the process of converting scanned images to actual text.
Last updated