πGlossary
Overview of commonly used words and terminology
Term | Synonyms | Description |
---|---|---|
Entity | Field, entity type | An entity is a piece of information to be extracted from a document. Entities can have properties like required or maximum unique occurrences. |
Value | Annotation, label, entity value | A specific instance of an entity like an annotation or a prediction. E.g. |
Annotating | Labeling | The process of marking or highlighting values of entities in a specific document. |
Document | In Metamaze, a document is a logical set of pages that is assigned exactly one document type. E-mails are treated as documents as well. | |
Upload | Batch | An upload is a logical set of one or more files to be treated together. |
Text entity | Entity consisting of text that is recognised on the document.β | |
Image entity | Object detection | Any entity to be recognised that is not text. For example: signatures, handwriting, stamps, logoβs, β¦ |
Composite entity | Entity group | A group of entities that belong together. For example: address consisting of street, number, city, β¦ or an order line consisting of item description, item amount, item price, total price, β¦ |
Page management | The process of merging and splitting individual pages and files into one or more document(s). | |
Parsing | Standardising raw input values into a structured format. For example standardising a date like βApril 24th, 2020β to β2020-04-24β. Applies to dates, numeric values, currencies, β¦β | |
OCR | Optical Character Recognition is the process of converting scanned images to actual text. |
Last updated