Comment on page
🔖
Glossary
Overview of commonly used words and terminology
Term | Synonyms | Description |
---|---|---|
Entity | Field, entity type | An entity is a piece of information to be extracted from a document. Entities can have properties like required or maximum unique occurrences. |
Value | Annotation, label, entity value | A specific instance of an entity like an annotation or a prediction. E.g. 2022-02-13 is a value of entity Document date |
Annotating | Labeling | The process of marking or highlighting values of entities in a specific document. |
Document | | In Metamaze, a document is a logical set of pages that is assigned exactly one document type. E-mails are treated as documents as well. |
Upload | Batch | An upload is a logical set of one or more files to be treated together. |
Text entity | | Entity consisting of text that is recognised on the document. |
Image entity | Object detection | Any entity to be recognised that is not text. For example: signatures, handwriting, stamps, logo’s, … |
Composite entity | Entity group | A group of entities that belong together. For example: address consisting of street, number, city, … or an order line consisting of item description, item amount, item price, total price, … |
Page management | | The process of merging and splitting individual pages and files into one or more document(s). |
Parsing | | Standardising raw input values into a structured format. For example standardising a date like “April 24th, 2020” to “2020-04-24”. Applies to dates, numeric values, currencies, … |
OCR | | Optical Character Recognition is the process of converting scanned images to actual text. |
Last modified 1yr ago