πŸ”–Glossary

Overview of commonly used words and terminology

TermSynonymsDescription

Entity

Field, entity type

An entity is a piece of information to be extracted from a document. Entities can have properties like required or maximum unique occurrences.

Value

Annotation, label, entity value

A specific instance of an entity like an annotation or a prediction. E.g. 2022-02-13 is a value of entity Document date

Annotating

Labeling

The process of marking or highlighting values of entities in a specific document.

Document

In Metamaze, a document is a logical set of pages that is assigned exactly one document type. E-mails are treated as documents as well.

Upload

Batch

An upload is a logical set of one or more files to be treated together.

Text entity

Entity consisting of text that is recognised on the document.​

Image entity

Object detection

Any entity to be recognised that is not text. For example: signatures, handwriting, stamps, logo’s, …

Composite entity

Entity group

A group of entities that belong together. For example: address consisting of street, number, city, … or an order line consisting of item description, item amount, item price, total price, …

Page management

The process of merging and splitting individual pages and files into one or more document(s).

Parsing

Standardising raw input values into a structured format. For example standardising a date like β€œApril 24th, 2020” to β€œ2020-04-24”. Applies to dates, numeric values, currencies, …​

OCR

Optical Character Recognition is the process of converting scanned images to actual text.

Last updated