Uploading data
Last updated
Last updated
To upload files you have two options:
Input via the user interface
Input via the upload API
You can upload data to Metamaze by clicking Menu > Training > Uploads, and then clicking the "+Upload" button in the top right corner:
A new window will appear where you can select the relevant files. You can either drag and drop the files into the window or search for the files via the explorer screen using the 'Browse' button.
A document will be created for each file in the upload.
A document will be created for each page in the upload.
All files will be merged into one document.
Document type and language
You can select the relevant document type and language of the uploaded files in the corresponding dropdown lists. If you do not specify these, the language will be detected automatically, and the document type will be left to specify. The document type and language can be modified later in the document meta data. It is necessary to assign a document type and a language to a document before you can start annotating it.
You can use the Metamaze REST API to upload files to Metamaze. Make sure you only upload files to logically belong together. Often, that means one file is one upload call.
For a tutorial on how to integrate with the REST API, see Rest API (JSON).
The API has a rate limit on how many uploads can be sent per hour depending on your tier license (see Limitations)
Up-to-date API documentation can be found here: https://app.metamaze.eu/docs/index.html. You can also click on the API documentation menu item.
Currently, the following file formats are supported:
JPEG
JPEG 2000
PNG
PDF (v1.4 or higher)
Microsoft Word
Microsoft Excel
OpenOffice
plain text TXT (utf-8-BOM)
TIFF (incl. multipage)
RTF
MSG e-mails (including nested attachments)
EML e-mails (including nested attachments)
OpenOffice Calc (.ods)
For images, the minimum dimensions should be 250px by 250px. Smaller images will not be processed.
For other file formats, please file a feature request at support@metamaze.eu.
The following limitations are enforced:
maximum file size of 2 GB
maximum of 200 pages per upload
maximum 3000 annotations or predictions per document
maximum of 100 document types in one project
maximum of 250 entity types per document type
rate limits for upload per hour depending on the license tier.
maximum size of emails of 25 MB
maximum size of attachments in e-mails is 5MB each
Other limitations:
maximum of 200 pages per document
License tier (pages / month) | Maximum uploads per hour |
---|---|
5.000 | 200 |
7.500 | 200 |
10.000 | 400 |
15.000 | 400 |
20.000 | 400 |
30.000 | 400 |
40.000 | 800 |
50.000 | 800 |
60.000 | 1.200 |
80.000 | 1.200 |
100.000 | 2.000 |
150.000 | 2.000 |
200.000 | 2.000 |
250.000 | 3.000 |
300.000 | 3.500 |
400.000 | 4.000 |
500.000 | 5.000 |
600.000 | 6.000 |
700.000 | 7.000 |
800.000 | 8.000 |
900.000 | 9.000 |
1.000.000 | 10.000 |
1.100.000 | 11.000 |
1.200.000 | 12.000 |
1.300.000 | 13.000 |
1.400.000 | 14.000 |
1.500.000 | 15.000 |
1.600.000 | 16.000 |
1.800.000 | 18.000 |
2.000.000 | 20.000 |
2.100.000 | 21.000 |
2.200.000 | 22.000 |
2.300.000 | 23.000 |
2.400.000 | 24.000 |
2.500.000 | 25.000 |
2.600.000 | 26.000 |
2.700.000 | 27.000 |
2.800.000 | 28.000 |
2.900.000 | 29.000 |
3.000.000 | 30.000 |