Upload treebanks for GrETEL

Upload your treebank

A treebank (in this case) consists of a number of archived folders (.zip), with each folder containing one of the below:

  • plain-text files (with extension *.txt)
  • CHAT files (with extension *.cha)
  • sentences parsed by Alpino, in LASSY-XML format (*.xml-files)

Choose an appropriate title for your treebank and upload your .zip-file below. Then, please set the correct parse attributes for your files (they will not be detected automatically).

Upload your treebank
Please do not use spaces in your title.
If you want your treebank to be only available to you, please deselect this option.
Parse flags
If you check this box, your .zip-file contains plain-text files (see definition above), and they will be parsed using the Alpino parser.
If you check this box, your .zip-file contains files in the FoLiA XML format. The text will be extracted and parsed using the Alpino parser.
If you check this box, your .zip-file contains files in the TEI XML format. The text will be extracted and parsed using the Alpino parser.
If you check this box, your .zip-file contains files in the CHAT format, and they will be pre-processed and then parsed using the Alpino parser.
If you check this box, your .zip-file contains Alpino-parsed files. These require no further preprocessing.
If you check this box, your .txt-files are already sentence-tokenized. Otherwise, the file will be sentence-tokenised during import.
If you check this box, your .txt-files are already word-tokenized. Otherwise, the file will be word-tokenised during import.
If you check this box, your input has labels. Otherwise, labels for each sentence wil be auto-generated during import.
version 0.2.3 Fri Sep 10 2021