DK-CLARIN LSP Corpus - Construction domain
Please use the following text to cite this item or export to a predefined format:
Centre for Language Technology, NorS, University of Copenhagen and The Danish Language Council, 2011,
DK-CLARIN LSP Corpus - Construction domain, CLARIN-DK-UCPH Centre Repository,
http://hdl.handle.net/20.500.12115/9.
Authors
Item identifier
Date issued
2011
Size
35 files,
577,392 tokens
Language(s)
Description
Texts in the Construction Domain come from Statens Byggeforskningsinstitut, Erhvervs- og byggestyrelsen and Murerfagets Oplysningsråd and have been collected in the DK-CLARIN project, WP2.2, 2008 - 2011.
The corpus consists of 577,392 words in 35 files.
Communicative setting/Number of files: expert->expert (18) expert->advanced (6) expert->basic (11).
All texts are in XML TEIP5 format (TEIP5DKCLARIN-format), with tokenisation, sentence and paragraph segmentation, pos-tagging, lemmatisation and termhood annotation placed in separate text external spangroups.
"DK-CLARIN LSP Corpus - Construction domain" is a part of the Danish DK-CLARIN LSP corpus consisting of seven sub-corpora from following subject domains: Agriculture, Construction, Economics, Environment, Health, IT and Nanotechnology.
Acknowledgement
n/a
Project code:n/a
Project name:DK-CLARIN
Subject(s)
Collections
Files in this item
- Name
- text-format.pdf
- Size
- 111.77 KB
- Format
- application/pdf
- Description
- Documentation
- MD5
- c4c4b5f1cd83ff232c44bc7692621da7

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- text-header.pdf
- Size
- 375.79 KB
- Format
- application/pdf
- Description
- Documentation
- MD5
- 47825d0010a398bf10ce1564da2a15f0

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- README_construction.txt
- Size
- 3.04 KB
- Format
- text/plain
- Description
- README
- MD5
- 11ed9229c6f8755eaa19d8bc15673ef2

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- textCorpusProfile.xsd
- Size
- 142.26 KB
- Format
- text/xml
- Description
- CMDI schema
- MD5
- 7d6b452b88175041133ea8020e453cd8

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- DKCLARIN_fagsprogligt_korpus_dokumentation_2011.pdf
- Size
- 361.81 KB
- Format
- application/pdf
- Description
- Documentation
- MD5
- e1752deaa6888e2f856811c8d933e655

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- dkclarin-LSPConstruction-cmdi_textCorpus.xml
- Size
- 16.3 KB
- Format
- text/xml
- Description
- CMDI metadata
- MD5
- 3e436cf3b457d8d132fb24fd5d671e20

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- erhvervsOgByggestyrelsen_1.zip
- Size
- 3.72 MB
- Format
- application/zip
- Description
- Corpus 1
- MD5
- dea50e92118686c9dad9e20238d8adaf

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- erhvervsOgByggestyrelsen_2.zip
- Size
- 2.02 MB
- Format
- application/zip
- Description
- Corpus 2
- MD5
- 3767db815f1b24a2b15b2186ed7eff79

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- Muro.zip
- Size
- 3.62 MB
- Format
- application/zip
- Description
- Corpus 3
- MD5
- 6a40bd1d3ce17983d4b2d7fe3881a01b

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- SBI.zip
- Size
- 13.3 MB
- Format
- application/zip
- Description
- Corpus 4
- MD5
- d48c1d676cc4d1a4990d36832f0c0e7c

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- teiHeader.xsd
- Size
- 59.88 KB
- Format
- text/xml
- Description
- TEI schema
- MD5
- 9fc5374ad34319278f437b963454f972

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk

